Kaczmarz method
The Kaczmarz method or Kaczmarz's algorithm is an iterative algorithm for solving linear equation systems . It was first discovered by the Polish mathematician Stefan Kaczmarz,[1] and was rediscovered in the field of image reconstruction from projections by Richard Gordon, Robert Bender, and Gabor Herman in 1970, where it is called the Algebraic Reconstruction Technique (ART).[2] ART includes the positivity constraint, making it nonlinear.[3]
The Kaczmarz method is applicable to any linear system of equations, but its computational advantage relative to other methods depends on the system being sparse. It has been demonstrated to be superior, in some biomedical imaging applications, to other methods such as the filtered backprojection method.[4]
It has many applications ranging from computed tomography (CT) to signal processing. It can be obtained also by applying to the hyperplanes, described by the linear system, the method of successive projections onto convex sets (POCS).[5][6]
Algorithm 1: Randomized Kaczmarz algorithm
Let be a linear system, let be the th row of complex-valued matrix , and let be arbitrary complex-valued initial approximation to the solution of . For compute:
where denotes complex conjugation of , and is chosen from the set at random, with probability proportional to .
Under such circumstances converges exponentially fast to the solution of , and the rate of convergence depends only on the scaled condition number .
Theorem
Let be the solution of . Then Algorithm 1 converges to in expectation, with the average error:
Proof
We have
for all
Using the fact that we can write (1) as
for all
The main point of the proof is to view the left hand side in (2) as an expectation of some random variable. Namely, recall that the solution space of the equation of is the hyperplane , whose normal is Define a random vector Z whose values are the normals to all the equations of , with probabilities as in our algorithm:
with probability
Then (2) says that
for all
The orthogonal projection onto the solution space of a random equation of is given by
Now we are ready to analyze our algorithm. We want to show that the error reduces at each step in average (conditioned on the previous steps) by at least the factor of The next approximation is computed from as where are independent realizations of the random projection The vector is in the kernel of It is orthogonal to the solution space of the equation onto which projects, which contains the vector (recall that is the solution to all equations). The orthogonality of these two vectors then yields To complete the proof, we have to bound from below. By the definition of , we have
where are independent realizations of the random vector
Thus
Now we take the expectation of both sides conditional upon the choice of the random vectors (hence we fix the choice of the random projections and thus the random vectors and we average over the random vector ). Then
By (3) and the independence,
Taking the full expectation of both sides, we conclude that
Algorithm 2: Randomized Kaczmarz algorithm with relaxation
Given a real or complex matrix and a real or complex vector , respectively, the Kaczmarz's algorithm iteratively computes an approximation of the solution of the linear systems of equations . It does so by converging to the vector (where is the conjugate transpose of ) without the need to invert the matrix , which is algorithm's main advantage, especially when the matrix has a large number of rows.[7] Most generally, algorithm is defined as follows:
where , is the i-th row of the matrix , is the i-th component of the vector , and is a relaxation parameter. The above formulae gives a simple iteration routine. There are various ways for choosing the i-th equation and the relaxation parameter at the k-th iteration.[4]
If the linear system is consistent, the ART converges to the minimum-norm solution, provided that the iterations start with the zero vector. There are versions of the ART that converge to a regularized weighted least squares solution when applied to a system of inconsistent equations and, at least as far as initial behavior is concerned, at a lesser cost than other iterative methods, such as the conjugate gradient method.[8]
Advances
Recently, a randomized version of the Kaczmarz method for overdetermined linear systems was introduced by Strohmer and Vershynin[9] in which the i-th equation is selected with probability proportional to . The superiority of this selection was illustrated with the reconstruction of a bandlimited function from its nonuniformly spaced sampling values. However, it has been pointed out[10] that the reported success by Strohmer and Vershynin depends on the specific choices that were made there in translating the underlying problem, whose geometrical nature is to find a common point of a set of hyperplanes, into a system of algebraic equations. There will always be legitimate algebraic representations of the underlying problem for which the selection method in [9] will perform in an inferior manner.[9][10][11]
Notes
- ↑ Kaczmarz (1937)
- ↑ Gordon, Bender & Herman (1970)
- ↑ Gordon (2011)
- ↑ 4.0 4.1 Herman (2009)
- ↑ Censor & Zenios (1997)
- ↑ Aster, Borchers & Thurber (2004)
- ↑ Chong & Zak (2008:226)
- ↑ See Herman (2009) and references therein.
- ↑ 9.0 9.1 9.2 Strohmer & Vershynin (2009)
- ↑ 10.0 10.1 Censor, Herman & Jiang (2009)
- ↑ Strohmer & Vershynin (2009b)
References
- Kaczmarz, Stefan (1937), "Angenäherte Auflösung von Systemen linearer Gleichungen" (PDF), Bulletin International de l'Académie Polonaise des Sciences et des Lettres. Classe des Sciences Mathématiques et Naturelles. Série A, Sciences Mathématiques, 35: 355–357
- Chong, Edwin K. P.; Zak, Stanislaw H. (2008), An Introduction to Optimization (3rd ed.), John Wiley & Sons, pp. 226–230
- Gordon, Richard; Bender, Robert; Herman, Gabor (1970), "Algebraic reconstruction techniques (ART) for threedimensional electron microscopy and x-ray photography", Journal of Theoretical Biology 29: 471–481, doi:10.1016/0022-5193(70)90109-8, PMID 5492997
- Gordon, Richard (2011), Stop breast cancer now! Imagining imaging pathways towards search, destroy, cure and watchful waiting of premetastasis breast cancer. In: Breast Cancer - A Lobar Disease, editor: Tibor Tot, Springer, pp. 167–203
- Herman, Gabor (2009), Fundamentals of computerized tomography: Image reconstruction from projection (2nd ed.), Springer
- Censor, Yair; Zenios, S.A. (1997), Parallel optimization: theory, algorithms, and applications, New York: Oxford University Press
- Aster, Richard; Borchers, Brian; Thurber, Clifford (2004), Parameter Estimation and Inverse Problems, Elsevier
- Strohmer, Thomas; Vershynin, Roman (2009), "A randomized Kaczmarz algorithm for linear systems with exponential convergence" (PDF), Journal of Fourier Analysis and Applications 15: 262–278, doi:10.1007/s00041-008-9030-4
- Censor, Yair; Herman, Gabor; Jiang, M. (2009), "A note on the behavior of the randomized Kaczmarz algorithm of Strohmer and Vershynin", Journal of Fourier Analysis and Applications 15: 431–436, doi:10.1007/s00041-009-9077-x
- Strohmer, Thomas; Vershynin, Roman (2009b), "Comments on the randomized Kaczmarz method", Journal of Fourier Analysis and Applications 15: 437–440, doi:10.1007/s00041-009-9082-0
- Vinh Nguyen, Quang; Lumban Gaol, Ford (2011), "Proceedings of the 2011 2nd International Congress on Computer Applications and Computational Science", Springer 2: 465–469
External links
- A randomized Kaczmarz algorithm with exponential convergence
- Comments on the randomized Kaczmarz method
|