Bauer-Fike theorem

From Wikipedia, the free encyclopedia

In mathematics, the Bauer-Fike theorem is a standard result in the perturbation theory of the eigenvalue of a complex-valued diagonalizable matrix. In its substance, it states an absolute upper bound for the deviation of one perturbed matrix eigenvalue from a properly chosen eigenvalue of the exact matrix. Informally speaking, what it says is that the sensitivity of the eigenvalues is estimated by the condition number of the matrix of eigenvectors.

Contents

[edit] Theorem (Friedrich L. Bauer, C.T.Fike - 1960)

Let A\in\mathbb{C}^{n,n} be a diagonalizable matrix, and be V\in\mathbb{C}^{n,n} the non singular eigenvector matrix such that A = VΛV − 1. Be moreover μ an eigenvalue of the matrix A + δA; then an eigenvalue \lambda\in\sigma(A) exists such that:

|\lambda-\mu|\leq\kappa_p (V)\|\delta A\|_p

where \kappa_p(V)=\|V\|_p\|V^{-1}\|_p is the usual condition number in p-norm.

[edit] Proof

If \mu\in\sigma(A), we can choose λ = μ and the thesis is trivially verified (since \kappa_p(V)\geq 1).

So, be \mu\notin\sigma(A). Then det(\Lambda-\mu I)\ \ne\  0. μ being an eigenvalue of A + δA, we have det(A + δA − μI) = 0 and so

0=\ det(V^{-1})\ det(A+\delta A-\mu I)\ det(V)=det(\Lambda+V^{-1}\delta AV-\mu I)=
=det(\Lambda-\mu I)\ det[(\Lambda-\mu I)^{-1}V^{-1}\delta AV +I]

and, since det(\Lambda-\mu I)\ \ne\  0 as stated above, we must have

det[(\Lambda-\mu I)^{-1}V^{-1}\delta AV +I]=\ 0

which reveals the value -1 to be an eigenvalue of the matrix (Λ − μI) − 1V − 1δAV.

For each consistent matrix norm, we have |\lambda|\leq\|A\|, so, being all p-norms consistent, we can write:

1\leq\|(\Lambda-\mu I)^{-1}V^{-1}\delta AV\|_p\leq\|(\Lambda-\mu I)^{-1}\|_p\|V^{-1}\|_p\|V\|_p\|\delta A\|_p=
=\|(\Lambda-\mu I)^{-1}\|_p\ \kappa_p(V)\|\delta A\|_p

But (Λ − μI) − 1 being a diagonal matrix, the p-norm is easily computed, and yields:

\|(\Lambda-\mu I)^{-1}\|_p\ =\ max_{\|\mathbf{x}\|_p\ne 0} \frac{\|(\Lambda-\mu I)^{-1}\mathbf{x}\|_p}{\|\mathbf{x}\|_p}\ =
=max_{\lambda\in\sigma(A)}\frac{1}{|\lambda -\mu|}\ =\ \frac{1}{min_{\lambda\in\sigma(A)}|\lambda-\mu|}

whence:

min_{\lambda\in\sigma(A)}|\lambda-\mu|\leq\ \kappa_p(V)\|\delta A\|_p.



The theorem can also be reformulated to better suit numerical methods. In fact, dealing with real eigensystem problems, one often has an exact matrix A, but knows only an approximate eigenvalue-eigenvector couple, (\tilde{\lambda},\tilde{\mathbf{v}}), and needs to bound the error. The following version comes in help.

[edit] Theorem (Friedrich L. Bauer, C.T.Fike - 1960) (alternative statement)

Let A\in\mathbb{C}^{n,n} be a diagonalizable matrix, and be V\in\mathbb{C}^{n,n} the non singular eigenvector matrix such as A = VΛV − 1. Be moreover (\tilde{\lambda},\mathbf{\tilde{v}}) an approximate eigenvalue-eigenvector couple, and \mathbf{r}=A\mathbf{\tilde{v}}-\tilde{\lambda}\mathbf{\tilde{v}}; then an eigenvalue \lambda\in\sigma(A) exists such that:

|\lambda-\tilde{\lambda}|\leq\kappa_p (V)\frac{\|\mathbf{r}\|_p}{\|\mathbf{\tilde{v}}\|_p}

where \kappa_p(V)=\|V\|_p\|V^{-1}\|_p is the usual condition number in p-norm.

[edit] Proof

We can assume \tilde{\lambda}\notin\sigma(A) (otherwise, we can choose \lambda=\tilde{\lambda} and theorem is proven, since \kappa_p(V)\geq 1). Then (A-\tilde{\lambda} I)^{-1} exists, so we can write:

\mathbf{\tilde{v}}=(A-\tilde{\lambda} I)^{-1}\mathbf{r}=V(D-\tilde{\lambda} I)^{-1}V^{-1}\mathbf{r}

since A is diagonalizable; taking the p-norm of both sides, we obtain:

\|\mathbf{\tilde{v}}\|_p=\|V(D-\tilde{\lambda} I)^{-1}V^{-1}\mathbf{r}\|_p \leq \|V\|_p \|(D-\tilde{\lambda} I)^{-1}\|_p \|V^{-1}\|_p \|\mathbf{r}\|_p=

=\kappa_p(V)\|(D-\tilde{\lambda} I)^{-1}\|_p \|\mathbf{r}\|_p.

But, since (D-\tilde{\lambda} I)^{-1} is a diagonal matrix, the p-norm is easily computed, and yields:

\|(D-\tilde{\lambda} I)^{-1}\|_p=max_{\|\mathbf{x}\|_p \ne 0}\frac{\|(D-\tilde{\lambda} I)^{-1}\mathbf{x}\|_p}{\|\mathbf{x}\|_p}=
=max_{\lambda\in\sigma(A)} \frac{1}{|\lambda-\tilde{\lambda}|}=\frac{1}{min_{\lambda\in\sigma(A)}|\lambda-\tilde{\lambda}|}

whence:

min_{\lambda\in\sigma(A)}|\lambda-\tilde{\lambda}|\leq\kappa_p(V)\frac{\|\mathbf{r}\|_p}{\|\mathbf{\tilde{v}}\|_p}.



The Bauer-Fike theorem, in both versions, yields an absolute bound. The following corollary, which, besides all the hypothesis of Bauer-Fike theorem, requires also the non-singularity of A, turns out to be useful whenever a relative bound is needed.

[edit] Corollary

Be A\in\mathbb{C}^{n,n} a non-singular, diagonalizable matrix, and be V\in\mathbb{C}^{n,n} the non singular eigenvector matrix such as A = VΛV − 1. Be moreover μ an eigenvalue of the matrix A + δA; then an eigenvalue \lambda\in\sigma(A) exists such that:

\frac{|\lambda-\mu|}{|\lambda|}\leq\kappa_p (V)\|A^{-1}\delta A\|_p

(Note: \|A^{-1}\delta A\|can be formally viewed as the "relative variation of A", just as | λ − μ | | λ | − 1 is the relative variation of λ.)


[edit] Proof

Since μ is an eigenvalue of (A+δA) and det(A)\ne 0, we have, left-multiplying by A − 1:

-A^{-1}(A+\delta A)\mathbf{v}=-\mu A^{-1}\mathbf{v}

that is, putting\tilde{A}=\mu A^{-1} and \tilde{\delta A}=-A^{-1}\delta A:

(\tilde{A}+\tilde{\delta A}-I)\mathbf{v}=\mathbf{0}

which means that\tilde{\mu}=1is an eigenvalue of(\tilde{A}+\tilde{\delta A}), with \mathbf{v}eigenvector. Now, the eigenvalues of \tilde{A}are \frac{\mu}{\lambda_i}, while its eigenvector matrix is the same as A. Applying Bauer-Fike theorem to the matrix\tilde{A}+\tilde{\delta A} and to its eigenvalue\tilde{\mu}=1, we obtain:

min_{\lambda\in\sigma(A)}\left|\frac{\mu}{\lambda}-1\right|=\ min_{\lambda\in\sigma(A)}\frac{|\lambda-\mu|}{|\lambda|}\leq\kappa_p (V)\|A^{-1}\delta A\|_p



[edit] Remark

If A is normal, V is a unitary matrix, and \|V\|_2=\|V^{-1}\|_2=1, so that κ2(V) = 1.

The Bauer-Fike theorem then becomes:

\exists\lambda\in\sigma(A): |\lambda-\mu|\leq\|\delta A\|_2
(\exists\lambda\in\sigma(A): |\lambda-\tilde{\lambda}|\leq\frac{\|\mathbf{r}\|_2}{\|\mathbf{\tilde{v}}\|_2} in the alternative formulation)

which obviously remains true if A is a Hermitian matrix. In this case, however, a much stronger result holds, known as the Weyl theorem.

[edit] References

  1. F. L. Bauer and C. T. Fike. Norms and exclusion theorems. Numer. Math. 2 (1960), 137-141.
  1. S. C. Eisenstat and I. C. F. Ipsen. Three absolute perturbation bounds for matrix eigenvalues imply relative bounds. SIAM Journal on Matrix Analysis and Applications Vol. 20, N. 1 (1998), 149-158
In other languages