Bauer-Fike theorem

From Wikipedia, the free encyclopedia

In mathematics, the Bauer-Fike theorem is a standard result in the perturbation theory of the eigenvalue of a complex-valued diagonalizable matrix. In its substance, it states an absolute upper bound for the deviation of one perturbed matrix eigenvalue from a properly chosen eigenvalue of the exact matrix. Informally speaking, what it says is that the sensitivity of the eigenvalues is estimated by the condition number of the matrix of eigenvectors.

1 Theorem (Friedrich L. Bauer, C.T.Fike - 1960)
- 1.1 Proof
2 Theorem (Friedrich L. Bauer, C.T.Fike - 1960) (alternative statement)
- 2.1 Proof
3 Corollary
- 3.1 Proof
4 Remark
5 References

[edit] Theorem (Friedrich L. Bauer, C.T.Fike - 1960)

Let $A\in\mathbb{C}^{n,n}$ be a diagonalizable matrix, and be $V\in\mathbb{C}^{n,n}$ the non singular eigenvector matrix such that $A = V Λ V - 1$ . Be moreover $μ$ an eigenvalue of the matrix $A + δ A$ ; then an eigenvalue $\lambda\in\sigma(A)$ exists such that:

$|\lambda-\mu|\leq\kappa_p (V)\|\delta A\|_p$

where $\kappa_p(V)=\|V\|_p\|V^{-1}\|_p$ is the usual condition number in p-norm.

[edit] Proof

If $\mu\in\sigma(A)$ , we can choose $λ = μ$ and the thesis is trivially verified (since $\kappa_p(V)\geq 1$ ).

So, be $\mu\notin\sigma(A)$ . Then $det(\Lambda-\mu I)\ \ne\ 0$ . $μ$ being an eigenvalue of $A + δ A$ , we have $d e t (A + δ A - μ I) = 0$ and so

$0=\ det(V^{-1})\ det(A+\delta A-\mu I)\ det(V)=det(\Lambda+V^{-1}\delta AV-\mu I)=$

$=det(\Lambda-\mu I)\ det[(\Lambda-\mu I)^{-1}V^{-1}\delta AV +I]$

and, since $det(\Lambda-\mu I)\ \ne\ 0$ as stated above, we must have

$det[(\Lambda-\mu I)^{-1}V^{-1}\delta AV +I]=\ 0$

which reveals the value -1 to be an eigenvalue of the matrix $(Λ - μ I) - 1 V - 1 δ A V$ .

For each consistent matrix norm, we have $|\lambda|\leq\|A\|$ , so, being all p-norms consistent, we can write:

$1\leq\|(\Lambda-\mu I)^{-1}V^{-1}\delta AV\|_p\leq\|(\Lambda-\mu I)^{-1}\|_p\|V^{-1}\|_p\|V\|_p\|\delta A\|_p=$

$=\|(\Lambda-\mu I)^{-1}\|_p\ \kappa_p(V)\|\delta A\|_p$

But $(Λ - μ I) - 1$ being a diagonal matrix, the p-norm is easily computed, and yields:

$\|(\Lambda-\mu I)^{-1}\|_p\ =\ max_{\|\mathbf{x}\|_p\ne 0} \frac{\|(\Lambda-\mu I)^{-1}\mathbf{x}\|_p}{\|\mathbf{x}\|_p}\ =$

$=max_{\lambda\in\sigma(A)}\frac{1}{|\lambda -\mu|}\ =\ \frac{1}{min_{\lambda\in\sigma(A)}|\lambda-\mu|}$

whence:

$min_{\lambda\in\sigma(A)}|\lambda-\mu|\leq\ \kappa_p(V)\|\delta A\|_p$ .

The theorem can also be reformulated to better suit numerical methods. In fact, dealing with real eigensystem problems, one often has an exact matrix $A$ , but knows only an approximate eigenvalue-eigenvector couple, ( $\tilde{\lambda}$ , $\tilde{\mathbf{v}}$ ), and needs to bound the error. The following version comes in help.

[edit] Theorem (Friedrich L. Bauer, C.T.Fike - 1960) (alternative statement)

Let $A\in\mathbb{C}^{n,n}$ be a diagonalizable matrix, and be $V\in\mathbb{C}^{n,n}$ the non singular eigenvector matrix such as $A = V Λ V - 1$ . Be moreover ( $\tilde{\lambda}$ , $\mathbf{\tilde{v}}$ ) an approximate eigenvalue-eigenvector couple, and $\mathbf{r}=A\mathbf{\tilde{v}}-\tilde{\lambda}\mathbf{\tilde{v}}$ ; then an eigenvalue $\lambda\in\sigma(A)$ exists such that:

$|\lambda-\tilde{\lambda}|\leq\kappa_p (V)\frac{\|\mathbf{r}\|_p}{\|\mathbf{\tilde{v}}\|_p}$

where $\kappa_p(V)=\|V\|_p\|V^{-1}\|_p$ is the usual condition number in p-norm.

[edit] Proof

We solve this problem with Tarık's method:

m (otherwise, we can choose  and theorem is proven, since ).

Then $(A-\tilde{\lambda} I)^{-1}$ exists, so we can write:

$\mathbf{\tilde{v}}=(A-\tilde{\lambda} I)^{-1}\mathbf{r}=V(D-\tilde{\lambda} I)^{-1}V^{-1}\mathbf{r}$

since $A$ is diagonalizable; taking the p-norm of both sides, we obtain:

$\|\mathbf{\tilde{v}}\|_p=\|V(D-\tilde{\lambda} I)^{-1}V^{-1}\mathbf{r}\|_p \leq \|V\|_p \|(D-\tilde{\lambda} I)^{-1}\|_p \|V^{-1}\|_p \|\mathbf{r}\|_p=$

$=\kappa_p(V)\|(D-\tilde{\lambda} I)^{-1}\|_p \|\mathbf{r}\|_p.$

But, since $(D-\tilde{\lambda} I)^{-1}$ is a diagonal matrix, the p-norm is easily computed, and yields:

$\|(D-\tilde{\lambda} I)^{-1}\|_p=max_{\|\mathbf{x}\|_p \ne 0}\frac{\|(D-\tilde{\lambda} I)^{-1}\mathbf{x}\|_p}{\|\mathbf{x}\|_p}=$

$=max_{\lambda\in\sigma(A)} \frac{1}{|\lambda-\tilde{\lambda}|}=\frac{1}{min_{\lambda\in\sigma(A)}|\lambda-\tilde{\lambda}|}$

whence:

$min_{\lambda\in\sigma(A)}|\lambda-\tilde{\lambda}|\leq\kappa_p(V)\frac{\|\mathbf{r}\|_p}{\|\mathbf{\tilde{v}}\|_p}.$

The Bauer-Fike theorem, in both versions, yields an absolute bound. The following corollary, which, besides all the hypothesis of Bauer-Fike theorem, requires also the non-singularity of A, turns out to be useful whenever a relative bound is needed.

[edit] Corollary

Be $A\in\mathbb{C}^{n,n}$ a non-singular, diagonalizable matrix, and be $V\in\mathbb{C}^{n,n}$ the non singular eigenvector matrix such as $A = V Λ V - 1$ . Be moreover $μ$ an eigenvalue of the matrix $A + δ A$ ; then an eigenvalue $\lambda\in\sigma(A)$ exists such that:

$\frac{|\lambda-\mu|}{|\lambda|}\leq\kappa_p (V)\|A^{-1}\delta A\|_p$

(Note: $\|A^{-1}\delta A\|$ can be formally viewed as the "relative variation of A", just as $| λ - μ | | λ | - 1$ is the relative variation of λ.)

[edit] Proof

Since μ is an eigenvalue of (A+δA) and $det(A)\ne 0$ , we have, left-multiplying by $- A - 1$ :

$-A^{-1}(A+\delta A)\mathbf{v}=-\mu A^{-1}\mathbf{v}$

that is, putting $\tilde{A}=\mu A^{-1}$ and $\tilde{\delta A}=-A^{-1}\delta A$ :

$(\tilde{A}+\tilde{\delta A}-I)\mathbf{v}=\mathbf{0}$

which means that $\tilde{\mu}=1$ is an eigenvalue of $(\tilde{A}+\tilde{\delta A})$ , with $\mathbf{v}$ eigenvector. Now, the eigenvalues of $\tilde{A}$ are $\frac{\mu}{\lambda_i}$ , while its eigenvector matrix is the same as A. Applying Bauer-Fike theorem to the matrix $\tilde{A}+\tilde{\delta A}$ and to its eigenvalue $\tilde{\mu}=1$ , we obtain:

$min_{\lambda\in\sigma(A)}\left|\frac{\mu}{\lambda}-1\right|=\ min_{\lambda\in\sigma(A)}\frac{|\lambda-\mu|}{|\lambda|}\leq\kappa_p (V)\|A^{-1}\delta A\|_p$

[edit] Remark

If A is normal, V is a unitary matrix, and $\|V\|_2=\|V^{-1}\|_2=1$ , so that $κ 2 (V) = 1$ .

The Bauer-Fike theorem then becomes:

$\exists\lambda\in\sigma(A): |\lambda-\mu|\leq\|\delta A\|_2$

( $\exists\lambda\in\sigma(A): |\lambda-\tilde{\lambda}|\leq\frac{\|\mathbf{r}\|_2}{\|\mathbf{\tilde{v}}\|_2}$ in the alternative formulation)

which obviously remains true if A is a Hermitian matrix. In this case, however, a much stronger result holds, known as the Weyl theorem.

[edit] References

F. L. Bauer and C. T. Fike. Norms and exclusion theorems. Numer. Math. 2 (1960), 137-141.

S. C. Eisenstat and I. C. F. Ipsen. Three absolute perturbation bounds for matrix eigenvalues imply relative bounds. SIAM Journal on Matrix Analysis and Applications Vol. 20, N. 1 (1998), 149-158

Categories: Spectral theory | Mathematical theorems

Bauer-Fike theorem

From Wikipedia, the free encyclopedia

Contents

[edit] Theorem (Friedrich L. Bauer, C.T.Fike - 1960)

[edit] Proof

[edit] Theorem (Friedrich L. Bauer, C.T.Fike - 1960) (alternative statement)

[edit] Proof

[edit] Corollary

[edit] Proof

[edit] Remark

[edit] References

Views

Navigation

Interaction

Search

Languages