Diagonalizable matrix
From Wikipedia, the free encyclopedia
In linear algebra, a square matrix A is called diagonalizable if it is similar to a diagonal matrix, i.e. if there exists an invertible matrix P such that P −1AP is a diagonal matrix. If V is a finite-dimensional vector space, then a linear map T : V → V is called diagonalizable if there exists a basis of V with respect to which T is represented by a diagonal matrix. Diagonalization is the process of finding a corresponding diagonal matrix for a diagonalizable matrix or linear map.
Diagonalizable matrices and maps are of interest because diagonal matrices are especially easy to handle: their eigenvalues and eigenvectors are known and one can raise a diagonal matrix to a power by simply raising the diagonal entries to that same power.
The Jordan-Chevalley decomposition expresses an operator as the sum of its diagonal part and its nilpotent part.
Contents |
[edit] Characterization
The fundamental fact about diagonalizable maps and matrices is expressed by the following:
- An n-by-n matrix A over the field F is diagonalizable if and only if the sum of the dimensions of its eigenspaces is equal to n, which is the case if and only if there exists a basis of Fn consisting of eigenvectors of A. If such a basis has been found, one can form the matrix P having these basis vectors as columns, and P -1AP will be a diagonal matrix. The diagonal entries of this matrix are the eigenvalues of A.
- A linear map T : V → V is diagonalizable if and only if the sum of the dimensions of its eigenspaces is equal to dim(V), which is the case if and only if there exists a basis of V consisting of eigenvectors of T. With respect to such a basis, T will be represented by a diagonal matrix. The diagonal entries of this matrix are the eigenvalues of T.
Another characterization: A matrix or linear map is diagonalizable over the field F if and only if its minimal polynomial is a product of distinct linear factors over F.
The following sufficient (but not necessary) condition is often useful.
- An n-by-n matrix A is diagonalizable over the field F if it has n distinct eigenvalues in F, i.e. if its characteristic polynomial has n distinct roots in F.
- A linear map T : V → V with n=dim(V) is diagonalizable if it has n distinct eigenvalues, i.e. if its characteristic polynomial has n distinct roots in F.
As a rule of thumb, over C almost every matrix is diagonalizable. More precisely: the set of complex n-by-n matrices that are not diagonalizable over C, considered as a subset of Cn×n, has Lebesgue measure zero. One can also say that the diagonalizable matrices form a dense subset with respect to the Zariski topology: the complement lies inside the set where the discriminant of the characteristic polynomial vanishes, which is a hypersurface. From that follows also density in the usual (strong) topology given by a norm.
The same is not true over R. As n increases, it becomes (in some sense) less and less likely that a randomly selected real matrix is diagonalizable over R.
[edit] Examples
[edit] Diagonalizable matrices
- Involutions are diagonalizable over the reals (and indeed any field of characteristic not 2), with 1's and -1's on the diagonal
- Finite order endomorphisms (including involutions) are diagonalizable over the complex numbers, or any algebraically closed field where the characteristic of the field doesn't divide the order of the endomorphism (as the roots of unity are distinct), with roots of unity on the diagonal. This is a part of representation theory of cyclic groups.
- Projections are diagonalizable, with 0's and 1's on the diagonal.
[edit] Matrices that are not diagonalizable
Some matrices are not diagonalizable over any field, most notably nilpotent matrices. This happens more generally if the algebraic and geometric multiplicities of an eigenvalue do not coincide. For instance, consider
This matrix is not diagonalizable: there is no matrix U such that U − 1CU is a diagonal matrix. Indeed, C has one eigenvalue (namely zero) and this eigenvalue has algebraic multiplicity 2 and geometric multiplicity 1.
Some real matrices are not diagonalizable over the reals. Consider for instance the matrix
The matrix B does not have any real eigenvalues, so there is no real matrix Q such that Q − 1BQ is a diagonal matrix. However, we can diagonalize B if we allow complex numbers. Indeed, if we take
then Q − 1BQ is diagonal.
[edit] How to diagonalize a matrix
Consider a matrix
This matrix has eigenvalues
So A is a 3-by-3 matrix with 3 different eigenvalues, therefore it is diagonalizable.
If we want to diagonalize A, we need to compute the corresponding eigenvectors. They are
One can easily check that Avk = λkvk.
Now, let P be the matrix with these eigenvectors as its columns:
Then P diagonalizes A, as a simple computation confirms:
Note that the eigenvalues λk appear in the diagonal matrix.
[edit] An application
Diagonalization can be used to compute the powers of a matrix A efficiently, provided the matrix is diagonalizable. Suppose we have found that
is a diagonal matrix. Then, as the matrix product is associative,
and the latter is easy to calculate since it only involves the powers of a diagonal matrix.
This is particularly useful in finding closed form expressions for terms of a linear recursive sequences, such as the Fibonacci numbers.
[edit] Particular application
For example, consider the following matrix:
Calculating the various powers of M reveals a surprising pattern:
The above phenomenon can be explained by diagonalizing M. To accomplish this, we need a basis of R2 consisting of eigenvectors of M. One such eigenvector basis is given by
where ei denotes the standard basis of Rn. The reverse change of basis is given by
Straightforward calculations show that
Thus, a and b are the eigenvalues corresponding to u and v, respectively. By linearity of matrix multiplication, we have that
Switching back to the standard basis, we have
The preceding relations, expressed in matrix form, are
thereby explaining the above phenomenon.
[edit] Quantum mechanical application
In quantum mechanical and quantum chemical computations matrix diagonalization is one of the most frequently applied numerical processes. The basic reason is that the time-independent Schrödinger equation is an eigenvalue equation, albeit in most of the physical situations on an infinite dimensional space (a Hilbert space). A very common approximation is to truncate Hilbert space to finite dimension, after which the Schrödinger equation can be formulated as an eigenvalue problem of a real symmetric, or complex Hermitian, matrix. Formally this approximation is founded on the variational principle, valid for Hamiltonians that are bounded from below. But also first-order perturbation theory for degenerate states leads to a matrix eigenvalue problem.
[edit] See also
[edit] External links
[edit] References
- Roger A. Horn and Charles R. Johnson, Matrix Analysis, Chapter 1, Cambridge University Press, 1985. ISBN 0-521-30586-1 (hardback), ISBN 0-521-38632-2 (paperback).