In linear algebra, an n-by-n (square) matrix A is called invertible or non-singular if there exists an n-by-n matrix B such that
where In denotes the n-by-n identity matrix and the multiplication used is ordinary matrix multiplication. If this is the case, then the matrix B is uniquely determined by A and is called the inverse of A, denoted by A−1. It follows from the theory of matrices that if
for square matrices A and B, then also
Non-square matrices (m-by-n matrices for which m ≠ n) do not have an inverse. However, in some cases such a matrix may have a left inverse or right inverse. If A is m-by-n and the rank of A is equal to n, then A has a left inverse: an n-by-m matrix B such that BA=I. If A has rank m, then it has a right inverse: an n-by-m matrix B such that AB=I.
While the most common case is that of matrices over the real or complex numbers, all these definitions can be given for matrices over any commutative ring.
A square matrix that is not invertible is called singular or degenerate. A square matrix is singular if and only if its determinant is 0.
Matrix inversion is the process of finding the matrix B that satisfies the prior equation for a given invertible matrix A.
Contents |
Let A be a square n by n matrix over a field (for example the field of real numbers). Then the following statements are equivalent:
In general, a square matrix over a commutative ring is invertible if and only if its determinant is a unit in that ring.
A matrix that is its own inverse, i.e. and , is called an involution.
If A1, A2, ..., An are nonsingular square matrices over a field, then
It becomes evident why this is the case if one attempts to find an inverse for the product of the Ais from first principles, that is, that we wish to determine B such that
where B is the inverse matrix of the product. To remove A1 from the product, we can then write
which would reduce the equation to
Likewise, then, from
which simplifies to
If one repeat the process up to An, the equation becomes
but B is the inverse matrix, i.e. so the property is established.
Over the field of real numbers, the set of singular n-by-n matrices, considered as a subset of , is a null set, i.e., has Lebesgue measure zero. This is true because singular matrices are the roots of the polynomial function in the entries of the matrix given by the determinant. Thus in the language of measure theory, almost all n-by-n matrices are invertible. Intuitively, this means that if you pick a random square matrix over the reals, the probability that it will be singular is zero.
Furthermore the n-by-n invertible matrices are a dense open set in the topological space of all n-by-n matrices. Equivalently, the set of singular matrices is closed and nowhere dense in the space of n-by-n matrices.
In practice however, one may encounter non-invertible matrices. And in numerical calculations, matrices which are invertible, but close to a non-invertible matrix, can still be problematic; such matrices are said to be ill-conditioned.
Gauss–Jordan elimination is an algorithm that can be used to determine whether a given matrix is invertible and to find the inverse. An alternative is the LU decomposition which generates an upper and a lower triangular matrices which are easier to invert. For special purposes, it may be convenient to invert matrices by treating mn-by-mn matrices as m-by-m matrices of n-by-n matrices, and applying one or another formula recursively (other sized matrices can be padded out with dummy rows and columns). For other purposes, a variant of Newton's method may be convenient (particularly when dealing with families of related matrices, so inverses of earlier matrices can be used to seed generating inverses of later matrices).
Writing another special matrix of cofactors, known as an adjugate matrix, can also be an efficient way to calculate the inverse of small matrices, but this recursive method is inefficient for large matrices. To determine the inverse, we calculate a matrix of cofactors:
where |A| is the determinant of A, Cij is the matrix cofactor, and AT represents the matrix transpose.
For most practical applications, it is not necessary to invert a matrix to solve a system of linear equations; however, for a unique solution, it is necessary that the matrix involved be invertible.
Decomposition techniques like LU decomposition are much faster than inversion, and various fast algorithms for special classes of linear systems have also been developed.
The cofactor equation listed above yields the following result for 2×2 matrices. Inversion of these matrices can be done easily as follows: [2]
This is possible because 1/(ad-bc) is the reciprocal of the determinant of the matrix in question, and the same strategy could be used for other matrix sizes.
Matrices can also be inverted blockwise by using the following analytic inversion formula:
|
where A, B, C and D are matrix sub-blocks of arbitrary size. (A and D must, of course, be square, so that they can be inverted). This strategy is particularly advantageous if A is diagonal and D−CA-1B (the Schur complement of A) is a small matrix, since they are the only matrices requiring inversion. This technique was reinvented several times and is due to Hans Bolz (1923), who used it for the inversion of geodetic matrices, and Tadeusz Banachiewicz (1937), who generalized it and proved its correctness.
The nullity theorem says that the nullity of A equals the nullity of the sub-block in the lower right of the inverse matrix, and that the nullity of B equals the nullity of the sub-block in the upper right of the inverse matrix.
The inversion procedure that led to Equation (1) performed matrix block operations that operated on C and D first. Instead, if A and B are operated on first, the result is
|
Equating Equations (1) and (2) leads to
|
where Equation (3) is the matrix inversion lemma, which is equivalent to the binomial inverse theorem.
First, multiply the RHS of Equation (3) by the inverse of the LHS to get
Note that
If we can show that , then the terms would be cancelled out. This is accomplished by noting
Hence we have shown that is indeed equal to . After the cancellation of the term, all there is left is the identity matrix; and the proof is completed.
Suppose that the matrix A depends on a parameter t. Then the derivative of the inverse of A with respect to t is given by
This formula can be found by differentiating the identity
Some of the properties of inverse matrices are shared by (Moore-Penrose) pseudoinverses, which can be defined for any m-by-n matrix.
Matrix inversion plays a significant role in computer graphics, particularly in 3D graphics rendering and 3D simulations. Examples include screen-to-world ray casting, world-to-subspace-to-world object transformations, and physical simulations. The problem is usually the numerical complexity of calculating the inverses of 3×3 and 4×4 matrices. Compared to matrix multiplication or creation of rotation matrices, matrix inversion is several orders of magnitude slower. There are existing solutions which use hand-crafted assembly language routines and SIMD processor extensions (SSE, SSE2, Altivec) that address this problem and which achieve a performance improvement of as much as five times.