Minimal polynomial (linear algebra)

From Wikipedia, the free encyclopedia

For the minimal polynomial of an algebraic element of a field, see minimal polynomial (field theory).

In linear algebra, the minimal polynomial of an n-by-n matrix A over a field F is the monic polynomial p(x) over F of least degree such that p(A)=0. Any other polynomial q with q(A) = 0 is a (polynomial) multiple of p.

The following three statements are equivalent:

  1. λ∈F is a root of p(x),
  2. λ is a root of the characteristic polynomial of A,
  3. λ is an eigenvalue of A.

The multiplicity of a root λ of p(x) is the size of the largest Jordan block corresponding to λ.

The minimal polynomial is not always the same as the characteristic polynomial. Consider the matrix 4In, which has characteristic polynomial (x − 4)n. However, the minimal polynomial is x − 4, since 4I − 4I = 0 as desired, so they are different for n\ge 2. That the minimal polynomial always divides the characteristic polynomial is a consequence of the Cayley–Hamilton theorem.

[edit] Formal definition

Given an endomorphism T on a vector space V over a field \mathbb{K}, let IT be the set defined as

\mathit{I}_T = \{ p \in \mathbb{K}[t] \; | \; p(T) = O \}

where \mathbb{K}[t] is the space of all polynomials over the field \mathbb{K}. It is easy to show that I T is a proper ideal of \mathbb{K}.

Thus it must be the polynomial of least degree in I T .

[edit] How to compute it

Let I T, v be defined as

\mathit{I}_{T, v} = \{ p \in \mathbb{K}[t] \; | \; v \in \mbox{Ker} \; p(T) \} = \{ p \in \mathbb{K}[t] \; | \; p(T)(v) = O \}

This definition satisfies the properties of a proper ideal. Let μT,v be the monic polynomial which generates it.

Properties

  • It is easy to see that IT, v divides IT.
  • If d is the greatest natural number such that v, T(v), ... , Td(v) are linearly independent, then
\alpha_0 v + \alpha_1 T(v) + \cdots + \alpha_n T^d (v) + T^{d+1} (v) = O for some alpha in \mathbb{K} and
\mu_{T,v} (t) = \alpha_0  + \alpha_1 t + \cdots + \alpha_n t^d  + t^{d+1}
  • Given a basis {v1,..., vn} the minimal polynomial is the least commond multiple of \mu_{T,v_i} for all i = 1, ... , n.

[edit] Example

Define T on R3 with the matrix

\begin{bmatrix} 1 & -1 & -1 \\ 1 & -2 & 1 \\ 0 & 1 & -3 \end{bmatrix}

Take

e_1 = \begin{vmatrix} 1 \\ 0 \\ 0 \end{vmatrix}

We see that e1, T(e1), T2(e1) are linearly independent while by adding T3(e1) to this list they are no more. Thus it's easy to find \mu_{T,e_1}, repeat it with e2 and e3 and calculate the least common multiple of them.