Sturm's theorem
From Wikipedia, the free encyclopedia
In mathematics, Sturm's theorem is a symbolic procedure to determine the number of unique real roots of a polynomial. It was named for its discoverer, Jacques Charles François Sturm.
Whereas the fundamental theorem of algebra readily yields the number of real or complex roots of a polynomial, counted according to their multiplicities, Sturm's theorem deals with real roots and disregards their multiplicities.
Contents |
[edit] Sturm chains
To apply Sturm's theorem, we first construct a Sturm chain or sequence from a square-free polynomial
A Sturm chain or Sturm sequence is the sequence of intermediary results when applying Euclid's algorithm to X and its derivative X1 = X′.
To obtain the Sturm chain, compute
That is, successively take the remainders with polynomial division and change their signs. Each Xi is a polynomial of degree at least one, hence the algorithm terminates. The final polynomial, Xr, is the GCD of X and its derivative. If X only has simple roots, Xr will be a constant. The Sturm chain then is
[edit] Statement
Let σ(ξ) be the number of sign changes (zeroes are not counted) in the sequence
where X is a square-free polynomial. Sturm's theorem then states that for two real numbers a < b, the number of unique roots in the half-open interval (a,b] is σ(a)−σ(b).
[edit] Applications
This can be used to compute the total number of real roots a polynomial has (to use, for example, as an input to a numerical root finder) by choosing a and b appropriately. For example, a bound due to Cauchy says that all real roots of a polynomial with coefficients ai are in the interval [−M,M], where
Alternatively, we can use the fact that for large x, the sign of a polynomial
is sgn(an), whereas sgn(P(−x)) is sgn((−1)nan).
In this way, simply counting the sign changes in the leading coefficients in the Sturm chain readily gives the number of distinct real roots of a polynomial.
We can also determine the multiplicity of a given root, say ξ, with the help of Sturm's theorem. Indeed, suppose we know a and b bracketing ξ, with σ(a)−σ(b) = 1. Then, ξ has multiplicity m precisely when ξ is a root with multiplicity m−1 of Xr (since it is the GCD of X and its derivative).
[edit] Generalized Sturm chains
Let ξ be in the compact interval [a,b]. A generalized Sturm chain over [a,b] is a finite sequence of real polynomials (X0,X1,…,Xr) such that:
- X(a)X(b) ≠ 0
- sgn(Xr) is constant on [a,b]
- If Xi(ξ) = 0 for 1 ≤ i ≤ r−1, then Xi−1(ξ)Xi+1(ξ) < 0
One can check that each Sturm chain is indeed a generalized Sturm chain.