Fundamental theorem of algebra

From Wikipedia, the free encyclopedia

In mathematics, the fundamental theorem of algebra states that every non-constant single-variable polynomial with complex coefficients has at least one complex root. Equivalently, the field of complex numbers is algebraically closed.

Sometimes, this theorem is stated as: every non-zero single-variable polynomial, with complex coefficients, has exactly as many complex roots as its degree, if each root is counted up to its multiplicity. Although this at first appears to be a stronger statement, it is an easy consequence of the other form of the theorem, through the use of successive polynomial division by linear factors.

In spite of its name, there is no known purely algebraic proof of the theorem, and many mathematicians believe that such a proof does not exist.[1] Besides, it is not fundamental for modern algebra; its name was given at a time in which algebra was basically about solving polynomial equations with real or complex coefficients.

Contents

[edit] History

Peter Rothe (Petrus Roth), in his book Arithmetica Philosophica (published in 1608), wrote that a polynomial equation of degree n (with real coefficients) may have n solutions. Albert Girard, in his book L'invention nouvelle en l'Algèbre (published in 1629), asserted that a polynomial equation of degree n has n solutions, but he did not state that they had to be real numbers. Furthermore, he added that his assertion holds “unless the equation is incomplete”, by which he meant that no coefficient is equal to 0. However, when he explains in detail what he means, it is clear that he actually believes that his assertion is always true; for instance, he shows that the equation x4 = 4x − 3, although incomplete, has four solutions (counting multiplicities): 1 (twice), −1 + i2, and −1 − i2.

As will be mentioned again below, it follows from the fundamental theorem of algebra that every non-constant polynomial with real coefficients can be written as a product of polynomials with real coefficients whose degree is either 1 or 2. However, in 1702 Leibniz said that no polynomial of the type x4 + a4 (with a real and distinct from 0) can be written in such a way. Later, Nikolaus Bernoulli made the same assertion concerning the polynomial x4 − 4x3 + 2x2 + 4x + 4, but he got a letter from Euler in 1742[2] in which he was told that his polynomial happened to be equal to

(x^2-(2+\alpha)x+1+\sqrt{7}+\alpha)(x^2-(2-\alpha)x+1+\sqrt{7}-\alpha),

where α is the square root of 4 + 2√7, whereas

x^4+a^4=(x^2+a\sqrt{2}x+a^2)(x^2-a\sqrt{2}x+a^2).\,

A first attempt at proving the theorem was made by d'Alembert in 1746, but his proof was incomplete. Among other problems, it assumed implicitly a theorem (now known as Puiseux's theorem) which would be proved only more than a century later (and furthermore the proof assumed the fundamental theorem of algebra). Other attempts were made by Euler (1749), de Foncenex (1759), Lagrange (1772), and Laplace (1795). These last four attempts assumed implicitly Girard's assertion; to be more precise, the existence of solutions was assumed and all that remained to be proved was that their form was a + bi for some real numbers a and b. In modern terms, Euler, de Foncenex, Lagrange, and Laplace were assuming the existence of a splitting field of the polynomial p(z).

At the end of the 18th century two new proofs were published which did not assume the existence of roots. One of them, due to James Wood and mainly algebraic, was published in 1798 and it was totally ignored. Wood's proof had an algebraic gap.[3] The other one was published by Gauss in 1799 and it was mainly geometric, but it had a topological gap. A rigorous proof was published by Argand in 1806; it was here that, for the first time, the fundamental theorem of algebra was stated for polynomials with complex coefficients, rather than just real coefficients. Gauss produced two other proofs in 1816 and another version of his original proof in 1849.

The first textbook containing a proof of the theorem was Cauchy's Cours d'analyse de l'École Royale Polytechnique (1821). It contained Argand's proof, although Argand is not credited for it.

None of the proofs mentioned so far is constructive. It was Weierstrass who raised for the first time, in the middle of the 19th century, the problem of finding a constructive proof of the fundamental theorem of algebra. He presented his solution, that amounts in modern terms to a combination of the Durand-Kerner method with the homotopy continuation principle, in 1891. Another proof of this kind was obtained by Hellmuth Kneser in 1940 and simplified by his son Martin Kneser in 1981.

[edit] Proofs

All proofs below involve some analysis, at the very least the concept of continuity of real or complex functions. Some also use differentiable or even analytic functions.

Some proofs of the theorem only prove that any non-constant polynomial with real coefficients has some complex root. This is enough to establish the theorem in the general case because, given a non-constant polynomial p(z) with complex coefficients, the polynomial

q(z)=p(z)\overline{p(\overline{z})}

has only real coefficients and, if z is a zero of q(z), then either z or its conjugate is a root of p(z).

A large number of non-algebraic proofs of the theorem use the fact (sometimes called “growth lemma”) that an n-th degree polynomial function p(z) whose dominant coefficient is 1 behaves like zn when |z| is large enough. A more precise statement is: there is some positive real number R such that

\tfrac{1}{2}|z^n|<|p(z)|<\tfrac{3}{2}|z^n|

when |z| > R.

[edit] Complex-analytic proofs

Find a closed disk D of radius r centered at the origin such that |p(z)| > |p(0)| whenever |z| ≥ r. The minimum of |p(z)| on D, which must exist since D is compact, is therefore achieved at some point z0 in the interior of D, but not at any point of its boundary. The minimum modulus principle implies then that p(z0) = 0. In other words, z0 is a zero of p(z).

Another analytic proof can be obtained along this line of thought observing that, since |p(z)| > |p(0)| outside D, the minimum of |p(z)| on the whole complex plane is achieved at z0. If |p(z0)| > 0, then 1/p is a bounded holomorphic function in the entire complex plane since, for each complex number z, |1/p(z)| ≤ |1/p(z0)|. Applying Liouville's theorem, which states that a bounded entire function must be constant, this would imply that 1/p is constant and therefore that p is constant. This gives a contradiction, and hence p(z0) = 0.

Yet another analytic proof uses argument principle. Let R be a positive real number large enough so that every root of p(z) has absolute value smaller than R; such a number must exist because every non-constant polynomial function of degree n has at most n zeros. For each r > R, consider the number

\frac{1}{2\pi i}\int_{c(r)}\frac{p'(z)}{p(z)}\,dz,

where c(r) is the circle centered at 0 with radius r oriented counterclockwise; then the argument principle says that this number is the number N of zeros of p(z) in the open ball centered at 0 with radius r, which, since r > R, is the total number of zeros of p(z). On the other hand, the integral of n/z along c(r) divided by 2πi is equal to n. But the difference between the two numbers is

\frac{1}{2\pi i}\int_{c(r)}\frac{p'(z)}{p(z)}-\frac{n}{z}\,dz=\frac{1}{2\pi i}\int_{c(r)}\frac{zp'(z)-np(z)}{zp(z)}\,dz.

The numerator of the rational expression being integrated has degree at most n − 1 and the degree of the denominator is n + 1. Therefore, the number above tends to 0 as r tends to +∞. But the number is also equal to N − n and so N = n.

[edit] Topological proofs

Let z0 ∈ C be such that the minimum of |p(z)| on the whole complex plane is achieved at z0; it was seen at the proof which uses Liouville's theorem that such a number must exist. We can write p(z) as a polynomial in z − z0: there is some natural number k and there are some complex numbers ck, ck + 1, …, cn such that ck ≠ 0 and that

p(z)=p(z_0)+c_k(z-z_0)^k+c_{k+1}(z-z_0)^{k+1}+ \cdots +c_n(z-z_0)^n.

It follows that if a is a kth root of −p(z0)/ck and if t is positive and sufficiently small, then |p(z0 + ta)| < |p(z0)|, which is impossible, since |p(z0)| is the minimum of |p| on D.

For another topological proof by contradiction, suppose that p(z) has no zeros. Choose a large positive number R such that, for |z| = R, the leading term zn of p(z) dominates all other terms combined; in other words, such that |z|n > |an − 1zn −1 + ··· + a0|. As z traverses the circle given by the equation |z| = R once counter-clockwise, p(z), like zn, winds n times counter-clockwise around 0. At the other extreme, with |z| = 0, the “curve” p(z) is simply the single (nonzero) point p(0), whose winding number is clearly 0. If the loop followed by z is continuously deformed between these extremes, the path of p(z) also deforms continuously. We can explicitly write such a deformation as H(Reiθ,t) = p((1 − t)Reiθ) where t is greater than or equal to 0 and less than or equal to 1. If one views the variable t as time, then at time zero the curve is p(z) and at time one the curve is p(0). Clearly at every point t, p(z) cannot be zero by the original assumption, therefore during the deformation, the curve never crosses zero. Therefore the winding number of the curve around zero should never change. However, given that the winding number started as n and ended as 0, this is absurd. Therefore, p(z) has at least one zero.

[edit] Algebraic proofs

These proofs use two facts about real numbers that require only a small amount of analysis (more precisely, the intermediate value theorem):

  • every polynomial with odd degree and real coefficients has some real root;
  • every non-negative real number has a square root.

The second fact, together with the quadratic formula, implies the theorem for real quadratic polynomials.

As mentioned above, it suffices to check the statement “every non-constant polynomial p(z) with real coefficients has a complex root”. This statement can be proved by induction on the greatest non-negative integer k such that 2k divides the degree n of p(z). Let a be the coefficient of zn in p(z) and let F be a splitting field of p(z) (seen as a polynomial with complex coefficients); in other words, the field F contains C and there are elements z1, z2, …, zn in F such that

p(z)=a(z-z_1)(z-z_2) \cdots (z-z_n).

If k = 0, then n is odd, and therefore p(z) has a real root. Now, suppose that n = 2km (with m odd and k > 0) and that the theorem is already proved when the degree of the polynomial has the form 2k − 1m′ with m′ odd. For a real number t, define:

q_t(z)=\prod_{1\le i<j\le n}\left(z-z_i-z_j-tz_iz_j\right).\,

Then the coefficients of qt(z) are symmetric polynomials in the zi's with real coefficients. Therefore, they can be expressed as polynomials with real coefficients in the elementary symmetric polynomials, that is, in −a1, a2, …, (−1)nan. So qt(z) has in fact real coefficients. Furthermore, the degree of qt(z) is n(n − 1)/2 = 2k − 1m(n − 1), and m(n − 1) is an odd number. So, using the induction hypothesis, qt has at least one complex root; in other words, zi + zj + tzizj is complex for two distinct elements i and j from {1,…,n}. Since there are more real numbers than pairs (i,j), one can find distinct real numbers t and s such that zi + zj + tzizj and zi + zj + szizj are complex (for the same i and j). So, both zi + zj and zizj are complex numbers. It is easy to check that every complex number has a complex square root, thus every complex polynomial of degree 2 has a complex root by the quadratic formula. It follows that zi and zj are complex numbers, since they are roots of the quadratic polynomial z2 − (zi + zj)z + zizj.

Another algebraic proof of the fundamental theorem can be given using Galois theory. It suffices to show that C has no proper finite field extension.[4] Let K/C be a finite extension. Since the normal closure of K over R still has a finite degree over C (or R), we may assume without loss of generality that K is a normal extension of R (hence it is a Galois extension, as every algebraic extension of a field of characteristic 0 is separable). Let G be the Galois group of this extension, and let H be a Sylow 2-group of G, so that the order of H is a power of 2, and the index of H in G is odd. By the fundamental theorem of Galois theory, there exists a subextension L of K/R such that Gal(K/L) = H. As [L:R] = [G:H] is odd, and there are no nonlinear irreducible real polynomials of odd degree, we must have L = R, thus [K:R] and [K:C] are powers of 2. Assuming for contradiction [K:C] > 1, the 2-group Gal(K/C) contains a subgroup of index 2, thus there exists a subextension M of C of degree 2. However, C has no extension of degree 2, because every quadratic complex polynomial has a complex root, as mentioned above.

[edit] Corollaries

Since the fundamental theorem of algebra can be seen as the statement that the field of complex numbers is algebraically closed, it follows that any theorem concerning algebraically closed fields applies to the field of complex numbers. Here are a few more consequences of the theorem, which are either about the field of real numbers or about the relationship between the field of real numbers and the field of complex numbers:

  • The field of complex numbers is the algebraic closure of the field of real numbers.
  • Every polynomial in one variable x with real coefficients is the product of a constant, polynomials of the form x + a with a real, and polynomials of the form x2 + ax + b with a and b real and a2 − 4b < 0 (which is the same thing as saying that the polynomial x2 + ax + b has no real roots).
  • Every rational function in one variable x, with real coefficients, can be written as the sum of a polynomial function with rational functions of the form a/(x − b)n (where n is a natural number, and a and b are real numbers), and rational functions of the form (ax + b)/(x2 + cx + d)n (where n is a natural number, and a, b, c, and d are real numbers such that c2 − 4d < 0). A corollary of this is that every rational function in one variable and real coefficients has an elementary primitive.
  • Every algebraic extension of the real field is isomorphic either to the real field or to the complex field.

[edit] Notes

  1. ^ See §1.9 of R. Remmert's text The fundamental theorem of Algebra.
  2. ^ See section Le rôle d'Euler in C. Gilain's article Sur l'histoire du théorème fondamental de l'algèbre: théorie des équations et calcul intégral.
  3. ^ Concerning Wood's proof, see the article A forgotten paper on the fundamental theorem of algebra, by Frank Smithies.
  4. ^ A proof of the fact that this suffices can be seen here.

[edit] References

[edit] Historic sources

  • Gauss, Carl Friedrich (1799), Demonstratio nova theorematis omnem functionem algebraicam rationalem integram unius variabilis in factores reales primi vel secundi gradus resolvi posse, Helmstedt: C. G. Fleckeisen 
  • Weierstraß, Karl (1891). "Neuer Beweis des Satzes, dass jede ganze rationale Function einer Veränderlichen dargestellt werden kann als ein Product aus linearen Functionen derselben Veränderlichen". Sitzungsberichte der königlich preussischen Akademie der Wissenschaften zu Berlin: 1085–1101. 

[edit] Recent literature

  • Gersten, S.M. & Stallings, John R. (1988), “On Gauss's First Proof of the Fundamental Theorem of Algebra”, Proceedings of the AMS 103 (1): 331–332, ISSN 0002-9939 
  • Gilain, Christian (1991), “Sur l'histoire du théorème fondamental de l'algèbre: théorie des équations et calcul intégral”, Archive for History of Exact Sciences 42 (2): 91–136, ISSN 0003-9519 
  • Netto, Eugen & Le Vavasseur, Raymond (1916), “Les fonctions rationnelles §80–88: Le théorème fondamental”, in Meyer, François & Molk, Jules, Encyclopédie des Sciences Mathématiques Pures et Appliquées, tome I, vol. 2, Éditions Jacques Gabay, 1992, ISBN 2-87647-101-9 
  • Smithies, Frank (2000), “A forgotten paper on the fundamental theorem of algebra”, Notes & Records of the Royal Society 54 (3): 333–341, ISSN 0035-9149 

[edit] External links