Schwartz-Zippel lemma and testing polynomial identities

From Wikipedia, the free encyclopedia

Polynomial identity testing is the problem of determining whether a given multivariate polynomial is the 0-polynomial or identically equal to 0. The input to the problem is an n-variable polynomial over a field F. It can occur in the following forms:

Algebraic form 
e.g. Is (x_1 + 3x_2 - x_3)(3x_1 + x_4 - 1) \cdots (x_7 - x_2) \equiv 0. To solve this, we can multiply it out and check that all the co-efficients are 0. However, this takes exponential time.
Determinant of a matrix with polynomial entries
Let p(x_1,x_2, \ldots, x_n) be the determinant of the polynomial matrix.
Read-once branching program 
also called binary decision diagrams.

Currently, there is no known sub-exponential time algorithm that can solve this problem deterministically. However, there are randomized polynomial algorithms for testing polynomial identities. The first of these algorithms was discovered independently by Schwartz and Zippel.

Contents

[edit] Schwartz-Zippel theorem

The Schwartz-Zippel theorem is a tool commonly used in probabilistic polynomial identity testing. It bounds the probability that a non-zero polynomial will have roots at randomly selected test points. The formal statement is as follows:

Theorem 1 (Schwartz-Zippel). Let P\in F[x_1,x_2,\ldots,x_n] be a polynomial of degree d\geq0 over a field, F. Let S be a finite subset of F and let r_1,r_2,\ldots,r_n be selected randomly from S. Then

\Pr[P(r_1,r_2,\ldots,r_n)=0]\leq\frac{d}{|S|}.

In the single variable case, this follows directly from the fact that a polynomial of degree d can have no more than d roots. It seems logical, then, to think that a similar statement would hold for multivariable polynomials. This is, in fact, the case.

Proof. The proof is by mathematical induction on n. For n = 1, as was mentioned before, P can have at most d roots. This gives us the base case. Now, assume that the theorem holds for all polynomials in n − 1 variables. We can then consider P to be a polynomial in x1 by writing it as

P(x_1,\dots,x_n)=\sum_{i=0}^d x_1^i P_i(x_2,\dots,x_n)

Since P is not identically 0, there is some i such that Pi is not identically 0. Take the largest such i. Then \deg P_i\leq d-i. Now we randomly pick r_2,\dots,r_n from S. By the induction hypothesis, \Pr[P_i(r_2,\ldots,r_n)=0]\leq\frac{d-i}{|S|}. If P_i(r_2,\ldots,r_n)\neq 0,then P(x_1,r_2,\ldots,r_n) is of degree i so

\Pr[P(r_1,r_2,\ldots,r_n)=0|P_i(r_2,\ldots,r_n)\neq 0]\leq\frac{i}{|S|}

If we denote the event P(r_1,r_2,\ldots,r_n)=0 by A and the event P_i(r_2,\ldots,r_n)=0 by B, we have

\Pr[A] =\Pr[A\cap B]+\Pr[A\cap B^c]
=\Pr[B]\Pr[A|B]+\Pr[B^c]\Pr[A|B^c]
\leq \Pr[B]+\Pr[A|B^c]
\leq \frac{d-i}{|S|}+\frac{i}{|S|}=\frac{d}{|S|}

[edit] Applications

The importance of the Schwartz-Zippel Theorem and Testing Polynomial Identities follows from algorithms which are obtained to problems that can be reduced to the problem of polynomial identity testing.

[edit] Comparison of two polynomials

Given a pair of polynomials p1(x) and p2(x), is

p_1(x) \equiv p_2(x)?

This problem can be solved by reducing it to the problem of polynomial identity testing. It is equivalent to checking if

[p_1(x) - p_2(x)] \equiv 0

Hence if we can determine that

p(x) \equiv 0,

where

p(x) = p_1(x)\;-\;p_2(x),

then we can determine whether the two polynomials are equivalent.

Comparison of two polynomials (and therefore testing polynomial identities) also has applications in 2D-compression, where the problem of finding the equality of two 2D-texts A and B is reduced to the problem of comparing equality of two polynomials PolyA(x,y) and PolyB(x,y)

[edit] Primality testing

Given n \in \mathbb{Z^+}, is n a prime number?

A simple randomized algorithm developed by Manindra Agrawal and Somenath Biswas can determine probabilistically whether n is prime and uses polynomial identity testing to do so.

They propose that all prime numbers n (and only prime numbers) satisfy the following polynomial identity

(1+z)^n = 1+z^n (\mbox{mod}\;n).

Let

\mathcal{P}_n(z) = (1+z)^n - 1 -z^n.\,

Then \mathcal{P}_n(z) = 0\;(\mbox{mod}\;n) iff n is prime. The proof can be found in [4]. However, since this polynomial has degree n, and since n may or may not be a prime, the Schwartz-Zippel method would not work. Agrawal and Biswas use a more sophisticated technique, which divides \mathcal{P}_n by a random monic polynomial of small degree.

Prime numbers are used in a number of applications such hash table sizing, pseudorandom number generators and in key generation for cryptography. Therefore finding very large prime numbers (on the order of 1010,000) becomes very important and efficient primality testing algorithms are required.

[edit] Perfect matching

Let G = (V,E) be a graph of n vertices where n is even. Does G contain a perfect matching?

Theorem 2 (Tutte): A Tutte polynomial is not a 0-polynomial if and only if there exists a perfect matching.

A subset D of E is called a matching if each vertex in V is incident with at most one edge in D. A matching is perfect if each vertex in V has exactly one edge that is incident to it in D. Create a Tutte matrix A in the following way:

A = \begin{bmatrix} a_{11} & a_{12} & \cdots & a_{1\mathit{n}} \\ a_{21} & a_{22} & \cdots & a_{2\mathit{n}} \\ \vdots & \vdots & \ddots & \vdots \\ a_{\mathit{n}1} & a_{\mathit{n}2} & \ldots & a_{\mathit{nn}} \end{bmatrix}

where

a_{ij} = \begin{cases} x_{ij}\;\;\mbox{if}\;(i,j) \in E \mbox{ and } i<j\\ -x_{ji}\;\;\mbox{if}\;(i,j) \in E \mbox{ and } i>j\\ 0\;\;\;\;\mbox{otherwise} \end{cases}

The Tutte polynomial (in the variables xij, i<j ) is then defined as the determinant of this skew-symmetric matrix which coincides with the square of the pfaffian of the matrix A and is non-zero (as polynomial) if and only if a perfect matching exists. We can then use polynomial identity testing to find whether G contains a perfect matching. In the special case of a bipartite graph on n = m + m vertices this matrix takes the form of a block matrix

A = \begin{pmatrix} 0 & X \\ -X^t & 0 \end{pmatrix}

if the first m rows (resp. columns) are indexed with the first subset of the bipartition and the last m rows with the complementary subset. In this case the pfaffian coincides with the usual determinant of the m × m matrix X (up to sign).

[edit] References

  1. Computational Complexity Theory. Rudich, Wigderson, editors. AMS, 2004.
  2. Effective Polynomial Computation. Zippel. Kluwer Academic Publishers, 1993.
  3. Piotr Berman, Marek Karpinski, Lawrence L. Larmore, Wojciech Plandowski and Wojciech Rytter, On the Complexity of Pattern Matching for Highly Compressed Two-Dimensional Texts, J. Comput. Syst. Sci. 65 (2002), 332-350.
  4. Manindra Agrawal, Somenath Biswas. Primality and Identity Testing via Chinese Remaindering. In Journal of the ACM (JACM), pages 429 - 443, 2003.
  5. W.T. Tutte. "The factorization of linear graphs." In The Journal of the London Mathematical Society 22, pages 107-111, 1947.
  6. J. Schwartz, Fast probabilistic algorithms for verification of polynomial identities, Journal of the ACM, vol. 27, pages 701 - 717, 1980.