Inclusion-exclusion principle

From Wikipedia, the free encyclopedia

In combinatorial mathematics, the inclusion-exclusion principle (also known as the sieve principle) states that if A1, ..., An are finite sets, then

\left|\bigcup_{i=1}^n A_i\right|=\sum_{i=1}^n\left|A_i\right| -\sum_{i,j\,:\,i < j}\left|A_i\cap A_j\right|+\sum_{i,j,k\,:\,i<j<k}\left|A_i\cap A_j\cap A_k\right|-\ \cdots\cdots\ \pm \left|A_1\cap\cdots\cap A_n\right|

where |A| denotes the cardinality of the set A. For example, taking n = 2, we get a special case of double counting: in words, we can count the size of the union of sets A and B by adding |A| and |B| and then subtracting the size of their intersection. The name comes from the idea that the principle is based on over-generous inclusion, followed by compensating exclusion. When n > 2 the exclusion of the pairwise intersections is (possibly) too severe, and the correct formula is as shown with alternating signs.

This formula is attributed to Abraham de Moivre; it is sometimes also named for Joseph Sylvester or Henri Poincaré.

Inclusion-exclusion illustrated for three sets
Inclusion-exclusion illustrated for three sets

For the case of three sets A, B, C the inclusion-exclusion principle is illustrated in the graphic on the right.

Contents

[edit] Proof

To prove the inclusion-exclusion principle in general, let X be a superset of all A1, ..., An. The formula follows by first proving the identity

1_{\bigcup_{i=1}^n A_i}=\sum_{i=1}^n 1_{A_i} -\sum_{i,j\,:\,i<j}1_{A_i\cap A_j}
+\sum_{i,j,k\,:\,i<j<k}1_{A_i\cap A_j\cap A_k}-\ \cdots\cdots\ \pm 1_{A_1\cap\cdots\cap A_n}

which is shown by manipulating indicator functions, and then summing over all xX.

[edit] Other forms

The principle is sometimes stated in the form that says that if

g(A)=\sum_{S\,:\,S\subseteq A}f(S)

then

f(A)=\sum_{S\,:\,S\subseteq A}(-1)^{\left|A\right|-\left|S\right|}g(S)

In that form it is seen to be the Möbius inversion formula for the incidence algebra of the partially ordered set of all subsets of A.

The inclusion-exclusion principle can also be used in probability where it becomes:

\Pr\left(\bigcup_{i=1}^n A_i\right)=\sum_{i=1}^n \Pr\left(A_i\right) -\sum_{i,j\,:\,i<j}\Pr\left(A_i\cap A_j\right)+\sum_{i,j,k\,:\,i<j<k}\Pr\left(A_i\cap A_j\cap A_k\right)-\ \cdots\cdots\ \pm \Pr\left(\bigcap_{i=1}^n A_i\right).

According to the Bonferroni inequalities, the sum of the first k terms in the formula is alternately an upper bound and a lower bound for the LHS. This can be used in cases where the full formula is too cumbersome.

[edit] Applications

In many cases where the principle could give an exact formula (in particular, counting prime numbers using the sieve of Eratosthenes), the formula arising doesn't offer useful content because the number of terms in it is excessive. If each term individually can be estimated accurately, the accumulation of errors may imply that the inclusion-exclusion formula isn't directly applicable. In number theory, this difficulty was addressed by Viggo Brun. After a slow start, his ideas were taken up by others, and a large variety of sieve methods developed. These for example may try to find upper bounds for the "sieved" sets, rather than an exact formula.

[edit] Derangements

A well-known application of the inclusion-exclusion principle is to the combinatorial problem of counting all derangements of a finite set. A derangement of a set A is a bijection from A into itself that has no fixed points. Via the inclusion-exclusion principle one can show that if the cardinality of (number of elements in) A is n, then the number of derangements is [n! / e] where [x] denotes the nearest integer to x.

This is also known as the subfactorial of n, written !n. It follows that if all bijections are assigned the same probability then the probability that a random bijection is a derangement quickly approaches 1/e as n grows.

[edit] Counting intersections

The principle of inclusion-exclusion, combined with de Morgan's theorem, can be used to count the intersection of sets as well. Let \overline{A_k} represent the complement of \ A_k with respect to some universal set \ A such that A_k \subseteq A for each k. Then we have

\bigcap_{i=1}^N A_i = \overline{\bigcup_{i=1}^N \overline{A_i} }

thereby turning the problem of finding an intersection into the problem of finding a union.

[edit] See also

[edit] References

This article incorporates material from principle of inclusion-exclusion on PlanetMath, which is licensed under the GFDL.