Poisson binomial distribution

Poisson binomial
Parameters	$\mathbf{p}\in [0,1]^n$ — success probabilities for each of the n trials
Support	k ∈ { 0, …, n }
pmf	$\sum\limits_{A\in F_k} \prod\limits_{i\in A} p_i \prod\limits_{j\in A^c} (1-p_j)$
CDF	$\sum\limits_{l=0}^k \sum\limits_{A\in F_l} \prod\limits_{i\in A} p_i \prod\limits_{j\in A^c} (1-p_j)$
Mean	$\sum\limits_{i=1}^n p_i$
Variance	$\sigma^2 =\sum\limits_{i=1}^n (1 - p_i)p_i$
Skewness	$\frac{1}{\sigma^3}\sum\limits_{i=1}^n ( 1-2p_i ) ( 1-p_i ) p_i$
Ex. kurtosis	$\frac{1}{\sigma^4}\sum\limits_{i=1}^n ( 1 - 6(1 - p_i)p_i )( 1 - p_i )p_i$
MGF	$\prod\limits_{j=1}^n (1-p_j+p_j e^t)$
CF	$\prod\limits_{j=1}^n (1-p_j+p_j e^{it})$

In probability theory and statistics, the Poisson binomial distribution is the discrete probability distribution of a sum of independent Bernoulli trials that are not necessarily identically distributed. The concept is named after Siméon Denis Poisson.

In other words, it is the probability distribution of the number of successes in a sequence of n independent yes/no experiments with success probabilities $p_1, p_2, \dots , p_n$ . The ordinary binomial distribution is a special case of the Poisson binomial distribution, when all success probabilities are the same, that is $p_1 = p_2 = \cdots = p_n$ .

Mean and variance

Since a Poisson binomial distributed variable is a sum of n independent Bernoulli distributed variables, its mean and variance will simply be sums of the mean and variance of the n Bernoulli distributions:

\mu = \sum\limits_{i=1}^n p_i

\sigma^2 =\sum\limits_{i=1}^n (1-p_i) p_i

For fixed values of the mean ( $\mu$ ) and size (n), the variance is maximal when all success probabilities are equal and we have a binomial distribution. When the mean is fixed, the variance is bounded from above by the variance of the Poisson distribution with the same mean which is attained asymptotically as n tends to infinity.

Probability mass function

The probability of having k successful trials out of a total of n can be written as the sum ^[1]

\Pr(K=k) = \sum\limits_{A\in F_k} \prod\limits_{i\in A} p_i \prod\limits_{j\in A^c} (1-p_j)

where $F_k$ is the set of all subsets of k integers that can be selected from {1,2,3,...,n}. For example, if n = 3, then $F_2=\left\{ \{1,2\},\{1,3\},\{2,3\} \right\}$ . $A^c$ is the complement of $A$ , i.e. $A^c =\{1,2,3,\dots,n\}\setminus A$ .

$F_k$ will contain $n!/((n-k)!k!)$ elements, the sum over which is infeasible to compute in practice unless the number of trials n is small (e.g. if n = 30, $F_{15}$ contains over 10²⁰ elements). There are luckily more efficient ways to calculate $\Pr(K=k)$ .

As long as none of the success probabilities are equal to one, one can calculate the probability of k successes using the recursive formula ^[2] ^[3]

\Pr (K=k)= \begin{cases} \prod\limits_{i=1}^n (1-p_i) & k=0 \\ \frac{1}{k} \sum\limits_{i=1}^k (-1)^{i-1}\Pr (K=k-i)T(i) & k>0 \\ \end{cases}

where

T(i)=\sum\limits_{j=1}^n \left( \frac{p_j}{1-p_j} \right)^i.

The recursive formula is not numerically stable, and should be avoided if $n$ is greater than approximately 20. Another possibility is using the discrete Fourier transform ^[4]

\Pr (K=k)=\frac{1}{n+1} \sum\limits_{l=0}^n C^{-lk} \prod\limits_{m=1}^n \left( 1+(C^l-1) p_m \right)

where $C=\exp \left( \frac{2i\pi }{n+1} \right)$ and $i=\sqrt{-1}$ .

Still other methods are described in .^[5]

Entropy

There is no simple formula for the entropy of a Poisson binomial distribution, but the entropy can be upper bounded by that entropy of a binomial distribution with the same number parameter and the same mean. Therefore the entropy can also be upper bounded by the entropy of a Poisson distribution with the same mean. ^[6]

The Shepp-Olkin conjecture, due to Lawrence Shepp and Ingram Olkin in 1981, states that the entropy of a Poisson binomial distribution is a concave function of the success probabilities $p_1, p_2, \dots , p_n$ . ^[7]

References

↑ Wang, Y. H. (1993). "On the number of successes in independent trials". Statistica Sinica 3 (2): 295–312.
↑ Shah, B. K. (1994). "On the distribution of the sum of independent integer valued random variables". American Statistician 27 (3): 123–124. JSTOR 2683639.
↑ Chen, X. H.; A. P. Dempster; J. S. Liu (1994). "Weighted finite population sampling to maximize entropy". Biometrika 81 (3): 457. doi:10.1093/biomet/81.3.457.
↑ Fernandez, M.; S. Williams (2010). "Closed-Form Expression for the Poisson-Binomial Probability Density Function". IEEE Transactions on Aerospace Electronic Systems 46: 803–817. doi:10.1109/TAES.2010.5461658.
↑ Chen, S. X.; J. S. Liu (1997). "Statistical Applications of the Poisson-Binomial and conditional Bernoulli distributions". Statistica Sinica 7: 875–892.
↑ Harremoës, P. (2001). "Binomial and Poisson distributions as maximum entropy distributions". IEEE Transactions on Information Theory 47 (5): 2039–2041. doi:10.1109/18.930936.
↑ Shepp, Lawrence; Olkin, Ingram (1981). "Entropy of the sum of independent Bernoulli random variables and of the multinomial distribution". In Gani, J.; Rohatgi, V.K. Contributions to probability: A collection of papers dedicated to Eugene Lukacs. New York,: Academic Press,. pp. 201–206. ISBN 0-12-274460-8. MR 0618689.

Probability distributions

Discrete univariate with finite support

Benford Bernoulli Beta-binomial binomial categorical hypergeometric Poisson binomial Rademacher discrete uniform Zipf Zipf–Mandelbrot

Discrete univariate with infinite support

beta negative binomial Borel Conway–Maxwell–Poisson discrete phase-type Delaporte extended negative binomial Gauss–Kuzmin geometric logarithmic negative binomial parabolic fractal Poisson Skellam Yule–Simon zeta

Continuous univariate supported on a bounded interval, e.g. [0,1]

Arcsine ARGUS Balding–Nichols Bates Beta Beta rectangular Irwin–Hall Kumaraswamy logit-normal Noncentral beta raised cosine Triangular U-quadratic uniform Wigner semicircle

[[List of probability distributions#Supported_on_semi-infinite_intervals.2C_usually_.5B0.2C.E2.88.9E.29|Continuous univariate supported on a semi-infinite interval, usually [0,∞)]]

Benini
Benktander 1st kind
Benktander 2nd kind
Beta prime
Burr
chi-squared
chi
Coxian
Dagum
Davis
EL
Erlang
exponential
F
folded normal
Flory-Schulz
Fréchet
Gamma
Gamma/Gompertz
generalized inverse Gaussian
Gompertz
half-logistic
half-normal
Hotelling's T-squared
hyper-Erlang
hyperexponential
hypoexponential
inverse chi-squared (scaled inverse chi-squared)
inverse Gaussian
inverse gamma
Kolmogorov
Lévy
log-Cauchy
log-Laplace
log-logistic
log-normal
matrix-exponential
Maxwell–Boltzmann
Maxwell–Jüttner
Mittag–Leffler
Nakagami
noncentral chi-squared
Pareto
phase-type
Poly-Weibull
Rayleigh
relativistic Breit–Wigner
Rice
Rosin–Rammler
shifted Gompertz
truncated normal
type-2 Gumbel
Weibull
Wilks' lambda

Continuous univariate supported on the whole real line (−∞, ∞)

Cauchy exponential power Fisher's z generalized normal generalized hyperbolic geometric stable Gumbel Holtsmark hyperbolic secant Johnson SU Landau Laplace Linnik logistic noncentral t normal (Gaussian) normal-inverse Gaussian skew normal slash stable Student's t type-1 Gumbel Tracy–Widom variance-gamma Voigt

Continuous univariate with support whose type varies

generalized extreme value generalized Pareto Tukey lambda q-Gaussian q-exponential q-Weibull shifted log-logistic

Mixed continuous-discrete univariate distributions

rectified Gaussian

Multivariate (joint)

Discrete Ewens multinomial Dirichlet-multinomial negative multinomial Continuous Dirichlet Generalized Dirichlet multivariate normal Multivariate stable multivariate Student normal-scaled inverse gamma normal-gamma Matrix-valued inverse matrix gamma inverse-Wishart matrix normal matrix t matrix gamma normal-inverse-Wishart normal-Wishart Wishart

Directional

Univariate (circular) directional Circular uniform univariate von Mises wrapped normal wrapped Cauchy wrapped exponential wrapped Lévy Bivariate (spherical) Kent Bivariate (toroidal) bivariate von Mises Multivariate von Mises–Fisher Bingham

Degenerate and singular

Degenerate discrete degenerate Dirac delta function Singular Cantor

Families

Circular compound Poisson elliptical exponential natural exponential location-scale maximum entropy mixture Pearson Tweedie wrapped

Some common univariate probability distributions

Continuous	beta Cauchy chi-squared exponential F gamma Laplace log-normal normal Pareto Student's t uniform Weibull

Discrete	Bernoulli binomial discrete uniform geometric hypergeometric negative binomial Poisson

List of probability distributions

Poisson binomial distribution

Mean and variance

Probability mass function

Entropy

See also

References