Borel distribution

Borel distribution
Parameters \mu \in [0,1]
Support n \in \{1, 2, 3,\ldots\}
pmf \frac{e^{-\mu n}(\mu n)^{n-1}}{n!}
Mean \frac{1}{1-\mu}
Variance \frac{\mu}{(1-\mu)^3}

The Borel distribution is a discrete probability distribution, arising in contexts including branching processes and queueing theory.

If the number of offspring that an organism has is Poisson-distributed, and if the average number of offspring of each organism is no bigger than 1, then the descendants of each individual will ultimately become extinct. The number of descendants that an individual ultimately has in that situation is a random variable distributed according to a Borel distribution.

Definition

A discrete random variable X  is said to have a Borel distribution[1][2] with parameter μ  [0,1] if the probability mass function of X is given by

P_\mu(n)= \Pr(X=n)= \frac{e^{-\mu n}(\mu n)^{n-1}}{n!}

for n = 1, 2, 3 ....

Derivation and branching process interpretation

If a Galton–Watson branching process has common offspring distribution Poisson with mean μ, then the total number of individuals in the branching process has Borel distribution with parameter μ.

Let X  be the total number of individuals in a Galton–Watson branching process. Then a correspondence between the total size of the branching process and a hitting time for an associated random walk[3][4][5] gives

\Pr(X=n)=\frac{1}{n}\Pr(S_n=n-1)

where Sn = Y1 +  + Yn, and Y1  Yn are independent identically distributed random variables whose common distribution is the offspring distribution of the branching process. In the case where this common distribution is Poisson with mean μ, the random variable Sn has Poisson distribution with mean μn, leading to the mass function of the Borel distribution given above.

Since the mth generation of the branching process has mean size μm  1, the mean of X  is

1+\mu+\mu^2+\cdots = \frac{1}{1-\mu}.

Queueing theory interpretation

In an M/D/1 queue with arrival rate μ and common service time 1, the distribution of a typical busy period of the queue is Borel with parameter μ. [6]

Properties

If Pμ(n) is the probability mass function of a Borel(μ) random variable, then the mass function P
μ
(n) of a sized-biased sample from the distribution (i.e. the mass function proportional to nPμ(n) ) is given by

P_\mu^*(n)=(1-\mu)\frac{e^{-\mu n}(\mu n)^{n-1}}{(n-1)!}.

Aldous and Pitman [7] show that

P_\mu(n)=\frac{1}{\mu}\int_0^{\mu}P_\lambda^*(n) \, d\lambda.

In words, this says that a Borel(μ) random variable has the same distribution as a size-biased Borel(μU) random variable, where U has the uniform distribution on [0,1].

This relation leads to various useful formulas, including

E(1/X) = 1-\mu/2.

Borel–Tanner distribution

The Borel–Tanner distribution generalizes the Borel distribution. Let k be a positive integer. If X1, X2,   Xk are independent and each has Borel distribution with parameter μ, then their sum W = X1 + X2 +  + Xk is said to have Borel–Tanner distribution with parameters μ and k. [2][6][8] This gives the distribution of the total number of individuals in a Poisson–Galton–Watson process starting with k individuals in the first generation, or of the time taken for an M/D/1 queue to empty starting with k jobs in the queue. The case k = 1 is simply the Borel distribution above.

Generalizing the random walk correspondence given above for k = 1,[4][5]

\Pr(W=n)=\frac{k}{n}\Pr(S_n=n-k)

where Sn has Poisson distribution with mean . As a result the probability mass function is given by

\Pr(W=n)=\frac{k}{n}\frac{e^{-\mu n}(\mu n)^{n-k}}{(n-k)!}

for n = k, k + 1, ... .

References

  1. Borel, Émile (1942). "Sur l’emploi du théorème de Bernoulli pour faciliter le calcul d’une infinité de coefficients. Application au problème de l’attente à un guichet.". C. R. Acad. Sci. 214: 452–456.
  2. 2.0 2.1 Tanner, J. C. (1961). "A derivation of the Borel distribution". Biometrika 48: 222–224. doi:10.1093/biomet/48.1-2.222. JSTOR 2333154.
  3. Otter, R. (1949). "The Multiplicative Process". The Annals of Mathematical Statistics 20 (2): 206. doi:10.1214/aoms/1177730031.
  4. 4.0 4.1 Dwass, Meyer (1969). "The Total Progeny in a Branching Process and a Related Random Walk". Journal of Applied Probability (Applied Probability Trust) 6 (3): 682–686. JSTOR 3212112.
  5. 5.0 5.1 Pitman, Jim (1997). "Enumerations Of Trees And Forests Related To Branching Processes And Random Walks". Microsurveys in Discrete Probability: DIMACS Workshop (41).
  6. 6.0 6.1 Haight, F. A.; Breuer, M. A. (1960). "The Borel-Tanner distribution". Biometrika 47: 143. doi:10.1093/biomet/47.1-2.143. JSTOR 2332966.
  7. Aldous, D.; Pitman, J. (1998). "Tree-valued Markov chains derived from Galton-Watson processes". Annales de l'Institut Henri Poincare (B) Probability and Statistics 34 (5): 637. doi:10.1016/S0246-0203(98)80003-4.
  8. Tanner, J. C. (1953). "A Problem of Interference Between Two Queues". Biometrika 40: 58–69. doi:10.1093/biomet/40.1-2.58. JSTOR 2333097‎.

External links