Laplace distribution

Laplace
Probability density function

Cumulative distribution function

Parameters \mu location (real)
b > 0 scale (real)
Support x \in (-\infty; +\infty)\,
PDF \frac{1}{2\,b} \exp \left(-\frac{|x-\mu|}b \right) \,
CDF \begin{cases}
      \frac12 \exp \left( \frac{x-\mu}{b} \right) & \mbox{if }x < \mu
             \\[8pt]
          1-\frac12 \exp \left( -\frac{x-\mu}{b} \right) & \mbox{if }x \geq \mu
       \end{cases}
Mean \mu
Median \mu
Mode \mu
Variance 2b^2
Skewness 0
Ex. kurtosis 3
Entropy 1+\log(2b)
MGF \frac{\exp(\mu\,t)}{1-b^2\,t^2}\,\!\text{ for }|t|<1/b\,
CF \frac{\exp(\mu\,i\,t)}{1+b^2\,t^2}\,\!

In probability theory and statistics, the Laplace distribution is a continuous probability distribution named after Pierre-Simon Laplace. It is also sometimes called the double exponential distribution, because it can be thought of as two exponential distributions (with an additional location parameter) spliced together back-to-back, although the term 'double exponential distribution' is also sometimes used to refer to the Gumbel distribution. The difference between two independent identically distributed exponential random variables is governed by a Laplace distribution, as is a Brownian motion evaluated at an exponentially distributed random time. Increments of Laplace motion or a variance gamma process evaluated over the time scale also have a Laplace distribution.

Characterization

Probability density function

A random variable has a Laplace(μ, b) distribution if its probability density function is

f(x\mid\mu,b) = \frac{1}{2b} \exp \left( -\frac{|x-\mu|}{b} \right) \,\!
    = \frac{1}{2b}
    \left\{\begin{matrix}
      \exp \left( -\frac{\mu-x}{b} \right) & \text{if }x < \mu
      \\[8pt]
      \exp \left( -\frac{x-\mu}{b} \right) & \text{if }x \geq \mu
    \end{matrix}\right.

Here, μ is a location parameter and b > 0, which is sometimes referred to as the diversity, is a scale parameter. If μ = 0 and b = 1, the positive half-line is exactly an exponential distribution scaled by 1/2.

The probability density function of the Laplace distribution is also reminiscent of the normal distribution; however, whereas the normal distribution is expressed in terms of the squared difference from the mean μ, the Laplace density is expressed in terms of the absolute difference from the mean. Consequently the Laplace distribution has fatter tails than the normal distribution.

Differential equation

The pdf of the Laplace distribution is a solution of the following differential equation:

\begin{cases}
  \left\{\begin{array}{l}
  b f'(x)+f(x)=0 \\[8pt]
  f(0)=\frac{e^{\frac{\mu}{b}}}{2b}\end{array}\right\} & \text{if } x \geq \mu \\[8pt]
  \left\{\begin{array}{l}
  b f'(x)-f(x)=0 \\[8pt]
  f(0)=\frac{e^{-\frac{\mu}{b}}}{2b}\end{array}\right\} & \text{if } x < \mu
  \end{cases}

Cumulative distribution function

The Laplace distribution is easy to integrate (if one distinguishes two symmetric cases) due to the use of the absolute value function. Its cumulative distribution function is as follows:

\begin{align}
F(x) &= \int_{-\infty}^x \!\!f(u)\,\mathrm{d}u  = \begin{cases}
             \frac12 \exp \left( \frac{x-\mu}{b} \right) & \mbox{if }x < \mu \\
             1-\frac12 \exp \left( -\frac{x-\mu}{b} \right) & \mbox{if }x \geq \mu
            \end{cases} \\
&=\tfrac{1}{2} + \tfrac{1}{2} \sgn(x-\mu) \left(1-\exp \left(-\frac{|x-\mu|}{b} \right ) \right ).
\end{align}

The inverse cumulative distribution function is given by

F^{-1}(p) = \mu - b\,\sgn(p-0.5)\,\ln(1 - 2|p-0.5|).

Generating random variables according to the Laplace distribution

Given a random variable U drawn from the uniform distribution in the interval (−1/2, 1/2], the random variable

X=\mu - b\,\sgn(U)\,\ln(1 - 2|U|)

has a Laplace distribution with parameters μ and b. This follows from the inverse cumulative distribution function given above.

A Laplace(0, b) variate can also be generated as the difference of two i.i.d. Exponential(1/b) random variables. Equivalently, a Laplace(0, 1) random variable can be generated as the logarithm of the ratio of two iid uniform random variables.

Parameter estimation

Given N independent and identically distributed samples x1, x2, ..., xN, the maximum likelihood estimator \hat{\mu} of μ is the sample median,[1] and the maximum likelihood estimator of b is

\hat{b} = \frac{1}{N} \sum_{i = 1}^{N} |x_i - \hat{\mu}|

(revealing a link between the Laplace distribution and least absolute deviations).

Moments

\mu_r' = \bigg({\frac{1}{2}}\bigg) \sum_{k=0}^r \bigg[{\frac{r!}{k! (r-k)!}} b^k \mu^{(r-k)} k! \{1 + (-1)^k\}\bigg]

Related distributions

Relation to the exponential distribution

A Laplace random variable can be represented as the difference of two iid exponential random variables.[2] One way to show this is by using the characteristic function approach. For any set of independent continuous random variables, for any linear combination of those variables, its characteristic function (which uniquely determines the distribution) can be acquired by multiplying the corresponding characteristic functions.

Consider two i.i.d random variables X, Y ~ Exponential(λ). The characteristic functions for X, −Y are

\frac{\lambda }{-i t+\lambda }, \quad \frac{\lambda }{i t+\lambda }

respectively. On multiplying these characteristic functions (equivalent to the characteristic function of the sum of therandom variables X + (−Y)), the result is

\frac{\lambda ^2}{(-i t+\lambda ) (i t+\lambda )} = \frac{\lambda ^2}{t^2+\lambda ^2}.

This is the same as the characteristic function for Z ~ Laplace(0,1/λ), which is

\frac{1}{1+\frac{t^2}{\lambda ^2}}.

Sargan distributions

Sargan distributions are a system of distributions of which the Laplace distribution is a core member. A pth order Sargan distribution has density[3][4]

f_p(x)=\tfrac{1}{2} \exp(-\alpha |x|)  \frac{\displaystyle 1+\sum_{j=1}^p \beta_j \alpha^j |x|^j}{\displaystyle 1+\sum_{j=1}^p j!\beta_j},

for parameters α ≥ 0, βj ≥ 0. The Laplace distribution results for p = 0.

Applications

The Laplacian distribution has been used in speech recognition to model priors on DFT coefficients.[5]

The addition of noise drawn from a Laplacian distribution, with scaling parameter appropriate to a function's sensitivity, to the output of a statistical database query is the most common means to provide differential privacy in statistical databases.

The least absolute deviations estimate arises as the maximum likelihood estimate if the errors have a Laplace distribution.

History

This distribution is often referred to as Laplace's first law of errors. He published it in 1774 when he noted that the frequency of an error could be expressed as an exponential function of its magnitude once its sign was disregarded.[6][7]

Keynes published a paper in 1911 based on his earlier thesis wherein he showed that the Laplace distribution minimised the absolute deviation from the median.[8]

See also

References

  1. Robert M. Norton (May 1984). "The Double Exponential Distribution: Using Calculus to Find a Maximum Likelihood Estimator". The American Statistician (American Statistical Association) 38 (2): 135136. doi:10.2307/2683252. JSTOR 2683252.
  2. Kotz, Samuel; Kozubowski, Tomasz J.; Podgórski, Krzysztof (2001). The Laplace distribution and generalizations: a revisit with applications to Communications, Economics, Engineering and Finance. Birkhauser. pp. 23 (Proposition 2.2.2, Equation 2.2.8). ISBN 9780817641665.
  3. Everitt, B.S. (2002) The Cambridge Dictionary of Statistics, CUP. ISBN 0-521-81099-X
  4. Johnson, N.L., Kotz S., Balakrishnan, N. (1994) Continuous Univariate Distributions, Wiley. ISBN 0-471-58495-9. p. 60
  5. Eltoft, T.; Taesu Kim; Te-Won Lee (2006). "On the multivariate Laplace distribution". IEEE Signal Processing Letters 13 (5): 300–303. doi:10.1109/LSP.2006.870353.
  6. Laplace, P-S. (1774). Mémoire sur la probabilité des causes par les évènements. Mémoires de l’Academie Royale des Sciences Presentés par Divers Savan, 6, 621–656
  7. Wilson EB (1923) First and second laws of error. JASA 18, 143
  8. Keynes JM (1911) The principal averages and the laws of error which lead to them. J Roy Stat Soc, 74, 322–331

External links