Split normal distribution

In probability theory and statistics, the split normal distribution also known as the two-piece normal distribution results from joining at the mode the corresponding halves of two normal distributions with the same mode but different variances. It is claimed by Johnson et al.^[1] that this distribution was introduced by Gibbons and Mylroie^[2] and by John.^[3] But these are two of several independent rediscoveries of the Zweiseitige Gauss'sche Gesetz introduced in the posthumously published Kollektivmasslehre (1897) ^[4] of Gustav Theodor Fechner (1801-1887).^[5]

Split-normal
Notation	$\mathcal{SN}(\mu,\,\sigma_1,\sigma_2)$
Parameters	$\mu \in \Re$ — mode (location, real) $\sigma_1 > 0$ — left-hand-side standard deviation (scale, real) $\sigma_2 > 0$ — right-hand-side standard deviation (scale, real)
Support	$x \in \Re$
PDF	$A \exp \left(- \frac {(x-\mu)^2}{2 \sigma_1^2}\right) \quad \text{if } x< \mu$ $A \exp \left(- \frac {(x-\mu)^2}{2 \sigma_2^2}\right) \quad \text{otherwise,}$ $\text{where} \quad A= \sqrt{2/\pi} (\sigma_1+\sigma_2)^{-1}$
Mean	$\mu+\sqrt{2 / \pi}(\sigma_2-\sigma_1)$
Mode	$\mu$
Variance	$(1-2 / \pi)(\sigma_2-\sigma_1)^2 + \sigma_1 \sigma_2$
Skewness	$\gamma_3 = \sqrt{\frac{2}{\pi}}(\sigma_2-\sigma_1)\left[\left(\frac{4}{\pi}-1\right)(\sigma_2-\sigma_1)^2 + \sigma_1 \sigma_2\right]$

Definition

The split normal distribution arises from merging two opposite halves of two probability density functions (PDFs) of normal distributions in their common mode.

The PDF of the split normal distribution is given by^[1]

f(x;\mu,\sigma_1,\sigma_2)= A \exp (- \frac {(x-\mu)^2}{2 \sigma_1^2}) \quad \text{if } x< \mu

f(x;\mu,\sigma_1,\sigma_2)= A \exp (- \frac {(x-\mu)^2}{2 \sigma_2^2}) \quad \text{otherwise}

where

\quad A= \sqrt{2/\pi} (\sigma_1+\sigma_2)^{-1}.

Discussion

The split normal distribution results from merging two halves of normal distributions. In a general case the 'parent' normal distributions can have different variances which implies that the joined PDF would not be continuous. To ensure that the resulting PDF integrates to 1, the normalizing constant A is used.

In a special case when $\sigma_1^2=\sigma_2^2=\sigma_{*}^2$ the split normal distribution reduces to normal distribution with variance $\sigma_{*}^2$ .

When σ₂≠σ₁ the constant A it is different from the constant of normal distribution. However, when $\sigma_1^2=\sigma_2^2=\sigma_{*}^2$ the constants are equal.

The sign of its third central moment is determined by the difference (σ₂-σ₁). If this difference is positive, the distribution is skewed to the right and if negative, then it is skewed to the left.

Other properties of the split normal density were discussed by Johnson et al.^[1] and Julio.^[6]

Alternative formulations

The formulation discussed above originates from John.^[3] The literature offers two mathematically equivalent alternative parameterizations . Britton, Fisher and Whitley^[7] offer a parameterization if terms of mode, dispersion and normed skewness, denoted with $\mathcal{SN}(\mu,\, \sigma^2,\gamma)$ . The parameter μ is the mode and has equivalent to the mode in John’s formulation. The parameter σ ²>0 informs about the dispersion (scale) and should not be confused with variance. The third parameter, γ ∈ (-1,1), is the normalized skew.

The second alternative parameterization is used in the Bank of England’s communication and is written in terms of mode, dispersion and unnormed skewness and is denoted with $\mathcal{SN}(\mu,\, \sigma^2,\xi)$ . In this formulation the parameter μ is the mode and is identical as in John’s ^[3] and Britton, Fisher and Whitley’s ^[7] formulation. The parameter σ ² informs about the dispersion (scale) and is the same as in the Britton, Fisher and Whitley’s formulation. The parameter ξ equals the difference between the distribution’s mean and mode and can be viewed as unnormed measure of skewness.

The three parameterizations are mathematically equivalent, meaning that there is a strict relationship between the parameters and that it is possible to go from one parameterization to another. The following relationships hold:^[8]

\begin{align} \sigma^2 &= \sigma_1^2(1+\gamma)= \sigma_2^2(1-\gamma) \\ \gamma &= \frac{\sigma_2-\sigma_1}{\sigma_2+\sigma_1} \\ \xi &=\sqrt{2 / \pi}(\sigma_2-\sigma_1) \\ \gamma &= \operatorname{sgn}(\xi) \sqrt{1-\left( \frac{\sqrt{1+2\beta}-1}{\beta} \right)^2}, \quad \text{where} \quad \beta = \frac{\pi\xi^2}{2\sigma^2}. \end{align}

Multivariate Extensions

The multivariate generalization of the split normal distribution was proposed by Villani and Larsson.^[9] They assume that each of the principal components has univariate split normal distribution with a different set of parameters μ, σ₂ and σ₁.

Estimation of parameters

John^[3] proposes to estimate the parameters using maximum likelihood method. He shows that the likelihood function can be expressed in an intensive form, in which the scale parameters σ₁ and σ₂ are a function of the location parameter μ. The likelihood in its intensive form is:

L(\mu) = -\left[\sum_{x_i: x_i<\mu} (x_i-\mu)^2 \right]^{1/3} - \left[\sum_{x_i: x_i>\mu} (x_i-\mu)^2 \right]^{1/3}

and has to be maximized numerically with respect to a single parameter μ only.

Given the maximum likelihood estimator $\hat{\mu}$ the other parameters take values:

\hat{\sigma}_1^2 = \frac{-L(\mu)}{N} \left[\sum_{x_i: x_i<\mu} (x_i-\mu)^2 \right]^{2/3},

\hat{\sigma}_2^2 = \frac{-L(\mu)}{N} \left[\sum_{x_i: x_i>\mu} (x_i-\mu)^2 \right]^{2/3},

where N is the number of observations.

Villani and Larsson^[9] propose to use either maximum likelihood method or bayesian estimation and provide some analytical results for either univariate and multivariate case.

Applications

The split normal distribution has been used mainly in econometrics and time series. A remarkable area of application is the construction of the fan chart, a representation of the inflation forecast distribution reported by inflation targeting central banks around the globe.^[6]^[10]

References

↑ 1.0 1.1 1.2 Johnson, N.L., Kotz, S. and Balakrishnan, N. (1994). Continuous Univariate Distributions, Volume 1. John Wiley & Sons. p. 173. ISBN 0-471-58495-9.
↑ Gibbons, J.F.; Mylroie, S. (1973). "Estimation of impurity profiles in ion-implanted amorphous targets using joined half-Gaussian distributions". Applied Physics Letters 22: 568–569. doi:10.1063/1.1654511.
↑ 3.0 3.1 3.2 3.3 John, S. (1982). "The three-parameter two-piece normal family of distributions and its fitting". Communications in Statistics - Theory and Methods 11 (8): 879–885. doi:10.1080/03610928208828279.
↑ Fechner, G.T. (ed. Lipps, G.F.) (1897). Kollectivmasslehre. Engelmann, Leipzig.
↑ Wallis, K.F. (2014). The two-piece normal, binormal, or double Gaussian distribution: its origin and rediscoveries. Statistical Science, vol. 29, no. 1, pp.106-112. doi:10.1214/13-STS417.
↑ 6.0 6.1 Juan Manuel Julio (2007). The Fan Chart: The Technical Details Of The New Implementation. Banco de la República. Retrieved 2010-09-11, direct link
↑ 7.0 7.1 Britton, E.; P. Fisher, Whitley, J. (1998). "The inflation report projections: understanding the fan chart". Quarterly Bulletin. February 1998: 30–37.
↑ Banerjee, N.; A. Das (2011). Fan Chart: Methodology and its Application to Inflation Forecasting in India. Reserve Bank of India Working Paper Series.
↑ 9.0 9.1 Villani, Mattias; Rolf Larsson (2006). "The Multivariate Split Normal Distribution and Asymmetric Principal Components Analysis". Communications in Statistics - Theory and Methods 35 (6): 1123–1140. doi:10.1080/03610920600672252. ISSN 0361-0926.
↑ Bank of England, Inflation Report

Probability distributions

Discrete univariate with finite support

Benford Bernoulli Beta-binomial binomial categorical hypergeometric Poisson binomial Rademacher discrete uniform Zipf Zipf–Mandelbrot

Discrete univariate with infinite support

beta negative binomial Borel Conway–Maxwell–Poisson discrete phase-type Delaporte extended negative binomial Gauss–Kuzmin geometric logarithmic negative binomial parabolic fractal Poisson Skellam Yule–Simon zeta

Continuous univariate supported on a bounded interval, e.g. [0,1]

Arcsine ARGUS Balding–Nichols Bates Beta Beta rectangular Irwin–Hall Kumaraswamy logit-normal Noncentral beta raised cosine Triangular U-quadratic uniform Wigner semicircle

[[List of probability distributions#Supported_on_semi-infinite_intervals.2C_usually_.5B0.2C.E2.88.9E.29|Continuous univariate supported on a semi-infinite interval, usually [0,∞)]]

Benini
Benktander 1st kind
Benktander 2nd kind
Beta prime
Burr
chi-squared
chi
Coxian
Dagum
Davis
EL
Erlang
exponential
F
folded normal
Flory-Schulz
Fréchet
Gamma
Gamma/Gompertz
generalized inverse Gaussian
Gompertz
half-logistic
half-normal
Hotelling's T-squared
hyper-Erlang
hyperexponential
hypoexponential
inverse chi-squared (scaled inverse chi-squared)
inverse Gaussian
inverse gamma
Kolmogorov
Lévy
log-Cauchy
log-Laplace
log-logistic
log-normal
matrix-exponential
Maxwell–Boltzmann
Maxwell–Jüttner
Mittag–Leffler
Nakagami
noncentral chi-squared
Pareto
phase-type
Poly-Weibull
Rayleigh
relativistic Breit–Wigner
Rice
Rosin–Rammler
shifted Gompertz
truncated normal
type-2 Gumbel
Weibull
Wilks' lambda

Continuous univariate supported on the whole real line (−∞, ∞)

Cauchy exponential power Fisher's z generalized normal generalized hyperbolic geometric stable Gumbel Holtsmark hyperbolic secant Johnson SU Landau Laplace Linnik logistic noncentral t normal (Gaussian) normal-inverse Gaussian skew normal slash stable Student's t type-1 Gumbel Tracy–Widom variance-gamma Voigt

Continuous univariate with support whose type varies

generalized extreme value generalized Pareto Tukey lambda q-Gaussian q-exponential q-Weibull shifted log-logistic

Mixed continuous-discrete univariate distributions

rectified Gaussian

Multivariate (joint)

Discrete Ewens multinomial Dirichlet-multinomial negative multinomial Continuous Dirichlet Generalized Dirichlet multivariate normal Multivariate stable multivariate Student normal-scaled inverse gamma normal-gamma Matrix-valued inverse matrix gamma inverse-Wishart matrix normal matrix t matrix gamma normal-inverse-Wishart normal-Wishart Wishart

Directional

Univariate (circular) directional Circular uniform univariate von Mises wrapped normal wrapped Cauchy wrapped exponential wrapped Lévy Bivariate (spherical) Kent Bivariate (toroidal) bivariate von Mises Multivariate von Mises–Fisher Bingham

Degenerate and singular

Degenerate discrete degenerate Dirac delta function Singular Cantor

Families

Circular compound Poisson elliptical exponential natural exponential location-scale maximum entropy mixture Pearson Tweedie wrapped