Log-logistic distribution

From Wikipedia, the free encyclopedia

Log-logistic
Probability density function
α = 1, values of β as shown in legend
α = 1, values of β as shown in legend
Cumulative distribution function
α = 1, values of β as shown in legend
α = 1, values of β as shown in legend
Parameters α > 0 scale
β > 0 shape
Support x\in[0,\infty)
Probability density function (pdf)  \frac{ (\beta/\alpha)(x/\alpha)^{\beta-1} }
                       { \left[ 1+(x/\alpha)^{\beta} \right]^2 }
Cumulative distribution function (cdf) { 1 \over 1+(x/\alpha)^{-\beta} }
Mean {\alpha\,\pi/\beta \over \sin(\pi/\beta)}
if β > 1, else undefined
Median \alpha\,
Mode \alpha\left(\frac{\beta-1}{\beta+1}\right)^{1/\beta}
if β > 1, 0 otherwise
Variance See main text
Skewness
Excess kurtosis
Entropy
Moment-generating function (mgf)
Characteristic function

In probability and statistics, the log-logistic distribution (known as the Fisk distribution in economics) is a continuous probability distribution for a non-negative random variable. It is used in survival analysis as a parametric model for events whose rate increases initially and decreases later, for example mortality from cancer following diagnosis or treatment. It has also been used in hydrology to model stream flow and precipitation, and in economics as a simple model of the distribution of wealth or income.

The log-logistic distribution is the probability distribution of a random variable whose logarithm has a logistic distribution. It is similar in shape to the log-normal distribution but has heavier tails. Its cumulative distribution function can be written in closed form, unlike that of the log-normal.

Contents

[edit] Characterisation

There are several different parameterizations of the distribution in use. The one shown here gives reasonably interpretable parameters and a simple form for the cumulative distribution function.[1][2] The parameter α > 0 is a scale parameter and is also the median of the distribution. The parameter β > 0 is a shape parameter. The distribution is unimodal when β > 1 and its dispersion decreases as β increases.

The cumulative distribution function is

\begin{align} 
F(x; \alpha, \beta) & = {       1         \over 1+(x/\alpha)^{-\beta} } \\    
                    & = {(x/\alpha)^\beta \over 1+(x/\alpha)^   \beta  } \\
                    & = {x^\beta \over \alpha^\beta+x^\beta}
\end{align}

where x > 0, α > 0, β > 0.

The probability density function is

f(x; \alpha, \beta) = \frac{ (\beta/\alpha)(x/\alpha)^{-\beta-1} }
                      { \left[ 1+(x/\alpha)^{-\beta} \right]^2 }.

[edit] Properties

[edit] Moments

The kth raw moment exists only when k < β, when it is given by[3][4]

\begin{align}
\operatorname{E}(X^k) 
&  = \alpha^k\,\operatorname{B}(1-k/\beta,\, 1+k/\beta) \\
& = \alpha^k\, {k\,\pi/\beta \over \sin(k\,\pi/\beta)}
\end{align}

where B() is the beta function. Expressions for the mean, variance, skewness and kurtosis can be derived from this. Writing b = π / β for convenience, the mean is

 \operatorname{E}(X) = \alpha b / \sin b , \quad \beta>1,

and the variance is

 \operatorname{Var}(X) = \alpha^2 \left( 2b / \sin 2b -b^2 / \sin^2 b \right), \quad \beta>2.

Explicit expressions for the kurtosis and variance are lengthy.[5] As β tends to infinity the mean tends to α, the variance and skewness tend to zero and the excess kurtosis tends to 6/5 (see also related distributions below).

[edit] Quantiles

The quantile function (inverse cumulative distribution function) is :

F^{-1}(p;\alpha, \beta) = \alpha\left( \frac{p}{1-p} \right)^{1/\beta}.

It follows that the median is α, the lower quartile is 31 / βα and the upper quartile is 3 − 1 / βα.

[edit] Applications

Hazard function. α = 1, values of β as shown in legend
Hazard function. α = 1, values of β as shown in legend

[edit] Survival analysis

The log-logistic distribution provides one parametric model for survival analysis. Unlike the more commonly-used Weibull distribution, it can have a non-monotonic hazard function: when β > 1, the hazard function is unimodal (when β ≤ 1, the hazard decreases monotonically). The fact that the cumulative distribution function can be written in closed form is particularly useful for analysis of survival data with censoring.[6] The log-logistic distribution can be used as the basis of an accelerated failure time model by allowing β to differ between groups, or more generally by introducing covariates that affect β but not α by modelling log(β) as a linear function of the covariates.[7]

The survival function is

S(t) = 1 - F(t) = [1+(t/\alpha)^{\beta}]^{-1},\,

and so the hazard function is

 h(t) = \frac{f(t)}{S(t)} = \frac{(\beta/\alpha)(x/\alpha)^{\beta-1}}
                                       {[1+(x/\alpha)^{\beta}]}.

[edit] Hydrology

The log-logistic distribution has been used in hydrology for modelling stream flow rates and precipitation.[1][2]

[edit] Economics

The log-logistic has been used as a simple model of the distribution of wealth or income in economics, where it is known as the Fisk distribution.[8] Its Gini coefficient is 1 / β.[9]

[edit] Related distributions

  • If X has a log-logistic distribution with scale parameter α and shape parameter β then Y = log(X) has a logistic distribution with location parameter log(α) and scale parameter β.
  • As the shape parameter β of the log-logistic distribution increases, its shape increasingly resembles that of a (very narrow) logistic distribution. Informally, as β→∞,
LL(\alpha, \beta) \to 
       L(\alpha,\alpha/\beta).
  • The log-logistic distribution with shape parameter β = 1 and scale parameter α is the same as the generalized Pareto distribution with location parameter μ = 0, shape parameter ξ = 1 and scale parameter α:
LL(\alpha,1) = GPD(1,\alpha,1).\,

[edit] Generalizations

Several different distributions are sometimes referred to as the generalized log-logistic distribution, as they contain the log-logistic as a special case.[9] These include the Burr Type XII distribution (also known as the Singh-Maddala distribution) and the Dagum distribution, both of which include a second shape parameter. Both are in turn special cases of the even more general generalized beta distribution of the second kind. Another more straightforward generalization of the log-logistic is given in the next section.

[edit] Shifted log-logistic distribution

Shifted log-logistic
Probability density function
μ = 0,σ = 1, values of ξ as shown in legend
μ = 0,σ = 1, values of ξ as shown in legend
Cumulative distribution function
μ = 0,σ = 1, values of ξ as shown in legend
μ = 0,σ = 1, values of ξ as shown in legend
Parameters \mu \in (-\infty,+\infty) \, location (real)

\sigma \in (0,+\infty)    \, scale (real)
\xi\in (-\infty,+\infty)  \, shape (real)

Support x \geqslant \mu -\sigma/\xi\,\;(\xi > 0)

x \leqslant \mu -\sigma/\xi\,\;(\xi < 0)
x \in (-\infty, +\infty)   \,\;(\xi = 0)

Probability density function (pdf) \frac{(1+\xi z)^{-(1/\xi +1)}}{\sigma\left(1 + (1+\xi z)^{-1/\xi}\right)^2}

where z=(x-\mu)/\sigma\,

Cumulative distribution function (cdf) \left(1+(1 + \xi z)^{-1/\xi}\right)^{-1} \,

where z=(x-\mu)/\sigma\,

Mean \mu + \frac{\sigma}{\xi}(\alpha \csc(\alpha)-1)

where \alpha= \pi \xi\,

Median \mu \,
Mode \mu + \frac{\sigma}{\xi}\left[\left(\frac{1-\xi}{1+\xi}\right)^\xi - 1 \right]
Variance  \frac{\sigma^2}{\xi^2}[2\alpha \csc(2 \alpha) - (\alpha \csc(\alpha))^2]

where \alpha= \pi \xi\,

Skewness
Excess kurtosis
Entropy
Moment-generating function (mgf)
Characteristic function

The shifted log-logistic distribution is also known as the generalized log-logistic, the generalized logistic, or the three-parameter log-logistic distribution.[10] [11][12] It can be obtained from the log-logistic distribution by addition of a shift parameter δ: if X has a log-logistic distribution then X + δ has a shifted log-logistic distribution. So Y has a shifted log-logistic distribution if log(Y − δ) has a logistic distribution. The shift parameter adds a location parameter to the scale and shape parameters of the (unshifted) log-logistic.

The properties of this distribution are straightforward to derive from those of the log-logistic distribution. However, an alternative parameterisation, similar to that used for the generalized Pareto distribution and the generalized extreme value distribution, gives more interpretable parameters and also aids their estimation.

In this parameterisation, the cumulative distribution function of the shifted log-logistic distribution is

F(x; \mu,\sigma,\xi) = \frac{1}{ 1 + \left(1+ \frac{\xi(x-\mu)}{\sigma}\right)^{-1/\xi}}

for  1 + \xi(x-\mu)/\sigma \geqslant 0, where \mu\in\mathbb R is the location parameter, \sigma>0\, the scale parameter and \xi\in\mathbb R the shape parameter. Note that some references use  \kappa =  - \xi\,\! to parameterise the shape.[10][13]

The probability density function is

 f(x; \mu,\sigma,\xi) = \frac{\left(1+\frac{\xi(x-\mu)}{\sigma}\right)^{-(1/\xi +1)}}
{\sigma\left[1 + \left(1+\frac{\xi(x-\mu)}{\sigma}\right)^{-1/\xi}\right]^2} .

again, for  1 + \xi(x-\mu)/\sigma \geqslant 0.

The shape parameter ξ is often restricted to lie in [-1,1], when the probability density function is bounded. When | ξ | > 1, it has an asymptote at x = μ − σ / ξ. Reversing the sign of ξ reflects the pdf and the cdf about x = 0..

[edit] Related distributions

  • When μ = σ / ξ, the shifted log-logistic reduces to the log-logistic distribution.
  • When ξ → 0, the shifted log-logistic reduces to the logistic distribution.
  • The shifted log-logistic with shape parameter ξ = 1 is the same as the generalized Pareto distribution with shape parameter ξ = 1.

[edit] Applications

The three-parameter log-logistic distribution is used in hydrology for modelling flood frequency.[10][13][14]

[edit] See also

[edit] References

  1. ^ a b Shoukri, M.M.; Mian, I.U.M. & Tracy, D.S. (1988), “Sampling Properties of Estimators of the Log-Logistic Distribution with Application to Canadian Precipitation Data”, The Canadian Journal of Statistics 16 (3): 223-236, <http://links.jstor.org/sici?sici=0319-5724%28198809%2916%3A3%3C223%3ASPOEOT%3E2.0.CO%3B2-E> 
  2. ^ a b Ashkar, Fahim & Mahdi, Smail (2006), “Fitting the log-logistic distribution by generalized moments”, Journal of Hydrology 328: 694-703, DOI 10.1016/j.jhydrol.2006.01.014 
  3. ^ Tadikamalla, Pandu R. & Johnson, Norman L. (1982), “Systems of Frequency Curves Generated by Transformations of Logistic Variables”, Biometrika 69 (2): 461-465, <http://links.jstor.org/sici?sici=0006-3444%28198208%2969%3A2%3C461%3ASOFCGB%3E2.0.CO%3B2-Y> 
  4. ^ Tadikamalla, Pandu R. (1980), “A Look at the Burr and Related Distributions”, International Statistical Review 48 (3): 337-344, <http://links.jstor.org/sici?sici=0306-7734%28198012%2948%3A3%3C337%3AALATBA%3E2.0.CO%3B2-Z> 
  5. ^ McLaughlin, Michael P. (2001), A Compendium of Common Probability Distributions, p. A-37, <http://www.causascientia.org/math_stat/Dists/Compendium.pdf>. Retrieved on 15 February 2008 
  6. ^ Bennett, Steve (1983), “Log-Logistic Regression Models for Survival Data”, Applied Statistics 32 (2): 165-171, <http://links.jstor.org/sici?sici=0035-9254%281983%2932%3A2%3C165%3ALRMFSD%3E2.0.CO%3B2-F> 
  7. ^ Collett, Dave (2003), Modelling Survival Data in Medical Research (2nd ed.), CRC press, ISBN 1584883251 
  8. ^ Fisk, P.R. (1961), “The Graduation of Income Distributions”, Econometrica 29: 171-185, <http://links.jstor.org/sici?sici=0012-9682%28196104%2929%3A2%3C171%3ATGOID%3E2.0.CO%3B2-Y> 
  9. ^ a b Kleiber, C. & Kotz, S (2003), Statistical Size Distributions in Economics and Actuarial Sciences, Wiley, ISBN 0471150649 
  10. ^ a b c Hosking, Jonathan R. M. & Wallis, James R (1997), Regional Frequency Analysis: An Approach Based on L-Moments, Cambridge University Press, ISBN 0521430453 
  11. ^ Venter, Gary G. (Spring 1994), “Introduction to selected papers from the variability in reserves prize program”, Casualty Actuarial Society Forum 1: 91-101, <http://www.casact.org/pubs/forum/94spforum/94spf091.pdf> 
  12. ^ Geskus, Ronald B. (2001), “Methods for estimating the AIDS incubation time distribution when date of seroconversion is censored”, Statistics in Medicine 20 (5): 795-812, DOI 10.1002/sim.700 
  13. ^ a b Robson, A. & Reed, D. (1999), Flood Estimation Handbook, vol. 3: "Statistical Procedures for Flood Frequency Estimation", Wallingford, UK: Institute of Hydrology, ISBN 0948540893 
  14. ^ Ahmad, M. I.; Sinclair, C. D. & Werritty, A. (1988), “Log-logistic flood frequency analysis”, Journal of Hydrology 98: 205-224, DOI 10.1016/0022-1694(88)90015-7