Tukey lambda distribution

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Template:Short description Template:Probability distribution Formalized by John Tukey, the Tukey lambda distribution is a continuous, symmetric probability distribution defined in terms of its quantile function. It is typically used to identify an appropriate distribution (see the comments below) and not used in statistical models directly.

The Tukey lambda distribution has a single shape parameter, Template:Mvar, and as with other probability distributions, it can be transformed with a location parameter, Template:Mvar, and a scale parameter, Template:Mvar. Since the general form of probability distribution can be expressed in terms of the standard distribution, the subsequent formulas are given for the standard form of the function.

Quantile function

For the standard form of the Tukey lambda distribution, the quantile function, Q(p), (i.e. the inverse function to the cumulative distribution function) and the quantile density function, q= dQ dp , are

 Q( p ;λ )={1 λ [ pλ(1p)λ ] ,  if  λ0,ln(p 1p ),  if  λ=0.


q( p ;λ )= dQ dp=pλ1+( 1p )λ1.

For most values of the shape parameter, Template:Mvar, the probability density function (PDF) and cumulative distribution function (CDF) must be computed numerically. The Tukey lambda distribution has a simple, closed form for the CDF and / or PDF only for a few exceptional values of the shape parameter, for example: Template:Mvar Script error: No such module "Check for unknown parameters". { 2, 1, Template:SfracScript error: No such module "Check for unknown parameters"., 0 } (see uniform distribution [ cases λ = 1 Script error: No such module "Check for unknown parameters". and λ = 2 Script error: No such module "Check for unknown parameters". ] and the logistic distribution [ case λ = 0 Script error: No such module "Check for unknown parameters". ].

However, for any value of Template:Mvar both the CDF and PDF can be tabulated for any number of cumulative probabilities, Template:Mvar, using the quantile function Template:Mvar to calculate the value Template:Mvar, for each cumulative probability Template:Mvar, with the probability density given by Template:Sfrac, the reciprocal of the quantile density function. As is the usual case with statistical distributions, the Tukey lambda distribution can readily be used by looking up values in a prepared table.

Moments

The Tukey lambda distribution is symmetric around zero, therefore the expected value of this distribution, if it exists, is equal to zero. The variance exists for λ > −Template:Sfrac Script error: No such module "Check for unknown parameters". Script error: No such module "Check for unknown parameters". , and except when λ = 0 Script error: No such module "Check for unknown parameters". , is given by the formula

Var[ X ]=2λ2( 1 1+2λ  Γ(λ+1)2  Γ(2λ+2)  ).

More generally, the Template:Mvar-th order moment is finite when λ > Template:Sfrac Script error: No such module "Check for unknown parameters". and is expressed (except when λ = 0 Script error: No such module "Check for unknown parameters". ) in terms of the beta function Β(x,y) Script error: No such module "Check for unknown parameters". :

μnE[ Xn ]=1λnk=0n (1)k (nk) B( λ k+1 , (nk) λ+1 ).

Due to symmetry of the density function, all moments of odd orders, if they exist, are equal to zero.

L-moments

Differently from the central moments, L-moments can be expressed in a closed form. For λ>1 , the  rth L-moment,  r , is given by[1]

r= 1+(1)r λ k=0r1 (1)r1k (r1k) (r+k1k) (1 k+1+λ )=(1+(1)r) Γ(1+λ) Γ(r1λ)  Γ(1λ) Γ(r+1+λ) .

The first six L-moments can be presented as follows:[1]

1=0 ,
2=2 λ  [ 1 1+λ +2 2+λ  ] ,
3=0 ,
4=2 λ  [1 1+λ +12 2+λ 30 3+λ +20 4+λ  ] ,
5=0 ,
6=2 λ  [ 1 1+λ +30 2+λ 210 3+λ +560 4+λ 630 5+λ +252 6+λ  ].

Comments

Probability density plots of Tukey lambda distributions
Probability density plots of Tukey lambda distributions

The Tukey lambda distribution is actually a family of distributions that can approximate a number of common distributions. For example,

λ ≈ −1 Script error: No such module "Check for unknown parameters". approx. Cauchy C( 0, π ) Script error: No such module "Check for unknown parameters".
λ = 0 Script error: No such module "Check for unknown parameters". exactly logistic
λ ≈ 0.14 Script error: No such module "Check for unknown parameters". approx. normal N( 0, 2.142± ) Script error: No such module "Check for unknown parameters".
λ = Template:Sfrac Script error: No such module "Check for unknown parameters". Script error: No such module "Check for unknown parameters". strictly concave (-shaped)
λ = 1 Script error: No such module "Check for unknown parameters". exactly uniform U( −1, +1 ) Script error: No such module "Check for unknown parameters".
λ = 2 Script error: No such module "Check for unknown parameters". exactly uniform U(Template:Sfrac Script error: No such module "Check for unknown parameters". , +Template:Sfrac Script error: No such module "Check for unknown parameters". ) Script error: No such module "Check for unknown parameters".

The most common use of this distribution is to generate a Tukey lambda PPCC plot of a data set. Based on the value for Template:Mvar with best correlation, as shown on the PPCC plot, an appropriate model for the data is suggested. For example, if the best-fit of the curve to the data occurs for a value of Template:Mvar at or near 0.14Script error: No such module "Check for unknown parameters"., then empirically the data could be well-modeled with a normal distribution. Values of Template:Mvar less than 0.14 suggests a heavier-tailed distribution.

A milepost at λ = 0 Script error: No such module "Check for unknown parameters". (logistic) would indicate quite fat tails, with the extreme limit at λ = −1 Script error: No such module "Check for unknown parameters". , approximating Cauchy and small sample versions of the [[Student's t distribution|Student's Template:Mvar]]. That is, as the best-fit value of Template:Mvar varies from thin tails at 0.14Script error: No such module "Check for unknown parameters". towards fat tails −1Script error: No such module "Check for unknown parameters"., a bell-shaped PDF with increasingly heavy tails is suggested. Similarly, an optimal curve-fit value of Template:Mvar greater than 0.14Script error: No such module "Check for unknown parameters". suggests a distribution with exceptionally thin tails (based on the point of view that the normal distribution itself is thin-tailed to begin with; the exponential distribution is often chosen as the exemplar of tails intermediate between fat and thin).

Except for values of Template:Mvar approaching 0Script error: No such module "Check for unknown parameters". and those below, all the PDF functions discussed have finite support, between   Template:Sfrac   and   Template:Sfrac .

Since the Tukey lambda distribution is a symmetric distribution, the use of the Tukey lambda PPCC plot to determine a reasonable distribution to model the data only applies to symmetric distributions. A histogram of the data should provide evidence as to whether the data can be reasonably modeled with a symmetric distribution.[2]

References

<templatestyles src="Reflist/styles.css" />

  1. a b Script error: No such module "Citation/CS1".
  2. Script error: No such module "Citation/CS1".

Script error: No such module "Check for unknown parameters".

External links

  • Script error: No such module "citation/CS1".

Template:Include-USGov Template:ProbDistributions