Normal distribution

Template:Short description Script error: No such module "redirect hatnote". Script error: No such module "Unsubst". Script error: No such module "Unsubst". Script error: No such module "infobox3cols".Script error: No such module "Check for unknown parameters". Template:Probability fundamentals

In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is^[1]^[2]^[3] $f (x) = \frac{1}{\sqrt{2 π σ^{2}}} e^{- \frac{(x - μ)^{2}}{2 σ^{2}}} .$

The parameter Template:Tmath is the mean or expectation of the distribution (and also its median and mode), while the parameter $σ^{2}$ is the variance. The standard deviation of the distribution is Template:Tmath (sigma). A random variable with a Gaussian distribution is said to be normally distributed and is called a normal deviate.

Normal distributions are important in statistics and are often used in the natural and social sciences to represent real-valued random variables whose distributions are not known.^[4]^[5] Their importance is partly due to the central limit theorem. It states that, under some conditions, the average of many samples (observations) of a random variable with finite mean and variance is itself a random variable—whose distribution converges to a normal distribution as the number of samples increases. Therefore, physical quantities that are expected to be the sum of many independent processes, such as measurement errors, often have distributions that are nearly normal.^[6]

Moreover, Gaussian distributions have some unique properties that are valuable in analytic studies. For instance, any linear combination of a fixed collection of independent normal deviates is a normal deviate. Many results and methods, such as propagation of uncertainty and least squares^[7] parameter fitting, can be derived analytically in explicit form when the relevant variables are normally distributed.

A normal distribution is sometimes informally called a bell curve.^[8]^[9] However, many other distributions are bell-shaped (such as the Cauchy, [[Student's t-distribution|Student's Template:Mvar]], and logistic distributions). (For other names, see Naming.)

The univariate probability distribution is generalized for vectors in the multivariate normal distribution and for matrices in the matrix normal distribution.

Definitions

Standard normal distribution

The simplest case of a normal distribution is known as the standard normal distribution or unit normal distribution. This is a special case when $μ = 0$ and $σ^{2} = 1$ , and it is described by this probability density function (or density):^[10] $φ (z) = \frac{e^{- z^{2} / 2}}{\sqrt{2 π}} .$ The variable Template:Tmath has a mean of 0 and a variance and standard deviation of 1. The density $φ (z)$ has its peak $\frac{1}{\sqrt{2 π}}$ at $z = 0$ and inflection points at $z = + 1$ and Template:Tmath.

Although the density above is most commonly known as the standard normal, a few authors have used that term to describe other versions of the normal distribution. Carl Friedrich Gauss, for example, once defined the standard normal as $φ (z) = \frac{e^{- z^{2}}}{\sqrt{π}},$ which has a variance of Template:Tmath, and Stephen Stigler^[11] once defined the standard normal as $φ (z) = e^{- π z^{2}},$ which has a simple functional form and a variance of $σ^{2} = \frac{1}{2 π} .$

General normal distribution

Every normal distribution is a version of the standard normal distribution whose domain has been stretched by a factor Template:Tmath (the standard deviation) and then translated by Template:Tmath (the mean value): $f (x ∣ μ, σ^{2}) = \frac{1}{σ} φ (\frac{x - μ}{σ}) .$

The probability density must be scaled by $1 / σ$ so that the integral is still 1.

If Template:Tmath is a standard normal deviate, then $X = σ Z + μ$ will have a normal distribution with expected value Template:Tmath and standard deviation Template:Tmath. This is equivalent to saying that the standard normal distribution Template:Tmath can be scaled/stretched by a factor of Template:Tmath and shifted by Template:Tmath to yield a different normal distribution, called Template:Tmath. Conversely, if Template:Tmath is a normal deviate with parameters Template:Tmath and $σ^{2}$ , then this Template:Tmath distribution can be re-scaled and shifted via the formula $Z = (X - μ) / σ$ to convert it to the standard normal distribution. This variate is also called the standardized form of Template:Tmath.

Notation

The probability density of the standard Gaussian distribution (standard normal distribution, with zero mean and unit variance) is often denoted with the Greek letter Template:Tmath (phi).^[12] The alternative form of the Greek letter phi, Template:Tmath, is also used quite often.

The normal distribution is often referred to as $N (μ, σ^{2})$ or Template:Tmath.^[13] Thus when a random variable Template:Tmath is normally distributed with mean Template:Tmath and standard deviation Template:Tmath, one may write

$X \sim 𝒩 (μ, σ^{2}) .$

Alternative parameterizations

Some authors advocate using the precision Template:Tmath as the parameter defining the width of the distribution, instead of the standard deviation Template:Tmath or the variance Template:Tmath. The precision is normally defined as the reciprocal of the variance, Template:Tmath.^[14] The formula for the distribution then becomes $f (x) = \sqrt{\frac{τ}{2 π}} e^{- τ (x - μ)^{2} / 2} .$

This choice is claimed to have advantages in numerical computations when Template:Tmath is very close to zero, and simplifies formulas in some contexts, such as in the Bayesian inference of variables with multivariate normal distribution.

Alternatively, the reciprocal of the standard deviation $τ^{'} = 1 / σ$ might be defined as the precision, in which case the expression of the normal distribution becomes $f (x) = \frac{τ^{'}}{\sqrt{2 π}} e^{- (τ^{'})^{2} (x - μ)^{2} / 2} .$

According to Stigler, this formulation is advantageous because of a much simpler and easier-to-remember formula, and simple approximate formulas for the quantiles of the distribution.

Normal distributions form an exponential family with natural parameters $θ_{1} = \frac{μ}{σ^{2}}$ and $θ_{2} = \frac{- 1}{2 σ^{2}}$ , and natural statistics Template:Mvar and $x 2$ Script error: No such module "Check for unknown parameters".. The dual expectation parameters for normal distribution are $η 1 = μ$ Script error: No such module "Check for unknown parameters". and $η 2 = μ 2 + σ 2$ Script error: No such module "Check for unknown parameters"..

Cumulative distribution function

The cumulative distribution function (CDF) of the standard normal distribution, usually denoted with the capital Greek letter Template:Tmath, is the integral $Φ (x) = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{x} e^{- t^{2} / 2} d t .$

Error function

The related error function $\erf (x)$ gives the probability of a random variable, with normal distribution of mean 0 and variance 1/2 falling in the range Template:Tmath. That is: $\erf (x) = \frac{1}{\sqrt{π}} \int_{- x}^{x} e^{- t^{2}} d t = \frac{2}{\sqrt{π}} \int_{0}^{x} e^{- t^{2}} d t .$

These integrals cannot be expressed in terms of elementary functions, and are often said to be special functions. However, many numerical approximations are known; see below for more.

The two functions are closely related, namely $Φ (x) = \frac{1}{2} [1 + \erf (\frac{x}{\sqrt{2}})] .$

For a generic normal distribution with density Template:Tmath, mean Template:Tmath and variance $σ^{2}$ , the cumulative distribution function is $F (x) = Φ (\frac{x - μ}{σ}) = \frac{1}{2} [1 + \erf (\frac{x - μ}{σ \sqrt{2}})] .$

The complement of the standard normal cumulative distribution function, $Q (x) = 1 - Φ (x)$ , is often called the Q-function, especially in engineering texts.^[15]^[16] It gives the probability that the value of a standard normal random variable Template:Tmath will exceed Template:Tmath: Template:Tmath. Other definitions of the Template:Tmath-function, all of which are simple transformations of Template:Tmath, are also used occasionally.^[17]

The graph of the standard normal cumulative distribution function Template:Tmath has 2-fold rotational symmetry around the point (0,1/2); that is, Template:Tmath. Its antiderivative (indefinite integral) can be expressed as follows: $\int Φ (x) d x = x Φ (x) + φ (x) + C .$

The cumulative distribution function of the standard normal distribution can be expanded by integration by parts into a series: $Φ (x) = \frac{1}{2} + \frac{1}{\sqrt{2 π}} \cdot e^{- x^{2} / 2} [x + \frac{x^{3}}{3} + \frac{x^{5}}{3 \cdot 5} + \dots + \frac{x^{2 n + 1}}{(2 n + 1)!!} + \dots] .$ where $!!$ denotes the double factorial.

An asymptotic expansion of the cumulative distribution function for large Template:Mvar can also be derived using integration by parts. For more, see Template:Slink.^[18]

A quick approximation to the standard normal distribution's cumulative distribution function can be found by using a Taylor series approximation: $Φ (x) \approx \frac{1}{2} + \frac{1}{\sqrt{2 π}} \sum_{k = 0}^{n} \frac{(- 1)^{k} x^{(2 k + 1)}}{2^{k} k! (2 k + 1)} .$

Recursive computation with Taylor series expansion

The recursive nature of the $e^{a x^{2}}$ family of derivatives may be used to easily construct a rapidly converging Taylor series expansion using recursive entries about any point of known value of the distribution, $Φ (x_{0})$ : $Φ (x) = \sum_{n = 0}^{\infty} \frac{Φ^{(n)} (x_{0})}{n!} (x - x_{0})^{n},$ where: $\begin{aligned} Φ^{(0)} (x_{0}) & = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{x_{0}} e^{- t^{2} / 2} d t \\ Φ^{(1)} (x_{0}) & = \frac{1}{\sqrt{2 π}} e^{- x_{0}^{2} / 2} \\ Φ^{(n)} (x_{0}) & = - (x_{0} Φ^{(n - 1)} (x_{0}) + (n - 2) Φ^{(n - 2)} (x_{0})), & n \geq 2 . \end{aligned}$

Using the Taylor series and Newton's method for the inverse function

An application for the above Taylor series expansion is to use Newton's method to reverse the computation. That is, if we have a value for the cumulative distribution function, $Φ (x)$ , but do not know the x needed to obtain the $Φ (x)$ , we can use Newton's method to find x, and use the Taylor series expansion above to minimize the number of computations. Newton's method is ideal to solve this problem because the first derivative of $Φ (x)$ , which is an integral of the normal standard distribution, is the normal standard distribution, and is readily available to use in the Newton's method solution.

To solve, select a known approximate solution, $x_{0}$ , to the desired Template:Tmath. $x_{0}$ may be a value from a distribution table, or an intelligent estimate followed by a computation of $Φ (x_{0})$ using any desired means to compute. Use this value of $x_{0}$ and the Taylor series expansion above to minimize computations.

Repeat the following process until the difference between the computed $Φ (x_{n})$ and the desired Template:Tmath, which we will call $Φ (desired)$ , is below a chosen acceptably small error, such as 10⁻⁵, 10⁻¹⁵, etc.: $x_{n + 1} = x_{n} - \frac{Φ (x_{n}, x_{0}, Φ (x_{0})) - Φ (desired)}{Φ^{'} (x_{n})},$ where

Φ (x, x_{0}, Φ (x_{0}))

is the

Φ (x)

from a Taylor series solution using

x_{0}

and

Φ (x_{0})

$Φ^{'} (x_{n}) = \frac{1}{\sqrt{2 π}} e^{- x_{n}^{2} / 2} .$

When the repeated computations converge to an error below the chosen acceptably small value, Template:Mvar will be the value needed to obtain a $Φ (x)$ of the desired value, Template:Tmath.

Standard deviation and coverage

Script error: No such module "labelled list hatnote".

File:Standard deviation diagram.svg

For the normal distribution, the values less than one standard deviation from the mean account for 68.27% of the set; while two standard deviations from the mean account for 95.45%; and three standard deviations account for 99.73%.

About 68% of values drawn from a normal distribution are within one standard deviation Template:Mvar from the mean; about 95% of the values lie within two standard deviations; and about 99.7% are within three standard deviations.^[8] This is known as the 68–95–99.7 (empirical) rule, or the 3-sigma rule.

More precisely, the probability that a normal deviate lies in the range between $μ - n σ$ and $μ + n σ$ is given by $F (μ + n σ) - F (μ - n σ) = Φ (n) - Φ (- n) = \erf (\frac{n}{\sqrt{2}}) .$ To 12 significant digits, the values for $n = 1, 2, \dots, 6$ are:

Template:Tmath

p = F (μ + n σ) - F (μ - n σ)

1 - p

or 1 in (1 - p)

OEIS

1