97.5th percentile point

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Template:Short description Template:Use dmy dates

File:NormalDist1.96.svg
95% of the area under the normal distribution lies within 1.96 standard deviations (σ for standard deviation) away from the mean.

In probability and statistics, the 97.5th percentile point of the standard normal distribution is a number commonly used for statistical calculations. The approximate value of this number is 1.96, meaning that 95% of the area under a normal curve lies within approximately 1.96 standard deviations of the mean. Because of the central limit theorem, this number is used in the construction of approximate 95% confidence intervals. Its ubiquity is due to the arbitrary but common convention of using confidence intervals with 95% probability in science and frequentist statistics, though other probabilities (90%, 99%, etc.) are sometimes used.[1][2][3][4] This convention seems particularly common in medical statistics,[5][6][7] but is also common in other areas of application, such as earth sciences,[8] social sciences and business research.[9]

There is no single accepted name for this number; it is also commonly referred to as the "standard normal deviate", "normal score" or "Z score" for the 97.5 percentile point, the .975 point, or just its approximate value, 1.96.

If X has a standard normal distribution, i.e. X ~ N(0,1),

P(X>1.96)0.025,
P(X<1.96)0.975,

and as the normal distribution is symmetric,

P(1.96<X<1.96)0.95.

One notation for this number is z.975.[10] From the probability density function of the standard normal distribution, the exact value of z.975 is determined by

12πz.975z.975ex2/2dx=0.95.

Its square, about 3.84146, is the 95th percentile point of a chi-squared distribution with 1 degree of freedom, often used for testing 2×2 contingency tables.

History

File:Youngronaldfisher2.JPG
Ronald Fisher

The use of this number in applied statistics can be traced to the influence of Ronald Fisher's classic textbook, Statistical Methods for Research Workers, first published in 1925:

<templatestyles src="Template:Blockquote/styles.css" />

"The value for which P = .05, or 1 in 20, is 1.96 or nearly 2; it is convenient to take this point as a limit in judging whether a deviation is to be considered significant or not."[11]

Script error: No such module "Check for unknown parameters".

In Table 1 of the same work, he gave the more precise value 1.959964.[12] In 1970, the value truncated to 20 decimal places was calculated to be

1.95996 39845 40054 23552...[13][14]

The commonly used approximate value of 1.96 is therefore accurate to better than one part in 50,000, which is more than adequate for applied work.

Some people even use the value of 2 in the place of 1.96, reporting a 95.4% confidence interval as a 95% confidence interval. This is not recommended but is occasionally seen.[15]

Software functions

The inverse of the standard normal cumulative distribution function (CDF) can be used to compute the value. The following is a table of function calls that return 1.96 in some commonly used applications:

Application Function call
Excel NORM.S.INV(0.975)
MATLAB norminv(0.975)
R qnorm(0.975)
Python (SciPy) scipy.stats.norm.ppf(0.975)
SAS probit(0.025);
SPSS x = COMPUTE IDF.NORMAL(0.975,0,1).
Stata invnormal(0.975)
Wolfram Language (Mathematica) InverseCDF[NormalDistribution[0, 1], 0.975][16][17]

See also

References

<templatestyles src="Reflist/styles.css" />

  1. Script error: No such module "citation/CS1".
  2. Script error: No such module "citation/CS1".
  3. Script error: No such module "citation/CS1".
  4. Script error: No such module "Citation/CS1".
  5. Script error: No such module "citation/CS1".
  6. Script error: No such module "citation/CS1".
  7. Script error: No such module "citation/CS1".
  8. Script error: No such module "citation/CS1".
  9. Script error: No such module "citation/CS1".
  10. Script error: No such module "citation/CS1".
  11. Script error: No such module "citation/CS1".
  12. Script error: No such module "citation/CS1"., Table 1
  13. Script error: No such module "citation/CS1".
  14. Script error: No such module "citation/CS1".Script error: No such module "Check for unknown parameters".
  15. Script error: No such module "citation/CS1".
  16. InverseCDF, Wolfram Language Documentation Center.
  17. NormalDistribution, Wolfram Language Documentation Center.

Script error: No such module "Check for unknown parameters".

Further reading

  • Script error: No such module "citation/CS1".