Probability-generating function: Difference between revisions

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
imported>David Eppstein
 
imported>DreamRimmer bot II
m Bot: Implementing outcome of RfC: converting list-defined references from {{reflist|refs=…}} to <references>…</references> for VisualEditor compatibility
 
Line 34: Line 34:
Probability generating functions are particularly useful for dealing with functions of [[statistical independence|independent]] random variables. For example:
Probability generating functions are particularly useful for dealing with functions of [[statistical independence|independent]] random variables. For example:


* If <math>X_i, i=1,2,\cdots,N</math> is a sequence of independent (and not necessarily identically distributed) random variables that take on natural-number values, and
{{bullet list
| If <math>X_i, i=1,2,\cdots,N</math> is a sequence of independent (and not necessarily identically distributed) random variables that take on natural-number values, and
<math display="block">S_N = \sum_{i=1}^N a_i X_i,</math> where the <math>a_i</math> are constant natural numbers, then the probability generating function is given by
<math display="block">S_N = \sum_{i=1}^N a_i X_i,</math> where the <math>a_i</math> are constant natural numbers, then the probability generating function is given by
<math display="block">G_{S_N}(z) = \operatorname{E}(z^{S_N}) = \operatorname{E} \left( z^{\sum_{i=1}^N a_i X_i,} \right) = G_{X_1}( z^{a_1})G_{X_2}(z^{a_2})\cdots G_{X_N}(z^{a_N}).</math>
<math display="block">G_{S_N}(z) = \operatorname{E}(z^{S_N}) = \operatorname{E} \left( z^{\sum_{i=1}^N a_i X_i,} \right) = G_{X_1}( z^{a_1})G_{X_2}(z^{a_2})\cdots G_{X_N}(z^{a_N}).</math>
* In particular, if <math>X</math> and <math>Y</math> are independent random variables:
| In particular, if <math>X</math> and <math>Y</math> are independent random variables:
<math display="block">G_{X+Y}(z) = G_X(z) \cdot G_Y(z)</math> and
<math display="block">G_{X+Y}(z) = G_X(z) \cdot G_Y(z)</math> and
<math display="block">G_{X-Y}(z) = G_X(z) \cdot G_Y(1/z).</math>
<math display="block">G_{X-Y}(z) = G_X(z) \cdot G_Y(1/z).</math>
* In the above, the number <math>N</math> of independent random variables in the sequence is fixed. Assume <math>N</math> is discrete random variable taking values on the non-negative integers, which is independent of the <math>X_i</math>, and consider the probability generating function <math>G_N</math>.  If the <math>X_i</math> are not only independent but also identically distributed with common probability generating function <math>G_X = G_{X_i}</math>, then
| In the above, the number <math>N</math> of independent random variables in the sequence is fixed. Assume <math>N</math> is discrete random variable taking values on the non-negative integers, which is independent of the <math>X_i</math>, and consider the probability generating function <math>G_N</math>.  If the <math>X_i</math> are not only independent but also identically distributed with common probability generating function <math>G_X = G_{X_i}</math>, then
<math display="block">G_{S_N}(z) = G_N(G_X(z)).</math> This can be seen, using the [[law of total expectation]], as follows:
<math display="block">G_{S_N}(z) = G_N(G_X(z)).</math> This can be seen, using the [[law of total expectation]], as follows:
<math display="block">
<math display="block">
Line 49: Line 50:
</math>
</math>
This last fact is useful in the study of [[Galton&ndash;Watson process]]es and [[compound Poisson process]]es.
This last fact is useful in the study of [[Galton&ndash;Watson process]]es and [[compound Poisson process]]es.
* When the <math>X_i</math> are not supposed identically distributed (but still independent and independent of <math>N</math>), we have
| When the <math>X_i</math> are not supposed identically distributed (but still independent and independent of <math>N</math>), we have
<math display="block">G_{S_N}(z) = \sum_{n \ge 1} f_n \prod_{i=1}^n G_{X_i}(z),</math> where <math>f_n = \Pr(N=n).</math> For identically distributed <math>X_i</math>s, this simplifies to the identity stated before, but the general case is sometimes useful to obtain a decomposition of <math>S_N</math> by means of generating functions.
<math display="block">G_{S_N}(z) = \sum_{n \ge 1} f_n \prod_{i=1}^n G_{X_i}(z),</math> where <math>f_n = \Pr(N=n).</math> For identically distributed <math>X_i</math>s, this simplifies to the identity stated before, but the general case is sometimes useful to obtain a decomposition of <math>S_N</math> by means of generating functions.
}}


==Examples==
==Examples==


* The probability generating function of an almost surely [[degenerate distribution|constant random variable]], i.e. one with <math>Pr(X=c) = 1</math> and <math>Pr(X\neq c) = 0</math> is <math display="block">G(z) = z^c. </math>
* The probability generating function of an almost surely [[degenerate distribution|constant random variable]], i.e. one with <math>\Pr(X=c) = 1</math> and <math>\Pr(X\neq c) = 0</math> is <math display="block">G(z) = z^c. </math>
* The probability generating function of a [[binomial distribution|binomial random variable]], the number of successes in <math>n</math> trials, with probability <math>p</math> of success in each trial, is <math display="block">G(z) = \left[(1-p) + pz\right]^n. </math> '''Note''': it is the <math>n</math>-fold product of the probability generating function of a [[Bernoulli distribution|Bernoulli random variable]] with parameter <math>p</math>. {{pb}}  So the probability generating function of a [[fair coin]], is <math display="block">G(z) = 1/2 + z/2. </math>
* The probability generating function of a [[binomial distribution|binomial random variable]], the number of successes in <math>n</math> trials, with probability <math>p</math> of success in each trial, is <math display="block">G(z) = \left[(1-p) + pz\right]^n. </math> '''Note''': it is the <math>n</math>-fold product of the probability generating function of a [[Bernoulli distribution|Bernoulli random variable]] with parameter <math>p</math>. {{pb}}  So the probability generating function of a [[fair coin]], is <math display="block">G(z) = \frac{1}{2} + \frac{z}{2}. </math>
* The probability generating function of a [[negative binomial distribution|negative binomial random variable]] on <math>\{0,1,2 \cdots\}</math>, the number of failures until the <math>r^{th}</math> success with probability of success in each trial <math>p</math>, is <math display="block">G(z) = \left(\frac{p}{1 - (1-p)z}\right)^r,</math> which converges for <math>|z| < \frac{1}{1-p}</math>. {{pb}} '''Note''' that this is the <math>r</math>-fold product of the probability generating function of a [[geometric distribution|geometric random variable]] with parameter <math>1-p</math> on <math>\{0,1,2,\cdots\}</math>.
* The probability generating function of a [[negative binomial distribution|negative binomial random variable]] on <math>\{0,1,2 \cdots\}</math>, the number of failures until the <math>r^{th}</math> success with probability of success in each trial <math>p</math>, is <math display="block">G(z) = \left(\frac{p}{1 - (1-p)z}\right)^r,</math> which converges for <math>|z| < \frac{1}{1-p}</math>. {{pb}} '''Note''' that this is the <math>r</math>-fold product of the probability generating function of a [[geometric distribution|geometric random variable]] with parameter <math>1-p</math> on <math>\{0,1,2,\cdots\}</math>.
* The probability generating function of a [[Poisson distribution|Poisson random variable]] with rate parameter <math>\lambda</math> is <math display="block">G(z) = e^{\lambda(z - 1)}.</math>
* The probability generating function of a [[Poisson distribution|Poisson random variable]] with rate parameter <math>\lambda</math> is <math display="block">G(z) = e^{\lambda(z - 1)}.</math>
Line 76: Line 78:


==Notes==
==Notes==
{{reflist|refs=
<references></references>
 
}}


==References==
==References==
 
{{refbegin}}
* {{ cite book | last1 = Johnson | first1 = Norman Lloyd | last2 = Kotz | first2 = Samuel | last3 = Kemp | first3 = Adrienne W. |author3-link=Adrienne W. Kemp| title = Univariate Discrete Distributions | date = 1992 | publisher = J. Wiley & Sons | isbn = 978-0-471-54897-3 | edition = 2nd | series = Wiley series in probability and mathematical statistics | location = New York }}
* {{ cite book | last1 = Johnson | first1 = Norman Lloyd | last2 = Kotz | first2 = Samuel | last3 = Kemp | first3 = Adrienne W. |author3-link=Adrienne W. Kemp| title = Univariate Discrete Distributions | date = 1992 | publisher = J. Wiley & Sons | isbn = 978-0-471-54897-3 | edition = 2nd | series = Wiley series in probability and mathematical statistics | location = New York }}
{{refend}}


{{Theory of probability distributions}}
{{Theory of probability distributions}}

Latest revision as of 11:08, 18 December 2025

Template:Short description In probability theory, the probability generating function of a discrete random variable is a power series representation (the generating function) of the probability mass function of the random variable. Probability generating functions are often employed for their succinct description of the sequence of probabilities Pr(X = i) in the probability mass function for a random variable X, and to make available the well-developed theory of power series with non-negative coefficients.

Definition

Univariate case

If X is a discrete random variable taking values x in the non-negative integers {0,1, ...}, then the probability generating function of X is defined as [1]

G(z)=E(zX)=x=0p(x)zx, where p is the probability mass function of X. Note that the subscripted notations GX and pX are often used to emphasize that these pertain to a particular random variable X, and to its distribution. The power series converges absolutely at least for all complex numbers z with |z|<1; the radius of convergence being often larger.

Multivariate case

If X = (X1,...,Xd)Script error: No such module "Check for unknown parameters". is a discrete random variable taking values (x1, ..., xd)Script error: No such module "Check for unknown parameters". in the Template:Mvar-dimensional non-negative integer lattice {0,1, ...}dScript error: No such module "Check for unknown parameters"., then the probability generating function of XScript error: No such module "Check for unknown parameters". is defined as G(z)=G(z1,,zd)=E(z1X1zdXd)=x1,,xd=0p(x1,,xd)z1x1zdxd, where Template:Mvar is the probability mass function of Template:Mvar. The power series converges absolutely at least for all complex vectors z=(z1,...zd)d with max{|z1|,...,|zd|}1.

Properties

Power series

Probability generating functions obey all the rules of power series with non-negative coefficients. In particular, G(1)=1, where G(1)=limx1,x<1G(x), x approaching 1 from below, since the probabilities must sum to one. So the radius of convergence of any probability generating function must be at least 1, by Abel's theorem for power series with non-negative coefficients.

Probabilities and expectations

The following properties allow the derivation of various basic quantities related to X:

  1. The probability mass function of X is recovered by taking derivatives of G, p(k)=Pr(X=k)=G(k)(0)k!.
  2. It follows from Property 1 that if random variables X and Y have probability-generating functions that are equal, GX=GY, then pX=pY. That is, if X and Y have identical probability-generating functions, then they have identical distributions.
  3. The normalization of the probability mass function can be expressed in terms of the generating function by E[1]=G(1)=i=0p(i)=1. The expectation of X is given by E[X]=G(1). More generally, the kthfactorial moment, E[X(X1)(Xk+1)] of X is given by E[X!(Xk)!]=G(k)(1),k0. So the variance of X is given by Var(X)=G(1)+G(1)[G(1)]2. Finally, the Template:Mvar-th raw moment of X is given by E[Xk]=(zz)kG(z)|z=1
  4. GX(et)=MX(t) where X is a random variable, GX(t) is the probability generating function (of X) and MX(t) is the moment-generating function (of X).

Functions of independent random variables

Probability generating functions are particularly useful for dealing with functions of independent random variables. For example:

Template:Bullet list

Examples

  • The probability generating function of an almost surely constant random variable, i.e. one with Pr(X=c)=1 and Pr(Xc)=0 is G(z)=zc.
  • The probability generating function of a binomial random variable, the number of successes in n trials, with probability p of success in each trial, is G(z)=[(1p)+pz]n. Note: it is the n-fold product of the probability generating function of a Bernoulli random variable with parameter p. Template:Pb So the probability generating function of a fair coin, is G(z)=12+z2.
  • The probability generating function of a negative binomial random variable on {0,1,2}, the number of failures until the rth success with probability of success in each trial p, is G(z)=(p1(1p)z)r, which converges for |z|<11p. Template:Pb Note that this is the r-fold product of the probability generating function of a geometric random variable with parameter 1p on {0,1,2,}.
  • The probability generating function of a Poisson random variable with rate parameter λ is G(z)=eλ(z1).

Related concepts

The probability generating function is an example of a generating function of a sequence: see also formal power series. It is equivalent to, and sometimes called, the z-transform of the probability mass function.

Other generating functions of random variables include the moment-generating function, the characteristic function and the cumulant generating function. The probability generating function is also equivalent to the factorial moment generating function, which as E[zX] can also be considered for continuous and other random variables.

Script error: No such module "Unsubst".

Notes

  1. Script error: No such module "citation/CS1".

References

<templatestyles src="Refbegin/styles.css" />

  • Script error: No such module "citation/CS1".

Template:Theory of probability distributions