<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>http://debianws.lexgopc.com/wiki143/index.php?action=history&amp;feed=atom&amp;title=Regular_conditional_probability</id>
	<title>Regular conditional probability - Revision history</title>
	<link rel="self" type="application/atom+xml" href="http://debianws.lexgopc.com/wiki143/index.php?action=history&amp;feed=atom&amp;title=Regular_conditional_probability"/>
	<link rel="alternate" type="text/html" href="http://debianws.lexgopc.com/wiki143/index.php?title=Regular_conditional_probability&amp;action=history"/>
	<updated>2026-05-06T17:34:13Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.43.1</generator>
	<entry>
		<id>http://debianws.lexgopc.com/wiki143/index.php?title=Regular_conditional_probability&amp;diff=7205458&amp;oldid=prev</id>
		<title>84.169.251.219: /* Conditional probability distribution */</title>
		<link rel="alternate" type="text/html" href="http://debianws.lexgopc.com/wiki143/index.php?title=Regular_conditional_probability&amp;diff=7205458&amp;oldid=prev"/>
		<updated>2024-11-03T21:14:10Z</updated>

		<summary type="html">&lt;p&gt;&lt;span class=&quot;autocomment&quot;&gt;Conditional probability distribution&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;In [[probability theory]], &amp;#039;&amp;#039;&amp;#039;regular conditional probability&amp;#039;&amp;#039;&amp;#039; is a concept that formalizes the notion of conditioning on the outcome of a [[random variable]]. The resulting &amp;#039;&amp;#039;&amp;#039;conditional probability distribution&amp;#039;&amp;#039;&amp;#039; is a parametrized family of probability measures called a [[Markov kernel]].&lt;br /&gt;
&lt;br /&gt;
== Definition ==&lt;br /&gt;
&lt;br /&gt;
=== Conditional probability distribution ===&lt;br /&gt;
&lt;br /&gt;
Consider two random variables &amp;lt;math&amp;gt;X, Y : \Omega \to \mathbb{R}&amp;lt;/math&amp;gt;. The &amp;#039;&amp;#039;conditional probability distribution&amp;#039;&amp;#039; of &amp;#039;&amp;#039;Y&amp;#039;&amp;#039; given &amp;#039;&amp;#039;X&amp;#039;&amp;#039; is a two variable function &amp;lt;math&amp;gt;\kappa_{Y\mid X}: \mathbb{R} \times \mathcal{B}(\mathbb{R}) \to [0,1]&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
If the random variable &amp;#039;&amp;#039;X&amp;#039;&amp;#039; is discrete&lt;br /&gt;
:&amp;lt;math&amp;gt;\kappa_{Y\mid X}(x, A) = P(Y \in A \mid X = x) = \begin{cases}&lt;br /&gt;
\frac{P(Y \in A, X = x)}{P(X=x)} &amp;amp; \text{ if } P(X = x) &amp;gt; 0 \\[3pt]&lt;br /&gt;
\text{arbitrary value} &amp;amp; \text{ otherwise}.&lt;br /&gt;
\end{cases}&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
If the random variables &amp;#039;&amp;#039;X&amp;#039;&amp;#039;, &amp;#039;&amp;#039;Y&amp;#039;&amp;#039; are continuous with density &amp;lt;math&amp;gt;f_{X,Y}(x,y)&amp;lt;/math&amp;gt;.&lt;br /&gt;
:&amp;lt;math&amp;gt;\kappa_{Y\mid X}(x, A) = \begin{cases}&lt;br /&gt;
\frac{\int_A f_{X,Y}(x, y) \, \mathrm{d}y}{\int_\mathbb{R} f_{X,Y}(x, y) \mathrm{d}y} &amp;amp;&lt;br /&gt;
\text{ if } \int_\mathbb{R} f_{X,Y}(x, y) \, \mathrm{d}y &amp;gt; 0 \\[3pt]&lt;br /&gt;
\text{arbitrary value} &amp;amp; \text{ otherwise}.&lt;br /&gt;
\end{cases}&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
A more general definition can be given in terms of [[conditional expectation]]. Consider a function &amp;lt;math&amp;gt; e_{Y \in A} : \mathbb{R} \to [0,1]&amp;lt;/math&amp;gt; satisfying&lt;br /&gt;
:&amp;lt;math&amp;gt;e_{Y \in A}(X(\omega)) = \operatorname E[1_{Y \in A} \mid X](\omega)&amp;lt;/math&amp;gt;&lt;br /&gt;
for almost all &amp;lt;math&amp;gt;\omega&amp;lt;/math&amp;gt;.&lt;br /&gt;
Then the conditional probability distribution is given by&lt;br /&gt;
:&amp;lt;math&amp;gt;\kappa_{Y\mid X}(x, A) = e_{Y \in A}(x).&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
As with conditional expectation, this can be further generalized to conditioning on a sigma algebra &amp;lt;math&amp;gt;\mathcal{F}&amp;lt;/math&amp;gt;. In that case the conditional distribution is a function &amp;lt;math&amp;gt;\Omega \times \mathcal{B}(\mathbb{R}) \to [0, 1]&amp;lt;/math&amp;gt;:&lt;br /&gt;
:&amp;lt;math&amp;gt; \kappa_{Y\mid\mathcal{F}}(\omega, A) = \operatorname E[1_{Y \in A} \mid \mathcal{F}](\omega)&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Regularity ===&lt;br /&gt;
&lt;br /&gt;
For working with &amp;lt;math&amp;gt;\kappa_{Y\mid X}&amp;lt;/math&amp;gt;, it is important that it be &amp;#039;&amp;#039;regular&amp;#039;&amp;#039;, that is:&lt;br /&gt;
# For almost all &amp;#039;&amp;#039;x&amp;#039;&amp;#039;, &amp;lt;math&amp;gt;A \mapsto \kappa_{Y\mid X}(x, A)&amp;lt;/math&amp;gt; is a probability measure&lt;br /&gt;
# For all &amp;#039;&amp;#039;A&amp;#039;&amp;#039;,  &amp;lt;math&amp;gt;x \mapsto \kappa_{Y\mid X}(x, A)&amp;lt;/math&amp;gt; is a measurable function&lt;br /&gt;
In other words &amp;lt;math&amp;gt;\kappa_{Y\mid X}&amp;lt;/math&amp;gt; is a [[Markov kernel]].&lt;br /&gt;
&lt;br /&gt;
The second condition holds trivially, but the proof of the first is more involved. It can be shown that if &amp;#039;&amp;#039;Y&amp;#039;&amp;#039; is a random element &amp;lt;math&amp;gt;\Omega \to S&amp;lt;/math&amp;gt; in a [[Radon space]] &amp;#039;&amp;#039;S&amp;#039;&amp;#039;, there exists a &amp;lt;math&amp;gt;\kappa_{Y\mid X}&amp;lt;/math&amp;gt; that satisfies the first condition.&amp;lt;ref&amp;gt;{{cite book |last1=Klenke |first1=Achim |title=Probability theory : a comprehensive course |date=30 August 2013 |location=London |isbn=978-1-4471-5361-0 |edition=Second}}&amp;lt;/ref&amp;gt; It is possible to construct more general spaces where a regular conditional probability distribution does not exist.&amp;lt;ref&amp;gt;Faden, A.M., 1985. The existence of regular conditional probabilities: necessary and sufficient conditions. &amp;#039;&amp;#039;The Annals of Probability&amp;#039;&amp;#039;, 13(1), pp.&amp;amp;nbsp;288–298.&amp;lt;/ref&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Relation to conditional expectation ===&lt;br /&gt;
For discrete and continuous random variables, the [[conditional expectation]] can be expressed as&lt;br /&gt;
:&amp;lt;math&amp;gt;&lt;br /&gt;
\begin{aligned}&lt;br /&gt;
\operatorname E[Y\mid X=x] &amp;amp;= \sum_y y \, P(Y=y\mid X=x) \\&lt;br /&gt;
\operatorname E[Y\mid X=x] &amp;amp;= \int y \, f_{Y\mid X}(x, y) \, \mathrm{d}y&lt;br /&gt;
\end{aligned}&lt;br /&gt;
&amp;lt;/math&amp;gt;&lt;br /&gt;
where &amp;lt;math&amp;gt;f_{Y\mid X}(x, y)&amp;lt;/math&amp;gt; is the [[conditional density]] of {{mvar|Y}} given {{mvar|X}}.&lt;br /&gt;
&lt;br /&gt;
This result can be extended to measure theoretical conditional expectation using the regular conditional probability distribution:&lt;br /&gt;
:&amp;lt;math&amp;gt;\operatorname E[Y\mid X](\omega) = \int y \, \kappa_{Y\mid\sigma(X)}(\omega, \mathrm{d}y).&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Formal definition==&lt;br /&gt;
&lt;br /&gt;
Let &amp;lt;math&amp;gt;(\Omega, \mathcal F, P)&amp;lt;/math&amp;gt; be a [[probability space]], and let &amp;lt;math&amp;gt;T:\Omega\rightarrow E&amp;lt;/math&amp;gt; be a [[random variable]], defined as a [[Borel measure|Borel-]][[measurable function]] from &amp;lt;math&amp;gt;\Omega&amp;lt;/math&amp;gt; to its [[Probability space#Random variables|state space]] &amp;lt;math&amp;gt;(E, \mathcal E)&amp;lt;/math&amp;gt;.&lt;br /&gt;
One should think of &amp;lt;math&amp;gt;T&amp;lt;/math&amp;gt; as a way to &amp;quot;disintegrate&amp;quot; the sample space &amp;lt;math&amp;gt;\Omega&amp;lt;/math&amp;gt; into &amp;lt;math&amp;gt;\{ T^{-1}(x) \}_{x \in E}&amp;lt;/math&amp;gt;.&lt;br /&gt;
Using the [[disintegration theorem]] from the measure theory, it allows us to &amp;quot;disintegrate&amp;quot; the measure &amp;lt;math&amp;gt;P&amp;lt;/math&amp;gt; into a collection of measures,&lt;br /&gt;
one for each &amp;lt;math&amp;gt;x \in E&amp;lt;/math&amp;gt;. Formally, a &amp;#039;&amp;#039;&amp;#039;regular conditional probability&amp;#039;&amp;#039;&amp;#039; is defined as a function &amp;lt;math&amp;gt;\nu:E \times\mathcal F \rightarrow [0,1],&amp;lt;/math&amp;gt; called a &amp;quot;transition probability&amp;quot;, where:&lt;br /&gt;
* For every &amp;lt;math&amp;gt;x \in E&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;\nu(x, \cdot)&amp;lt;/math&amp;gt; is a probability measure on &amp;lt;math&amp;gt;\mathcal F&amp;lt;/math&amp;gt;. Thus we provide one measure for each &amp;lt;math&amp;gt;x \in E&amp;lt;/math&amp;gt;.&lt;br /&gt;
* For all &amp;lt;math&amp;gt;A\in\mathcal F&amp;lt;/math&amp;gt;, &amp;lt;math&amp;gt;\nu(\cdot, A)&amp;lt;/math&amp;gt; (a mapping &amp;lt;math&amp;gt;E \to [0,1]&amp;lt;/math&amp;gt;) is &amp;lt;math&amp;gt;\mathcal E&amp;lt;/math&amp;gt;-measurable, and&lt;br /&gt;
* For all &amp;lt;math&amp;gt;A\in\mathcal F&amp;lt;/math&amp;gt; and all &amp;lt;math&amp;gt;B\in\mathcal E&amp;lt;/math&amp;gt;&amp;lt;ref&amp;gt;D. Leao Jr. et al. &amp;#039;&amp;#039;Regular conditional probability, disintegration of probability and Radon spaces.&amp;#039;&amp;#039; Proyecciones. Vol. 23, No. 1, pp. 15–29, May 2004, Universidad Católica del Norte, Antofagasta, Chile [http://www.scielo.cl/pdf/proy/v23n1/art02.pdf PDF]&amp;lt;/ref&amp;gt;&lt;br /&gt;
:: &amp;lt;math&amp;gt;P\big(A\cap T^{-1}(B)\big) = \int_B \nu(x,A) \,(P\circ T^{-1})(\mathrm{d}x)&amp;lt;/math&amp;gt;&lt;br /&gt;
where &amp;lt;math&amp;gt;P\circ T^{-1}&amp;lt;/math&amp;gt; is the [[pushforward measure]] &amp;lt;math&amp;gt;T_*P&amp;lt;/math&amp;gt; of the distribution of the random element &amp;lt;math&amp;gt;T&amp;lt;/math&amp;gt;,&lt;br /&gt;
&amp;lt;math&amp;gt;x\in\operatorname{supp}T,&amp;lt;/math&amp;gt; i.e. the [[Support (measure theory)|support]] of the &amp;lt;math&amp;gt;T_*  P&amp;lt;/math&amp;gt;.&lt;br /&gt;
Specifically, if we take &amp;lt;math&amp;gt;B=E&amp;lt;/math&amp;gt;, then &amp;lt;math&amp;gt;A \cap T^{-1}(E) = A&amp;lt;/math&amp;gt;, and so&lt;br /&gt;
:&amp;lt;math&amp;gt;P(A) = \int_E \nu(x,A) \, (P\circ T^{-1})(\mathrm{d}x),&amp;lt;/math&amp;gt;&lt;br /&gt;
where &amp;lt;math&amp;gt;\nu(x, A)&amp;lt;/math&amp;gt; can be denoted, using more familiar terms &amp;lt;math&amp;gt;P(A\ |\ T=x)&amp;lt;/math&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Alternate definition==&lt;br /&gt;
{{disputeabout|&amp;#039;&amp;#039;&amp;#039;this way leads to irregular conditional probability&amp;#039;&amp;#039;&amp;#039;|Non-regular conditional probability|date=September 2009}}&lt;br /&gt;
Consider a [[Radon space]] &amp;lt;math&amp;gt; \Omega &amp;lt;/math&amp;gt; (that is a probability measure defined on a Radon space endowed with the Borel sigma-algebra) and a real-valued random variable &amp;#039;&amp;#039;T&amp;#039;&amp;#039;. As discussed above, in this case there exists a regular conditional probability with respect to &amp;#039;&amp;#039;T&amp;#039;&amp;#039;. Moreover, we can alternatively define the &amp;#039;&amp;#039;&amp;#039;regular conditional probability&amp;#039;&amp;#039;&amp;#039; for an event &amp;#039;&amp;#039;A&amp;#039;&amp;#039; given a particular value &amp;#039;&amp;#039;t&amp;#039;&amp;#039; of the random variable &amp;#039;&amp;#039;T&amp;#039;&amp;#039; in the following manner:&lt;br /&gt;
&lt;br /&gt;
:&amp;lt;math&amp;gt; P (A\mid T=t) = \lim_{U\supset \{T= t\}} \frac {P(A\cap U)}{P(U)},&amp;lt;/math&amp;gt;&lt;br /&gt;
&lt;br /&gt;
where the [[Limit (mathematics)|limit]] is taken over the [[Net (mathematics)|net]] of [[Open set|open]] [[Neighbourhood (mathematics)|neighborhoods]] &amp;#039;&amp;#039;U&amp;#039;&amp;#039; of &amp;#039;&amp;#039;t&amp;#039;&amp;#039; as they become [[Subset|smaller with respect to set inclusion]].  This limit is defined if and only if the probability space is [[Radon space|Radon]], and only in the support of &amp;#039;&amp;#039;T&amp;#039;&amp;#039;, as described in the article.   This is the restriction of the transition probability to the support of&amp;amp;nbsp;&amp;#039;&amp;#039;T&amp;#039;&amp;#039;.  To describe this limiting process rigorously:&lt;br /&gt;
&lt;br /&gt;
For every &amp;lt;math&amp;gt;\varepsilon &amp;gt; 0,&amp;lt;/math&amp;gt; there exists an open neighborhood &amp;#039;&amp;#039;U&amp;#039;&amp;#039; of the event {&amp;#039;&amp;#039;T&amp;#039;&amp;#039;&amp;amp;nbsp;=&amp;amp;nbsp;&amp;#039;&amp;#039;t&amp;#039;&amp;#039;}, such that for every open &amp;#039;&amp;#039;V&amp;#039;&amp;#039; with &amp;lt;math&amp;gt;\{T=t\} \subset V \subset U,&amp;lt;/math&amp;gt;&lt;br /&gt;
:&amp;lt;math&amp;gt;\left|\frac {P(A\cap V)}{P(V)}-L\right| &amp;lt; \varepsilon,&amp;lt;/math&amp;gt;&lt;br /&gt;
where &amp;lt;math&amp;gt;L = P (A\mid T=t)&amp;lt;/math&amp;gt; is the limit.&lt;br /&gt;
&lt;br /&gt;
==See also==&lt;br /&gt;
* [[Conditioning (probability)]]&lt;br /&gt;
* [[Disintegration theorem]]&lt;br /&gt;
* [[Adherent point]]&lt;br /&gt;
* [[Limit point]]&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&amp;lt;references/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==External links==&lt;br /&gt;
* [http://planetmath.org/regularconditionalprobability Regular Conditional Probability] on [http://planetmath.org/ PlanetMath]&lt;br /&gt;
&lt;br /&gt;
[[Category:Conditional probability]]&lt;br /&gt;
[[Category:Measure theory]]&lt;/div&gt;</summary>
		<author><name>84.169.251.219</name></author>
	</entry>
</feed>