HOME

TheInfoList



OR:

In mathematics, the disintegration theorem is a result in measure theory and
probability theory Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set ...
. It rigorously defines the idea of a non-trivial "restriction" of a measure to a measure zero subset of the measure space in question. It is related to the existence of conditional probability measures. In a sense, "disintegration" is the opposite process to the construction of a product measure.


Motivation

Consider the unit square in the
Euclidean plane In mathematics, the Euclidean plane is a Euclidean space of dimension two. That is, a geometric setting in which two real quantities are required to determine the position of each point ( element of the plane), which includes affine notions of ...
R2, . Consider the
probability measure In mathematics, a probability measure is a real-valued function defined on a set of events in a probability space that satisfies measure properties such as ''countable additivity''. The difference between a probability measure and the more ge ...
μ defined on ''S'' by the restriction of two-dimensional Lebesgue measure λ2 to ''S''. That is, the probability of an event ''E'' ⊆ ''S'' is simply the area of ''E''. We assume ''E'' is a measurable subset of ''S''. Consider a one-dimensional subset of ''S'' such as the line segment ''L''''x'' = ×
, 1 The comma is a punctuation mark that appears in several variants in different languages. It has the same shape as an apostrophe or single closing quotation mark () in many typefaces, but it differs from them in being placed on the baseline o ...
''L''''x'' has μ-measure zero; every subset of ''L''''x'' is a μ- null set; since the Lebesgue measure space is a
complete measure space In mathematics, a complete measure (or, more precisely, a complete measure space) is a measure space in which every subset of every null set is measurable (having measure zero). More formally, a measure space (''X'', Σ, ''μ'') is comp ...
, E \subseteq L_ \implies \mu (E) = 0. While true, this is somewhat unsatisfying. It would be nice to say that μ "restricted to" ''L''''x'' is the one-dimensional Lebesgue measure λ1, rather than the zero measure. The probability of a "two-dimensional" event ''E'' could then be obtained as an integral of the one-dimensional probabilities of the vertical "slices" ''E'' ∩ ''L''''x'': more formally, if μ''x'' denotes one-dimensional Lebesgue measure on ''L''''x'', then \mu (E) = \int_ \mu_ (E \cap L_) \, \mathrm x for any "nice" ''E'' ⊆ ''S''. The disintegration theorem makes this argument rigorous in the context of measures on
metric space In mathematics, a metric space is a set together with a notion of '' distance'' between its elements, usually called points. The distance is measured by a function called a metric or distance function. Metric spaces are the most general set ...
s.


Statement of the theorem

(Hereafter, ''P''(''X'') will denote the collection of Borel probability measures on a
topological space In mathematics, a topological space is, roughly speaking, a geometrical space in which closeness is defined but cannot necessarily be measured by a numeric distance. More specifically, a topological space is a set whose elements are called po ...
(''X'', ''T'').) The assumptions of the theorem are as follows: * Let ''Y'' and ''X'' be two Radon spaces (i.e. a
topological space In mathematics, a topological space is, roughly speaking, a geometrical space in which closeness is defined but cannot necessarily be measured by a numeric distance. More specifically, a topological space is a set whose elements are called po ...
such that every Borel
probability measure In mathematics, a probability measure is a real-valued function defined on a set of events in a probability space that satisfies measure properties such as ''countable additivity''. The difference between a probability measure and the more ge ...
on ''M'' is inner regular e.g. separable metric spaces on which every probability measure is a
Radon measure In mathematics (specifically in measure theory), a Radon measure, named after Johann Radon, is a measure on the σ-algebra of Borel sets of a Hausdorff topological space ''X'' that is finite on all compact sets, outer regular on all Borel s ...
). * Let μ ∈ ''P''(''Y''). * Let π : ''Y'' → ''X'' be a Borel-
measurable function In mathematics and in particular measure theory, a measurable function is a function between the underlying sets of two measurable spaces that preserves the structure of the spaces: the preimage of any measurable set is measurable. This is in d ...
. Here one should think of π as a function to "disintegrate" ''Y'', in the sense of partitioning ''Y'' into \. For example, for the motivating example above, one can define \pi((a,b)) = a, (a,b) \in ,1times ,1/math>, which gives that \pi^(a) = a \times ,1/math>, a slice we want to capture. * Let \nu ∈ ''P''(''X'') be the pushforward measure This measure provides the distribution of x (which corresponds to the events \pi^(x)). The conclusion of the theorem: There exists a \nu-
almost everywhere In measure theory (a branch of mathematical analysis), a property holds almost everywhere if, in a technical sense, the set for which the property holds takes up nearly all possibilities. The notion of "almost everywhere" is a companion notion to ...
uniquely determined family of probability measures ''x''∈''X'' ⊆ ''P''(''Y''), which provides a "disintegration" of \mu into such that: * the function x \mapsto \mu_ is Borel measurable, in the sense that x \mapsto \mu_ (B) is a Borel-measurable function for each Borel-measurable set ''B'' ⊆ ''Y''; * μ''x'' "lives on" the fiber π−1(''x''): for \nu-
almost all In mathematics, the term "almost all" means "all but a negligible amount". More precisely, if X is a set, "almost all elements of X" means "all elements of X but those in a negligible subset of X". The meaning of "negligible" depends on the math ...
''x'' ∈ ''X'', \mu_ \left( Y \setminus \pi^ (x) \right) = 0, and so μ''x''(''E'') = μ''x''(''E'' ∩ π−1(''x'')); * for every Borel-measurable function ''f'' : ''Y'' →
, ∞ The comma is a punctuation mark that appears in several variants in different languages. It has the same shape as an apostrophe or single closing quotation mark () in many typefaces, but it differs from them in being placed on the baseline o ...
\int_ f(y) \, \mathrm \mu (y) = \int_ \int_ f(y) \, \mathrm \mu_ (y) \mathrm \nu (x). In particular, for any event ''E'' ⊆ ''Y'', taking ''f'' to be the indicator function of ''E'', \mu (E) = \int_ \mu_ \left( E \right) \, \mathrm \nu (x).


Applications


Product spaces

The original example was a special case of the problem of product spaces, to which the disintegration theorem applies. When ''Y'' is written as a
Cartesian product In mathematics, specifically set theory, the Cartesian product of two sets ''A'' and ''B'', denoted ''A''×''B'', is the set of all ordered pairs where ''a'' is in ''A'' and ''b'' is in ''B''. In terms of set-builder notation, that is : A\tim ...
''Y'' = ''X''1 × ''X''2 and π''i'' : ''Y'' → ''X''''i'' is the natural projection, then each fibre ''π''1−1(''x''1) can be canonically identified with ''X''2 and there exists a Borel family of probability measures \_ in ''P''(''X''2) (which is (π1)(μ)-almost everywhere uniquely determined) such that \mu = \int_ \mu_ \, \mu \left(\pi_1^(\mathrm d x_1) \right)= \int_ \mu_ \, \mathrm (\pi_)_ (\mu) (x_), which is in particular \int_ f(x_1,x_2)\, \mu(\mathrm d x_1,\mathrm d x_2) = \int_\left( \int_ f(x_1,x_2) \mu(\mathrm d x_2, x_1) \right) \mu\left( \pi_1^(\mathrm x_)\right) and \mu(A \times B) = \int_A \mu\left(B, x_1\right) \, \mu\left( \pi_1^(\mathrm x_)\right). The relation to
conditional expectation In probability theory, the conditional expectation, conditional expected value, or conditional mean of a random variable is its expected value – the value it would take “on average” over an arbitrarily large number of occurrences – given ...
is given by the identities \operatorname E(f, \pi_1)(x_1)= \int_ f(x_1,x_2) \mu(\mathrm d x_2, x_1), \mu(A\times B, \pi_1)(x_1)= 1_A(x_1) \cdot \mu(B, x_1).


Vector calculus

The disintegration theorem can also be seen as justifying the use of a "restricted" measure in vector calculus. For instance, in Stokes' theorem as applied to a vector field flowing through a
compact Compact as used in politics may refer broadly to a pact or treaty; in more specific cases it may refer to: * Interstate compact * Blood compact, an ancient ritual of the Philippines * Compact government, a type of colonial rule utilized in Britis ...
surface , it is implicit that the "correct" measure on Σ is the disintegration of three-dimensional Lebesgue measure λ3 on Σ, and that the disintegration of this measure on ∂Σ is the same as the disintegration of λ3 on ∂Σ.


Conditional distributions

The disintegration theorem can be applied to give a rigorous treatment of conditional probability distributions in statistics, while avoiding purely abstract formulations of conditional probability.


See also

* * * * *
Regular conditional probability In probability theory, regular conditional probability is a concept that formalizes the notion of conditioning on the outcome of a random variable. The resulting conditional probability distribution is a parametrized family of probability measures c ...


References

{{Measure theory Theorems in measure theory Probability theorems