HOME

TheInfoList



OR:

In
mathematics Mathematics is an area of knowledge that includes the topics of numbers, formulas and related structures, shapes and the spaces in which they are contained, and quantities and their changes. These topics are represented in modern mathematics ...
, the Radon–Nikodym theorem is a result in
measure theory In mathematics, the concept of a measure is a generalization and formalization of geometrical measures ( length, area, volume) and other common notions, such as mass and probability of events. These seemingly distinct concepts have many simil ...
that expresses the relationship between two measures defined on the same
measurable space In mathematics, a measurable space or Borel space is a basic object in measure theory. It consists of a set and a σ-algebra, which defines the subsets that will be measured. Definition Consider a set X and a σ-algebra \mathcal A on X. Then the ...
. A ''measure'' is a
set function In mathematics, especially measure theory, a set function is a function whose domain is a family of subsets of some given set and that (usually) takes its values in the extended real number line \R \cup \, which consists of the real numbers \R a ...
that assigns a consistent magnitude to the measurable subsets of a measurable space. Examples of a measure include area and volume, where the subsets are sets of points; or the probability of an event, which is a subset of possible outcomes within a wider
probability space In probability theory, a probability space or a probability triple (\Omega, \mathcal, P) is a mathematical construct that provides a formal model of a random process or "experiment". For example, one can define a probability space which models t ...
. One way to derive a new measure from one already given is to assign a density to each point of the space, then integrate over the measurable subset of interest. This can be expressed as :\nu(A) = \int_A f \, d\mu, where is the new measure being defined for any measurable subset and the function is the density at a given point. The integral is with respect to an existing measure , which may often be the canonical Lebesgue measure on the
real line In elementary mathematics, a number line is a picture of a graduated straight line (geometry), line that serves as visual representation of the real numbers. Every point of a number line is assumed to correspond to a real number, and every real ...
or the ''n''-dimensional
Euclidean space Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, that is, in Euclid's Elements, Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics ther ...
(corresponding to our standard notions of length, area and volume). For example, if represented mass density and was the Lebesgue measure in three-dimensional space , then would equal the total mass in a spatial region . The Radon–Nikodym theorem essentially states that, under certain conditions, any measure can be expressed in this way with respect to another measure on the same space. The function is then called the Radon–Nikodym derivative and is denoted by \tfrac. An important application is in
probability theory Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set o ...
, leading to the
probability density function In probability theory, a probability density function (PDF), or density of a continuous random variable, is a function whose value at any given sample (or point) in the sample space (the set of possible values taken by the random variable) can ...
of a
random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...
. The theorem is named after
Johann Radon Johann Karl August Radon (; 16 December 1887 – 25 May 1956) was an Austrian mathematician. His doctoral dissertation was on the calculus of variations (in 1910, at the University of Vienna). Life RadonBrigitte Bukovics: ''Biography of Johan ...
, who proved the theorem for the special case where the underlying space is in 1913, and for
Otto Nikodym Otto Marcin Nikodym (3 August 1887 – 4 May 1974) (also Otton Martin Nikodým) was a Polish mathematician. Education and career Nikodym studied mathematics at the University of Jan Kazimierz (UJK) in Lvov (today's University of Lviv). Imm ...
who proved the general case in 1930. In 1936
Hans Freudenthal Hans Freudenthal (17 September 1905 – 13 October 1990) was a Jewish-German-born Dutch mathematician. He made substantial contributions to algebraic topology and also took an interest in literature, philosophy, history and mathematics education ...
generalized the Radon–Nikodym theorem by proving the
Freudenthal spectral theorem In mathematics, the Freudenthal spectral theorem is a result in Riesz space theory proved by Hans Freudenthal in 1936. It roughly states that any element dominated by a positive element in a Riesz space with the principal projection property can ...
, a result in
Riesz space In mathematics, a Riesz space, lattice-ordered vector space or vector lattice is a partially ordered vector space where the order structure is a lattice. Riesz spaces are named after Frigyes Riesz who first defined them in his 1928 paper ''Su ...
theory; this contains the Radon–Nikodym theorem as a special case. A
Banach space In mathematics, more specifically in functional analysis, a Banach space (pronounced ) is a complete normed vector space. Thus, a Banach space is a vector space with a metric that allows the computation of vector length and distance between vector ...
is said to have the
Radon–Nikodym property In mathematics, the Bochner integral, named for Salomon Bochner, extends the definition of Lebesgue integral to functions that take values in a Banach space, as the limit of integrals of simple functions. Definition Let (X, \Sigma, \mu) be a me ...
if the generalization of the Radon–Nikodym theorem also holds, ''
mutatis mutandis ''Mutatis mutandis'' is a Medieval Latin phrase meaning "with things changed that should be changed" or "once the necessary changes have been made". It remains unnaturalized in English and is therefore usually italicized in writing. It is used i ...
'', for functions with values in . All
Hilbert space In mathematics, Hilbert spaces (named after David Hilbert) allow generalizing the methods of linear algebra and calculus from (finite-dimensional) Euclidean vector spaces to spaces that may be infinite-dimensional. Hilbert spaces arise natural ...
s have the Radon–Nikodym property.


Formal description


Radon–Nikodym theorem

The Radon–Nikodym theorem involves a
measurable space In mathematics, a measurable space or Borel space is a basic object in measure theory. It consists of a set and a σ-algebra, which defines the subsets that will be measured. Definition Consider a set X and a σ-algebra \mathcal A on X. Then the ...
(X, \Sigma) on which two
σ-finite measure In mathematics, a positive (or signed) measure ''μ'' defined on a ''σ''-algebra Σ of subsets of a set ''X'' is called a finite measure if ''μ''(''X'') is a finite real number (rather than ∞), and a set ''A'' in Σ is of finite measur ...
s are defined, \mu and \nu. It states that, if \nu \ll \mu (that is, if \nu is
absolutely continuous In calculus, absolute continuity is a smoothness property of functions that is stronger than continuity and uniform continuity. The notion of absolute continuity allows one to obtain generalizations of the relationship between the two central ope ...
with respect to \mu), then there exists a \Sigma-
measurable function In mathematics and in particular measure theory, a measurable function is a function between the underlying sets of two measurable spaces that preserves the structure of the spaces: the preimage of any measurable set is measurable. This is in di ...
f : X \to [0, \infty), such that for any measurable set A \subseteq X, \nu(A) = \int_A f \, d\mu.


Radon–Nikodym derivative

The function f satisfying the above equality is , that is, if g is another function which satisfies the same property, then f = g . The function f is commonly written \frac and is called the . The choice of notation and the name of the function reflects the fact that the function is analogous to a derivative in calculus in the sense that it describes the rate of change of density of one measure with respect to another (the way the Jacobian determinant is used in multivariable integration).


Extension to signed or complex measures

A similar theorem can be proven for signed and
complex measure In mathematics, specifically measure theory, a complex measure generalizes the concept of measure by letting it have complex values. In other words, one allows for sets whose size (length, area, volume) is a complex number. Definition Formal ...
s: namely, that if \mu is a nonnegative σ-finite measure, and \nu is a finite-valued signed or complex measure such that \nu \ll \mu, that is, \nu is
absolutely continuous In calculus, absolute continuity is a smoothness property of functions that is stronger than continuity and uniform continuity. The notion of absolute continuity allows one to obtain generalizations of the relationship between the two central ope ...
with respect to \mu, then there is a \mu-integrable real- or complex-valued function g on X such that for every measurable set A, \nu(A) = \int_A g \, d\mu.


Examples

In the following examples, the set is the real interval ,1 and \Sigma is the Borel sigma-algebra on . # \mu is the length measure on . \nu assigns to each subset of , twice the length of . Then, \frac = 2. # \mu is the length measure on . \nu assigns to each subset of , the number of points from the set that are contained in . Then, \nu is not absolutely-continuous with respect to \mu since it assigns non-zero measure to zero-length points. Indeed, there is no derivative \frac: there is no finite function that, when integrated e.g. from (0.1 - \varepsilon) to (0.1 + \varepsilon), gives 1 for all \varepsilon > 0. # \mu = \nu + \delta_0, where \nu is the length measure on X and \delta_0 is the
Dirac measure In mathematics, a Dirac measure assigns a size to a set based solely on whether it contains a fixed element ''x'' or not. It is one way of formalizing the idea of the Dirac delta function, an important tool in physics and other technical fields. ...
on 0 (it assigns a measure of 1 to any set containing 0 and a measure of 0 to any other set). Then, \nu is absolutely continuous with respect to \mu, and \frac = 1_ – the derivative is 0 at x = 0 and 1 at x > 0.


Properties

* Let ''ν'', ''μ'', and ''λ'' be σ-finite measures on the same measurable space. If ''ν'' ≪ ''λ'' and ''μ'' ≪ ''λ'' (''ν'' and ''μ'' are both
absolutely continuous In calculus, absolute continuity is a smoothness property of functions that is stronger than continuity and uniform continuity. The notion of absolute continuity allows one to obtain generalizations of the relationship between the two central ope ...
with respect to ''λ''), then \frac = \frac+\frac \quad \lambda\text. * If ''ν'' ≪ ''μ'' ≪ ''λ'', then \frac=\frac\frac\quad\lambda\text. * In particular, if ''μ'' ≪ ''ν'' and ''ν'' ≪ ''μ'', then \frac=\left(\frac\right)^\quad\nu\text. * If ''μ'' ≪ ''λ'' and is a ''μ''-integrable function, then \int_X g\,d\mu = \int_X g\frac\,d\lambda. * If ''ν'' is a finite signed or complex measure, then = \left, \.


Applications


Probability theory

The theorem is very important in extending the ideas of
probability theory Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set o ...
from probability masses and probability densities defined over real numbers to probability measures defined over arbitrary sets. It tells if and how it is possible to change from one probability measure to another. Specifically, the
probability density function In probability theory, a probability density function (PDF), or density of a continuous random variable, is a function whose value at any given sample (or point) in the sample space (the set of possible values taken by the random variable) can ...
of a
random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...
is the Radon–Nikodym derivative of the induced measure with respect to some base measure (usually the Lebesgue measure for
continuous random variable In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomenon ...
s). For example, it can be used to prove the existence of
conditional expectation In probability theory, the conditional expectation, conditional expected value, or conditional mean of a random variable is its expected value – the value it would take “on average” over an arbitrarily large number of occurrences – give ...
for probability measures. The latter itself is a key concept in
probability theory Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set o ...
, as
conditional probability In probability theory, conditional probability is a measure of the probability of an event occurring, given that another event (by assumption, presumption, assertion or evidence) has already occurred. This particular method relies on event B occur ...
is just a special case of it.


Financial mathematics

Amongst other fields,
financial mathematics Mathematical finance, also known as quantitative finance and financial mathematics, is a field of applied mathematics, concerned with mathematical modeling of financial markets. In general, there exist two separate branches of finance that require ...
uses the theorem extensively, in particular via the
Girsanov theorem In probability theory, the Girsanov theorem tells how stochastic processes change under changes in measure. The theorem is especially important in the theory of financial mathematics as it tells how to convert from the physical measure which desc ...
. Such changes of probability measure are the cornerstone of the
rational pricing Rational pricing is the assumption in financial economics that asset prices - and hence asset pricing models - will reflect the arbitrage-free price of the asset as any deviation from this price will be "arbitraged away". This assumption is use ...
of
derivatives The derivative of a function is the rate of change of the function's output relative to its input value. Derivative may also refer to: In mathematics and economics * Brzozowski derivative in the theory of formal languages * Formal derivative, an ...
and are used for converting actual probabilities into those of the risk neutral probabilities.


Information divergences

If ''μ'' and ''ν'' are measures over , and ''μ'' ≪ ''ν'' * The
Kullback–Leibler divergence In mathematical statistics, the Kullback–Leibler divergence (also called relative entropy and I-divergence), denoted D_\text(P \parallel Q), is a type of statistical distance: a measure of how one probability distribution ''P'' is different fr ...
from ''ν'' to ''μ'' is defined to be D_\text(\mu \parallel \nu) = \int_X \log \left( \frac \right) \; d\mu. * For ''α'' > 0, ''α'' ≠ 1 the Rényi divergence of order ''α'' from ''ν'' to ''μ'' is defined to be D_\alpha(\mu \parallel \nu) = \frac \log\left(\int_X\left(\frac\right)^\; d\mu\right).


The assumption of σ-finiteness

The Radon–Nikodym theorem above makes the assumption that the measure ''μ'' with respect to which one computes the rate of change of ''ν'' is σ-finite.


Negative example

Here is an example when ''μ'' is not σ-finite and the Radon–Nikodym theorem fails to hold. Consider the
Borel σ-algebra In mathematics, a Borel set is any set in a topological space that can be formed from open sets (or, equivalently, from closed sets) through the operations of countable union, countable intersection, and relative complement. Borel sets are nam ...
on the
real line In elementary mathematics, a number line is a picture of a graduated straight line (geometry), line that serves as visual representation of the real numbers. Every point of a number line is assumed to correspond to a real number, and every real ...
. Let the
counting measure In mathematics, specifically measure theory, the counting measure is an intuitive way to put a measure on any set – the "size" of a subset is taken to be the number of elements in the subset if the subset has finitely many elements, and infinity ...
, , of a Borel set be defined as the number of elements of if is finite, and otherwise. One can check that is indeed a measure. It is not -finite, as not every Borel set is at most a countable union of finite sets. Let be the usual Lebesgue measure on this Borel algebra. Then, is absolutely continuous with respect to , since for a set one has only if is the
empty set In mathematics, the empty set is the unique set having no elements; its size or cardinality (count of elements in a set) is zero. Some axiomatic set theories ensure that the empty set exists by including an axiom of empty set, while in other ...
, and then is also zero. Assume that the Radon–Nikodym theorem holds, that is, for some measurable function one has :\nu(A) = \int_A f \,d\mu for all Borel sets. Taking to be a
singleton set In mathematics, a singleton, also known as a unit set or one-point set, is a set with exactly one element. For example, the set \ is a singleton whose single element is 0. Properties Within the framework of Zermelo–Fraenkel set theory, the ...
, , and using the above equality, one finds : 0 = f(a) for all real numbers . This implies that the function , and therefore the Lebesgue measure , is zero, which is a contradiction.


Positive result

Assuming \nu\ll\mu, the Radon–Nikodym theorem also holds if \mu is localizable and \nu is ''accessible with respect to'' \mu, i.e., \nu(A)=\sup\ for all A\in\Sigma.


Proof

This section gives a measure-theoretic proof of the theorem. There is also a functional-analytic proof, using Hilbert space methods, that was first given by
von Neumann Von Neumann may refer to: * John von Neumann (1903–1957), a Hungarian American mathematician * Von Neumann family * Von Neumann (surname), a German surname * Von Neumann (crater), a lunar impact crater See also * Von Neumann algebra * Von Ne ...
. For finite measures and , the idea is to consider functions with . The supremum of all such functions, along with the
monotone convergence theorem In the mathematical field of real analysis, the monotone convergence theorem is any of a number of related theorems proving the convergence of monotonic sequences (sequences that are non-decreasing or non-increasing) that are also bounded. Inform ...
, then furnishes the Radon–Nikodym derivative. The fact that the remaining part of is singular with respect to follows from a technical fact about finite measures. Once the result is established for finite measures, extending to -finite, signed, and complex measures can be done naturally. The details are given below.


For finite measures

Constructing an extended-valued candidate First, suppose and are both finite-valued nonnegative measures. Let be the set of those extended-value measurable functions such that: :\forall A \in \Sigma:\qquad \int_A f\,d\mu \leq \nu(A) , since it contains at least the zero function. Now let , and suppose is an arbitrary measurable set, and define: :\begin A_1 &= \left\, \\ A_2 &= \left\. \end Then one has :\int_A\max\left\\,d\mu = \int_ f_1\,d\mu + \int_ f_2\,d\mu \leq \nu\left(A_1\right) + \nu\left(A_2\right) = \nu(A), and therefore, . Now, let be a sequence of functions in such that :\lim_\int_X f_n\,d\mu = \sup_ \int_X f\,d\mu. By replacing with the maximum of the first functions, one can assume that the sequence is increasing. Let be an extended-valued function defined as :g(x) := \lim_f_n(x). By Lebesgue's
monotone convergence theorem In the mathematical field of real analysis, the monotone convergence theorem is any of a number of related theorems proving the convergence of monotonic sequences (sequences that are non-decreasing or non-increasing) that are also bounded. Inform ...
, one has :\lim_ \int_A f_n\,d\mu = \int_A \lim_ f_n(x)\,d\mu(x) = \int_A g\,d\mu \leq \nu(A) for each , and hence, . Also, by the construction of , :\int_X g\,d\mu = \sup_\int_X f\,d\mu. Proving equality Now, since , :\nu_0(A) := \nu(A) - \int_A g\,d\mu defines a nonnegative measure on . To prove equality, we show that . Suppose ; then, since is finite, there is an such that . To derive a contradiction from , we look for a positive set for the signed measure (i.e. a measurable set , all of whose measurable subsets have non-negative measure), where also has positive -measure. Conceptually, we're looking for a set , where in every part of . A convenient approach is to use the Hahn decomposition for the signed measure . Note then that for every one has , and hence, :\begin \nu(A) &= \int_A g\,d\mu + \nu_0(A) \\ &\geq \int_A g\,d\mu + \nu_0(A\cap P)\\ &\geq \int_A g\,d\mu + \varepsilon\mu(A\cap P) = \int_A\left(g + \varepsilon 1_P\right)\,d\mu, \end where is the
indicator function In mathematics, an indicator function or a characteristic function of a subset of a set is a function that maps elements of the subset to one, and all other elements to zero. That is, if is a subset of some set , one has \mathbf_(x)=1 if x\i ...
of . Also, note that as desired; for if , then (since is absolutely continuous in relation to ) , so and :\nu_0(X) - \varepsilon\mu(X) = \left(\nu_0 - \varepsilon\mu\right)(N) \leq 0, contradicting the fact that . Then, since also :\int_X\left(g + \varepsilon1_P\right)\,d\mu \leq \nu(X) < +\infty, and satisfies :\int_X\left(g + \varepsilon 1_P\right)\,d\mu > \int_X g\,d\mu = \sup_\int_X f\,d\mu. This is
impossible Impossible, Imposible or Impossibles may refer to: Music * ''ImPossible'' (album), a 2016 album by Divinity Roxx * ''The Impossible'' (album) Groups * The Impossibles (American band), a 1990s indie-ska group from Austin, Texas * The Impossibl ...
because it violates the definition of a
supremum In mathematics, the infimum (abbreviated inf; plural infima) of a subset S of a partially ordered set P is a greatest element in P that is less than or equal to each element of S, if such an element exists. Consequently, the term ''greatest l ...
; therefore, the initial assumption that must be false. Hence, , as desired. Restricting to finite values Now, since is -integrable, the set is -
null Null may refer to: Science, technology, and mathematics Computing * Null (SQL) (or NULL), a special marker and keyword in SQL indicating that something has no value * Null character, the zero-valued ASCII character, also designated by , often use ...
. Therefore, if a is defined as :f(x) = \begin g(x) & \textg(x) < \infty \\ 0 & \text \end then has the desired properties. Uniqueness As for the uniqueness, let be measurable functions satisfying :\nu(A) = \int_A f\,d\mu = \int_A g\,d\mu for every measurable set . Then, is -integrable, and :\int_A(g - f)\,d\mu = 0. In particular, for or . It follows that :\int_X(g - f)^+\,d\mu = 0 = \int_X(g - f)^-\,d\mu, and so, that -almost everywhere; the same is true for , and thus, -almost everywhere, as desired.


For -finite positive measures

If and are -finite, then can be written as the union of a sequence of
disjoint sets In mathematics, two sets are said to be disjoint sets if they have no element in common. Equivalently, two disjoint sets are sets whose intersection is the empty set.. For example, and are ''disjoint sets,'' while and are not disjoint. A ...
in , each of which has finite measure under both and . For each , by the finite case, there is a -measurable function such that :\nu_n(A) = \int_A f_n\,d\mu for each -measurable subset of . The sum \left(\sum_n f_n 1_\right) := f of those functions is then the required function such that \nu(A) = \int_A f \, d\mu. As for the uniqueness, since each of the is -almost everywhere unique, so is .


For signed and complex measures

If is a -finite signed measure, then it can be Hahn–Jordan decomposed as where one of the measures is finite. Applying the previous result to those two measures, one obtains two functions, , satisfying the Radon–Nikodym theorem for and respectively, at least one of which is -integrable (i.e., its integral with respect to is finite). It is clear then that satisfies the required properties, including uniqueness, since both and are unique up to -almost everywhere equality. If is a
complex measure In mathematics, specifically measure theory, a complex measure generalizes the concept of measure by letting it have complex values. In other words, one allows for sets whose size (length, area, volume) is a complex number. Definition Formal ...
, it can be decomposed as , where both and are finite-valued signed measures. Applying the above argument, one obtains two functions, , satisfying the required properties for and , respectively. Clearly, is the required function.


The Lebesgue decomposition theorem

Lebesgue's decomposition theorem In mathematics, more precisely in measure theory, Lebesgue's decomposition theorem states that for every two σ-finite signed measures \mu and \nu on a measurable space (\Omega,\Sigma), there exist two σ-finite signed measures \nu_0 and \nu_1 s ...
shows that the assumptions of the Radon–Nikodym theorem can be found even in a situation which is seemingly more general. Consider a σ-finite positive measure \mu on the measure space (X,\Sigma) and a σ-finite signed measure \nu on \Sigma, without assuming any absolute continuity. Then there exist unique signed measures \nu_a and \nu_s on \Sigma such that \nu=\nu_a+\nu_s, \nu_a\ll\mu, and \nu_s\perp\mu. The Radon–Nikodym theorem can then be applied to the pair \nu_a,\mu.


See also

*
Girsanov theorem In probability theory, the Girsanov theorem tells how stochastic processes change under changes in measure. The theorem is especially important in the theory of financial mathematics as it tells how to convert from the physical measure which desc ...
*
Radon–Nikodym set In the theory of fair cake-cutting, the Radon–Nikodym set (RNS) is a geometric object that represents a cake, based on how different people evaluate the different parts of the cake. Example Suppose we have a cake made of four parts. There are ...


Notes


References

* Contains a proof for vector measures assuming values in a Banach space. * Contains a lucid proof in case the measure ''ν'' is not σ-finite. * * Contains a proof of the generalisation. * {{DEFAULTSORT:Radon-Nikodym theorem Theorems in measure theory Articles containing proofs Generalizations of the derivative Integral representations