In
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, the deficiency is a measure to compare a
statistical model
A statistical model is a mathematical model that embodies a set of statistical assumptions concerning the generation of Sample (statistics), sample data (and similar data from a larger Statistical population, population). A statistical model repre ...
with another statistical model. The concept was introduced in the 1960s by the French mathematician
Lucien Le Cam
Lucien Marie Le Cam (November 18, 1924 – April 25, 2000) was a mathematician and statistician.
Biography
Le Cam was born November 18, 1924, in Croze, France. His parents were farmers, and unable to afford higher education for him; his father d ...
, who used it to prove an approximative version of the
Blackwell–Sherman–Stein theorem. Closely related is the Le Cam distance, a
pseudometric for the maximum deficiency between two statistical models. If the deficiency of a model
in relation to
is zero, then one says
is ''better'' or ''more informative'' or ''stronger'' than
.
Introduction
Le Cam defined the statistical model more abstract than a probability space with a family of probability measures. He also didn't use the term "statistical model" and instead used the term "experiment". In his publication from 1964 he introduced the statistical experiment to a parameter set
as a triple
consisting of a set
, a
vector lattice
In mathematics, a Riesz space, lattice-ordered vector space or vector lattice is a partially ordered vector space where the order structure is a lattice.
Riesz spaces are named after Frigyes Riesz who first defined them in his 1928 paper ''Su ...
with unit
and a family of normalized
positive functional In mathematics, more specifically in functional analysis, a positive linear functional on an ordered vector space (V, \leq) is a linear functional f on V so that for all positive elements v \in V, that is v \geq 0, it holds that
f(v) \geq 0.
In oth ...
s
on
.
In his book from 1986 he omitted
and
.
This article follows his definition from 1986 and uses his terminology to emphasize the generalization.
Formulation
Basic concepts
Let
be a parameter space. Given an
abstract L1-space (i.e. a
Banach lattice
In the mathematical disciplines of in functional analysis and order theory, a Banach lattice is a complete normed vector space with a lattice order, \leq, such that for all , the implication \Rightarrow holds, where the absolute value is defin ...
such that for elements
also
holds) consisting of lineare positive functionals
. An ''experiment''
is a map
of the form
, such that
.
is the band induced by
and therefore we use the notation
. For a
denote the
. The
topological dual of an L-space with the conjugated norm
is called an ''abstract M-space''. It's also a lattice with unit defined through
for
.
Let
and
be two L-space of two experiments
and
, then one calls a positive, norm-preserving linear map, i.e.
for all
, a transition. The adjoint of a transitions is a positive linear map from the dual space
of
into the dual space
of
, such that the unit of
is the image of the unit of
ist.
Deficiency
Let
be a parameter space and
and
be two experiments indexed by
. Le
and
denote the corresponding L-spaces and let
be the set of all transitions from
to
.
The deficiency
of
in relation to
is the number defined in terms of
inf sup:
:
where
denoted the
total variation norm
In mathematics, the total variation identifies several slightly different concepts, related to the (local or global) structure of the codomain of a function or a measure. For a real-valued continuous function ''f'', defined on an interval 'a ...
. The factor
is just for computational purposes and is sometimes omitted.
Explanations
*
means that there exists a transition
such that
for all
.
* The deficiency measures how well
of
can be approximated by
in the sense of total variation.
* The deficiency is a norm for
.
Le Cam distance
The Le Cam distance is the following pseudometric
:
This induces an
equivalence relation
In mathematics, an equivalence relation is a binary relation that is reflexive, symmetric, and transitive. The equipollence relation between line segments in geometry is a common example of an equivalence relation. A simpler example is equ ...
and when
, then one says
and
are ''equivalent''. The equivalent class
of
is also called the ''type of
''.
Often one is interested in families of experiments
with
and
with
. If
as
, then one says
and
are ''asymptotically equivalent''.
Let
be a parameter space and
be the set of all types that are induced by
, then the Le Cam distance
is complete with respect to
. The condition
induces a
partial order
In mathematics, especially order theory, a partial order on a set is an arrangement such that, for certain pairs of elements, one precedes the other. The word ''partial'' is used to indicate that not every pair of elements needs to be comparable ...
on
, one says
is ''better'' or ''more informative'' or ''stronger'' than
.
References
Bibliography
*
*
* {{cite book
, first=Erik
, last=Torgersen
, title=Comparison of Statistical Experiments
, publisher=Cambridge University Press, United Kingdom
, date=1991
, doi=10.1017/CBO9780511666353
Statistical theory