HOME

TheInfoList



OR:

In
statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, the deficiency is a measure to compare a
statistical model A statistical model is a mathematical model that embodies a set of statistical assumptions concerning the generation of Sample (statistics), sample data (and similar data from a larger Statistical population, population). A statistical model repre ...
with another statistical model. The concept was introduced in the 1960s by the French mathematician
Lucien Le Cam Lucien Marie Le Cam (November 18, 1924 – April 25, 2000) was a mathematician and statistician. Biography Le Cam was born November 18, 1924, in Croze, France. His parents were farmers, and unable to afford higher education for him; his father d ...
, who used it to prove an approximative version of the Blackwell–Sherman–Stein theorem. Closely related is the Le Cam distance, a pseudometric for the maximum deficiency between two statistical models. If the deficiency of a model \mathcal in relation to \mathcal is zero, then one says \mathcal is ''better'' or ''more informative'' or ''stronger'' than \mathcal.


Introduction

Le Cam defined the statistical model more abstract than a probability space with a family of probability measures. He also didn't use the term "statistical model" and instead used the term "experiment". In his publication from 1964 he introduced the statistical experiment to a parameter set \Theta as a triple (X,E,(P_\theta)_) consisting of a set X, a
vector lattice In mathematics, a Riesz space, lattice-ordered vector space or vector lattice is a partially ordered vector space where the order structure is a lattice. Riesz spaces are named after Frigyes Riesz who first defined them in his 1928 paper ''Su ...
E with unit I and a family of normalized
positive functional In mathematics, more specifically in functional analysis, a positive linear functional on an ordered vector space (V, \leq) is a linear functional f on V so that for all positive elements v \in V, that is v \geq 0, it holds that f(v) \geq 0. In oth ...
s (P_\theta)_ on E. In his book from 1986 he omitted E and X. This article follows his definition from 1986 and uses his terminology to emphasize the generalization.


Formulation


Basic concepts

Let \Theta be a parameter space. Given an abstract L1-space (L,\, \cdot\, ) (i.e. a
Banach lattice In the mathematical disciplines of in functional analysis and order theory, a Banach lattice is a complete normed vector space with a lattice order, \leq, such that for all , the implication \Rightarrow holds, where the absolute value is defin ...
such that for elements x,y\geq 0 also \, x+y\, =\, x\, +\, y\, holds) consisting of lineare positive functionals \. An ''experiment'' \mathcal is a map \mathcal:\Theta \to L of the form \theta \mapsto P_, such that \, P_\, =1. L is the band induced by \ and therefore we use the notation L(\mathcal). For a \mu\in L(\mathcal) denote the \mu^=\mu \vee 0=\max(\mu,0). The topological dual M of an L-space with the conjugated norm \, u\, _M=\sup\ is called an ''abstract M-space''. It's also a lattice with unit defined through I \mu=\, \mu^+\, _L-\, \mu^-\, _L for \mu\in L. Let L(A) and L(B) be two L-space of two experiments A and B, then one calls a positive, norm-preserving linear map, i.e. \, T\mu^\, =\, \mu^\, for all \mu\in L(A), a transition. The adjoint of a transitions is a positive linear map from the dual space M_B of L(B) into the dual space M_A of L(A), such that the unit of M_A is the image of the unit of M_B ist.


Deficiency

Let \Theta be a parameter space and \mathcal:\theta \to P_\theta and \mathcal:\theta \to Q_\theta be two experiments indexed by \Theta. Le L(\mathcal) and L(\mathcal) denote the corresponding L-spaces and let \mathcal be the set of all transitions from L(\mathcal) to L(\mathcal). The deficiency \delta(\mathcal,\mathcal) of \mathcal in relation to \mathcal is the number defined in terms of inf sup: :\delta(\mathcal,\mathcal):=\inf\limits_\sup\limits_ \tfrac\, Q_-TP_\, _, where \, \cdot\, _ denoted the
total variation norm In mathematics, the total variation identifies several slightly different concepts, related to the (local or global) structure of the codomain of a function or a measure. For a real-valued continuous function ''f'', defined on an interval 'a ...
\, \mu\, _=\mu^+\mu^. The factor \tfrac is just for computational purposes and is sometimes omitted.


Explanations

* \delta(\mathcal,\mathcal)=0 means that there exists a transition T such that TP_=Q_ for all \theta \in \Theta. * The deficiency measures how well Q_ of P_ can be approximated by T in the sense of total variation. * The deficiency is a norm for Q_-TP_.


Le Cam distance

The Le Cam distance is the following pseudometric :\Delta(\mathcal,\mathcal):= \operatorname\left(\delta(\mathcal,\mathcal),\delta(\mathcal,\mathcal)\right). This induces an
equivalence relation In mathematics, an equivalence relation is a binary relation that is reflexive, symmetric, and transitive. The equipollence relation between line segments in geometry is a common example of an equivalence relation. A simpler example is equ ...
and when \Delta(\mathcal,\mathcal)=0, then one says \mathcal and \mathcal are ''equivalent''. The equivalent class C_ of \mathcal is also called the ''type of \mathcal''. Often one is interested in families of experiments (\mathcal_n)_ with \ and (\mathcal_n)_ with \. If \Delta(\mathcal_n,\mathcal_n)=0 as n\to \infty, then one says (\mathcal_n) and (\mathcal_n) are ''asymptotically equivalent''. Let \Theta be a parameter space and E(\Theta) be the set of all types that are induced by \Theta, then the Le Cam distance \Delta is complete with respect to E(\Theta). The condition \delta(\mathcal,\mathcal)=0 induces a
partial order In mathematics, especially order theory, a partial order on a set is an arrangement such that, for certain pairs of elements, one precedes the other. The word ''partial'' is used to indicate that not every pair of elements needs to be comparable ...
on E(\Theta), one says \mathcal is ''better'' or ''more informative'' or ''stronger'' than \mathcal.


References


Bibliography

* * * {{cite book , first=Erik , last=Torgersen , title=Comparison of Statistical Experiments , publisher=Cambridge University Press, United Kingdom , date=1991 , doi=10.1017/CBO9780511666353 Statistical theory