Local Asymptotic Normality
   HOME

TheInfoList



OR:

In
statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, local asymptotic normality is a property of a sequence of
statistical model A statistical model is a mathematical model that embodies a set of statistical assumptions concerning the generation of Sample (statistics), sample data (and similar data from a larger Statistical population, population). A statistical model repre ...
s, which allows this sequence to be asymptotically approximated by a normal location model, after an appropriate rescaling of the parameter. An important example when the local asymptotic normality holds is in the case of i.i.d sampling from a regular parametric model. The notion of local asymptotic normality was introduced by and is fundamental in the treatment of estimator and test efficiency.


Definition

A sequence of parametric statistical models is said to be locally asymptotically normal (LAN) at ''θ'' if there exist
matrices Matrix (: matrices or matrixes) or MATRIX may refer to: Science and mathematics * Matrix (mathematics), a rectangular array of numbers, symbols or expressions * Matrix (logic), part of a formula in prenex normal form * Matrix (biology), the ...
''rn'' and ''Iθ'' and a random
vector Vector most often refers to: * Euclidean vector, a quantity with a magnitude and a direction * Disease vector, an agent that carries and transmits an infectious pathogen into another living organism Vector may also refer to: Mathematics a ...
such that, for every converging sequence , : \ln \frac = h'\Delta_ - \frac12 h'I_\theta\,h + o_(1), where the derivative here is a Radon–Nikodym derivative, which is a formalised version of the
likelihood ratio A likelihood function (often simply called the likelihood) measures how well a statistical model explains observed data by calculating the probability of seeing that data under different parameter values of the model. It is constructed from the j ...
, and where ''o'' is a type of big O in probability notation. In other words, the local likelihood ratio must converge in distribution to a normal random variable whose mean is equal to minus one half the variance: : \ln \frac\ \ \xrightarrow\ \ \mathcal\Big( h'I_\theta\,h,\ h'I_\theta\,h\Big). The sequences of distributions P_ and P_ are
contiguous Contiguity or contiguous may refer to: *Contiguous data storage, in computer science *Contiguity (probability theory) *Contiguity (psychology) *Contiguous distribution of species, in biogeography *Geographic contiguity Geographic contiguity is t ...
.


Example

The most straightforward example of a LAN model is an iid model whose likelihood is twice continuously differentiable. Suppose is an iid sample, where each ''Xi'' has density function . The likelihood function of the model is equal to : p_(x_1,\ldots,x_n;\,\theta) = \prod_^n f(x_i,\theta). If ''f'' is twice continuously differentiable in ''θ'', then : \begin \ln p_ &\approx \ln p_ + \delta\theta'\frac + \frac12 \delta\theta' \frac \delta\theta \\ &= \ln p_ + \delta\theta' \sum_^n\frac + \frac12 \delta\theta' \bigg sum_^n\frac \biggdelta\theta . \end Plugging in \delta\theta=h/\sqrt, gives : \ln \frac = h' \Bigg(\frac \sum_^n\frac\Bigg) \;-\; \frac12 h' \Bigg( \frac1n \sum_^n - \frac \Bigg) h \;+\; o_p(1). By the
central limit theorem In probability theory, the central limit theorem (CLT) states that, under appropriate conditions, the Probability distribution, distribution of a normalized version of the sample mean converges to a Normal distribution#Standard normal distributi ...
, the first term (in parentheses) converges in distribution to a normal random variable , whereas by the
law of large numbers In probability theory, the law of large numbers is a mathematical law that states that the average of the results obtained from a large number of independent random samples converges to the true value, if it exists. More formally, the law o ...
the expression in second parentheses converges in probability to ''Iθ'', which is the
Fisher information matrix In mathematical statistics, the Fisher information is a way of measuring the amount of information that an observable random variable ''X'' carries about an unknown parameter ''θ'' of a distribution that models ''X''. Formally, it is the variance ...
: : I_\theta = \mathrm\bigg bigg= \mathrm\bigg bigg(\frac\bigg)\bigg(\frac\bigg)'\,\bigg Thus, the definition of the local asymptotic normality is satisfied, and we have confirmed that the parametric model with iid observations and twice continuously differentiable likelihood has the LAN property.


See also

*
Asymptotic distribution In mathematics and statistics, an asymptotic distribution is a probability distribution that is in a sense the limiting distribution of a sequence of distributions. One of the main uses of the idea of an asymptotic distribution is in providing appr ...


Notes


References

* * * {{DEFAULTSORT:Local Asymptotic Normality Asymptotic theory (statistics)