In
statistical theory
The theory of statistics provides a basis for the whole range of techniques, in both study design and data analysis, that are used within applications of statistics.
The theory covers approaches to statistical-decision problems and to statistical ...
, a pseudolikelihood is an
approximation
An approximation is anything that is intentionally similar but not exactly equality (mathematics), equal to something else.
Etymology and usage
The word ''approximation'' is derived from Latin ''approximatus'', from ''proximus'' meaning ''very ...
to the
joint probability distribution
Given two random variables that are defined on the same probability space, the joint probability distribution is the corresponding probability distribution on all possible pairs of outputs. The joint distribution can just as well be considered ...
of a collection of
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...
s. The practical use of this is that it can provide an approximation to the
likelihood function
The likelihood function (often simply called the likelihood) represents the probability of random variable realizations conditional on particular values of the statistical parameters. Thus, when evaluated on a given sample, the likelihood funct ...
of a set of observed data which may either provide a computationally simpler problem for
estimation
Estimation (or estimating) is the process of finding an estimate or approximation, which is a value that is usable for some purpose even if input data may be incomplete, uncertain, or unstable. The value is nonetheless usable because it is der ...
, or may provide a way of obtaining explicit estimates of model parameters.
The pseudolikelihood approach was introduced by
Julian Besag
Julian Ernst Besag FRS (26 March 1945 – 6 August 2010) was a British statistician known chiefly for his work in spatial statistics (including its applications to epidemiology, image analysis and agricultural science), and Bayesian inferenc ...
in the context of analysing data having
spatial dependence
Spatial analysis or spatial statistics includes any of the formal techniques which studies entities using their topological, geometric, or geographic properties. Spatial analysis includes a variety of techniques, many still in their early devel ...
.
Definition
Given a set of random variables
the pseudolikelihood of
is
:
in discrete case and
:
in continuous one.
Here
is a vector of variables,
is a vector of values,
is conditional density and
is the vector of parameters we are to estimate. The expression
above means that each variable
in the vector
has a corresponding value
in the vector
and
means that the coordinate
has been omitted. The expression
is the probability that the vector of variables
has values equal to the vector
. This probability of course depends on the unknown parameter
. Because situations can often be described using state variables ranging over a set of possible values, the expression
can therefore represent the probability of a certain state among all possible states allowed by the state variables.
The pseudo-log-likelihood is a similar measure derived from the above expression, namely (in discrete case)
:
One use of the pseudolikelihood measure is as an approximation for inference about a
Markov Markov (Bulgarian, russian: Марков), Markova, and Markoff are common surnames used in Russia and Bulgaria. Notable people with the name include:
Academics
*Ivana Markova (born 1938), Czechoslovak-British emeritus professor of psychology at t ...
or
Bayesian network
A Bayesian network (also known as a Bayes network, Bayes net, belief network, or decision network) is a probabilistic graphical model that represents a set of variables and their conditional dependencies via a directed acyclic graph (DAG). Bay ...
, as the pseudolikelihood of an assignment to
may often be computed more efficiently than the likelihood, particularly when the latter may require marginalization over a large number of variables.
Properties
Use of the pseudolikelihood in place of the true likelihood function in a
maximum likelihood
In statistics, maximum likelihood estimation (MLE) is a method of estimation theory, estimating the Statistical parameter, parameters of an assumed probability distribution, given some observed data. This is achieved by Mathematical optimization, ...
analysis can lead to good estimates, but a straightforward application of the usual likelihood techniques to derive information about estimation uncertainty, or for
significance testing, would in general be incorrect.
[Dodge, Y. (2003) ''The Oxford Dictionary of Statistical Terms'', Oxford University Press. ]
References
{{reflist
Statistical inference