statistical theory The theory of statistics provides a basis for the whole range of techniques, in both study design and data analysis, that are used within applications of statistics. The theory covers approaches to statistical-decision problems and to statistical ...

, a pseudolikelihood is an

approximation An approximation is anything that is intentionally similar but not exactly equality (mathematics), equal to something else. Etymology and usage The word ''approximation'' is derived from Latin ''approximatus'', from ''proximus'' meaning ''very ...

to the

joint probability distribution Given two random variables that are defined on the same probability space, the joint probability distribution is the corresponding probability distribution on all possible pairs of outputs. The joint distribution can just as well be considered ...

of a collection of

random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...

s. The practical use of this is that it can provide an approximation to the

likelihood function The likelihood function (often simply called the likelihood) represents the probability of random variable realizations conditional on particular values of the statistical parameters. Thus, when evaluated on a given sample, the likelihood funct ...

of a set of observed data which may either provide a computationally simpler problem for

estimation Estimation (or estimating) is the process of finding an estimate or approximation, which is a value that is usable for some purpose even if input data may be incomplete, uncertain, or unstable. The value is nonetheless usable because it is der ...

, or may provide a way of obtaining explicit estimates of model parameters. The pseudolikelihood approach was introduced by

Julian Besag Julian Ernst Besag FRS (26 March 1945 – 6 August 2010) was a British statistician known chiefly for his work in spatial statistics (including its applications to epidemiology, image analysis and agricultural science), and Bayesian inferenc ...

in the context of analysing data having

spatial dependence Spatial analysis or spatial statistics includes any of the formal techniques which studies entities using their topological, geometric, or geographic properties. Spatial analysis includes a variety of techniques, many still in their early devel ...

Definition

Given a set of random variables

X = X_1, X_2, \ldots, X_n

the pseudolikelihood of

X = x = (x_1,x_2, \ldots, x_n)

is :

L(\theta) := \prod_i \mathrm_\theta(X_i = x_i\mid X_j = x_j \text j \neq i)=\prod_i \mathrm _\theta (X_i = x_i \mid X_=x_)

in discrete case and :

L(\theta) := \prod_i p_\theta(x_i \mid x_j \text j \neq i)=\prod_i p _\theta (x_i \mid x_)=\prod _i p_\theta (x_i \mid x_1,\ldots, \hat x_i, \ldots, x_n)

in continuous one. Here

X

is a vector of variables,

x

is a vector of values,

p_\theta(\cdot \mid \cdot)

is conditional density and

\theta =(\theta_1, \ldots, \theta_p)

is the vector of parameters we are to estimate. The expression

X = x

above means that each variable

X_i

in the vector

X

has a corresponding value

x_i

in the vector

x

and

x_=(x_1, \ldots,\hat x_i, \ldots, x_n)

means that the coordinate

x_i

has been omitted. The expression

\mathrm _\theta(X = x)

is the probability that the vector of variables

X

has values equal to the vector

x

. This probability of course depends on the unknown parameter

\theta

. Because situations can often be described using state variables ranging over a set of possible values, the expression

\mathrm _\theta(X = x)

can therefore represent the probability of a certain state among all possible states allowed by the state variables. The pseudo-log-likelihood is a similar measure derived from the above expression, namely (in discrete case) :

l(\theta):=\log L(\theta) = \sum_i \log \mathrm_\theta(X_i = x_i\mid X_j = x_j \text j \neq i).

One use of the pseudolikelihood measure is as an approximation for inference about a

Markov Markov (Bulgarian, russian: Марков), Markova, and Markoff are common surnames used in Russia and Bulgaria. Notable people with the name include: Academics *Ivana Markova (born 1938), Czechoslovak-British emeritus professor of psychology at t ...

Bayesian network A Bayesian network (also known as a Bayes network, Bayes net, belief network, or decision network) is a probabilistic graphical model that represents a set of variables and their conditional dependencies via a directed acyclic graph (DAG). Bay ...

, as the pseudolikelihood of an assignment to

X_i

may often be computed more efficiently than the likelihood, particularly when the latter may require marginalization over a large number of variables.

Properties

Use of the pseudolikelihood in place of the true likelihood function in a

maximum likelihood In statistics, maximum likelihood estimation (MLE) is a method of estimation theory, estimating the Statistical parameter, parameters of an assumed probability distribution, given some observed data. This is achieved by Mathematical optimization, ...

analysis can lead to good estimates, but a straightforward application of the usual likelihood techniques to derive information about estimation uncertainty, or for significance testing, would in general be incorrect.Dodge, Y. (2003) ''The Oxford Dictionary of Statistical Terms'', Oxford University Press.

References

{{reflist Statistical inference