In
statistics, the Vuong closeness test is a
likelihood-ratio-based test for
model selection
Model selection is the task of selecting a statistical model from a set of candidate models, given data. In the simplest cases, a pre-existing set of data is considered. However, the task can also involve the design of experiments such that the ...
using the
Kullback–Leibler information criterion. This statistic makes probabilistic statements about two models. They can be
nested, strictly non-nested or partially non-nested (also called overlapping). The statistic tests the null hypothesis that the two models are equally close to the true data generating process, against the alternative that one model is closer. It cannot make any decision whether the "closer" model is the true model.
Technical description
With strictly non-nested models and
iid exogenous variables, model 1 (2) is preferred with significance level ''α'', if the
z statistic
In statistics, the standard score is the number of standard deviations by which the value of a raw score (i.e., an observed value or data point) is above or below the mean value of what is being observed or measured. Raw scores above the mean ...
:
with
:
exceeds the positive (falls below the negative) (1 − α)-quantile of the
standard normal distribution
In statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is
:
f(x) = \frac e^
The parameter \mu i ...
. Here ''K''
1 and ''K''
2 are the numbers of parameters in models 1 and 2 respectively.
The numerator is the difference between the maximum likelihoods of the two models, corrected for the number of coefficients analogous to the
BIC, the term in the denominator of the expression for ''Z'',
, is defined by setting
equal to either the mean of the squares of the pointwise log-likelihood ratios
, or to the sample variance of these values, where
:
For nested or partially non-nested (overlapping) models the statistic
:
has to be compared to critical values from a weighted sum of
chi squared distribution
In probability theory and statistics, the chi-squared distribution (also chi-square or \chi^2-distribution) with k degrees of freedom is the distribution of a sum of the squares of k independent standard normal random variables. The chi-square ...
s. This can be approximated by a
gamma distribution
In probability theory and statistics, the gamma distribution is a two- parameter family of continuous probability distributions. The exponential distribution, Erlang distribution, and chi-square distribution are special cases of the gamma dis ...
:
:
with
:
:
and
:
is a vector of
eigenvalue
In linear algebra, an eigenvector () or characteristic vector of a linear transformation is a nonzero vector that changes at most by a scalar factor when that linear transformation is applied to it. The corresponding eigenvalue, often denot ...
s of a
matrix
Matrix most commonly refers to:
* ''The Matrix'' (franchise), an American media franchise
** '' The Matrix'', a 1999 science-fiction action film
** "The Matrix", a fictional setting, a virtual reality environment, within ''The Matrix'' (franchi ...
of conditional
expectations. The computation is quite difficult, so that in the overlapping and nested case many authors only derive statements from a subjective evaluation of the Z statistic (is it subjectively "big enough" to accept my hypothesis?).
Improper use for zero-inflated models
Vuong's test for non-nested models has been used in model selection to compare a zero-inflated count model to its non-zero-inflated counterpart (e.g., zero-inflated Poisson model versus ordinary Poisson model). Wilson (2015) argues that such use of Vuong's test is invalid as a non-zero-inflated model is neither ''strictly non-nested'' nor ''partially non-nested'' in its zero-inflated counterpart. The core of the misunderstanding appears to be the terminology, which offers itself to being incorrectly understood to imply that all pairs of non-nested models are either ''strictly non-nested'' or ''partially non-nested'' (aka overlapping). Crucially, the definitions of ''strictly non-nested'' and ''partially non-nested'' in Vuong (1989) do ''not'' unite to mean "all pairs of models that are not ''nested''". In other words, there are non-nested models that are neither ''strictly non-nested'' nor ''partially non-nested''. The zero-inflated Poisson model and its non-zero-inflated counterpart are an example of such a pair of non-nested models. Consequently, Vuong's test is not a valid test for discriminating between them.
Example of strictly and partially non-nested models
Vuong (1989) gives two examples of strictly non-nested models:
* A pair of standard linear regression models with different distributional assumptions on the distribution of error terms (e.g., normally distributed and logistically distributed).
* A pair of standard linear regression models with the same distributional assumptions on the distribution of error terms but different functional forms such as
and
, where
and
is a non-degenerate real random vector.
Vuong (1989) also gives an intuitive example of partially non-nested (aka overlapping) models:
* A pair of standard linear regression models with some common explanatory variables and neither model nested in the other.
References
*
*
*{{cite journal
, last1 = Wilson, first1 = Paul
, year = 2015
, title = The Misuse of The Vuong Test For Non-Nested Models to Test for Zero-Inflation
, journal = Economics Letters, volume = 127
, issue = 2
, pages = 151–153
, doi = 10.1016/j.econlet.2014.12.029
, hdl = 2436/621118
, hdl-access= free
Statistical tests