The Durbin–Wu–Hausman test (also called Hausman specification test) is a

statistical hypothesis test A statistical hypothesis test is a method of statistical inference used to decide whether the data at hand sufficiently support a particular hypothesis. Hypothesis testing allows us to make probabilistic statements about population parameters. ...

in econometrics named after James Durbin, De-Min Wu, and

Jerry A. Hausman Jerry Allen Hausman (born May 5, 1946) is the John and Jennie S. MacDonald Professor of Economics at the Massachusetts Institute of Technology and a notable econometrician. He has published numerous influential papers in microeconometrics. Haus ...

. The test evaluates the consistency of an estimator when compared to an alternative, less efficient estimator which is already known to be consistent. It helps one evaluate if a statistical model corresponds to the data.

Details

Consider the linear model ''y'' = ''Xb'' + ''e'', where ''y'' is the dependent variable and ''X'' is vector of

regressor Dependent and independent variables are variables in mathematical modeling, statistical modeling and experimental sciences. Dependent variables receive this name because, in an experiment, their values are studied under the supposition or demand ...

s, ''b'' is a vector of coefficients and ''e'' is the error term. We have two estimators for ''b'': ''b''₀ and ''b''₁. Under the null hypothesis, both of these estimators are consistent, but ''b''₁ is efficient (has the smallest asymptotic variance), at least in the class of estimators containing ''b''₀. Under the

alternative hypothesis In statistical hypothesis testing, the alternative hypothesis is one of the proposed proposition in the hypothesis test. In general the goal of hypothesis test is to demonstrate that in the given condition, there is sufficient evidence supporting ...

, ''b''₀ is consistent, whereas ''b''₁ isn't. Then the Wu–Hausman

statistic A statistic (singular) or sample statistic is any quantity computed from values in a sample which is considered for a statistical purpose. Statistical purposes include estimating a population parameter, describing a sample, or evaluating a hypo ...

is: :

H=(b_-b_)'\big(\operatorname(b_)-\operatorname(b_)\big)^\dagger(b_-b_),

where ^† denotes the Moore–Penrose pseudoinverse. Under the null hypothesis, this statistic has asymptotically the chi-squared distribution with the number of degrees of freedom equal to the rank of matrix . If we reject the null hypothesis, it means that b₁ is inconsistent. This test can be used to check for the endogeneity of a variable (by comparing instrumental variable (IV) estimates to

ordinary least squares In statistics, ordinary least squares (OLS) is a type of linear least squares method for choosing the unknown parameters in a linear regression model (with fixed level-one effects of a linear function of a set of explanatory variables) by the prin ...

(OLS) estimates). It can also be used to check the validity of extra

instruments Instrument may refer to: Science and technology * Flight instruments, the devices used to measure the speed, altitude, and pertinent flight angles of various kinds of aircraft * Laboratory equipment, the measuring tools used in a scientific l ...

by comparing IV estimates using a full set of instruments ''Z'' to IV estimates that use a proper subset of ''Z''. Note that in order for the test to work in the latter case, we must be certain of the validity of the subset of ''Z'' and that subset must have enough instruments to identify the parameters of the equation. Hausman also showed that the covariance between an efficient estimator and the difference of an efficient and inefficient estimator is zero.

Derivation

Assuming joint normality of the estimators. :

\sqrt \begin b_1 -b\\ b_0 -b\end \xrightarrow \mathcal \left(\begin 0 \\ 0 \end, \begin\operatorname(b_1) & \operatorname(b_1,b_0) \\ \operatorname(b_1,b_0) & \operatorname(b_0)  \end\right)

Consider the function :

q=b_0-b_1\Rightarrow \operatornameq=0

By the delta method :

& \operatorname(q)=\operatorname(b_)+\operatorname(b_)-2\operatorname(b_1,b_0) \end

Using the commonly used result, showed by Hausman, that the covariance of an efficient estimator with its difference from an inefficient estimator is zero yields :

\operatorname(q)=\operatorname(b_0)-\operatorname(b_1)

The chi-squared test is based on the Wald criterion :

(b_1-b_0)'\big(\operatorname(b_0)-\operatorname(b_1)\big)^\dagger(b_1-b_0),

where ^† denotes the Moore–Penrose pseudoinverse

Panel data

The Hausman test can be used to differentiate between fixed effects model and random effects model in panel analysis. In this case, Random effects (RE) is preferred under the null hypothesis due to higher efficiency, while under the alternative Fixed effects (FE) is at least as consistent and thus preferred.

Details

Derivation

Panel data

See also

References

Further reading