__NOTOC__ A Newey–West estimator is used in

statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...

and

econometrics Econometrics is the application of Statistics, statistical methods to economic data in order to give Empirical evidence, empirical content to economic relationships.M. Hashem Pesaran (1987). "Econometrics," ''The New Palgrave: A Dictionary of ...

to provide an estimate of the

covariance matrix In probability theory and statistics, a covariance matrix (also known as auto-covariance matrix, dispersion matrix, variance matrix, or variance–covariance matrix) is a square matrix giving the covariance between each pair of elements of ...

of the parameters of a regression-type model where the standard assumptions of

regression analysis In statistical modeling, regression analysis is a set of statistical processes for estimating the relationships between a dependent variable (often called the 'outcome' or 'response' variable, or a 'label' in machine learning parlance) and one ...

do not apply. It was devised by

Whitney K. Newey Whitney Kent Newey (born July 17, 1954) is the Jane Berkowitz Carlton and Dennis William Carlton Professor of Economics at the Massachusetts Institute of Technology and a well-known econometrician. He is best known for developing, with Kenneth D ...

and

Kenneth D. West Kenneth David West (born 1953) is the John D. MacArthur and Ragnar Frisch Professor of Economics in the Department of Economics at the University of Wisconsin. He is currently co-editor of the ''Journal of Money, Credit and Banking'', and has p ...

in 1987, although there are a number of later variants. The estimator is used to try to overcome

autocorrelation Autocorrelation, sometimes known as serial correlation in the discrete time case, is the correlation of a signal with a delayed copy of itself as a function of delay. Informally, it is the similarity between observations of a random variable ...

(also called serial correlation), and

heteroskedasticity In statistics, a sequence (or a vector) of random variables is homoscedastic () if all its random variables have the same finite variance. This is also known as homogeneity of variance. The complementary notion is called heteroscedasticity. The ...

in the error terms in the models, often for regressions applied to

time series In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. Exa ...

data. The abbreviation "HAC," sometimes used for the estimator, stands for "heteroskedasticity and autocorrelation consistent." There are a number of HAC estimators described in, and HAC estimator does not refer uniquely to Newey-West. One version of Newey-West Bartlett requires the user to specify the bandwidth and usage of the Bartlett Kernel from

Kernel density estimation In statistics, kernel density estimation (KDE) is the application of kernel smoothing for probability density estimation, i.e., a non-parametric method to estimate the probability density function of a random variable based on ''kernels'' as w ...

Regression models estimated with time series data often exhibit autocorrelation; that is, the

error term In mathematics and statistics, an error term is an additive type of error. Common examples include: * errors and residuals in statistics, e.g. in linear regression * the error term in numerical integration In analysis, numerical integration ...

s are correlated over time. The heteroscedastic consistent estimator of the error covariance is constructed from a term

X^\Sigma X

, where

X

is the design matrix for the regression problem and

\Sigma

is the covariance matrix of the residuals. The least squares estimator

b

is a

consistent estimator In statistics, a consistent estimator or asymptotically consistent estimator is an estimator—a rule for computing estimates of a parameter ''θ''0—having the property that as the number of data points used increases indefinitely, the result ...

\beta

. This implies that the

least squares The method of least squares is a standard approach in regression analysis to approximate the solution of overdetermined systems (sets of equations in which there are more equations than unknowns) by minimizing the sum of the squares of the res ...

residuals

e_i

are "point-wise" consistent estimators of their population counterparts

E_i

. The general approach, then, will be to use

X

and

e

to devise an estimator of

X^\Sigma X

. This means that as the time between error terms increases, the correlation between the error terms decreases. The estimator thus can be used to improve the

ordinary least squares In statistics, ordinary least squares (OLS) is a type of linear least squares method for choosing the unknown parameters in a linear regression model (with fixed level-one effects of a linear function of a set of explanatory variables) by the prin ...

(OLS)

regression Regression or regressions may refer to: Science * Marine regression, coastal advance due to falling sea level, the opposite of marine transgression * Regression (medicine), a characteristic of diseases to express lighter symptoms or less extent ( ...

when the residuals are heteroskedastic and/or autocorrelated. :

X^\Sigma X=\frac \sum^_ e_t^x_t x^_t + \frac \sum^_ \sum^_ w_ e_t e_(x_t x^_ + x_ x^_t)

w_\ell=1 - \frac\ell

where T is the sample size,

e_t

is the

t^

residual and

x_t

is the

t^

row of the design matrix, and

w_\ell

is the Bartlett Kernel and can be thought of as a weight that decreases with increasing separation between samples. Disturbances that are farther apart from each other are given lower weight, while those with equal subscripts are given a weight of 1. This ensures that second term converges (in some appropriate sense) to a finite matrix. This weighting scheme also ensures that the resulting covariance matrix is positive semi-definite. L=0 reduces the Newy-West estimator to

Huber–White standard error The topic of heteroskedasticity-consistent (HC) standard errors arises in statistics and econometrics in the context of linear regression and time series analysis. These are also known as heteroskedasticity-robust standard errors (or simply robus ...

. L specifies the "maximum lag considered for the control of autocorrelation. A common choice for L" is

T^

Software implementations

Julia Julia is usually a feminine given name. It is a Latinate feminine form of the name Julio and Julius. (For further details on etymology, see the Wiktionary entry "Julius".) The given name ''Julia'' had been in use throughout Late Antiquity (e.g. ...

, the CovarianceMatrices.jl package supports several types of heteroskedasticity and autocorrelation consistent covariance matrix estimation including Newey–West, White, and Arellano. In R, the packages sandwich and plm include a function for the Newey–West estimator. In

Stata Stata (, , alternatively , occasionally stylized as STATA) is a general-purpose statistical software package developed by StataCorp for data manipulation, visualization, statistics, and automated reporting. It is used by researchers in many fie ...

, the command newey produces Newey–West standard errors for coefficients estimated by OLS regression. In

MATLAB MATLAB (an abbreviation of "MATrix LABoratory") is a proprietary multi-paradigm programming language and numeric computing environment developed by MathWorks. MATLAB allows matrix manipulations, plotting of functions and data, implementation ...

, the command hac in the Econometrics toolbox produces the Newey–West estimator (among others). In

Python Python may refer to: Snakes * Pythonidae, a family of nonvenomous snakes found in Africa, Asia, and Australia ** ''Python'' (genus), a genus of Pythonidae found in Africa and Asia * Python (mythology), a mythical serpent Computing * Python (pro ...

, the statsmodels module includes functions for the covariance matrix using Newey-West. In

Gretl gretl is an open-source statistical package, mainly for econometrics. The name is an acronym for ''G''nu ''R''egression, ''E''conometrics and ''T''ime-series ''L''ibrary. It has both a graphical user interface (GUI) and a command-line interfa ...

, the option --robust to several estimation commands (such as ols) in the context of a time-series dataset produces Newey–West standard errors. In SAS, the Newey-West corrected standard errors can be obtained in PROC AUTOREG and PROC MODEL

Software implementations

See also

References

Further reading