The Chow test (), proposed by

econometrician Econometrics is an application of statistical methods to economic data in order to give empirical content to economic relationships. M. Hashem Pesaran (1987). "Econometrics", '' The New Palgrave: A Dictionary of Economics'', v. 2, p. 8 p. 8� ...

Gregory Chow in 1960, is a

statistical test A statistical hypothesis test is a method of statistical inference used to decide whether the data provide sufficient evidence to reject a particular hypothesis. A statistical hypothesis test typically involves a calculation of a test statistic. ...

of whether the true coefficients in two

linear regression In statistics, linear regression is a statistical model, model that estimates the relationship between a Scalar (mathematics), scalar response (dependent variable) and one or more explanatory variables (regressor or independent variable). A mode ...

s on different data sets are equal. In econometrics, it is most commonly used in

time series analysis In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. ...

to test for the presence of a structural break at a period which can be assumed to be known ''a priori'' (for instance, a major historical event such as a war). In

program evaluation Program evaluation is a systematic method for collecting, analyzing, and using information to answer questions about projects, policies and programs, particularly about their effectiveness and efficiency. In the public, private, and voluntar ...

, the Chow test is often used to determine whether the independent variables have different impacts on different subgroups of the population.

Illustrations

First Chow Test

Suppose that we model our data as :

y_t=a+bx_ + cx_ + \varepsilon.\,

If we split our data into two groups, then we have :

y_t=a_1+b_1x_ + c_1x_ + \varepsilon \,

and :

y_t=a_2+b_2x_ + c_2x_ + \varepsilon. \,

The

null hypothesis The null hypothesis (often denoted ''H''0) is the claim in scientific research that the effect being studied does not exist. The null hypothesis can also be described as the hypothesis in which no relationship exists between two sets of data o ...

of the Chow test asserts that

a_1=a_2

b_1=b_2

, and

c_1=c_2

, and there is the assumption that the model errors

\varepsilon

are independent and identically distributed from a

normal distribution In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is f(x) = \frac ...

with unknown

variance In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion ...

. Let

S_C

be the sum of squared residuals from the combined data,

S_1

be the sum of squared residuals from the first group, and

S_2

be the sum of squared residuals from the second group.

N_1

and

N_2

are the number of observations in each group and

k

is the total number of parameters (in this case 3, i.e. 2 independent variables coefficients + intercept). Then the Chow

test statistic Test statistic is a quantity derived from the sample for statistical hypothesis testing.Berger, R. L.; Casella, G. (2001). ''Statistical Inference'', Duxbury Press, Second Edition (p.374) A hypothesis test is typically specified in terms of a tes ...

is :

\frac.

The test statistic follows the ''F''-distribution with

k

and

N_1+N_2-2k

degrees of freedom In many scientific fields, the degrees of freedom of a system is the number of parameters of the system that may vary independently. For example, a point in the plane has two degrees of freedom for translation: its two coordinates; a non-infinite ...

. The same result can be achieved via dummy variables. Consider the two data sets which are being compared. Firstly there is the 'primary'

data set A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more table (database), database tables, where every column (database), column of a table represents a particular Variable (computer sci ...

i= and the 'secondary' data set i=. Then there is the union of these two sets: i=. If there is no structural change between the primary and secondary data sets a regression can be run over the union without the issue of biased estimators arising. Consider the regression:

y_t=\beta_0+\beta_1x_ + \beta_2x_ + ... + \beta_kx_ + \gamma_0D_t + \sum_^k\gamma_ix_D_t +  \varepsilon_t.\,

Which is run over i=. D is a dummy variable taking a value of 1 for i= and 0 otherwise. If both data sets can be explained fully by

(\beta_0,\beta_1,...,\beta_k)

then there is no use in the dummy variable as the data set is explained fully by the restricted equation. That is, under the assumption of no structural change we have a null and alternative hypothesis of:

H_0: \gamma_0=0,\gamma_1=0,...,\gamma_k=0

H_1: \text

The null hypothesis of joint insignificance of D can be run as an

F-test An F-test is a statistical test that compares variances. It is used to determine if the variances of two samples, or if the ratios of variances among multiple samples, are significantly different. The test calculates a Test statistic, statistic, ...

with

n-2(k+1)

degrees of freedom (DoF). That is:

F=\frac

. Remarks * The global sum of squares (SSE) is often called the Restricted Sum of Squares (RSSM) as we basically test a constrained model where we have

2k

assumptions (with

k

the number of regressors). * Some software like SAS will use a predictive Chow test when the size of a subsample is less than the number of regressors.

References

* * * * *

External links

{{commonscat, Chow test
Computing the Chow statistic
Series of FAQ explanations from the Stata Corporation at https://www.stata.com/support/faqs/

Series of FAQ explanations from the SAS Institute, SAS Corporation Time series statistical tests Regression diagnostics