In statistics, the residual sum of squares (RSS), also known as the sum of squared estimate of errors (SSE), is the

sum Sum most commonly means the total of two or more numbers added together; see addition. Sum can also refer to: Mathematics * Sum (category theory), the generic concept of summation in mathematics * Sum, the result of summation, the additio ...

of the

squares In Euclidean geometry, a square is a regular quadrilateral, which means that it has four equal sides and four equal angles (90-degree angles, π/2 radian angles, or right angles). It can also be defined as a rectangle with two equal-length ad ...

of residuals (deviations predicted from actual empirical values of data). It is a measure of the discrepancy between the data and an estimation model, such as a

linear regression In statistics, linear regression is a linear approach for modelling the relationship between a scalar response and one or more explanatory variables (also known as dependent and independent variables). The case of one explanatory variable is ...

. A small RSS indicates a tight fit of the model to the data. It is used as an

optimality criterion In statistics, an optimality criterion provides a measure of the fit of the data to a given hypothesis, to aid in model selection. A model is designated as the "best" of the candidate models if it gives the best value of an objective function m ...

in parameter selection and

model selection Model selection is the task of selecting a statistical model from a set of candidate models, given data. In the simplest cases, a pre-existing set of data is considered. However, the task can also involve the design of experiments such that the ...

. In general,

total sum of squares In statistical data analysis the total sum of squares (TSS or SST) is a quantity that appears as part of a standard way of presenting results of such analyses. For a set of observations, y_i, i\leq n, it is defined as the sum over all squared dif ...

explained sum of squares In statistics, the explained sum of squares (ESS), alternatively known as the model sum of squares or sum of squares due to regression (SSR – not to be confused with the residual sum of squares (RSS) or sum of squares of errors), is a quantity ...

+ residual sum of squares. For a proof of this in the multivariate

ordinary least squares In statistics, ordinary least squares (OLS) is a type of linear least squares method for choosing the unknown parameters in a linear regression model (with fixed level-one effects of a linear function of a set of explanatory variables) by the ...

(OLS) case, see partitioning in the general OLS model.

One explanatory variable

In a model with a single explanatory variable, RSS is given by: :

\operatorname = \sum_^n (y_i - f(x_i))^2

where ''y''_''i'' is the ''i''^th value of the variable to be predicted, ''x''_''i'' is the ''i''^th value of the explanatory variable, and

f(x_i)

is the predicted value of ''y''_''i'' (also termed

\hat

). In a standard linear simple

regression model In statistical modeling, regression analysis is a set of statistical processes for estimating the relationships between a dependent variable (often called the 'outcome' or 'response' variable, or a 'label' in machine learning parlance) and one o ...

y_i = \alpha + \beta x_i+\varepsilon_i\,

, where

\alpha

and

\beta

are

coefficient In mathematics, a coefficient is a multiplicative factor in some term of a polynomial, a series, or an expression; it is usually a number, but may be any expression (including variables such as , and ). When the coefficients are themselves ...

s, ''y'' and ''x'' are the regressand and the

regressor Dependent and independent variables are variables in mathematical modeling, statistical modeling and experimental sciences. Dependent variables receive this name because, in an experiment, their values are studied under the supposition or dema ...

, respectively, and ε is the

error term In mathematics and statistics, an error term is an additive type of error. Common examples include: * errors and residuals in statistics, e.g. in linear regression In statistics, linear regression is a linear approach for modelling the relati ...

. The sum of squares of residuals is the sum of squares of

\widehat_i

; that is :

\operatorname = \sum_^n (\widehat_i)^2 = \sum_^n (y_i - (\widehat + \widehat x_i))^2

where

\widehat

is the estimated value of the constant term

\alpha

and

\widehat

is the estimated value of the slope coefficient

\beta

Matrix expression for the OLS residual sum of squares

The general regression model with observations and explanators, the first of which is a constant unit vector whose coefficient is the regression intercept, is :

y = X \beta + e

where is an ''n'' × 1 vector of dependent variable observations, each column of the ''n'' × ''k'' matrix is a vector of observations on one of the ''k'' explanators,

\beta

is a ''k'' × 1 vector of true coefficients, and is an ''n''× 1 vector of the true underlying errors. The

estimator for

\beta

is :

X \hat \beta = y \iff

X^\operatorname X \hat \beta = X^\operatorname y \iff

\hat \beta = (X^\operatorname X)^X^\operatorname y.

The residual vector

\hat e = y - X \hat \beta = y - X (X^\operatorname X)^X^\operatorname y

; so the residual sum of squares is: :

\operatorname = \hat e ^\operatorname \hat e =  \,  \hat e \, ^2

, (equivalent to the square of the

norm Naturally occurring radioactive materials (NORM) and technologically enhanced naturally occurring radioactive materials (TENORM) consist of materials, usually industrial wastes or by-products enriched with radioactive elements found in the envir ...

of residuals). In full: :

\operatorname = y^\operatorname y - y^\operatorname X(X^\operatorname X)^ X^\operatorname y = y^\operatorname - X(X^\operatorname X)^ X^\operatorname y = y^\operatorname - H y

, where is the

hat matrix In statistics, the projection matrix (\mathbf), sometimes also called the influence matrix or hat matrix (\mathbf), maps the vector of response values (dependent variable values) to the vector of fitted values (or predicted values). It describes ...

, or the projection matrix in linear regression.

Relation with Pearson's product-moment correlation

The least-squares regression line is given by :

y=ax+b

, where

b=\bar-a\bar

and

a=\frac

, where

S_=\sum_^n(\bar-x_i)(\bar-y_i)

and

S_=\sum_^n(\bar-x_i)^2.

Therefore, :

& = \sum_^n (a(\bar-x_i)-(\bar-y_i))^2=a^2S_-2aS_+S_=S_-aS_=S_ \left(1-\frac \right) \end

where

S_=\sum_^n (\bar-y_i)^2 .

The Pearson product-moment correlation is given by

r=\frac;

therefore,

\operatorname=S_(1-r^2).

References

* {{cite book , title = Applied Regression Analysis , edition = 3rd , last1= Draper , first1=N.R. , last2=Smith , first2=H. , publisher = John Wiley , year = 1998 , isbn = 0-471-17082-8 Least squares Errors and residuals

One explanatory variable

Matrix expression for the OLS residual sum of squares

Relation with Pearson's product-moment correlation

See also

References