Two-step M-estimators deals with
M-estimation problems that require preliminary estimation to obtain the parameter of interest. Two-step M-estimation is different from usual M-estimation problem because asymptotic distribution of the second-step estimator generally depends on the first-step estimator. Accounting for this change in asymptotic distribution is important for valid inference.
Description
The class of two-step M-estimators includes
Heckman's sample selection estimator, weighted
non-linear least squares
Non-linear least squares is the form of least squares analysis used to fit a set of ''m'' observations with a model that is non-linear in ''n'' unknown parameters (''m'' ≥ ''n''). It is used in some forms of nonlinear regression. The ...
, and
ordinary least squares
In statistics, ordinary least squares (OLS) is a type of linear least squares method for choosing the unknown parameters in a linear regression model (with fixed level-one effects of a linear function of a set of explanatory variables) by the ...
with
generated regressors
In least squares estimation problems, sometimes one or more regressors
Dependent and independent variables are variables in mathematical modeling, statistical modeling and experimental sciences. Dependent variables receive this name because, ...
.
[Wooldridge, J.M., Econometric Analysis of Cross Section and Panel Data, MIT Press, Cambridge, Mass.]
To fix ideas, let
be an
i.i.d. sample.
and
are subsets of Euclidean spaces
and
, respectively. Given a function
, two-step M-estimator
is defined as:
:
where
is an M-estimate of a
nuisance parameter
Nuisance (from archaic ''nocence'', through Fr. ''noisance'', ''nuisance'', from Lat. ''nocere'', "to hurt") is a common law tort. It means that which causes offence, annoyance, trouble or injury. A nuisance can be either public (also "common") ...
that needs to be calculated in the first step.
Consistency
In classical deductive logic, a consistent theory is one that does not lead to a logical contradiction. The lack of contradiction can be defined in either semantic or syntactic terms. The semantic definition states that a theory is consistent ...
of two-step M-estimators can be verified by checking consistency conditions for usual M-estimators, although some modification might be necessary. In practice, the important condition to check is the
identification condition
In statistics, identifiability is a property which a statistical model, model must satisfy for precise statistical inference, inference to be possible. A model is identifiable if it is theoretically possible to learn the true values of this model' ...
.
If
where
is a non-random vector, then the identification condition is that