Redescending M-estimator
   HOME

TheInfoList



OR:

In
statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
, redescending M-estimators are ''Ψ''-type
M-estimator In statistics, M-estimators are a broad class of extremum estimators for which the objective function is a sample average. Both non-linear least squares and maximum likelihood estimation are special cases of M-estimators. The definition of M-estima ...
s which have ''ψ'' functions that are non-decreasing near the origin, but decreasing toward 0 far from the origin. Their ''ψ'' functions can be chosen to redescend smoothly to zero, so that they usually satisfy ''ψ''(''x'') = 0 for all x with , ''x'', > ''r'', where ''r'' is referred to as the minimum rejection point. Due to these properties of the ''ψ'' function, these kinds of estimators are very efficient, have a high breakdown point and, unlike other outlier rejection techniques, they do not suffer from a masking effect. They are efficient because they completely reject gross outliers, and do not completely ignore moderately large outliers (like median).


Advantages

Redescending M-estimators have high breakdown points (close to 0.5), and their ''Ψ'' function can be chosen to redescend smoothly to 0. This means that moderately large outliers are not ignored completely, and greatly improves the efficiency of the redescending M-estimator. The redescending M-estimators are slightly more efficient than the Huber estimator for several symmetric, wider tailed distributions, but about 20% more efficient than the Huber estimator for the
Cauchy distribution The Cauchy distribution, named after Augustin Cauchy, is a continuous probability distribution. It is also known, especially among physicists, as the Lorentz distribution (after Hendrik Lorentz), Cauchy–Lorentz distribution, Lorentz(ian) fun ...
. This is because they completely reject gross outliers, while the Huber estimator effectively treats these the same as moderate outliers. As other M-estimators, but unlike other outlier rejection techniques, they do not suffer from masking effects.


Disadvantages

The M-estimating equation for a redescending estimator may not have a unique solution. Consequently, the initial point for an iterative solution must be chosen with care, e.g., by use of another estimator.


Choosing redescending ''Ψ'' functions

When choosing a redescending ''Ψ'' function, care must be taken such that it does not descend too steeply, which may have a very bad influence on the denominator in the expression for the asymptotic variance : \frac where ''F'' is the mixture model distribution. This effect is particularly harmful when a large negative value of ''ψ''′(''x'') combines with a large positive value of ''ψ''2(''x''), and there is a cluster of outliers near ''x''.


Examples

1. Hampel's three-part M estimators have ''Ψ'' functions which are odd functions and defined for any ''x'' by: :: \Psi(x)= \begin x, & 0\le , x, \le a \text\\ a\, \operatorname(x), & a\le , x, \le b \text\\ \frac\,\operatorname(x),& b\le , x, \le r \text\\ 0,& r\le , x, \qquad\, \text \end This function is plotted in the following figure for ''a'' = 1.645, ''b'' = 3 and ''r'' = 6.5. 2. Tukey's biweight or bisquare M estimators have ''Ψ'' functions for any positive ''k'', which defined by: :\Psi(x)=x(1-(x/k)^2)^2 ; \ , x, \le k This function is plotted in the following figure for ''k'' = 5. 3. Andrew's sine wave M estimator has the following Ψ function: :\Psi(x)=\sin{(x)};\ -\pi \le x \le\pi This function is plotted in the following figure.


References

* ''Redescending M-estimators'', Shevlyakov, G, Morgenthaler, S and Shurygin, A. M., J Stat Plann Inference 138:2906–2917, 2008. * ''Robust Estimation and Testing'', Robert G. Staudte and Simon J. Sheather, Wiley 1990. * ''Robust Statistics'',Huber, P., New York: Wiley, 1981.


See also

*
M-estimator In statistics, M-estimators are a broad class of extremum estimators for which the objective function is a sample average. Both non-linear least squares and maximum likelihood estimation are special cases of M-estimators. The definition of M-estima ...
*
Robust statistics Robust statistics are statistics with good performance for data drawn from a wide range of probability distributions, especially for distributions that are not normal. Robust statistical methods have been developed for many common problems, suc ...
Robust statistics M-estimators