Semiparametric Model
   HOME

TheInfoList



OR:

In
statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
, a semiparametric model is a statistical model that has parametric and
nonparametric Nonparametric statistics is the branch of statistics that is not based solely on Statistical parameter, parametrized families of probability distributions (common examples of parameters are the mean and variance). Nonparametric statistics is based ...
components. A statistical model is a parameterized family of distributions: \ indexed by a
parameter A parameter (), generally, is any characteristic that can help in defining or classifying a particular system (meaning an event, project, object, situation, etc.). That is, a parameter is an element of a system that is useful, or critical, when ...
\theta. * A parametric model is a model in which the indexing parameter \theta is a vector in k-dimensional
Euclidean space Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, that is, in Euclid's Elements, Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics ther ...
, for some nonnegative integer k.. Thus, \theta is finite-dimensional, and \Theta \subseteq \mathbb^k. * With a nonparametric model, the set of possible values of the parameter \theta is a subset of some space V, which is not necessarily finite-dimensional. For example, we might consider the set of all distributions with mean 0. Such spaces are vector spaces with topological structure, but may not be finite-dimensional as vector spaces. Thus, \Theta \subseteq V for some possibly
infinite-dimensional space In mathematics, the dimension of a vector space ''V'' is the cardinality (i.e., the number of vectors) of a basis of ''V'' over its base field. p. 44, ยง2.36 It is sometimes called Hamel dimension (after Georg Hamel) or algebraic dimension to dis ...
V. * With a semiparametric model, the parameter has both a finite-dimensional component and an infinite-dimensional component (often a real-valued function defined on the real line). Thus, \Theta \subseteq \mathbb^k \times V, where V is an infinite-dimensional space. It may appear at first that semiparametric models include nonparametric models, since they have an infinite-dimensional as well as a finite-dimensional component. However, a semiparametric model is considered to be "smaller" than a completely nonparametric model because we are often interested only in the finite-dimensional component of \theta. That is, the infinite-dimensional component is regarded as a
nuisance parameter Nuisance (from archaic ''nocence'', through Fr. ''noisance'', ''nuisance'', from Lat. ''nocere'', "to hurt") is a common law tort. It means that which causes offence, annoyance, trouble or injury. A nuisance can be either public (also "common") ...
.. In nonparametric models, by contrast, the primary interest is in estimating the infinite-dimensional parameter. Thus the estimation task is statistically harder in nonparametric models. These models often use
smoothing In statistics and image processing, to smooth a data set is to create an approximating function that attempts to capture important patterns in the data, while leaving out noise or other fine-scale structures/rapid phenomena. In smoothing, the dat ...
or
kernel Kernel may refer to: Computing * Kernel (operating system), the central component of most operating systems * Kernel (image processing), a matrix used for image convolution * Compute kernel, in GPGPU programming * Kernel method, in machine learn ...
s.


Example

A well-known example of a semiparametric model is the
Cox proportional hazards model Proportional hazards models are a class of survival models in statistics. Survival models relate the time that passes, before some event occurs, to one or more covariates that may be associated with that quantity of time. In a proportional haz ...
. If we are interested in studying the time T to an event such as death due to cancer or failure of a light bulb, the Cox model specifies the following distribution function for T: : F(t) = 1 - \exp\left(-\int_0^t \lambda_0(u) e^ du\right), where x is the covariate vector, and \beta and \lambda_0(u) are unknown parameters. \theta = (\beta, \lambda_0(u)). Here \beta is finite-dimensional and is of interest; \lambda_0(u) is an unknown non-negative function of time (known as the baseline hazard function) and is often a
nuisance parameter Nuisance (from archaic ''nocence'', through Fr. ''noisance'', ''nuisance'', from Lat. ''nocere'', "to hurt") is a common law tort. It means that which causes offence, annoyance, trouble or injury. A nuisance can be either public (also "common") ...
. The set of possible candidates for \lambda_0(u) is infinite-dimensional.


See also

*
Semiparametric regression In statistics, semiparametric regression includes regression models that combine parametric and nonparametric models. They are often used in situations where the fully nonparametric model may not perform well or when the researcher wants to u ...
* Statistical model * Generalized method of moments


Notes


References

* * * *{{citation , first1= Anastasios A. , last1= Tsiatis , title= Semiparametric Theory and Missing Data , year= 2006 , publisher= Springer *Begun, Janet M.; Hall, W. J.; Huang, Wei-Min; Wellner, Jon A. (1983), "Information and asymptotic efficiency in parametric--nonparametric models", Annals of Statistics, 11 (1983), no. 2, 432--452 Mathematical and quantitative methods (economics)