Nonlinear Mixed-effects Model
   HOME

TheInfoList



OR:

Nonlinear mixed-effects models constitute a class of
statistical model A statistical model is a mathematical model that embodies a set of statistical assumptions concerning the generation of Sample (statistics), sample data (and similar data from a larger Statistical population, population). A statistical model repres ...
s generalizing linear mixed-effects models. Like linear mixed-effects models, they are particularly useful in settings where there are multiple measurements within the same
statistical unit In statistics, a unit is one member of a set of entities being studied. It is the main source for the mathematical abstraction of a "random variable". Common examples of a unit would be a single person, animal, plant, manufactured item, or country ...
s or when there are dependencies between measurements on related statistical units. Nonlinear mixed-effects models are applied in many fields including
medicine Medicine is the science and practice of caring for a patient, managing the diagnosis, prognosis, prevention, treatment, palliation of their injury or disease, and promoting their health. Medicine encompasses a variety of health care pract ...
,
public health Public health is "the science and art of preventing disease, prolonging life and promoting health through the organized efforts and informed choices of society, organizations, public and private, communities and individuals". Analyzing the det ...
,
pharmacology Pharmacology is a branch of medicine, biology and pharmaceutical sciences concerned with drug or medication action, where a drug may be defined as any artificial, natural, or endogenous (from within the body) molecule which exerts a biochemica ...
, and
ecology Ecology () is the study of the relationships between living organisms, including humans, and their physical environment. Ecology considers organisms at the individual, population, community, ecosystem, and biosphere level. Ecology overlaps wi ...
.


Definition

While any
statistical model A statistical model is a mathematical model that embodies a set of statistical assumptions concerning the generation of Sample (statistics), sample data (and similar data from a larger Statistical population, population). A statistical model repres ...
containing both fixed effects and
random effect In statistics, a random effects model, also called a variance components model, is a statistical model where the model parameters are random variables. It is a kind of hierarchical linear model, which assumes that the data being analysed are ...
s is an example of a nonlinear mixed-effects model, the most commonly used models are members of the class of nonlinear mixed-effects models for repeated measures :_ = f(\phi_,_) + \epsilon_,\quad i =1,\ldots, M, \, j = 1,\ldots, n_i where *M is the number of groups/subjects, *n_i is the number of observations for the ith group/subject, *f is a real-valued differentiable function of a group-specific parameter vector \theta_ and a covariate vector v_, *\phi_ is modeled as a linear mixed-effects model \phi_= \boldsymbol_\beta + \boldsymbol_ \boldsymbol_, where \beta is a vector of fixed effects and \boldsymbol_ is a vector of random effects associated with group i, and *\epsilon_ is a random variable describing additive noise.


Estimation

When the model is only nonlinear in fixed effects and the random effects are Gaussian,
maximum-likelihood estimation In statistics, maximum likelihood estimation (MLE) is a method of estimation theory, estimating the Statistical parameter, parameters of an assumed probability distribution, given some observed data. This is achieved by Mathematical optimization, ...
can be done using
nonlinear least squares Non-linear least squares is the form of least squares analysis used to fit a set of ''m'' observations with a model that is non-linear in ''n'' unknown parameters (''m'' ≥ ''n''). It is used in some forms of nonlinear regression. The ...
methods, although asymptotic properties of estimators and
test statistic A test statistic is a statistic (a quantity derived from the sample) used in statistical hypothesis testing.Berger, R. L.; Casella, G. (2001). ''Statistical Inference'', Duxbury Press, Second Edition (p.374) A hypothesis test is typically specif ...
s may differ from the conventional
general linear model The general linear model or general multivariate regression model is a compact way of simultaneously writing several multiple linear regression models. In that sense it is not a separate statistical linear model. The various multiple linear regre ...
. In the more general setting, there exist several methods for doing
maximum-likelihood estimation In statistics, maximum likelihood estimation (MLE) is a method of estimation theory, estimating the Statistical parameter, parameters of an assumed probability distribution, given some observed data. This is achieved by Mathematical optimization, ...
or
maximum a posteriori estimation In Bayesian statistics, a maximum a posteriori probability (MAP) estimate is an estimate of an unknown quantity, that equals the mode of the posterior distribution. The MAP can be used to obtain a point estimate of an unobserved quantity on the b ...
in certain classes of nonlinear mixed-effects models – typically under the assumption of normally distributed random variables. A popular approach is the Lindstrom-Bates algorithm which relies on iteratively optimizing a nonlinear problem, locally linearizing the model around this optimum and then employing conventional methods from linear mixed-effects models to do maximum likelihood estimation. Stochastic approximation of the expectation-maximization algorithm gives an alternative approach for doing maximum-likelihood estimation.


Applications


Example: Disease progression modeling

Nonlinear mixed-effects models have been used for modeling progression of disease. In
progressive disease Progressive disease or progressive illness is a disease or physical ailment whose course in most cases is the worsening, growth, or spread of the disease. This may happen until death, serious debility, or organ failure occurs. Some progressive di ...
, the temporal patterns of progression on outcome variables may follow a nonlinear temporal shape that is similar between patients. However, the stage of disease of an individual may not be known or only partially known from what can be measured. Therefore, a
latent Latency or latent may refer to: Science and technology * Latent heat, energy released or absorbed, by a body or a thermodynamic system, during a constant-temperature process * Latent variable, a variable that is not directly observed but inferred ...
time variable that describe individual disease stage (i.e. where the patient is along the nonlinear mean curve) can be included in the model.


Example: Modeling cognitive decline in Alzheimer's disease

Alzheimer's disease Alzheimer's disease (AD) is a neurodegeneration, neurodegenerative disease that usually starts slowly and progressively worsens. It is the cause of 60–70% of cases of dementia. The most common early symptom is difficulty in short-term me ...
is characterized by a progressive cognitive deterioration. However, patients may differ widely in cognitive ability and
reserve Reserve or reserves may refer to: Places * Reserve, Kansas, a US city * Reserve, Louisiana, a census-designated place in St. John the Baptist Parish * Reserve, Montana, a census-designated place in Sheridan County * Reserve, New Mexico, a US vi ...
, so
cognitive testing Cognitive tests are assessments of the cognitive capabilities of humans and other animals. Tests administered to humans include various forms of IQ tests; those administered to animals include the mirror test (a test of visual self-awareness) and ...
at a single time point can often only be used to coarsely group individuals in different stages of disease. Now suppose we have a set of longitudinal cognitive data (y_, \ldots, y_) from i=1,\ldots,M individuals that are each categorized as having either normal cognition (CN),
mild cognitive impairment Mild cognitive impairment (MCI) is a neurocognitive disorder which involves cognitive impairments beyond those expected based on an individual's age and education but which are not significant enough to interfere with instrumental activities of ...
(MCI) or
dementia Dementia is a disorder which manifests as a set of related symptoms, which usually surfaces when the brain is damaged by injury or disease. The symptoms involve progressive impairments in memory, thinking, and behavior, which negatively affe ...
(DEM) at the baseline visit (time t_ =0 corresponding to measurement y_ ). These longitudinal trajectories can be modeled using a nonlinear mixed effects model that allows differences in disease state based on baseline categorization: :_ = f_(t_ + A^_i \beta^ + A^_i \beta^ + b_i) + \epsilon_,\quad i =1,\ldots, M, \, j = 1,\ldots, n_i where *f_ is a function that models the mean time-profile of cognitive decline whose shape is determined by the parameters \tilde\beta, *t_ represents observation time (e.g. time since baseline in the study), *A^_i and A^_i are dummy variables that are 1 if individual i has MCI or dementia at baseline and 0 otherwise, *\beta^ and \beta^ are parameters that model the difference in disease progression of the MCI and dementia groups relative to the cognitively normal, *b_ is the difference in disease stage of individual i relative to his/her baseline category, and *\epsilon_ is a random variable describing additive noise. An example of such a model with an
exponential Exponential may refer to any of several mathematical topics related to exponentiation, including: *Exponential function, also: **Matrix exponential, the matrix analogue to the above * Exponential decay, decrease at a rate proportional to value *Exp ...
mean function fitted to longitudinal measurements of the Alzheimer's Disease Assessment Scale-Cognitive Subscale (ADAS-Cog) in shown in the box. As shown, the inclusion of fixed effects of baseline categorization (MCI or dementia relative to normal cognition) and the random effect of individual continuous disease stage b_ aligns the trajectories of cognitive deterioration to reveal a common pattern of cognitive decline.


Example: Growth analysis

Growth phenomena often follow nonlinear patters (e.g.
logistic growth A logistic function or logistic curve is a common S-shaped curve (sigmoid curve) with equation f(x) = \frac, where For values of x in the domain of real numbers from -\infty to +\infty, the S-curve shown on the right is obtained, with the ...
,
exponential growth Exponential growth is a process that increases quantity over time. It occurs when the instantaneous rate of change (that is, the derivative) of a quantity with respect to time is proportional to the quantity itself. Described as a function, a q ...
, and
hyperbolic growth When a quantity grows towards a singularity under a finite variation (a "finite-time singularity") it is said to undergo hyperbolic growth. More precisely, the reciprocal function 1/x has a hyperbola as a graph, and has a singularity at 0, meani ...
). Factors such as nutrient deficiency may both directly affect the measured outcome (e.g. organisms with lack of nutrients end up smaller), but possibly also timing (e.g. organisms with lack of nutrients grow at a slower pace). If a model fails to account for the differences in timing, the estimated population-level curves may smooth out finer details due to lack of
synchronization Synchronization is the coordination of events to operate a system in unison. For example, the conductor of an orchestra keeps the orchestra synchronized or ''in time''. Systems that operate with all parts in synchrony are said to be synchronou ...
between organisms. Nonlinear mixed-effects models enable simultaneous modeling of individual differences in growth outcomes and timing.


Example: Modeling human height

Models for estimating the mean curves of human height and weight as a function of age and the natural variation around the mean are used to create
growth chart A growth chart is used by pediatricians and other health care providers to follow a child's growth over time. Growth charts have been constructed by observing the growth of large numbers of healthy children over time. The height, weight, and head ...
s. The growth of children can however become desynchronized due to both genetic and environmental factors. For example, age at onset of puberty and its associated height spurt can vary several years between adolescents. Therefore,
cross-sectional studies In medical research, social science, and biology, a cross-sectional study (also known as a cross-sectional analysis, transverse study, prevalence study) is a type of observational study that analyzes data from a population, or a representative su ...
may underestimate the magnitude of the pubertal height spurt because age is not synchronized with biological development. The differences in biological development can be modeled using random effects \boldsymbol_i that describe a mapping of observed age to a
latent Latency or latent may refer to: Science and technology * Latent heat, energy released or absorbed, by a body or a thermodynamic system, during a constant-temperature process * Latent variable, a variable that is not directly observed but inferred ...
biological age using a so-called ''warping function'' v(\cdot, \boldsymbol_i). A simple nonlinear mixed-effects model with this structure is given by :_ = f_(v(t_, \boldsymbol_i)) + \epsilon_,\quad i =1,\ldots, M, \, j = 1,\ldots, n_i where *f_ is a function that represents the height development of a typical child as a function of age. Its shape is determined by the parameters \beta, *t_ is the age of child i corresponding to the height measurement y_, *v(\cdot, \boldsymbol_i) is a warping function that maps age to biological development to synchronize. Its shape is determined by the random effects \boldsymbol_i, *\epsilon_ is a random variable describing additive variation (e.g. consistent differences in height between children and measurement noise). There exists several methods and software packages for fitting such models. The so-called ''SITAR'' model can fit such models using warping functions that are
affine transformation In Euclidean geometry, an affine transformation or affinity (from the Latin, ''affinis'', "connected with") is a geometric transformation that preserves lines and parallelism, but not necessarily Euclidean distances and angles. More generally, ...
s of time (i.e. additive shifts in biological age and differences in rate of maturation), while the so-called ''pavpop'' model can fit models with smoothly-varying warping functions. An example of the latter is shown in the box.


Example: Population Pharmacokinetic/pharmacodynamic modeling

PK/PD models PK/PD modeling (pharmacokinetic/pharmacodynamic modeling) (alternatively abbreviated as PKPD or PK-PD modeling) is a technique that combines the two classical pharmacologic disciplines of pharmacokinetics and pharmacodynamics. It integrates a pharm ...
for describing exposure-response relationships such as the Emax model can be formulated as nonlinear mixed-effects models. The mixed-model approach allows modeling of both population level and individual differences in effects that have a nonlinear effect on the observed outcomes, for example the rate at which a compound is being metabolized or distributed in the body.


Example: COVID-19 epidemiological modeling

The platform of the nonlinear mixed effect models can be used to describe infection trajectories of subjects and understand some common features shared across the subjects. In epidemiological problems, subjects can be countries, states, or counties, etc. This can be particularly useful in estimating a future trend of the epidemic in an early stage of pendemic where nearly little information is known regarding the disease.


Example: Prediction of oil production curve of shale oil wells at a new location with latent kriging

The eventual success of petroleum development projects relies on a large degree of well construction costs. As for
unconventional oil Unconventional oil is petroleum produced or extracted using techniques other than the conventional method (oil well). Industry and governments across the globe are investing in unconventional oil sources due to the increasing scarcity of conventio ...
and gas reservoirs, because of very low permeability, and a flow mechanism very different from that of conventional reservoirs, estimates for the well construction cost often contain high levels of uncertainty, and oil companies need to make heavy investment in the drilling and completion phase of the wells. The overall recent commercial success rate of horizontal wells in the United States is known to be 65%, which implies that only 2 out of 3 drilled wells will be commercially successful. For this reason, one of the crucial tasks of petroleum engineers is to quantify the uncertainty associated with oil or gas production from shale reservoirs, and further, to predict an approximated production behavior of a new well at a new location given specific completion data before actual drilling takes place to save a large degree of well construction costs. The platform of the nonlinear mixed effect models can be extended to consider the spatial association by incorporating the geostatistical processes such as
Gaussian process In probability theory and statistics, a Gaussian process is a stochastic process (a collection of random variables indexed by time or space), such that every finite collection of those random variables has a multivariate normal distribution, i.e. e ...
on the second stage of the model as follows: :_ = \mu(t;\theta_,\theta_,\theta_) + \epsilon_,\quad \quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad i =1,\ldots, N, \, t = 1,\ldots, T_i, \theta_=\theta_(s_i) = \alpha_l + \sum_^\beta_x_j + \epsilon_(s_i) + \eta_(s_i), \quad \epsilon_(\cdot) \sim GWN(\sigma_l^2), \quad\quad l=1,2,3, \eta_(\cdot) \sim GP(0,K_(\cdot, \cdot)),\quad K_(s_i,s_j) = \gamma_l^2 \exp (-e^ \, s_i - s_j \, ^2), \quad\quad\quad l=1,2,3, \beta_, \lambda_,\tau_l,\sigma_l \sim N(0,\sigma_l^2 \tau_l^2 \lambda_^2 ),\quad \sigma,\lambda_,\tau_l,\sigma_l\sim C^(0,1), \quad\quad\quad\quad\quad\quad\quad l=1,2,3,\, j =1,\cdots, p, \alpha_l \sim \pi(\alpha)\propto 1, \quad \sigma_l^2 \sim \pi(\sigma^2) \propto 1/\sigma^2, \quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad l=1,2,3, where *\mu(t;\theta_,\theta_,\theta_) is a function that models the mean time-profile of log-scaled oil production rate whose shape is determined by the parameters (\theta_,\theta_,\theta_). The function is obtained from taking logarithm to the rate decline curve used in
decline curve analysis Decline curve analysis is a means of predicting future oil well or gas well production based on past production history. Production decline curve analysis is a traditional means of identifying well production problems and predicting well performan ...
, *x_i = (x_,\cdots,x_)^ represents covariates obtained from the completion process of the
hydraulic fracturing Fracking (also known as hydraulic fracturing, hydrofracturing, or hydrofracking) is a well stimulation technique involving the fracturing of bedrock formations by a pressurized liquid. The process involves the high-pressure injection of "frack ...
and horizontal
directional drilling Directional drilling (or slant drilling) is the practice of drilling non-vertical bores. It can be broken down into four main groups: oilfield directional drilling, utility installation directional drilling, directional boring (horizontal dire ...
for the i-th well, *s_i = (s_,s_)^ represents the spatial location (longitude, latitude) of the i-th well, *\epsilon_(\cdot) represents the Gaussian white noise with error variance \sigma_l^2 (also called the nugget effect), *\eta_(\cdot) represents the
Gaussian process In probability theory and statistics, a Gaussian process is a stochastic process (a collection of random variables indexed by time or space), such that every finite collection of those random variables has a multivariate normal distribution, i.e. e ...
with Gaussian
covariance function In probability theory and statistics, the covariance function describes how much two random variables change together (their ''covariance'') with varying spatial or temporal separation. For a random field or stochastic process ''Z''(''x'') on a doma ...
K_(\cdot, \cdot), *\beta represents the horseshoe shrinkage prior. The
Gaussian process regression In statistics, originally in geostatistics, kriging or Kriging, also known as Gaussian process regression, is a method of interpolation based on Gaussian process governed by prior covariances. Under suitable assumptions of the prior, kriging giv ...
s used on the latent level (the second stage) eventually produce
kriging In statistics, originally in geostatistics, kriging or Kriging, also known as Gaussian process regression, is a method of interpolation based on Gaussian process governed by prior covariances. Under suitable assumptions of the prior, kriging giv ...
predictors for the curve parameters (\theta_,\theta_,\theta_), (i=1,\cdots,N), that dictate the shape of the mean curve \mu(t;\theta_,\theta_,\theta_) on the date level (the first level). As the
kriging In statistics, originally in geostatistics, kriging or Kriging, also known as Gaussian process regression, is a method of interpolation based on Gaussian process governed by prior covariances. Under suitable assumptions of the prior, kriging giv ...
techniques have been employed in the latent level, this technique is called latent kriging. The right panels show the prediction results of the latent kriging method applied to the two test wells in the Eagle Ford Shale Reservoir of South Texas.


Bayesian nonlinear mixed-effects model

The framework of Bayesian hierarchical modeling is frequently used in diverse applications. Particularly, Bayesian nonlinear mixed-effects models have recently received significant attention. A basic version of the Bayesian nonlinear mixed-effects models is represented as the following three-stage: ''Stage 1: Individual-Level Model'' _ = f(t_;\theta_,\theta_,\ldots,\theta_,\ldots,\theta_ ) + \epsilon_,\quad \epsilon_ \sim N(0, \sigma^2), \quad i =1,\ldots, N, \, j = 1,\ldots, M_i. ''Stage 2: Population Model'' \theta_= \alpha_l + \sum_^\beta_x_ + \eta_, \quad \eta_ \sim N(0, \omega_l^2), \quad i =1,\ldots, N, \, l=1,\ldots, K. ''Stage 3: Prior'' \sigma^2 \sim \pi(\sigma^2),\quad \alpha_l \sim \pi(\alpha_l), \quad (\beta_,\ldots,\beta_,\ldots,\beta_) \sim \pi(\beta_,\ldots,\beta_,\ldots,\beta_), \quad \omega_l^2 \sim \pi(\omega_l^2), \quad l=1,\ldots, K. Here, y_ denotes the continuous response of the i-th subject at the time point t_, and x_ is the b-th covariate of the i-th subject. Parameters involved in the model are written in Greek letters. f(t ; \theta_,\ldots,\theta_) is a known function parameterized by the K-dimensional vector (\theta_,\ldots,\theta_). Typically, f is a `nonlinear' function and describes the temporal trajectory of individuals. In the model, \epsilon_ and \eta_ describe within-individual variability and between-individual variability, respectively. If ''Stage 3: Prior'' is not considered, then the model reduces to a frequentist nonlinear mixed-effect model. A central task in the application of the Bayesian nonlinear mixed-effect models is to evaluate the posterior density: \pi(\_^,\sigma^2, \_^K, \_^,\_^K , \_^) \propto \pi(\_^, \_^,\sigma^2, \_^K, \_^,\_^K) = \underbrace_ \times \underbrace_ \times \underbrace_ The panel on the right displays Bayesian research cycle using Bayesian nonlinear mixed-effects model. A research cycle using the Bayesian nonlinear mixed-effects model comprises two steps: (a) standard research cycle and (b) Bayesian-specific workflow. Standard research cycle involves literature review, defining a problem and specifying the research question and hypothesis. Bayesian-specific workflow comprises three sub-steps: (b)–(i) formalizing prior distributions based on background knowledge and prior elicitation; (b)–(ii) determining the likelihood function based on a nonlinear function f ; and (b)–(iii) making a posterior inference. The resulting posterior inference can be used to start a new research cycle.


See also

*
Mixed model A mixed model, mixed-effects model or mixed error-component model is a statistical model containing both fixed effects and random effects. These models are useful in a wide variety of disciplines in the physical, biological and social sciences. ...
*
Fixed effects model In statistics, a fixed effects model is a statistical model in which the model parameters are fixed or non-random quantities. This is in contrast to random effects models and mixed models in which all or some of the model parameters are random ...
*
Generalized linear mixed model In statistics, a generalized linear mixed model (GLMM) is an extension to the generalized linear model (GLM) in which the linear predictor contains random effects in addition to the usual fixed effects. They also inherit from GLMs the idea of exte ...
*
Linear regression In statistics, linear regression is a linear approach for modelling the relationship between a scalar response and one or more explanatory variables (also known as dependent and independent variables). The case of one explanatory variable is call ...
*
Mixed-design analysis of variance In statistics, a mixed-design analysis of variance model, also known as a split-plot ANOVA, is used to test for differences between two or more independent groups whilst subjecting participants to repeated measures. Thus, in a mixed-design ANOVA mo ...
*
Multilevel model Multilevel models (also known as hierarchical linear models, linear mixed-effect model, mixed models, nested data models, random coefficient, random-effects models, random parameter models, or split-plot designs) are statistical models of parame ...
*
Random effects model In statistics, a random effects model, also called a variance components model, is a statistical model where the model parameters are random variables. It is a kind of hierarchical linear model, which assumes that the data being analysed are dra ...
*
Repeated measures design Repeated measures design is a research design that involves multiple measures of the same variable taken on the same or matched subjects either under different conditions or over two or more time periods. For instance, repeated measurements are c ...


References

{{Reflist Regression models