HOME

TheInfoList



OR:

In
probability theory Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set o ...
, an exponentially modified Gaussian distribution (EMG, also known as exGaussian distribution) describes the sum of independent
normal Normal(s) or The Normal(s) may refer to: Film and television * ''Normal'' (2003 film), starring Jessica Lange and Tom Wilkinson * ''Normal'' (2007 film), starring Carrie-Anne Moss, Kevin Zegers, Callum Keith Rennie, and Andrew Airlie * ''Norma ...
and
exponential Exponential may refer to any of several mathematical topics related to exponentiation, including: *Exponential function, also: **Matrix exponential, the matrix analogue to the above * Exponential decay, decrease at a rate proportional to value *Exp ...
random variables. An exGaussian random variable ''Z'' may be expressed as , where ''X'' and ''Y'' are independent, ''X'' is Gaussian with mean ''μ'' and variance ''σ''2, and ''Y'' is exponential of rate ''λ''. It has a characteristic positive skew from the exponential component. It may also be regarded as a weighted function of a shifted exponential with the weight being a function of the normal distribution.


Definition

The
probability density function In probability theory, a probability density function (PDF), or density of a continuous random variable, is a function whose value at any given sample (or point) in the sample space (the set of possible values taken by the random variable) can ...
(pdf) of the exponentially modified
normal distribution In statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is : f(x) = \frac e^ The parameter \mu ...
is :f(x;\mu,\sigma,\lambda) = \frac e^ \operatorname \left(\frac\right), where erfc is the
complementary error function In mathematics, the error function (also called the Gauss error function), often denoted by , is a complex function of a complex variable defined as: :\operatorname z = \frac\int_0^z e^\,\mathrm dt. This integral is a special (non-elementary ...
defined as :\begin \operatorname(x) & = 1-\operatorname(x) \\ & = \frac \int_x^\infty e^\,dt. \end This density function is derived via
convolution In mathematics (in particular, functional analysis), convolution is a operation (mathematics), mathematical operation on two function (mathematics), functions ( and ) that produces a third function (f*g) that expresses how the shape of one is ...
of the normal and
exponential Exponential may refer to any of several mathematical topics related to exponentiation, including: *Exponential function, also: **Matrix exponential, the matrix analogue to the above * Exponential decay, decrease at a rate proportional to value *Exp ...
probability density functions.


Alternative forms for computation

An alternative but equivalent form of the EMG distribution is used for description of peak shape in
chromatography In chemical analysis, chromatography is a laboratory technique for the separation of a mixture into its components. The mixture is dissolved in a fluid solvent (gas or liquid) called the ''mobile phase'', which carries it through a system (a ...
. This is as follows where :h is the amplitude of Gaussian, :\tau=\frac is exponent relaxation time, ''\tau^2'' is a variance of
exponential Exponential may refer to any of several mathematical topics related to exponentiation, including: *Exponential function, also: **Matrix exponential, the matrix analogue to the above * Exponential decay, decrease at a rate proportional to value *Exp ...
probability density function. This function cannot be calculated for some values of parameters (for example, \tau=0) because of arithmetic overflow. Alternative, but equivalent form of writing the function was proposed by Delley: where \operatorname t = \exp t^2 \cdot \operatorname t is a scaled complementary error function In the case of this formula arithmetic overflow is also possible, region of overflow is different from the first formula, except for very small τ. For small τ it is reasonable to use asymptotic form of the second formula: Decision on formula usage is made on the basis of the parameter z = \frac\left(\frac - \frac\right): :for ''z'' < 0 computation should be made according to the first formula, :for 0 ≤ ''z'' ≤ 6.71·107 (in the case of
double-precision floating-point format Double-precision floating-point format (sometimes called FP64 or float64) is a floating-point number format, usually occupying 64 bits in computer memory; it represents a wide dynamic range of numeric values by using a floating radix point. Fl ...
) according to the second formula, :and for ''z'' > 6.71·107 according to the third formula.
Mode Mode ( la, modus meaning "manner, tune, measure, due measure, rhythm, melody") may refer to: Arts and entertainment * '' MO''D''E (magazine)'', a defunct U.S. women's fashion magazine * ''Mode'' magazine, a fictional fashion magazine which is ...
(position of apex, most probable value) is calculated using derivative of formula 2; the inverse of scaled complementary error function erfcxinv() is used for calculation. Approximate values are also proposed by Kalembet. Though the mode is at a value higher than that of the original Gaussian, the apex is always located on the original (unmodified) Gaussian.


Parameter estimation

There are three parameters: the
mean There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value (magnitude and sign) of a given data set. For a data set, the ''arithme ...
of the normal distribution (''μ''), the
standard deviation In statistics, the standard deviation is a measure of the amount of variation or dispersion of a set of values. A low standard deviation indicates that the values tend to be close to the mean (also called the expected value) of the set, while ...
of the normal distribution (''σ'') and the
exponential decay A quantity is subject to exponential decay if it decreases at a rate proportional to its current value. Symbolically, this process can be expressed by the following differential equation, where is the quantity and (lambda) is a positive rate ...
parameter (''τ'' = 1 / ''λ''). The shape ''K'' = ''τ'' / ''σ'' is also sometimes used to characterise the distribution. Depending on the values of the parameters, the distribution may vary in shape from almost normal to almost exponential. The parameters of the distribution can be estimated from the sample data with the method of moments as follows:Olivier J. and Norberg M. M. (2010) Positively skewed data: Revisiting the Box−Cox power transformation. Int. J. Psych. Res. 3 (1) 68−75. : m = \mu + \tau, : s^2 = \sigma^2 + \tau^2, : \gamma_1 = \frac, where ''m'' is the sample mean, ''s'' is the sample standard deviation, and ''γ''1 is the
skewness In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined. For a unimodal d ...
. Solving these for the parameters gives: : \hat\mu = m - s \left( \frac \right)^, : \hat = s^2 \left 1 - \left( \frac \right)^ \right : \hat\tau = s \left( \frac \right)^.


Recommendations

Ratcliff has suggested that there be at least 100 data points in the sample before the parameter estimates should be regarded as reliable. Vincent averaging may be used with smaller samples, as this procedure only modestly distorts the shape of the distribution. These point estimates may be used as initial values that can be refined with more powerful methods, including a least-squares optimization, which has shown to work for the Multimodal Exponentially Modified Gaussian (MEMG) case. A code implementation with analytical MEMG derivatives and an optional oscillation term for sound processing is released as part of an open-source project.


Confidence intervals

There are currently no published tables available for significance testing with this distribution. The distribution can be simulated by forming the sum of two random variables one drawn from a normal distribution and the other from an exponential.


Skew

The value of the
nonparametric skew In statistics and probability theory, the nonparametric skew is a statistic occasionally used with random variables that take real values.Arnold BC, Groeneveld RA (1995) Measuring skewness with respect to the mode. The American Statistician 49 ( ...
: \frac of this distribution lies between 0 and 0.31. The lower limit is approached when the normal component dominates, and the upper when the exponential component dominates.


Occurrence

The distribution is used as a theoretical model for the shape of
chromatographic In chemical analysis, chromatography is a laboratory technique for the separation of a mixture into its components. The mixture is dissolved in a fluid solvent (gas or liquid) called the ''mobile phase'', which carries it through a system (a ...
peaks. It has been proposed as a statistical model of intermitotic time in dividing cells. It is also used in modelling cluster ion beams. It is commonly used in psychology and other brain sciences in the study of response times. In a slight variant where the mean of the Normal component is set to zero, it is also used in
Stochastic Frontier Analysis Stochastic frontier analysis (SFA) is a method of economic modeling. It has its starting point in the stochastic production frontier models simultaneously introduced by Aigner, Lovell and Schmidt (1977) and Meeusen and Van den Broeck (1977). The ...
, as one of the distributional specifications for the composed error term that models inefficiency. In signal processing, EMGs have been extended to the multimodal case with an optional oscillation term to represent digitized sound signals.


Related distributions

This family of distributions is a special or limiting case of the
normal-exponential-gamma distribution In probability theory and statistics, the normal-exponential-gamma distribution (sometimes called the NEG distribution) is a three-parameter family of continuous probability distributions. It has a location parameter \mu, scale parameter \theta a ...
. This can also be seen as a three-parameter generalization of a normal distribution to add skew; another distribution like that is the
skew normal distribution In probability theory and statistics, the skew normal distribution is a continuous probability distribution that generalises the normal distribution In statistics, a normal distribution or Gaussian distribution is a type of continuous proba ...
, which has thinner tails. The distribution is a
compound probability distribution In probability and statistics, a compound probability distribution (also known as a mixture distribution or contagious distribution) is the probability distribution that results from assuming that a random variable is distributed according to some p ...
in which the mean of a
normal distribution In statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is : f(x) = \frac e^ The parameter \mu ...
varies randomly as a shifted
exponential distribution In probability theory and statistics, the exponential distribution is the probability distribution of the time between events in a Poisson point process, i.e., a process in which events occur continuously and independently at a constant average ...
. A ''Gaussian minus exponential'' distribution has been suggested for modelling option prices.Peter Carr and Dilip B. Madan, Saddlepoint Methods for Option Pricing, The Journal of Computational Finance (49–61) Volume 13/Number 1, Fall 2009 If such a random variable ''Y'' has parameters ''μ'', ''σ'', ''λ'', then its negative ''-Y'' has an exponentially modified Gaussian distribution with parameters ''-μ'', ''σ'', ''λ'', and thus ''Y'' has mean \mu - \tfrac and variance \sigma^2 + \tfrac.


References

{{ProbDistributions, continuous Continuous distributions Compound probability distributions