In
probability
Probability is the branch of mathematics concerning numerical descriptions of how likely an Event (probability theory), event is to occur, or how likely it is that a proposition is true. The probability of an event is a number between 0 and ...
and
statistics, an elliptical distribution is any member of a broad family of
probability distribution
In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomeno ...
s that generalize the
multivariate normal distribution
In probability theory and statistics, the multivariate normal distribution, multivariate Gaussian distribution, or joint normal distribution is a generalization of the one-dimensional ( univariate) normal distribution to higher dimensions. One ...
. Intuitively, in the simplified two and three dimensional case, the joint distribution forms an
ellipse and an
ellipsoid
An ellipsoid is a surface that may be obtained from a sphere by deforming it by means of directional scalings, or more generally, of an affine transformation.
An ellipsoid is a quadric surface; that is, a surface that may be defined as the ...
, respectively, in iso-density plots.
In
statistics, the normal distribution is used in ''classical''
multivariate analysis
Multivariate statistics is a subdivision of statistics encompassing the simultaneous observation and analysis of more than one outcome variable.
Multivariate statistics concerns understanding the different aims and background of each of the dif ...
, while elliptical distributions are used in ''generalized'' multivariate analysis, for the study of symmetric distributions with tails that are
heavy
Heavy may refer to:
Measures
* Heavy (aeronautics), a term used by pilots and air traffic controllers to refer to aircraft capable of 300,000 lbs or more takeoff weight
* Heavy, a characterization of objects with substantial weight
* Heavy, ...
, like the
multivariate t-distribution
In statistics, the multivariate ''t''-distribution (or multivariate Student distribution) is a multivariate probability distribution. It is a generalization to random vectors of the Student's ''t''-distribution, which is a distribution applic ...
, or light (in comparison with the normal distribution). Some statistical methods that were originally motivated by the study of the normal distribution have good performance for general elliptical distributions (with finite variance), particularly for spherical distributions (which are defined below). Elliptical distributions are also used in
robust statistics
Robust statistics are statistics with good performance for data drawn from a wide range of probability distributions, especially for distributions that are not normal. Robust statistical methods have been developed for many common problems, suc ...
to evaluate proposed multivariate-statistical procedures.
Definition
Elliptical distributions are defined in terms of the
characteristic function In mathematics, the term "characteristic function" can refer to any of several distinct concepts:
* The indicator function of a subset, that is the function
::\mathbf_A\colon X \to \,
:which for a given subset ''A'' of ''X'', has value 1 at point ...
of probability theory. A random vector
on a
Euclidean space
Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, that is, in Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics there are Euclidean sp ...
has an ''elliptical distribution'' if its characteristic function
satisfies the following
functional equation
In mathematics, a functional equation
is, in the broadest meaning, an equation in which one or several functions appear as unknowns. So, differential equations and integral equations are functional equations. However, a more restricted mea ...
(for every column-vector
)
:
for some
location parameter
In geography, location or place are used to denote a region (point, line, or area) on Earth's surface or elsewhere. The term ''location'' generally implies a higher degree of certainty than ''place'', the latter often indicating an entity with an ...
, some
nonnegative-definite matrix
In mathematics, a symmetric matrix M with real entries is positive-definite if the real number z^\textsfMz is positive for every nonzero real column vector z, where z^\textsf is the transpose of More generally, a Hermitian matrix (that is, ...
and some scalar function
.
The definition of elliptical distributions for ''real'' random-vectors has been extended to accommodate random vectors in Euclidean spaces over the
field of
complex number
In mathematics, a complex number is an element of a number system that extends the real numbers with a specific element denoted , called the imaginary unit and satisfying the equation i^= -1; every complex number can be expressed in the for ...
s, so facilitating applications in
time-series analysis. Computational methods are available for generating
pseudo-random vectors from elliptical distributions, for use in
Monte Carlo
Monte Carlo (; ; french: Monte-Carlo , or colloquially ''Monte-Carl'' ; lij, Munte Carlu ; ) is officially an administrative area of the Principality of Monaco, specifically the ward of Monte Carlo/Spélugues, where the Monte Carlo Casino i ...
simulation
A simulation is the imitation of the operation of a real-world process or system over time. Simulations require the use of models; the model represents the key characteristics or behaviors of the selected system or process, whereas the ...
s for example.
Some elliptical distributions are alternatively defined in terms of their
density function
In probability theory, a probability density function (PDF), or density of a continuous random variable, is a function whose value at any given sample (or point) in the sample space (the set of possible values taken by the random variable) can ...
s. An elliptical distribution with a density function ''f'' has the form:
:
where
is the
normalizing constant
The concept of a normalizing constant arises in probability theory and a variety of other areas of mathematics. The normalizing constant is used to reduce any probability function to a probability density function with total probability of one.
...
,
is an
-dimensional
random vector
In probability, and statistics, a multivariate random variable or random vector is a list of mathematical variables each of whose value is unknown, either because the value has not yet occurred or because there is imperfect knowledge of its valu ...
with
median vector (which is also the mean vector if the latter exists), and
is a
positive definite matrix
In mathematics, a symmetric matrix M with real entries is positive-definite if the real number z^\textsfMz is positive for every nonzero real column vector z, where z^\textsf is the transpose of More generally, a Hermitian matrix (that is, a ...
which is proportional to the
covariance matrix
In probability theory and statistics, a covariance matrix (also known as auto-covariance matrix, dispersion matrix, variance matrix, or variance–covariance matrix) is a square matrix giving the covariance between each pair of elements o ...
if the latter exists.
Examples
Examples include the following multivariate probability distributions:
*
Multivariate normal distribution
In probability theory and statistics, the multivariate normal distribution, multivariate Gaussian distribution, or joint normal distribution is a generalization of the one-dimensional ( univariate) normal distribution to higher dimensions. One ...
*
Multivariate ''t''-distribution
*
Symmetric multivariate stable distribution
*
Symmetric multivariate Laplace distribution
*
Multivariate logistic distribution
* Multivariate symmetric general
hyperbolic distribution[
]
Properties
In the 2-dimensional case, if the density exists, each iso-density locus (the set of ''x''1,''x''2 pairs all giving a particular value of ) is an ellipse or a union of ellipses (hence the name elliptical distribution). More generally, for arbitrary ''n'', the iso-density loci are unions of ellipsoid
An ellipsoid is a surface that may be obtained from a sphere by deforming it by means of directional scalings, or more generally, of an affine transformation.
An ellipsoid is a quadric surface; that is, a surface that may be defined as the ...
s. All these ellipsoids or ellipses have the common center μ and are scaled copies (homothets) of each other.
The multivariate normal distribution
In probability theory and statistics, the multivariate normal distribution, multivariate Gaussian distribution, or joint normal distribution is a generalization of the one-dimensional ( univariate) normal distribution to higher dimensions. One ...
is the special case in which . While the multivariate normal is unbounded (each element of can take on arbitrarily large positive or negative values with non-zero probability, because for all non-negative ), in general elliptical distributions can be bounded or unbounded—such a distribution is bounded if for all greater than some value.
There exist elliptical distributions that have undefined mean
There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value ( magnitude and sign) of a given data set.
For a data set, the '' ari ...
, such as the Cauchy distribution
The Cauchy distribution, named after Augustin Cauchy, is a continuous probability distribution. It is also known, especially among physicists, as the Lorentz distribution (after Hendrik Lorentz), Cauchy–Lorentz distribution, Lorentz(ian) fu ...
(even in the univariate case). Because the variable ''x'' enters the density function quadratically, all elliptical distributions are symmetric
Symmetry (from grc, συμμετρία "agreement in dimensions, due proportion, arrangement") in everyday language refers to a sense of harmonious and beautiful proportion and balance. In mathematics, "symmetry" has a more precise definit ...
about
If two subsets of a jointly elliptical random vector are uncorrelated
In probability theory and statistics, two real-valued random variables, X, Y, are said to be uncorrelated if their covariance, \operatorname ,Y= \operatorname Y- \operatorname \operatorname /math>, is zero. If two variables are uncorrelated, there ...
, then if their means exist they are mean independent In probability theory, a random variable Y is said to be mean independent of random variable X if and only if its conditional mean E(Y , X = x) equals its (unconditional) mean E(Y) for all x such that the probability density/mass of X at x, f_X ...
of each other (the mean of each subvector conditional on the value of the other subvector equals the unconditional mean).
If random vector ''X'' is elliptically distributed, then so is ''DX'' for any matrix ''D'' with full row rank. Thus any linear combination of the components of ''X'' is elliptical (though not necessarily with the same elliptical distribution), and any subset of ''X'' is elliptical.
Applications
Elliptical distributions are used in statistics and in economics.
In mathematical economics, elliptical distributions have been used to describe portfolio
Portfolio may refer to:
Objects
* Portfolio (briefcase), a type of briefcase
Collections
* Portfolio (finance), a collection of assets held by an institution or a private individual
* Artist's portfolio, a sample of an artist's work or a ...
s in mathematical finance
Mathematical finance, also known as quantitative finance and financial mathematics, is a field of applied mathematics, concerned with mathematical modeling of financial markets.
In general, there exist two separate branches of finance that requir ...
.
Statistics: Generalized multivariate analysis
In statistics, the multivariate ''normal'' distribution (of Gauss) is used in ''classical''