In
probability theory and
statistics
Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
, a copula is a multivariate
cumulative distribution function
In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x.
Ev ...
for which the
marginal probability
In probability theory and statistics, the marginal distribution of a subset of a collection of random variables is the probability distribution of the variables contained in the subset. It gives the probabilities of various values of the variables ...
distribution of each variable is
uniform on the interval
, 1 Copulas are used to describe/model the
dependence (inter-correlation) between
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...
s. Their name, introduced by applied mathematician
Abe Sklar in 1959, comes from the Latin for "link" or "tie", similar but unrelated to grammatical
copulas in
linguistics. Copulas have been used widely in
quantitative finance
Mathematical finance, also known as quantitative finance and financial mathematics, is a field of applied mathematics, concerned with mathematical modeling of financial markets.
In general, there exist two separate branches of finance that require ...
to model and minimize tail risk
and
portfolio-optimization applications.
Sklar's theorem states that any multivariate
joint distribution can be written in terms of univariate
marginal distribution functions and a copula which describes the dependence structure between the variables.
Copulas are popular in high-dimensional statistical applications as they allow one to easily model and estimate the distribution of random vectors by estimating marginals and copulae separately. There are many parametric copula families available, which usually have parameters that control the strength of dependence. Some popular parametric copula models are outlined below.
Two-dimensional copulas are known in some other areas of mathematics under the name ''permutons'' and ''doubly-stochastic measures''.
Mathematical definition
Consider a random vector
. Suppose its marginals are continuous, i.e. the marginal
CDFs
CDfs is a virtual file system for Unix-like operating systems; it provides access to data and audio tracks on Compact Discs. When the CDfs driver mounts a Compact Disc, it represents each track as a file. This is consistent with the Unix conve ...
are
continuous function
In mathematics, a continuous function is a function such that a continuous variation (that is a change without jump) of the argument induces a continuous variation of the value of the function. This means that there are no abrupt changes in value ...
s. By applying the
probability integral transform to each component, the random vector
:
has marginals that are
uniformly distributed on the interval
, 1
The copula of
is defined as the
joint cumulative distribution function of
:
:
The copula ''C'' contains all information on the dependence structure between the components of
whereas the marginal cumulative distribution functions
contain all information on the marginal distributions of
.
The reverse of these steps can be used to generate
pseudo-random samples from general classes of
multivariate probability distributions. That is, given a procedure to generate a sample
from the copula function, the required sample can be constructed as
:
The inverses
are unproblematic
almost surely, since the
were assumed to be continuous. Furthermore, the above formula for the copula function can be rewritten as:
:
Definition
In
probabilistic
Probability is the branch of mathematics concerning numerical descriptions of how likely an Event (probability theory), event is to occur, or how likely it is that a proposition is true. The probability of an event is a number between 0 and ...
terms,