In
probability theory
Probability theory or probability calculus is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expre ...
and
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, the covariance function describes how much two
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
s change together (their ''
covariance
In probability theory and statistics, covariance is a measure of the joint variability of two random variables.
The sign of the covariance, therefore, shows the tendency in the linear relationship between the variables. If greater values of one ...
'') with varying spatial or temporal separation. For a
random field
In physics and mathematics, a random field is a random function over an arbitrary domain (usually a multi-dimensional space such as \mathbb^n). That is, it is a function f(x) that takes on a random value at each point x \in \mathbb^n(or some other ...
or
stochastic process
In probability theory and related fields, a stochastic () or random process is a mathematical object usually defined as a family of random variables in a probability space, where the index of the family often has the interpretation of time. Sto ...
''Z''(''x'') on a domain ''D'', a covariance function ''C''(''x'', ''y'') gives the covariance of the values of the random field at the two locations ''x'' and ''y'':
The same ''C''(''x'', ''y'') is called the autocovariance function in two instances: in time series (to denote exactly the same concept except that ''x'' and ''y'' refer to locations in time rather than in space), and in multivariate random fields (to refer to the covariance of a variable with itself, as opposed to the
cross covariance between two different variables at different locations, Cov(''Z''(''x''
1), ''Y''(''x''
2))).
Admissibility
For locations ''x''
1, ''x''
2, ..., ''x''
''N'' ∈ ''D'' the variance of every linear combination
:
can be computed as
:
A function is a valid covariance function if and only if
this variance is non-negative for all possible choices of ''N'' and weights ''w''
1, ..., ''w''
''N''. A function with this property is called
positive semidefinite.
Simplifications with stationarity
In case of a weakly
stationary random field
In physics and mathematics, a random field is a random function over an arbitrary domain (usually a multi-dimensional space such as \mathbb^n). That is, it is a function f(x) that takes on a random value at each point x \in \mathbb^n(or some other ...
, where
:
for any lag ''h'', the covariance function can be represented by a one-parameter function
:
which is called a ''covariogram'' and also a ''covariance function''. Implicitly the ''C''(''x''
''i'', ''x''
''j'') can be computed from ''C''
''s''(''h'') by:
:
The
positive definiteness In mathematics, positive definiteness is a property of any object to which a bilinear form or a sesquilinear form may be naturally associated, which is positive-definite. See, in particular:
* Positive-definite bilinear form
* Positive-definite f ...
of this single-argument version of the covariance function can be checked by
Bochner's theorem
In mathematics, Bochner's theorem (named for Salomon Bochner) characterizes the Fourier transform of a positive finite Borel measure on the real line. More generally in harmonic analysis, Bochner's theorem asserts that under Fourier transform a c ...
.
Parametric families of covariance functions
For a given
variance
In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion ...
, a simple stationary parametric covariance function is the "exponential covariance function"
:
where ''V'' is a scaling parameter (correlation length), and ''d'' = ''d''(''x'',''y'') is the distance between two points. Sample paths of a
Gaussian process
In probability theory and statistics, a Gaussian process is a stochastic process (a collection of random variables indexed by time or space), such that every finite collection of those random variables has a multivariate normal distribution. The di ...
with the exponential covariance function are not smooth. The "squared exponential" (or "
Gaussian") covariance function:
:
is a stationary covariance function with smooth sample paths.
The
Matérn covariance function and
rational quadratic covariance function are two parametric families of stationary covariance functions. The Matérn family includes the exponential and squared exponential covariance functions as special cases.
See also
*
Autocorrelation function
Autocorrelation, sometimes known as serial correlation in the discrete time case, measures the correlation of a signal with a delayed copy of itself. Essentially, it quantifies the similarity between observations of a random variable at differe ...
*
Correlation function
*
Covariance matrix
In probability theory and statistics, a covariance matrix (also known as auto-covariance matrix, dispersion matrix, variance matrix, or variance–covariance matrix) is a square matrix giving the covariance between each pair of elements of ...
* {{annotated link, Covariance operator
*
Kriging
In statistics, originally in geostatistics, kriging or Kriging (), also known as Gaussian process regression, is a method of interpolation based on Gaussian process governed by prior covariances. Under suitable assumptions of the prior, kriging g ...
*
Positive-definite kernel
In operator theory, a branch of mathematics, a positive-definite kernel is a generalization of a positive-definite function or a positive-definite matrix. It was first introduced by James Mercer in the early 20th century, in the context of solvi ...
*
Random field
In physics and mathematics, a random field is a random function over an arbitrary domain (usually a multi-dimensional space such as \mathbb^n). That is, it is a function f(x) that takes on a random value at each point x \in \mathbb^n(or some other ...
*
Stochastic process
In probability theory and related fields, a stochastic () or random process is a mathematical object usually defined as a family of random variables in a probability space, where the index of the family often has the interpretation of time. Sto ...
*
Variogram
In spatial statistics the theoretical variogram, denoted 2\gamma(\mathbf_1,\mathbf_2), is a function describing the degree of spatial dependence of a spatial random field or stochastic process Z(\mathbf). The semivariogram \gamma(\mathbf_1,\ma ...
References
Geostatistics
Spatial analysis
Covariance and correlation