Directional statistics (also circular statistics or spherical statistics) is the subdiscipline of
statistics
Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
that deals with
directions (
unit vector
In mathematics, a unit vector in a normed vector space is a vector (often a spatial vector) of length 1. A unit vector is often denoted by a lowercase letter with a circumflex, or "hat", as in \hat (pronounced "v-hat").
The term ''direction vecto ...
s in
Euclidean space
Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, that is, in Euclid's Elements, Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics ther ...
, R
''n''),
axes
Axes, plural of ''axe'' and of ''axis'', may refer to
* ''Axes'' (album), a 2005 rock album by the British band Electrelane
* a possibly still empty plot (graphics)
See also
*Axess (disambiguation)
*Axxess (disambiguation) Axxess may refer to:
...
(
lines
Line most often refers to:
* Line (geometry), object with zero thickness and curvature that stretches to infinity
* Telephone line, a single-user circuit on a telephone communication system
Line, lines, The Line, or LINE may also refer to:
Arts ...
through the origin in R
''n'') or
rotation
Rotation, or spin, is the circular movement of an object around a '' central axis''. A two-dimensional rotating object has only one possible central axis and can rotate in either a clockwise or counterclockwise direction. A three-dimensional ...
s in R
''n''. More generally, directional statistics deals with observations on compact
Riemannian manifold
In differential geometry, a Riemannian manifold or Riemannian space , so called after the German mathematician Bernhard Riemann, is a real manifold, real, smooth manifold ''M'' equipped with a positive-definite Inner product space, inner product ...
s including the
Stiefel manifold.
The fact that 0
degrees and 360 degrees are identical
angle
In Euclidean geometry, an angle is the figure formed by two Ray (geometry), rays, called the ''Side (plane geometry), sides'' of the angle, sharing a common endpoint, called the ''vertex (geometry), vertex'' of the angle.
Angles formed by two ...
s, so that for example 180 degrees is not a sensible
mean
There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value (magnitude and sign) of a given data set.
For a data set, the ''arithme ...
of 2 degrees and 358 degrees, provides one illustration that special statistical methods are required for the analysis of some types of data (in this case, angular data). Other examples of data that may be regarded as directional include statistics involving temporal periods (e.g. time of day, week, month, year, etc.), compass directions,
dihedral angle
A dihedral angle is the angle between two intersecting planes or half-planes. In chemistry, it is the clockwise angle between half-planes through two sets of three atoms, having two atoms in common. In solid geometry, it is defined as the uni ...
s in molecules, orientations, rotations and so on.
Circular distributions
Any
probability density function
In probability theory, a probability density function (PDF), or density of a continuous random variable, is a function whose value at any given sample (or point) in the sample space (the set of possible values taken by the random variable) can ...
(pdf)
on the line can be
"wrapped" around the circumference of a circle of unit radius. That is, the pdf of the wrapped variable
is
This concept can be extended to the multivariate context by an extension of the simple sum to a number of
sums that cover all dimensions in the feature space:
where
is the
-th Euclidean basis vector.
The following sections show some relevant circular distributions.
von Mises circular distribution
The ''von Mises distribution'' is a circular distribution which, like any other circular distribution, may be thought of as a wrapping of a certain linear probability distribution around the circle. The underlying linear probability distribution for the von Mises distribution is mathematically intractable; however, for statistical purposes, there is no need to deal with the underlying linear distribution. The usefulness of the von Mises distribution is twofold: it is the most mathematically tractable of all circular distributions, allowing simpler statistical analysis, and it is a close approximation to the
wrapped normal distribution, which, analogously to the linear normal distribution, is important because it is the limiting case for the sum of a large number of small angular deviations. In fact, the von Mises distribution is often known as the "circular normal" distribution because of its ease of use and its close relationship to the wrapped normal distribution (Fisher, 1993).
The pdf of the von Mises distribution is:
where
is the modified
Bessel function
Bessel functions, first defined by the mathematician Daniel Bernoulli and then generalized by Friedrich Bessel, are canonical solutions of Bessel's differential equation
x^2 \frac + x \frac + \left(x^2 - \alpha^2 \right)y = 0
for an arbitrary ...
of order 0.
Circular uniform distribution
The probability density function (pdf) of the ''circular uniform distribution'' is given by
It can also be thought of as
of the von Mises above.
Wrapped normal distribution
The pdf of the ''wrapped normal distribution'' (WN) is:
where μ and σ are the mean and standard deviation of the unwrapped distribution, respectively and
is the
Jacobi theta function
In mathematics, theta functions are special functions of several complex variables. They show up in many topics, including Abelian varieties, moduli spaces, quadratic forms, and solitons. As Grassmann algebras, they appear in quantum field theo ...
:
where
and
Wrapped Cauchy distribution
The pdf of the ''wrapped Cauchy distribution'' (WC) is:
where
is the scale factor and
is the peak position.
Wrapped Lévy distribution
The pdf of the ''wrapped Lévy distribution'' (WL) is:
where the value of the summand is taken to be zero when
,
is the scale factor and
is the location parameter.
Distributions on higher-dimensional manifolds
There also exist distributions on the
two-dimensional sphere
A sphere () is a geometrical object that is a three-dimensional analogue to a two-dimensional circle. A sphere is the set of points that are all at the same distance from a given point in three-dimensional space.. That given point is the ...
(such as the
Kent distribution
In directional statistics, the Kent distribution, also known as the 5-parameter Fisher–Bingham distribution (named after John T. Kent, Ronald Fisher, and Christopher Bingham), is a probability distribution on the unit sphere (2-sphere ''S''2 in ...
), the
''N''-dimensional sphere (the
von Mises–Fisher distribution
In directional statistics, the von Mises–Fisher distribution (named after Richard von Mises and Ronald Fisher), is a probability distribution on the (p-1)-sphere in \mathbb^. If p=2
the distribution reduces to the von Mises distribution on the ci ...
) or the
torus
In geometry, a torus (plural tori, colloquially donut or doughnut) is a surface of revolution generated by revolving a circle in three-dimensional space about an axis that is coplanar with the circle.
If the axis of revolution does not tou ...
(the
bivariate von Mises distribution
In probability theory and statistics, the bivariate von Mises distribution is a probability distribution describing values on a torus. It may be thought of as an analogue on the torus of the bivariate normal distribution. The distribution belongs ...
).
The
matrix von Mises–Fisher distribution
Matrix most commonly refers to:
* ''The Matrix'' (franchise), an American media franchise
** ''The Matrix'', a 1999 science-fiction action film
** "The Matrix", a fictional setting, a virtual reality environment, within ''The Matrix'' (franchis ...
is a distribution on the
Stiefel manifold, and can be used to construct probability distributions over
rotation matrices.
The
Bingham distribution In statistics, the Bingham distribution, named after Christopher Bingham, is an antipodally symmetric probability distribution on the ''n''-sphere. It is a generalization of the Watson distribution and a special case of the Kent and Fisher-Bing ...
is a distribution over axes in ''N'' dimensions, or equivalently, over points on the (''N'' − 1)-dimensional sphere with the antipodes identified. For example, if ''N'' = 2, the axes are undirected lines through the origin in the plane. In this case, each axis cuts the unit circle in the plane (which is the one-dimensional sphere) at two points that are each other's antipodes. For ''N'' = 4, the Bingham distribution is a distribution over the space of unit
quaternions
In mathematics, the quaternion number system extends the complex numbers. Quaternions were first described by the Irish mathematician William Rowan Hamilton in 1843 and applied to mechanics in three-dimensional space. Hamilton defined a quatern ...
(
versor
In mathematics, a versor is a quaternion of norm one (a ''unit quaternion''). The word is derived from Latin ''versare'' = "to turn" with the suffix ''-or'' forming a noun from the verb (i.e. ''versor'' = "the turner"). It was introduced by Will ...
s). Since a versor corresponds to a rotation matrix, the Bingham distribution for ''N'' = 4 can be used to construct probability distributions over the space of rotations, just like the Matrix-von Mises–Fisher distribution.
These distributions are for example used in
geology
Geology () is a branch of natural science concerned with Earth and other astronomical objects, the features or rocks of which it is composed, and the processes by which they change over time. Modern geology significantly overlaps all other Ear ...
,
crystallography
Crystallography is the experimental science of determining the arrangement of atoms in crystalline solids. Crystallography is a fundamental subject in the fields of materials science and solid-state physics (condensed matter physics). The wor ...
and
bioinformatics
Bioinformatics () is an interdisciplinary field that develops methods and software tools for understanding biological data, in particular when the data sets are large and complex. As an interdisciplinary field of science, bioinformatics combi ...
.
Moments
The raw vector (or trigonometric) moments of a circular distribution are defined as
:
where
is any interval of length
,
is the PDF of the circular distribution, and
. Since the integral
is unity, and the integration interval is finite, it follows that the moments of any circular distribution are always finite and well defined.
Sample moments are analogously defined:
:
The population resultant vector, length, and mean angle are defined in analogy with the corresponding sample parameters.
:
:
:
In addition, the lengths of the higher moments are defined as:
:
while the angular parts of the higher moments are just
. The lengths of all moments will lie between 0 and 1.
Measures of location and spread
Various measures of
central tendency
In statistics, a central tendency (or measure of central tendency) is a central or typical value for a probability distribution.Weisberg H.F (1992) ''Central Tendency and Variability'', Sage University Paper Series on Quantitative Applications ...
and
statistical dispersion
In statistics, dispersion (also called variability, scatter, or spread) is the extent to which a Probability distribution, distribution is stretched or squeezed. Common examples of measures of statistical dispersion are the variance, standard de ...
may be defined for both the population and a sample drawn from that population.
[Fisher, NI., ''Statistical Analysis of Circular Data'', Cambridge University Press, 1993. ]
Central tendency
The most common measure of location is the circular mean. The population circular mean is simply the first moment of the distribution while the sample mean is the first moment of the sample. The sample mean will serve as an unbiased estimator of the population mean.
When data is concentrated, the median and mode may be defined by analogy to the linear case, but for more dispersed or multi-modal data, these concepts are not useful.
Dispersion
The most common measures of circular spread are:
* The . For the sample the circular variance is defined as:
and for the population
Both will have values between 0 and 1.
* The
with values between 0 and infinity. This definition of the standard deviation (rather than the square root of the variance) is useful because for a wrapped normal distribution, it is an estimator of the standard deviation of the underlying normal distribution. It will therefore allow the circular distribution to be standardized as in the linear case, for small values of the standard deviation. This also applies to the von Mises distribution which closely approximates the wrapped normal distribution. Note that for small
, we have
.
* The
with values between 0 and infinity. This measure of spread is found useful in the statistical analysis of variance.
Distribution of the mean
Given a set of ''N'' measurements
the mean value of ''z'' is defined as:
:
which may be expressed as
:
where
:
or, alternatively as:
:
where
:
The distribution of the mean angle (
) for a circular pdf ''P''(''θ'') will be given by:
:
where
is over any interval of length
and the integral is subject to the constraint that
and
are constant, or, alternatively, that
and
are constant.
The calculation of the distribution of the mean for most circular distributions is not analytically possible, and in order to carry out an analysis of variance, numerical or mathematical approximations are needed.
The
central limit theorem
In probability theory, the central limit theorem (CLT) establishes that, in many situations, when independent random variables are summed up, their properly normalized sum tends toward a normal distribution even if the original variables themselv ...
may be applied to the distribution of the sample means. (main article:
Central limit theorem for directional statistics In probability theory, the central limit theorem states conditions under which the average of a sufficiently large number of independent random variables, each with finite mean and variance, will be approximately normally distributed.
Directiona ...
). It can be shown
that the distribution of