In
mathematics
Mathematics is an area of knowledge that includes the topics of numbers, formulas and related structures, shapes and the spaces in which they are contained, and quantities and their changes. These topics are represented in modern mathematics ...
and
statistics
Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
, the quasi-arithmetic mean or generalised ''f''-mean or Kolmogorov-Nagumo-de Finetti mean is one generalisation of the more familiar
mean
There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value (magnitude and sign) of a given data set.
For a data set, the ''arithme ...
s such as the
arithmetic mean
In mathematics and statistics, the arithmetic mean ( ) or arithmetic average, or just the ''mean'' or the ''average'' (when the context is clear), is the sum of a collection of numbers divided by the count of numbers in the collection. The colle ...
and the
geometric mean
In mathematics, the geometric mean is a mean or average which indicates a central tendency of a set of numbers by using the product of their values (as opposed to the arithmetic mean which uses their sum). The geometric mean is defined as the ...
, using a function
. It is also called Kolmogorov mean after Soviet mathematician
Andrey Kolmogorov
Andrey Nikolaevich Kolmogorov ( rus, Андре́й Никола́евич Колмого́ров, p=ɐnˈdrʲej nʲɪkɐˈlajɪvʲɪtɕ kəlmɐˈɡorəf, a=Ru-Andrey Nikolaevich Kolmogorov.ogg, 25 April 1903 – 20 October 1987) was a Sovi ...
. It is a broader generalization than the regular
generalized mean
In mathematics, generalized means (or power mean or Hölder mean from Otto Hölder) are a family of functions for aggregating sets of numbers. These include as special cases the Pythagorean means (arithmetic, geometric, and harmonic means).
De ...
.
Definition
If ''f'' is a function which maps an interval
of the real line to the
real number
In mathematics, a real number is a number that can be used to measure a ''continuous'' one-dimensional quantity such as a distance, duration or temperature. Here, ''continuous'' means that values can have arbitrarily small variations. Every real ...
s, and is both
continuous
Continuity or continuous may refer to:
Mathematics
* Continuity (mathematics), the opposing concept to discreteness; common examples include
** Continuous probability distribution or random variable in probability and statistics
** Continuous ...
and
injective
In mathematics, an injective function (also known as injection, or one-to-one function) is a function that maps distinct elements of its domain to distinct elements; that is, implies . (Equivalently, implies in the equivalent contrapositiv ...
, the ''f''-mean of
numbers
is defined as
, which can also be written
:
We require ''f'' to be injective in order for the
inverse function
In mathematics, the inverse function of a function (also called the inverse of ) is a function that undoes the operation of . The inverse of exists if and only if is bijective, and if it exists, is denoted by f^ .
For a function f\colon X\t ...
to exist. Since
is defined over an interval,
lies within the domain of
.
Since ''f'' is injective and continuous, it follows that ''f'' is a strictly
monotonic function
In mathematics, a monotonic function (or monotone function) is a function between ordered sets that preserves or reverses the given order. This concept first arose in calculus, and was later generalized to the more abstract setting of order ...
, and therefore that the ''f''-mean is neither larger than the largest number of the tuple
nor smaller than the smallest number in
.
Examples
* If
= ℝ, the
real line
In elementary mathematics, a number line is a picture of a graduated straight line (geometry), line that serves as visual representation of the real numbers. Every point of a number line is assumed to correspond to a real number, and every real ...
, and
, (or indeed any linear function
,
not equal to 0) then the ''f''-mean corresponds to the
arithmetic mean
In mathematics and statistics, the arithmetic mean ( ) or arithmetic average, or just the ''mean'' or the ''average'' (when the context is clear), is the sum of a collection of numbers divided by the count of numbers in the collection. The colle ...
.
* If
= ℝ
+, the
positive real numbers
In mathematics, the set of positive real numbers, \R_ = \left\, is the subset of those real numbers that are greater than zero. The non-negative real numbers, \R_ = \left\, also include zero. Although the symbols \R_ and \R^ are ambiguously used fo ...
and
, then the ''f''-mean corresponds to the
geometric mean
In mathematics, the geometric mean is a mean or average which indicates a central tendency of a set of numbers by using the product of their values (as opposed to the arithmetic mean which uses their sum). The geometric mean is defined as the ...
. According to the ''f''-mean properties, the result does not depend on the base of the
logarithm
In mathematics, the logarithm is the inverse function to exponentiation. That means the logarithm of a number to the base is the exponent to which must be raised, to produce . For example, since , the ''logarithm base'' 10 o ...
as long as it is positive and not 1.
* If
= ℝ
+ and
, then the ''f''-mean corresponds to the
harmonic mean
In mathematics, the harmonic mean is one of several kinds of average, and in particular, one of the Pythagorean means. It is sometimes appropriate for situations when the average rate is desired.
The harmonic mean can be expressed as the recipro ...
.
* If
= ℝ
+ and
, then the ''f''-mean corresponds to the
power mean
Power most often refers to:
* Power (physics), meaning "rate of doing work"
** Engine power, the power put out by an engine
** Electric power
* Power (social and political), the ability to influence people or events
** Abusive power
Power may ...
with exponent
.
* If
= ℝ and
, then the ''f''-mean is the mean in the
log semiring
In mathematics, in the field of tropical analysis, the log semiring is the semiring structure on the logarithmic scale, obtained by considering the extended real numbers as logarithms. That is, the operations of addition and multiplication are defi ...
, which is a constant shifted version of the
LogSumExp
The LogSumExp (LSE) (also called RealSoftMax or multivariable softplus) function is a smooth maximum – a smooth approximation to the maximum function, mainly used by machine learning algorithms. It is defined as the logarithm of the sum of t ...
(LSE) function (which is the logarithmic sum),
. The
corresponds to dividing by , since logarithmic division is linear subtraction. The LogSumExp function is a
smooth maximum
In mathematics, a smooth maximum of an indexed family ''x''1, ..., ''x'n'' of numbers is a smooth approximation to the maximum function \max(x_1,\ldots,x_n), meaning a parametric family of functions m_\alpha(x_1,\ldots,x_n) such that ...
: a smooth approximation to the maximum function.
Properties
The following properties hold for
for any single function
:
Symmetry: The value of
is unchanged if its arguments are permuted.
Idempotency: for all ''x'',
.
Monotonicity:
is monotonic in each of its arguments (since
is
monotonic
In mathematics, a monotonic function (or monotone function) is a function between ordered sets that preserves or reverses the given order. This concept first arose in calculus, and was later generalized to the more abstract setting of order ...
).
Continuity:
is continuous in each of its arguments (since
is continuous).
Replacement: Subsets of elements can be averaged a priori, without altering the mean, given that the multiplicity of elements is maintained. With
it holds:
:
Partitioning: The computation of the mean can be split into computations of equal sized sub-blocks:
Self-distributivity: For any quasi-arithmetic mean
of two variables:
.
Mediality: For any quasi-arithmetic mean
of two variables:
.
Balancing: For any quasi-arithmetic mean
of two variables:
.
Central limit theorem
In probability theory, the central limit theorem (CLT) establishes that, in many situations, when independent random variables are summed up, their properly normalized sum tends toward a normal distribution even if the original variables themselv ...
: Under regularity conditions, for a sufficiently large sample,
is approximately normal.
A similar result is available for Bajraktarević means, which are generalizations of quasi-arithmetic means.
Scale-invariance: The quasi-arithmetic mean is invariant with respect to offsets and scaling of
:
.
Characterization
There are several different sets of properties that characterize the quasi-arithmetic mean (i.e., each function that satisfies these properties is an ''f''-mean for some function ''f'').
* Mediality is essentially sufficient to characterize quasi-arithmetic means.
* Self-distributivity is essentially sufficient to characterize quasi-arithmetic means.
* Replacement: Kolmogorov proved that the five properties of symmetry, fixed-point, monotonicity, continuity, and replacement fully characterize the quasi-arithmetic means.
* Balancing: An interesting problem is whether this condition (together with symmetry, fixed-point, monotonicity and continuity properties) implies that the mean is quasi-arithmetic.
Georg Aumann
Georg Aumann (11 November 1906, Munich, Germany – 4 August 1980), was a German mathematician. He was known for his work in general topology and regulated functions. During World War II, he worked as part of a group of five mathematicians, rec ...
showed in the 1930s that the answer is no in general, but that if one additionally assumes
to be an
analytic function
In mathematics, an analytic function is a function that is locally given by a convergent power series. There exist both real analytic functions and complex analytic functions. Functions of each type are infinitely differentiable, but complex an ...
then the answer is positive.
Homogeneity
Mean
There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value (magnitude and sign) of a given data set.
For a data set, the ''arithme ...
s are usually
homogeneous
Homogeneity and heterogeneity are concepts often used in the sciences and statistics relating to the uniformity of a substance or organism. A material or image that is homogeneous is uniform in composition or character (i.e. color, shape, siz ...
, but for most functions
, the ''f''-mean is not.
Indeed, the only homogeneous quasi-arithmetic means are the
power mean
Power most often refers to:
* Power (physics), meaning "rate of doing work"
** Engine power, the power put out by an engine
** Electric power
* Power (social and political), the ability to influence people or events
** Abusive power
Power may ...
s (including the
geometric mean
In mathematics, the geometric mean is a mean or average which indicates a central tendency of a set of numbers by using the product of their values (as opposed to the arithmetic mean which uses their sum). The geometric mean is defined as the ...
); see Hardy–Littlewood–Pólya, page 68.
The homogeneity property can be achieved by normalizing the input values by some (homogeneous) mean
.
:
However this modification may violate
monotonicity
In mathematics, a monotonic function (or monotone function) is a function between ordered sets that preserves or reverses the given order. This concept first arose in calculus, and was later generalized to the more abstract setting of orde ...
and the partitioning property of the mean.
See also
*
Generalized mean
In mathematics, generalized means (or power mean or Hölder mean from Otto Hölder) are a family of functions for aggregating sets of numbers. These include as special cases the Pythagorean means (arithmetic, geometric, and harmonic means).
De ...
*
Jensen's inequality
In mathematics, Jensen's inequality, named after the Danish mathematician Johan Jensen, relates the value of a convex function of an integral to the integral of the convex function. It was proved by Jensen in 1906, building on an earlier pr ...
References
* Andrey Kolmogorov (1930) “On the Notion of Mean”, in “Mathematics and Mechanics” (Kluwer 1991) — pp. 144–146.
* Andrey Kolmogorov (1930) Sur la notion de la moyenne. Atti Accad. Naz. Lincei 12, pp. 388–391.
* John Bibby (1974) “Axiomatisations of the average and a further generalisation of monotonic sequences,” Glasgow Mathematical Journal, vol. 15, pp. 63–65.
* Hardy, G. H.; Littlewood, J. E.; Pólya, G. (1952) Inequalities. 2nd ed. Cambridge Univ. Press, Cambridge, 1952.
* B. De Finetti
“Sul concetto di media” vol. 3, p. 36996, 1931, istituto italiano degli attuari.
{{DEFAULTSORT:Quasi-Arithmetic Mean
Means