Geometric mean
   HOME

TheInfoList



OR:

In mathematics, the geometric mean is a
mean There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value ( magnitude and sign) of a given data set. For a data set, the '' ar ...
or
average In ordinary language, an average is a single number taken as representative of a list of numbers, usually the sum of the numbers divided by how many numbers are in the list (the arithmetic mean). For example, the average of the numbers 2, 3, 4, 7 ...
which indicates a central tendency of a set of numbers by using the product of their values (as opposed to the
arithmetic mean In mathematics and statistics, the arithmetic mean ( ) or arithmetic average, or just the '' mean'' or the ''average'' (when the context is clear), is the sum of a collection of numbers divided by the count of numbers in the collection. The co ...
which uses their sum). The geometric mean is defined as the th root of the product of numbers, i.e., for a set of numbers , the geometric mean is defined as :\left(\prod_^n a_i\right)^\frac = \sqrt /math> or, equivalently, as the
arithmetic mean In mathematics and statistics, the arithmetic mean ( ) or arithmetic average, or just the '' mean'' or the ''average'' (when the context is clear), is the sum of a collection of numbers divided by the count of numbers in the collection. The co ...
in logscale: :\exp For instance, the geometric mean of two numbers, say 2 and 8, is just the
square root In mathematics, a square root of a number is a number such that ; in other words, a number whose '' square'' (the result of multiplying the number by itself, or  ⋅ ) is . For example, 4 and −4 are square roots of 16, because . ...
of their product, that is, \sqrt = 4. As another example, the geometric mean of the three numbers 4, 1, and 1/32 is the cube root of their product (1/8), which is 1/2, that is, \sqrt = 1/2. The geometric mean applies only to positive numbers. The geometric mean is often used for a set of numbers whose values are meant to be multiplied together or are exponential in nature, such as a set of growth figures: values of the human population or interest rates of a financial investment over time. It also applies to benchmarking, where it is particularly useful for computing means of speedup ratios: since the mean of 0.5x (half as fast) and 2x (twice as fast) will be 1 (i.e., no speedup overall). The geometric mean can be understood in terms of
geometry Geometry (; ) is, with arithmetic, one of the oldest branches of mathematics. It is concerned with properties of space such as the distance, shape, size, and relative position of figures. A mathematician who works in the field of geometry is c ...
. The geometric mean of two numbers, a and b, is the length of one side of a square whose area is equal to the area of a
rectangle In Euclidean plane geometry, a rectangle is a quadrilateral with four right angles. It can also be defined as: an equiangular quadrilateral, since equiangular means that all of its angles are equal (360°/4 = 90°); or a parallelogram contain ...
with sides of lengths a and b. Similarly, the geometric mean of three numbers, a, b, and c, is the length of one edge of a
cube In geometry, a cube is a three-dimensional solid object bounded by six square faces, facets or sides, with three meeting at each vertex. Viewed from a corner it is a hexagon and its net is usually depicted as a cross. The cube is the only ...
whose volume is the same as that of a
cuboid In geometry, a cuboid is a hexahedron, a six-faced solid. Its faces are quadrilaterals. Cuboid means "like a cube", in the sense that by adjusting the length of the edges or the angles between edges and faces a cuboid can be transformed into a c ...
with sides whose lengths are equal to the three given numbers. The geometric mean is one of the three classical Pythagorean means, together with the arithmetic mean and the
harmonic mean In mathematics, the harmonic mean is one of several kinds of average, and in particular, one of the Pythagorean means. It is sometimes appropriate for situations when the average rate is desired. The harmonic mean can be expressed as the recipro ...
. For all positive data sets containing at least one pair of unequal values, the harmonic mean is always the least of the three means, while the arithmetic mean is always the greatest of the three and the geometric mean is always in between (see Inequality of arithmetic and geometric means.)


Calculation

The geometric mean of a data set \left\ is given by: :\left(\prod_^n a_i \right)^\frac = \sqrt The above figure uses capital pi notation to show a series of multiplications. Each side of the equal sign shows that a set of values is multiplied in succession (the number of values is represented by "n") to give a total product of the set, and then the ''n''th root of the total product is taken to give the geometric mean of the original set. For example, in a set of four numbers \, the product of 1 \times 2 \times 3 \times 4 is 24, and the geometric mean is the fourth root of 24, or ~ 2.213. The exponent \frac on the left side is equivalent to the taking ''n''th root. For example, 24^\frac = \sqrt /math>.


Iterative means

The geometric mean of a data set is less than the data set's
arithmetic mean In mathematics and statistics, the arithmetic mean ( ) or arithmetic average, or just the '' mean'' or the ''average'' (when the context is clear), is the sum of a collection of numbers divided by the count of numbers in the collection. The co ...
unless all members of the data set are equal, in which case the geometric and arithmetic means are equal. This allows the definition of the arithmetic-geometric mean, an intersection of the two which always lies in between. The geometric mean is also the arithmetic-harmonic mean in the sense that if two
sequence In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called ''elements'', or ''terms''). The number of elements (possibly infinite) is called ...
s (a_n) and (h_n) are defined: :a_ = \frac, \quad a_0 = x and :h_ = \frac, \quad h_0 = y where h_ is the
harmonic mean In mathematics, the harmonic mean is one of several kinds of average, and in particular, one of the Pythagorean means. It is sometimes appropriate for situations when the average rate is desired. The harmonic mean can be expressed as the recipro ...
of the previous values of the two sequences, then a_n and h_n will converge to the geometric mean of x and y. The sequences converge to a common limit, and the geometric mean is preserved: :\sqrt = \sqrt = \sqrt = \sqrt Replacing the arithmetic and harmonic mean by a pair of generalized means of opposite, finite exponents yields the same result.


Relationship with logarithms

The geometric mean can also be expressed as the exponential of the arithmetic mean of logarithms. By using logarithmic identities to transform the formula, the multiplications can be expressed as a sum and the power as a multiplication: When a_1, a_2, \dots, a_n > 0 : \left( \prod_^n a_i \right)^\frac = \exp\left frac \sum_^n \ln a_i\right :As: :\begin \left ( \prod_^a_i \right )^&= \sqrt \ &=e ^\\ &=e ^\\ &=e ^\\ \texta\text&=e^ \end :alternatively, use any positive real number base, for both the logarithms and the number you are raising to the power of the arithmetic mean of the individual logarithms at that same base. additionally, if negative values of the a_i are allowed, : \left( \prod_^n a_i \right)^\frac = \left(\left(-1\right)^m\right)^\frac \exp\left a_i\ \right where is the number of negative numbers. This is sometimes called the log-average (not to be confused with the
logarithmic average In mathematics, the logarithmic mean is a function of two non-negative numbers which is equal to their difference divided by the logarithm of their quotient. This calculation is applicable in engineering problems involving heat and mas ...
). It is simply computing the
arithmetic mean In mathematics and statistics, the arithmetic mean ( ) or arithmetic average, or just the '' mean'' or the ''average'' (when the context is clear), is the sum of a collection of numbers divided by the count of numbers in the collection. The co ...
of the logarithm-transformed values of a_i (i.e., the arithmetic mean on the log scale) and then using the exponentiation to return the computation to the original scale, i.e., it is the generalised f-mean with f(x) = \log x. For example, the geometric mean of 2 and 8 can be calculated as the following, where b is any base of a
logarithm In mathematics, the logarithm is the inverse function to exponentiation. That means the logarithm of a number  to the base  is the exponent to which must be raised, to produce . For example, since , the ''logarithm base'' 10 ...
(commonly 2, e or 10): :b^ = 4 Related to the above, it can be seen that for a given sample of points a_1, \ldots, a_n, the geometric mean is the minimizer of :f(a) = \sum_^n (\log(a_i) - \log(a))^2 = \sum_^n (\log(a_i/a) )^2, whereas the arithmetic mean is the minimizer of :f(a) = \sum_^n (a_i - a)^2. Thus, the geometric mean provides a summary of the samples whose exponent best matches the exponents of the samples (in the least squares sense). The log form of the geometric mean is generally the preferred alternative for implementation in computer languages because calculating the product of many numbers can lead to an arithmetic overflow or arithmetic underflow. This is less likely to occur with the sum of the logarithms for each number.


Comparison to arithmetic mean

The geometric mean of a non-empty data set of (positive) numbers is always at most their arithmetic mean. Equality is only obtained when all numbers in the data set are equal; otherwise, the geometric mean is smaller. For example, the geometric mean of 2 and 3 is 2.45, while their arithmetic mean is 2.5. In particular, this means that when a set of non-identical numbers is subjected to a
mean-preserving spread In probability and statistics, a mean-preserving spread (MPS) is a change from one probability distribution A to another probability distribution B, where B is formed by spreading out one or more portions of A's probability density function or pr ...
— that is, the elements of the set are "spread apart" more from each other while leaving the arithmetic mean unchanged — their geometric mean decreases.


Average growth rate

In many cases the geometric mean is the best measure to determine the average growth rate of some quantity. (For example, if in one year sales increases by 80% and the next year by 25%, the end result is the same as that of a constant growth rate of 50%, since the geometric mean of 1.80 and 1.25 is 1.50.) In order to determine the average growth rate, it is not necessary to take the product of the measured growth rates at every step. Let the quantity be given as the sequence a_0, a_1,..., a_n, where n is the number of steps from the initial to final state. The growth rate between successive measurements a_k and a_ is a_/a_k. The geometric mean of these growth rates is then just: :\left( \frac \frac \cdots \frac \right)^\frac = \left(\frac\right)^\frac.


Application to normalized values

The fundamental property of the geometric mean, which does not hold for any other mean, is that for two sequences X and Y of equal length, : \operatorname\left(\frac\right) = \frac This makes the geometric mean the only correct mean when averaging ''normalized'' results; that is, results that are presented as ratios to reference values. This is the case when presenting computer performance with respect to a reference computer, or when computing a single average index from several heterogeneous sources (for example, life expectancy, education years, and infant mortality). In this scenario, using the arithmetic or harmonic mean would change the ranking of the results depending on what is used as a reference. For example, take the following comparison of execution time of computer programs: Table 1 The arithmetic and geometric means "agree" that computer C is the fastest. However, by presenting appropriately normalized values ''and'' using the arithmetic mean, we can show either of the other two computers to be the fastest. Normalizing by A's result gives A as the fastest computer according to the arithmetic mean: Table 2 while normalizing by B's result gives B as the fastest computer according to the arithmetic mean but A as the fastest according to the harmonic mean: Table 3 and normalizing by C's result gives C as the fastest computer according to the arithmetic mean but A as the fastest according to the harmonic mean: Table 4 In all cases, the ranking given by the geometric mean stays the same as the one obtained with unnormalized values. However, this reasoning has been questioned. Giving consistent results is not always equal to giving the correct results. In general, it is more rigorous to assign weights to each of the programs, calculate the average weighted execution time (using the arithmetic mean), and then normalize that result to one of the computers. The three tables above just give a different weight to each of the programs, explaining the inconsistent results of the arithmetic and harmonic means (Table 4 gives equal weight to both programs, the Table 2 gives a weight of 1/1000 to the second program, and the Table 3 gives a weight of 1/100 to the second program and 1/10 to the first one). The use of the geometric mean for aggregating performance numbers should be avoided if possible, because multiplying execution times has no physical meaning, in contrast to adding times as in the arithmetic mean. Metrics that are inversely proportional to time (speedup, IPC) should be averaged using the harmonic mean. The geometric mean can be derived from the generalized mean as its limit as p goes to zero. Similarly, this is possible for the weighted geometric mean.


Geometric mean of a continuous function

If f: ,bto(0, \infty) is a positive continuous real-valued function, its geometric mean over this interval is :\text = \exp\left(\frac\int_a^b\ln f(x)dx\right) For instance, taking the identity function f(x) = x over the unit interval shows that the geometric mean of the positive numbers between 0 and 1 is equal to \frac.


Applications


Proportional growth

The geometric mean is more appropriate than the
arithmetic mean In mathematics and statistics, the arithmetic mean ( ) or arithmetic average, or just the '' mean'' or the ''average'' (when the context is clear), is the sum of a collection of numbers divided by the count of numbers in the collection. The co ...
for describing proportional growth, both
exponential growth Exponential growth is a process that increases quantity over time. It occurs when the instantaneous rate of change (that is, the derivative) of a quantity with respect to time is proportional to the quantity itself. Described as a function, a ...
(constant proportional growth) and varying growth; in business the geometric mean of growth rates is known as the compound annual growth rate (CAGR). The geometric mean of growth over periods yields the equivalent constant growth rate that would yield the same final amount. Suppose an orange tree yields 100 oranges one year and then 180, 210 and 300 the following years, so the growth is 80%, 16.6666% and 42.8571% for each year respectively. Using the
arithmetic mean In mathematics and statistics, the arithmetic mean ( ) or arithmetic average, or just the '' mean'' or the ''average'' (when the context is clear), is the sum of a collection of numbers divided by the count of numbers in the collection. The co ...
calculates a (linear) average growth of 46.5079% (80% + 16.6666% + 42.8571%, that sum then divided by 3). However, if we start with 100 oranges and let it grow 46.5079% each year, the result is 314 oranges, not 300, so the linear average ''over''-states the year-on-year growth. Instead, we can use the geometric mean. Growing with 80% corresponds to multiplying with 1.80, so we take the geometric mean of 1.80, 1.166666 and 1.428571, i.e. \sqrt \approx 1.442249; thus the "average" growth per year is 44.2249%. If we start with 100 oranges and let the number grow with 44.2249% each year, the result is 300 oranges.


Financial

The geometric mean has from time to time been used to calculate financial indices (the averaging is over the components of the index). For example, in the past the
FT 30 The FT 30 (''FT Ordinary Index'' or ''FTOI'', not "FTSE 30") is a now rarely used index that is similar to the Dow Jones Industrial Average. As an index of stocks to represent the real trends on the market, the FT 30 has been superseded by the FTS ...
index used a geometric mean. It is also used in the recently introduced " RPIJ" measure of inflation in the United Kingdom and in the European Union. This has the effect of understating movements in the index compared to using the arithmetic mean.


Applications in the social sciences

Although the geometric mean has been relatively rare in computing social statistics, starting from 2010 the United Nations Human Development Index did switch to this mode of calculation, on the grounds that it better reflected the non-substitutable nature of the statistics being compiled and compared: : The geometric mean decreases the level of substitutability between dimensions eing comparedand at the same time ensures that a 1 percent decline in say life expectancy at birth has the same impact on the HDI as a 1 percent decline in education or income. Thus, as a basis for comparisons of achievements, this method is also more respectful of the intrinsic differences across the dimensions than a simple average. Not all values used to compute the HDI (Human Development Index) are normalized; some of them instead have the form \left(X - X_\text\right) / \left(X_\text - X_\text\right). This makes the choice of the geometric mean less obvious than one would expect from the "Properties" section above. The equally distributed welfare equivalent income associated with an Atkinson Index with an inequality aversion parameter of 1.0 is simply the geometric mean of incomes. For values other than one, the equivalent value is an Lp norm divided by the number of elements, with p equal to one minus the inequality aversion parameter.


Geometry

In the case of a right triangle, its altitude is the length of a line extending perpendicularly from the hypotenuse to its 90° vertex. Imagining that this line splits the hypotenuse into two segments, the geometric mean of these segment lengths is the length of the altitude. This property is known as the
geometric mean theorem The right triangle altitude theorem or geometric mean theorem is a result in elementary geometry that describes a relation between the altitude on the hypotenuse in a right triangle and the two line segments it creates on the hypotenuse. It states ...
. In an
ellipse In mathematics, an ellipse is a plane curve surrounding two focal points, such that for all points on the curve, the sum of the two distances to the focal points is a constant. It generalizes a circle, which is the special type of ellipse in ...
, the semi-minor axis is the geometric mean of the maximum and minimum distances of the ellipse from a focus; it is also the geometric mean of the
semi-major axis In geometry, the major axis of an ellipse is its longest diameter: a line segment that runs through the center and both foci, with ends at the two most widely separated points of the perimeter. The semi-major axis (major semiaxis) is the lon ...
and the
semi-latus rectum In mathematics, a conic section, quadratic curve or conic is a curve obtained as the intersection of the surface of a cone with a plane. The three types of conic section are the hyperbola, the parabola, and the ellipse; the circle is a sp ...
. The
semi-major axis In geometry, the major axis of an ellipse is its longest diameter: a line segment that runs through the center and both foci, with ends at the two most widely separated points of the perimeter. The semi-major axis (major semiaxis) is the lon ...
of an ellipse is the geometric mean of the distance from the center to either focus and the distance from the center to either directrix. Another way to think about it is as follows: Consider a circle with radius r. Now take two diametrically opposite points on the circle and apply pressure from both ends to deform it into an ellipse with semi-major and semi-minor axes of lengths a and b. Since the area of the circle and the ellipse stays the same, we have: \begin \pi r^2 &= \pi a b \\ r^2 &= a b \\ r &= \sqrt \end The radius of the circle is the geometric mean of the semi-major and the semi-minor axes of the ellipse formed by deforming the circle. Distance to the
horizon The horizon is the apparent line that separates the surface of a celestial body from its sky when viewed from the perspective of an observer on or near the surface of the relevant body. This line divides all viewing directions based on whether i ...
of a
sphere A sphere () is a geometrical object that is a three-dimensional analogue to a two-dimensional circle. A sphere is the set of points that are all at the same distance from a given point in three-dimensional space.. That given point is the c ...
is approximately equal to the geometric mean of the distance to the closest point of the sphere and the distance to the farthest point of the sphere when the distance to the closest point of the sphere is small. Both in the approximation of squaring the circle according to S.A. Ramanujan (1914) and in the construction of the Heptadecagon according to ''"sent by T. P. Stowell, credited to Leybourn's Math. Repository, 1818"'', the geometric mean is employed.


Aspect ratios

The geometric mean has been used in choosing a compromise aspect ratio in film and video: given two aspect ratios, the geometric mean of them provides a compromise between them, distorting or cropping both in some sense equally. Concretely, two equal area rectangles (with the same center and parallel sides) of different aspect ratios intersect in a rectangle whose aspect ratio is the geometric mean, and their hull (smallest rectangle which contains both of them) likewise has the aspect ratio of their geometric mean. In the choice of 16:9 aspect ratio by the SMPTE, balancing 2.35 and 4:3, the geometric mean is \sqrt \approx 1.7701, and thus 16:9 = 1.77\overline... was chosen. This was discovered empirically by Kerns Powers, who cut out rectangles with equal areas and shaped them to match each of the popular aspect ratios. When overlapped with their center points aligned, he found that all of those aspect ratio rectangles fit within an outer rectangle with an aspect ratio of 1.77:1 and all of them also covered a smaller common inner rectangle with the same aspect ratio 1.77:1. The value found by Powers is exactly the geometric mean of the extreme aspect ratios, 4:3(1.33:1) and
CinemaScope CinemaScope is an anamorphic lens series used, from 1953 to 1967, and less often later, for shooting widescreen films that, crucially, could be screened in theatres using existing equipment, albeit with a lens adapter. Its creation in 1953 by ...
(2.35:1), which is coincidentally close to 16:9 (1.77\overline:1). The intermediate ratios have no effect on the result, only the two extreme ratios. Applying the same geometric mean technique to 16:9 and 4:3 approximately yields the 14:9 (1.55\overline...) aspect ratio, which is likewise used as a compromise between these ratios. In this case 14:9 is exactly the ''
arithmetic mean In mathematics and statistics, the arithmetic mean ( ) or arithmetic average, or just the '' mean'' or the ''average'' (when the context is clear), is the sum of a collection of numbers divided by the count of numbers in the collection. The co ...
'' of 16:9 and 4:3 = 12:9, since 14 is the average of 16 and 12, while the precise ''geometric mean'' is \sqrt \approx 1.5396 \approx 13.8:9, but the two different ''means'', arithmetic and geometric, are approximately equal because both numbers are sufficiently close to each other (a difference of less than 2%).


Paper formats

The geometric mean is also used to calculate B and C series paper formats. The B_n format has an area which is the geometric mean of the areas of A_n and A_. For example, the area of a B1 paper is \frac\mathrm m^2, because it is the geometric mean of the areas of an A0 (1\mathrm m^2) and an A1 (\frac\mathrm m^2) paper (\sqrt=\sqrt=\frac\mathrm m^2= \frac\mathrm m^2). The same principle applies with the C series, whose area is the geometric mean of the A and B series. For example, the C4 format has an area which is the geometric mean of the areas of A4 and B4. An advantage that comes from this relationship is that an A4 paper fits inside a C4 envelope, and both fit inside a B4 envelope.


Other applications

*''Spectral flatness'': in
signal processing Signal processing is an electrical engineering subfield that focuses on analyzing, modifying and synthesizing '' signals'', such as sound, images, and scientific measurements. Signal processing techniques are used to optimize transmissions, ...
,
spectral flatness Spectral flatness or tonality coefficient, also known as Wiener entropy, is a measure used in digital signal processing to characterize an audio spectrum. Spectral flatness is typically measured in decibels, and provides a way to quantify how m ...
, a measure of how flat or spiky a spectrum is, is defined as the ratio of the geometric mean of the power spectrum to its arithmetic mean. *''Anti-reflective coatings'': In optical coatings, where reflection needs to be minimised between two media of refractive indices ''n''0 and ''n''2, the optimum refractive index ''n''1 of the
anti-reflective coating An antireflective, antiglare or anti-reflection (AR) coating is a type of optical coating applied to the surface of lenses, other optical elements, and photovoltaic cells to reduce reflection. In typical imaging systems, this improves the ef ...
is given by the geometric mean: n_1 = \sqrt. *''Subtractive color mixing'': The spectral reflectance curve for paint mixtures (of equal tinting strength, opacity and
dilution Dilution may refer to: * Reducing the concentration of a chemical * Serial dilution, a common way of going about this reduction of concentration * Homeopathic dilution * Dilution (equation), an equation to calculate the rate a gas dilutes * Trad ...
) is approximately the geometric mean of the paints' individual reflectance curves computed at each wavelength of their spectra. *''Image processing'': The geometric mean filter is used as a noise filter in
image processing An image is a visual representation of something. It can be two-dimensional, three-dimensional, or somehow otherwise feed into the visual system to convey information. An image can be an artifact, such as a photograph or other two-dimensio ...
.


See also

* Arithmetic-geometric mean * Generalized mean *
Geometric mean theorem The right triangle altitude theorem or geometric mean theorem is a result in elementary geometry that describes a relation between the altitude on the hypotenuse in a right triangle and the two line segments it creates on the hypotenuse. It states ...
*
Geometric standard deviation In probability theory and statistics, the geometric standard deviation (GSD) describes how spread out are a set of numbers whose preferred average is the geometric mean. For such data, it may be preferred to the more usual standard deviation. Note ...
*
Harmonic mean In mathematics, the harmonic mean is one of several kinds of average, and in particular, one of the Pythagorean means. It is sometimes appropriate for situations when the average rate is desired. The harmonic mean can be expressed as the recipro ...
* Heronian mean *
Heteroscedasticity In statistics, a sequence (or a vector) of random variables is homoscedastic () if all its random variables have the same finite variance. This is also known as homogeneity of variance. The complementary notion is called heteroscedasticity. The ...
* Hyperbolic coordinates *
Log-normal distribution In probability theory, a log-normal (or lognormal) distribution is a continuous probability distribution of a random variable whose logarithm is normally distributed. Thus, if the random variable is log-normally distributed, then has a norma ...
*
Muirhead's inequality In mathematics, Muirhead's inequality, named after Robert Franklin Muirhead, also known as the "bunching" method, generalizes the inequality of arithmetic and geometric means. Preliminary definitions ''a''-mean For any real vector :a=(a_1,\d ...
* Product * Pythagorean means *
Quadratic mean In mathematics and its applications, the root mean square of a set of numbers x_i (abbreviated as RMS, or rms and denoted in formulas as either x_\mathrm or \mathrm_x) is defined as the square root of the mean square (the arithmetic mean of the ...
* Quadrature (mathematics) * Quasi-arithmetic mean ( generalized f-mean) * Rate of return * Weighted geometric mean


Notes and references


External links


Calculation of the geometric mean of two numbers in comparison to the arithmetic solutionArithmetic and geometric means
* ttp://mathworld.wolfram.com/GeometricMean.html Geometric Mean on MathWorldbr>Geometric Meaning of the Geometric MeanGeometric Mean Calculator for larger data sets
using Geometric Mean ]
Non-Newtonian calculus websiteGeometric Mean Definition and Formula

The Distribution of the Geometric Mean

The geometric mean?
{{DEFAULTSORT:Geometric Mean Means Non-Newtonian calculus