HOME

TheInfoList



OR:

In
mathematics Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
and
statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
, the arithmetic mean ( ), arithmetic average, or just the ''
mean A mean is a quantity representing the "center" of a collection of numbers and is intermediate to the extreme values of the set of numbers. There are several kinds of means (or "measures of central tendency") in mathematics, especially in statist ...
'' or ''
average In colloquial, ordinary language, an average is a single number or value that best represents a set of data. The type of average taken as most typically representative of a list of numbers is the arithmetic mean the sum of the numbers divided by ...
'' is the sum of a collection of numbers divided by the count of numbers in the collection. The collection is often a set of results from an
experiment An experiment is a procedure carried out to support or refute a hypothesis, or determine the efficacy or likelihood of something previously untried. Experiments provide insight into cause-and-effect by demonstrating what outcome occurs whe ...
, an
observational study In fields such as epidemiology, social sciences, psychology and statistics, an observational study draws inferences from a sample (statistics), sample to a statistical population, population where the dependent and independent variables, independ ...
, or a survey. The term "arithmetic mean" is preferred in some contexts in mathematics and statistics because it helps to distinguish it from other types of means, such as geometric and
harmonic In physics, acoustics, and telecommunications, a harmonic is a sinusoidal wave with a frequency that is a positive integer multiple of the ''fundamental frequency'' of a periodic signal. The fundamental frequency is also called the ''1st har ...
. Arithmetic means are also frequently used in
economics Economics () is a behavioral science that studies the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services. Economics focuses on the behaviour and interac ...
,
anthropology Anthropology is the scientific study of humanity, concerned with human behavior, human biology, cultures, society, societies, and linguistics, in both the present and past, including archaic humans. Social anthropology studies patterns of behav ...
,
history History is the systematic study of the past, focusing primarily on the Human history, human past. As an academic discipline, it analyses and interprets evidence to construct narratives about what happened and explain why it happened. Some t ...
, and almost every other
academic field An academic discipline or academic field is a subdivision of knowledge that is taught and researched at the college or university level. Disciplines are defined (in part) and recognized by the academic journals in which research is published, a ...
to some extent. For example,
per capita income Per capita income (PCI) or average income measures the average income earned per person in a given area (city, region, country, etc.) in a specified year. In many countries, per capita income is determined using regular population surveys, such ...
is the arithmetic average of the
income Income is the consumption and saving opportunity gained by an entity within a specified timeframe, which is generally expressed in monetary terms. Income is difficult to define conceptually and the definition may be different across fields. F ...
of a
nation A nation is a type of social organization where a collective Identity (social science), identity, a national identity, has emerged from a combination of shared features across a given population, such as language, history, ethnicity, culture, t ...
's
population Population is a set of humans or other organisms in a given region or area. Governments conduct a census to quantify the resident population size within a given jurisdiction. The term is also applied to non-human animals, microorganisms, and pl ...
. While the arithmetic mean is often used to report central tendencies, it is not a robust statistic: it is greatly influenced by
outlier In statistics, an outlier is a data point that differs significantly from other observations. An outlier may be due to a variability in the measurement, an indication of novel data, or it may be the result of experimental error; the latter are ...
s (
values In ethics and social sciences, value denotes the degree of importance of some thing or action, with the aim of determining which actions are best to do or what way is best to live ( normative ethics), or to describe the significance of different a ...
much larger or smaller than most others). For
skewed distribution In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real number, real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined. For ...
s, such as the distribution of income for which a few people's incomes are substantially higher than most people's, the arithmetic mean may not coincide with one's notion of "middle". In that case, robust statistics, such as the
median The median of a set of numbers is the value separating the higher half from the lower half of a Sample (statistics), data sample, a statistical population, population, or a probability distribution. For a data set, it may be thought of as the “ ...
, may provide a better description of central tendency.


Definition

The arithmetic mean of a set of observed data is equal to the sum of the numerical values of each observation, divided by the total number of observations. Symbolically, for a data set consisting of the values x_1,\dots,x_n, the arithmetic mean is defined by the formula: :\bar=\frac\left (\sum_^n\right) =\frac In simpler terms, the formula for the arithmetic mean is: \frac For example, if the monthly salaries of 5 employees are \, then the arithmetic mean is: :\frac=2520 If the data set is a
statistical population In statistics, a population is a set of similar items or events which is of interest for some question or experiment. A statistical population can be a group of existing objects (e.g. the set of all stars within the Milky Way galaxy) or a hyp ...
(i.e. consists of every possible observation and not just a
subset In mathematics, a Set (mathematics), set ''A'' is a subset of a set ''B'' if all Element (mathematics), elements of ''A'' are also elements of ''B''; ''B'' is then a superset of ''A''. It is possible for ''A'' and ''B'' to be equal; if they a ...
of them), then the mean of that population is called the ''
population mean In statistics, a population is a set of similar items or events which is of interest for some question or experiment. A statistical population can be a group of existing objects (e.g. the set of all stars within the Milky Way galaxy) or a hyp ...
'' and denoted by the
Greek letter The Greek alphabet has been used to write the Greek language since the late 9th or early 8th century BC. It was derived from the earlier Phoenician alphabet, and is the earliest known alphabetic script to systematically write vowels as wel ...
\mu. If the data set is a
statistical sample In this statistics, quality assurance, and survey methodology, sampling is the selection of a subset or a statistical sample (termed sample for short) of individuals from within a statistical population to estimate characteristics of the whole ...
(a subset of the population), it is called the '' sample mean'' (which for a data set X is denoted as \overline). The arithmetic mean can be similarly defined for vectors in multiple dimensions, not only scalar values; this is often referred to as a
centroid In mathematics and physics, the centroid, also known as geometric center or center of figure, of a plane figure or solid figure is the arithmetic mean position of all the points in the figure. The same definition extends to any object in n-d ...
. More generally, because the arithmetic mean is a convex combination (meaning its coefficients sum to 1), it can be defined on a convex space, not only a
vector space In mathematics and physics, a vector space (also called a linear space) is a set (mathematics), set whose elements, often called vector (mathematics and physics), ''vectors'', can be added together and multiplied ("scaled") by numbers called sc ...
.


History

Statistician Churchill Eisenhart, senior researcher fellow at the U. S. National Bureau of Standards, traced the history of the arithmetic mean in detail. In the modern age, it started to be used as a way of combining various observations that should be identical, but were not such as estimates of the direction of
magnetic north The north magnetic pole, also known as the magnetic north pole, is a point on the surface of Earth's Northern Hemisphere at which the planet's magnetic field points vertically downward (in other words, if a magnetic compass needle is allowed t ...
. In , mathematician Henry Gellibrand described as "meane" the
midpoint In geometry, the midpoint is the middle point of a line segment. It is equidistant from both endpoints, and it is the centroid both of the segment and of the endpoints. It bisects the segment. Formula The midpoint of a segment in ''n''-dim ...
of a lowest and highest number, not quite the arithmetic mean. In , a person known as "D. B." was quoted in the Transactions of the Royal Society describing "taking the mean" of five values:


Motivating properties

The arithmetic mean has several properties that make it interesting, especially as a measure of central tendency. These include: *If numbers x_1,\dotsc,x_n have a mean \bar, then (x_1-\bar)+\dotsb+(x_n-\bar)=0. Since x_i-\bar is the distance from a given number to the mean, one way to interpret this property is by saying that the numbers to the left of the mean are balanced by the numbers to the right. The mean is the only number for which the residuals (deviations from the estimate) sum to zero. This can also be interpreted as saying that the mean is translationally invariant in the sense that for any real number a, \overline = \bar + a. *If it is required to use a single number as a "typical" value for a set of known numbers x_1,\dotsc,x_n, then the arithmetic mean of the numbers does this best since it minimizes the sum of squared deviations from the typical value: the sum of (x_i-\bar)^2. The sample mean is also the best single predictor because it has the lowest root mean squared error. If the arithmetic mean of a population of numbers is desired, then the estimate of it that is
unbiased Bias is a disproportionate weight ''in favor of'' or ''against'' an idea or thing, usually in a way that is inaccurate, closed-minded, prejudicial, or unfair. Biases can be innate or learned. People may develop biases for or against an individ ...
is the arithmetic mean of a sample drawn from the population. *The arithmetic mean is independent of scale of the units of measurement, in the sense that \text(ca_,\cdots,ca_)=c\cdot\text(a_,\cdots,a_). So, for example, calculating a mean of liters and then converting to gallons is the same as converting to gallons first and then calculating the mean. This is also called first order homogeneity.


Additional properties

*The arithmetic mean of a sample is always between the largest and smallest values in that sample. *The arithmetic mean of any amount of equal-sized number groups together is the arithmetic mean of the arithmetic means of each group.


Contrast with median

The arithmetic mean may be contrasted with the
median The median of a set of numbers is the value separating the higher half from the lower half of a Sample (statistics), data sample, a statistical population, population, or a probability distribution. For a data set, it may be thought of as the “ ...
. The median is defined such that no more than half the values are larger, and no more than half are smaller than it. If elements in the data increase arithmetically when placed in some order, then the median and arithmetic average are equal. For example, consider the data sample \. The mean is 2.5, as is the median. However, when we consider a sample that cannot be arranged to increase arithmetically, such as \, the median and arithmetic average can differ significantly. In this case, the arithmetic average is 6.2, while the median is 4. The average value can vary considerably from most values in the sample and can be larger or smaller than most. There are applications of this phenomenon in many fields. For example, since the 1980s, the
median income The median income is the income amount that divides a population into two groups, half having an income above that amount, and half having an income below that amount. It may differ from the mean (or average) income. Both of these are ways of unde ...
in the
United States The United States of America (USA), also known as the United States (U.S.) or America, is a country primarily located in North America. It is a federal republic of 50 U.S. state, states and a federal capital district, Washington, D.C. The 48 ...
has increased more slowly than the arithmetic average of income.


Generalizations


Weighted average

A weighted average, or weighted mean, is an average in which some data points count more heavily than others in that they are given more weight in the calculation. For example, the arithmetic mean of 3 and 5 is \frac=4, or equivalently 3 \cdot \frac+5 \cdot \frac=4. In contrast, a ''weighted'' mean in which the first number receives, for example, twice as much weight as the second (perhaps because it is assumed to appear twice as often in the general population from which these numbers were sampled) would be calculated as 3 \cdot \frac+5 \cdot \frac=\frac. Here the weights, which necessarily sum to one, are \frac and \frac, the former being twice the latter. The arithmetic mean (sometimes called the "unweighted average" or "equally weighted average") can be interpreted as a special case of a weighted average in which all weights are equal to the same number (\frac in the above example and \frac in a situation with n numbers being averaged).


Functions


Continuous probability distributions

If a numerical property, and any sample of data from it, can take on any value from a continuous range instead of, for example, just integers, then the
probability Probability is a branch of mathematics and statistics concerning events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probability, the more likely an e ...
of a number falling into some range of possible values can be described by integrating a
continuous probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
across this range, even when the naive probability for a sample number taking one certain value from infinitely many is zero. In this context, the analog of a weighted average, in which there are infinitely many possibilities for the precise value of the variable in each range, is called the ''mean of the
probability distribution In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
''. The most widely encountered probability distribution is called the
normal distribution In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is f(x) = \frac ...
; it has the property that all measures of its central tendency, including not just the mean but also the median mentioned above and the mode (the three Ms), are equal. This equality does not hold for other probability distributions, as illustrated for the log-normal distribution here.


Angles

Particular care is needed when using cyclic data, such as phases or
angle In Euclidean geometry, an angle can refer to a number of concepts relating to the intersection of two straight Line (geometry), lines at a Point (geometry), point. Formally, an angle is a figure lying in a Euclidean plane, plane formed by two R ...
s. Taking the arithmetic mean of 1 ° and 359° yields a result of 180°. This is incorrect for two reasons: #Angle measurements are only defined up to an additive constant of 360° (2\pi or \tau, if measuring in
radian The radian, denoted by the symbol rad, is the unit of angle in the International System of Units (SI) and is the standard unit of angular measure used in many areas of mathematics. It is defined such that one radian is the angle subtended at ...
s). Thus, these could easily be called 1° and -1°, or 361° and 719°, since each one of them produces a different average. #In this situation, 0° (or 360°) is geometrically a better ''average'' value: there is lower dispersion about it (the points are both 1° from it and 179° from 180°, the putative average). In general application, such an oversight will lead to the average value artificially moving towards the middle of the numerical range. A solution to this problem is to use the optimization formulation (that is, define the mean as the central point: the point about which one has the lowest dispersion) and redefine the difference as a modular distance (i.e. the distance on the circle: so the modular distance between 1° and 359° is 2°, not 358°).


Symbols and encoding

The arithmetic mean is often denoted by a bar ( vinculum or macron), as in \bar.


See also

* Fréchet mean *
Generalized mean In mathematics, generalised means (or power mean or Hölder mean from Otto Hölder) are a family of functions for aggregating sets of numbers. These include as special cases the Pythagorean means (arithmetic mean, arithmetic, geometric mean, ge ...
*
Inequality of arithmetic and geometric means Inequality may refer to: * Inequality (mathematics), a relation between two quantities when they are different. * Economic inequality, difference in economic well-being between population groups ** Income inequality, an unequal distribution of in ...
* Sample mean and covariance *
Standard deviation In statistics, the standard deviation is a measure of the amount of variation of the values of a variable about its Expected value, mean. A low standard Deviation (statistics), deviation indicates that the values tend to be close to the mean ( ...
* Standard error of the mean *
Summary statistics In descriptive statistics, summary statistics are used to summarize a set of observations, in order to communicate the largest amount of information as simply as possible. Statisticians commonly try to describe the observations in * a measure of ...


Notes


References


Further reading

*


External links


Calculations and comparisons between arithmetic mean and geometric mean of two numbersCalculate the arithmetic mean of a series of numbers and compare it with median and mode
{{DEFAULTSORT:Arithmetic Mean Means