Mean (average)
   HOME

TheInfoList



OR:

In
mathematics Mathematics is an area of knowledge that includes the topics of numbers, formulas and related structures, shapes and the spaces in which they are contained, and quantities and their changes. These topics are represented in modern mathematics ...
and
statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
, the arithmetic mean ( ) or arithmetic average, or just the ''
mean There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value (magnitude and sign) of a given data set. For a data set, the ''arithme ...
'' or the ''
average In ordinary language, an average is a single number taken as representative of a list of numbers, usually the sum of the numbers divided by how many numbers are in the list (the arithmetic mean). For example, the average of the numbers 2, 3, 4, 7, ...
'' (when the context is clear), is the sum of a collection of numbers divided by the count of numbers in the collection. The collection is often a set of results of an
experiment An experiment is a procedure carried out to support or refute a hypothesis, or determine the efficacy or likelihood of something previously untried. Experiments provide insight into Causality, cause-and-effect by demonstrating what outcome oc ...
or an
observational study In fields such as epidemiology, social sciences, psychology and statistics, an observational study draws inferences from a sample (statistics), sample to a statistical population, population where the dependent and independent variables, independ ...
, or frequently a set of results from a
survey Survey may refer to: Statistics and human research * Statistical survey, a method for collecting quantitative information about items in a population * Survey (human research), including opinion polls Spatial measurement * Surveying, the techniq ...
. The term "arithmetic mean" is preferred in some contexts in mathematics and statistics, because it helps distinguish it from other
mean There are several kinds of mean in mathematics, especially in statistics. Each mean serves to summarize a given group of data, often to better understand the overall value (magnitude and sign) of a given data set. For a data set, the ''arithme ...
s, such as the geometric mean and the harmonic mean. In addition to mathematics and statistics, the arithmetic mean is used frequently in many diverse fields such as
economics Economics () is the social science that studies the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services. Economics focuses on the behaviour and intera ...
,
anthropology Anthropology is the scientific study of humanity, concerned with human behavior, human biology, cultures, societies, and linguistics, in both the present and past, including past human species. Social anthropology studies patterns of behavi ...
and
history History (derived ) is the systematic study and the documentation of the human activity. The time period of event before the History of writing#Inventions of writing, invention of writing systems is considered prehistory. "History" is an umbr ...
, and it is used in almost every academic field to some extent. For example,
per capita income Per capita income (PCI) or total income measures the average income earned per person in a given area (city, region, country, etc.) in a specified year. It is calculated by dividing the area's total income by its total population. Per capita i ...
is the arithmetic average income of a nation's population. While the arithmetic mean is often used to report central tendencies, it is not a
robust statistic Robust statistics are statistics with good performance for data drawn from a wide range of probability distributions, especially for distributions that are not normal. Robust statistical methods have been developed for many common problems, su ...
, meaning that it is greatly influenced by outliers (values that are very much larger or smaller than most of the values). For
skewed distribution In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive, zero, negative, or undefined. For a unimoda ...
s, such as the
distribution of income In economics, income distribution covers how a country's total GDP is distributed amongst its population. Economic theory and economic policy have long seen income and its distribution as a central concern. Unequal distribution of income causes ec ...
for which a few people's incomes are substantially greater than most people's, the arithmetic mean may not coincide with one's notion of "middle", and robust statistics, such as the
median In statistics and probability theory, the median is the value separating the higher half from the lower half of a data sample, a population, or a probability distribution. For a data set, it may be thought of as "the middle" value. The basic fe ...
, may provide better description of central tendency.


Definition

Given a
data set A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record of the ...
X = \, the ''arithmetic mean'' (or ''mean'' or ''average''), denoted \bar (read x ''bar''), is the mean of the n values x_1,x_2,\ldots,x_n. The arithmetic mean is the most commonly used and readily understood measure of central tendency in a data set. In statistics, the term
average In ordinary language, an average is a single number taken as representative of a list of numbers, usually the sum of the numbers divided by how many numbers are in the list (the arithmetic mean). For example, the average of the numbers 2, 3, 4, 7, ...
refers to any of the measures of central tendency. The arithmetic mean of a set of observed data is defined as being equal to the sum of the numerical values of each and every observation, divided by the total number of observations. Symbolically, if we have a data set consisting of the values a_1, a_2, \ldots, a_n, then the arithmetic mean A is defined by the formula: :A=\frac\sum_^n a_i=\frac (for an explanation of the
summation operator In mathematics, summation is the addition of a sequence of any kind of numbers, called ''addends'' or ''summands''; the result is their ''sum'' or ''total''. Beside numbers, other types of values can be summed as well: functions, vectors, ma ...
, see
summation In mathematics, summation is the addition of a sequence of any kind of numbers, called ''addends'' or ''summands''; the result is their ''sum'' or ''total''. Beside numbers, other types of values can be summed as well: functions, vectors, mat ...
.) For example, if the monthly salaries of 10 employees of a firm are: 2500, 2700, 2400, 2300, 2550, 2650, 2750, 2450, 2600, 2400, then the arithmetic mean is : \frac=2530. If the data set is a
statistical population In statistics, a population is a set of similar items or events which is of interest for some question or experiment. A statistical population can be a group of existing objects (e.g. the set of all stars within the Milky Way galaxy) or a hypoth ...
(i.e., consists of every possible observation and not just a subset of them), then the mean of that population is called the ''
population mean In statistics, a population is a set of similar items or events which is of interest for some question or experiment. A statistical population can be a group of existing objects (e.g. the set of all stars within the Milky Way galaxy) or a hypothe ...
'', and denoted by the
Greek letter The Greek alphabet has been used to write the Greek language since the late 9th or early 8th century BCE. It is derived from the earlier Phoenician alphabet, and was the earliest known alphabetic script to have distinct letters for vowels as w ...
\mu. If the data set is a statistical sample (a subset of the population), it is called the ''
sample mean The sample mean (or "empirical mean") and the sample covariance are statistics computed from a Sample (statistics), sample of data on one or more random variables. The sample mean is the average value (or mean, mean value) of a sample (statistic ...
'' (which for a data set X is denoted as \overline). The arithmetic mean can be similarly defined for vectors in multiple dimension, not only
scalar Scalar may refer to: *Scalar (mathematics), an element of a field, which is used to define a vector space, usually the field of real numbers * Scalar (physics), a physical quantity that can be described by a single element of a number field such ...
values; this is often referred to as a
centroid In mathematics and physics, the centroid, also known as geometric center or center of figure, of a plane figure or solid figure is the arithmetic mean position of all the points in the surface of the figure. The same definition extends to any ...
. More generally, because the arithmetic mean is a
convex combination In convex geometry and vector algebra, a convex combination is a linear combination of points (which can be vectors, scalars, or more generally points in an affine space) where all coefficients are non-negative and sum to 1. In other w ...
(coefficients sum to 1), it can be defined on a convex space, not only a vector space.


Motivating properties

The arithmetic mean has several properties that make it useful, especially as a measure of central tendency. These include: * If numbers x_1,\dotsc,x_n have mean \bar, then (x_1-\bar) + \dotsb + (x_n-\bar) = 0. Since x_i-\bar is the distance from a given number to the mean, one way to interpret this property is as saying that the numbers to the left of the mean are balanced by the numbers to the right of the mean. The mean is the only single number for which the residuals (deviations from the estimate) sum to zero. * If it is required to use a single number as a "typical" value for a set of known numbers x_1,\dotsc,x_n, then the arithmetic mean of the numbers does this best, in the sense of minimizing the sum of squared deviations from the typical value: the sum of (x_i-\bar)^2. (It follows that the sample mean is also the best single predictor in the sense of having the lowest
root mean squared error The root-mean-square deviation (RMSD) or root-mean-square error (RMSE) is a frequently used measure of the differences between values (sample or population values) predicted by a model or an estimator and the values observed. The RMSD represents ...
.) If the arithmetic mean of a population of numbers is desired, then the estimate of it that is
unbiased Bias is a disproportionate weight ''in favor of'' or ''against'' an idea or thing, usually in a way that is closed-minded, prejudicial, or unfair. Biases can be innate or learned. People may develop biases for or against an individual, a group, ...
is the arithmetic mean of a sample drawn from the population.


Additional properties

* \text(c \cdot a_, c \cdot a_...c \cdot a_) = c \cdot \text(a_, a_...a_) * The arithmetic mean of any amount of equal-sized number groups together is the arithmetic mean of the arithmetic means of each group.


Contrast with median

The arithmetic mean may be contrasted with the
median In statistics and probability theory, the median is the value separating the higher half from the lower half of a data sample, a population, or a probability distribution. For a data set, it may be thought of as "the middle" value. The basic fe ...
. The median is defined such that no more than half the values are larger than, and no more than half are smaller than, the median. If elements in the data increase arithmetically, when placed in some order, then the median and arithmetic average are equal. For example, consider the data sample . The average is 2.5, as is the median. However, when we consider a sample that cannot be arranged so as to increase arithmetically, such as , the median and arithmetic average can differ significantly. In this case, the arithmetic average is 6.2, while the median is 4. In general, the average value can vary significantly from most values in the sample, and can be larger or smaller than most of them. There are applications of this phenomenon in many fields. For example, since the 1980s, the median income in the United States has increased more slowly than the arithmetic average of income.


Generalizations


Weighted average

A weighted average, or weighted mean, is an average in which some data points count more heavily than others, in that they are given more weight in the calculation. For example, the arithmetic mean of 3 and 5 is \frac{(3+5)}{2} = 4, or equivalently \left( \frac{1}{2} \cdot 3\right) + \left( \frac{1}{2} \cdot 5\right) = 4. In contrast, a ''weighted'' mean in which the first number receives, for example, twice as much weight as the second (perhaps because it is assumed to appear twice as often in the general population from which these numbers were sampled) would be calculated as \left( \frac{2}{3} \cdot 3\right) + \left(\frac{1}{3} \cdot 5\right) = \frac{11}{3}. Here the weights, which necessarily sum to the value one, are (2/3) and (1/3), the former being twice the latter. The arithmetic mean (sometimes called the "unweighted average" or "equally weighted average") can be interpreted as a special case of a weighted average in which all the weights are equal to each other (equal to \frac{1}{2} in the above example, and equal to \frac{1}{n} in a situation with n numbers being averaged).


Continuous probability distributions

If a numerical property, and any sample of data from it, could take on any value from a continuous range, instead of, for example, just integers, then the
probability Probability is the branch of mathematics concerning numerical descriptions of how likely an Event (probability theory), event is to occur, or how likely it is that a proposition is true. The probability of an event is a number between 0 and ...
of a number falling into some range of possible values can be described by integrating a
continuous probability distribution In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomenon ...
across this range, even when the naive probability for a sample number taking one certain value from infinitely many is zero. The analog of a weighted average in this context, in which there are an infinite number of possibilities for the precise value of the variable in each range, is called the ''mean of the probability distribution''. A most widely encountered probability distribution is called the
normal distribution In statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is : f(x) = \frac e^ The parameter \mu ...
; it has the property that all measures of its central tendency, including not just the mean but also the aforementioned median and the
mode Mode ( la, modus meaning "manner, tune, measure, due measure, rhythm, melody") may refer to: Arts and entertainment * '' MO''D''E (magazine)'', a defunct U.S. women's fashion magazine * ''Mode'' magazine, a fictional fashion magazine which is ...
(the three Ms{{cite web, url=https://www.visualthesaurus.com/cm/lessons/the-three-ms-of-statistics-mode-median-mean/ , title=The Three M's of Statistics: Mode, Median, Mean June 30, 2010 , website=www.visualthesaurus.com, author= Thinkmap Visual Thesaurus , date=2010-06-30 , access-date=2018-12-03), are equal to each other. This equality does not hold for other probability distributions, as illustrated for the
log-normal distribution In probability theory, a log-normal (or lognormal) distribution is a continuous probability distribution of a random variable whose logarithm is normally distributed. Thus, if the random variable is log-normally distributed, then has a norma ...
here.


Angles

{{Main, Mean of circular quantities Particular care is needed when using cyclic data, such as phases or
angle In Euclidean geometry, an angle is the figure formed by two Ray (geometry), rays, called the ''Side (plane geometry), sides'' of the angle, sharing a common endpoint, called the ''vertex (geometry), vertex'' of the angle. Angles formed by two ...
s. Naively taking the arithmetic mean of 1° and 359° yields a result of 180°. This is incorrect for two reasons: * Firstly, angle measurements are only defined up to an additive constant of 360° (or 2π, if measuring in
radian The radian, denoted by the symbol rad, is the unit of angle in the International System of Units (SI) and is the standard unit of angular measure used in many areas of mathematics. The unit was formerly an SI supplementary unit (before that c ...
s). Thus, these could easily be called 1° and −1°, or 361° and 719°, since each one of them gives a different average. * Secondly, in this situation, 0° (equivalently, 360°) is geometrically a better ''average'' value: there is lower
dispersion Dispersion may refer to: Economics and finance * Dispersion (finance), a measure for the statistical distribution of portfolio returns * Price dispersion, a variation in prices across sellers of the same item *Wage dispersion, the amount of variat ...
about it (the points are both 1° from it, and 179° from 180°, the putative average). In general application, such an oversight will lead to the average value artificially moving towards the middle of the numerical range. A solution to this problem is to use the optimization formulation (
viz. The abbreviation ''viz.'' (or ''viz'' without a full stop) is short for the Latin , which itself is a contraction of the Latin phrase ''videre licet'', meaning "it is permitted to see". It is used as a synonym for "namely", "that is to say", "to ...
, define the mean as the central point: the point about which one has the lowest dispersion), and redefine the difference as a modular distance (i.e., the distance on the circle: so the modular distance between 1° and 359° is 2°, not 358°). {{AM_GM_inequality_visual_proof.svg


Symbols and encoding

The arithmetic mean is often denoted by a bar, (a.k.a. vinculum or macron), for example as in \bar{x} (read x ''bar''). Some software ( text processors,
web browser A web browser is application software for accessing websites. When a user requests a web page from a particular website, the browser retrieves its files from a web server and then displays the page on the user's screen. Browsers are used on ...
s) may not display the x̄ symbol properly. For example, the x̄ symbol in
HTML The HyperText Markup Language or HTML is the standard markup language for documents designed to be displayed in a web browser. It can be assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaScri ...
is actually a combination of two codes - the base letter x plus a code for the line above (̄ or ¯).{{Cite web, url=http://www.personal.psu.edu/ejp10/psu/gotunicode/statsym.html, title=Notes on Unicode for Stat Symbols, website=www.personal.psu.edu, access-date=2018-10-14 In some texts, such as pdfs, the x̄ symbol may be replaced by a cent (¢) symbol (
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology Technical standard, standard for the consistent character encoding, encoding, representation, and handling of Character (computing), text expre ...
¢), when copied to text processor such as
Microsoft Word Microsoft Word is a word processing software developed by Microsoft. It was first released on October 25, 1983, under the name ''Multi-Tool Word'' for Xenix systems. Subsequent versions were later written for several other platforms includin ...
.


See also

{{QM_AM_GM_HM_inequality_visual_proof.svg *
Fréchet mean In mathematics and statistics, the Fréchet mean is a generalization of centroids to metric spaces, giving a single representative point or central tendency for a cluster of points. It is named after Maurice Fréchet. Karcher mean is the renaming o ...
*
Generalized mean In mathematics, generalized means (or power mean or Hölder mean from Otto Hölder) are a family of functions for aggregating sets of numbers. These include as special cases the Pythagorean means (arithmetic, geometric, and harmonic means). D ...
* Geometric mean * Harmonic mean *
Inequality of arithmetic and geometric means In mathematics, the inequality of arithmetic and geometric means, or more briefly the AM–GM inequality, states that the arithmetic mean of a list of non-negative real numbers is greater than or equal to the geometric mean of the same list; and ...
*
Sample mean and covariance The sample mean (or "empirical mean") and the sample covariance are statistics computed from a sample of data on one or more random variables. The sample mean is the average value (or mean value) of a sample of numbers taken from a larger po ...
*
Standard deviation In statistics, the standard deviation is a measure of the amount of variation or dispersion of a set of values. A low standard deviation indicates that the values tend to be close to the mean (also called the expected value) of the set, while ...
*
Standard error of the mean The standard error (SE) of a statistic (usually an estimate of a parameter) is the standard deviation of its sampling distribution or an estimate of that standard deviation. If the statistic is the sample mean, it is called the standard error of ...
*
Summary statistics In descriptive statistics, summary statistics are used to summarize a set of observations, in order to communicate the largest amount of information as simply as possible. Statisticians commonly try to describe the observations in * a measure of ...


References

{{Reflist


Further reading

* {{cite book, last = Huff, first = Darrell, title = How to Lie with Statistics, year = 1993, publisher = W. W. Norton, isbn = 978-0-393-31072-6, url-access = registration, url = https://archive.org/details/howtoliewithstat00huff


External links


Calculations and comparisons between arithmetic mean and geometric mean of two numbers

Calculate the arithmetic mean of a series of numbers on fxSolver
{{Statistics, descriptive {{Portal bar, Mathematics {{Authority control {{DEFAULTSORT:Arithmetic Mean Means