statistics Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...

, the sample maximum and sample minimum, also called the largest observation and smallest observation, are the values of the greatest and least elements of a sample. They are basic

summary statistics In descriptive statistics, summary statistics are used to summarize a set of observations, in order to communicate the largest amount of information as simply as possible. Statisticians commonly try to describe the observations in * a measure of ...

, used in

descriptive statistics A descriptive statistic (in the count noun sense) is a summary statistic that quantitatively describes or summarizes features from a collection of information, while descriptive statistics (in the mass noun sense) is the process of using and an ...

such as the

five-number summary The five-number summary is a set of descriptive statistics that provides information about a dataset. It consists of the five most important sample percentiles: # the sample minimum ''(smallest observation)'' # the lower quartile or ''first quar ...

and Bowley's seven-figure summary and the associated

box plot In descriptive statistics, a box plot or boxplot is a method for demonstrating graphically the locality, spread and skewness groups of numerical data through their quartiles. In addition to the box on a box plot, there can be lines (which are ca ...

. The minimum and the maximum value are the first and last

order statistic In statistics, the ''k''th order statistic of a statistical sample is equal to its ''k''th-smallest value. Together with Ranking (statistics), rank statistics, order statistics are among the most fundamental tools in non-parametric statistics and ...

s (often denoted ''X''₍₁₎ and ''X''_(''n'') respectively, for a sample size of ''n''). If the sample has

outliers In statistics, an outlier is a data point that differs significantly from other observations. An outlier may be due to a variability in the measurement, an indication of novel data, or it may be the result of experimental error; the latter ar ...

, they necessarily include the sample maximum or sample minimum, or both, depending on whether they are extremely high or low. However, the sample maximum and minimum need not be outliers, if they are not unusually far from other observations.

Robustness

The sample maximum and minimum are the ''least''

robust statistics Robust statistics are statistics that maintain their properties even if the underlying distributional assumptions are incorrect. Robust Statistics, statistical methods have been developed for many common problems, such as estimating location parame ...

: they are maximally sensitive to outliers. This can either be an advantage or a drawback: if extreme values are real (not measurement errors), and of real consequence, as in applications of

extreme value theory Extreme value theory or extreme value analysis (EVA) is the study of extremes in statistical distributions. It is widely used in many disciplines, such as structural engineering, finance, economics, earth sciences, traffic prediction, and Engin ...

such as building dikes or financial loss, then outliers (as reflected in sample extrema) are important. On the other hand, if outliers have little or no impact on actual outcomes, then using non-robust statistics such as the sample extrema simply clouds the statistics, and robust alternatives should be used, such as other quantiles: the 10th and 90th

percentiles In statistics, a ''k''-th percentile, also known as percentile score or centile, is a score (e.g., a data point) a given percentage ''k'' of all scores in its frequency distribution exists ("exclusive" definition) or a score a given percentage ...

(first and last decile) are more robust alternatives.

Derived statistics

In addition to being a component of every statistic that uses all elements of the sample, the sample extrema are important parts of the

range Range may refer to: Geography * Range (geographic), a chain of hills or mountains; a somewhat linear, complex mountainous or hilly area (cordillera, sierra) ** Mountain range, a group of mountains bordered by lowlands * Range, a term used to i ...

, a measure of dispersion, and

mid-range In statistics, the mid-range or mid-extreme is a measure of central tendency of a sample defined as the arithmetic mean of the maximum and minimum values of the data set: :M=\frac. The mid-range is closely related to the range, a measure of ...

, a measure of location. They also realize the

maximum absolute deviation The average absolute deviation (AAD) of a data set is the average of the absolute deviations from a central point. It is a summary statistic of statistical dispersion or variability. In the general form, the central point can be a mean, median, ...

: one of them is the ''furthest'' point from any given point, particularly a measure of center such as the median or mean.

Applications

Smooth maximum

For a sample set, the maximum function is non-smooth and thus non-differentiable. For optimization problems that occur in statistics it often needs to be approximated by a smooth function that is close to the maximum of the set. A smooth maximum, for example, : ''g''(''x''₁, ''x''₂, …, ''x''_''n'') = log( exp(''x''₁) + exp(''x''₂) + … + exp(''x''_''n'') ) is a good approximation of the sample maximum.

Summary statistics

The sample maximum and minimum are basic

, showing the most extreme observations, and are used in the

and a version of the

seven-number summary In descriptive statistics, the seven-number summary is a collection of seven summary statistics, and is an extension of the five-number summary. There are three similar, common forms. As with the five-number summary, it can be represented by a m ...

and the associated

Prediction interval

The sample maximum and minimum provide a non-parametric

prediction interval In statistical inference, specifically predictive inference, a prediction interval is an estimate of an interval (statistics), interval in which a future observation will fall, with a certain probability, given what has already been observed. Pr ...

: in a sample from a population, or more generally an exchangeable sequence of random variables, each observation is equally likely to be the maximum or minimum. Thus if one has a sample

\,

and one picks another observation

X_,

then this has

1/(n+1)

probability of being the largest value seen so far,

1/(n+1)

probability of being the smallest value seen so far, and thus the other

(n-1)/(n+1)

of the time,

X_

falls between the sample maximum and sample minimum of

\.

Thus, denoting the sample maximum and minimum by ''M'' and ''m,'' this yields an

(n-1)/(n+1)

prediction interval of 'm'',''M'' For example, if ''n'' = 19, then 'm'',''M''gives an 18/20 = 90% prediction interval – 90% of the time, the 20th observation falls between the smallest and largest observation seen heretofore. Likewise, ''n'' = 39 gives a 95% prediction interval, and ''n'' = 199 gives a 99% prediction interval.

Estimation

Due to their sensitivity to outliers, the sample extrema cannot reliably be used as

estimators In statistics, an estimator is a rule for calculating an estimate of a given quantity based on observed data: thus the rule (the estimator), the quantity of interest (the estimand) and its result (the estimate) are distinguished. For example, the ...

unless data is clean – robust alternatives include the first and last deciles. However, with clean data or in theoretical settings, they can sometimes prove very good estimators, particularly for

platykurtic In probability theory and statistics, kurtosis (from , ''kyrtos'' or ''kurtos'', meaning "curved, arching") refers to the degree of “tailedness” in the probability distribution of a real-valued random variable. Similar to skewness, kurtosis ...

distributions, where for small data sets the

is the most efficient estimator. They are inefficient estimators of location for mesokurtic distributions, such as the

normal distribution In probability theory and statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is f(x) = \frac ...

, and leptokurtic distributions, however.

Uniform distribution

For sampling without replacement from a uniform distribution with one or two unknown endpoints (so

1,2,\dots,N

with ''N'' unknown, or

M,M+1,\dots,N

with both ''M'' and ''N'' unknown), the sample maximum, or respectively the sample maximum and sample minimum, are sufficient and complete statistics for the unknown endpoints; thus an unbiased estimator derived from these will be

UMVU In statistics a minimum-variance unbiased estimator (MVUE) or uniformly minimum-variance unbiased estimator (UMVUE) is an unbiased estimator that has lower variance than any other unbiased estimator for all possible values of the parameter. For pra ...

estimator. If only the top endpoint is unknown, the sample maximum is a biased estimator for the population maximum, but the unbiased estimator

\fracm - 1

(where ''m'' is the sample maximum and ''k'' is the sample size) is the UMVU estimator; see

German tank problem German(s) may refer to: * Germany, the country of the Germans and German things **Germania (Roman era) * Germans, citizens of Germany, people of German ancestry, or native speakers of the German language ** For citizenship in Germany, see also Ge ...

for details. If both endpoints are unknown, then the sample range is a biased estimator for the population range, but correcting as for maximum above yields the UMVU estimator. If both endpoints are unknown, then the

is an unbiased (and hence UMVU) estimator of the midpoint of the interval (here equivalently the population median, average, or mid-range). The reason the sample extrema are sufficient statistics is that the conditional distribution of the non-extreme samples is just the distribution for the uniform interval between the sample maximum and minimum – once the endpoints are fixed, the values of the interior points add no additional information.

Normality testing

The sample extrema can be used for a simple normality test, specifically of kurtosis: one computes the

t-statistic In statistics, the ''t''-statistic is the ratio of the difference in a number’s estimated value from its assumed value to its standard error. It is used in hypothesis testing via Student's ''t''-test. The ''t''-statistic is used in a ''t''-t ...

of the sample maximum and minimum (subtracts

sample mean The sample mean (sample average) or empirical mean (empirical average), and the sample covariance or empirical covariance are statistics computed from a sample of data on one or more random variables. The sample mean is the average value (or me ...

and divides by the

sample standard deviation In statistics, the standard deviation is a measure of the amount of variation of the values of a variable about its mean. A low standard deviation indicates that the values tend to be close to the mean (also called the expected value) of the ...

), and if they are unusually large for the sample size (as per the three sigma rule and table therein, or more precisely a

Student's t-distribution In probability theory and statistics, Student's distribution (or simply the distribution) t_\nu is a continuous probability distribution that generalizes the Normal distribution#Standard normal distribution, standard normal distribu ...

), then the kurtosis of the sample distribution deviates significantly from that of the normal distribution. For instance, a daily process should expect a 3σ event once per year (of calendar days; once every year and a half of business days), while a 4σ event happens on average every 40 years of calendar days, 60 years of business days (once in a lifetime), 5σ events happen every 5,000 years (once in recorded history), and 6σ events happen every 1.5 million years (essentially never). Thus if the sample extrema are 6 sigmas from the mean, one has a significant failure of normality. Further, this test is very easy to communicate without involved statistics. These tests of normality can be applied if one faces

kurtosis risk In statistics and decision theory, kurtosis risk is the risk that results when a statistical model assumes the normal distribution, but is applied to observations that have a tendency to occasionally be much farther (in terms of number of standar ...

, for instance.

Extreme value theory

Sample extrema play two main roles in

: * first, they give a lower bound on extreme events – events can be at least this extreme, and for this size sample; * second, they can sometimes be used in estimators of probability of more extreme events. However, caution must be used in using sample extrema as guidelines: in

heavy-tailed distribution In probability theory, heavy-tailed distributions are probability distributions whose tails are not exponentially bounded: that is, they have heavier tails than the exponential distribution. Roughly speaking, “heavy-tailed” means the distribu ...

s or for non-stationary processes, extreme events can be significantly more extreme than any previously observed event. This is elaborated in

black swan theory The black swan theory or theory of black swan events is a metaphor that describes an event that comes as a surprise, has a major effect, and is often inappropriately rationalized after the fact with the benefit of hindsight. The term arose from ...

References

{{DEFAULTSORT:Sample Maximum And Minimum Summary statistics