statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...

, the sample maximum and sample minimum, also called the largest observation and smallest observation, are the values of the greatest and least elements of a

sample Sample or samples may refer to: Base meaning * Sample (statistics), a subset of a population – complete data set * Sample (signal), a digital discrete sample of a continuous analog signal * Sample (material), a specimen or small quantity of s ...

. They are basic

summary statistics In descriptive statistics, summary statistics are used to summarize a set of observations, in order to communicate the largest amount of information as simply as possible. Statisticians commonly try to describe the observations in * a measure of ...

, used in

descriptive statistics A descriptive statistic (in the count noun sense) is a summary statistic that quantitatively describes or summarizes features from a collection of information, while descriptive statistics (in the mass noun sense) is the process of using and an ...

such as the

five-number summary The five-number summary is a set of descriptive statistics that provides information about a dataset. It consists of the five most important sample percentiles: # the sample minimum ''(smallest observation)'' # the lower quartile or ''first quart ...

and Bowley's seven-figure summary and the associated

box plot In descriptive statistics, a box plot or boxplot is a method for graphically demonstrating the locality, spread and skewness groups of numerical data through their quartiles. In addition to the box on a box plot, there can be lines (which are ca ...

. The minimum and the maximum value are the first and last

order statistic In statistics, the ''k''th order statistic of a statistical sample is equal to its ''k''th-smallest value. Together with rank statistics, order statistics are among the most fundamental tools in non-parametric statistics and inference. Import ...

s (often denoted ''X''₍₁₎ and ''X''_(''n'') respectively, for a sample size of ''n''). If the sample has

outliers In statistics, an outlier is a data point that differs significantly from other observations. An outlier may be due to a variability in the measurement, an indication of novel data, or it may be the result of experimental error; the latter are ...

, they necessarily include the sample maximum or sample minimum, or both, depending on whether they are extremely high or low. However, the sample maximum and minimum need not be outliers, if they are not unusually far from other observations.

Robustness

The sample maximum and minimum are the ''least''

robust statistics Robust statistics are statistics with good performance for data drawn from a wide range of probability distributions, especially for distributions that are not normal. Robust statistical methods have been developed for many common problems, suc ...

: they are maximally sensitive to outliers. This can either be an advantage or a drawback: if extreme values are real (not measurement errors), and of real consequence, as in applications of

extreme value theory Extreme value theory or extreme value analysis (EVA) is a branch of statistics dealing with the extreme deviations from the median of probability distributions. It seeks to assess, from a given ordered sample of a given random variable, the pr ...

such as building dikes or financial loss, then outliers (as reflected in sample extrema) are important. On the other hand, if outliers have little or no impact on actual outcomes, then using non-robust statistics such as the sample extrema simply cloud the statistics, and robust alternatives should be used, such as other quantiles: the 10th and 90th

percentiles In statistics, a ''k''-th percentile (percentile score or centile) is a score ''below which'' a given percentage ''k'' of scores in its frequency distribution falls (exclusive definition) or a score ''at or below which'' a given percentage falls ...

(first and last

decile In descriptive statistics, a decile is any of the nine values that divide the sorted data into ten equal parts, so that each part represents 1/10 of the sample or population. A decile is one possible form of a quantile; others include the quartile ...

) are more robust alternatives.

Derived statistics

In addition to being a component of every statistic that uses all elements of the sample, the sample extrema are important parts of the

range Range may refer to: Geography * Range (geographic), a chain of hills or mountains; a somewhat linear, complex mountainous or hilly area (cordillera, sierra) ** Mountain range, a group of mountains bordered by lowlands * Range, a term used to i ...

, a measure of dispersion, and

mid-range In statistics, the mid-range or mid-extreme is a measure of central tendency of a sample defined as the arithmetic mean of the maximum and minimum values of the data set: :M=\frac. The mid-range is closely related to the range, a measure of ...

, a measure of location. They also realize the

maximum absolute deviation The average absolute deviation (AAD) of a data set is the average of the absolute deviations from a central point. It is a summary statistic of statistical dispersion or variability. In the general form, the central point can be a mean, median, ...

: one of them is the ''furthest'' point from any given point, particularly a measure of center such as the median or mean.

Applications

Smooth maximum

For a sample set, the maximum function is non-smooth and thus non-differentiable. For optimization problems that occur in statistics it often needs to be approximated by a smooth function that is close to the maximum of the set. A

smooth maximum In mathematics, a smooth maximum of an indexed family ''x''1, ..., ''x'n'' of numbers is a smooth approximation to the maximum function \max(x_1,\ldots,x_n), meaning a parametric family of functions m_\alpha(x_1,\ldots,x_n) such that ...

, for example, : ''g''(''x''₁, ''x''₂, …, ''x''_''n'') = log( exp(''x''₁) + exp(''x''₂) + … + exp(''x''_''n'') ) is a good approximation of the sample maximum.

Summary statistics

The sample maximum and minimum are basic

, showing the most extreme observations, and are used in the

and a version of the

seven-number summary In descriptive statistics, the seven-number summary is a collection of seven summary statistics, and is an extension of the five-number summary. There are three similar, common forms. As with the five-number summary, it can be represented by a modi ...

and the associated

Prediction interval

The sample maximum and minimum provide a non-parametric

prediction interval In statistical inference, specifically predictive inference, a prediction interval is an estimate of an interval in which a future observation will fall, with a certain probability, given what has already been observed. Prediction intervals are o ...

: in a sample from a population, or more generally an exchangeable sequence of random variables, each observation is equally likely to be the maximum or minimum. Thus if one has a sample

\,

and one picks another observation

X_,

then this has

1/(n+1)

probability of being the largest value seen so far,

1/(n+1)

probability of being the smallest value seen so far, and thus the other

(n-1)/(n+1)

of the time,

X_

falls between the sample maximum and sample minimum of

\.

Thus, denoting the sample maximum and minimum by ''M'' and ''m,'' this yields an

(n-1)/(n+1)

prediction interval of 'm'',''M'' For example, if ''n'' = 19, then 'm'',''M''gives an 18/20 = 90% prediction interval – 90% of the time, the 20th observation falls between the smallest and largest observation seen heretofore. Likewise, ''n'' = 39 gives a 95% prediction interval, and ''n'' = 199 gives a 99% prediction interval.

Estimation

Due to their sensitivity to outliers, the sample extrema cannot reliably be used as

estimators In statistics, an estimator is a rule for calculating an estimate of a given quantity based on observed data: thus the rule (the estimator), the quantity of interest (the estimand) and its result (the estimate) are distinguished. For example, the ...

unless data is clean – robust alternatives include the first and last deciles. However, with clean data or in theoretical settings, they can sometimes prove very good estimators, particularly for

platykurtic In probability theory and statistics, kurtosis (from el, κυρτός, ''kyrtos'' or ''kurtos'', meaning "curved, arching") is a measure of the "tailedness" of the probability distribution of a real number, real-valued random variable. Like skew ...

distributions, where for small data sets the

is the most efficient estimator. They are inefficient estimators of location for mesokurtic distributions, such as the

normal distribution In statistics, a normal distribution or Gaussian distribution is a type of continuous probability distribution for a real-valued random variable. The general form of its probability density function is : f(x) = \frac e^ The parameter \mu ...

, and leptokurtic distributions, however.

Uniform distribution

For sampling without replacement from a uniform distribution with one or two unknown endpoints (so

1,2,\dots,N

with ''N'' unknown, or

M,M+1,\dots,N

with both ''M'' and ''N'' unknown), the sample maximum, or respectively the sample maximum and sample minimum, are

sufficient In logic and mathematics, necessity and sufficiency are terms used to describe a conditional or implicational relationship between two statements. For example, in the conditional statement: "If then ", is necessary for , because the truth of ...

and

complete Complete may refer to: Logic * Completeness (logic) * Completeness of a theory, the property of a theory that every formula in the theory's language or its negation is provable Mathematics * The completeness of the real numbers, which implies t ...

statistics for the unknown endpoints; thus an unbiased estimator derived from these will be UMVU estimator. If only the top endpoint is unknown, the sample maximum is a biased estimator for the population maximum, but the unbiased estimator

\fracm - 1

(where ''m'' is the sample maximum and ''k'' is the sample size) is the UMVU estimator; see

German tank problem In the statistical theory of estimation theory, estimation, the German tank problem consists of estimating the maximum of a discrete uniform distribution from sampling without replacement. In simple terms, suppose there exists an unknown number o ...

for details. If both endpoints are unknown, then the sample range is a biased estimator for the population range, but correcting as for maximum above yields the UMVU estimator. If both endpoints are unknown, then the

is an unbiased (and hence UMVU) estimator of the midpoint of the interval (here equivalently the population median, average, or mid-range). The reason the sample extrema are sufficient statistics is that the conditional distribution of the non-extreme samples is just the distribution for the uniform interval between the sample maximum and minimum – once the endpoints are fixed, the values of the interior points add no additional information.

Normality testing

The sample extrema can be used for a simple

normality test In statistics, normality tests are used to determine if a data set is well-modeled by a normal distribution and to compute how likely it is for a random variable underlying the data set to be normally distributed. More precisely, the tests are a fo ...

, specifically of kurtosis: one computes the

t-statistic In statistics, the ''t''-statistic is the ratio of the departure of the estimated value of a parameter from its hypothesized value to its standard error. It is used in hypothesis testing via Student's ''t''-test. The ''t''-statistic is used in a ...

of the sample maximum and minimum (subtracts

sample mean The sample mean (or "empirical mean") and the sample covariance are statistics computed from a Sample (statistics), sample of data on one or more random variables. The sample mean is the average value (or mean, mean value) of a sample (statistic ...

and divides by the

sample standard deviation In statistics, the standard deviation is a measure of the amount of variation or dispersion of a set of values. A low standard deviation indicates that the values tend to be close to the mean (also called the expected value) of the set, while ...

), and if they are unusually large for the sample size (as per the

three sigma rule 3 is a number, numeral, and glyph. 3, three, or III may also refer to: * AD 3, the third year of the AD era * 3 BC, the third year before the AD era * March, the third month Books * '' Three of Them'' (Russian: ', literally, "three"), a 190 ...

and table therein, or more precisely a

Student's t-distribution In probability and statistics, Student's ''t''-distribution (or simply the ''t''-distribution) is any member of a family of continuous probability distributions that arise when estimating the mean of a normally distributed population in sit ...

), then the kurtosis of the sample distribution deviates significantly from that of the normal distribution. For instance, a daily process should expect a 3σ event once per year (of calendar days; once every year and a half of business days), while a 4σ event happens on average every 40 years of calendar days, 60 years of business days (once in a lifetime), 5σ events happen every 5,000 years (once in recorded history), and 6σ events happen every 1.5 million years (essentially never). Thus if the sample extrema are 6 sigmas from the mean, one has a significant failure of normality. Further, this test is very easy to communicate without involved statistics. These tests of normality can be applied if one faces

kurtosis risk In statistics and decision theory, kurtosis risk is the risk that results when a statistical model assumes the normal distribution, but is applied to observations that have a tendency to occasionally be much farther (in terms of number of standar ...

, for instance.

Extreme value theory

Sample extrema play two main roles in

: * firstly, they give a lower bound on extreme events – events can be at least this extreme, and for this size sample; * secondly, they can sometimes be used in estimators of probability of more extreme events. However, caution must be used in using sample extrema as guidelines: in

heavy-tailed distribution In probability theory, heavy-tailed distributions are probability distributions whose tails are not exponentially bounded: that is, they have heavier tails than the exponential distribution. In many applications it is the right tail of the distrib ...

s or for

non-stationary In mathematics and statistics, a stationary process (or a strict/strictly stationary process or strong/strongly stationary process) is a stochastic process whose unconditional joint probability distribution does not change when shifted in time. Con ...

processes, extreme events can be significantly more extreme than any previously observed event. This is elaborated in

black swan theory The black swan theory or theory of black swan events is a metaphor that describes an event that comes as a surprise, has a major effect, and is often inappropriately rationalized after the fact with the benefit of hindsight. The term is based on ...