Although the term well-behaved statistic often seems to be used in the scientific literature in somewhat the same way as is
well-behaved
In mathematics, when a mathematical phenomenon runs counter to some intuition, then the phenomenon is sometimes called pathological. On the other hand, if a phenomenon does not run counter to intuition, it is sometimes called well-behaved or n ...
in
mathematics
Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
(that is, to mean "non-
pathological") it can also be assigned precise mathematical meaning, and in more than one way. In the former case, the meaning of this term will vary from context to context. In the latter case, the mathematical conditions can be used to derive classes of combinations of distributions with statistics which are ''well-behaved'' in each sense.
First Definition: The
variance
In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion ...
of a well-behaved
statistic
A statistic (singular) or sample statistic is any quantity computed from values in a sample which is considered for a statistical purpose. Statistical purposes include estimating a population parameter, describing a sample, or evaluating a hypot ...
al
estimator
In statistics, an estimator is a rule for calculating an estimate of a given quantity based on Sample (statistics), observed data: thus the rule (the estimator), the quantity of interest (the estimand) and its result (the estimate) are distinguish ...
is finite and one condition on its
mean
A mean is a quantity representing the "center" of a collection of numbers and is intermediate to the extreme values of the set of numbers. There are several kinds of means (or "measures of central tendency") in mathematics, especially in statist ...
is that it is
differentiable
In mathematics, a differentiable function of one real variable is a function whose derivative exists at each point in its domain. In other words, the graph of a differentiable function has a non- vertical tangent line at each interior point in ...
in the parameter being estimated.
Second Definition: The statistic is monotonic, well-defined, and locally sufficient.
Conditions for a Well-Behaved Statistic: First Definition
More formally the conditions can be expressed in this way.
is a statistic for
that is a function of the sample,
. For
to be ''well-behaved'' we require:
: Condition 1
differentiable in
, and the derivative satisfies:
: Condition 2
Conditions for a Well-Behaved Statistic: Second Definition
In order to derive the distribution law of the parameter ''T'', compatible with
, the statistic must obey some technical properties. Namely, a statistic ''s'' is said to be well-behaved if it satisfies the following three statements:
# monotonicity. A uniformly monotone relation exists between ''s'' and ? for any fixed seed
– so as to have a unique solution of (1);
# well-defined. On each observed ''s'' the statistic is well defined for every value of ?, i.e. any sample specification
such that
has a probability density different from 0 – so as to avoid considering a non-surjective mapping from
to
, i.e. associating via
to a sample
a ? that could not generate the sample itself;
# local sufficiency.
constitutes a true T sample for the observed ''s'', so that the same probability distribution can be attributed to each sampled value. Now,
is a solution of (1) with the seed
. Since the seeds are equally distributed, the sole caveat comes from their independence or, conversely from their dependence on ? itself. This check can be restricted to seeds involved by ''s'', i.e. this drawback can be avoided by requiring that the distribution of
is independent of ?. An easy way to check this property is by mapping seed specifications into
s specifications. The mapping of course depends on ?, but the distribution of
will not depend on ?, if the above seed independence holds – a condition that looks like a ''local
sufficiency'' of the statistic ''S''.
The remainder of the present article is mainly concerned with the context of
data mining
Data mining is the process of extracting and finding patterns in massive data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and ...
procedures applied to
statistical inference
Statistical inference is the process of using data analysis to infer properties of an underlying probability distribution.Upton, G., Cook, I. (2008) ''Oxford Dictionary of Statistics'', OUP. . Inferential statistical analysis infers properties of ...
and, in particular, to the group of computationally intensive procedure that have been called
algorithmic inference.
Algorithmic inference
In
algorithmic inference, the property of a statistic that is of most relevance is the pivoting step which allows to transference of probability-considerations from the sample distribution to the distribution of the parameters representing the population distribution in such a way that the conclusion of this
statistical inference
Statistical inference is the process of using data analysis to infer properties of an underlying probability distribution.Upton, G., Cook, I. (2008) ''Oxford Dictionary of Statistics'', OUP. . Inferential statistical analysis infers properties of ...
step is compatible with the sample actually observed.
By default, capital letters (such as ''U'', ''X'') will denote random variables and small letters (''u'', ''x'') their corresponding realizations and with gothic letters (such as
) the domain where the variable takes specifications. Facing a sample
, given a
sampling mechanism , with
scalar, for the random variable ''X'', we have
:
The sampling mechanism
, of the statistic ''s'', as a function ? of
with specifications in
, has an explaining function defined by the master equation:
:
for suitable seeds
and parameter ?
Example
For instance, for both the
Bernoulli distribution
In probability theory and statistics, the Bernoulli distribution, named after Swiss mathematician Jacob Bernoulli, is the discrete probability distribution of a random variable which takes the value 1 with probability p and the value 0 with pro ...
with parameter ''p'' and the
exponential distribution
In probability theory and statistics, the exponential distribution or negative exponential distribution is the probability distribution of the distance between events in a Poisson point process, i.e., a process in which events occur continuousl ...
with parameter ? the statistic
is well-behaved. The satisfaction of the above three properties is straightforward when looking at both explaining functions:
if
, 0 otherwise in the case of the Bernoulli random variable, and
for the Exponential random variable, giving rise to statistics
:
and
:
''Vice versa'', in the case of ''X'' following a
continuous uniform distribution
In probability theory and statistics, the continuous uniform distributions or rectangular distributions are a family of symmetric probability distributions. Such a distribution describes an experiment where there is an arbitrary outcome that li ...
on