statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...

, a truncated distribution is a

conditional distribution In probability theory and statistics, given two jointly distributed random variables X and Y, the conditional probability distribution of Y given X is the probability distribution of Y when X is known to be a particular value; in some cases the ...

that results from restricting the domain of some other

probability distribution In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomenon i ...

. Truncated distributions arise in practical statistics in cases where the ability to record, or even to know about, occurrences is limited to values which lie above or below a given threshold or within a specified range. For example, if the dates of birth of children in a school are examined, these would typically be subject to truncation relative to those of all children in the area given that the school accepts only children in a given age range on a specific date. There would be no information about how many children in the locality had dates of birth before or after the school's cutoff dates if only a direct approach to the school were used to obtain information. Where sampling is such as to retain knowledge of items that fall outside the required range, without recording the actual values, this is known as censoring, as opposed to the truncation here.

Definition

The following discussion is in terms of a random variable having a

continuous distribution In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomenon i ...

although the same ideas apply to

discrete distribution In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomenon i ...

s. Similarly, the discussion assumes that truncation is to a semi-open interval ''y'' ∈ (''a,b''] but other possibilities can be handled straightforwardly. Suppose we have a random variable,

X

that is distributed according to some probability density function,

f(x)

, with cumulative distribution function

F(x)

both of which have infinite Support (mathematics), support. Suppose we wish to know the probability density of the random variable after restricting the support to be between two constants so that the support,

y = (a,b]

. That is to say, suppose we wish to know how

X

is distributed given

a < X \leq b

. :

f(x, a < X \leq b) = \frac = \frac \propto_x f(x) \cdot I(\)

where

g(x) = f(x)

for all

a and g(x) = 0 everywhere else. That is, g(x) = f(x)\cdot I(\) where I is the indicator function. Note that the denominator in the truncated distribution is constant with respect to the x . 

Notice that in fact f(x, a < X \leq b) is a density:
: \int_^ f(x, a < X \leq b)dx = \frac \int_^ g(x) dx = 1 .

Truncated distributions need not have parts removed from the top and bottom. A truncated distribution where just the bottom of the distribution has been removed is as follows:

: f(x, X>y) = \frac where g(x) = f(x) for all y < x and g(x) = 0 everywhere else, and F(x) is the

cumulative distribution function In probability theory and statistics, the cumulative distribution function (CDF) of a real-valued random variable X, or just distribution function of X, evaluated at x, is the probability that X will take a value less than or equal to x. Ev ...

. A truncated distribution where the top of the distribution has been removed is as follows: :

f(x, X \leq y) = \frac

where

g(x) = f(x)

for all

x \leq y

and

g(x) = 0

everywhere else, and

F(x)

is the

Expectation of truncated random variable

Suppose we wish to find the expected value of a random variable distributed according to the density

f(x)

and a cumulative distribution of

F(x)

given that the random variable,

X

, is greater than some known value

y

. The expectation of a truncated random variable is thus: :

E(X, X>y) = \frac

where again

g(x)

g(x) = f(x)

for all

x > y

and

g(x) = 0

everywhere else. Letting

a

and

b

be the lower and upper limits respectively of support for the original density function

f

(which we assume is continuous), properties of

E(u(X), X>y)

, where

u

is some continuous function with a continuous derivative, include: #

\lim_ E(u(X), X>y) = E(u(X))

\lim_ E(u(X), X>y) = u(b)

\frac X>y) = \frac X>y) - u(y)

: and

\frac X = \frac X

\lim_\frac X>y) = f(a) (u(X)) - u(a)

= \fracu'(b)

Provided that the limits exist, that is:

\lim_ u'(y) = u'(c)

\lim_ u(y) = u(c)

and

\lim_ f(y) = f(c)

where

c

represents either

a

b

Examples

The

truncated normal distribution In probability and statistics, the truncated normal distribution is the probability distribution derived from that of a normally distributed random variable by bounding the random variable from either below or above (or both). The truncated no ...

is an important example.Johnson, N.L., Kotz, S., Balakrishnan, N. (1994) ''Continuous Univariate Distributions, Volume 1'', Wiley. (Section 10.1) The

Tobit model In statistics, a tobit model is any of a class of regression models in which the observed range of the dependent variable is censored in some way. The term was coined by Arthur Goldberger in reference to James Tobin, who developed the model in 19 ...

employs truncated distributions. Other examples include truncated binomial at x=0 and truncated poisson at x=0.

Random truncation

Suppose we have the following set up: a truncation value,

t

, is selected at random from a density,

g(t)

, but this value is not observed. Then a value,

x

, is selected at random from the truncated distribution,

f(x, t)=Tr(x)

. Suppose we observe

x

and wish to update our belief about the density of

t

given the observation. First, by definition: :

f(x)=\int_^ f(x, t)g(t)dt

, and :

F(a)=\int_^a \left t)g(t)dt \right x .

Notice that

t

must be greater than

x

, hence when we integrate over

t

, we set a lower bound of

x

. The functions

f(x)

and

F(x)

are the unconditional density and unconditional cumulative distribution function, respectively. By

Bayes' rule In probability theory and statistics, Bayes' theorem (alternatively Bayes' law or Bayes' rule), named after Thomas Bayes, describes the probability of an event, based on prior knowledge of conditions that might be related to the event. For examp ...

, :

g(t, x)= \frac ,

which expands to :

g(t, x) = \frac .

Two uniform distributions (example)

Suppose we know that ''t'' is uniformly distributed from ,''T''and ''x'', ''t'' is distributed uniformly on ,''t'' Let ''g''(''t'') and ''f''(''x'', ''t'') be the densities that describe ''t'' and ''x'' respectively. Suppose we observe a value of ''x'' and wish to know the distribution of ''t'' given that value of ''x''. :

g(t, x) =\frac = \frac \quad \text{for all } t > x .

References

Theory of probability distributions Types of probability distributions

Definition

Expectation of truncated random variable

Examples

Random truncation

Two uniform distributions (example)

See also

References