Beta Negative Binomial Distribution
   HOME

TheInfoList



OR:

In
probability theory Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set ...
, a beta negative binomial distribution is the
probability distribution In probability theory and statistics, a probability distribution is the mathematical function that gives the probabilities of occurrence of different possible outcomes for an experiment. It is a mathematical description of a random phenomenon ...
of a discrete
random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...
 X equal to the number of failures needed to get r successes in a sequence of
independent Independent or Independents may refer to: Arts, entertainment, and media Artist groups * Independents (artist group), a group of modernist painters based in the New Hope, Pennsylvania, area of the United States during the early 1930s * Independe ...
Bernoulli trial In the theory of probability and statistics, a Bernoulli trial (or binomial trial) is a random experiment with exactly two possible outcomes, "success" and "failure", in which the probability of success is the same every time the experiment is c ...
s. The probability p of success on each trial stays constant within any given experiment but varies across different experiments following a
beta distribution In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval , 1in terms of two positive parameters, denoted by ''alpha'' (''α'') and ''beta'' (''β''), that appear as ...
. Thus the distribution is a
compound probability distribution In probability and statistics, a compound probability distribution (also known as a mixture distribution or contagious distribution) is the probability distribution that results from assuming that a random variable is distributed according to som ...
. This distribution has also been called both the inverse Markov-Pólya distribution and the generalized Waring distributionJohnson et al. (1993) or simply abbreviated as the BNB distribution. A shifted form of the distribution has been called the beta-Pascal distribution. If parameters of the beta distribution are \alpha and \beta, and if : X \mid p \sim \mathrm(r,p), where : p \sim \textrm(\alpha,\beta), then the marginal distribution of X is a beta negative binomial distribution: : X \sim \mathrm(r,\alpha,\beta). In the above, \mathrm(r,p) is the
negative binomial distribution In probability theory and statistics, the negative binomial distribution is a discrete probability distribution that models the number of failures in a sequence of independent and identically distributed Bernoulli trials before a specified (non-r ...
and \textrm(\alpha,\beta) is the
beta distribution In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval , 1in terms of two positive parameters, denoted by ''alpha'' (''α'') and ''beta'' (''β''), that appear as ...
.


Definition and derivation

Denoting f_(k, q), f_(q, \alpha,\beta) the densities of the negative binomial and beta distributions respectively, we obtain the PMF f(k, \alpha,\beta,r) of the BNB distribution by marginalization: :f(k, \alpha,\beta,r) = \int_0^1 f_(k, r,q) \cdot f_(q, \alpha,\beta) \mathrm q = \int_0^1 \binom (1-q)^k q^r \cdot \frac \mathrm q = \frac \binom \int_0^1 q^(1-q)^ \mathrm q Noting that the integral evaluates to: : \int_0^1 q^(1-q)^ \mathrm q = \frac we can arrive at the following formulas by relatively simple manipulations. If r is an integer, then the PMF can be written in terms of the
beta function In mathematics, the beta function, also called the Euler integral of the first kind, is a special function that is closely related to the gamma function and to binomial coefficients. It is defined by the integral : \Beta(z_1,z_2) = \int_0^1 t^(1 ...
,: :f(k, \alpha,\beta,r)=\binomk\frac. More generally, the PMF can be written :f(k, \alpha,\beta,r)=\frac\frac or :f(k, \alpha,\beta,r)=\frac\frac.


PMF expressed with Gamma

Using the properties of the
Beta function In mathematics, the beta function, also called the Euler integral of the first kind, is a special function that is closely related to the gamma function and to binomial coefficients. It is defined by the integral : \Beta(z_1,z_2) = \int_0^1 t^(1 ...
, the PMF with integer r can be rewritten as: :f(k, \alpha,\beta,r)=\binomk\frac. More generally, the PMF can be written as :f(k, \alpha,\beta,r)=\frac\frac.


PMF expressed with the rising Pochammer symbol

The PMF is often also presented in terms of the Pochammer symbol for integer r :f(k, \alpha,\beta,r)=\frac


Properties


Non-identifiable

The beta negative binomial is non-identifiable which can be seen easily by simply swapping r and \beta in the above density or
characteristic function In mathematics, the term "characteristic function" can refer to any of several distinct concepts: * The indicator function of a subset, that is the function ::\mathbf_A\colon X \to \, :which for a given subset ''A'' of ''X'', has value 1 at points ...
and noting that it is unchanged. Thus
estimation Estimation (or estimating) is the process of finding an estimate or approximation, which is a value that is usable for some purpose even if input data may be incomplete, uncertain, or unstable. The value is nonetheless usable because it is der ...
demands that a constraint be placed on r, \beta or both.


Relation to other distributions

The beta negative binomial distribution contains the beta geometric distribution as a special case when either r=1 or \beta=1. It can therefore approximate the
geometric distribution In probability theory and statistics, the geometric distribution is either one of two discrete probability distributions: * The probability distribution of the number ''X'' of Bernoulli trials needed to get one success, supported on the set \; * ...
arbitrarily well. It also approximates the negative binomial distribution arbitrary well for large \alpha. It can therefore approximate the
Poisson distribution In probability theory and statistics, the Poisson distribution is a discrete probability distribution that expresses the probability of a given number of events occurring in a fixed interval of time or space if these events occur with a known co ...
arbitrarily well for large \alpha, \beta and r.


Heavy tailed

By
Stirling's approximation In mathematics, Stirling's approximation (or Stirling's formula) is an approximation for factorials. It is a good approximation, leading to accurate results even for small values of n. It is named after James Stirling, though a related but less p ...
to the beta function, it can be easily shown that for large k :f(k, \alpha,\beta,r) \sim \frac\frac which implies that the beta negative binomial distribution is
heavy tailed In probability theory, heavy-tailed distributions are probability distributions whose tails are not exponentially bounded: that is, they have heavier tails than the exponential distribution. In many applications it is the right tail of the distrib ...
and that moments less than or equal to \alpha do not exist.


Beta geometric distribution

The beta geometric distribution is an important special case of the beta negative binomial distribution occurring for r=1 . In this case the pmf simplifies to :f(k, \alpha,\beta)=\frac . This distribution is used in some
Buy Till you Die The Buy Till You Die (BTYD) class of statistical models are designed to capture the behavioral characteristics of non-contractual customers, or when the company is not able to directly observe when a customer stops being a customer of a brand. The g ...
(BTYD) models. Further, when \beta=1 the beta geometric reduces to the
Yule–Simon distribution In probability and statistics, the Yule–Simon distribution is a discrete probability distribution named after Udny Yule and Herbert A. Simon. Simon originally called it the ''Yule distribution''. The probability mass function (pmf) of the Yu ...
. However, it is more common to define the Yule-Simon distribution in terms of a shifted version of the beta geometric. In particular, if X \sim BG(\alpha,1) then X+1 \sim YS(\alpha).


Beta negative binomial as a Pólya urn model

In the case when the 3 parameters r, \alpha and \beta are positive integers, the Beta negative binomial can also be motivated by an
urn model In probability and statistics, an urn problem is an idealized mental exercise in which some objects of real interest (such as atoms, people, cars, etc.) are represented as colored balls in an urn or other container. One pretends to remove one or m ...
- or more specifically a basic
Pólya urn model In statistics, a Pólya urn model (also known as a Pólya urn scheme or simply as Pólya's urn), named after George Pólya, is a type of statistical model used as an idealized mental exercise framework, unifying many treatments. In an urn model, ob ...
. Consider an urn initially containing \alpha red balls (the stopping color) and \beta blue balls. At each step of the model, a ball is drawn at random from the urn and replaced, along with one additional ball of the same color. The process is repeated over and over, until r red colored balls are drawn. The random variable X of observed draws of blue balls are distributed according to a \mathrm(r, \alpha, \beta). Note, at the end of the experiment, the urn always contains the fixed number r+\alpha of red balls while containing the random number X+\beta blue balls. By the non-identifiability property, X can be equivalently generated with the urn initially containing \alpha red balls (the stopping color) and r blue balls and stopping when \beta red balls are observed.


See also

*
Negative binomial distribution In probability theory and statistics, the negative binomial distribution is a discrete probability distribution that models the number of failures in a sequence of independent and identically distributed Bernoulli trials before a specified (non-r ...
*
Dirichlet negative multinomial distribution In probability theory and statistics, the Dirichlet negative multinomial distribution is a multivariate distribution on the non-negative integers. It is a multivariate extension of the beta negative binomial distribution. It is also a generaliza ...


Notes


References

*Johnson, N.L.; Kotz, S.; Kemp, A.W. (1993) ''Univariate Discrete Distributions'', 2nd edition, Wiley (Section 6.2.3) *Kemp, C.D.; Kemp, A.W. (1956) "Generalized hypergeometric distributions'', ''
Journal of the Royal Statistical Society The ''Journal of the Royal Statistical Society'' is a peer-reviewed scientific journal of statistics. It comprises three series and is published by Wiley for the Royal Statistical Society. History The Statistical Society of London was founded ...
'', Series B, 18, 202–211 *Wang, Zhaoliang (2011) "One mixed negative binomial distribution with application", ''Journal of Statistical Planning and Inference'', 141 (3), 1153-1160


External links

* Interactive graphic
Univariate Distribution Relationships
{{ProbDistributions, discrete-infinite Discrete distributions Compound probability distributions Factorial and binomial topics