Pólya urn model
   HOME

TheInfoList



OR:

In
statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
, a Pólya urn model (also known as a Pólya urn scheme or simply as Pólya's urn), named after
George Pólya George Pólya (; hu, Pólya György, ; December 13, 1887 – September 7, 1985) was a Hungarian mathematician. He was a professor of mathematics from 1914 to 1940 at ETH Zürich and from 1940 to 1953 at Stanford University. He made fundamental ...
, is a type of
statistical model A statistical model is a mathematical model that embodies a set of statistical assumptions concerning the generation of Sample (statistics), sample data (and similar data from a larger Statistical population, population). A statistical model repres ...
used as an idealized
mental exercise Brain training (also called cognitive training) is a program of regular activities purported to maintain or improve one's cognitive abilities. The phrase “cognitive ability” usually refers to components of fluid intelligence such as executive ...
framework, unifying many treatments. In an
urn model In probability and statistics, an urn problem is an idealized mental exercise in which some objects of real interest (such as atoms, people, cars, etc.) are represented as colored balls in an urn or other container. One pretends to remove one or ...
, objects of real interest (such as atoms, people, cars, etc.) are represented as colored balls in an
urn An urn is a vase, often with a cover, with a typically narrowed neck above a rounded body and a footed pedestal. Describing a vessel as an "urn", as opposed to a vase or other terms, generally reflects its use rather than any particular shape or ...
or other container. In the basic Pólya urn model, the urn contains ''x'' white and ''y'' black balls; one ball is drawn randomly from the urn and its color observed; it is then returned in the urn, and an additional ball of the same color is added to the urn, and the selection process is repeated. Questions of interest are the evolution of the urn population and the sequence of colors of the balls drawn out. This endows the urn with a self-reinforcing property sometimes expressed as ''
the rich get richer "The rich get richer and the poor get poorer" is an aphorism due to Percy Bysshe Shelley. In ''A Defence of Poetry'' (1821, not published until 1840) Shelley remarked that Utilitarianism, the promoters of utility had exemplified the saying, "To ...
''. Note that in some sense, the Pólya urn model is the "opposite" of the model of
sampling without replacement In statistics, a simple random sample (or SRS) is a subset of individuals (a sample) chosen from a larger set (a population) in which a subset of individuals are chosen randomly, all with the same probability. It is a process of selecting a sample ...
, where every time a particular value is observed, it is less likely to be observed again, whereas in a Pólya urn model, an observed value is ''more'' likely to be observed again. In both of these models, the act of measurement has an effect on the outcome of future measurements. (For comparison, when
sampling with replacement In statistics, a simple random sample (or SRS) is a subset of individuals (a sample) chosen from a larger set (a population) in which a subset of individuals are chosen randomly, all with the same probability. It is a process of selecting a sample ...
, observation of a particular value has no effect on how likely it is to observe that value again.) In a Pólya urn model, successive acts of measurement over time have less and less effect on future measurements, whereas in sampling without replacement, the opposite is true: After a certain number of measurements of a particular value, that value will never be seen again. One of the reasons for interest in ''this particular'' rather elaborate urn model (i.e. with duplication and then replacement of each ball drawn) is that it provides an example in which the count (initially ''x'' black and ''y'' white) of balls in the urn is ''not'' concealed, which is able to approximate the correct updating of ''subjective'' probabilities appropriate to a ''different'' case in which the original urn content ''is'' concealed while ordinary sampling with replacement is conducted (without the Pólya ball-duplication). Because of the simple "sampling with replacement" scheme in this second case, the urn content is now ''static'', but this greater simplicity is compensated for by the assumption that the urn content is now ''unknown'' to an observer. A
Bayesian analysis Bayesian inference is a method of statistical inference in which Bayes' theorem is used to update the probability for a hypothesis as more evidence or information becomes available. Bayesian inference is an important technique in statistics, and e ...
of the observer's uncertainty about the urn's initial content can be made, using a ''particular choice'' of (conjugate) prior distribution. Specifically, suppose that an observer knows that the urn contains only identical balls, each coloured either black or white, but he does not know the absolute number of balls present, nor the proportion that are of each colour. Suppose that he holds prior beliefs about these unknowns: for him the probability distribution of the urn content is well approximated by some prior distribution for the total number of balls in the urn, and a beta prior distribution with parameters ''(x,y)'' for the initial proportion of these which are black, this proportion being (for him) considered approximately independent of the total number. Then the process of outcomes of a succession of draws from the urn (with replacement but without the duplication) has ''approximately the same probability law'' as does the above Pólya scheme in which the actual urn content was not hidden from him. The approximation error here relates to the fact that an urn containing a known finite number ''m'' of balls of course cannot have an ''exactly'' beta-distributed unknown proportion of black balls, since the domain of possible values for that proportion are confined to being multiples of 1/m, rather than having the full freedom to assume any value in the continuous unit interval, as would an ''exactly'' beta distributed proportion. This slightly informal account is provided for reason of motivation, and can be made more mathematically precise. This basic Pólya urn model has been enriched and generalized in many ways.


Distributions related to the Pólya urn

*
beta-binomial distribution In probability theory and statistics, the beta-binomial distribution is a family of discrete probability distributions on a finite support of non-negative integers arising when the probability of success in each of a fixed or known number of B ...
: The distribution of the number of successful draws (trials), e.g. number of extractions of white ball, given n draws from a Pólya urn. *
Beta negative binomial distribution In probability theory, a beta negative binomial distribution is the probability distribution of a discrete random variable X equal to the number of failures needed to get r successes in a sequence of independent Bernoulli trials. The probabi ...
: The distribution of number of white balls observed until a fixed number black balls are observed. *
Dirichlet-multinomial distribution In probability theory and statistics, the Dirichlet-multinomial distribution is a family of discrete multivariate probability distributions on a finite support of non-negative integers. It is also called the Dirichlet compound multinomial distribut ...
(also known as the ''multivariate Pólya distribution''): The distribution over the number of balls of each color, given n draws from a Pólya urn where there are k different colors instead of only two. *
Dirichlet negative multinomial distribution In probability theory and statistics, the Dirichlet negative multinomial distribution is a multivariate distribution on the non-negative integers. It is a multivariate extension of the beta negative binomial distribution. It is also a generaliz ...
: The distribution over the number of balls of each color until a fixed number of stopping colored balls are observed. * martingales, the
Beta-binomial distribution In probability theory and statistics, the beta-binomial distribution is a family of discrete probability distributions on a finite support of non-negative integers arising when the probability of success in each of a fixed or known number of B ...
and the
beta distribution In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval , 1in terms of two positive parameters, denoted by ''alpha'' (''α'') and ''beta'' (''β''), that appear as ...
: Let ''w'' and ''b'' be the number of white and black balls initially in the urn, and w+n_w the number of white balls currently in the urn after ''n'' draws. Then the sequence of values \frac for n=1,2,3,\dots is a normalized version of the
Beta-binomial distribution In probability theory and statistics, the beta-binomial distribution is a family of discrete probability distributions on a finite support of non-negative integers arising when the probability of success in each of a fixed or known number of B ...
. It is a martingale and converges to the
beta distribution In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval , 1in terms of two positive parameters, denoted by ''alpha'' (''α'') and ''beta'' (''β''), that appear as ...
when ''n'' → ∞. *
Dirichlet process In probability theory, Dirichlet processes (after the distribution associated with Peter Gustav Lejeune Dirichlet) are a family of stochastic processes whose realizations are probability distributions. In other words, a Dirichlet process is a p ...
,
Chinese restaurant process In probability theory, the Chinese restaurant process is a discrete-time stochastic process, analogous to seating customers at tables in a restaurant. Imagine a restaurant with an infinite number of circular tables, each with infinite capacity. C ...
, Hoppe urn: Imagine a modified Pólya urn scheme as follows. We start with an urn with \alpha black balls. When drawing a ball from the urn, if we draw a black ball, put the ball back along with a new ball of a new non-black color randomly generated from a uniform distribution over an infinite set of available colours, and consider the newly generated color to be the "value" of the draw. Otherwise, put the ball back along with another ball of the same color, as for the standard Pólya urn scheme. The colors of an infinite sequence of draws from this modified Pólya urn scheme follow a
Chinese restaurant process In probability theory, the Chinese restaurant process is a discrete-time stochastic process, analogous to seating customers at tables in a restaurant. Imagine a restaurant with an infinite number of circular tables, each with infinite capacity. C ...
. If, instead of generating a new color, we draw a random value from a given base distribution and use that value to label the ball, the labels of an infinite sequence of draws follow a
Dirichlet process In probability theory, Dirichlet processes (after the distribution associated with Peter Gustav Lejeune Dirichlet) are a family of stochastic processes whose realizations are probability distributions. In other words, a Dirichlet process is a p ...
. * Moran model: An urn model used to model
genetic drift Genetic drift, also known as allelic drift or the Wright effect, is the change in the frequency of an existing gene variant (allele) in a population due to random chance. Genetic drift may cause gene variants to disappear completely and there ...
in theoretical
population genetics Population genetics is a subfield of genetics that deals with genetic differences within and between populations, and is a part of evolutionary biology. Studies in this branch of biology examine such phenomena as adaptation, speciation, and pop ...
. This is closely similar to the Pólya urn model except that, in addition to adding a new ball of the same color, a randomly drawn ball is removed from the urn. The number of balls in the urn thus remains constant. Continued sampling then leads ultimately to an urn with all balls of one color, the probability of each color being the proportion of that color in the original urn. There are variants of the Moran model that insist that the ball removed from the urn be a different ball from one originally sampled in that step, and variants that do the removal of a ball immediately after the new ball is placed in the urn, so that the new ball is one of the balls available to be removed. This makes a small difference in the time taken to reach the state in which all balls are the same color. The Moran process models genetic drift in a population with overlapping generations.


Mathematical Derivation

Suppose we have an urn containing \gamma white balls and \alpha black balls. Each time we pick a random ball from the urn, record its color and return the ball to the urn with another ball of the same color. Every time we draw a new ball from the urn, we define a random variable called X_i for the color of the ball. X_i = 1 if the color of the ball is black and X_i = 0 otherwise. The more random variables prior to X_i that are one, the more likely that X_i will also be one, because more black balls have been added to the urn. Therefore, these variables are not independent of each other, but as shown below, they are interchangeable. Assume that n balled are picked from the urn, and out of this n balls k balls are back and n-k are white. The first time the number of balls in the urn is \gamma+\alpha, the second time it is \gamma+\alpha+1 and so on. So in the i-th time, the number of balls will be \gamma+\alpha+i-1. Now suppose that we have seen all the black balls before the white balls, the probability of this event is simply:\frac\times \frac\times \cdots \times \frac \times \frac\times \frac\cdots \frac Now we have to prove that if the order of black and white balls is changed arbitrarily, there will be no change in the final probability. As we can see in the line above, the denominators of the fractions will not change by changing the order of observing the balls, the ith denominator will always be \gamma+\alpha+i-1, because this is the number of balls in the urn at that round. If we see j-th black ball in round t, the probability X_t = 1 will be equal to \frac, i.e. the numerator will be equal to \alpha+j-1. With a similar argument, we can also calculate the probability for white balls. Therefore, the final probability will be equal to the following expression: \begin p(X_1=x_1,X_2=x_2,...,X_n=x_n) &= \frac \\ &= \frac \end This probability is not related to the order of seeing black and white balls and only depends on the total number of white balls and the total number of black balls. According to the
De Finetti's theorem In probability theory, de Finetti's theorem states that exchangeable observations are conditionally independent relative to some latent variable. An epistemic probability distribution could then be assigned to this variable. It is named in hono ...
, there must be a unique prior distribution such that the joint distribution of observing the sequence is a Bayesian mixture of the Bernoulli probabilities. It can be shown that this prior distribution is a
beta distribution In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval , 1in terms of two positive parameters, denoted by ''alpha'' (''α'') and ''beta'' (''β''), that appear as ...
with parameters \beta\left(\cdot; \,\gamma,\, \alpha\right). In the De Finetti's theorem, if \pi(\cdot) with \beta\left(\cdot; \,\gamma,\, \alpha\right) we get the previous equation: \begin p(X_1=x_1,X_2=x_2,...,X_n=x_n) &= \int \theta^\left(\right)\times \left(1-\theta\right)^\left(n - \right)\,\beta\left(\theta; \alpha,\, \gamma\right)d\left(\theta\right)\\ &= \int \theta^\left(\right)\times \left(1-\theta\right)^\left(n - \right)\,\dfrac\theta^(1-\theta)^d\left(\theta\right)\\ &= \int \theta^\left(\right)\times \left(1-\theta\right)^\left(n + \alpha -1 -\right)\,\dfracd\left(\theta\right)\\ &= \int \theta^\left(\right)\times \left(1-\theta\right)^\left(n -k - 1+ \alpha\right)\,\dfracd\left(\theta\right)\\ &= \dfrac \int \theta^\left(\right)\times \left(1-\theta\right)^\left(n-k+\alpha- 1\right)\,d\left(\theta\right)\\ &= \dfrac \dfrac\\ &= \dfrac \end In this equation k = \sum_^n x_i.


See also

*
Pitman–Yor process In probability theory, a Pitman–Yor process denoted PY(''d'', ''θ'', ''G''0), is a stochastic process whose sample path is a probability distribution. A random sample from this process is an infinite discrete probability distribution ...
* Moran process *
Yule process A preferential attachment process is any of a class of processes in which some quantity, typically some form of wealth or credit, is distributed among a number of individuals or objects according to how much they already have, so that those who ...
*
De Finetti's theorem In probability theory, de Finetti's theorem states that exchangeable observations are conditionally independent relative to some latent variable. An epistemic probability distribution could then be assigned to this variable. It is named in hono ...
*
Chinese restaurant process In probability theory, the Chinese restaurant process is a discrete-time stochastic process, analogous to seating customers at tables in a restaurant. Imagine a restaurant with an infinite number of circular tables, each with infinite capacity. C ...


References


Further reading

* F. Alajaji and T. Fuja, "A Communication Channel Modeled on Contagion," IEEE Transactions on Information Theory, Vol. 40, pp. 2035–2041, November 1994. * A. Banerjee, P. Burlina and F. Alajaji, "Image Segmentation and Labeling Using the Pólya Urn Model," IEEE Transactions on Image Processing, Vol. 8, No. 9, pp. 1243–1253, September 1999.


Bibliography

* N.L. Johnson and S.Kotz, (1977) "Urn Models and Their Application." John Wiley. * Hosam Mahmoud, (2008) "Pólya Urn Models." Chapman and Hall/CRC. . {{DEFAULTSORT:Polya urn model Probabilistic models