Vandermonde's Identity
   HOME

TheInfoList



OR:

In combinatorics, Vandermonde's identity (or Vandermonde's convolution) is the following identity for binomial coefficients: :=\sum_^r for any nonnegative
integer An integer is the number zero (), a positive natural number (, , , etc.) or a negative integer with a minus sign ( −1, −2, −3, etc.). The negative numbers are the additive inverses of the corresponding positive numbers. In the languag ...
s ''r'', ''m'', ''n''. The identity is named after
Alexandre-Théophile Vandermonde Alexandre-Théophile Vandermonde (28 February 1735 – 1 January 1796) was a French mathematician, musician and chemist who worked with Bézout and Lavoisier; his name is now principally associated with determinant theory in mathematics. He was b ...
(1772), although it was already known in 1303 by the Chinese mathematician
Zhu Shijie Zhu Shijie (, 1249–1314), courtesy name Hanqing (), pseudonym Songting (), was a Chinese mathematician and writer. He was a Chinese mathematician during the Yuan Dynasty. Zhu was born close to today's Beijing. Two of his mathematical works ha ...
.See for the history. There is a ''q''-analog to this theorem called the ''q''-Vandermonde identity. Vandermonde's identity can be generalized in numerous ways, including to the identity : = \sum_ \cdots .


Proofs


Algebraic proof

In general, the product of two
polynomial In mathematics, a polynomial is an expression consisting of indeterminates (also called variables) and coefficients, that involves only the operations of addition, subtraction, multiplication, and positive-integer powers of variables. An example ...
s with degrees ''m'' and ''n'', respectively, is given by :\biggl(\sum_^m a_ix^i\biggr) \biggl(\sum_^n b_jx^j\biggr) = \sum_^\biggl(\sum_^r a_k b_\biggr) x^r, where we use the convention that ''ai'' = 0 for all integers ''i'' > ''m'' and ''bj'' = 0 for all integers ''j'' > ''n''. By the
binomial theorem In elementary algebra, the binomial theorem (or binomial expansion) describes the algebraic expansion of powers of a binomial. According to the theorem, it is possible to expand the polynomial into a sum involving terms of the form , where the ...
, :(1+x)^ = \sum_^ x^r. Using the binomial theorem also for the exponents ''m'' and ''n'', and then the above formula for the product of polynomials, we obtain :\begin \sum_^ x^r &= (1+x)^\\ &= (1+x)^m (1+x)^n \\ &= \biggl(\sum_^m x^i\biggr) \biggl(\sum_^n x^j\biggr)\\ &=\sum_^\biggl(\sum_^r \biggr) x^r, \end where the above convention for the coefficients of the polynomials agrees with the definition of the binomial coefficients, because both give zero for all ''i'' > ''m'' and ''j'' > ''n'', respectively. By comparing coefficients of ''x''''r'', Vandermonde's identity follows for all integers ''r'' with 0 ≤ ''r'' ≤ ''m'' + ''n''. For larger integers ''r'', both sides of Vandermonde's identity are zero due to the definition of binomial coefficients.


Combinatorial proof

Vandermonde's identity also admits a combinatorial double counting proof, as follows. Suppose a committee consists of ''m'' men and ''n'' women. In how many ways can a subcommittee of ''r'' members be formed? The answer is :. The answer is also the sum over all possible values of ''k'', of the number of subcommittees consisting of ''k'' men and ''r'' − ''k'' women: :\sum_^r.


Geometrical proof

Take a rectangular grid of ''r'' x (''m''+''n''−''r'') squares. There are : \binom=\binom paths that start on the bottom left vertex and, moving only upwards or rightwards, end at the top right vertex (this is because ''r'' right moves and ''m''+''n''-''r'' up moves must be made (or vice versa) in any order, and the total path length is ''m'' + ''n''). Call the bottom left vertex (0, 0). There are \binom paths starting at (0, 0) that end at (''k'', ''m''−''k''), as ''k'' right moves and ''m''−''k'' upward moves must be made (and the path length is ''m''). Similarly, there are \binom paths starting at (''k'', ''m''−''k'') that end at (''r'', ''m''+''n''−''r''), as a total of ''r''−''k'' right moves and (''m''+''n''−''r'') − (''m''−''k'') upward moves must be made and the path length must be ''r''−''k'' + (''m''+''n''−''r'') − (''m''−''k'') = ''n''. Thus there are : \binom\binom paths that start at (0, 0), end at (''r'', ''m''+''n''−''r''), and go through (''k'', ''m''−''k''). This is a subset of all paths that start at (0, 0) and end at (''r'', ''m''+''n''−''r''), so sum from ''k'' = 0 to ''k'' = ''r'' (as the point (''k'', ''m''−''k'') is confined to be within the square) to obtain the total number of paths that start at (0, 0) and end at (''r'', ''m''+''n''−''r'').


Generalizations


Generalized Vandermonde's identity

One can generalize Vandermonde's identity as follows: : \sum_ \cdots = . This identity can be obtained through the algebraic derivation above when more than two polynomials are used, or through a simple double counting argument. On the one hand, one chooses \textstyle k_1 elements out of a first set of \textstyle n_1 elements; then \textstyle k_2 out of another set, and so on, through \textstyle p such sets, until a total of \textstyle m elements have been chosen from the \textstyle p sets. One therefore chooses \textstyle m elements out of \textstyle n_1+\dots +n_p in the left-hand side, which is also exactly what is done in the right-hand side. Vandermonde's identity also follows from the following identity by setting t=0. Given p,q,s\leq p+q. Then for t\leq s: : = \sum_ -\sum_ .


Chu–Vandermonde identity

The identity generalizes to non-integer arguments. In this case, it is known as the Chu–Vandermonde identity (see Askey 1975, pp. 59–60) and takes the form :=\sum_^n for general complex-valued ''s'' and ''t'' and any non-negative integer ''n''. It can be proved along the lines of the algebraic proof above by multiplying the
binomial series In mathematics, the binomial series is a generalization of the polynomial that comes from a binomial formula expression like (1+x)^n for a nonnegative integer n. Specifically, the binomial series is the Taylor series for the function f(x)=(1+x ...
for (1+x)^s and (1+x)^t and comparing terms with the binomial series for (1+x)^. This identity may be rewritten in terms of the falling
Pochhammer symbol In mathematics, the falling factorial (sometimes called the descending factorial, falling sequential product, or lower factorial) is defined as the polynomial :\begin (x)_n = x^\underline &= \overbrace^ \\ &= \prod_^n(x-k+1) = \prod_^(x-k) \,. \e ...
s as :(s+t)_n = \sum_^n (s)_k (t)_ in which form it is clearly recognizable as an umbral variant of the
binomial theorem In elementary algebra, the binomial theorem (or binomial expansion) describes the algebraic expansion of powers of a binomial. According to the theorem, it is possible to expand the polynomial into a sum involving terms of the form , where the ...
(for more on umbral variants of the binomial theorem, see
binomial type In mathematics, a polynomial sequence, i.e., a sequence of polynomials indexed by non-negative integers \left\ in which the index of each polynomial equals its degree, is said to be of binomial type if it satisfies the sequence of identities :p_ ...
). The Chu–Vandermonde identity can also be seen to be a special case of Gauss's hypergeometric theorem, which states that :\;_2F_1(a,b;c;1) = \frac where \;_2F_1 is the hypergeometric function and \Gamma(n+1)=n! is the
gamma function In mathematics, the gamma function (represented by , the capital letter gamma from the Greek alphabet) is one commonly used extension of the factorial function to complex numbers. The gamma function is defined for all complex numbers except ...
. One regains the Chu–Vandermonde identity by taking ''a'' = −''n'' and applying the identity : = (-1)^k liberally. The Rothe–Hagen identity is a further generalization of this identity.


The hypergeometric probability distribution

When both sides have been divided by the expression on the left, so that the sum is 1, then the terms of the sum may be interpreted as probabilities. The resulting probability distribution is the
hypergeometric distribution In probability theory and statistics, the hypergeometric distribution is a discrete probability distribution that describes the probability of k successes (random draws for which the object drawn has a specified feature) in n draws, ''without'' ...
. That is the probability distribution of the number of red marbles in ''r'' draws ''without replacement'' from an urn containing ''n'' red and ''m'' blue marbles.


See also

* Pascal's identity * Hockey-stick identity * Rothe–Hagen identity


References

{{Reflist Factorial and binomial topics Mathematical identities Articles containing proofs