The partition function or configuration integral, as used in
probability theory
Probability theory or probability calculus is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expre ...
,
information theory
Information theory is the mathematical study of the quantification (science), quantification, Data storage, storage, and telecommunications, communication of information. The field was established and formalized by Claude Shannon in the 1940s, ...
and
dynamical systems
In mathematics, a dynamical system is a system in which a Function (mathematics), function describes the time dependence of a Point (geometry), point in an ambient space, such as in a parametric curve. Examples include the mathematical models ...
, is a generalization of the definition of a
partition function in statistical mechanics. It is a special case of a
normalizing constant in probability theory, for the
Boltzmann distribution. The partition function occurs in many problems of probability theory because, in situations where there is a natural symmetry, its associated
probability measure
In mathematics, a probability measure is a real-valued function defined on a set of events in a σ-algebra that satisfies Measure (mathematics), measure properties such as ''countable additivity''. The difference between a probability measure an ...
, the
Gibbs measure, has the
Markov property. This means that the partition function occurs not only in physical systems with translation symmetry, but also in such varied settings as neural networks (the
Hopfield network), and applications such as
genomics,
corpus linguistics
Corpus linguistics is an empirical method for the study of language by way of a text corpus (plural ''corpora''). Corpora are balanced, often stratified collections of authentic, "real world", text of speech or writing that aim to represent a giv ...
and
artificial intelligence
Artificial intelligence (AI) is the capability of computer, computational systems to perform tasks typically associated with human intelligence, such as learning, reasoning, problem-solving, perception, and decision-making. It is a field of re ...
, which employ
Markov networks, and
Markov logic networks. The Gibbs measure is also the unique measure that has the property of maximizing the
entropy for a fixed expectation value of the energy; this underlies the appearance of the partition function in
maximum entropy methods and the algorithms derived therefrom.
The partition function ties together many different concepts, and thus offers a general framework in which many different kinds of quantities may be calculated. In particular, it shows how to calculate
expectation values and
Green's functions, forming a bridge to
Fredholm theory. It also provides a natural setting for the
information geometry
Information geometry is an interdisciplinary field that applies the techniques of differential geometry to study probability theory and statistics. It studies statistical manifolds, which are Riemannian manifolds whose points correspond to proba ...
approach to information theory, where the
Fisher information metric
In information geometry, the Fisher information metric is a particular Riemannian metric which can be defined on a smooth statistical manifold, ''i.e.'', a smooth manifold whose points are probability distributions. It can be used to calculate the ...
can be understood to be a
correlation function derived from the partition function; it happens to define a
Riemannian manifold
In differential geometry, a Riemannian manifold is a geometric space on which many geometric notions such as distance, angles, length, volume, and curvature are defined. Euclidean space, the N-sphere, n-sphere, hyperbolic space, and smooth surf ...
.
When the setting for random variables is on
complex projective space or
projective Hilbert space, geometrized with the
Fubini–Study metric, the theory of
quantum mechanics
Quantum mechanics is the fundamental physical Scientific theory, theory that describes the behavior of matter and of light; its unusual characteristics typically occur at and below the scale of atoms. Reprinted, Addison-Wesley, 1989, It is ...
and more generally
quantum field theory
In theoretical physics, quantum field theory (QFT) is a theoretical framework that combines Field theory (physics), field theory and the principle of relativity with ideas behind quantum mechanics. QFT is used in particle physics to construct phy ...
results. In these theories, the partition function is heavily exploited in the
path integral formulation, with great success, leading to many formulas nearly identical to those reviewed here. However, because the underlying measure space is complex-valued, as opposed to the real-valued
simplex of probability theory, an extra factor of ''i'' appears in many formulas. Tracking this factor is troublesome, and is not done here. This article focuses primarily on classical probability theory, where the sum of probabilities total to one.
Definition
Given a set of
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
s
taking on values
, and some sort of
potential function or
Hamiltonian
Hamiltonian may refer to:
* Hamiltonian mechanics, a function that represents the total energy of a system
* Hamiltonian (quantum mechanics), an operator corresponding to the total energy of that system
** Dyall Hamiltonian, a modified Hamiltonian ...
, the partition function is defined as
The function ''H'' is understood to be a real-valued function on the space of states
, while
is a real-valued free parameter (conventionally, the
inverse temperature). The sum over the
is understood to be a sum over all possible values that each of the random variables
may take. Thus, the sum is to be replaced by an
integral
In mathematics, an integral is the continuous analog of a Summation, sum, which is used to calculate area, areas, volume, volumes, and their generalizations. Integration, the process of computing an integral, is one of the two fundamental oper ...
when the
are continuous, rather than discrete. Thus, one writes
for the case of continuously-varying
.
When ''H'' is an
observable, such as a finite-dimensional
matrix or an infinite-dimensional
Hilbert space operator or element of a
C-star algebra, it is common to express the summation as a
trace, so that
When ''H'' is infinite-dimensional, then, for the above notation to be valid, the argument must be
trace class, that is, of a form such that the summation exists and is bounded.
The number of variables
need not be
countable
In mathematics, a Set (mathematics), set is countable if either it is finite set, finite or it can be made in one to one correspondence with the set of natural numbers. Equivalently, a set is ''countable'' if there exists an injective function fro ...
, in which case the sums are to be replaced by
functional integrals. Although there are many notations for functional integrals, a common one would be
Such is the case for the
partition function in quantum field theory.
A common, useful modification to the partition function is to introduce auxiliary functions. This allows, for example, the partition function to be used as a
generating function for
correlation functions. This is discussed in greater detail below.
The parameter ''β''
The role or meaning of the parameter
can be understood in a variety of different ways. In classical thermodynamics, it is an
inverse temperature. More generally, one would say that it is the variable that is
conjugate to some (arbitrary) function
of the random variables
. The word ''conjugate'' here is used in the sense of conjugate
generalized coordinates in
Lagrangian mechanics
In physics, Lagrangian mechanics is a formulation of classical mechanics founded on the d'Alembert principle of virtual work. It was introduced by the Italian-French mathematician and astronomer Joseph-Louis Lagrange in his presentation to the ...
, thus, properly
is a
Lagrange multiplier. It is not uncommonly called the
generalized force. All of these concepts have in common the idea that one value is meant to be kept fixed, as others, interconnected in some complicated way, are allowed to vary. In the current case, the value to be kept fixed is the
expectation value of
, even as many different
probability distribution
In probability theory and statistics, a probability distribution is a Function (mathematics), function that gives the probabilities of occurrence of possible events for an Experiment (probability theory), experiment. It is a mathematical descri ...
s can give rise to exactly this same (fixed) value.
For the general case, one considers a set of functions
that each depend on the random variables
. These functions are chosen because one wants to hold their expectation values constant, for one reason or another. To constrain the expectation values in this way, one applies the method of
Lagrange multipliers. In the general case,
maximum entropy methods illustrate the manner in which this is done.
Some specific examples are in order. In basic thermodynamics problems, when using the
canonical ensemble, the use of just one parameter
reflects the fact that there is only one expectation value that must be held constant: the
free energy (due to
conservation of energy). For chemistry problems involving chemical reactions, the
grand canonical ensemble provides the appropriate foundation, and there are two Lagrange multipliers. One is to hold the energy constant, and another, the
fugacity, is to hold the particle count constant (as chemical reactions involve the recombination of a fixed number of atoms).
For the general case, one has
with
a point in a space.
For a collection of observables
, one would write