In mathematics, the Fortuin–Kasteleyn–Ginibre (FKG) inequality is a

correlation In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statistics ...

inequality, a fundamental tool in

statistical mechanics In physics, statistical mechanics is a mathematical framework that applies statistical methods and probability theory to large assemblies of microscopic entities. It does not assume or postulate any natural laws, but explains the macroscopic be ...

and

probabilistic combinatorics The probabilistic method is a nonconstructive method, primarily used in combinatorics and pioneered by Paul Erdős, for proving the existence of a prescribed kind of mathematical object. It works by showing that if one randomly chooses objects ...

(especially

random graph In mathematics, random graph is the general term to refer to probability distributions over graphs. Random graphs may be described simply by a probability distribution, or by a random process which generates them. The theory of random graphs li ...

s and the

probabilistic method The probabilistic method is a nonconstructive method, primarily used in combinatorics and pioneered by Paul Erdős, for proving the existence of a prescribed kind of mathematical object. It works by showing that if one randomly chooses objects fr ...

), due to . Informally, it says that in many random systems, increasing events are positively correlated, while an increasing and a decreasing event are negatively correlated. It was obtained by studying the

random cluster model In statistical mechanics, probability theory, graph theory, etc. the random cluster model is a random graph that generalizes and unifies the Ising model, Potts model, and percolation model. It is used to study random combinatorial structures, elec ...

. An earlier version, for the special case of

i.i.d. In probability theory and statistics, a collection of random variables is independent and identically distributed if each random variable has the same probability distribution as the others and all are mutually independent. This property is us ...

variables, called Harris inequality, is due to , see below. One generalization of the FKG inequality is the Holley inequality (1974) below, and an even further generalization is the Ahlswede–Daykin "four functions" theorem (1978). Furthermore, it has the same conclusion as the Griffiths inequalities, but the hypotheses are different.

The inequality

Let

X

be a finite

distributive lattice In mathematics, a distributive lattice is a lattice in which the operations of join and meet distribute over each other. The prototypical examples of such structures are collections of sets for which the lattice operations can be given by set uni ...

, and ''μ'' a nonnegative function on it, that is assumed to satisfy the (FKG) lattice condition (sometimes a function satisfying this condition is called log supermodular) i.e., :

\mu(x\wedge y)\mu(x\vee y) \ge \mu(x)\mu(y)

for all ''x'', ''y'' in the lattice

X

. The FKG inequality then says that for any two monotonically increasing functions ''ƒ'' and ''g'' on

X

, the following positive correlation inequality holds: :

\left(\sum _f(x)g(x)\mu(x)\right)\left(\sum _\mu(x)\right) \ge \left(\sum _f(x)\mu(x)\right)\left(\sum _g(x)\mu(x)\right).

The same inequality (positive correlation) is true when both ''ƒ'' and ''g'' are decreasing. If one is increasing and the other is decreasing, then they are negatively correlated and the above inequality is reversed. Similar statements hold more generally, when

X

is not necessarily finite, not even countable. In that case, ''μ'' has to be a finite measure, and the lattice condition has to be defined using

cylinder A cylinder (from ) has traditionally been a three-dimensional solid, one of the most basic of curvilinear geometric shapes. In elementary geometry, it is considered a prism with a circle as its base. A cylinder may also be defined as an infin ...

events; see, e.g., Section 2.2 of . For proofs, see or the Ahlswede–Daykin inequality (1978). Also, a rough sketch is given below, due to , using a

Markov chain A Markov chain or Markov process is a stochastic model describing a sequence of possible events in which the probability of each event depends only on the state attained in the previous event. Informally, this may be thought of as, "What happe ...

coupling A coupling is a device used to connect two shafts together at their ends for the purpose of transmitting power. The primary purpose of couplings is to join two pieces of rotating equipment while permitting some degree of misalignment or end mov ...

argument.

Variations on terminology

The lattice condition for ''μ'' is also called multivariate total positivity, and sometimes the strong FKG condition; the term (multiplicative) FKG condition is also used in older literature. The property of ''μ'' that increasing functions are positively correlated is also called having positive associations, or the weak FKG condition. Thus, the FKG theorem can be rephrased as "the strong FKG condition implies the weak FKG condition".

A special case: the Harris inequality

If the lattice

X

totally ordered In mathematics, a total or linear order is a partial order in which any two elements are comparable. That is, a total order is a binary relation \leq on some set X, which satisfies the following for all a, b and c in X: # a \leq a ( reflexive ...

, then the lattice condition is satisfied trivially for any measure ''μ''. In case the measure ''μ'' is uniform, the FKG inequality is

Chebyshev's sum inequality In mathematics, Chebyshev's sum inequality, named after Pafnuty Chebyshev, states that if :a_1 \geq a_2 \geq \cdots \geq a_n \quad and \quad b_1 \geq b_2 \geq \cdots \geq b_n, then : \sum_^n a_k b_k \geq \left(\sum_^n a_k\right)\!\!\left(\sum_^n ...

: if the two increasing functions take on values

a_1\leq a_2 \leq \cdots \leq a_n

and

b_1\leq b_2 \leq \cdots \leq b_n

, then :

\frac \geq \frac \; \frac.

More generally, for any probability measure ''μ'' on

\R

and increasing functions ''ƒ'' and ''g'', :

\int_\R f(x)g(x) \,d\mu(x) \geq \int_\R f(x)\,d\mu(x) \, \int_\R g(x)\,d\mu(x),

which follows immediately from :

g(x)-g(y)]\,d\mu(x)\,d\mu(y) \geq 0.

The lattice condition is trivially satisfied also when the lattice is the product of totally ordered lattices,

X=X_1\times\cdots\times X_n

, and

\mu=\mu_1\otimes\cdots\otimes\mu_n

is a product measure. Often all the factors (both the lattices and the measures) are identical, i.e., ''μ'' is the probability distribution of

random variables. The FKG inequality for the case of a product measure is known also as the Harris inequality after Ted Harris (mathematician), Harris , who found and used it in his study of

percolation Percolation (from Latin ''percolare'', "to filter" or "trickle through"), in physics, chemistry and materials science, refers to the movement and filtering of fluids through porous materials. It is described by Darcy's law. Broader applicatio ...

in the plane. A proof of the Harris inequality that uses the above double integral trick on

\R

can be found, e.g., in Section 2.2 of .

Simple examples

A typical example is the following. Color each hexagon of the infinite

honeycomb lattice The hexagonal lattice or triangular lattice is one of the five two-dimensional Bravais lattice types. The symmetry category of the lattice is wallpaper group p6m. The primitive translation vectors of the hexagonal lattice form an angle of 120° ...

black with probability

p

and white with probability

1-p

, independently of each other. Let ''a, b, c, d'' be four hexagons, not necessarily distinct. Let

a \leftrightarrow b

and

c\leftrightarrow d

be the events that there is a black path from ''a'' to ''b'', and a black path from ''c'' to ''d'', respectively. Then the Harris inequality says that these events are positively correlated:

\Pr(a \leftrightarrow b,\ c\leftrightarrow d) \geq \Pr(a \leftrightarrow b)\Pr(c\leftrightarrow d)

. In other words, assuming the presence of one path can only increase the probability of the other. Similarly, if we randomly color the hexagons inside an

n\times n

rhombus-shaped hex board, then the events that there is black crossing from the left side of the board to the right side is positively correlated with having a black crossing from the top side to the bottom. On the other hand, having a left-to-right black crossing is negatively correlated with having a top-to-bottom white crossing, since the first is an increasing event (in the amount of blackness), while the second is decreasing. In fact, in any coloring of the hex board exactly one of these two events happen — this is why hex is a well-defined game. In the Erdős–Rényi random graph, the existence of a

Hamiltonian cycle In the mathematical field of graph theory, a Hamiltonian path (or traceable path) is a path in an undirected or directed graph that visits each vertex exactly once. A Hamiltonian cycle (or Hamiltonian circuit) is a cycle that visits each vertex ...

is negatively correlated with the 3-colorability of the graph, since the first is an increasing event, while the latter is decreasing.

Examples from statistical mechanics

In statistical mechanics, the usual source of measures that satisfy the lattice condition (and hence the FKG inequality) is the following: If

S

is an ordered set (such as

\

), and

\Gamma

is a finite or infinite

graph Graph may refer to: Mathematics *Graph (discrete mathematics), a structure made of vertices and edges **Graph theory, the study of such graphs and their properties *Graph (topology), a topological space resembling a graph in the sense of discre ...

, then the set

S^\Gamma

S

-valued configurations is a

poset In mathematics, especially order theory, a partially ordered set (also poset) formalizes and generalizes the intuitive concept of an ordering, sequencing, or arrangement of the elements of a Set (mathematics), set. A poset consists of a set toget ...

that is a distributive lattice. Now, if

\Phi

is a submodular

potential Potential generally refers to a currently unrealized ability. The term is used in a wide variety of fields, from physics to the social sciences to indicate things that are in a state where they are able to change in ways ranging from the simple re ...

(i.e., a family of functions :

\Phi_\Lambda: S^\Lambda \longrightarrow \R\cup\,

one for each finite

\Lambda \subset \Gamma

, such that each

\Phi_\Lambda

is submodular), then one defines the corresponding

Hamiltonian Hamiltonian may refer to: * Hamiltonian mechanics, a function that represents the total energy of a system * Hamiltonian (quantum mechanics), an operator corresponding to the total energy of that system ** Dyall Hamiltonian, a modified Hamiltonian ...

s as :

H_\Lambda(\varphi):=\sum_ \Phi_\Delta(\varphi).

If ''μ'' is an extremal Gibbs measure for this Hamiltonian on the set of configurations

\varphi

, then it is easy to show that ''μ'' satisfies the lattice condition, see . A key example is the

Ising model The Ising model () (or Lenz-Ising model or Ising-Lenz model), named after the physicists Ernst Ising and Wilhelm Lenz, is a mathematical model of ferromagnetism in statistical mechanics. The model consists of discrete variables that represent ...

on a graph

\Gamma

. Let

S=\

, called spins, and

\beta\in,\infty /math>. Take the following potential:

: \Phi_\Lambda(\varphi)=\begin 
\beta 1_ & \text\Lambda=\\text\Gamma;\\
0 & \text\end Submodularity is easy to check; intuitively, taking the min or the max of two configurations tends to decrease the number of disagreeing spins. Then, depending on the graph \Gamma and the value of \beta, there could be one or more extremal Gibbs measures, see, e.g.,  and .

A generalization: the Holley inequality

The Holley inequality, due to , states that the expectations :

\langle f\rangle_i = \frac

of a monotonically increasing function ''ƒ'' on a finite

X

with respect to two positive functions ''μ''₁, ''μ''₂ on the lattice satisfy the condition :

\langle f\rangle_1 \ge \langle f\rangle_2,

provided the functions satisfy the Holley condition (criterion) :

\mu_2(x\wedge y)\mu_1(x\vee y) \ge \mu_1(x)\mu_2(y)

for all ''x'', ''y'' in the lattice. To recover the FKG inequality: If ''μ'' satisfies the lattice condition and ''ƒ'' and ''g'' are increasing functions on

X

, then ''μ''₁(''x'') = ''g''(''x'')''μ''(''x'') and ''μ''₂(''x'') = ''μ''(''x'') will satisfy the lattice-type condition of the Holley inequality. Then the Holley inequality states that :

\frac = \langle f\rangle_1 \ge \langle f\rangle_2 =\langle f\rangle_\mu,

which is just the FKG inequality. As for FKG, the Holley inequality follows from the

Ahlswede–Daykin inequality The Ahlswede–Daykin inequality , also known as the four functions theorem (or inequality), is a correlation-type inequality for four functions on a finite distributive lattice. It is a fundamental tool in statistical mechanics and probabilis ...

Weakening the lattice condition: monotonicity

Consider the usual case of

X

being a product

\R^V

for some finite set

V

. The lattice condition on ''μ'' is easily seen to imply the following monotonicity, which has the virtue that it is often easier to check than the lattice condition: Whenever one fixes a vertex

v \in V

and two configurations ''φ'' and ''ψ'' outside ''v'' such that

\varphi(w) \geq \psi(w)

for all

w\not=v

, the ''μ''-conditional distribution of ''φ''(''v'') given

\

stochastically dominates the ''μ''-conditional distribution of ''ψ''(''v'') given

\

. Now, if ''μ'' satisfies this monotonicity property, that is already enough for the FKG inequality (positive associations) to hold. Here is a rough sketch of the proof, due to : starting from any initial configuration on

V

, one can run a simple

(the

Metropolis algorithm A metropolis () is a large city or conurbation which is a significant economic, political, and cultural center for a country or region, and an important hub for regional or international connections, commerce, and communications. A big c ...

) that uses independent Uniform ,1random variables to update the configuration in each step, such that the chain has a unique stationary measure, the given ''μ''. The monotonicity of ''μ'' implies that the configuration at each step is a monotone function of independent variables, hence the product measure version of Harris implies that it has positive associations. Therefore, the limiting stationary measure ''μ'' also has this property. The monotonicity property has a natural version for two measures, saying that ''μ''₁ conditionally pointwise dominates ''μ''₂. It is again easy to see that if ''μ''₁ and ''μ''₂ satisfy the lattice-type condition of the Holley inequality, then ''μ''₁ conditionally pointwise dominates ''μ''₂. On the other hand, a Markov chain

argument similar to the above, but now without invoking the Harris inequality, shows that conditional pointwise domination, in fact, implies stochastically domination. Stochastic domination is equivalent to saying that

\langle f\rangle_1 \ge \langle f\rangle_2

for all increasing ''ƒ'', thus we get a proof of the Holley inequality. (And thus also a proof of the FKG inequality, without using the Harris inequality.) See and for details.

References

* * * * * * * * * * {{DEFAULTSORT:Fkg Inequality Inequalities Statistical mechanics Covariance and correlation