A factor graph is a

bipartite graph In the mathematics, mathematical field of graph theory, a bipartite graph (or bigraph) is a Graph (discrete mathematics), graph whose vertex (graph theory), vertices can be divided into two disjoint sets, disjoint and Independent set (graph theo ...

representing the

factorization In mathematics, factorization (or factorisation, see American and British English spelling differences#-ise, -ize (-isation, -ization), English spelling differences) or factoring consists of writing a number or another mathematical object as a p ...

of a

function Function or functionality may refer to: Computing * Function key, a type of key on computer keyboards * Function model, a structured representation of processes in a system * Function object or functor or functionoid, a concept of object-orie ...

. In

probability theory Probability theory or probability calculus is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expre ...

and its applications, factor graphs are used to represent factorization of a probability distribution function, enabling efficient computations, such as the computation of

marginal distribution In probability theory and statistics, the marginal distribution of a subset of a collection of random variables is the probability distribution of the variables contained in the subset. It gives the probabilities of various values of the variable ...

s through the sum–product algorithm. One of the important success stories of factor graphs and the sum–product algorithm is the decoding of capacity-approaching

error-correcting code In computing, telecommunication, information theory, and coding theory, forward error correction (FEC) or channel coding is a technique used for controlling errors in data transmission over unreliable or noisy communication channels. The centra ...

s, such as

LDPC Low-density parity-check (LDPC) codes are a class of error correction codes which (together with the closely-related turbo codes) have gained prominence in coding theory and information theory since the late 1990s. The codes today are widely ...

and turbo codes. Factor graphs generalize constraint graphs. A factor whose value is either 0 or 1 is called a constraint. A constraint graph is a factor graph where all factors are constraints. The max-product algorithm for factor graphs can be viewed as a generalization of the arc-consistency algorithm for constraint processing.

Definition

A factor graph is a

representing the

of a function. Given a factorization of a function

g(X_1,X_2,\dots,X_n)

, :

g(X_1,X_2,\dots,X_n) = \prod_^m f_j(S_j),

where

S_j \subseteq \

, the corresponding factor graph

G=(X,F,E)

consists of variable vertices

X=\

, factor vertices

F=\

, and edges

E

. The edges depend on the factorization as follows: there is an undirected edge between factor vertex

f_j

and variable vertex

X_k

X_k \in S_j

. The function is tacitly assumed to be

real-valued In mathematics, value may refer to several, strongly related notions. In general, a mathematical value may be any definite mathematical object. In elementary mathematics, this is most often a number – for example, a real number such as or an ...

g(X_1,X_2,\dots,X_n) \in \mathbb

. Factor graphs can be combined with message passing algorithms to efficiently compute certain characteristics of the function

g(X_1,X_2,\dots,X_n)

, such as the

Examples

Consider a function that factorizes as follows: :

g(X_1,X_2,X_3) = f_1(X_1)f_2(X_1,X_2)f_3(X_1,X_2)f_4(X_2,X_3)

, with a corresponding factor graph shown on the right. Observe that the factor graph has a cycle. If we merge

f_2(X_1,X_2)f_3(X_1,X_2)

into a single factor, the resulting factor graph will be a

tree In botany, a tree is a perennial plant with an elongated stem, or trunk, usually supporting branches and leaves. In some usages, the definition of a tree may be narrower, e.g., including only woody plants with secondary growth, only ...

. This is an important distinction, as message passing algorithms are usually exact for trees, but only approximate for graphs with cycles.

Message passing on factor graphs

A popular message passing algorithm on factor graphs is the sum–product algorithm, which efficiently computes all the marginals of the individual variables of the function. In particular, the marginal of variable

X_k

is defined as :

g_k(X_k) = \sum_ g(X_1,X_2,\dots,X_n)

where the notation

X_

means that the summation goes over all the variables, ''except''

X_k

. The messages of the sum–product algorithm are conceptually computed in the vertices and passed along the edges. A message from or to a variable vertex is always a

of that particular variable. For instance, when a variable is binary, the messages over the edges incident to the corresponding vertex can be represented as vectors of length 2: the first entry is the message evaluated in 0, the second entry is the message evaluated in 1. When a variable belongs to the field of

real numbers In mathematics, a real number is a number that can be used to measurement, measure a continuous variable, continuous one-dimensional quantity such as a time, duration or temperature. Here, ''continuous'' means that pairs of values can have arbi ...

, messages can be arbitrary functions, and special care needs to be taken in their representation. In practice, the sum–product algorithm is used for

statistical inference Statistical inference is the process of using data analysis to infer properties of an underlying probability distribution.Upton, G., Cook, I. (2008) ''Oxford Dictionary of Statistics'', OUP. . Inferential statistical analysis infers properties of ...

, whereby

g(X_1,X_2,\dots,X_n)

is a joint

distribution Distribution may refer to: Mathematics *Distribution (mathematics), generalized functions used to formulate solutions of partial differential equations *Probability distribution, the probability of a particular value or value range of a varia ...

or a joint

likelihood function A likelihood function (often simply called the likelihood) measures how well a statistical model explains observed data by calculating the probability of seeing that data under different parameter values of the model. It is constructed from the ...

, and the factorization depends on the conditional independencies among the variables. The

Hammersley–Clifford theorem The Hammersley–Clifford theorem is a result in probability theory, mathematical statistics and statistical mechanics that gives necessary and sufficient conditions under which a strictly positive probability distribution can be represented as even ...

shows that other probabilistic models such as

Bayesian network A Bayesian network (also known as a Bayes network, Bayes net, belief network, or decision network) is a probabilistic graphical model that represents a set of variables and their conditional dependencies via a directed acyclic graph (DAG). Whi ...

s and

Markov network In the domain of physics and probability, a Markov random field (MRF), Markov network or undirected graphical model is a set of random variables having a Markov property described by an undirected graph. In other words, a random field is said to ...

s can be represented as factor graphs; the latter representation is frequently used when performing inference over such networks using

belief propagation Belief propagation, also known as sum–product message passing, is a message-passing algorithm for performing inference on graphical models, such as Bayesian networks and Markov random fields. It calculates the marginal distribution for ea ...

. On the other hand, Bayesian networks are more naturally suited for

generative model In statistical classification, two main approaches are called the generative approach and the discriminative approach. These compute classifiers by different approaches, differing in the degree of statistical modelling. Terminology is inconsiste ...

s, as they can directly represent the causalities of the model.

External links

*
dimple
an open-source tool for building and solving factor graphs in MATLAB. *

References

* * * * {{DEFAULTSORT:Factor Graph Graphical models Markov networks Application-specific graphs

Definition

Examples

Message passing on factor graphs

See also

External links

References