Graphoid
   HOME

TheInfoList



OR:

A graphoid is a set of statements of the form, "''X'' is irrelevant to ''Y'' given that we know ''Z''" where ''X'', ''Y'' and ''Z'' are sets of variables. The notion of "irrelevance" and "given that we know" may obtain different interpretations, including
probabilistic Probability is the branch of mathematics concerning numerical descriptions of how likely an Event (probability theory), event is to occur, or how likely it is that a proposition is true. The probability of an event is a number between 0 and ...
, relational and correlational, depending on the application. These interpretations share common properties that can be captured by paths in graphs (hence the name "graphoid"). The theory of graphoids characterizes these properties in a finite set of
axioms An axiom, postulate, or assumption is a statement that is taken to be true, to serve as a premise or starting point for further reasoning and arguments. The word comes from the Ancient Greek word (), meaning 'that which is thought worthy or f ...
that are common to informational irrelevance and its graphical representations.


History

Judea Pearl Judea Pearl (born September 4, 1936) is an Israeli-American computer scientist and philosopher, best known for championing the probabilistic approach to artificial intelligence and the development of Bayesian networks (see the article on belief ...
and Azaria Paz coined the term "graphoids" after discovering that a set of axioms that govern
conditional independence In probability theory, conditional independence describes situations wherein an observation is irrelevant or redundant when evaluating the certainty of a hypothesis. Conditional independence is usually formulated in terms of conditional probabil ...
in
probability theory Probability theory is the branch of mathematics concerned with probability. Although there are several different probability interpretations, probability theory treats the concept in a rigorous mathematical manner by expressing it through a set o ...
is shared by
undirected graph In discrete mathematics, and more specifically in graph theory, a graph is a structure amounting to a set of objects in which some pairs of the objects are in some sense "related". The objects correspond to mathematical abstractions called '' v ...
s. Variables are represented as nodes in a graph in such a way that variable sets ''X'' and ''Y'' are independent conditioned on ''Z'' in the distribution whenever node set ''Z'' separates ''X'' from ''Y'' in the graph. Axioms for conditional independence in probability were derived earlier by A. Philip Dawid and
Wolfgang Spohn Wolfgang Konrad Spohn (born 20 March 1950, in Tübingen) is a German philosopher. He is professor of philosophy and philosophy of science at the University of Konstanz. Biography Wolfgang Spohn studied philosophy, logic and philosophy of scie ...
. The correspondence between dependence and graphs was later extended to
directed acyclic graph In mathematics, particularly graph theory, and computer science, a directed acyclic graph (DAG) is a directed graph with no directed cycles. That is, it consists of vertices and edges (also called ''arcs''), with each edge directed from one ve ...
s (DAGs) and to other models of dependency.


Definition

A dependency model ''M'' is a subset of triplets (''X'',''Z'',''Y'') for which the predicate ''I''(''X'',''Z'',''Y''): ''X'' is independent of ''Y'' given ''Z'', is true. A graphoid is defined as a dependency model that is closed under the following five axioms: #Symmetry: I(X,Z,Y) \Leftrightarrow I(Y,Z,X) #Decomposition: I(X,Z,Y\cup W) \Rightarrow I(X,Z,Y)~\&~I(X,Z,W) #Weak Union: I(X,Z,Y\cup W) \Rightarrow I(X,Z\cup W,Y)~\&~I(X,Z\cup Y,W) #Contraction: I(X,Z,Y)~\&~I(X,Z\cup Y,W) \Rightarrow I(X,Z,Y\cup W) #Intersection: I(X,Z\cup W,Y)~\&~I(X,Z\cup Y,W) \Rightarrow I(X,Z,Y\cup W) A semi-graphoid is a dependency model closed under 1–4. These five axioms together are known as the graphoid axioms. Intuitively, the weak union and contraction properties mean that irrelevant information should not alter the relevance status of other propositions in the system; what was relevant remains relevant and what was irrelevant remains irrelevant.


Types of graphoids


Probabilistic graphoids

Conditional independence, defined as : I(X,Z,Y) \Leftrightarrow P(X\mid Y, Z) = P(X\mid Z) is a semi-graphoid which becomes a full graphoid when ''P'' is strictly positive.


Correlational graphoids

A dependency model is a correlational graphoid if in some probability function we have, : I_c(X,Y,Z) \Leftrightarrow \rho_=0\textx \in X\texty \in Y where \rho_ is the
partial correlation In probability theory and statistics, partial correlation measures the degree of association between two random variables, with the effect of a set of controlling random variables removed. When determining the numerical relationship between two va ...
between ''x'' and ''y'' given set ''Z''. In other words, the linear estimation error of the variables in ''X'' using measurements on ''Z'' would not be reduced by adding measurements of the variables in ''Y'', thus making ''Y'' irrelevant to the estimation of ''X''. Correlational and probabilistic dependency models coincide for normal distributions.


Relational graphoids

A dependency model is a relational graphoid if it satisfies : P(X,Z)>0~\&~P(Y,Z)>0 \implies P(X,Y,Z)>0. In words, the range of values permitted for ''X'' is not restricted by the choice of ''Y'', once ''Z'' is fixed. Independence statements belonging to this model are similar to embedded multi-valued dependencies (EMVD s) in databases.


Graph-induced graphoids

If there exists an undirected graph ''G'' such that, : I(X,Z,Y) \Leftrightarrow \langle X,Z,Y\rangle_G, then the graphoid is called graph-induced. In other words, there exists an undirected graph ''G'' such that every independence statement in ''M'' is reflected as a vertex separation in ''G'' and vice versa. A necessary and sufficient condition for a dependency model to be a graph-induced graphoid is that it satisfies the following axioms: symmetry, decomposition, intersection, strong union and transitivity. Strong union states that : I(X,Z,Y) \implies I(X,Z\cup W,Y) Transitivity states that : I(X,Z,Y) \implies \left(\forall~\gamma \notin X \cup Y \cup Z,~~I(X,Z,\gamma) \text I(\gamma, Z,Y)\right) The axioms symmetry, decomposition, intersection, strong union and transitivity constitute a complete characterization of undirected graphs.A. Paz, J. Pearl, and S. Ur, "A New Characterization of Graphs Based on Interception Relations" Journal of Graph Theory, Vol. 22, No. 2, 125-136, 1996.


DAG-induced graphoids

A graphoid is termed DAG-induced if there exists a directed acyclic graph ''D'' such that I(X,Z,Y) \Leftrightarrow \langle X,Z,Y\rangle_D where \langle X,Z,Y\rangle_D stands for ''d''-separation in ''D''. ''d''-separation (''d''-connotes "directional") extends the notion of vertex separation from undirected graphs to directed acyclic graphs. It permits the reading of conditional independencies from the structure of
Bayesian network A Bayesian network (also known as a Bayes network, Bayes net, belief network, or decision network) is a probabilistic graphical model that represents a set of variables and their conditional dependencies via a directed acyclic graph (DAG). Bay ...
s. However, conditional independencies in a DAG cannot be completely characterized by a finite set of axioms.


Inclusion and construction

Graph-induced and DAG-induced graphoids are both contained in probabilistic graphoids. This means that for every graph ''G'' there exists a probability distribution ''P'' such that every conditional independence in ''P'' is represented in ''G'', and vice versa. The same is true for DAGs. However, there are probabilistic distributions that are not graphoids and, moreover, there is no finite axiomatization for probabilistic conditional dependencies. Thomas Verma showed that every semi-graphoid has a recursive way of constructing a DAG in which every ''d''-separation is valid. The construction is similar to that used in
Bayes networks A Bayesian network (also known as a Bayes network, Bayes net, belief network, or decision network) is a probabilistic graphical model that represents a set of variables and their conditional dependencies via a directed acyclic graph (DAG). Bay ...
and goes as follows: #Arrange the variables in some arbitrary order 1, 2,...,i,...,''N'' and, starting with ''i'' = 1, #choose for each node ''i'' a set of nodes ''PA''''i'' such that ''i'' is independent on all its predecessors, 1, 2,...,''i'' − 1, conditioned on ''PA''''i''. #Draw arrows from ''PA''''i'' to ''i'' and continue. The DAG created by this construction will represent all the conditional independencies that follow from those used in the construction. Furthermore, every ''d''-separation shown in the DAG will be a valid conditional independence in the graphoid used in the construction.


References

{{Reflist Logic Probability theory