HOME

TheInfoList



OR:

In
statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...
and
machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...
, when one wants to infer a random variable with a set of variables, usually a subset is enough, and other variables are useless. Such a subset that contains all the useful information is called a Markov blanket. If a Markov blanket is minimal, meaning that it cannot drop any variable without losing information, it is called a Markov boundary. Identifying a Markov blanket or a Markov boundary helps to extract useful features. The terms of Markov blanket and Markov boundary were coined by
Judea Pearl Judea Pearl (born September 4, 1936) is an Israeli-American computer scientist and philosopher, best known for championing the probabilistic approach to artificial intelligence and the development of Bayesian networks (see the article on beli ...
in 1988.


Markov blanket

A Markov blanket of a random variable Y in a random variable set \mathcal=\ is any subset \mathcal_1 of \mathcal, conditioned on which other variables are independent with Y: :Y\perp \!\!\! \perp\mathcal\backslash\mathcal_1 \mid \mathcal_1. It means that \mathcal_1 contains at least all the information one needs to infer Y, where the variables in \mathcal\backslash\mathcal_1 are redundant. In general, a given Markov blanket is not unique. Any set in \mathcal that contains a Markov blanket is also a Markov blanket itself. Specifically, \mathcal is a Markov blanket of Y in \mathcal.


Markov boundary

A Markov boundary of Y in \mathcal is a subset \mathcal_2 of \mathcal, that \mathcal_2 itself is a Markov blanket of Y, but any proper subset of \mathcal_2 is not a Markov blanket of Y. In other words, a Markov boundary is a minimal Markov blanket. The Markov boundary of a
node In general, a node is a localized swelling (a "knot") or a point of intersection (a vertex). Node may refer to: In mathematics * Vertex (graph theory), a vertex in a mathematical graph *Vertex (geometry), a point where two or more curves, lines ...
A in a
Bayesian network A Bayesian network (also known as a Bayes network, Bayes net, belief network, or decision network) is a probabilistic graphical model that represents a set of variables and their conditional dependencies via a directed acyclic graph (DAG). Bay ...
is the set of nodes composed of A's parents, A's children, and A's children's other parents. In a
Markov random field In the domain of physics and probability, a Markov random field (MRF), Markov network or undirected graphical model is a set of random variables having a Markov property described by an undirected graph. In other words, a random field is said to b ...
, the Markov boundary for a node is the set of its neighboring nodes. In a dependency network, the Markov boundary for a node is the set of its parents.


Uniqueness of Markov boundary

The Markov boundary always exists. Under some mild conditions, the Markov boundary is unique. However, for most practical and theoretical scenarios multiple Markov boundaries may provide alternative solutions. When there are multiple Markov boundaries, quantities measuring causal effect could fail.{{cite journal , last1=Wang , first1=Yue , last2=Wang , first2=Linbo , title=Causal inference in degenerate systems: An impossibility result , journal=Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics , date=2020 , pages=3383-3392 , url=http://proceedings.mlr.press/v108/wang20i.html


See also

* Andrey Markov * Free energy minimisation *
Moral graph In graph theory, a moral graph is used to find the equivalent undirected form of a directed acyclic graph. It is a key step of the junction tree algorithm, used in belief propagation on graphical models. The moralized counterpart of a directed a ...
*
Separation of concerns In computer science, separation of concerns is a design principle for separating a computer program into distinct sections. Each section addresses a separate '' concern'', a set of information that affects the code of a computer program. A concern ...
*
Causality Causality (also referred to as causation, or cause and effect) is influence by which one event, process, state, or object (''a'' ''cause'') contributes to the production of another event, process, state, or object (an ''effect'') where the cau ...
* Causal inference


Notes

Bayesian networks Markov networks