In
statistics
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
and
machine learning
Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
, a Markov blanket of a
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...
is a minimal
set
Set, The Set, SET or SETS may refer to:
Science, technology, and mathematics Mathematics
*Set (mathematics), a collection of elements
*Category of sets, the category whose objects and morphisms are sets and total functions, respectively
Electro ...
of variables that renders the variable
conditionally independent of all other variables in the system. This concept is central in
probabilistic graphical models and
feature selection
In machine learning, feature selection is the process of selecting a subset of relevant Feature (machine learning), features (variables, predictors) for use in model construction. Feature selection techniques are used for several reasons:
* sim ...
. If a Markov blanket is minimal—meaning that no variable in it can be removed without losing this conditional independence—it is called a Markov boundary. Identifying a Markov blanket or boundary allows for efficient
inference
Inferences are steps in logical reasoning, moving from premises to logical consequences; etymologically, the word '' infer'' means to "carry forward". Inference is theoretically traditionally divided into deduction and induction, a distinct ...
and helps isolate relevant variables for prediction or causal reasoning. The terms of Markov blanket and Markov boundary were coined by
Judea Pearl in 1988.
A Markov blanket may be derived from the structure of a probabilistic graphical model such as a
Bayesian network or
Markov random field.
Markov blanket
A Markov blanket of a random variable
in a random variable set
is any subset
of
, conditioned on which other variables are independent with
:
It means that
contains at least all the information one needs to infer
, where the variables in
are redundant.
In general, a given Markov blanket is not unique. Any set in
that contains a Markov blanket is also a Markov blanket itself. Specifically,
is a Markov blanket of
in
.
Example
In a
Bayesian network, the Markov blanket of a
node
In general, a node is a localized swelling (a "knot") or a point of intersection (a vertex).
Node may refer to:
In mathematics
* Vertex (graph theory), a vertex in a mathematical graph
*Vertex (geometry), a point where two or more curves, lines ...
consists of its parents, its children, and its children's other parents (i.e., co-parents). Knowing the values of these nodes makes the target node
conditionally independent of the rest of the network. In a
Markov random field, the Markov blanket of a node is simply its immediate neighbors.
Markov condition
The concept of a Markov blanket is rooted in the Markov condition, which states that in a probabilistic graphical model, each variable is conditionally independent of its non-descendants given its parents.
This condition implies the existence of a minimal separating set — the Markov blanket — that shields a variable from the rest of the network.
Markov boundary
A Markov boundary of
in
is a subset
of
, such that
itself is a Markov blanket of
, but any proper subset of
is not a Markov blanket of
. In other words, a Markov boundary is a minimal Markov blanket.
The Markov boundary of a
node
In general, a node is a localized swelling (a "knot") or a point of intersection (a vertex).
Node may refer to:
In mathematics
* Vertex (graph theory), a vertex in a mathematical graph
*Vertex (geometry), a point where two or more curves, lines ...
in a
Bayesian network is the set of nodes composed of
's parents,
's children, and
's children's other parents. In a
Markov random field, the Markov boundary for a node is the set of its neighboring nodes. In a
dependency network, the Markov boundary for a node is the set of its parents.
Uniqueness of Markov boundary
The Markov boundary always exists. Under some mild conditions, the Markov boundary is unique. However, for most practical and theoretical scenarios multiple Markov boundaries may provide alternative solutions. When there are multiple Markov boundaries, quantities measuring causal effect could fail.
[{{cite journal , last1=Wang , first1=Yue , last2=Wang , first2=Linbo , title=Causal inference in degenerate systems: An impossibility result , journal=Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics , date=2020 , pages=3383-3392 , url=http://proceedings.mlr.press/v108/wang20i.html]
See also
*
Andrey Markov
Andrey Andreyevich Markov (14 June 1856 – 20 July 1922) was a Russian mathematician best known for his work on stochastic processes. A primary subject of his research later became known as the Markov chain. He was also a strong, close to mas ...
*
Free energy minimisation
*
Moral graph
In graph theory, a moral graph is used to find the equivalent undirected form of a directed acyclic graph. It is a key step of the junction tree algorithm, used in belief propagation on graphical models.
The moralized counterpart of a directed ac ...
*
Separation of concerns
In computer science, separation of concerns (sometimes abbreviated as SoC) is a design principle for separating a computer program into distinct sections. Each section addresses a separate '' concern'', a set of information that affects the code o ...
*
Causality
*
Causal inference
Notes
Bayesian networks
Markov networks