In statistics, ignorability is a feature of an experiment design whereby the method of data collection (and the nature of missing data) does not depend on the missing data. A missing data mechanism such as a treatment assignment or survey sampling strategy is "ignorable" if the missing data matrix, which indicates which variables are observed or missing, is independent of the missing data conditional on the observed data. This idea is part of the Rubin Causal Inference Model, developed by

Donald Rubin Donald is a masculine given name derived from the Gaelic name ''Dòmhnall''.. This comes from the Proto-Celtic *''Dumno-ualos'' ("world-ruler" or "world-wielder"). The final -''d'' in ''Donald'' is partly derived from a misinterpretation of the ...

in collaboration with Paul Rosenbaum in the early 1970s. The exact definition differs between their articles in that period. In one of Rubins articles from 1978 Rubin discuss ''ignorable assignment mechanisms'', which can be understood as the way individuals are assigned to treatment groups is irrelevant for the data analysis, given everything that is recorded about that individual. Later, in 1983 Rubin and Rosenbaum rather define ''strongly ignorable treatment assignment'' which is a stronger condition, mathematically formulated as

(r_1,r_0) \perp \!\!\!\perp z \mid v ,\quad 0<\operatorname(z=1)<1 \quad \forall v

, where

r_t

is a potential outcome given treatment

t

v

is some covariates and

z

is the actual treatment. Pearl devised a simple graphical criterion, called ''back-door'', that entails ignorability and identifies sets of covariates that achieve this condition. Ignorability means we can ignore how one ended up in one vs. the other group (‘treated’

Tx = 1

, or ‘control’

Tx = 0

) when it comes to the potential outcome (say

Y

). It has also been called unconfoundedness, selection on the observables, or no omitted variable bias. Formally it has been written as

\perp Tx_i

, or in words the potential

Y

outcome of person

i

had they been treated or not does not depend on whether they have really been (observable) treated or not. We can ignore in other words how people ended up in one vs. the other condition, and treat their potential outcomes as exchangeable. While this seems thick, it becomes clear if we add subscripts for the ‘realized’ and superscripts for the ‘ideal’ (potential) worlds (notation suggested b
David Freedman
So: Y₁¹/*Y₀¹ are potential Y outcomes had the person been treated (superscript ¹), when in reality they have actually been (Y₁¹, subscript ₁), or not (*Y₀¹: the

^*

signals this quantity can never be realized or observed, or is ''fully'' contrary-to-fact or counterfactual, CF). Similarly,

^*Y_1^0 / Y_0^0

are potential

Y

outcomes had the person not been treated (superscript

^0

), when in reality they have been

^*Y_1^0

, subscript

_1

or not actually (

Y_0^0

. Only one of each potential outcome (PO) can be realized, the other cannot, for the same assignment to condition, so when we try to estimate treatment effects, we need something to replace the fully contrary-to-fact ones with observables (or estimate them). When ignorability/exogeneity holds, like when people are randomized to be treated or not, we can ‘replace’ *''Y''₀¹ with its observable counterpart Y₁¹, and *Y₁⁰ with its observable counterpart ''Y''₀⁰, not at the individual level Y_i’s, but when it comes to averages like E 'Y''_''i''¹ – ''Y''_''i''⁰ which is exactly the causal treatment effect (TE) one tries to recover. Because of the ‘consistency rule’, the potential outcomes are the values actually realized, so we can write Y_i⁰ = Y_i0⁰ and Y_i¹ = Y_i1¹ (“the consistency rule states that an individual’s potential outcome under a hypothetical condition that happened to materialize is precisely the outcome experienced by that individual”, p. 872). Hence TE = E _i¹ – Y_i⁰= E _i1¹ – Y_i0⁰ Now, by simply adding and subtracting the same fully counterfactual quantity *Y₁⁰ we get: E _i1¹ – Y_i0⁰= E _i1¹ –*Y₁⁰ +*Y₁⁰ - Y_i0⁰= E _i1¹ –*Y₁⁰+ E Y₁⁰ - Y_i0⁰= ATT + , where ATT = average treatment effect on the treated and the second term is the bias introduced when people have the choice to belong to either the ‘treated’ or the ‘control’ group. Ignorability, either plain or conditional on some other variables, implies that such selection bias can be ignored, so one can recover (or estimate) the causal effect.

See also

References

Further reading