statistics Statistics (from German language, German: ''wikt:Statistik#German, Statistik'', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of ...

, the Cochran–Mantel–Haenszel test (CMH) is a test used in the analysis of

stratified Stratification may refer to: Mathematics * Stratification (mathematics), any consistent assignment of numbers to predicate symbols * Data stratification in statistics Earth sciences * Stable and unstable stratification * Stratification, or st ...

or matched categorical data. It allows an investigator to test the association between a binary predictor or treatment and a binary outcome such as case or control status while taking into account the stratification. Unlike the

McNemar test In statistics, McNemar's test is a statistical test used on paired nominal data. It is applied to 2 × 2 contingency tables with a dichotomous trait, with matched pairs of subjects, to determine whether the row and column marginal frequ ...

which can only handle pairs, the CMH test handles arbitrary strata size. It is named after

William G. Cochran William Gemmell Cochran (15 July 1909 – 29 March 1980) was a prominent statistician. He was born in Scotland but spent most of his life in the United States. Cochran studied mathematics at the University of Glasgow and the University of Cam ...

Nathan Mantel Nathan Mantel (February 16, 1919 – May 25, 2002) was an American biostatistician best known for his work with William Haenszel, which led to the Mantel–Haenszel test and its associated estimate, the Mantel–Haenszel odds ratio. The Mantel–H ...

and William Haenszel. Extensions of this test to a categorical response and/or to several groups are commonly called Cochran–Mantel–Haenszel statistics. It is often used in observational studies where random assignment of subjects to different treatments cannot be controlled, but

confounding In statistics, a confounder (also confounding variable, confounding factor, extraneous determinant or lurking variable) is a variable that influences both the dependent variable and independent variable, causing a spurious association. Con ...

covariates can be measured.

Definition

We consider a binary outcome variable such as case status (e.g. lung cancer) and a binary predictor such as treatment status (e.g. smoking). The observations are grouped in strata. The stratified data are summarized in a series of 2 × 2 contingency tables, one for each stratum. The ''i''-th such contingency table is: The common odds-ratio of the K contingency tables is defined as: :

R = ,

The null hypothesis is that there is no association between the treatment and the outcome. More precisely, the null hypothesis is

H_0: R=1

and the alternative hypothesis is

H_1: R\ne 1

. The test statistic is: :

\xi_ = .

It follows a

\chi^2

distribution asymptotically with 1 df under the null hypothesis.

Subset stability

The standard odds- or

risk ratio The relative risk (RR) or risk ratio is the ratio of the probability of an outcome in an exposed group to the probability of an outcome in an unexposed group. Together with risk difference and odds ratio, relative risk measures the association bet ...

of all strata could be calculated, giving risk ratios

r_1, r_2, \dots, r_n

, where

n

is the number of strata. If the stratification were removed, there would be one aggregate risk ratio of the collapsed table; let this be

R

. One generally expects the risk of an event unconditional on the stratification to be bounded between the highest and lowest risk within the strata (or identically with odds ratios). It is easy to construct examples where this is not the case, and

R

is larger or smaller than all of

r_i

for

i\in 1,\dots, n

. This is comparable but not identical to

Simpson's paradox Simpson's paradox is a phenomenon in probability and statistics in which a trend appears in several groups of data but disappears or reverses when the groups are combined. This result is often encountered in social-science and medical-science st ...

, and as with Simpson's paradox, it is difficult to interpret the statistic and decide policy based upon it. Klemens defines a statistic to be ''subset stable'' iff

R

is bounded between

\min(r_i)

and

\max(r_i)

, and a ''well-behaved'' statistic as being infinitely differentiable and not dependent on the order of the strata. Then the CMH statistic is the unique well-behaved statistic satisfying subset stability.

Related tests

* The

can only handle pairs. The CMH test is a generalization of the

as their test statistics are identical when each stratum shows a pair. * Conditional logistic regression is more general than the CMH test as it can handle continuous variable and perform multivariate analysis. When the CMH test can be applied, the CMH test statistic and the

score test In statistics, the score test assesses constraints on statistical parameters based on the gradient of the likelihood function—known as the ''score''—evaluated at the hypothesized parameter value under the null hypothesis. Intuitively, if the ...

statistic of the conditional logistic regression are identical. * Breslow-Day test for homogeneous association. The CMH test supposes that the effect of the treatment is homogeneous in all strata. The Breslow-Day test allows to test this assumption. This is not a concern if the strata are small e.g. pairs.

Notes

External links

Introduction to the Cochran-Mantel-Haenszel Test
{{DEFAULTSORT:Cochran-Mantel-Haenszel statistics Statistical tests for contingency tables Epidemiology Medical statistics Bayesian statistics Summary statistics for contingency tables