In
experiment
An experiment is a procedure carried out to support or refute a hypothesis, or determine the efficacy or likelihood of something previously untried. Experiments provide insight into cause-and-effect by demonstrating what outcome occurs whe ...
s, a spillover is an indirect effect on a subject not directly treated by the experiment. These effects are useful for
policy analysis
Policy analysis or public policy analysis is a technique used in the public administration sub-field of political science to enable civil servants, nonprofit organizations, and others to examine and evaluate the available options to implement th ...
but complicate the
statistical analysis
Statistical inference is the process of using data analysis to infer properties of an underlying probability distribution.Upton, G., Cook, I. (2008) ''Oxford Dictionary of Statistics'', OUP. . Inferential statistical analysis infers properties of ...
of experiments.
Analysis of spillover effects involves relaxing the non-interference assumption, or
SUTVA (Stable Unit Treatment Value Assumption). This assumption requires that subject ''i''
's revelation of its
potential outcomes depends only on that subject ''i''
's own treatment status, and is unaffected by another subject ''j''
's treatment status. In ordinary settings where the researcher seeks to estimate the
average treatment effect
The average treatment effect (ATE) is a measure used to compare treatments (or interventions) in randomized experiments, evaluation of policy interventions, and medical trials. The ATE measures the difference in mean (average) outcomes between unit ...
(
), violation of the non-interference assumption means that traditional estimators for the ATE, such as difference-in-means, may be
biased. However, there are many real-world instances where a unit's revelation of potential outcomes depend on another unit's treatment assignment, and analyzing these effects may be just as important as analyzing the direct effect of treatment.
One solution to this problem is to redefine the causal
estimand An estimand is a quantity that is to be estimated in a statistical analysis. The term is used to distinguish the target of inference from the method used to obtain an approximation of this target (i.e., the estimator) and the specific value obtain ...
of interest by redefining a subject's potential outcomes in terms of one's own treatment status and related subjects' treatment status. The researcher can then analyze various estimands of interest separately. One important assumption here is that this process captures all patterns of
spillovers, and that there are no unmodeled spillovers remaining (ex. spillovers occur within a two-person household but not beyond).
Once the potential outcomes are redefined, the rest of the
statistical analysis
Statistical inference is the process of using data analysis to infer properties of an underlying probability distribution.Upton, G., Cook, I. (2008) ''Oxford Dictionary of Statistics'', OUP. . Inferential statistical analysis infers properties of ...
involves modeling the
probabilities
Probability is a branch of mathematics and statistics concerning Event (probability theory), events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probab ...
of being exposed to treatment given some schedule of treatment assignment, and using
inverse probability weighting
Inverse probability weighting is a statistical technique for estimating quantities related to a population other than the one from which the data was collected. Study designs with a disparate sampling population and population of target inference ( ...
(IPW) to produce unbiased (or
asymptotically
In analytic geometry, an asymptote () of a curve is a line such that the distance between the curve and the line approaches zero as one or both of the ''x'' or ''y'' coordinates tends to infinity. In projective geometry and related contexts, ...
unbiased) estimates of the estimand of interest.
Examples of spillover effects
Spillover effects can occur in a variety of different ways. Common applications include the analysis of social network spillovers and geographic spillovers. Examples include the following:
*
Communication
Communication is commonly defined as the transmission of information. Its precise definition is disputed and there are disagreements about whether Intention, unintentional or failed transmissions are included and whether communication not onl ...
: An intervention that conveys information about a
technology
Technology is the application of Conceptual model, conceptual knowledge to achieve practical goals, especially in a reproducible way. The word ''technology'' can also mean the products resulting from such efforts, including both tangible too ...
or product can influence the take-up decisions of others in their
network
Network, networking and networked may refer to:
Science and technology
* Network theory, the study of graphs as a representation of relations between discrete objects
* Network science, an academic field that studies complex networks
Mathematics
...
if it diffuses beyond the initial user.
*
Competition
Competition is a rivalry where two or more parties strive for a common goal which cannot be shared: where one's gain is the other's loss (an example of which is a zero-sum game). Competition can arise between entities such as organisms, indi ...
: Job placement assistance for young job seekers may influence the job market prospects of individuals who did not receive the training but are competing for the same jobs.
* Contagion: Receiving deworming drugs can decrease other's likelihood of contracting the disease.
*
Deterrence
Deterrence may refer to:
* Deterrence theory, a theory of war, especially regarding nuclear weapons
* Deterrence (penology), a theory of justice
* Deterrence (psychology)
Deterrence in relation to criminal offending is the idea or penology, t ...
: Information about government audits in specific municipalities can spread to nearby municipalities.
*
Displacement
Displacement may refer to:
Physical sciences
Mathematics and physics
*Displacement (geometry), is the difference between the final and initial position of a point trajectory (for instance, the center of mass of a moving object). The actual path ...
: A hotspot policing intervention that increases policing presence on a given street can lead to the displacement of crime onto nearby untreated streets.
*
Reallocation of resources: A hotspot policing intervention that increases policing presence on a given street can decrease police presence on nearby streets.
*
Social comparison: A program that randomizes individuals to receive a voucher to move to a new neighborhood can additionally influence the control group's beliefs about their housing conditions.
In such examples, treatment in a
randomized-control trial can have a direct effect on those who receive the intervention and also a spillover effect on those who were not directly treated.
Statistical issues
Estimating
spillover effect
In economics, a spillover is a positive or a negative, but more often negative, impact experienced in one region or across the world due to an independent event occurring from an unrelated environment.
For example, externalities of economic act ...
s in
experiment
An experiment is a procedure carried out to support or refute a hypothesis, or determine the efficacy or likelihood of something previously untried. Experiments provide insight into cause-and-effect by demonstrating what outcome occurs whe ...
s introduces three
statistical
Statistics (from German language, German: ', "description of a State (polity), state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a s ...
issues that researchers must take into account.
Relaxing the non-interference assumption
One key assumption for
unbiased
Bias is a disproportionate weight ''in favor of'' or ''against'' an idea or thing, usually in a way that is inaccurate, closed-minded, prejudicial, or unfair. Biases can be innate or learned. People may develop biases for or against an individ ...
inference is the non-interference assumption, which posits that an individual's potential outcomes are only revealed by their own treatment assignment and not the treatment assignment of others. This assumption has also been called the Individualistic Treatment Response or the
stable unit treatment value assumption. Non-interference is violated when subjects can
communicate
Communication is commonly defined as the transmission of information. Its precise definition is disputed and there are disagreements about whether unintentional or failed transmissions are included and whether communication not only transmit ...
with each other about their treatments, decisions, or experiences, thereby
influencing each other's potential outcomes. If the non-interference assumption does not hold, units no longer have just two potential outcomes (treated and control), but a variety of other potential outcomes that depend on other units’ treatment assignments, which complicates the
estimation
Estimation (or estimating) is the process of finding an estimate or approximation, which is a value that is usable for some purpose even if input data may be incomplete, uncertain, or unstable. The value is nonetheless usable because it is d ...
of the
average treatment effect
The average treatment effect (ATE) is a measure used to compare treatments (or interventions) in randomized experiments, evaluation of policy interventions, and medical trials. The ATE measures the difference in mean (average) outcomes between unit ...
.
Estimating spillover effects requires relaxing the non-interference assumption. This is because a unit's outcomes depend not only on its treatment assignment but also on the treatment assignment of its neighbors. The researcher must posit a set of potential outcomes that limit the type of interference. As an example, consider an
experiment
An experiment is a procedure carried out to support or refute a hypothesis, or determine the efficacy or likelihood of something previously untried. Experiments provide insight into cause-and-effect by demonstrating what outcome occurs whe ...
that sends out political information to undergraduate students to increase their political participation. If the
study population consists of all students living with a roommate in a college dormitory, one can imagine four sets of potential outcomes, depending on whether the student or their partner received the information (assume no spillover outside of each two-person room):
* ''Y''
0,0 refers to an individual's potential outcomes when they are not treated (0) and neither was their roommate (0).
* ''Y''
0,1 refers to an individual's potential outcome when they are not treated (0) but their roommate was treated (1).
* ''Y''
1,0 refers to an individual's potential outcome when they are treated (1) but their roommate was not treated (0).
* ''Y''
1,1 refers to an individual's potential outcome when they are treated (1) and their roommate was treated (1).
Now an individual's outcomes are influenced by both whether they received the treatment and whether their roommate received the treatment. We can estimate one type of
spillover effect
In economics, a spillover is a positive or a negative, but more often negative, impact experienced in one region or across the world due to an independent event occurring from an unrelated environment.
For example, externalities of economic act ...
by looking at how one's outcomes change depending on whether their roommate received the treatment or not, given the individual did not receive treatment directly. This would be captured by the difference Y
0,1- Y
0,0. Similarly, we can measure how ones’ outcomes change depending on their roommate's treatment status, when the individual themselves are treated. This amounts to taking the difference Y
1,1- Y
1,0.
While researchers typically embrace
experiment
An experiment is a procedure carried out to support or refute a hypothesis, or determine the efficacy or likelihood of something previously untried. Experiments provide insight into cause-and-effect by demonstrating what outcome occurs whe ...
s because they require less demanding assumptions,
spillovers can be “unlimited in extent and impossible to specify in form.” The researcher must make specific assumptions about which types of spillovers are operative. One can relax the non-interference assumption in various ways depending on how spillovers are thought to occur in a given setting. One way to model spillover effects is a
binary
Binary may refer to:
Science and technology Mathematics
* Binary number, a representation of numbers using only two values (0 and 1) for each digit
* Binary function, a function that takes two arguments
* Binary operation, a mathematical op ...
indicator for whether an immediate neighbor was also treated, as in the example above. One can also posit spillover effects that depend on the number of immediate neighbors that were also treated, also known as k-level effects.
Using randomization inference for hypothesis testing
In experimental settings where treatment is randomized, we can use
randomization inference to test for the existence of spillover effects. The key advantage of this approach is that randomization inference is finite-sample valid, without requiring correct model specification or normal asymptotics. To be specific, consider the aforementioned example experiment in college dorm rooms, and suppose we want to test:
This hypothesis posits that there is no spillover effect on students who don't receive the information (i.e., students who are in control in the experiment). Rejecting this hypothesis implies that even when students don't receive the information message directly, they still may receive it indirectly from treated roommates; hence, there is a spillover effect.
To test a hypothesis like
we can apply a conditional Fisher randomization test. Let
be an indicator denoting that students
are roommates, where we assumed for simplicity that each student has exactly one roommate. Suppose this is a
completely randomized design In the design of experiments, completely randomized designs are for studying the effects of one primary factor without the need to take other nuisance variables into account. This article describes completely randomized designs that have one primary ...
and let
denote the binary treatment of student
. Then:
# Define
and
.
# Calculate an estimate of the spillover effect:
. This is the
test statistic
Test statistic is a quantity derived from the sample for statistical hypothesis testing.Berger, R. L.; Casella, G. (2001). ''Statistical Inference'', Duxbury Press, Second Edition (p.374) A hypothesis test is typically specified in terms of a tes ...
.
# For
## Randomly shuffle units between
producing new randomized sets
akin to the
permutation test
A permutation test (also called re-randomization test or shuffle test) is an exact statistical hypothesis test.
A permutation test involves two or more samples. The (possibly counterfactual) null hypothesis is that all samples come from the same ...
.
## Recalculate the test statistic
.
# Calculate the randomization p-value:
To explain this procedure, in Step 1, we define the sub-populations of interest:
is the set of students who are in control but their roommate is treated,
and
are the students in control with their roommates also in control. These are known as "focal units".
In Step 2, we define an estimate of the spillover effect as
, the difference in outcomes between populations
.
Crucially, in randomization inference, we don't need to derive the
sampling distribution
In statistics, a sampling distribution or finite-sample distribution is the probability distribution of a given random-sample-based statistic. For an arbitrarily large number of samples where each sample, involving multiple observations (data poi ...
of this estimator. The validity of the procedure stems from Step 3 where we resample treatment according to the true experimental variation (here, simply permuting the "exposures" 01 and 00) while keeping the outcomes fixed under the null. Finally, in Step 4 we calculate the randomization
p-value
In null-hypothesis significance testing, the ''p''-value is the probability of obtaining test results at least as extreme as the result actually observed, under the assumption that the null hypothesis is correct. A very small ''p''-value means ...
. The
term is a finite-sample correction to avoid issues with repeated test statistic values.
As mentioned before, the randomization p-value is valid for any finite sample size and does not rely on correct model specification.
This randomization procedure can be extended to arbitrary designs and more general definitions of spillover effects, although care must be taken to properly account for the interference structure between all pairs of units.
The above procedure can also be used to obtain an interval estimate of a constant spillover effect through test inversion. Moreover, the same procedure could be modified for testing whether the "average" spillover effect is zero by using an appropriately studentized test statistic in Step 2.
Exposure mappings

The next step after redefining the causal estimand of interest is to characterize the probability of spillover exposure for each subject in the analysis, given some vector of treatment assignment. Aronow and Samii (2017) present a method for obtaining a matrix of exposure probabilities for each unit in the analysis.
First, define a diagonal matrix with a vector of treatment assignment probabilities
Second, define an indicator matrix
of whether the unit is exposed to spillover or not. This is done by using an
adjacency matrix
In graph theory and computer science, an adjacency matrix is a square matrix used to represent a finite graph (discrete mathematics), graph. The elements of the matrix (mathematics), matrix indicate whether pairs of Vertex (graph theory), vertices ...
as shown on the right, where information regarding a network can be transformed into an indicator matrix. This resulting indicator matrix will contain values of
, the realized values of a random binary variable
, indicating whether that unit has been exposed to spillover or not.
Third, obtain the
sandwich product , an ''N'' × ''N'' matrix which contains two elements: the individual probability of exposure
on the diagonal, and the joint exposure probabilities
on the off diagonals:
: