In the
mathematical
Mathematics is an area of knowledge that includes the topics of numbers, formulas and related structures, shapes and the spaces in which they are contained, and quantities and their changes. These topics are represented in modern mathematics ...
discipline of
graph theory
In mathematics, graph theory is the study of '' graphs'', which are mathematical structures used to model pairwise relations between objects. A graph in this context is made up of '' vertices'' (also called ''nodes'' or ''points'') which are conn ...
, the expander walk sampling theorem intuitively states that
sampling vertices in an
expander graph
In graph theory, an expander graph is a sparse graph that has strong connectivity properties, quantified using vertex, edge or spectral expansion. Expander constructions have spawned research in pure and applied mathematics, with several appli ...
by doing relatively short
random walk
In mathematics, a random walk is a random process that describes a path that consists of a succession of random steps on some mathematical space.
An elementary example of a random walk is the random walk on the integer number line \mathbb ...
can simulate sampling the vertices
independently from a
uniform distribution.
The earliest version of this theorem is due to , and the more general version is typically attributed to .
Statement
Let
be an n-vertex
expander graph
In graph theory, an expander graph is a sparse graph that has strong connectivity properties, quantified using vertex, edge or spectral expansion. Expander constructions have spawned research in pure and applied mathematics, with several appli ...
with positively weighted edges, and let
. Let
denote the
stochastic matrix
In mathematics, a stochastic matrix is a square matrix used to describe the transitions of a Markov chain. Each of its entries is a nonnegative real number representing a probability. It is also called a probability matrix, transition matrix, ...
of the graph, and let
be the second largest
eigenvalue
In linear algebra, an eigenvector () or characteristic vector of a linear transformation is a nonzero vector that changes at most by a scalar factor when that linear transformation is applied to it. The corresponding eigenvalue, often denot ...
of
. Let
denote the vertices encountered in a
-step random walk on
starting at vertex
, and let
. Where
(It is well known
that almost all trajectories
converges to some limiting point,
, as
.)
The theorem states that for a weighted graph
and a random walk
where
is chosen by an initial distribution
, for all
, we have the following bound:
:
Where
is dependant on
and
.
The theorem gives a bound for the rate of convergence to
with respect to the length of the random walk, hence giving a more efficient method to estimate
compared to independent sampling the vertices of
.
Proof
In order to prove the theorem, we provide a few definitions followed by three lemmas.
Let
be the weight of the edge
and let
Denote by
. Let
be the matrix with entries
, and let
.
Let
and
. Let
where
is the stochastic matrix,
and
. Then:
:
Where
. As
and
are symmetric, they have real eigenvalues. Therefore, as the eigenvalues of
and
are equal, the eigenvalues of
are real. Let
and
be the first and second largest eigenvalue of
respectively.
For convenience of notation, let
,
,
, and let
be the all-1 vector.
Lemma 1
Proof:
By
Markov’s inequality,
Where
is the expectation of
chosen according to the probability distribution
. As this can be interpreted by summing over all possible trajectories
, hence:
Combining the two results proves the lemma.
Lemma 2
For
,
:
Proof:
As eigenvalues of
and
are equal,
Lemma 3
If
is a real number such that
,
:
Proof summary:
We Taylor expand
about point
to get:
:
Where
are first and second derivatives of
at
. We show that
We then prove that (i)
by matrix manipulation, and then prove (ii)
using (i) and Cauchy’s estimate from complex analysis.
The results combine to show that
:
:A line to line proof can be found in Gilman (199
Proof of theorem
Combining lemma 2 and lemma 3, we get that
:
Interpreting the exponent on the right hand side of the inequality as a quadratic in
and minimising the expression, we see that
:
A similar bound
:
holds, hence setting
gives the desired result.
Uses
This theorem is useful in randomness reduction in the study of
derandomization
A randomized algorithm is an algorithm that employs a degree of randomness as part of its logic or procedure. The algorithm typically uses uniformly random bits as an auxiliary input to guide its behavior, in the hope of achieving good performan ...
. Sampling from an expander walk is an example of a randomness-efficient
sampler. Note that the number of
bit
The bit is the most basic unit of information in computing and digital communications. The name is a portmanteau of binary digit. The bit represents a logical state with one of two possible values. These values are most commonly represented a ...
s used in sampling
independent samples from
is
, whereas if we sample from an infinite family of constant-degree expanders this costs only
. Such families exist and are efficiently constructible, e.g. the
Ramanujan graph In the mathematical field of spectral graph theory, a Ramanujan graph is a regular graph whose spectral gap is almost as large as possible (see extremal graph theory). Such graphs are excellent spectral expanders. AMurty's survey papernotes, Ramanu ...
s of
Lubotzky-Phillips-
Sarnak.
References
*
*
*
{{refend
Sampling (statistics)