Pseudorandom Graph
   HOME

TheInfoList



OR:

In
graph theory In mathematics, graph theory is the study of ''graphs'', which are mathematical structures used to model pairwise relations between objects. A graph in this context is made up of '' vertices'' (also called ''nodes'' or ''points'') which are conne ...
, a graph is said to be a pseudorandom graph if it obeys certain properties that
random graphs In mathematics, random graph is the general term to refer to probability distributions over graphs. Random graphs may be described simply by a probability distribution, or by a random process which generates them. The theory of random graphs li ...
obey
with high probability In mathematics, an event that occurs with high probability (often shortened to w.h.p. or WHP) is one whose probability depends on a certain number ''n'' and goes to 1 as ''n'' goes to infinity, i.e. the probability of the event occurring can be ma ...
. There is no concrete definition of graph pseudorandomness, but there are many reasonable characterizations of pseudorandomness one can consider. Pseudorandom properties were first formally considered by Andrew Thomason in 1987. He defined a condition called "jumbledness": a graph G=(V,E) is said to be (p,\alpha)-''jumbled'' for real p and \alpha with 0 if :\left, e(U)-p\binom\\leq \alpha, U, for every subset U of the vertex set V, where e(U) is the number of edges among U (equivalently, the number of edges in the subgraph induced by the vertex set U). It can be shown that the Erdős–Rényi random graph G(n,p) is
almost surely In probability theory, an event is said to happen almost surely (sometimes abbreviated as a.s.) if it happens with probability 1 (or Lebesgue measure 1). In other words, the set of possible exceptions may be non-empty, but it has probability 0. ...
(p,O(\sqrt))-jumbled. However, graphs with less uniformly distributed edges, for example a graph on 2n vertices consisting of an n-vertex
complete graph In the mathematical field of graph theory, a complete graph is a simple undirected graph in which every pair of distinct vertices is connected by a unique edge. A complete digraph is a directed graph in which every pair of distinct vertices is c ...
and n completely independent vertices, are not (p,\alpha)-jumbled for any small \alpha, making jumbledness a reasonable quantifier for "random-like" properties of a graph's edge distribution.


Connection to local conditions

Thomason showed that the "jumbled" condition is implied by a simpler-to-check condition, only depending on the codegree of two vertices and not every subset of the vertex set of the graph. Letting \operatorname(u,v) be the number of common neighbors of two vertices u and v, Thomason showed that, given a graph G on n vertices with minimum degree np, if \operatorname(u,v)\leq np^2+\ell for every u and v, then G is \left( p,\sqrt\,\right) -jumbled. This result shows how to check the jumbledness condition algorithmically in
polynomial time In computer science, the time complexity is the computational complexity that describes the amount of computer time it takes to run an algorithm. Time complexity is commonly estimated by counting the number of elementary operations performed by ...
in the number of vertices, and can be used to show pseudorandomness of specific graphs.


Chung–Graham–Wilson theorem

In the spirit of the conditions considered by Thomason and their alternately global and local nature, several weaker conditions were considered by Chung, Graham, and Wilson in 1989: a graph G on n vertices with edge density p and some \varepsilon>0 can satisfy each of these conditions if * Discrepancy: for any subsets X,Y of the vertex set V=V(G), the number of edges between X and Y is within \varepsilon n^2 of p, X, , Y, . * Discrepancy on individual sets: for any subset X of V, the number of edges among X is within \varepsilon n^2 of p\binom. * Subgraph counting: for every graph H, the number of labeled copies of H among the subgraphs of G is within \varepsilon n^ of p^n^. * 4-cycle counting: the number of labeled 4-cycles among the subgraphs of G is within \varepsilon n^4 of p^4n^4. * Codegree: letting \operatorname(u,v) be the number of common neighbors of two vertices u and v, :\sum_\big, \operatorname(u,v)-p^2 n\big, \leq \varepsilon n^3. * Eigenvalue bounding: If \lambda_1\geq \lambda_2\geq \cdots \geq \lambda_n are the eigenvalues of the
adjacency matrix In graph theory and computer science, an adjacency matrix is a square matrix used to represent a finite graph. The elements of the matrix indicate whether pairs of vertices are adjacent or not in the graph. In the special case of a finite simp ...
of G, then \lambda_1 is within \varepsilon n of pn and \max\left(\left, \lambda_2\,\left, \lambda_n\\right)\leq \varepsilon n. These conditions may all be stated in terms of a sequence of graphs \ where G_n is on n vertices with (p+o(1))\binom edges. For example, the 4-cycle counting condition becomes that the number of copies of any graph H in G_n is \left(p^+o(1)\right)e^ as n\to\infty, and the discrepancy condition becomes that \left, e(X,Y)-p, X, , Y, \=o(n^2), using
little-o notation Big ''O'' notation is a mathematical notation that describes the limiting behavior of a function when the argument tends towards a particular value or infinity. Big O is a member of a family of notations invented by Paul Bachmann, Edmund Land ...
. A pivotal result about graph pseudorandomness is the Chung–Graham–Wilson theorem, which states that many of the above conditions are equivalent, up to polynomial changes in \varepsilon. A sequence of graphs which satisfies those conditions is called quasi-random. It is considered particularly surprising that the weak condition of having the "correct" 4-cycle density implies the other seemingly much stronger pseudorandomness conditions. Graphs such as the 4-cycle, the density of which in a sequence of graphs is sufficient to test the quasi-randomness of the sequence, are known as forcing graphs. Some implications in the Chung–Graham–Wilson theorem are clear by the definitions of the conditions: the discrepancy on individual sets condition is simply the special case of the discrepancy condition for Y=X, and 4-cycle counting is a special case of subgraph counting. In addition, the graph counting lemma, a straightforward generalization of the triangle counting lemma, implies that the discrepancy condition implies subgraph counting. The fact that 4-cycle counting implies the codegree condition can be proven by a technique similar to the second-moment method. Firstly, the sum of codegrees can be upper-bounded: :\sum_ \operatorname(u,v)=\sum_ \deg(x)^2\ge n\left(\frac\right)^2=\left(p^2+o(1)\right)n^3. Given 4-cycles, the sum of squares of codegrees is bounded: : \sum_ \operatorname(u,v)^2=\textC_4 + o(n^4)\le \left(p^4+o(1)\right)n^4. Therefore, the
Cauchy–Schwarz inequality The Cauchy–Schwarz inequality (also called Cauchy–Bunyakovsky–Schwarz inequality) is considered one of the most important and widely used inequalities in mathematics. The inequality for sums was published by . The corresponding inequality fo ...
gives :\sum_, \operatorname(u,v)-p^2n, \le n\left(\sum_ \left(\operatorname(u,v)-p^2n\right)^2\right)^, which can be expanded out using our bounds on the first and second moments of \operatorname to give the desired bound. A proof that the codegree condition implies the discrepancy condition can be done by a similar, albeit trickier, computation involving the Cauchy–Schwarz inequality. The eigenvalue condition and the 4-cycle condition can be related by noting that the number of labeled 4-cycles in G is, up to o(1) stemming from degenerate 4-cycles, \operatorname\left(A_G^4\right), where A_G is the adjacency matrix of G. The two conditions can then be shown to be equivalent by invocation of the Courant–Fischer theorem.


Connections to graph regularity

The concept of graphs that act like random graphs connects strongly to the concept of graph regularity used in the
Szemerédi regularity lemma Szemerédi's regularity lemma is one of the most powerful tools in extremal graph theory, particularly in the study of large dense graphs. It states that the vertices of every large enough graph can be partitioned into a bounded number of parts so ...
. For \varepsilon>0, a pair of vertex sets X,Y is called \varepsilon-regular, if for all subsets A\subset X,B\subset Y satisfying , A, \geq\varepsilon, X, ,, B, \geq\varepsilon, Y, , it holds that :\left, d(X,Y) - d(A,B) \ \le \varepsilon, where d(X,Y) denotes the ''edge density'' between X and Y: the number of edges between X and Y divided by , X, , Y, . This condition implies a bipartite analogue of the discrepancy condition, and essentially states that the edges between A and B behave in a "random-like" fashion. In addition, it was shown by
Miklós Simonovits Miklós Simonovits (4 September 1943 in Budapest) is a Hungarian mathematician who currently works at the Rényi Institute of Mathematics in Budapest and is a member of the Hungarian Academy of Sciences. He is on the advisory board of the journ ...
and
Vera T. Sós Vera T. Sós (born September 11, 1930) is a Hungarian mathematician, specializing in number theory and combinatorics. She was a student and close collaborator of both Paul Erdős and Alfréd Rényi. She also collaborated frequently with her h ...
in 1991 that a graph satisfies the above weak pseudorandomness conditions used in the Chung–Graham–Wilson theorem if and only if it possesses a Szemerédi partition where nearly all densities are close to the edge density of the whole graph.


Sparse pseudorandomness


Chung–Graham–Wilson theorem analogues

The Chung–Graham–Wilson theorem, specifically the implication of subgraph counting from discrepancy, does not follow for sequences of graphs with edge density approaching 0, or, for example, the common case of d- regular graphs on n vertices as n\to\infty. The following sparse analogues of the discrepancy and eigenvalue bounding conditions are commonly considered: * Sparse discrepancy: for any subsets X,Y of the vertex set V=V(G), the number of edges between X and Y is within \varepsilon dn of \frac, X, , Y, . * Sparse eigenvalue bounding: If \lambda_1\geq \lambda_2\geq \cdots \geq \lambda_n are the eigenvalues of the
adjacency matrix In graph theory and computer science, an adjacency matrix is a square matrix used to represent a finite graph. The elements of the matrix indicate whether pairs of vertices are adjacent or not in the graph. In the special case of a finite simp ...
of G, then \max\left(\left, \lambda_2\,\left, \lambda_n\\right)\leq \varepsilon d. It is generally true that this eigenvalue condition implies the corresponding discrepancy condition, but the reverse is not true: the disjoint union of a random large d-regular graph and a d+1-vertex complete graph has two eigenvalues of exactly d but is likely to satisfy the discrepancy property. However, as proven by David Conlon and Yufei Zhao in 2017, slight variants of the discrepancy and eigenvalue conditions for d-regular
Cayley graph In mathematics, a Cayley graph, also known as a Cayley color graph, Cayley diagram, group diagram, or color group is a graph that encodes the abstract structure of a group. Its definition is suggested by Cayley's theorem (named after Arthur Cayle ...
s are equivalent up to linear scaling in \varepsilon. One direction of this follows from the
expander mixing lemma The expander mixing lemma intuitively states that the edges of certain d-regular graphs are evenly distributed throughout the graph. In particular, the number of edges between two vertex subsets S and T is always close to the expected number of edge ...
, while the other requires the assumption that the graph is a Cayley graph and uses the
Grothendieck inequality In mathematics, the Grothendieck inequality states that there is a universal constant K_G with the following property. If ''M'ij'' is an ''n'' × ''n'' (real or complex) matrix with : \Big, \sum_ M_ s_i t_j \Big, \le 1 for all (real ...
.


Consequences of eigenvalue bounding

A d-regular graph G on n vertices is called an ''(n,d,\lambda)-graph'' if, letting the eigenvalues of the adjacency matrix of G be d=\lambda_1\geq \lambda_2\geq \cdots \geq \lambda_n, \max\left(\left, \lambda_2\,\left, \lambda_n\\right)\leq \lambda. The Alon-Boppana bound gives that \max\left(\left, \lambda_2\,\left, \lambda_n\\right)\geq 2\sqrt-o(1) (where the o(1) term is as n\to\infty), and Joel Friedman proved that a random d-regular graph on n vertices is (n,d,\lambda) for \lambda=2\sqrt+o(1). In this sense, how much \lambda exceeds 2\sqrt is a general measure of the non-randomness of a graph. There are graphs with \lambda\leq 2\sqrt, which are termed
Ramanujan graphs In the mathematical field of spectral graph theory, a Ramanujan graph is a regular graph whose spectral gap is almost as large as possible (see extremal graph theory). Such graphs are excellent spectral expanders. AMurty's survey papernotes, Raman ...
. They have been studied extensively and there are a number of open problems relating to their existence and commonness. Given an (n,d,\lambda) graph for small \lambda, many standard graph-theoretic quantities can be bounded to near what one would expect from a random graph. In particular, the size of \lambda has a direct effect on subset edge density discrepancies via the expander mixing lemma. Other examples are as follows, letting G be an (n,d,\lambda) graph: * If d\leq \frac, the vertex-connectivity \kappa(G) of G satisfies \kappa(G)\geq d-\frac. * If \lambda\leq d-2, G is d edge-connected. If n is even, G contains a perfect matching. * The
maximum cut For a graph, a maximum cut is a cut whose size is at least the size of any other cut. That is, it is a partition of the graph's vertices into two complementary sets and , such that the number of edges between and is as large as possible. Fin ...
of G is at most \frac. * The largest independent subset of a subset U\subset V(G) in G is of size at least \frac\ln\left(\frac+1\right). * The
chromatic number In graph theory, graph coloring is a special case of graph labeling; it is an assignment of labels traditionally called "colors" to elements of a graph subject to certain constraints. In its simplest form, it is a way of coloring the vertices o ...
of G is at most \frac.


Connections to the Green–Tao theorem

Pseudorandom graphs factor prominently in the proof of the
Green–Tao theorem In number theory, the Green–Tao theorem, proved by Ben Green and Terence Tao in 2004, states that the sequence of prime numbers contains arbitrarily long arithmetic progressions. In other words, for every natural number ''k'', there exist arith ...
. The theorem is proven by transferring
Szemerédi's theorem In arithmetic combinatorics, Szemerédi's theorem is a result concerning arithmetic progressions in subsets of the integers. In 1936, Erdős and Turán conjectured that every set of integers ''A'' with positive natural density contains a ''k''-ter ...
, the statement that a set of positive integers with positive
natural density In number theory, natural density (also referred to as asymptotic density or arithmetic density) is one method to measure how "large" a subset of the set of natural numbers is. It relies chiefly on the probability of encountering members of the de ...
contains arbitrarily long arithmetic progressions, to the sparse setting (as the primes have natural density 0 in the integers). The transference to sparse sets requires that the sets behave pseudorandomly, in the sense that corresponding graphs and hypergraphs have the correct subgraph densities for some fixed set of small (hyper)subgraphs. It is then shown that a suitable superset of the prime numbers, called pseudoprimes, in which the primes are dense obeys these pseudorandomness conditions, completing the proof.


References

{{Reflist Graph theory