Discrepancy Of Hypergraphs
   HOME

TheInfoList



OR:

Discrepancy of hypergraphs is an area of
discrepancy theory In mathematics, discrepancy theory describes the deviation of a situation from the state one would like it to be in. It is also called the theory of irregularities of distribution. This refers to the theme of ''classical'' discrepancy theory, name ...
.


Definitions

In the classical setting, we aim at partitioning the vertices of a
hypergraph In mathematics, a hypergraph is a generalization of a graph in which an edge can join any number of vertices. In contrast, in an ordinary graph, an edge connects exactly two vertices. Formally, an undirected hypergraph H is a pair H = (X,E) w ...
\mathcal=(V, \mathcal) into two classes in such a way that ideally each hyperedge contains the same number of vertices in both classes. A partition into two classes can be represented by a coloring \chi \colon V \rightarrow \. We call −1 and +1 ''colors''. The color-classes \chi^(-1) and \chi^(+1) form the corresponding partition. For a hyperedge E \in \mathcal, set :\chi(E) := \sum_ \chi(v). The ''discrepancy of \mathcal with respect to \chi'' and the ''discrepancy of \mathcal'' are defined by :\operatorname(\mathcal,\chi) := \; \max_ , \chi(E), , :\operatorname(\mathcal) := \min_ \operatorname(\mathcal, \chi). These notions as well as the term 'discrepancy' seem to have appeared for the first time in a paper of Beck.J. Beck: "Roth's estimate of the discrepancy of integer sequences is nearly sharp", page 319-325.
Combinatorica ''Combinatorica'' is an international journal of mathematics, publishing papers in the fields of combinatorics and computer science. It started in 1981, with László Babai and László Lovász as the editors-in-chief with Paul Erdős as honora ...
, 1, 1981
Earlier results on this problem include the famous lower bound on the discrepancy of arithmetic progressions by Roth and upper bounds for this problem and other results by Erdős and Spencer and Sárközi (described on p. 39). At that time, discrepancy problems were called quasi-
Ramsey Ramsey may refer to: Geography British Isles * Ramsey, Cambridgeshire, a small market town in England * Ramsey, Essex, a village near Harwich, England ** Ramsey and Parkeston, a civil parish formerly called just "Ramsey" * Ramsey, Isle of Man, t ...
problems.


Examples

To get some intuition for this concept, let's have a look at a few examples. * If all edges of \mathcal intersect trivially, i.e. E_1 \cap E_2 = \varnothing for any two distinct edges E_1, E_2 \in \mathcal, then the discrepancy is zero, if all edges have even cardinality, and one, if there is an odd cardinality edge. * The other extreme is marked by the ''complete hypergraph'' (V, 2^V). In this case the discrepancy is \lceil \frac , V, \rceil. Any 2-coloring will have a color class of at least this size, and this set is also an edge. On the other hand, any coloring \chi with color classes of size \lceil \frac , V, \rceil and \lfloor \frac , V, \rfloor proves that the discrepancy is not larger than \lceil \frac , V, \rceil. It seems that the discrepancy reflects how chaotic the hyperedges of \mathcal intersect. Things are not that easy, however, as the following example shows. * Set n=4k, k \in \mathcal and \mathcal_n = ( \). In words, \mathcal_n is the hypergraph on 4''k'' vertices , whose edges are all subsets that have the same number of elements in as in . Now \mathcal_n has many (more than \binom^2 = \Theta(\frac 1 n 2^n)) complicatedly intersecting edges. However, its discrepancy is zero, since we can color in one color and in another color. The last example shows that we cannot expect to determine the discrepancy by looking at a single parameter like the number of hyperedges. Still, the size of the hypergraph yields first upper bounds.


Bounds on the discrepancy


General hypergraphs

1. For any hypergraph ''\mathcal '' with ''n'' vertices and ''m'' edges: *\operatorname(\mathcal) \leq \sqrt. The proof is a simple application of the probabilistic method: Let \chi:V \rightarrow \ be a random coloring, i.e. we have :\Pr(\chi(v) = -1) = \Pr(\chi(v) = 1) = \frac independently for all v \in V. Since \chi(E) = \sum_ \chi(v) is a sum of independent −1, 1 random variables. So we have \Pr(, \chi(E), >\lambda)<2 \exp(-\lambda^2/(2n)) for all E \subseteq V and \lambda \geq 0. Put \lambda = \sqrt for convenience. Then :\Pr(\operatorname(\mathcal,\chi)> \lambda) \leq \sum_ \Pr(, \chi(E), > \lambda) < 1. Since a random coloring with positive probability has discrepancy at most \lambda, in particular, there are colorings that have discrepancy at most \lambda. Hence \operatorname(\mathcal) \leq \lambda. \ \Box 2. For any hypergraph ''\mathcal ''with ''n'' vertices and ''m'' edges ''such that m \geq n:'' * ''\operatorname(\mathcal) \in O(\sqrt).'' To prove this, a much more sophisticated approach using the entropy function was necessary. Of course this is particularly interesting for m = O(n). In the case m=n, \operatorname(\mathcal) \leq 6 \sqrt can be shown for n large enough. Therefore, this result is usually known to as 'Six Standard Deviations Suffice'. It is considered to be one of the milestones of discrepancy theory. The entropy method has seen numerous other applications, e.g. in the proof of the tight upper bound for the arithmetic progressions of Matoušek and Spencer or the upper bound in terms of the primal shatter function due to Matoušek.


Hypergraphs of bounded degree

If each vertex of \mathcal is contained in at most ''t'' edges, then :\operatorname(\mathcal) < 2t. This result, the Beck–Fiala theorem, is due to Beck and Fiala. They bound the discrepancy by the maximum degree of \mathcal. It is a famous open problem whether this bound can be improved asymptotically (modified versions of the original proof give 2''t''−1 or even 2''t''−3). Beck and Fiala conjectured that \operatorname(\mathcal) = O(\sqrt t), but little progress has been made so far in this direction. Bednarchak and Helm and Helm improved the Beck-Fiala bound in tiny steps to \operatorname(\mathcal) \leq 2t - 3 (for a slightly restricted situation, i.e. t \geq 3 ). Bukh improved this in 2016 to 2t - \log^* t, where \log^* t denotes the
iterated logarithm In computer science, the iterated logarithm of n, written  n (usually read "log star"), is the number of times the logarithm function must be iteratively applied before the result is less than or equal to 1. The simplest formal definition i ...
. A corollary of Beck's paper – the first time the notion of discrepancy explicitly appeared – shows \operatorname(\mathcal) \leq C \sqrt \log n for some constant C. The latest improvement in this direction is due to Banaszczyk: \operatorname(\mathcal) = O(\sqrt).


Permutations hypergraphs

Suppose ''p''1, ...,''pm'' are permutations of 'n'''.'' Suppose \mathcal is the hypergraph on 'n''whose edges are all the intervals of every permutation. For example, if one of the permutations is (1,2,3,4), then the hypergraph \mathcal contains e.g. the edges (1,2), (1,2,3), (2,3), (2,3,4), etc. The discrepancy of \mathcal is the minimum over all red-blue colorings of the integers in 'n'' of the maximum over all intervals, of the difference between the number of red and blue integers in the interval. Then: * For any two permutations, \operatorname(\mathcal)= 2. * For any ''m'' permutations, \operatorname(\mathcal) \leq 8 m\log , and such a coloring can be computed efficiently. *For any three permutations, Beck conjectures that \operatorname(\mathcal)= \text. However, this conjecture was refuted: for any ''n'' which is a power of 3, there exist 3 permutations whose discrepancy is \lceil (\log_3)/3 + 1 \rceil. More precisely, for any coloring, if the sum of all colors is ''d'', then there exists some integer ''q'' such that, in all three permutations, the sum of the first ''q'' colors is at most (-\log_3 + 2 d - 2)/3. This has implications for the
bin packing problem The bin packing problem is an optimization problem, in which items of different sizes must be packed into a finite number of bins or containers, each of a fixed given capacity, in a way that minimizes the number of bins used. The problem has ma ...
.


Other classic theorems

* Axis-parallel rectangles in the plane (
Roth Roth may refer to: Places Germany * Roth (district), in Bavaria, Germany ** Roth, Bavaria, capital of that district ** Roth (electoral district), a federal electoral district * Rhineland-Palatinate, Germany: ** Roth an der Our, in the district B ...
, Schmidt) * Discrepancy of half-planes (Alexander, Matoušek) * Arithmetic progressions (Roth, Sárközy, Beck, Matoušek & Spencer) * Six Standard Deviations Suffice (Spencer)


Major open problems

* Axis-parallel rectangles in dimensions three and higher (Folklore) * Komlós Conjecture


Applications

* Numerical Integration: Monte Carlo methods in high dimensions. * Computational Geometry:
Divide and conquer algorithms In computer science, divide and conquer is an algorithm design paradigm. A divide-and-conquer algorithm recursively breaks down a problem into two or more sub-problems of the same or related type, until these become simple enough to be solved direc ...
. * Image Processing: Halftoning


Notes


References

* * * * {{DEFAULTSORT:Discrepancy Of Hypergraphs Diophantine approximation Unsolved problems in mathematics Discrepancy theory Hypergraphs