Roth's theorem on arithmetic progressions is a result in

additive combinatorics Additive combinatorics is an area of combinatorics in mathematics. One major area of study in additive combinatorics are ''inverse problems'': given the size of the sumset is small, what can we say about the structures of and ? In the case of th ...

concerning the existence of

arithmetic progression An arithmetic progression or arithmetic sequence is a sequence of numbers such that the difference from any succeeding term to its preceding term remains constant throughout the sequence. The constant difference is called common difference of that ...

s in subsets of the

natural number In mathematics, the natural numbers are the numbers 0, 1, 2, 3, and so on, possibly excluding 0. Some start counting with 0, defining the natural numbers as the non-negative integers , while others start with 1, defining them as the positive in ...

s. It was first

proven Proven is a rural village in the Belgian province of West Flanders, and a "deelgemeente" of the municipality Poperinge. The village has about 1400 inhabitants. The church and parish A parish is a territorial entity in many Christianity, Chr ...

Klaus Roth Klaus Friedrich Roth (29 October 1925 – 10 November 2015) was a German-born British mathematician who won the Fields Medal for proving Roth's theorem on the Diophantine approximation of algebraic numbers. He was also a winner of the De ...

in 1953. Roth's theorem is a special case of

Szemerédi's theorem In arithmetic combinatorics, Szemerédi's theorem is a result concerning arithmetic progressions in subsets of the integers. In 1936, Erdős and Turán conjectured that every set of integers ''A'' with positive natural density contains a ''k''- ...

for the case

k = 3

Statement

A subset ''A'' of the natural numbers is said to have positive upper density if :

\limsup_\frac > 0

Roth's theorem on arithmetic progressions (infinite version): A subset of the natural numbers with positive upper density contains a arithmetic progression.

An alternate, more qualitative, formulation of the theorem is concerned with the maximum size of a

Salem–Spencer set In mathematics, and in particular in arithmetic combinatorics, a Salem-Spencer set is a set of numbers no three of which form an arithmetic progression. Salem–Spencer sets are also called 3-AP-free sequences or progression-free sets. They have ...

which is a subset of

= \

. Let

r_3(

be the size of the largest subset of

/math> which contains no  arithmetic progression.

Roth's theorem on arithmetic progressions (finitary version): $r_3(= o(N).$

Improving upper and lower bounds on

r_3(

is still an open research problem.

History

The first result in this direction was

Van der Waerden's theorem Van der Waerden's theorem is a theorem in the branch of mathematics called Ramsey theory. Van der Waerden's theorem states that for any given positive integers ''r'' and ''k'', there is some number ''N'' such that if the integers are colored, eac ...

in 1927, which states that for sufficiently large N, coloring the integers

1, \dots, n

with

r

colors will result in a

k

term arithmetic progression. Later on in 1936 Erdős and Turán

conjecture In mathematics, a conjecture is a conclusion or a proposition that is proffered on a tentative basis without proof. Some conjectures, such as the Riemann hypothesis or Fermat's conjecture (now a theorem, proven in 1995 by Andrew Wiles), ha ...

d a much stronger result that any subset of the integers with positive density contains arbitrarily long arithmetic progressions. In 1942,

Raphaël Salem Raphaël Salem (Greek: Ραφαέλ Σαλέμ; November 7, 1898 in Salonika, Ottoman Empire (now Thessaloniki, Greece) – June 20, 1963 in Paris, France) was a Greek mathematician after whom are named the Salem numbers and Salem–Spencer sets, ...

and Donald C. Spencer provided a construction of a 3-AP-free set (i.e. a set with no arithmetic progressions) of size

\frac

, disproving an additional conjecture of Erdős and Turán that

r_3(= N^

for some

\delta > 0

. In 1953, Roth partially resolved the initial conjecture by proving they must contain an arithmetic progression of length 3 using Fourier analytic methods. Eventually, in 1975, Szemerédi proved

using combinatorial techniques, resolving the original conjecture in full.

Proof techniques

The original proof given by Roth used Fourier analytic methods. Later on another proof was given using Szemerédi's regularity lemma.

Proof sketch via Fourier analysis

In 1953, Roth used

Fourier analysis In mathematics, Fourier analysis () is the study of the way general functions may be represented or approximated by sums of simpler trigonometric functions. Fourier analysis grew from the study of Fourier series, and is named after Joseph Fo ...

to prove an upper bound of

= O\left(\frac\right)

. Below is a sketch of this proof. Define the

Fourier transform In mathematics, the Fourier transform (FT) is an integral transform that takes a function as input then outputs another function that describes the extent to which various frequencies are present in the original function. The output of the tr ...

of a function

f : \mathbb \rightarrow \mathbb

to be the function

\widehat : [0, 1) \rightarrow \mathbb

satisfying :

\widehat(\theta) = \sum_f(x)e(-x\theta)

, where

e(t) = e^

. Let

A

be a 3-AP-free subset of

\

. The proof proceeds in 3 steps. # Show that a

A

admits a large Fourier coefficient. # Deduce that there exists a sub-progression of

\

such that

A

has a density increment when restricted to this subprogression. # Iterate Step 2 to obtain an upper bound on

, A,

Step 1

For functions,

f, g, h : \mathbb \rightarrow \mathbb,

define :

\Lambda(f, g, h) = \sum_ f(x)g(x + y)h(x + 2y)

Counting Lemma Let $f, g : \mathbb \rightarrow \mathbb$ satisfy $\sum_, f(n), ^2, \sum_, g(n), ^2 \le M$ . Define $\Lambda_3(f) = \Lambda(f,f,f)$ . Then $, \Lambda_3(f) - \Lambda_3(g), \le 3M\, \widehat\, _\infty$ .

The counting lemma tells us that if the Fourier Transforms of

f

and

g

are "close", then the number of arithmetic progressions between the two should also be "close." Let

\alpha = , A, /N

be the density of

A

. Define the functions

f = \mathbf_

(i.e the indicator function of

A

), and

g = \alpha \cdot \mathbf_

. Step 1 can then be deduced by applying the Counting Lemma to

f

and

g

, which tells us that there exists some

\theta

such that :

\left, \sum_^N (1_A - \alpha)(n)e(\theta n) \ \ge \fracN

Step 2

Given the

\theta

from step 1, we first show that it's possible to split up

/math> into relatively large subprogressions such that the character x \mapsto e(x\theta) is roughly constant on each subprogression.

Lemma 1: Let $0 < \eta < 1, \theta \in \mathbb$ . Assume that $N > C\eta^$ for a universal constant $C$ . Then it is possible to partition $/math> into arithmetic progressions P_i with length N^ \le , P_i, \le 2N^such that \sup_, e(x\theta) - e(y\theta), < \eta for all i .$

Next, we apply Lemma 1 to obtain a partition into subprogressions. We then use the fact that

\theta

produced a large coefficient in step 1 to show that one of these subprogressions must have a density increment:

Lemma 2: Let $A$ be a 3-AP-free subset of $/math>, with, A, = \alpha N and N > C\alpha^. Then, there exists a sub progression P \subset /math> such that, P, \ge N^and, A \cap P, \ge (\alpha + \alpha^2/40), P, .$

Step 3

We now iterate step 2. Let

a_t

be the density of

A

after the

t

th iteration. We have that

\alpha_0 = \alpha,

and

\alpha_ \ge \alpha + \alpha^2/40.

First, see that

\alpha

doubles (i.e. reach

T

such that

\alpha_T \ge 2\alpha_0

) after at most

40/\alpha + 1

steps. We double

\alpha

again (i.e reach

\alpha_T \ge 4\alpha_0

) after at most

20/\alpha + 1

steps. Since

\alpha \le 1

, this process must terminate after at most

O(1/\alpha)

steps. Let

N_t

be the size of our current progression after

t

iterations. By Lemma 2, we can always continue the process whenever

N_t \ge C\alpha_t^,

and thus when the process terminates we have that

N_t \le C\alpha_t^ \le C\alpha^.

Also, note that when we pass to a subprogression, the size of our set decreases by a cube root. Therefore :

N \le N_t^ \le (C\alpha^)^ = e^.

Therefore

\alpha = O(1/\log \log N),

, A,  = O \left(\frac\right),

as desired.

\blacksquare

Unfortunately, this technique does not generalize directly to larger arithmetic progressions to prove Szemerédi's theorem. An extension of this proof eluded mathematicians for decades until 1998, when Timothy Gowers developed the field of higher-order Fourier analysis specifically to generalize the above proof to prove Szemerédi's theorem.

Proof sketch via graph regularity

Below is an outline of a proof using the

Szemerédi regularity lemma In extremal graph theory, Szemerédi’s regularity lemma states that a graph can be partitioned into a bounded number of parts so that the edges between parts are regular (in the sense defined below). The lemma shows that certain properties of r ...

. Let

G

be a graph and

X,Y\subseteq V(G)

. We call

(X,Y)

\epsilon

-regular pair if for all

A\subset X,B\subset Y

with

, A, \geq\epsilon, X, ,, B, \geq\epsilon, Y,

, one has

, d(A,B)-d(X,Y), \leq\epsilon

. A partition

\mathcal=\

V(G)

is an

\epsilon

-regular partition if :

\sum_ , V_i, , V_j, \leq\epsilon, V(G), ^2

. Then the Szemerédi regularity lemma says that for every

\epsilon>0

, there exists a constant

M

such that every graph has an

\epsilon

-regular partition into at most

M

parts. We can also prove that triangles between

\epsilon

-regular sets of vertices must come along with many other triangles. This is known as the triangle counting lemma.

Triangle Counting Lemma: Let $G$ be a graph and $X, Y, Z$ be subsets of the vertices of $G$ such that $(X,Y), (Y,Z), (Z,X)$ are all $\epsilon$ -regular pairs for some $\epsilon > 0$ . Let $d_, d_, d_$ denote the edge densities $d(X,Y), d(X,Z), d(Y,Z)$ respectively. If $d_, d_, d_ \ge 2\epsilon$ , then the number of triples $(x,y,z)\in X\times Y\times Z$ such that $x,y,z$ form a triangle in $G$ is at least : $(1-2\epsilon)(d_ - \epsilon)(d_ - \epsilon)(d_ - \epsilon)\cdot , X, , Y, , Z,$ .

Using the triangle counting lemma and the Szemerédi regularity lemma, we can prove the triangle removal lemma, a special case of the

graph removal lemma In graph theory, the graph removal lemma states that when a graph contains few copies of a given Subgraph (graph theory), subgraph, then all of the copies can be eliminated by removing a small number of edges. The special case in which the subgraph ...

Triangle Removal Lemma: For all $\epsilon > 0$ , there exists $\delta > 0$ such that any graph on $n$ vertices with less than or equal to $\delta n^3$ triangles can be made triangle-free by removing at most $\epsilon n^2$ edges.

This has an interesting corollary pertaining to graphs

G

N

vertices where every edge of

G

lies in a unique triangle. In specific, all of these graphs must have

o(N^2)

edges. Take a set

A

with no arithmetic progressions. Now, construct a tripartite graph

G

whose parts

X, Y, Z

are all copies of

\mathbb/(2N+1)\mathbb

. Connect a vertex

x\in X

to a vertex

y\in Y

y-x\in A

. Similarly, connect

z\in Z

with

y\in Y

z-y\in A

. Finally, connect

x\in X

with

z\in Z

(z-x)/2\in A

. This construction is set up so that if

x,y,z

form a triangle, then we get elements

y-x, \frac, z-y

that all belong to

A

. These numbers form an arithmetic progression in the listed order. The assumption on

A

then tells us this progression must be trivial: the elements listed above are all equal. But this condition is equivalent to the assertion that

x,y,z

is an arithmetic progression in

\mathbb/(2N+1)\mathbb

. Consequently, every edge of

G

lies in exactly one triangle. The desired conclusion follows.

\blacksquare

Extensions and generalizations

Szemerédi's theorem resolved the original conjecture and generalized Roth's theorem to arithmetic progressions of arbitrary length. Since then it has been extended in multiple fashions to create new and interesting results. Furstenberg and Katznelson used

ergodic theory Ergodic theory is a branch of mathematics that studies statistical properties of deterministic dynamical systems; it is the study of ergodicity. In this context, "statistical properties" refers to properties which are expressed through the behav ...

to prove a multidimensional version and Leibman and Bergelson extended it to polynomial progressions as well. Most recently,

Green Green is the color between cyan and yellow on the visible spectrum. It is evoked by light which has a dominant wavelength of roughly 495570 nm. In subtractive color systems, used in painting and color printing, it is created by a com ...

and

Tao The Tao or Dao is the natural way of the universe, primarily as conceived in East Asian philosophy and religion. This seeing of life cannot be grasped as a concept. Rather, it is seen through actual living experience of one's everyday being. T ...

proved the

Green–Tao theorem In number theory, the Green–Tao theorem, proven by Ben Green and Terence Tao in 2004, states that the sequence of prime numbers contains arbitrarily long arithmetic progressions. In other words, for every natural number k, there exist arithme ...

which says that the prime numbers contain arbitrarily long arithmetic progressions. Since the prime numbers are a subset of density 0, they introduced a "relative" Szemerédi theorem which applies to subsets with density 0 that satisfy certain

pseudorandomness A pseudorandom sequence of numbers is one that appears to be statistically random, despite having been produced by a completely deterministic and repeatable process. Pseudorandom number generators are often used in computer programming, as tradi ...

conditions. Later on Conlon,

Fox Foxes are small-to-medium-sized omnivorous mammals belonging to several genera of the family Canidae. They have a flattened skull; upright, triangular ears; a pointed, slightly upturned snout; and a long, bushy tail ("brush"). Twelve species ...

, and Zhao strengthened this theorem by weakening the necessary pseudorandomness condition. In 2020, Bloom and SisaskThomas F. Bloom, Olof Sisask, ''Breaking the logarithmic barrier in Roth's theorem on arithmetic progressions'', arXiv:2007.03528, 2020 proved that any set

A

such that

\sum_ \frac

diverges must contain arithmetic progressions of length 3; this is the first non-trivial case of another

Erdős Erdős, Erdos, or Erdoes is a Hungarian surname. Paul Erdős (1913–1996), Hungarian mathematician Other people with the surname * Ágnes Erdős (1950–2021), Hungarian politician * Brad Erdos (born 1990), Canadian football player * Éva Erd� ...

postulating that any such set must in fact contain arbitrarily long arithmetic progressions.

Improving bounds

There has also been work done on improving the bound in Roth's theorem. The bound from the original proof of Roth's theorem showed that :

\leq c\cdot\frac

for some constant

c

. Over the years this bound has been continually lowered by Szemerédi, Heath-Brown, Bourgain, and Sanders. The current (July 2020) best bound is due to Bloom and Sisask who have shown the existence of an absolute constant c>0 such that :

\leq \frac.

In February 2023 a preprint (later published) by Kelley and Meka gave a new bound of:

\leq 2^ \cdot N

. Four days later, Bloom and Sisask published a preprint giving an exposition of the result (later published), simplifying the argument and yielding some additional applications. Several months later, Bloom and Sisask obtained a further improvement to

\leq \exp(-c(\log N)^)N

, and stated (without proof) that their techniques can be used to show

\leq \exp(-c(\log N)^)N

. There has also been work done on the other end, constructing the largest set with no arithmetic progressions. The best construction has barely been improved since 1946 when Behrend improved on the initial construction by Salem and Spencer and proved :

\geq N\exp(-c\sqrt)

. Due to no improvements in over 70 years, it is conjectured that Behrend's set is asymptotically very close in size to the largest possible set with no progressions. If correct, the Kelley-Meka bound will prove this conjecture.

Roth's theorem in finite fields

As a variation, we can consider the analogous problem over

finite field In mathematics, a finite field or Galois field (so-named in honor of Évariste Galois) is a field (mathematics), field that contains a finite number of Element (mathematics), elements. As with any field, a finite field is a Set (mathematics), s ...

s. Consider the finite field

\mathbb_3^n

, and let

r_3(\mathbb_3^n)

be the size of the largest subset of

\mathbb_3^n

which contains no arithmetic progression. This problem is actually equivalent to the

cap set In affine geometry, a cap set is a subset of the affine space \mathbb_3^n (the n-dimensional affine space over the three-element field) where no three elements sum to the zero vector. The cap set problem is the problem of finding the size of the ...

problem, which asks for the largest subset of

\mathbb_3^n

such that no 3 points lie on a line. The cap set problem can be seen as a generalization of the card game

Set Set, The Set, SET or SETS may refer to: Science, technology, and mathematics Mathematics *Set (mathematics), a collection of elements *Category of sets, the category whose objects and morphisms are sets and total functions, respectively Electro ...

. In 1982, Brown and Buhler were the first to show that

r_3(\mathbb_3^n) = o(3^n).

In 1995, Roy Mesuhlam used a similar technique to the Fourier-analytic proof of Roth's theorem to show that

r_3(\mathbb_3^n) = O\left(\frac\right).

This bound was improved to

O(3^n/n^)

in 2012 by Bateman and Katz. In 2016, Ernie Croot, Vsevolod Lev, Péter Pál Pach,

Jordan Ellenberg Jordan Stuart Ellenberg (born October 30, 1971) is an American mathematician who is a professor of mathematics at the University of Wisconsin–Madison. His research involves arithmetic geometry. He is also an author of both fiction and non-fict ...

and Dion Gijswijt developed a new technique based on the polynomial method to prove that

r_3(\mathbb_3^n) = O(2.756^n)

. The best known lower bound is

2.2202^

, discovered in December 2023 by Google DeepMind researchers using a

large language model A large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language processing tasks, especially language generation. The largest and most capable LLMs are g ...

(LLM).

Roth's theorem with popular differences

Another generalization of Roth's theorem shows that for positive density subsets, there not only exists a arithmetic progression, but that there exist many 3-APs all with the same common difference.

Roth's theorem with popular differences: For all $\epsilon > 0$ , there exists some $n_0 = n_0(\epsilon)$ such that for every $n > n_0$ and $A \subset \mathbb_3^n$ with $, A, = \alpha3^n,$ there exists some $y \neq 0$ such that $, \, \ge (\alpha^3 - \epsilon)3^n.$

A

is chosen randomly from

\mathbb_3^n,

then we would expect there to be

\alpha^33^n

progressions for each value of

y

. The popular differences theorem thus states that for each

, A,

with positive density, there is some

y

such that the number of 3-APs with common difference

y

is close to what we would expect. This theorem was first proven by Green in 2005, who gave a bound of

n_0 = \text((1/\epsilon)^),

where

\text

is the tower function. In 2019, Fox and Pham recently improved the bound to

n_0 = \text(O(\log\frac)).

A corresponding statement is also true in

\mathbb

for both 3-APs and 4-APs. However, the claim has been shown to be false for 5-APs.

References

{{reflist, colwidth=30em

External links

* Edmonds, Chelsea; Koutsoukou-Argyraki, Angeliki; Paulson, Lawrence C.br>Roth's Theorem on Arithmetic Progressions (Formal proof development in Isabelle/HOL, Archive of Formal Proofs)
Theorems in number theory