Dejean's theorem (formerly Dejean's conjecture) is a statement about repetitions in infinite
strings of symbols. It belongs to the field of
combinatorics on words
Combinatorics on words is a fairly new field of mathematics, branching from combinatorics, which focuses on the study of words and formal languages. The subject looks at letters or symbols, and the sequences they form. Combinatorics on words ...
; it was conjectured in 1972 by Françoise Dejean and proven in 2009 by Currie and Rampersad and, independently, by Rao.
Context
In the study of strings,
concatenation
In formal language theory and computer programming, string concatenation is the operation of joining character strings end-to-end. For example, the concatenation of "snow" and "ball" is "snowball". In certain formalizations of concatenati ...
is seen as analogous to multiplication of numbers. For instance, if
is any string,
then the concatenation
of two copies of
is called the square of
, and denoted
. This exponential notation may also be extended to fractional powers: if
has length
, and
is a non-negative rational number of the form
, then
denotes the string formed by the first
characters of the infinite repetition
.
A
square-free word is a string that does not contain any square as a substring. In particular, it avoids repeating the same symbol consecutively, repeating the same pair of symbols, etc.
Axel Thue
Axel Thue (; 19 February 1863 – 7 March 1922) was a Norwegian mathematician, known for his original work in diophantine approximation and combinatorics.
Work
Thue published his first important paper in 1909.
He stated in 1914 the so-called w ...
showed that there exists an infinite square-free word using a three-symbol alphabet, the sequence of differences between consecutive elements of the
Thue–Morse sequence
In mathematics, the Thue–Morse or Prouhet–Thue–Morse sequence is the binary sequence (an infinite sequence of 0s and 1s) that can be obtained by starting with 0 and successively appending the Boolean complement of the sequence obtained thus ...
. However, it is not possible for an infinite two-symbol word (or even a two-symbol word of length greater than three) to be square-free.
For alphabets of two symbols, however, there do exist infinite cube-free words,
words with no substring of the form
. One such example is the
Thue–Morse sequence
In mathematics, the Thue–Morse or Prouhet–Thue–Morse sequence is the binary sequence (an infinite sequence of 0s and 1s) that can be obtained by starting with 0 and successively appending the Boolean complement of the sequence obtained thus ...
itself; another is the
Kolakoski sequence
In mathematics, the Kolakoski sequence, sometimes also known as the Oldenburger–Kolakoski sequence, is an infinite sequence of symbols that is the sequence of run lengths in its own run-length encoding. It is named after the recreational math ...
. More strongly, the Thue–Morse sequence contains no substring that is a power strictly greater than two.
In 1972, Dejean investigated the problem of determining, for each possible alphabet size, the threshold between exponents
for which there exists an infinite
-power-free word, and the exponents for which no such word exists. The problem was solved for two-symbol alphabets by the Thue–Morse sequence, and Dejean solved it as well for three-symbol alphabets.
She conjectured a precise formula for the threshold exponent for every larger alphabet size; this formula is Dejean's conjecture, now a theorem.
Statement
Let
be the number of symbols in an alphabet.
For every
, define
, the ''repeat threshold'', to be the
infimum
In mathematics, the infimum (abbreviated inf; : infima) of a subset S of a partially ordered set P is the greatest element in P that is less than or equal to each element of S, if such an element exists. If the infimum of S exists, it is unique ...
of exponents
such that there exists an infinite
-power-free word on a
-symbol alphabet. Thus, for instance, the Thue–Morse sequence shows that
, and an argument based on the
Lovász local lemma
In probability theory, if a large number of events are all independent of one another and each has probability less than 1, then there is a positive (possibly small) probability that none of the events will occur. The Lovász local lemma allows a s ...
can be used to show that
is finite for all
.
Then Dejean's conjecture is that the repeat threshold can be calculated by the simple formula
:
except in two exceptional cases:
:
and
:
Progress and proof
Dejean herself proved the conjecture for
.
The case
was proven by Jean-Jacques Pansiot in 1984.
The next progress was by Moulin Ollagnier in 1992, who proved the conjecture for all alphabet sizes up to
.
This analysis was extended up to
in 2007 by Mohammad-Noori and Currie.
In the other direction, also in 2007, Arturo Carpi showed the conjecture to be true for large alphabets, with
. This reduced the problem to a finite number of remaining cases, which were solved in 2009 and published in 2011 by Currie and Rampersad and independently by Rao.
Dejean words
An infinite string that meets Dejean's formula (having no repetitions of exponent above the repetition threshold) is called a Dejean word.
Thus, for instance, the Thue–Morse sequence is a Dejean word.
References
{{reflist, refs=
[{{citation
, last = Carpi , first = Arturo
, doi = 10.1016/j.tcs.2007.06.001
, issue = 1–3
, journal = ]Theoretical Computer Science
Theoretical computer science is a subfield of computer science and mathematics that focuses on the Abstraction, abstract and mathematical foundations of computation.
It is difficult to circumscribe the theoretical areas precisely. The Associati ...
, mr = 2356248
, pages = 137–151
, title = On Dejean's conjecture over large alphabets
, volume = 385
, year = 2007, doi-access =
[{{citation
, last1 = Currie , first1 = James
, last2 = Rampersad , first2 = Narad
, doi = 10.1090/S0025-5718-2010-02407-X
, issue = 274
, journal = ]Mathematics of Computation
''Mathematics of Computation'' is a bimonthly mathematics journal focused on computational mathematics. It was established in 1943 as ''Mathematical Tables and Other Aids to Computation'', obtaining its current name in 1960. Articles older than f ...
, mr = 2772111
, pages = 1063–1070
, title = A proof of Dejean's conjecture
, volume = 80
, year = 2011, arxiv = 0905.1129
[{{citation
, last = Dejean , first = Françoise
, doi = 10.1016/0097-3165(72)90011-8
, journal = ]Journal of Combinatorial Theory
The ''Journal of Combinatorial Theory'', Series A and Series B, are mathematical journals specializing in combinatorics and related areas. They are published by Elsevier. ''Series A'' is concerned primarily with structures, designs, and applicati ...
, mr = 0300959
, pages = 90–99
, series = Series A
, title = Sur un théorème de Thue
, volume = 13
, year = 1972, doi-access =
[{{citation
, last1 = Mohammad-Noori , first1 = M.
, last2 = Currie , first2 = James D.
, doi = 10.1016/j.ejc.2005.11.005
, issue = 3
, journal = European Journal of Combinatorics
, mr = 2300768
, pages = 876–890
, title = Dejean's conjecture and Sturmian words
, volume = 28
, year = 2007, doi-access = free
]
[{{citation
, last = Moulin Ollagnier , first = Jean
, doi = 10.1016/0304-3975(92)90264-G
, issue = 2
, journal = ]Theoretical Computer Science
Theoretical computer science is a subfield of computer science and mathematics that focuses on the Abstraction, abstract and mathematical foundations of computation.
It is difficult to circumscribe the theoretical areas precisely. The Associati ...
, mr = 1156042
, pages = 187–205
, title = Proof of Dejean's conjecture for alphabets with 5, 6, 7, 8, 9, 10 and 11 letters
, volume = 95
, year = 1992, doi-access =
[{{citation
, last = Pansiot , first = Jean-Jacques
, issue = 3
, journal = ]Discrete Applied Mathematics
''Discrete Applied Mathematics'' is a peer-reviewed scientific journal covering algorithmic and applied areas of discrete mathematics. It is published by Elsevier and the editor-in-chief is Endre Boros (Rutgers University). The journal was split ...
, mr = 736893
, pages = 297–311
, title = À propos d'une conjecture de F. Dejean sur les répétitions dans les mots
, volume = 7
, year = 1984
, doi=10.1016/0166-218x(84)90006-4, doi-access = free
[{{citation
, last = Rao , first = Michaël
, doi = 10.1016/j.tcs.2010.06.020
, issue = 27
, journal = Theoretical Computer Science
, mr = 2830264
, pages = 3010–3018
, title = Last cases of Dejean's conjecture
, volume = 412
, year = 2011, doi-access =
]
[{{citation
, last1 = Rampersad , first1 = Narad
, last2 = Shallit , first2 = Jeffrey , author2-link = Jeffrey Shallit
, contribution = Repetitions in words
, mr = 3525483
, pages = 101–150
, publisher = Cambridge Univ. Press, Cambridge
, series = Encyclopedia Math. Appl.
, title = Combinatorics, words and symbolic dynamics
, volume = 159
, year = 2016]
Combinatorics on words
Theorems in combinatorics
Conjectures that have been proved