HOME

TheInfoList



OR:

In
mathematics Mathematics is an area of knowledge that includes the topics of numbers, formulas and related structures, shapes and the spaces in which they are contained, and quantities and their changes. These topics are represented in modern mathematics ...
, a Sturmian word (Sturmian sequence or billiard sequence), named after
Jacques Charles François Sturm Jacques Charles François Sturm (29 September 1803 – 15 December 1855) was a French mathematician. Life and work Sturm was born in Geneva (then part of France) in 1803. The family of his father, Jean-Henri Sturm, had emigrated from Strasbourg ...
, is a certain kind of infinitely long sequence of characters. Such a sequence can be generated by considering a game of
English billiards English billiards, called simply billiards in the United Kingdom and in many former British colonies, is a cue sport that combines the aspects of carom billiards and pool. Two (one white and one yellow) and a red are used. Each player or team us ...
on a square table. The struck ball will successively hit the vertical and horizontal edges labelled 0 and 1 generating a sequence of letters. This sequence is a Sturmian word.


Definition

Sturmian sequences can be defined strictly in terms of their combinatoric properties or geometrically as
cutting sequence In digital geometry, a cutting sequence is a sequence of symbols whose elements correspond to the individual grid lines crossed ("cut") as a curve crosses a square grid. Sturmian words are a special case of cutting sequences where the curves ar ...
s for lines of irrational slope or codings for
irrational rotation In the mathematical theory of dynamical systems, an irrational rotation is a map : T_\theta : ,1\rightarrow ,1\quad T_\theta(x) \triangleq x + \theta \mod 1 where ''θ'' is an irrational number. Under the identification of a circle wit ...
s. They are traditionally taken to be infinite sequences on the alphabet of the two symbols 0 and 1.


Combinatorial definitions


Sequences of low complexity

For an infinite sequence of symbols ''w'', let ''σ''(''n'') be the complexity function of ''w''; i.e., ''σ''(''n'') = the number of distinct contiguous subwords (factors) in ''w'' of length ''n''. Then ''w'' is Sturmian if ''σ''(''n'') = ''n'' + 1 for all ''n''.


Balanced sequences

A set ''X'' of binary strings is called ''balanced'' if the
Hamming weight The Hamming weight of a string is the number of symbols that are different from the zero-symbol of the alphabet used. It is thus equivalent to the Hamming distance from the all-zero string of the same length. For the most typical case, a string o ...
of elements of ''X'' takes at most two distinct values. That is, for any s\in X , ''s'', 1 = ''k'' or , ''s'', 1 = ''k where , ''s'', 1 is the number of 1s in ''s''. Let ''w'' be an infinite sequence of 0s and 1s and let \mathcal L_n(w) denote the set of all length-''n'' subwords of ''w''. The sequence ''w'' is Sturmian if \mathcal L_n(w) is balanced for all ''n'' and ''w'' is not eventually periodic.


Geometric definitions


Cutting sequence of irrational

Let ''w'' be an infinite sequence of 0s and 1s. The sequence ''w'' is Sturmian if for some x\in[0,1) and some irrational \theta\in(0,\infty), ''w'' is realized as the
cutting sequence In digital geometry, a cutting sequence is a sequence of symbols whose elements correspond to the individual grid lines crossed ("cut") as a curve crosses a square grid. Sturmian words are a special case of cutting sequences where the curves ar ...
of the line f(t)=\theta t+x.


Difference of Beatty sequences

Let ''w'' = (''w''''n'') be an infinite sequence of 0s and 1s. The sequence ''w'' is Sturmian if it is the difference of non-homogeneous
Beatty sequence In mathematics, a Beatty sequence (or homogeneous Beatty sequence) is the sequence of integers found by taking the floor of the positive multiples of a positive irrational number. Beatty sequences are named after Samuel Beatty, who wrote about th ...
s, that is, for some x\in[0,1) and some irrational \theta\in(0,1) :w_n = \lfloor n\theta + x\rfloor - \lfloor (n-1)\theta + x \rfloor for all n or :w_n = \lceil n\theta + x\rceil - \lceil (n-1)\theta + x \rceil for all n.


Coding of irrational rotation

For \theta\in [0,1), define T_\theta:[0,1)\to[0,1) by t\mapsto t+\theta\bmod 1. For x\in[0,1) define the ''θ''-coding of ''x'' to be the sequence (''x''''n'') where :x_n= \begin 1 & \text T_\theta^n(x)\in [0,\theta), \\ 0 & \text. \end Let ''w'' be an infinite sequence of 0s and 1s. The sequence ''w'' is Sturmian if for some x\in[0,1) and some irrational \theta\in(0,\infty), ''w'' is the ''θ''-coding of ''x''.


Discussion


Example

A famous example of a (standard) Sturmian word is the Fibonacci word; its slope is 1/\phi, where \phi is the golden ratio.


Balanced aperiodic sequences

A set ''S'' of finite binary words is ''balanced'' if for each ''n'' the subset ''S''''n'' of words of length ''n'' has the property that the
Hamming weight The Hamming weight of a string is the number of symbols that are different from the zero-symbol of the alphabet used. It is thus equivalent to the Hamming distance from the all-zero string of the same length. For the most typical case, a string o ...
of the words in ''S''''n'' takes at most two distinct values. A balanced sequence is one for which the set of factors is balanced. A balanced sequence has at most ''n''+1 distinct factors of length ''n''. An aperiodic sequence is one which does not consist of a finite sequence followed by a finite cycle. An aperiodic sequence has at least ''n'' + 1 distinct factors of length ''n''. A sequence is Sturmian if and only if it is balanced and aperiodic.


Slope and intercept

A
sequence In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called ''elements'', or ''terms''). The number of elements (possibly infinite) is calle ...
(a_n)_ over is a Sturmian word if and only if there exist two
real number In mathematics, a real number is a number that can be used to measure a ''continuous'' one-dimensional quantity such as a distance, duration or temperature. Here, ''continuous'' means that values can have arbitrarily small variations. Every real ...
s, the ''slope'' \alpha and the ''intercept'' \rho, with \alpha
irrational Irrationality is cognition, thinking, talking, or acting without inclusion of rationality. It is more specifically described as an action or opinion given through inadequate use of reason, or through emotional distress or cognitive deficiency. T ...
, such that :a_n=\lfloor\alpha(n+1)+\rho\rfloor -\lfloor\alpha n+\rho\rfloor-\lfloor\alpha\rfloor for all n. Thus a Sturmian word provides a
discretization In applied mathematics, discretization is the process of transferring continuous functions, models, variables, and equations into discrete counterparts. This process is usually carried out as a first step toward making them suitable for numerical ...
of the straight line with slope \alpha and intercept ''ρ''. Without loss of generality, we can always assume 0<\alpha<1, because for any integer ''k'' we have : \lfloor(\alpha + k)(n + 1) + \rho\rfloor - \lfloor(\alpha+k)n + \rho\rfloor - \lfloor\alpha + k\rfloor = a_n. All the Sturmian words corresponding to the same slope \alpha have the same set of factors; the word c_\alpha corresponding to the intercept \rho=0 is the standard word or characteristic word of slope \alpha. Hence, if 0<\alpha<1, the characteristic word c_\alpha is the
first difference In mathematics, a recurrence relation is an equation according to which the nth term of a sequence of numbers is equal to some combination of the previous terms. Often, only k previous terms of the sequence appear in the equation, for a paramete ...
of the
Beatty sequence In mathematics, a Beatty sequence (or homogeneous Beatty sequence) is the sequence of integers found by taking the floor of the positive multiples of a positive irrational number. Beatty sequences are named after Samuel Beatty, who wrote about th ...
corresponding to the irrational number \alpha. The standard word c_\alpha is also the limit of a sequence of words (s_n)_ defined recursively as follows: Let ; d_1+1, d_2, \ldots, d_n, \ldots/math> be the
continued fraction In mathematics, a continued fraction is an expression (mathematics), expression obtained through an iterative process of representing a number as the sum of its integer part and the multiplicative inverse, reciprocal of another number, then writ ...
expansion of \alpha, and define * s_0=1 * s_1=0 * s_=s_n^s_\textn>0 where the product between words is just their
concatenation In formal language, formal language theory and computer programming, string concatenation is the operation of joining character string (computer science), character strings wikt:end-to-end, end-to-end. For example, the concatenation of "sno ...
. Every word in the sequence (s_n)_ is a
prefix A prefix is an affix which is placed before the Word stem, stem of a word. Adding it to the beginning of one word changes it into another word. For example, when the prefix ''un-'' is added to the word ''happy'', it creates the word ''unhappy'' ...
of the next ones, so that the sequence itself converges to an infinite word, which is c_\alpha. The infinite sequence of words (s_n)_ defined by the above recursion is called the standard sequence for the standard word c_\alpha, and the infinite sequence ''d'' = (''d''1, ''d''2, ''d''3, ...) of nonnegative integers, with ''d''1 ≥ 0 and ''d''''n'' > 0 (''n'' ≥ 2), is called its directive sequence. A Sturmian word ''w'' over is characteristic if and only if both 0''w'' and 1''w'' are Sturmian.


Frequencies

If ''s'' is an infinite sequence word and ''w'' is a finite word, let μ''N''(''w'') denote the number of occurrences of ''w'' as a factor in the prefix of ''s'' of length ''N'' + , ''w'',  − 1. If ''μ''''N''(''w'') has a limit as ''N''→∞, we call this the frequency of ''w'', denoted by ''μ''(''w''). For a Sturmian word ''s'', every finite factor has a frequency. The
three-gap theorem In mathematics, the three-gap theorem, three-distance theorem, or Steinhaus conjecture states that if one places points on a circle, at angles of , , , ... from the starting point, then there will be at most three distinct distances between pairs ...
implies that the factors of fixed length ''n'' have at most three distinct frequencies, and if there are three values then one is the sum of the other two.


Non-binary words

For words over an alphabet of size ''k'' greater than 2, we define a Sturmian word to be one with complexity function ''n'' + ''k'' − 1. They can be described in terms of cutting sequences for ''k''-dimensional space. An alternative definition is as words of minimal complexity subject to not being ultimately periodic.


Associated real numbers

A real number for which the digits with respect to some fixed base form a Sturmian word is a
transcendental number In mathematics, a transcendental number is a number that is not algebraic—that is, not the root of a non-zero polynomial of finite degree with rational coefficients. The best known transcendental numbers are and . Though only a few classes ...
.


Sturmian endomorphisms

An endomorphism of the
free monoid In abstract algebra, the free monoid on a set is the monoid whose elements are all the finite sequences (or strings) of zero or more elements from that set, with string concatenation as the monoid operation and with the unique sequence of zero eleme ...
''B'' on a 2-letter alphabet ''B'' is Sturmian if it maps every Sturmian word to a Sturmian word and locally Sturmian if it maps some Sturmian word to a Sturmian word. The Sturmian endomorphisms form a submonoid of the monoid of endomorphisms of ''B''. Define endomorphisms φ and ψ of ''B'', where ''B'' = , by φ(0) = 01, φ(1) = 0 and ψ(0) = 10, ψ(1) = 0. Then ''I'', φ and ψ are Sturmian, and the Sturmian endomorphisms of ''B'' are precisely those endomorphisms in the submonoid of the endomorphism monoid generated by . A morphism is Sturmian if and only if the image of the word 10010010100101 is a
balanced sequence In computer science, the complexity function of a ''word'' or '' string'' (a finite or infinite sequence of symbols from some alphabet) is the function that counts the number of distinct ''factors'' (substrings of consecutive symbols) of that strin ...
; that is, for each ''n'', the
Hamming weight The Hamming weight of a string is the number of symbols that are different from the zero-symbol of the alphabet used. It is thus equivalent to the Hamming distance from the all-zero string of the same length. For the most typical case, a string o ...
s of the subwords of length ''n'' take at most two distinct values.


History

Although the study of Sturmian words dates back to
Johann III Bernoulli Johann III Bernoulli (also known as Jean; 4 November 1744, Basel – 13 July 1807, Berlin), grandson of Johann Bernoulli and son of Johann II Bernoulli, was known around the world as a child prodigy. Biography He studied at Basel and at Neu ...
(1772), it was
Gustav A. Hedlund Gustav Arnold Hedlund (May 7, 1904 – March 15, 1993), an American mathematician, was one of the founders of symbolic dynamics, symbolic and topological dynamics. Biography Hedlund was born May 7, 1904, in Somerville, Massachusetts. He did his ...
and
Marston Morse Harold Calvin Marston Morse (March 24, 1892 – June 22, 1977) was an American mathematician best known for his work on the ''calculus of variations in the large'', a subject where he introduced the technique of differential topology now known a ...
in 1940 who coined the term ''Sturmian'' to refer to such sequences, in honor of the mathematician
Jacques Charles François Sturm Jacques Charles François Sturm (29 September 1803 – 15 December 1855) was a French mathematician. Life and work Sturm was born in Geneva (then part of France) in 1803. The family of his father, Jean-Henri Sturm, had emigrated from Strasbourg ...
due to the relation with the
Sturm comparison theorem Sturm (German for storm) may refer to: People * Sturm (surname), surname (includes a list) * Saint Sturm (died 779), 8th-century monk Food * Federweisser, known as ''Sturm'' in Austria, wine in the fermentation stage * Sturm Foods, an American d ...
.


See also

*
Cutting sequence In digital geometry, a cutting sequence is a sequence of symbols whose elements correspond to the individual grid lines crossed ("cut") as a curve crosses a square grid. Sturmian words are a special case of cutting sequences where the curves ar ...
*
Word (group theory) In group theory, a word is any written product of group elements and their inverses. For example, if ''x'', ''y'' and ''z'' are elements of a group ''G'', then ''xy'', ''z''−1''xzz'' and ''y''−1''zxx''−1''yz''−1 are words in the set . Two ...
*
Morphic word In mathematics and computer science, a morphic word or substitutive word is an infinite sequence of symbols which is constructed from a particular class of endomorphism of a free monoid. Every automatic sequence is morphic. Definition Let ''f'' ...
*
Lyndon word In mathematics, in the areas of combinatorics and computer science, a Lyndon word is a nonempty string that is strictly smaller in lexicographic order than all of its rotations. Lyndon words are named after mathematician Roger Lyndon, who investi ...


References


Further reading

* * {{cite book , last=Lothaire , first=M. , author-link=M. Lothaire , title=Algebraic combinatorics on words , others=With preface by Jean Berstel and Dominique Perrin , edition=Reprint of the 2002 hardback , series=Encyclopedia of Mathematics and Its Applications , volume=90, publisher=
Cambridge University Press Cambridge University Press is the university press of the University of Cambridge. Granted letters patent by Henry VIII of England, King Henry VIII in 1534, it is the oldest university press A university press is an academic publishing hou ...
, year=2011 , isbn=978-0-521-18071-9 , zbl=1221.68183 Combinatorics on words Sequences and series