In
mathematics
Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
, a
real number
In mathematics, a real number is a number that can be used to measure a continuous one- dimensional quantity such as a duration or temperature. Here, ''continuous'' means that pairs of values can have arbitrarily small differences. Every re ...
is said to be simply normal in an
integer
An integer is the number zero (0), a positive natural number (1, 2, 3, ...), or the negation of a positive natural number (−1, −2, −3, ...). The negations or additive inverses of the positive natural numbers are referred to as negative in ...
base b if its infinite sequence of
digits is distributed uniformly in the sense that each of the
b digit values has the same
natural density 1/
b. A number is said to be normal in base
b if, for every positive integer
n, all possible strings
n digits long have density
b−''n''.
Intuitively, a number being simply normal means that no digit occurs more frequently than any other. If a number is normal, no finite combination of digits of a given length occurs more frequently than any other combination of the same length. A normal number can be thought of as an infinite sequence of coin flips (
binary) or rolls of a die (
base 6). Even though there ''will'' be sequences such as 10, 100, or more consecutive tails (binary) or fives (base 6) or even 10, 100, or more repetitions of a sequence such as tail-head (two consecutive coin flips) or 6-1 (two consecutive rolls of a die), there will also be equally many of any other sequence of equal length. No digit or sequence is "favored".
A number is said to be normal (sometimes called absolutely normal) if it is normal in all integer bases greater than or equal to 2.
While a general proof can be given that
almost all
In mathematics, the term "almost all" means "all but a negligible quantity". More precisely, if X is a set (mathematics), set, "almost all elements of X" means "all elements of X but those in a negligible set, negligible subset of X". The meaning o ...
real numbers are normal (meaning that the
set of non-normal numbers has
Lebesgue measure
In measure theory, a branch of mathematics, the Lebesgue measure, named after French mathematician Henri Lebesgue, is the standard way of assigning a measure to subsets of higher dimensional Euclidean '-spaces. For lower dimensions or , it c ...
zero), this proof is not
constructive, and only a few specific numbers have been shown to be normal. For example, any
Chaitin's constant is normal (and
uncomputable). It is widely believed that the (computable) numbers
,
, and ''
e'' are normal, but a proof remains elusive.
Definitions
Let be a finite
alphabet
An alphabet is a standard set of letter (alphabet), letters written to represent particular sounds in a spoken language. Specifically, letters largely correspond to phonemes as the smallest sound segments that can distinguish one word from a ...
of -digits, the set of all infinite
sequence
In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called ''elements'', or ''terms''). The number of elements (possibly infinite) is cal ...
s that may be drawn from that alphabet, and the set of finite sequences, or
strings. Let be such a sequence. For each in let denote the number of times the digit appears in the first digits of the sequence . We say that is simply normal if the
limit
for each . Now let be any finite string in and let be the number of times the string appears as a
substring in the first digits of the sequence . (For instance, if , then .) is normal if, for all finite strings ,
where denotes the length of the string . In other words, is normal if all strings of equal length occur with equal
asymptotic
In analytic geometry, an asymptote () of a curve is a line such that the distance between the curve and the line approaches zero as one or both of the ''x'' or ''y'' coordinates Limit of a function#Limits at infinity, tends to infinity. In pro ...
frequency. For example, in a normal binary sequence (a sequence over the alphabet ), and each occur with frequency ; , , , and each occur with frequency ; , , , , , , , and each occur with frequency ; etc. Roughly speaking, the
probability
Probability is a branch of mathematics and statistics concerning events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probability, the more likely an e ...
of finding the string in any given position in is precisely that expected if the sequence had been produced at
random.
Suppose now that is an
integer
An integer is the number zero (0), a positive natural number (1, 2, 3, ...), or the negation of a positive natural number (−1, −2, −3, ...). The negations or additive inverses of the positive natural numbers are referred to as negative in ...
greater than 1 and is a
real number
In mathematics, a real number is a number that can be used to measure a continuous one- dimensional quantity such as a duration or temperature. Here, ''continuous'' means that pairs of values can have arbitrarily small differences. Every re ...
. Consider the infinite digit sequence expansion of in the base
positional number system (we ignore the decimal point). We say that is simply normal in base if the sequence is simply normal and that is normal in base if the sequence is normal. The number is called a normal number (or sometimes an absolutely normal number) if it is normal in base for every integer greater than 1.
A given infinite sequence is either normal or not normal, whereas a real number, having a different base- expansion for each integer , may be normal in one base but not in another (in which case it is not a normal number). For bases and with
rational (so that and ) every number normal in base is normal in base . For bases and with irrational, there are uncountably many numbers normal in each base but not the other.
A
disjunctive sequence is a sequence in which every finite string appears. A normal sequence is disjunctive, but a disjunctive sequence need not be normal. A ''
rich number'' in base is one whose expansion in base is disjunctive: one that is disjunctive to every base is called ''absolutely disjunctive'' or is said to be a ''
lexicon
A lexicon (plural: lexicons, rarely lexica) is the vocabulary of a language or branch of knowledge (such as nautical or medical). In linguistics, a lexicon is a language's inventory of lexemes. The word ''lexicon'' derives from Greek word () ...
''. A number normal in base is rich in base , but not necessarily conversely. The real number is rich in base if and only if the set is
dense in the
unit interval.
[ denotes the fractional part of .]
We defined a number to be simply normal in base if each individual digit appears with frequency . For a given base , a number can be simply normal (but not normal or rich), rich (but not simply normal or normal), normal (and thus simply normal and rich), or none of these. A number is absolutely non-normal or absolutely abnormal if it is not simply normal in any base.
Properties and examples
The concept of a normal number was introduced by . Using the
Borel–Cantelli lemma, he proved that
almost all
In mathematics, the term "almost all" means "all but a negligible quantity". More precisely, if X is a set (mathematics), set, "almost all elements of X" means "all elements of X but those in a negligible set, negligible subset of X". The meaning o ...
real numbers are normal, establishing the existence of normal numbers. showed that it is possible to specify a particular such number. proved that there is a
computable absolutely normal number.
Although this construction does not directly give the digits of the numbers constructed, it shows that it is possible in principle to enumerate each digit of a particular normal number.
The set of non-normal numbers, despite being "large" in the sense of being
uncountable, is also a
null set (as its Lebesgue measure as a subset of the real numbers is zero, so it essentially takes up no space within the real numbers). Also, the non-normal numbers (as well as the normal numbers) are dense in the reals: the set of non-normal numbers between two distinct real numbers is non-empty since it contains
every rational number (in fact, it is uncountably infinite and even
comeagre). For instance, there are uncountably many numbers whose decimal expansions (in base 3 or higher) do not contain the digit 1, and none of those numbers are normal.
Champernowne's constant
obtained by concatenating the decimal representations of the
natural number
In mathematics, the natural numbers are the numbers 0, 1, 2, 3, and so on, possibly excluding 0. Some start counting with 0, defining the natural numbers as the non-negative integers , while others start with 1, defining them as the positive in ...
s in order, is normal in base 10. Likewise, the different variants of Champernowne's constant (done by performing the same concatenation in other bases) are normal in their respective bases (for example, the base-2 Champernowne constant is normal in base 2), but they have not been proven to be normal in other bases.
The
Copeland–Erdős constant
obtained by concatenating the
prime number
A prime number (or a prime) is a natural number greater than 1 that is not a Product (mathematics), product of two smaller natural numbers. A natural number greater than 1 that is not prime is called a composite number. For example, 5 is prime ...
s in base 10, is normal in base 10, as proved by . More generally, the latter authors proved that the real number represented in base ''b'' by the concatenation
where ''f''(''n'') is the ''n''
th prime expressed in base ''b'', is normal in base ''b''. proved that the number represented by the same expression, with ''f''(''n'') = ''n''
2,
obtained by concatenating the
square numbers in base 10, is normal in base 10. proved that the number represented by the same expression, with ''f'' being any non-constant polynomial whose values on the positive integers are positive integers, expressed in base 10, is normal in base 10.
proved that if ''f''(''x'') is any non-constant
polynomial
In mathematics, a polynomial is a Expression (mathematics), mathematical expression consisting of indeterminate (variable), indeterminates (also called variable (mathematics), variables) and coefficients, that involves only the operations of addit ...
with real coefficients such that ''f''(''x'') > 0 for all ''x'' > 0, then the real number represented by the concatenation
where
'f''(''n'')is the
integer part of ''f''(''n'') expressed in base ''b'', is normal in base ''b''. (This result includes as special cases all of the above-mentioned results of Champernowne, Besicovitch, and Davenport & Erdős.) The authors also show that the same result holds even more generally when ''f'' is any function of the form
where the αs and βs are real numbers with β > β
1 > β
2 > ... > β
''d'' ≥ 0, and ''f''(''x'') > 0 for all ''x'' > 0.
show an explicit
uncountably infinite class of ''b''-normal numbers by perturbing
Stoneham numbers.
It has been an elusive goal to prove the normality of numbers that are not artificially constructed. While
,
π,
ln(2), and
e are strongly conjectured to be normal, it is still not known whether they are normal or not. It has not even been proven that all digits actually occur infinitely many times in the decimal expansions of those constants (for example, in the case of π, the popular claim "every string of numbers eventually occurs in π" is not known to be true). It has also been conjectured that every
irrational algebraic number is absolutely normal (which would imply that is normal), and no counterexamples are known in any base. However, no irrational algebraic number has been proven to be normal in any base.
Non-normal numbers
No
rational number
In mathematics, a rational number is a number that can be expressed as the quotient or fraction of two integers, a numerator and a non-zero denominator . For example, is a rational number, as is every integer (for example,
The set of all ...
is normal in any base, since the digit sequences of rational numbers are
eventually periodic.
gives an example of an irrational number that is absolutely abnormal. Let
Then α is a Liouville number and is absolutely abnormal.
Properties
Additional properties of normal numbers include:
* Every non-zero real number is the product of two normal numbers. This follows from the general fact that every number is the product of two numbers from a set
if the complement of ''X'' has measure 0.
* If ''x'' is normal in base ''b'' and ''a'' ≠ 0 is a rational number, then
is also normal in base ''b''.
* If
is ''dense'' (for every
and for all sufficiently large ''n'',
) and
are the base-''b'' expansions of the elements of ''A'', then the number
, formed by concatenating the elements of ''A'', is normal in base ''b'' (Copeland and Erdős 1946). From this it follows that Champernowne's number is normal in base 10 (since the set of all positive integers is obviously dense) and that the Copeland–Erdős constant is normal in base 10 (since the
prime number theorem
In mathematics, the prime number theorem (PNT) describes the asymptotic analysis, asymptotic distribution of the prime numbers among the positive integers. It formalizes the intuitive idea that primes become less common as they become larger by p ...
implies that the set of primes is dense).
* A sequence is normal
if and only if
In logic and related fields such as mathematics and philosophy, "if and only if" (often shortened as "iff") is paraphrased by the biconditional, a logical connective between statements. The biconditional is true in two cases, where either bo ...
every ''block'' of equal length appears with equal frequency. (A block of length ''k'' is a substring of length ''k'' appearing at a position in the sequence that is a multiple of ''k'': e.g. the first length-''k'' block in ''S'' is ''S''
..''k'' the second length-''k'' block is ''S''
'k''+1..2''k'' etc.) This was implicit in the work of and made explicit in the work of .
* A number is normal in base ''b'' if and only if it is simply normal in base ''b
k'' for all
. This follows from the previous block characterization of normality: Since the ''n''
th block of length ''k'' in its base ''b'' expansion corresponds to the ''n''
th digit in its base ''b
k'' expansion, a number is simply normal in base ''b
k'' if and only if blocks of length ''k'' appear in its base ''b'' expansion with equal frequency.
* A number is normal if and only if it is simply normal in every base. This follows from the previous characterization of base ''b'' normality.
* A number is ''b''-normal if and only if there exists a set of positive integers