In the

computer science Computer science is the study of computation, information, and automation. Computer science spans Theoretical computer science, theoretical disciplines (such as algorithms, theory of computation, and information theory) to Applied science, ...

subfield of algorithmic information theory, a Chaitin constant (Chaitin omega number) or halting probability is a

real number In mathematics, a real number is a number that can be used to measure a continuous one- dimensional quantity such as a duration or temperature. Here, ''continuous'' means that pairs of values can have arbitrarily small differences. Every re ...

that, informally speaking, represents the

probability Probability is a branch of mathematics and statistics concerning events and numerical descriptions of how likely they are to occur. The probability of an event is a number between 0 and 1; the larger the probability, the more likely an e ...

that a randomly constructed program will halt. These numbers are formed from a construction due to

Gregory Chaitin Gregory John Chaitin ( ; born 25 June 1947) is an Argentina, Argentine-United States, American mathematician and computer scientist. Beginning in the late 1960s, Chaitin made contributions to algorithmic information theory and metamathematics, ...

. Although there are infinitely many halting probabilities, one for each (universal, see below) method of encoding programs, it is common to use the letter to refer to them as if there were only one. Because depends on the program encoding used, it is sometimes called Chaitin's construction when not referring to any specific encoding. Each halting probability is a normal and transcendental real number that is not

computable Computability is the ability to solve a problem by an effective procedure. It is a key topic of the field of computability theory within mathematical logic and the theory of computation within computer science. The computability of a problem is cl ...

, which means that there is no

algorithm In mathematics and computer science, an algorithm () is a finite sequence of Rigour#Mathematics, mathematically rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algo ...

to compute its digits. Each halting probability is Martin-Löf random, meaning there is not even any algorithm which can reliably guess its digits.

Background

The definition of a halting probability relies on the existence of a prefix-free universal computable function. Such a function, intuitively, represents a program in a programming language with the property that no valid program can be obtained as a proper extension of another valid program. Suppose that is a

partial function In mathematics, a partial function from a set to a set is a function from a subset of (possibly the whole itself) to . The subset , that is, the '' domain'' of viewed as a function, is called the domain of definition or natural domain ...

that takes one argument, a finite binary string, and possibly returns a single binary string as output. The function is called

if there is a

Turing machine A Turing machine is a mathematical model of computation describing an abstract machine that manipulates symbols on a strip of tape according to a table of rules. Despite the model's simplicity, it is capable of implementing any computer algori ...

that computes it, in the sense that for any finite binary strings and , if and only if the Turing machine halts with on its tape when given the input . The function is called universal if for every computable function of a single variable there is a string such that for all , ; here represents the

concatenation In formal language theory and computer programming, string concatenation is the operation of joining character strings end-to-end. For example, the concatenation of "snow" and "ball" is "snowball". In certain formalizations of concatenati ...

of the two strings and . This means that can be used to simulate any computable function of one variable. Informally, represents a "script" for the computable function , and represents an "interpreter" that parses the script as a prefix of its input and then executes it on the remainder of input. The domain of is the set of all inputs on which it is defined. For that are universal, such a can generally be seen both as the concatenation of a program part and a data part, and as a single program for the function . The function is called prefix-free if there are no two elements , in its domain such that is a proper extension of . This can be rephrased as: the domain of is a

prefix-free code A prefix code is a type of code system distinguished by its possession of the prefix property, which requires that there is no whole code word in the system that is a prefix (initial segment) of any other code word in the system. It is trivially t ...

(instantaneous code) on the set of finite binary strings. A simple way to enforce prefix-free-ness is to use machines whose means of input is a binary stream from which bits can be read one at a time. There is no end-of-stream marker; the end of input is determined by when the universal machine decides to stop reading more bits, and the remaining bits are not considered part of the accepted string. Here, the difference between the two notions of program mentioned in the last paragraph becomes clear: one is easily recognized by some grammar, while the other requires arbitrary computation to recognize. The domain of any universal computable function is a

computably enumerable set In computability theory, a set ''S'' of natural numbers is called computably enumerable (c.e.), recursively enumerable (r.e.), semidecidable, partially decidable, listable, provable or Turing-recognizable if: *There is an algorithm such that the ...

but never a

computable set In computability theory, a set of natural numbers is computable (or decidable or recursive) if there is an algorithm that computes the membership of every natural number in a finite number of steps. A set is noncomputable (or undecidable) if it i ...

. The domain is always Turing equivalent to the

halting problem In computability theory (computer science), computability theory, the halting problem is the problem of determining, from a description of an arbitrary computer program and an input, whether the program will finish running, or continue to run for ...

Definition

Let be the domain of a prefix-free universal computable function . The constant is then defined as

\Omega_F = \sum_ 2^,

where denotes the length of a string . This is an

infinite sum In mathematics, a series is, roughly speaking, an addition of infinitely many terms, one after the other. The study of series is a major part of calculus and its generalization, mathematical analysis. Series are used in most areas of mathemati ...

which has one summand for every in the domain of . The requirement that the domain be prefix-free, together with Kraft's inequality, ensures that this sum converges to a

between 0 and 1. If is clear from context then may be denoted simply , although different prefix-free universal computable functions lead to different values of .

Relationship to the halting problem

Knowing the first bits of , one could calculate the

for all programs of a size up to . Let the program for which the halting problem is to be solved be bits long. In dovetailing fashion, all programs of all lengths are run, until enough have halted to jointly contribute enough probability to match these first bits. If the program has not halted yet, then it never will, since its contribution to the halting probability would affect the first bits. Thus, the halting problem would be solved for . Because many outstanding problems in number theory, such as

Goldbach's conjecture Goldbach's conjecture is one of the oldest and best-known list of unsolved problems in mathematics, unsolved problems in number theory and all of mathematics. It states that every even and odd numbers, even natural number greater than 2 is the ...

, are equivalent to solving the halting problem for special programs (which would basically search for counter-examples and halt if one is found), knowing enough bits of Chaitin's constant would also imply knowing the answer to these problems. But as the halting problem is not generally solvable, calculating any but the first few bits of Chaitin's constant is not possible for a universal language. This reduces hard problems to impossible ones, much like trying to build an oracle machine for the halting problem would be.

Interpretation as a probability

The Cantor space is the collection of all infinite sequences of 0s and 1s. A halting probability can be interpreted as the measure of a certain subset of Cantor space under the usual

probability measure In mathematics, a probability measure is a real-valued function defined on a set of events in a σ-algebra that satisfies Measure (mathematics), measure properties such as ''countable additivity''. The difference between a probability measure an ...

on Cantor space. It is from this interpretation that halting probabilities take their name. The probability measure on Cantor space, sometimes called the fair-coin measure, is defined so that for any binary string the set of sequences that begin with has measure . This implies that for each natural number , the set of sequences in Cantor space such that = 1 has measure , and the set of sequences whose th element is 0 also has measure . Let be a prefix-free universal computable function. The domain of consists of an infinite set of binary strings

P = \.

Each of these strings determines a subset of Cantor space; the set contains all sequences in cantor space that begin with . These sets are disjoint because is a prefix-free set. The sum

\sum_ 2^

represents the measure of the set

\bigcup_ S_i.

In this way, represents the probability that a randomly selected infinite sequence of 0s and 1s begins with a bit string (of some finite length) that is in the domain of . It is for this reason that is called a halting probability.

Properties

Each Chaitin constant has the following properties: * It is algorithmically random (also known as Martin-Löf random or 1-random). This means that the shortest program to output the first bits of must be of size at least . This is because, as in the Goldbach example, those bits enable us to find out exactly which programs halt among all those of length at most . * As a consequence, it is a

normal number In mathematics, a real number is said to be simply normal in an integer base b if its infinite sequence of digits is distributed uniformly in the sense that each of the b digit values has the same natural density 1/b. A number is said to ...

, which means that its digits are equidistributed as if they were generated by tossing a fair coin. * It is not a computable number; there is no computable function that enumerates its binary expansion, as discussed below. * The set of

rational number In mathematics, a rational number is a number that can be expressed as the quotient or fraction of two integers, a numerator and a non-zero denominator . For example, is a rational number, as is every integer (for example, The set of all ...

s such that is

computably enumerable In computability theory, a set ''S'' of natural numbers is called computably enumerable (c.e.), recursively enumerable (r.e.), semidecidable, partially decidable, listable, provable or Turing-recognizable if: *There is an algorithm such that the ...

; a real number with such a property is called a left-c.e. real number in

recursion theory Computability theory, also known as recursion theory, is a branch of mathematical logic, computer science, and the theory of computation that originated in the 1930s with the study of computable functions and Turing degrees. The field has since ex ...

. * The set of rational numbers such that is not computably enumerable. (Reason: every left-c.e. real with this property is computable, which is not.) * It is an arithmetical number. * It is Turing equivalent to the

and thus at level of the

arithmetical hierarchy In mathematical logic, the arithmetical hierarchy, arithmetic hierarchy or Kleene–Mostowski hierarchy (after mathematicians Stephen Cole Kleene and Andrzej Mostowski) classifies certain sets based on the complexity of formulas that define th ...

. Not every set that is Turing equivalent to the halting problem is a halting probability. A finer equivalence relation, Solovay equivalence, can be used to characterize the halting probabilities among the left-c.e. reals. One can show that a real number in is a Chaitin constant (i.e. the halting probability of some prefix-free universal computable function) if and only if it is left-c.e. and algorithmically random. is among the few definable algorithmically random numbers and is the best-known algorithmically random number, but it is not at all typical of all algorithmically random numbers.

Uncomputability

A real number is called computable if there is an algorithm which, given , returns the first digits of the number. This is equivalent to the existence of a program that enumerates the digits of the real number. No halting probability is computable. The proof of this fact relies on an algorithm which, given the first digits of , solves Turing's

for programs of length up to . Since the halting problem is undecidable, cannot be computed. The algorithm proceeds as follows. Given the first digits of and a , the algorithm enumerates the domain of until enough elements of the domain have been found so that the probability they represent is within of . After this point, no additional program of length can be in the domain, because each of these would add to the measure, which is impossible. Thus the set of strings of length in the domain is exactly the set of such strings already enumerated.

Algorithmic randomness

A real number is random if the binary sequence representing the real number is an algorithmically random sequence. Calude, Hertling, Khoussainov, and Wang showed that a recursively enumerable real number is an algorithmically random sequence if and only if it is a Chaitin's number.

Incompleteness theorem for halting probabilities

For each specific consistent effectively represented

axiomatic system In mathematics and logic, an axiomatic system is a set of formal statements (i.e. axioms) used to logically derive other statements such as lemmas or theorems. A proof within an axiom system is a sequence of deductive steps that establishes ...

for the

natural numbers In mathematics, the natural numbers are the numbers 0, 1, 2, 3, and so on, possibly excluding 0. Some start counting with 0, defining the natural numbers as the non-negative integers , while others start with 1, defining them as the positiv ...

, such as

Peano arithmetic In mathematical logic, the Peano axioms (, ), also known as the Dedekind–Peano axioms or the Peano postulates, are axioms for the natural numbers presented by the 19th-century Italian mathematician Giuseppe Peano. These axioms have been used nea ...

, there exists a constant such that no bit of after the th can be proven to be 1 or 0 within that system. The constant depends on how the

formal system A formal system is an abstract structure and formalization of an axiomatic system used for deducing, using rules of inference, theorems from axioms. In 1921, David Hilbert proposed to use formal systems as the foundation of knowledge in ma ...

is effectively represented, and thus does not directly reflect the complexity of the axiomatic system. This incompleteness result is similar to Gödel's incompleteness theorem in that it shows that no consistent formal theory for arithmetic can be complete.

Super Omega

The first bits of

's constant are random or incompressible in the sense that they cannot be computed by a halting algorithm with fewer than bits. However, consider the short but never halting algorithm which systematically lists and runs all possible programs; whenever one of them halts its probability gets added to the output (initialized by zero). After finite time the first bits of the output will never change any more (it does not matter that this time itself is not computable by a halting program). So there is a short non-halting algorithm whose output converges (after finite time) onto the first bits of . In other words, the

enumerable An enumeration is a complete, ordered listing of all the items in a collection. The term is commonly used in mathematics and computer science to refer to a listing of all of the element (mathematics), elements of a Set (mathematics), set. The pre ...

first bits of are highly compressible in the sense that they are limit-computable by a very short algorithm; they are not

random In common usage, randomness is the apparent or actual lack of definite pattern or predictability in information. A random sequence of events, symbols or steps often has no order and does not follow an intelligible pattern or combination. ...

with respect to the set of enumerating algorithms. Jürgen Schmidhuber constructed a limit-computable "Super " which in a sense is much more random than the original limit-computable , as one cannot significantly compress the Super by any enumerating non-halting algorithm. For an alternative "Super ", the universality probability of a prefix-free

universal Turing machine In computer science, a universal Turing machine (UTM) is a Turing machine capable of computing any computable sequence, as described by Alan Turing in his seminal paper "On Computable Numbers, with an Application to the Entscheidungsproblem". Co ...

(UTM) namely, the probability that it remains universal even when every input of it (as a binary string) is prefixed by a random binary string can be seen as the non-halting probability of a machine with oracle the third iteration of the

(i.e., using

Turing jump In computability theory, the Turing jump or Turing jump operator, named for Alan Turing, is an operation that assigns to each decision problem a successively harder decision problem with the property that is not decidable by an oracle machine ...

notation).

References

Works cited

* * *

External links

Aspects of Chaitin's Omega
Survey article discussing recent advances in the study of Chaitin's .

article based on one written by

which appeared in the August 2004 edition of ''Mathematics Today'', on the occasion of the 50th anniversary of Alan Turing's death.
''The Limits of Reason''
Gregory Chaitin, originally appeared in ''Scientific American'', March 2006.

and generalizations of algorithmic information, by Jürgen Schmidhuber {{Irrational number Algorithmic information theory Theory of computation Real transcendental numbers