computability theory Computability theory, also known as recursion theory, is a branch of mathematical logic, computer science, and the theory of computation that originated in the 1930s with the study of computable functions and Turing degrees. The field has since ex ...

, the Ackermann function, named after

Wilhelm Ackermann Wilhelm Friedrich Ackermann (; ; 29 March 1896 – 24 December 1962) was a German mathematician and logician best known for his work in mathematical logic and the Ackermann function, an important example in the theory of computation. Biograph ...

, is one of the simplest and earliest-discovered examples of a total

computable function Computable functions are the basic objects of study in computability theory. Informally, a function is ''computable'' if there is an algorithm that computes the value of the function for every value of its argument. Because of the lack of a precis ...

that is not primitive recursive. All primitive recursive functions are total and computable, but the Ackermann function illustrates that not all total computable functions are primitive recursive. After Ackermann's publication of his function (which had three non-negative integer arguments), many authors modified it to suit various purposes, so that today "the Ackermann function" may refer to any of numerous variants of the original function. One common version is the two-argument Ackermann–Péter function developed by

Rózsa Péter Rózsa Péter, until January 1934 Rózsa Politzer, (17 February 1905 – 16 February 1977) was a Hungarian mathematician and logician. She is best known as the "founding mother of recursion theory". Early life and education Péter was bor ...

and Raphael Robinson. This function is defined from the

recurrence relation In mathematics, a recurrence relation is an equation according to which the nth term of a sequence of numbers is equal to some combination of the previous terms. Often, only k previous terms of the sequence appear in the equation, for a parameter ...

\operatorname(m+1, n+1) = \operatorname(m, \operatorname(m+1, n))

with appropriate base cases. Its value grows very rapidly; for example,

\operatorname(4, 2)

results in

2^ - 3

, an integer with 19,729 decimal digits.

History

In the late 1920s, the mathematicians Gabriel Sudan and

, students of

David Hilbert David Hilbert (; ; 23 January 1862 – 14 February 1943) was a German mathematician and philosopher of mathematics and one of the most influential mathematicians of his time. Hilbert discovered and developed a broad range of fundamental idea ...

, were studying the foundations of computation. Both Sudan and Ackermann are credited with discovering total

s (termed simply "recursive" in some references) that are not primitive recursive. Sudan published the lesser-known Sudan function, then shortly afterwards and independently, in 1928, Ackermann published his function

\varphi

(from Greek, the letter ''

phi Phi ( ; uppercase Φ, lowercase φ or ϕ; ''pheî'' ; Modern Greek: ''fi'' ) is the twenty-first letter of the Greek alphabet. In Archaic and Classical Greek (c. 9th to 4th century BC), it represented an aspirated voiceless bilabial plos ...

''). Ackermann's three-argument function,

\varphi(m, n, p)

, is defined such that for

p=0,1,2

, it reproduces the basic operations of

addition Addition (usually signified by the Plus and minus signs#Plus sign, plus symbol, +) is one of the four basic Operation (mathematics), operations of arithmetic, the other three being subtraction, multiplication, and Division (mathematics), divis ...

multiplication Multiplication is one of the four elementary mathematical operations of arithmetic, with the other ones being addition, subtraction, and division (mathematics), division. The result of a multiplication operation is called a ''Product (mathem ...

, and

exponentiation In mathematics, exponentiation, denoted , is an operation (mathematics), operation involving two numbers: the ''base'', , and the ''exponent'' or ''power'', . When is a positive integer, exponentiation corresponds to repeated multiplication ...

\begin
\varphi(m, n, 0) &= m+n \\
\varphi(m, n, 1) &= m\times n \\
\varphi(m, n, 2) &= m^n 
\end

and for

p > 2

it extends these basic operations in a way that can be compared to the

hyperoperation In mathematics, the hyperoperation sequence is an infinite sequence of arithmetic operations (called ''hyperoperations'' in this context) that starts with a unary operation (the successor function with ''n'' = 0). The sequence continues with th ...

n+1) && \text p > 3 \end

(Aside from its historic role as a total-computable-but-not-primitive-recursive function, Ackermann's original function is seen to extend the basic arithmetic operations beyond exponentiation, although not as seamlessly as do variants of Ackermann's function that are specifically designed for that purpose—such as Goodstein's

sequence.) In ''On the Infinite'', David Hilbert hypothesized that the Ackermann function was not primitive recursive, but it was Ackermann, Hilbert's personal secretary and former student, who actually proved the hypothesis in his paper ''On Hilbert's Construction of the Real Numbers''. Rózsa Péter and Raphael Robinson later developed a two-variable version of the Ackermann function that became preferred by almost all authors. The generalized hyperoperation sequence, e.g.

G(m, a, b) = a

, is a version of the Ackermann function as well. In 1963 R.C. Buck based an intuitive two-variable with parameter order reversed variant

\operatorname

on the hyperoperation sequence:

\operatorname(m,n) = 2 .

Compared to most other versions, Buck's function has no unessential offsets:

= 2^ \\ &\quad\vdots \end

Many other versions of Ackermann function have been investigated.

Definition

Definition: as m-ary function

Ackermann's original three-argument function

\varphi(m, n, p)

is defined recursively as follows for nonnegative integers

m,n,

and

p

\begin
\varphi(m, n, 0) &= m + n \\
\varphi(m, 0, 1) &= 0 \\
\varphi(m, 0, 2) &= 1 \\
\varphi(m, 0, p) &= m && \text p > 2 \\
\varphi(m, n, p) &= \varphi(m, \varphi(m, n-1, p), p - 1) && \text n, p > 0
\end

Of the various two-argument versions, the one developed by Péter and Robinson (called "the" Ackermann function by most authors) is defined for nonnegative integers

m

and

n

as follows:

\begin
\operatorname(0, n) & = & n + 1 \\
\operatorname(m+1, 0) & = & \operatorname(m, 1) \\
\operatorname(m+1, n+1) & = & \operatorname(m, \operatorname(m+1, n))
\end

The Ackermann function has also been expressed in relation to the hyperoperation sequence:

n+3)-3 & m>0 \\ \end

or, written in

Knuth's up-arrow notation In mathematics, Knuth's up-arrow notation is a method of notation for very large integers, introduced by Donald Knuth in 1976. In his 1947 paper, R. L. Goodstein introduced the specific sequence of operations that are now called ''hyperoperatio ...

(extended to integer indices

\geq -2

A(m,n) = \begin
 n+1 & m=0 \\
 2\uparrow^ (n+3) - 3 & m>0 \\
 \end

or, equivalently, in terms of Buck's function F:

A(m,n) = \begin
 n+1 & m=0 \\
 F(m,n+3) - 3 & m>0 \\
\end

Definition: as iterated 1-ary function

Define

f^

as the ''n''-th iterate of

f

\begin
f^(x) & = & x \\
f^(x) & = & f(f^(x))
\end

Iteration Iteration is the repetition of a process in order to generate a (possibly unbounded) sequence of outcomes. Each repetition of the process is a single iteration, and the outcome of each iteration is then the starting point of the next iteration. ...

is the process of composing a function with itself a certain number of times.

Function composition In mathematics, the composition operator \circ takes two function (mathematics), functions, f and g, and returns a new function h(x) := (g \circ f) (x) = g(f(x)). Thus, the function is function application, applied after applying to . (g \c ...

is an

associative In mathematics, the associative property is a property of some binary operations that rearranging the parentheses in an expression will not change the result. In propositional logic, associativity is a valid rule of replacement for express ...

operation, so

f(f^(x)) = f^(f(x))

. Conceiving the Ackermann function as a sequence of unary functions, one can set

\operatorname_(n) = \operatorname(m,n)

. The function then becomes a sequence

\operatorname_0, \operatorname_1, \operatorname_2, ...

of unary' curried' functions, defined from

iteration Iteration is the repetition of a process in order to generate a (possibly unbounded) sequence of outcomes. Each repetition of the process is a single iteration, and the outcome of each iteration is then the starting point of the next iteration. ...

\begin
\operatorname_(n) & = & n+1 \\
\operatorname_(n) & = & \operatorname_^(1) \\
\end

Computation

The recursive definition of the Ackermann function can naturally be transposed to a term rewriting system (TRS).

TRS, based on 2-ary function

The definition of the 2-ary Ackermann function leads to the obvious reduction rules

\begin
\text & A(0,n)       & \rightarrow & S(n) \\
\text & A(S(m),0)    & \rightarrow & A(m,S(0)) \\
\text & A(S(m),S(n)) & \rightarrow & A(m,A(S(m),n))
\end

Example Compute

A(1,2) \rightarrow_ 4

The reduction sequence is In each ''step'' the underlined ''redex'' is rewritten. To compute

\operatorname(m, n)

one can use a

stack Stack may refer to: Places * Stack Island, an island game reserve in Bass Strait, south-eastern Australia, in Tasmania’s Hunter Island Group * Blue Stack Mountains, in Co. Donegal, Ireland People * Stack (surname) (including a list of people ...

, which initially contains the elements

\langle m,n \rangle

. Then repeatedly the two top elements are replaced according to the ruleshere: leftmost-innermost strategy!

\begin
\text & 0     &,& n     & \rightarrow & (n+1) \\
\text & (m+1) &,& 0     & \rightarrow & m &,& 1 \\
\text & (m+1) &,& (n+1) & \rightarrow & m &,& (m+1) &,& n
\end

Schematically, starting from

\langle m,n \rangle

: WHILE stackLength <> 1 The

pseudocode In computer science, pseudocode is a description of the steps in an algorithm using a mix of conventions of programming languages (like assignment operator, conditional operator, loop) with informal, usually self-explanatory, notation of actio ...

is published in . For example, on input

\langle 2,1 \rangle

, Remarks *The leftmost-innermost strategy is implemented in 225 computer languages on

Rosetta Code Rosetta Code is a wiki-based programming chrestomathy website with implementations of common algorithms and solutions to various computer programming, programming problems in many different programming languages. It is named for the Rosetta Stone ...

. *For all

m,n

the computation of

A(m,n)

takes no more than

(A(m,n) + 1)^m

steps. * pointed out that in the computation of

\operatorname(m,n)

the maximum length of the stack is

\operatorname(m,n)

, as long as

m>0

Their own algorithm, inherently iterative, computes $\operatorname(m,n)$ within $\mathcal(m \operatorname(m,n))$ time and within $\mathcal(m)$ space.

TRS, based on iterated 1-ary function

The definition of the iterated 1-ary Ackermann functions leads to different reduction rules

\begin
\text & A(S(0),0,n)    & \rightarrow & S(n) \\
\text & A(S(0),S(m),n) & \rightarrow & A(S(n),m,S(0)) \\
\text & A(S(S(x)),m,n) & \rightarrow & A(S(0),m,A(S(x),m,n))
\end

As function composition is associative, instead of rule r6 one can define

\begin
\text & A(S(S(x)),m,n) & \rightarrow & A(S(x),m,A(S(0),m,n))
\end

Like in the previous section the computation of

\operatorname^1_m(n)

can be implemented with a stack. Initially the stack contains the three elements

\langle 1,m,n \rangle

. Then repeatedly the three top elements are replaced according to the rules

\begin
\text & 1   &, 0   &, n & \rightarrow & (n+1) \\
\text & 1   &, (m+1) &, n & \rightarrow & (n+1) &, m &, 1 \\
\text & (x+2) &, m   &, n & \rightarrow & 1 &, m &, (x+1) &, m &, n \\
\end

Schematically, starting from

\langle 1, m,n \rangle

: WHILE stackLength <> 1 Example On input

\langle 1,2,1 \rangle

the successive stack configurations are

\begin
& \underline
  \rightarrow_ \underline 
  \rightarrow_ 1,1,\underline
  \rightarrow_ 1,1,\underline
  \rightarrow_ 1,1,1,0,\underline \\
& \rightarrow_ 1,1,\underline
  \rightarrow_ \underline
  \rightarrow_ \underline
  \rightarrow_ 1,0,\underline
  \rightarrow_ 1,0,1,0,\underline \\
& \rightarrow_ 1,0,1,0,1,0,\underline
  \rightarrow_ 1,0,1,0,\underline
  \rightarrow_ 1,0,\underline
  \rightarrow_ \underline
  \rightarrow_ 5
\end

The corresponding equalities are

\begin
& A_2(1)
  = A^2_1(1)
  = A_1(A_1(1))
  = A_1(A^2_0(1))
  = A_1(A_0(A_0(1))) \\
& = A_1(A_0(2))
  = A_1(3)
  = A^4_0(1)
  = A_0(A^3_0(1))
  = A_0(A_0(A^2_0(1))) \\
& = A_0(A_0(A_0(A_0(1))))
  = A_0(A_0(A_0(2)))
  = A_0(A_0(3))
  = A_0(4)
  = 5
\end

When reduction rule r7 is used instead of rule r6, the replacements in the stack will follow

\begin
\text & (x+2) &, m   &, n & \rightarrow & (x+1) &, m &, 1 &, m &, n
\end

The successive stack configurations will then be

\begin
& \underline
  \rightarrow_ \underline 
  \rightarrow_ 1,1,\underline
  \rightarrow_ 1,1,\underline
  \rightarrow_ 1,1,1,0,\underline \\
& \rightarrow_ 1,1,\underline
  \rightarrow_ \underline
  \rightarrow_ \underline
  \rightarrow_ 3,0,\underline
  \rightarrow_ \underline \\
& \rightarrow_ 2,0,\underline
  \rightarrow_ \underline
  \rightarrow_ 1,0,\underline
  \rightarrow_ \underline
  \rightarrow_ 5
\end

The corresponding equalities are

\begin
& A_2(1)
  = A^2_1(1)
  = A_1(A_1(1))
  = A_1(A^2_0(1))
  = A_1(A_0(A_0(1))) \\
& = A_1(A_0(2)) = A_1(3) 
  = A^4_0(1)
  = A^3_0(A_0(1))
  = A^3_0(2) \\
& = A^2_0(A_0(2))
  = A^2_0(3)
  = A_0(A_0(3))
  = A_0(4) = 5
\end

Remarks *On any given input the TRSs presented so far converge in the same number of steps. They also use the same reduction rules (in this comparison the rules r1, r2, r3 are considered "the same as" the rules r4, r5, r6/r7 respectively). For example, the reduction of

A(2,1)

converges in 14 steps: 6 × r1, 3 × r2, 5 × r3. The reduction of

A_2(1)

converges in the same 14 steps: 6 × r4, 3 × r5, 5 × r6/r7. The TRSs differ in the order in which the reduction rules are applied. *When

A_(n)

is computed following the rules , the maximum length of the stack stays below

2 \times A(i,n)

. When reduction rule r7 is used instead of rule r6, the maximum length of the stack is only

2(i+2)

. The length of the stack reflects the recursion depth. As the reduction according to the rules involves a smaller maximum depth of recursion,The maximum depth of recursion refers to the number of levels of activation of a procedure which exist during the deepest call of the procedure. this computation is more efficient in that respect.

TRS, based on hyperoperators

As — or — showed explicitly, the Ackermann function can be expressed in terms of the

sequence:

n+3) - 3 & m>0 \\ \end

or, after removal of the constant 2 from the parameter list, in terms of Buck's function

A(m,n) = \begin
 n+1 & m=0 \\
 F(m,n+3) - 3 & m>0 \\
\end

Buck's function

\operatorname(m,n) = 2

, a variant of Ackermann function by itself, can be computed with the following reduction rules:

\begin
\text & F(S(0),0,n)          & \rightarrow & S(n) \\
\text & F(S(0),S(0),0)       & \rightarrow & S(S(0)) \\
\text & F(S(0),S(S(0)),0)    & \rightarrow & 0 \\
\text & F(S(0),S(S(S(m))),0) & \rightarrow & S(0) \\
\text & F(S(0),S(m),S(n))    & \rightarrow & F(S(n),m,F(S(0),S(m),0)) \\
\text & F(S(S(x)),m,n)       & \rightarrow & F(S(0),m,F(S(x),m,n))
\end

Instead of rule b6 one can define the rule

\begin
\text & F(S(S(x)),m,n)       & \rightarrow & F(S(x),m,F(S(0),m,n))
\end

To compute the Ackermann function it suffices to add three reduction rules

\begin
\text  & A(0,n)	     & \rightarrow & S(n) \\
\text  & A(S(m),n)     & \rightarrow & P(F(S(0),S(m),S(S(S(n))))) \\
\text & P(S(S(S(m)))) & \rightarrow & m \\
\end

These rules take care of the base case A(0,n), the alignment (n+3) and the fudge (-3). Example Compute

A(2,1) \rightarrow_ 5

The matching equalities are *when the TRS with the reduction rule

\text

is applied:

\begin
& A(2,1) +3
= F(2,4)
= \dots
= F^6(0,2)
= F(0,F^5(0,2))
= F(0,F(0,F^4(0,2))) \\
& = F(0,F(0,F(0,F^3(0,2))))
= F(0,F(0,F(0,F(0,F^2(0,2)))))
= F(0,F(0,F(0,F(0,F(0,F(0,2)))))) \\
& = F(0,F(0,F(0,F(0,F(0,3)))))
= F(0,F(0,F(0,F(0,4))))
= F(0,F(0,F(0,5)))
= F(0,F(0,6))
= F(0,7)
= 8
\end

*when the TRS with the reduction rule

\text

is applied:

\begin
& A(2,1) +3
= F(2,4)
= \dots
= F^6(0,2)
= F^5(0,F(0,2))
= F^5(0,3)
= F^4(0,F(0,3))
= F^4(0,4) \\
& = F^3(0,F(0,4))
= F^3(0,5)
= F^2(0,F(0,5))
= F^2(0,6)
= F(0,F(0,6))
= F(0,7)
= 8
\end

Remarks *The computation of

\operatorname_(n)

according to the rules is deeply recursive. The maximum depth of nested

F

s is

A(i,n)+1

. The culprit is the order in which iteration is executed:

F^(x) = F(F^(x))

. The first

F

disappears only after the whole sequence is unfolded. *The computation according to the rules is more efficient in that respect. The iteration

F^(x) = F^(F(x))

simulates the repeated loop over a block of code.LOOP n+1 TIMES DO F The nesting is limited to

(i+1)

, one recursion level per iterated function. showed this correspondence. *These considerations concern the recursion depth only. Either way of iterating leads to the same number of reduction steps, involving the same rules (when the rules b6 and b7 are considered "the same"). The reduction of

A(2,1)

for instance converges in 35 steps: 12 × b1, 4 × b2, 1 × b3, 4 × b5, 12 × b6/b7, 1 × r9, 1 × r10. The ''modus iterandi'' only affects the order in which the reduction rules are applied. *A real gain of execution time can only be achieved by not recalculating subresults over and over again.

Memoization In computing, memoization or memoisation is an optimization technique used primarily to speed up computer programs by storing the results of expensive function calls to pure functions and returning the cached result when the same inputs occur ag ...

is an optimization technique where the results of function calls are cached and returned when the same inputs occur again. See for instance . published a cunning algorithm which computes

A(i,n)

within

\mathcal(i A(i,n))

time and within

\mathcal(i)

space.

Huge numbers

To demonstrate how the computation of

A(4, 3)

results in many steps and in a large number:

\begin
A(4, 3) & \rightarrow A(3, A(4, 2)) \\
& \rightarrow A(3, A(3, A(4, 1))) \\
& \rightarrow A(3, A(3, A(3, A(4, 0)))) \\
& \rightarrow A(3, A(3, A(3, A(3, 1)))) \\
& \rightarrow A(3, A(3, A(3, A(2, A(3, 0))))) \\
& \rightarrow A(3, A(3, A(3, A(2, A(2, 1))))) \\
& \rightarrow A(3, A(3, A(3, A(2, A(1, A(2, 0)))))) \\
& \rightarrow A(3, A(3, A(3, A(2, A(1, A(1, 1)))))) \\
& \rightarrow A(3, A(3, A(3, A(2, A(1, A(0, A(1, 0))))))) \\
& \rightarrow A(3, A(3, A(3, A(2, A(1, A(0, A(0, 1))))))) \\
& \rightarrow A(3, A(3, A(3, A(2, A(1, A(0, 2)))))) \\
& \rightarrow A(3, A(3, A(3, A(2, A(1, 3))))) \\
& \rightarrow A(3, A(3, A(3, A(2, A(0, A(1, 2)))))) \\
& \rightarrow A(3, A(3, A(3, A(2, A(0, A(0, A(1, 1))))))) \\
& \rightarrow A(3, A(3, A(3, A(2, A(0, A(0, A(0, A(1, 0)))))))) \\
& \rightarrow A(3, A(3, A(3, A(2, A(0, A(0, A(0, A(0, 1)))))))) \\
& \rightarrow A(3, A(3, A(3, A(2, A(0, A(0, A(0, 2)) )) )) ) \\
& \rightarrow A(3, A(3, A(3, A(2, A(0, A(0, 3)))))) \\
& \rightarrow A(3, A(3, A(3, A(2, A(0, 4))))) \\
& \rightarrow A(3, A(3, A(3, A(2, 5)))) \\
& \qquad\vdots \\
& \rightarrow A(3, A(3, A(3, 13))) \\
& \qquad\vdots \\
& \rightarrow A(3, A(3, 65533)) \\
&\qquad\vdots \\
& \rightarrow A(3, 2^ - 3) \\
&\qquad\vdots \\
& \rightarrow 2^ - 3. \\
\end

Table of values

Computing the Ackermann function can be restated in terms of an infinite table. First, place the natural numbers along the top row. To determine a number in the table, take the number immediately to the left. Then use that number to look up the required number in the column given by that number and one row up. If there is no number to its left, simply look at the column headed "1" in the previous row. Here is a small upper-left portion of the table: The numbers here which are only expressed with recursive exponentiation or Knuth arrows are very large and would take up too much space to notate in plain decimal digits. Despite the large values occurring in this early section of the table, some even larger numbers have been defined, such as

Graham's number Graham's number is an Large numbers, immense number that arose as an upper bound on the answer of a problem in the mathematical field of Ramsey theory. It is much larger than many other large numbers such as Skewes's number and Moser's number, bot ...

, which cannot be written with any small number of Knuth arrows. This number is constructed with a technique similar to applying the Ackermann function to itself recursively. This is a repeat of the above table, but with the values replaced by the relevant expression from the function definition to show the pattern clearly:

Properties

General remarks

*It may not be immediately obvious that the evaluation of

A(m, n)

always terminates. However, the recursion is bounded because in each recursive application either

m

decreases, or

m

remains the same and

n

decreases. Each time that

n

reaches zero,

m

decreases, so

m

eventually reaches zero as well. (Expressed more technically, in each case the pair

(m,n)

decreases in the

lexicographic order In mathematics, the lexicographic or lexicographical order (also known as lexical order, or dictionary order) is a generalization of the alphabetical order of the dictionaries to sequences of ordered symbols or, more generally, of elements of a ...

on pairs, which is a

well-order In mathematics, a well-order (or well-ordering or well-order relation) on a set is a total ordering on with the property that every non-empty subset of has a least element in this ordering. The set together with the ordering is then calle ...

ing, just like the ordering of single non-negative integers; this means one cannot go down in the ordering infinitely many times in succession.) However, when

m

decreases there is no upper bound on how much

n

can increase — and it will often increase greatly. *For small values of ''m'' like 1, 2, or 3, the Ackermann function grows relatively slowly with respect to ''n'' (at most exponentially). For

m\geq 4

, however, it grows much more quickly; even

A(4,2)

is about 2.00353, and the decimal expansion of

A(4, 3)

is very large by any typical measure, about 2.12004. *An interesting aspect is that the only arithmetic operation it ever uses is addition of 1. Its fast growing power is based solely on nested recursion. This also implies that its running time is at least proportional to its output, and so is also extremely huge. In actuality, for most cases the running time is far larger than the output; see above. *A single-argument version

f(n)=A(n,n)

that increases both

m

and

n

at the same time dwarfs every primitive recursive function, including very fast-growing functions such as the exponential function, the factorial function, multi- and superfactorial functions, and even functions defined using Knuth's up-arrow notation (except when the indexed up-arrow is used). It can be seen that

f(n)

is roughly comparable to

f_(n)

in the fast-growing hierarchy. This extreme growth can be exploited to show that

f

which is obviously computable on a machine with infinite memory such as a

Turing machine A Turing machine is a mathematical model of computation describing an abstract machine that manipulates symbols on a strip of tape according to a table of rules. Despite the model's simplicity, it is capable of implementing any computer algori ...

and so is a

, grows faster than any primitive recursive function and is therefore not primitive recursive.

Not primitive recursive

The Ackermann function grows faster than any

primitive recursive function In computability theory, a primitive recursive function is, roughly speaking, a function that can be computed by a computer program whose loops are all "for" loops (that is, an upper bound of the number of iterations of every loop is fixed befor ...

and therefore is not itself primitive recursive. Proof sketch: primitive recursive function defined using up to k recursions must grow slower than

f_(n)

, the (k+1)-th function in the fast-growing hierarchy, but the Ackermann function grows at least as fast as

f_\omega(n)

. Specifically, one shows that, for every primitive recursive function

f(x_1,\ldots,x_n)

, there exists a non-negative integer

t

, such that for all non-negative integers

x_1,\ldots,x_n

f(x_1,\ldots,x_n) Once this is established, it follows that A itself is not primitive recursive, since otherwise putting x_1=x_2=t would lead to the contradiction A(t,t) The proof proceeds as follows: define the class \mathcal of all functions that grow slower than the Ackermann function \mathcal=\left\ and show that \mathcal contains all primitive recursive functions. The latter is achieved by showing that \mathcal contains the constant functions, the successor function, the projection functions and that it is closed under the operations of function composition and primitive recursion.

Inverse

Since the function considered above grows very rapidly, its

inverse function In mathematics, the inverse function of a function (also called the inverse of ) is a function that undoes the operation of . The inverse of exists if and only if is bijective, and if it exists, is denoted by f^ . For a function f\colon ...

, ''f'', grows very slowly. This inverse Ackermann function ''f''⁻¹ is usually denoted by ''α''. In fact, ''α''(''n'') is less than 5 for any practical input size ''n'', since is on the order of

2^

. This inverse appears in the time complexity of some algorithms, such as the

disjoint-set data structure In computer science, a disjoint-set data structure, also called a union–find data structure or merge–find set, is a data structure that stores a collection of Disjoint sets, disjoint (non-overlapping) Set (mathematics), sets. Equivalently, it ...

and Chazelle's algorithm for

minimum spanning tree A minimum spanning tree (MST) or minimum weight spanning tree is a subset of the edges of a connected, edge-weighted undirected graph that connects all the vertices together, without any cycles and with the minimum possible total edge weight. ...

s. Sometimes Ackermann's original function or other variations are used in these settings, but they all grow at similarly high rates. In particular, some modified functions simplify the expression by eliminating the −3 and similar terms. A two-parameter variation of the inverse Ackermann function can be defined as follows, where

\lfloor x \rfloor

is the

floor function In mathematics, the floor function is the function that takes as input a real number , and gives as output the greatest integer less than or equal to , denoted or . Similarly, the ceiling function maps to the least integer greater than or eq ...

\alpha(m,n) = \min\.

This function arises in more precise analyses of the algorithms mentioned above, and gives a more refined time bound. In the disjoint-set data structure, ''m'' represents the number of operations while ''n'' represents the number of elements; in the minimum spanning tree algorithm, ''m'' represents the number of edges while ''n'' represents the number of vertices. Several slightly different definitions of exist; for example, is sometimes replaced by ''n'', and the floor function is sometimes replaced by a

ceiling A ceiling is an overhead interior roof that covers the upper limits of a room. It is not generally considered a structural element, but a finished surface concealing the underside of the roof structure or the floor of a story above. Ceilings can ...

. Other studies might define an inverse function of one where m is set to a constant, such that the inverse applies to a particular row. The inverse of the Ackermann function is primitive recursive, since it is graph primitive recursive, and it is upper bounded by a primitive recursive function.

Usage

In computational complexity

The Ackermann function appears in the time

complexity Complexity characterizes the behavior of a system or model whose components interact in multiple ways and follow local rules, leading to non-linearity, randomness, collective dynamics, hierarchy, and emergence. The term is generally used to c ...

of some

algorithm In mathematics and computer science, an algorithm () is a finite sequence of Rigour#Mathematics, mathematically rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algo ...

s, such as vector addition systems and Petri net reachability, thus showing they are computationally infeasible for large instances. The inverse of the Ackermann function appears in some time complexity results. For instance, the

takes

amortized time In computer science, amortized analysis is a method for analyzing a given algorithm's complexity, or how much of a resource, especially time or memory, it takes to execute. The motivation for amortized analysis is that looking at the worst-case ...

per operation proportional to the inverse Ackermann function, and cannot be made faster within the cell-probe model of computational complexity.

In discrete geometry

Certain problems in

discrete geometry Discrete geometry and combinatorial geometry are branches of geometry that study combinatorial properties and constructive methods of discrete geometric objects. Most questions in discrete geometry involve finite or discrete sets of basic geom ...

related to

Davenport–Schinzel sequence In combinatorics, a Davenport–Schinzel sequence is a sequence of symbols in which the number of times any two symbols may appear in alternation is limited. The maximum possible length of a Davenport–Schinzel sequence is bounded by the number of ...

s have complexity bounds in which the inverse Ackermann function

\alpha(n)

appears. For instance, for

n

line segments in the plane, the unbounded face of the

arrangement In music, an arrangement is a musical adaptation of an existing composition. Differences from the original composition may include reharmonization, melodic paraphrasing, orchestration, or formal development. Arranging differs from orchestr ...

of the segments has complexity

O(n\alpha(n))

, and some systems of

n

line segments have an unbounded face of complexity

\Omega(n\alpha(n))

As a benchmark

The Ackermann function, due to its definition in terms of extremely deep recursion, can be used as a benchmark of a

compiler In computing, a compiler is a computer program that Translator (computing), translates computer code written in one programming language (the ''source'' language) into another language (the ''target'' language). The name "compiler" is primaril ...

's ability to optimize recursion. The first published use of Ackermann's function in this way was in 1970 by Dragoș Vaida and, almost simultaneously, in 1971, by Yngve Sundblad. Sundblad's seminal paper was taken up by Brian Wichmann (co-author of the Whetstone benchmark) in a trilogy of papers written between 1975 and 1982.

Notes

References

Bibliography

* * * * * * * * * * * * * * * * * * * * * Original version 1980, published in ''ACM SIGACT News'', modified on 20 October 2012 and 23 January 2016 (working paper) * * * * * * * * *

External links

* * *
An animated Ackermann function calculator
*

Includes a table of some values. * * describes several variations on the definition of ''A''. * *
The Ackermann function written in different programming languages
(on

) * ) Some study and programming. * {{Authority control Arithmetic Large integers Special functions Theory of computation Computability theory