Collapsing Function
   HOME

TheInfoList



OR:

In
mathematical logic Mathematical logic is the study of logic, formal logic within mathematics. Major subareas include model theory, proof theory, set theory, and recursion theory. Research in mathematical logic commonly addresses the mathematical properties of for ...
and
set theory Set theory is the branch of mathematical logic that studies sets, which can be informally described as collections of objects. Although objects of any kind can be collected into a set, set theory, as a branch of mathematics, is mostly conce ...
, an ordinal collapsing function (or projection function) is a technique for defining (
notations ''Notations'' is a book that was edited and compiled by American avant-garde composer John Cage (1912–1992) with Alison Knowles and first published in 1969 by Something Else Press. The book is made up of a large collection of graphical scores, ...
for) certain
recursive Recursion (adjective: ''recursive'') occurs when a thing is defined in terms of itself or of its type. Recursion is used in a variety of disciplines ranging from linguistics to logic. The most common application of recursion is in mathematics ...
large countable ordinal In the mathematical discipline of set theory, there are many ways of describing specific countable ordinals. The smallest ones can be usefully and non-circularly expressed in terms of their Cantor normal forms. Beyond that, many ordinals of releva ...
s, whose principle is to give names to certain ordinals much larger than the one being defined, perhaps even
large cardinals In the mathematical field of set theory, a large cardinal property is a certain kind of property of Transfinite number, transfinite cardinal numbers. Cardinals with such properties are, as the name suggests, generally very "large" (for example, big ...
(though they can be replaced with recursively large ordinals at the cost of extra technical difficulty), and then "collapse" them down to a system of notations for the sought-after ordinal. For this reason, ordinal collapsing functions are described as an
impredicative In mathematics, logic and philosophy of mathematics, something that is impredicative is a self-referencing definition. Roughly speaking, a definition is impredicative if it invokes (mentions or quantifies over) the set being defined, or (more com ...
manner of naming ordinals. The details of the definition of ordinal collapsing functions vary, and get more complicated as greater ordinals are being defined, but the typical idea is that whenever the notation system "runs out of fuel" and cannot name a certain ordinal, a much larger ordinal is brought "from above" to give a name to that critical point. An example of how this works will be detailed below, for an ordinal collapsing function defining the
Bachmann–Howard ordinal In mathematics, the Bachmann–Howard ordinal (also known as the Howard ordinal, or Howard-Bachmann ordinal) is a large countable ordinal. It is the ordinal analysis, proof-theoretic ordinal of several mathematical theory (logic), theories, such as ...
(i.e., defining a system of notations up to the Bachmann–Howard ordinal). The use and definition of ordinal collapsing functions is inextricably intertwined with the theory of
ordinal analysis In proof theory, ordinal analysis assigns ordinals (often large countable ordinals) to mathematical theories as a measure of their strength. If theories have the same proof-theoretic ordinal they are often equiconsistent, and if one theory has ...
, since the large countable ordinals defined and denoted by a given collapse are used to describe the ordinal-theoretic strength of certain
formal system A formal system is an abstract structure used for inferring theorems from axioms according to a set of rules. These rules, which are used for carrying out the inference of theorems from axioms, are the logical calculus of the formal system. A form ...
s, typicallyRathjen, 1995 (Bull. Symbolic Logic)Kahle, 2002 (Synthese) subsystems of
analysis Analysis ( : analyses) is the process of breaking a complex topic or substance into smaller parts in order to gain a better understanding of it. The technique has been applied in the study of mathematics and logic since before Aristotle (38 ...
(such as those seen in the light of
reverse mathematics Reverse mathematics is a program in mathematical logic that seeks to determine which axioms are required to prove theorems of mathematics. Its defining method can briefly be described as "going backwards from the theorems to the axioms", in cont ...
), extensions of
Kripke–Platek set theory The Kripke–Platek set theory (KP), pronounced , is an axiomatic set theory developed by Saul Kripke and Richard Platek. The theory can be thought of as roughly the predicative part of ZFC and is considerably weaker than it. Axioms In its fo ...
,
Bishop A bishop is an ordained clergy member who is entrusted with a position of authority and oversight in a religious institution. In Christianity, bishops are normally responsible for the governance of dioceses. The role or office of bishop is ca ...
-style systems of
constructive mathematics In the philosophy of mathematics, constructivism asserts that it is necessary to find (or "construct") a specific example of a mathematical object in order to prove that an example exists. Contrastingly, in classical mathematics, one can prove th ...
or Martin-Löf-style systems of
intuitionistic type theory Intuitionistic type theory (also known as constructive type theory, or Martin-Löf type theory) is a type theory and an alternative foundation of mathematics. Intuitionistic type theory was created by Per Martin-Löf, a Swedish mathematician and ph ...
. (
psi Psi, PSI or Ψ may refer to: Alphabetic letters * Psi (Greek) (Ψ, ψ), the 23rd letter of the Greek alphabet * Psi (Cyrillic) (Ѱ, ѱ), letter of the early Cyrillic alphabet, adopted from Greek Arts and entertainment * "Psi" as an abbreviation ...
)(1) Ordinal collapsing functions are typically denoted using some variation of either the Greek letter \psi (
psi Psi, PSI or Ψ may refer to: Alphabetic letters * Psi (Greek) (Ψ, ψ), the 23rd letter of the Greek alphabet * Psi (Cyrillic) (Ѱ, ѱ), letter of the early Cyrillic alphabet, adopted from Greek Arts and entertainment * "Psi" as an abbreviation ...
) or \theta (
theta Theta (, ; uppercase: Θ or ; lowercase: θ or ; grc, ''thē̂ta'' ; Modern: ''thī́ta'' ) is the eighth letter of the Greek alphabet, derived from the Phoenician letter Teth . In the system of Greek numerals, it has a value of 9. Gr ...
).


An example leading up to the Bachmann–Howard ordinal

The choice of the ordinal collapsing function given as example below imitates greatly the system introduced by BuchholzBuchholz, 1986 (Ann. Pure Appl. Logic) but is limited to collapsing one cardinal for clarity of exposition. More on the relation between this example and Buchholz's system will be said
below Below may refer to: *Earth *Ground (disambiguation) *Soil *Floor *Bottom (disambiguation) Bottom may refer to: Anatomy and sex * Bottom (BDSM), the partner in a BDSM who takes the passive, receiving, or obedient role, to that of the top or ...
.


Definition

Let \Omega stand for the
first uncountable ordinal In mathematics, the first uncountable ordinal, traditionally denoted by \omega_1 or sometimes by \Omega, is the smallest ordinal number that, considered as a set, is uncountable. It is the supremum (least upper bound) of all countable ordinals. Whe ...
\omega_1, or, in fact, any ordinal which is an \varepsilon-number and guaranteed to be greater than all the
countable ordinal In set theory, an ordinal number, or ordinal, is a generalization of ordinal numerals (first, second, th, etc.) aimed to extend enumeration to infinite sets. A finite set can be enumerated by successively labeling each element with the least n ...
s which will be constructed (for example, the
Church–Kleene ordinal In mathematics, particularly set theory, non-recursive ordinals are large countable ordinals greater than all the recursive ordinals, and therefore can not be expressed using ordinal collapsing functions. The Church–Kleene ordinal and variant ...
is adequate for our purposes; but we will work with \omega_1 because it allows the convenient use of the word ''countable'' in the definitions). We define a function \psi (which will be non-decreasing and
continuous Continuity or continuous may refer to: Mathematics * Continuity (mathematics), the opposing concept to discreteness; common examples include ** Continuous probability distribution or random variable in probability and statistics ** Continuous ...
), taking an arbitrary ordinal \alpha to a countable ordinal \psi(\alpha), recursively on \alpha, as follows: :Assume \psi(\beta) has been defined for all \beta<\alpha, and we wish to define \psi(\alpha). :Let C(\alpha) be the set of ordinals generated starting from 0, 1, \omega and \Omega by recursively applying the following functions: ordinal addition, multiplication and exponentiation and the function \psi\upharpoonright_\alpha, i.e., the restriction of \psi to ordinals \beta<\alpha. (Formally, we define C(\alpha)_0 = \ and inductively C(\alpha)_ = C(\alpha)_n \cup \ \cup \ for all natural numbers n and we let C(\alpha) be the union of the C(\alpha)_n for all n.) :Then \psi(\alpha) is defined as the smallest ordinal not belonging to C(\alpha). In a more concise (although more obscure) way: :\psi(\alpha) is the smallest ordinal which cannot be expressed from 0, 1, \omega and \Omega using sums, products, exponentials, and the \psi function itself (to previously constructed ordinals less than \alpha). Here is an attempt to explain the motivation for the definition of \psi in intuitive terms: since the usual operations of addition, multiplication and exponentiation are not sufficient to designate ordinals very far, we attempt to systematically create new names for ordinals by taking the first one which does not have a name yet, and whenever we run out of names, rather than invent them in an ''ad hoc'' fashion or using diagonal schemes, we seek them in the ordinals far beyond the ones we are constructing (beyond \Omega, that is); so we give names to uncountable ordinals and, since in the end the list of names is necessarily countable, \psi will "collapse" them to countable ordinals.


Computation of values of ''ψ''

To clarify how the function \psi is able to produce notations for certain ordinals, we now compute its first values.


Predicative start

First consider C(0). It contains ordinals 0, 1, 2, 3, \omega, \omega+1, \omega+2, \omega\cdot 2, \omega\cdot 3, \omega^2, \omega^3, \omega^\omega, \omega^ and so on. It also contains such ordinals as \Omega, \Omega+1, \omega^, \Omega^\Omega. The first ordinal which it does not contain is \varepsilon_0 (which is the limit of \omega, \omega^\omega, \omega^ and so on — less than \Omega by assumption). The upper bound of the ordinals it contains is \varepsilon_ (the limit of \Omega, \Omega^\Omega, \Omega^ and so on), but that is not so important. This shows that \psi(0) = \varepsilon_0. Similarly, C(1) contains the ordinals which can be formed from 0, 1, \omega, \Omega and this time also \varepsilon_0, using addition, multiplication and exponentiation. This contains all the ordinals up to \varepsilon_1 but not the latter, so \psi(1) = \varepsilon_1. In this manner, we prove that \psi(\alpha) = \varepsilon_\alpha inductively on \alpha: the proof works, however, only as long as \alpha<\varepsilon_\alpha. We therefore have: :\psi(\alpha) = \varepsilon_\alpha = \phi_1(\alpha) for all \alpha\leq\zeta_0, where \zeta_0 = \varphi_2(0) is the smallest fixed point of \alpha \mapsto \varepsilon_\alpha. (Here, the \varphi functions are the
Veblen function In mathematics, the Veblen functions are a hierarchy of normal functions ( continuous strictly increasing functions from ordinals to ordinals), introduced by Oswald Veblen in . If φ0 is any normal function, then for any non-zero ordinal α, φ ...
s defined starting with \phi_1(\alpha) = \varepsilon_\alpha.) Now \psi(\zeta_0) = \zeta_0 but \psi(\zeta_0+1) is no larger, since \zeta_0 cannot be constructed using finite applications of \varphi_1\colon \alpha\mapsto\varepsilon_\alpha and thus never belongs to a C(\alpha) set for \alpha\leq\Omega, and the function \psi remains "stuck" at \zeta_0 for some time: :\psi(\alpha) = \zeta_0 for all \zeta_0 \leq \alpha \leq \Omega.


First impredicative values

Again, \psi(\Omega) = \zeta_0. However, when we come to computing \psi(\Omega+1), something has changed: since \Omega was ("artificially") added to all the C(\alpha), we are permitted to take the value \psi(\Omega) = \zeta_0 in the process. So C(\Omega+1) contains all ordinals which can be built from 0, 1, \omega, \Omega, the \varphi_1\colon\alpha\mapsto\varepsilon_\alpha function ''up to \zeta_0'' and this time also \zeta_0 itself, using addition, multiplication and exponentiation. The smallest ordinal not in C(\Omega+1) is \varepsilon_ (the smallest \varepsilon-number after \zeta_0). We say that the definition \psi(\Omega) = \zeta_0 and the next values of the function \psi such as \psi(\Omega+1) = \varepsilon_ are
impredicative In mathematics, logic and philosophy of mathematics, something that is impredicative is a self-referencing definition. Roughly speaking, a definition is impredicative if it invokes (mentions or quantifies over) the set being defined, or (more com ...
because they use ordinals (here, \Omega) greater than the ones which are being defined (here, \zeta_0).


Values of ''ψ'' up to the Feferman–Schütte ordinal

The fact that \psi(\Omega+\alpha) = \varepsilon_ remains true for all \alpha \leq \zeta_1 = \varphi_2(1) (note, in particular, that \psi(\Omega+\zeta_0) = \varepsilon_: but since now the ordinal \zeta_0 has been constructed there is nothing to prevent from going beyond this). However, at \zeta_1 = \phi_2(1) (the first fixed point of \alpha\mapsto \varepsilon_\alpha beyond \zeta_0), the construction stops again, because \zeta_1 cannot be constructed from smaller ordinals and \zeta_0 by finitely applying the \varepsilon function. So we have \psi(\Omega 2) = \zeta_1. The same reasoning shows that \psi(\Omega(1+\alpha)) = \phi_2(\alpha) for all \alpha\leq\phi_3(0), where \phi_2 enumerates the fixed points of \phi_1\colon\alpha\mapsto\varepsilon_\alpha and \phi_3(0) is the first fixed point of \phi_2. We then have \psi(\Omega^2) = \phi_3(0). Again, we can see that \psi(\Omega^\alpha) = \varphi_(0) for some time: this remains true until the first fixed point \Gamma_0 of \alpha \mapsto \phi_\alpha(0), which is the
Feferman–Schütte ordinal In mathematics, the Feferman–Schütte ordinal Γ0 is a large countable ordinal. It is the proof-theoretic ordinal of several mathematical theories, such as arithmetical transfinite recursion. It is named after Solomon Feferman and Kurt Schütte, ...
. Thus, \psi(\Omega^\Omega) = \Gamma_0 is the Feferman–Schütte ordinal.


Beyond the Feferman–Schütte ordinal

We have \psi(\Omega^\Omega+\Omega^\alpha) = \varphi_(0) for all \alpha\leq\Gamma_1 where \Gamma_1 is the next fixed point of \alpha \mapsto \varphi_\alpha(0). So, if \alpha\mapsto\Gamma_\alpha enumerates the fixed points in question (which can also be noted \varphi(1,0,\alpha) using the many-valued Veblen functions) we have \psi(\Omega^\Omega(1+\alpha)) = \Gamma_\alpha, until the first fixed point \varphi(1,1,0) of the \alpha\mapsto\Gamma_\alpha itself, which will be \psi(\Omega^) (and the first fixed point \varphi(2,0,0) of the \alpha \mapsto \varphi(1,\alpha,0) functions will be \psi(\Omega^)). In this manner: * \psi(\Omega^) is the
Ackermann ordinal In mathematics, the Ackermann ordinal is a certain large countable ordinal, named after Wilhelm Ackermann. The term "Ackermann ordinal" is also occasionally used for the small Veblen ordinal, a somewhat larger ordinal. Unfortunately there is ...
(the range of the notation \varphi(\alpha,\beta,\gamma) defined predicatively), * \psi(\Omega^) is the "small" Veblen ordinal (the range of the notations \varphi(\cdot) predicatively using finitely many variables), * \psi(\Omega^) is the "large" Veblen ordinal (the range of the notations \varphi(\cdot) predicatively using transfinitely-but-predicatively-many variables), * the limit \psi(\varepsilon_) of \psi(\Omega), \psi(\Omega^\Omega), \psi(\Omega^), etc., is the
Bachmann–Howard ordinal In mathematics, the Bachmann–Howard ordinal (also known as the Howard ordinal, or Howard-Bachmann ordinal) is a large countable ordinal. It is the ordinal analysis, proof-theoretic ordinal of several mathematical theory (logic), theories, such as ...
: after this our function \psi is constant, and we can go no further with the definition we have given.


Ordinal notations up to the Bachmann–Howard ordinal

We now explain more systematically how the \psi function defines notations for ordinals up to the Bachmann–Howard ordinal.


A note about base representations

Recall that if \delta is an ordinal which is a power of \omega (for example \omega itself, or \varepsilon_0, or \Omega), any ordinal \alpha can be uniquely expressed in the form \delta^\gamma_1 + \ldots + \delta^\gamma_k, where k is a natural number, \gamma_1,\ldots,\gamma_k are non-zero ordinals less than \delta, and \beta_1 > \beta_2 > \cdots > \beta_k are ordinal numbers (we allow \beta_k=0). This "base \delta representation" is an obvious generalization of the
Cantor normal form In the mathematical field of set theory, ordinal arithmetic describes the three usual operations on ordinal numbers: addition, multiplication, and exponentiation. Each can be defined in essentially two different ways: either by constructing an e ...
(which is the case \delta=\omega). Of course, it may quite well be that the expression is uninteresting, i.e., \alpha = \delta^\alpha, but in any other case the \beta_i must all be less than \alpha; it may also be the case that the expression is trivial (i.e., \alpha<\delta, in which case k\leq 1 and \gamma_1 = \alpha). If \alpha is an ordinal less than \varepsilon_, then its base \Omega representation has coefficients \gamma_i<\Omega (by definition) and exponents \beta_i<\alpha (because of the assumption \alpha < \varepsilon_): hence one can rewrite these exponents in base \Omega and repeat the operation until the process terminates (any decreasing sequence of ordinals is finite). We call the resulting expression the ''iterated base \Omega representation'' of \alpha and the various coefficients involved (including as exponents) the ''pieces'' of the representation (they are all <\Omega), or, for short, the \Omega-pieces of \alpha.


Some properties of ''ψ''

* The function \psi is non-decreasing and continuous (this is more or less obvious from its definition). * If \psi(\alpha) = \psi(\beta) with \beta<\alpha then necessarily C(\alpha) = C(\beta). Indeed, no ordinal \beta' with \beta\leq\beta'<\alpha can belong to C(\alpha) (otherwise its image by \psi, which is \psi(\alpha) would belong to C(\alpha) — impossible); so C(\beta) is closed by everything under which C(\alpha) is the closure, so they are equal. * Any value \gamma=\psi(\alpha) taken by \psi is an \varepsilon-number (i.e., a fixed point of \beta\mapsto\omega^\beta). Indeed, if it were not, then by writing it in
Cantor normal form In the mathematical field of set theory, ordinal arithmetic describes the three usual operations on ordinal numbers: addition, multiplication, and exponentiation. Each can be defined in essentially two different ways: either by constructing an e ...
, it could be expressed using sums, products and exponentiation from elements less than it, hence in C(\alpha), so it would be in C(\alpha), a contradiction. * Lemma: Assume \delta is an \varepsilon-number and \alpha an ordinal such that \psi(\beta)<\delta for all \beta<\alpha: then the \Omega-pieces (defined above) of any element of C(\alpha) are less than \delta. Indeed, let C' be the set of ordinals all of whose \Omega-pieces are less than \delta. Then C' is closed under addition, multiplication and exponentiation (because \delta is an \varepsilon-number, so ordinals less than it are closed under addition, multiplication and exponentiation). And C' also contains every \psi(\beta) for \beta<\alpha by assumption, and it contains 0, 1, \omega, \Omega. So C'\supseteq C(\alpha), which was to be shown. * Under the hypothesis of the previous lemma, \psi(\alpha) \leq \delta (indeed, the lemma shows that \delta \not\in C(\alpha)). * Any \varepsilon-number less than some element in the range of \psi is itself in the range of \psi (that is, \psi omits no \varepsilon-number). Indeed: if \delta is an \varepsilon-number not greater than the range of \psi, let \alpha be the least upper bound of the \beta such that \psi(\beta)<\delta: then by the above we have \psi(\alpha)\leq\delta, but \psi(\alpha)<\delta would contradict the fact that \alpha is the ''least'' upper bound — so \psi(\alpha)=\delta. * Whenever \psi(\alpha) = \delta, the set C(\alpha) consists exactly of those ordinals \gamma (less than \varepsilon_) all of whose \Omega-pieces are less than \delta. Indeed, we know that all ordinals less than \delta, hence all ordinals (less than \varepsilon_) whose \Omega-pieces are less than \delta, are in C(\alpha). Conversely, if we assume \psi(\beta) < \delta for all \beta<\alpha (in other words if \alpha is the least possible with \psi(\alpha)=\delta), the lemma gives the desired property. On the other hand, if \psi(\alpha) = \psi(\beta) for some \beta<\alpha, then we have already remarked C(\alpha) = C(\beta) and we can replace \alpha by the least possible with \psi(\alpha)=\delta.


The ordinal notation

Using the facts above, we can define a (canonical) ordinal notation for every \gamma less than the Bachmann–Howard ordinal. We do this by induction on \gamma. If \gamma is less than \varepsilon_0, we use the iterated Cantor normal form of \gamma. Otherwise, there exists a largest \varepsilon-number \delta less or equal to \gamma (this is because the set of \varepsilon-numbers is closed): if \delta<\gamma then by induction we have defined a notation for \delta and the base \delta representation of \gamma gives one for \gamma, so we are finished. It remains to deal with the case where \gamma=\delta is an \varepsilon-number: we have argued that, in this case, we can write \delta = \psi(\alpha) for some (possibly uncountable) ordinal \alpha<\varepsilon_: let \alpha be the ''greatest'' possible such ordinal (which exists since \psi is continuous). We use the iterated base \Omega representation of \alpha: it remains to show that every piece of this representation is less than \delta (so we have already defined a notation for it). If this is ''not'' the case then, by the properties we have shown, C(\alpha) does not contain \alpha; but then C(\alpha+1)=C(\alpha) (they are closed under the same operations, since the value of \psi at \alpha can never be taken), so \psi(\alpha+1)=\psi(\alpha)=\delta, contradicting the maximality of \alpha. Note: Actually, we have defined canonical notations not just for ordinals below the Bachmann–Howard ordinal but also for certain uncountable ordinals, namely those whose \Omega-pieces are less than the Bachmann–Howard ordinal (viz.: write them in iterated base \Omega representation and use the canonical representation for every piece). This canonical notation is used for arguments of the \psi function (which may be uncountable).


Examples

For ordinals less than \varepsilon_0 = \psi(0), the canonical ordinal notation defined coincides with the iterated Cantor normal form (by definition). For ordinals less than \varepsilon_1 = \psi(1), the notation coincides with iterated base \varepsilon_0 notation (the pieces being themselves written in iterated Cantor normal form): e.g., \omega^ will be written ^, or, more accurately, \psi(0)^. For ordinals less than \varepsilon_2 = \psi(2), we similarly write in iterated base \varepsilon_1 and then write the pieces in iterated base \varepsilon_0 (and write the pieces of ''that'' in iterated Cantor normal form): so \omega^ is written ^, or, more accurately, \psi(1)^. Thus, up to \zeta_0 = \psi(\Omega), we always use the largest possible \varepsilon-number base which gives a non-trivial representation. Beyond this, we may need to express ordinals beyond \Omega: this is always done in iterated \Omega-base, and the pieces themselves need to be expressed using the largest possible \varepsilon-number base which gives a non-trivial representation. Note that while \psi(\varepsilon_) is equal to the Bachmann–Howard ordinal, this is not a "canonical notation" in the sense we have defined (canonical notations are defined only for ordinals ''less'' than the Bachmann–Howard ordinal).


Conditions for canonicalness

The notations thus defined have the property that whenever they nest \psi functions, the arguments of the "inner" \psi function are always less than those of the "outer" one (this is a consequence of the fact that the \Omega-pieces of \alpha, where \alpha is the largest possible such that \psi(\alpha)=\delta for some \varepsilon-number \delta, are all less than \delta, as we have shown above). For example, \psi(\psi(\Omega)+1) does not occur as a notation: it is a well-defined expression (and it is equal to \psi(\Omega) = \zeta_0 since \psi is constant between \zeta_0 and \Omega), but it is not a notation produced by the inductive algorithm we have outlined. Canonicalness can be checked recursively: an expression is canonical if and only if it is either the iterated Cantor normal form of an ordinal less than \varepsilon_0, or an iterated base \delta representation all of whose pieces are canonical, for some \delta=\psi(\alpha) where \alpha is itself written in iterated base \Omega representation all of whose pieces are canonical and less than \delta. The order is checked by lexicographic verification at all levels (keeping in mind that \Omega is greater than any expression obtained by \psi, and for canonical values the greater \psi always trumps the lesser or even arbitrary sums, products and exponentials of the lesser). For example, \psi(\Omega^\,\psi(\Omega) + \psi(\Omega^\omega)^42)^ is a canonical notation for an ordinal which is less than the Feferman–Schütte ordinal: it can be written using the Veblen functions as \varphi_1(\varphi_(\varphi_2(0)) + \varphi_\omega(0)^42)^. Concerning the order, one might point out that \psi(\Omega^\Omega) (the Feferman–Schütte ordinal) is much more than \psi(\Omega^) = \varphi_(0) (because \Omega is greater than \psi of anything), and \psi(\Omega^) = \varphi_(0) is itself much more than \psi(\Omega)^ = \varphi_2(0)^ (because \Omega^ is greater than \Omega, so any sum-product-or-exponential expression involving \psi(\Omega) and smaller value will remain less than \psi(\Omega^\Omega)). In fact, \psi(\Omega)^ is already less than \psi(\Omega+1).


Standard sequences for ordinal notations

To witness the fact that we have defined notations for ordinals below the Bachmann–Howard ordinal (which are all of countable
cofinality In mathematics, especially in order theory, the cofinality cf(''A'') of a partially ordered set ''A'' is the least of the cardinalities of the cofinal subsets of ''A''. This definition of cofinality relies on the axiom of choice, as it uses the ...
), we might define standard sequences converging to any one of them (provided it is a limit ordinal, of course). Actually we will define canonical sequences for certain uncountable ordinals, too, namely the uncountable ordinals of ''countable'' cofinality (if we are to hope to define a sequence converging to them...) which are representable (that is, all of whose \Omega-pieces are less than the Bachmann–Howard ordinal). The following rules are more or less obvious, except for the last: * First, get rid of the (iterated) base \delta representations: to define a standard sequence converging to \alpha = \delta^\gamma_1 + \cdots + \delta^\gamma_k, where \delta is either \omega or \psi(\cdots) (or \Omega, but see below): ** if k is zero then \alpha=0 and there is nothing to be done; ** if \beta_k is zero and \gamma_k is successor, then \alpha is successor and there is nothing to be done; ** if \gamma_k is limit, take the standard sequence converging to \gamma_k and replace \gamma_k in the expression by the elements of that sequence; ** if \gamma_k is successor and \beta_k is limit, rewrite the last term \delta^\gamma_k as \delta^(\gamma_k-1) + \delta^ and replace the exponent \beta_k in the last term by the elements of the fundamental sequence converging to it; ** if \gamma_k is successor and \beta_k is also, rewrite the last term \delta^\gamma_k as \delta^(\gamma_k-1) + \delta^\delta and replace the last \delta in this expression by the elements of the fundamental sequence converging to it. * If \delta is \omega, then take the obvious 0,1,2,3, \ldots as the fundamental sequence for \delta. * If \delta = \psi(0) then take as fundamental sequence for \delta the sequence \omega, \omega^\omega, \omega^, \ldots * If \delta = \psi(\alpha+1) then take as fundamental sequence for \delta the sequence \psi(\alpha), \psi(\alpha)^, \psi(\alpha)^,\ldots * If \delta = \psi(\alpha) where \alpha is a limit ordinal of ''countable'' cofinality, define the standard sequence for \delta to be obtained by applying \psi to the standard sequence for \alpha (recall that \psi is continuous and increasing, here). * It remains to handle the case where \delta = \psi(\alpha) with \alpha an ordinal of ''uncountable'' cofinality (e.g., \Omega itself). Obviously it doesn't make sense to define a sequence converging to \alpha in this case; however, what we can define is a sequence converging to some \rho<\alpha with countable cofinality and such that \psi is constant between \rho and \alpha. This \rho will be the first fixed point of a certain (continuous and non-decreasing) function \xi\mapsto h(\psi(\xi)). To find it, apply the same rules (from the base \Omega representation of \alpha) as to find the canonical sequence of \alpha, except that whenever a sequence converging to \Omega is called for (something which cannot exist), replace the \Omega in question, in the expression of \alpha = h(\Omega), by a \psi(\xi) (where \xi is a variable) and perform a repeated iteration (starting from 0, say) of the function \xi\mapsto h(\psi(\xi)): this gives a sequence 0, h(\psi(0)), h(\psi(h(\psi(0)))), \ldots tending to \rho, and the canonical sequence for \psi(\alpha) = \psi(\rho) is \psi(0), \psi(h(\psi(0))), \psi(h(\psi(h(\psi(0)))))... If we let the nth element (starting at 0) of the fundamental sequence for \delta be denoted as \delta /math>, then we can state this more clearly using recursion. Using this notation, we can see that \delta = \psi(0) quite easily. We can define the rest of the sequence using recursion: \delta = \psi(h(\delta -1). (The examples below should make this clearer.) Here are some examples for the last (and most interesting) case: * The canonical sequence for \psi(\Omega) is: \psi(0), \psi(\psi(0)), \psi(\psi(\psi(0)))... This indeed converges to \rho = \psi(\Omega) = \zeta_0 after which \psi is constant until \Omega. * The canonical sequence for \psi(\Omega 2) is: \psi(0), \psi(\Omega+\psi(0)), \psi(\Omega+\psi(\Omega+\psi(0))),\ldots This indeed converges to the value of \psi at \rho = \Omega + \psi(\Omega 2) = \Omega + \zeta_1 after which \psi is constant until \Omega 2. * The canonical sequence for \psi(\Omega^2) is: \psi(0), \psi(\Omega\psi(0)), \psi(\Omega\psi(\Omega\psi(0))), \ldots This converges to the value of \psi at \rho = \Omega \psi(\Omega^2). * The canonical sequence for \psi(\Omega^2 3 + \Omega) is \psi(0), \psi(\Omega^2 3 + \psi(0)), \psi(\Omega^2 3 + \psi(\Omega^2 3 + \psi(0))), \ldots This converges to the value of \psi at \rho = \Omega^2 3 + \psi(\Omega^2 3 + \Omega). * The canonical sequence for \psi(\Omega^\Omega) is: \psi(0), \psi(\Omega^), \psi(\Omega^), \ldots This converges to the value of \psi at \rho = \Omega^. * The canonical sequence for \psi(\Omega^\Omega 3) is: \psi(0), \psi(\Omega^\Omega 2+\Omega^), \psi(\Omega^\Omega 2+\Omega^), \ldots This converges to the value of \psi at \rho = \Omega^\Omega 2 + \Omega^. * The canonical sequence for \psi(\Omega^) is: \psi(0), \psi(\Omega^\Omega \psi(0)), \psi(\Omega^\Omega \psi(\Omega^\Omega \psi(0))),\ldots This converges to the value of \psi at \rho = \Omega^\Omega \psi(\Omega^). * The canonical sequence for \psi(\Omega^) is: \psi(0), \psi(\Omega^), \psi(\Omega^),\ldots Here are some examples of the other cases: * The canonical sequence for \omega^2 is: 0, \omega, \omega 2, \omega 3... * The canonical sequence for \psi(\omega^\omega) is: \psi(1), \psi(\omega), \psi(\omega^2), \psi(\omega^3)... * The canonical sequence for \psi(\Omega)^\omega is: 1, \psi(\Omega), \psi(\Omega)^2, \psi(\Omega)^3... * The canonical sequence for \psi(\Omega+1) is: \psi(\Omega), \psi(\Omega)^, \psi(\Omega)^... * The canonical sequence for \psi(\Omega+\omega) is: \psi(\Omega), \psi(\Omega+1), \psi(\Omega+2), \psi(\Omega+3)... * The canonical sequence for \psi(\Omega\omega) is: \psi(0), \psi(\Omega), \psi(\Omega 2), \psi(\Omega 3)... * The canonical sequence for \psi(\Omega^\omega) is: \psi(1), \psi(\Omega), \psi(\Omega^2), \psi(\Omega^3)... * The canonical sequence for \psi(\Omega^) is: \psi(\Omega^\omega), \psi(\Omega^), \psi(\Omega^)... (this is derived from the fundamental sequence for \psi(0)). * The canonical sequence for \psi(\Omega^) is: \psi(\Omega^), \psi(\Omega^), \psi(\Omega^)... (this is derived from the fundamental sequence for \psi(\Omega), which was given above). Even though the Bachmann–Howard ordinal \psi(\varepsilon_) itself has no canonical notation, it is also useful to define a canonical sequence for it: this is \psi(\Omega), \psi(\Omega^\Omega), \psi(\Omega^)...


A terminating process

Start with any ordinal less than or equal to the Bachmann–Howard ordinal, and repeat the following process so long as it is not zero: * if the ordinal is a successor, subtract one (that is, replace it with its predecessor), * if it is a limit, replace it by some element of the canonical sequence defined for it. Then it is true that this process always terminates (as any decreasing sequence of ordinals is finite); however, like (but even more so than for) the
hydra game In mathematics, specifically in graph theory and number theory, a hydra game is a single-player iterative mathematical game played on a tree (graph theory), mathematical tree called a ''hydra'' where, usually, the goal is to cut off the hydra's "h ...
: # it can take a ''very'' long time to terminate, # the proof of termination may be out of reach of certain weak systems of arithmetic. To give some flavor of what the process feels like, here are some steps of it: starting from \psi(\Omega^) (the small Veblen ordinal), we might go down to \psi(\Omega^), from there down to \psi(\Omega^), then \psi(\Omega^) then \psi(\Omega^) then \psi(\Omega^) then \psi(\Omega^) then \psi(\Omega^) then \psi(\Omega^) then \psi(\Omega^) and so on. It appears as though the expressions are getting more and more complicated whereas, in fact, the ordinals always decrease. Concerning the first statement, one could introduce, for any ordinal \alpha less or equal to the Bachmann–Howard ordinal \psi(\varepsilon_), the integer function f_\alpha(n) which counts the number of steps of the process before termination if one always selects the n'th element from the canonical sequence (this function satisfies the identity f_\alpha(n) = f_(n) + 1). Then f_\alpha can be a very fast growing function: already f_(n) is essentially n^n, the function f_(n) is comparable with the
Ackermann function In computability theory, the Ackermann function, named after Wilhelm Ackermann, is one of the simplest and earliest-discovered examples of a total computable function that is not primitive recursive. All primitive recursive functions are total ...
A(n,n), and f_(n) is comparable with the
Goodstein function In mathematical logic, Goodstein's theorem is a statement about the natural numbers, proved by Reuben Goodstein in 1944, which states that every ''Goodstein sequence'' eventually terminates at 0. Kirby and Paris showed that it is unprovable in Pe ...
. If we instead make a function that satisfies the identity g_\alpha(n) = g_(n+1) + 1, so the index of the function increases it is applied, then we create a much faster growing function: g_(n) is already comparable to the Goodstein function, and g_(n) is comparable to the
TREE In botany, a tree is a perennial plant with an elongated stem, or trunk, usually supporting branches and leaves. In some usages, the definition of a tree may be narrower, including only woody plants with secondary growth, plants that are ...
function. Concerning the second statement, a precise version is given by
ordinal analysis In proof theory, ordinal analysis assigns ordinals (often large countable ordinals) to mathematical theories as a measure of their strength. If theories have the same proof-theoretic ordinal they are often equiconsistent, and if one theory has ...
: for example,
Kripke–Platek set theory The Kripke–Platek set theory (KP), pronounced , is an axiomatic set theory developed by Saul Kripke and Richard Platek. The theory can be thought of as roughly the predicative part of ZFC and is considerably weaker than it. Axioms In its fo ...
can proveRathjen, 2005 (Fischbachau slides) that the process terminates for any given \alpha less than the Bachmann–Howard ordinal, but it cannot do this uniformly, i.e., it cannot prove the termination starting from the Bachmann–Howard ordinal. Some theories like
Peano arithmetic In mathematical logic, the Peano axioms, also known as the Dedekind–Peano axioms or the Peano postulates, are axioms for the natural numbers presented by the 19th century Italian mathematician Giuseppe Peano. These axioms have been used nearly u ...
are limited by much smaller ordinals (\varepsilon_0 in the case of Peano arithmetic).


Variations on the example


Making the function ''less'' powerful

It is instructive (although not exactly useful) to make \psi less powerful. If we alter the definition of \psi above to omit exponentiation from the repertoire from which C(\alpha) is constructed, then we get \psi(0) = \omega^\omega (as this is the smallest ordinal which cannot be constructed from 0, 1 and \omega using addition and multiplication only), then \psi(1) = \omega^ and similarly \psi(\omega) = \omega^, \psi(\psi(0)) = \omega^ until we come to a fixed point which is then our \psi(\Omega) = \varepsilon_0. We then have \psi(\Omega+1) = ^\omega and so on until \psi(\Omega 2) = \varepsilon_1. Since multiplication of \Omega's is permitted, we can still form \psi(\Omega^2) = \varphi_2(0) and \psi(\Omega^3) = \varphi_3(0) and so on, but our construction ends there as there is no way to get at or beyond \Omega^\omega: so the range of this weakened system of notation is \psi(\Omega^\omega) = \phi_\omega(0) (the value of \psi(\Omega^\omega) is the same in our weaker system as in our original system, except that now we cannot go beyond it). This does not even go as far as the Feferman–Schütte ordinal. If we alter the definition of \psi yet some more to allow only addition as a primitive for construction, we get \psi(0) = \omega^2 and \psi(1) = \omega^3 and so on until \psi(\psi(0)) = \omega^ and still \psi(\Omega) = \varepsilon_0. This time, \psi(\Omega+1) = \varepsilon_0 \omega and so on until \psi(\Omega 2) = \varepsilon_1 and similarly \psi(\Omega 3) = \varepsilon_2. But this time we can go no further: since we can only add \Omega's, the range of our system is \psi(\Omega\omega) = \varepsilon_\omega = \varphi_1(\omega). In both cases, we find that the limitation on the weakened \psi function comes not so much from the operations allowed on the ''countable'' ordinals as on the ''uncountable'' ordinals we allow ourselves to denote.


Going beyond the Bachmann–Howard ordinal

We know that \psi(\varepsilon_) is the Bachmann–Howard ordinal. The reason why \psi(\varepsilon_+1) is no larger, with our definitions, is that there is no notation for \varepsilon_ (it does not belong to C(\alpha) for any \alpha, it is always the least upper bound of it). One could try to add the \varepsilon function (or the Veblen functions of so-many-variables) to the allowed primitives beyond addition, multiplication and exponentiation, but that does not get us very far. To create more systematic notations for countable ordinals, we need more systematic notations for uncountable ordinals: we cannot use the \psi function itself because it only yields countable ordinals (e.g., \psi(\Omega+1) is, \varepsilon_, certainly not \varepsilon_), so the idea is to mimic its definition as follows: :Let \psi_1(\alpha) be the smallest ordinal which cannot be expressed from all countable ordinals and \Omega_2 using sums, products, exponentials, and the \psi_1 function itself (to previously constructed ordinals less than \alpha). Here, \Omega_2 is a new ordinal guaranteed to be greater than all the ordinals which will be constructed using \psi_1: again, letting \Omega = \omega_1 and \Omega_2 = \omega_2 works. For example, \psi_1(0) = \Omega, and more generally \psi_1(\alpha) = \varepsilon_ for all countable ordinals and even beyond (\psi_1(\Omega) = \psi_1(\psi_1(0)) = \varepsilon_ and \psi_1(\psi_1(1)) = \varepsilon_): this holds up to the first fixed point \zeta_ of the function \xi\mapsto\varepsilon_\xi beyond \Omega, which is the limit of \psi_1(0), \psi_1(\psi_1(0)) and so forth. Beyond this, we have \psi_1(\alpha) = \zeta_ and this remains true until \Omega_2: exactly as was the case for \psi(\Omega), we have \psi_1(\Omega_2) = \zeta_ and \psi_1(\Omega_2+1) = \varepsilon_. The \psi_1 function gives us a system of notations (''assuming'' we can somehow write down all countable ordinals!) for the uncountable ordinals below \psi_1(\varepsilon_), which is the limit of \psi_1(\Omega_2), \psi_1(^) and so forth. Now we can reinject these notations in the original \psi function, modified as follows: :\psi(\alpha) is the smallest ordinal which cannot be expressed from 0, 1, \omega, \Omega and \Omega_2 using sums, products, exponentials, the \psi_1 function, and the \psi function itself (to previously constructed ordinals less than \alpha). This modified function \psi coincides with the previous one up to (and including) \psi(\psi_1(1)) — which is the Bachmann–Howard ordinal. But now we can get beyond this, and \psi(\psi_1(1)+1) is \varepsilon_ (the next \varepsilon-number after the Bachmann–Howard ordinal). We have made our system ''doubly'' impredicative: to create notations for countable ordinals we use notations for certain ordinals between \Omega and \Omega_2 which are themselves defined using certain ordinals beyond \Omega_2. A variation on this scheme, which makes little difference when using just two (or finitely many) collapsing functions, but becomes important for infinitely many of them, is to define :\psi(\alpha) is the smallest ordinal which cannot be expressed from 0, 1, \omega, \Omega and \Omega_2 using sums, products, exponentials, and the \psi_1 and \psi function (to previously constructed ordinals less than \alpha). i.e., allow the use of \psi_1 only for arguments less than \alpha itself. With this definition, we must write \psi(\Omega_2) instead of \psi(\psi_1(\Omega_2)) (although it is still also equal to \psi(\psi_1(\Omega_2)) = \psi(\zeta_), of course, but it is now constant until \Omega_2). This change is inessential because, intuitively speaking, the \psi_1 function collapses the nameable ordinals beyond \Omega_2 below the latter so it matters little whether \psi is invoked directly on the ordinals beyond \Omega_2 or on their image by \psi_1. But it makes it possible to define \psi and \psi_1 by ''simultaneous'' (rather than "downward") induction, and this is important if we are to use infinitely many collapsing functions. Indeed, there is no reason to stop at two levels: using \omega+1 new cardinals in this way, \Omega_1,\Omega_2,\ldots,\Omega_\omega, we get a system essentially equivalent to that introduced by Buchholz, the inessential difference being that since Buchholz uses \omega+1 ordinals from the start, he does not need to allow multiplication or exponentiation; also, Buchholz does not introduce the numbers 1 or \omega in the system as they will also be produced by the \psi functions: this makes the entire scheme much more elegant and more concise to define, albeit more difficult to understand. This system is also sensibly equivalent to the earlier (and much more difficult to grasp) "ordinal diagrams" of Takeuti and \theta functions of Feferman: their range is the same (\psi_0(\varepsilon_), which could be called the Takeuti-Feferman–Buchholz ordinal, and which describes the
strength Strength may refer to: Physical strength *Physical strength, as in people or animals *Hysterical strength, extreme strength occurring when people are in life-and-death situations *Superhuman strength, great physical strength far above human ca ...
of \Pi^1_1-comprehension plus
bar induction Bar induction is a reasoning principle used in intuitionistic mathematics, introduced by L. E. J. Brouwer. Bar induction's main use is the intuitionistic derivation of the fan theorem, a key result used in the derivation of the uniform continuity ...
).


A "normal" variant

Most definitions of ordinal collapsing functions found in the recent literature differ from the ones we have given in one technical but important way which makes them technically more convenient although intuitively less transparent. We now explain this. The following definition (by induction on \alpha) is completely equivalent to that of the function \psi above: :Let C(\alpha,\beta) be the set of ordinals generated starting from 0, 1, \omega, \Omega and all ordinals less than \beta by recursively applying the following functions: ordinal addition, multiplication and exponentiation, and the function \psi\upharpoonright_\alpha. Then \psi(\alpha) is defined as the smallest ordinal \rho such that C(\alpha,\rho) \cap \Omega = \rho. (This is equivalent, because if \sigma is the smallest ordinal not in C(\alpha,0), which is how we originally defined \psi(\alpha), then it is also the smallest ordinal not in C(\alpha,0) = C(\alpha,\sigma), and furthermore the properties we described of \psi imply that no ordinal between \sigma inclusive and \Omega exclusive belongs to C(\alpha,\sigma).) We can now make a change to the definition which makes it subtly different: :Let \tilde C(\alpha,\beta) be the set of ordinals generated starting from 0, 1, \omega, \Omega and all ordinals less than \beta by recursively applying the following functions: ordinal addition, multiplication and exponentiation, and the function \tilde\psi\upharpoonright_\alpha. Then \tilde\psi(\alpha) is defined as the smallest ordinal \rho such that \tilde C(\alpha,\rho) \cap \Omega = \rho and \alpha \in \tilde C(\alpha,\rho). The first values of \tilde\psi coincide with those of \psi: namely, for all \alpha<\zeta_0 where \zeta_0 = \varphi_2(0), we have \tilde\psi(\alpha) = \psi(\alpha) because the additional clause \alpha \in \tilde C(\alpha,\rho) is always satisfied. But at this point the functions start to differ: while the function \psi gets "stuck" at \zeta_0 for all \zeta_0 \leq \alpha \leq \Omega, the function \tilde\psi satisfies \tilde\psi(\zeta_0) = \varepsilon_ because the new condition \alpha \in \tilde C(\alpha,\rho) imposes \tilde\psi(\zeta_0) > \zeta_0. On the other hand, we still have \tilde\psi(\Omega) = \zeta_0 (because \Omega \in C(\alpha,\rho) for all \rho so the extra condition does not come in play). Note in particular that \tilde\psi, unlike \psi, is not monotonic, nor is it continuous. Despite these changes, the \tilde\psi function also defines a system of ordinal notations up to the Bachmann–Howard ordinal: the notations, and the conditions for canonicity, are slightly different (for example, \psi(\Omega+1+\alpha) = \tilde\psi(\tilde\psi(\Omega)+\alpha) for all \alpha less than the common value \psi(\Omega2) = \tilde\psi(\Omega+1)).


Other similar OCFs


Arai's ''ψ''

Arai's ''ψ'' function is an ordinal collapsing function introduced by Toshiyasu Arai (husband of
Noriko H. Arai Noriko H. Arai ( ja, 新井紀子, born 1962) is a Japanese researcher in mathematical logic and artificial intelligence, known for her work on a project to develop robots that can pass the entrance examinations for the University of Tokyo. She is ...
) in his paper: ''A simplified ordinal analysis of first-order reflection''. \psi_\Omega(\alpha) is a collapsing function such that \psi_\Omega(\alpha) < \Omega, where \Omega represents the
first uncountable ordinal In mathematics, the first uncountable ordinal, traditionally denoted by \omega_1 or sometimes by \Omega, is the smallest ordinal number that, considered as a set, is uncountable. It is the supremum (least upper bound) of all countable ordinals. Whe ...
(it can be replaced by the
Church–Kleene ordinal In mathematics, particularly set theory, non-recursive ordinals are large countable ordinals greater than all the recursive ordinals, and therefore can not be expressed using ordinal collapsing functions. The Church–Kleene ordinal and variant ...
at the cost of extra technical difficulty). Throughout the course of this article, \mathsf represents
Kripke–Platek set theory The Kripke–Platek set theory (KP), pronounced , is an axiomatic set theory developed by Saul Kripke and Richard Platek. The theory can be thought of as roughly the predicative part of ZFC and is considerably weaker than it. Axioms In its fo ...
for a \mathsf-reflecting universe, \mathbb_N is the smallest \mathsf-reflecting ordinal, N is a natural number > 2, and \Omega_0 = 0. Suppose \mathsf for a \mathsf (\Omega)-sentence \mathsf. Then,
there exists In predicate logic, an existential quantification is a type of quantifier, a logical constant which is interpreted as "there exists", "there is at least one", or "for some". It is usually denoted by the logical operator symbol ∃, which, wh ...
a finite n such that for \alpha = \psi_\Omega(\Omega_n(\mathbb_N + 1)), L_\alpha \models \theta. It can also be proven that \mathsf proves that each initial segment \; n = 1, 2, \ldots is
well-founded In mathematics, a binary relation ''R'' is called well-founded (or wellfounded) on a class ''X'' if every non-empty subset ''S'' ⊆ ''X'' has a minimal element with respect to ''R'', that is, an element ''m'' not related by ''s& ...
, and therefore, the
proof-theoretic ordinal In proof theory, ordinal analysis assigns ordinals (often large countable ordinals) to mathematical theories as a measure of their strength. If theories have the same proof-theoretic ordinal they are often equiconsistent, and if one theory has ...
of \psi_\Omega(\varepsilon_) is the proof-theoretic ordinal of \mathsf. Using this, \psi_\Omega(\varepsilon_) = \min(\). One can then make the following conversions: * \psi_\Omega(A) = \psi_0(\Omega) = , \mathsf, = \varphi(1, 0), where A is the least admissible ordinal, \mathsf is
Peano arithmetic In mathematical logic, the Peano axioms, also known as the Dedekind–Peano axioms or the Peano postulates, are axioms for the natural numbers presented by the 19th century Italian mathematician Giuseppe Peano. These axioms have been used nearly u ...
and \varphi is the
Veblen hierarchy In mathematics, the Veblen functions are a hierarchy of normal functions (continuous function (set theory), continuous strictly increasing function, strictly increasing function (mathematics), functions from ordinal number, ordinals to ordinals), i ...
. * \psi_\Omega(\varepsilon_) = \psi_0(\varepsilon_) = , \mathsf, = \mathsf, where A is the least admissible ordinal, \mathsf is
Kripke–Platek set theory The Kripke–Platek set theory (KP), pronounced , is an axiomatic set theory developed by Saul Kripke and Richard Platek. The theory can be thought of as roughly the predicative part of ZFC and is considerably weaker than it. Axioms In its fo ...
and \mathsf is the
Bachmann–Howard ordinal In mathematics, the Bachmann–Howard ordinal (also known as the Howard ordinal, or Howard-Bachmann ordinal) is a large countable ordinal. It is the ordinal analysis, proof-theoretic ordinal of several mathematical theory (logic), theories, such as ...
. * \psi_\Omega(I) = \psi_0(\Omega_\omega) = , \mathsf, = \mathsf, where I is the least recursively inaccessible ordinal and \mathsf is Buchholz's ordinal. * \psi_\Omega(\varepsilon_) = \psi_0(\varepsilon_) = , \mathsf, = \mathsf, where I is the least recursively inaccessible ordinal, \mathsf is Kripke–Platek set theory with a recursively inaccessible universe and \mathsf is the
Takeuti–Feferman–Buchholz ordinal In the mathematical fields of set theory and proof theory, the Takeuti–Feferman–Buchholz ordinal (TFBO) is a large countable ordinal, which acts as the limit of the range of Buchholz's psi function and Feferman's theta function. It was named by ...
.


Bachmann's ''ψ''

The first true OCF, Bachmann's \psi was invented by Heinz Bachmann, somewhat cumbersome as it depends on fundamental sequences for all limit ordinals; and the original definition is complicated. Michael Rathjen has suggested a "recast" of the system, which goes like so: * Let \Omega represent an uncountable ordinal such as \omega_1; * Then define C^(\alpha, \beta) as the closure of \beta \cup \ under addition, (\xi \rightarrow \omega^\xi) and (\xi \rightarrow \psi_\Omega(\xi)) for \xi < \alpha. * \psi_\Omega(\alpha) is the smallest countable ordinal ρ such that C^\Omega(\alpha, \rho) \cap \Omega= \rho \psi_\Omega(\varepsilon_) is the Bachmann–Howard ordinal, the proof-theoretic ordinal of Kripke–Platek set theory with the axiom of infinity (KP).


Buchholz's ''ψ''

Buchholz's \psi is a hierarchy of single-argument functions \psi_\nu: \mathsf \rightarrow \mathsf, with \psi_\nu(\alpha) occasionally abbreviated as \psi_\nu\alpha. This function is likely the most well known out of all OCFs. The definition is so: * Define \Omega_0 = 1 and \Omega_\nu = \aleph_\nu for \nu > 0. * Let P(\alpha) be the set of distinct terms in the Cantor normal form of \alpha (with each term of the form \omega^\xi for \xi \in \mathsf, see
Cantor normal form theorem In the mathematical field of set theory, ordinal arithmetic describes the three usual operations on ordinal numbers: addition, multiplication, and exponentiation. Each can be defined in essentially two different ways: either by constructing an expl ...
) * C^0_\nu(\alpha) = \Omega_\nu * C^_\nu(\alpha) = C^_\nu(\alpha) \cup \ * C_\nu(\alpha) = \bigcup\limits_ C^n_\nu(\alpha) * \psi_\nu(\alpha) = \min(\) The limit of this system is \psi_0(\varepsilon_), the
Takeuti–Feferman–Buchholz ordinal In the mathematical fields of set theory and proof theory, the Takeuti–Feferman–Buchholz ordinal (TFBO) is a large countable ordinal, which acts as the limit of the range of Buchholz's psi function and Feferman's theta function. It was named by ...
.


Extended Buchholz's ''ψ''

This OCF is a sophisticated extension of Buchholz's \psi by mathematician Denis Maksudov. The limit of this system is much greater, equal to \psi_0(\Omega_) where \Omega_ denotes the first omega fixed point, sometimes referred to as Extended Buchholz's ordinal. The function is defined as follows: * Define \Omega_0 = 1 and \Omega_\nu = \aleph_\nu for \nu > 0. * C^0_\nu(\alpha) = \ * C^_\nu(\alpha) = \ * C_\nu(\alpha) = \bigcup\limits_ C^n_\nu(\alpha) * \psi_\nu(\alpha) = \min(\)


Madore's ''ψ''

This OCF was the same as the ''ψ'' function previously used throughout this article; it is a simpler, more efficient version of Buchholz's ''ψ'' function defined by David Madore. Its use in this article lead to widespread use of the function. * C_0(\alpha) = \ * C_(\alpha) = \ * C(\alpha) = \bigcup\limits_ C_n(\alpha) * \psi(\alpha) = \min(\) This function was used by Chris Bird, who also invented the next OCF.


Bird's θ

Chris Bird devised the following shorthand for the extended Veblen function \varphi: * \theta(\Omega^a_ + \cdots + \Omega^2a_2 + \Omega a_1 + a_0, b) = \varphi(a_, \ldots , a_2, a_1, a_0, b) * \theta(\alpha, 0) is abbreviated \theta(\alpha) This function is only defined for arguments less than \Omega^\omega, and its outputs are limited by the small Veblen ordinal.


Jäger's ''ψ''

Jäger's ''ψ'' is a hierarchy of single-argument ordinal functions ''ψ''''κ'' indexed by uncountable regular cardinals ''κ'' smaller than the least weakly Mahlo cardinal ''M''0 introduced by German mathematician Gerhard Jäger in 1984. It was developed on the base of Buchholz's approach. * If \kappa = I_\alpha(0) for some ''α'' < ''κ'', \kappa^ = 0. * If \kappa = I_\alpha(\beta + 1) for some ''α'', ''β'' < ''κ'', \kappa^ = I_\alpha(\beta). * C^0_\kappa(\alpha) = \ \cup \kappa^ * For every finite ''n'', C^_\kappa(\alpha) \subset M_0 is the smallest set satisfying the following: ** The sum of any finitely many ordinals in C^n_\kappa(\alpha) \subset M_0 belongs to C^_\kappa(\alpha) \subset M_0. ** For any \beta, \gamma \in C^_\kappa(\alpha), \varphi_\beta(\gamma) \in C^_\kappa(\alpha). ** For any \beta, \gamma \in C^_\kappa(\alpha), I_\beta(\gamma) \in C^_\kappa(\alpha). ** For any ordinal γ and uncountable regular cardinal \pi \in C^n_\kappa(\alpha), \gamma < \pi < \kappa \Rightarrow \gamma \in C^_\kappa(\alpha). ** For any \gamma \in \alpha \cap C^n_\kappa(\alpha) and uncountable regular cardinal \pi \in C^n_\kappa(\alpha), \gamma \in C_\pi(\gamma) \Rightarrow \psi_\pi(\gamma) \in C^_\kappa(\alpha). * C_\kappa(\alpha) = \bigcup\limits_ C^n_\kappa(\alpha) * \psi_\kappa(\alpha) = \min(\)


Simplified Jäger's ''ψ''

This is a sophisticated simplification of Jäger's ''ψ'' created by Denis Maksudov. An ordinal is ''α''-weakly inaccessible if it is uncountable, regular and it is a limit of ''γ''-weakly inaccessible cardinals for ''γ'' < ''α''. Let ''I''(''α'', 0) be the first α-weakly inaccessible cardinal, ''I''(''α'', ''β'' + 1) be the first ''α''-weakly inaccessible cardinal after ''I''(''α'', ''β'') and ''I''(''α'', ''β'') = sup(\) for limit ''β''. Restrict ''ρ'' and to uncountable regular ordinals of the form ''I''(''α'', 0) or ''I''(''α'', ''β'' + 1). Then, * C_0(\alpha, \beta) = \beta \cup \ * C_(\alpha, \beta) = \ \cup \ \cup \ * C(\alpha, \beta) = \bigcup\limits_ C_n(\alpha, \beta) * \psi_\pi(\alpha) = \min(\) *


Rathjen's ''Ψ''

Rathjen's ''Ψ'' function is based on the least weakly compact cardinal to create large countable ordinals. For a weakly compact cardinal K, the functions M^\alpha, C(\alpha, \pi), \Xi(\alpha), and \Psi^\xi_\pi(\alpha) are defined in mutual recursion in the following way: * M0 = K \cap \mathsf, where Lim denotes the class of limit ordinals. * For α > 0, Mα is the set \ * C(\alpha, \beta) is the closure of \beta \cup \ under addition, (\xi, \eta) \rightarrow \varphi(\xi, \eta) , \xi \rightarrow \Omega_\xi given ξ < K, \xi \rightarrow \Xi(\xi) given ξ < α, and (\xi, \pi, \delta) \rightarrow \Psi^\xi_\pi(\delta) given \xi \leq \delta < \alpha . * \Xi(\alpha) = \min(M^\alpha \cup \) . * For \xi \leq \alpha , \Psi^\xi_\pi(\alpha) = \min(\ \cup \) .


Collapsing large cardinals

As noted in the introduction, the use and definition of ordinal collapsing functions is strongly connected with the theory of
ordinal analysis In proof theory, ordinal analysis assigns ordinals (often large countable ordinals) to mathematical theories as a measure of their strength. If theories have the same proof-theoretic ordinal they are often equiconsistent, and if one theory has ...
, so the collapse of this or that large cardinal must be mentioned simultaneously with the theory for which it provides a proof-theoretic analysis. * Gerhard Jäger and Wolfram Pohlers described the collapse of an
inaccessible cardinal In set theory, an uncountable cardinal is inaccessible if it cannot be obtained from smaller cardinals by the usual operations of cardinal arithmetic. More precisely, a cardinal is strongly inaccessible if it is uncountable, it is not a sum of ...
to describe the ordinal-theoretic strength of Kripke–Platek set theory augmented by the recursive inaccessibility of the class of ordinals (KPi), which is also proof-theoretically equivalent to \Delta^1_2-comprehension plus
bar induction Bar induction is a reasoning principle used in intuitionistic mathematics, introduced by L. E. J. Brouwer. Bar induction's main use is the intuitionistic derivation of the fan theorem, a key result used in the derivation of the uniform continuity ...
. Roughly speaking, this collapse can be obtained by adding the \alpha \mapsto \Omega_\alpha function itself to the list of constructions to which the C(\cdot) collapsing system applies. * Michael Rathjen then described the collapse of a
Mahlo cardinal In mathematics, a Mahlo cardinal is a certain kind of large cardinal number. Mahlo cardinals were first described by . As with all large cardinals, none of these varieties of Mahlo cardinals can be proven to exist by ZFC (assuming ZFC is consist ...
to describe the ordinal-theoretic strength of Kripke–Platek set theory augmented by the recursive Mahloness of the class of ordinals (KPM). * Rathjen later described the collapse of a
weakly compact cardinal In mathematics, a weakly compact cardinal is a certain kind of cardinal number introduced by ; weakly compact cardinals are large cardinals, meaning that their existence cannot be proven from the ZFC, standard axioms of set theory. (Tarski original ...
to describe the ordinal-theoretic strength of Kripke–Platek set theory augmented by certain
reflection principle In set theory, a branch of mathematics, a reflection principle says that it is possible to find sets that resemble the class of all sets. There are several different forms of the reflection principle depending on exactly what is meant by "resemble ...
s (concentrating on the case of \Pi_3-reflection). Very roughly speaking, this proceeds by introducing the first cardinal \Xi(\alpha) which is \alpha-hyper-Mahlo and adding the \alpha \mapsto \Xi(\alpha) function itself to the collapsing system. * In a 2015 paper, Toshyasu Arai has created ordinal collapsing functions \psi^_\pi for a vector of ordinals \xi, which collapse \Pi_n^1- indescribable cardinals for n>0. These are used to carry out
ordinal analysis In proof theory, ordinal analysis assigns ordinals (often large countable ordinals) to mathematical theories as a measure of their strength. If theories have the same proof-theoretic ordinal they are often equiconsistent, and if one theory has ...
of Kripke–Platek set theory augmented by \Pi_-reflection principles. * Rathjen has begunRathjen, 2005 (Arch. Math. Logic) the investigation of the collapse of yet larger cardinals, with the ultimate goal of achieving an ordinal analysis of \Pi^1_2-comprehension (which is proof-theoretically equivalent to the augmentation of Kripke–Platek by \Sigma_1-separation).


Notes


References

* * * * * * * * * * (slides of a talk given at Fischbachau) {{countable ordinals Ordinal numbers