Constructive set theory is an approach to
mathematical constructivism
In the philosophy of mathematics, constructivism asserts that it is necessary to find (or "construct") a specific example of a mathematical object in order to prove that an example exists. Contrastingly, in classical mathematics, one can prove th ...
following the program of
axiomatic set theory
Set theory is the branch of mathematical logic that studies Set (mathematics), sets, which can be informally described as collections of objects. Although objects of any kind can be collected into a set, set theory, as a branch of mathematics, ...
.
The same
first-order
In mathematics and other formal sciences, first-order or first order most often means either:
* "linear" (a polynomial of degree at most one), as in first-order approximation and other calculus uses, where it is contrasted with "polynomials of high ...
language with "
" and "
" of classical set theory is usually used, so this is not to be confused with a
constructive types approach.
On the other hand, some constructive theories are indeed motivated by their interpretability in
type theories.
In addition to rejecting the
principle of excluded middle (
), constructive set theories often require some logical quantifiers in their axioms to be
bounded
Boundedness or bounded may refer to:
Economics
* Bounded rationality, the idea that human rationality in decision-making is bounded by the available information, the cognitive limitations, and the time available to make the decision
* Bounded e ...
, motivated by results tied to
impredicativity
In mathematics, logic and philosophy of mathematics, something that is impredicative is a self-referencing definition. Roughly speaking, a definition is impredicative if it invokes (mentions or quantifies over) the set being defined, or (more com ...
.
Introduction
Constructive outlook
Use of intuitionistic logic
The logic of the set theories discussed here is
constructive
Although the general English usage of the adjective constructive is "helping to develop or improve something; helpful to someone, instead of upsetting and negative," as in the phrase "constructive criticism," in legal writing ''constructive'' has ...
in that it rejects
, i.e. that the
disjunction
In logic, disjunction is a logical connective typically notated as \lor and read aloud as "or". For instance, the English language sentence "it is raining or it is snowing" can be represented in logic using the disjunctive formula R \lor S ...
automatically holds for all propositions. As a rule, to prove the excluded middle for a proposition
, i.e. to prove the particular disjunction
, either
or
needs to be explicitly proven. When either such proof is established, one says the proposition is decidable, and this then logically implies the disjunction holds. Similarly, a predicate
on a domain
is said to be decidable when the more intricate statement
is provable. Non-constructive axioms may enable proofs that ''formally'' decide such
(and/or
) in the sense that they prove excluded middle for
(resp. the statement using the quantifier above) without demonstrating the truth of any of the disjuncts, as is often the case in classical logic. In contrast, constructive theories tend to not permit many classical proofs of properties that are provenly computationally
undecidable. Similarly, a counter-example existence claim
is generally constructively stronger than a rejection claim
.
The
intuitionistic logic
Intuitionistic logic, sometimes more generally called constructive logic, refers to systems of symbolic logic that differ from the systems used for classical logic by more closely mirroring the notion of constructive proof. In particular, systems ...
underlying the set theories discussed here, unlike the more conservative
minimal logic
Minimal logic, or minimal calculus, is a symbolic logic system originally developed by Ingebrigt Johansson. It is an intuitionistic and paraconsistent logic, that rejects both the law of the excluded middle as well as the principle of explosion (' ...
, still permits
double negation elimination
In propositional logic, double negation is the theorem that states that "If a statement is true, then it is not the case that the statement is not true." This is expressed by saying that a proposition ''A'' is logically equivalent to ''not (not ...
for individual propositions for which excluded middle holds, and in turn the theorem formulations regarding finite objects tends to not differ from their classical counterparts. Given a model of all numbers, the equivalent for predicates, namely
Markov's principle
Markov's principle, named after Andrey Markov Jr, is a conditional existence statement for which there are many equivalent formulations, as discussed below.
The principle is logically valid classically, but not in intuitionistic constructive m ...
, does not automatically hold, but may be considered as an additional principle.
Expressing the instance for
of the valid
law of noncontradiction
In logic, the law of non-contradiction (LNC) (also known as the law of contradiction, principle of non-contradiction (PNC), or the principle of contradiction) states that contradictory propositions cannot both be true in the same sense at the sa ...
and using a valid
De Morgan's law
In propositional logic and Boolean algebra, De Morgan's laws, also known as De Morgan's theorem, are a pair of transformation rules that are both valid rules of inference. They are named after Augustus De Morgan, a 19th-century British mathem ...
, already minimal logic does prove
for any given proposition
. So in words, intuitionistic logic still posits: It is impossible to rule out a proposition and rule out its negation both at once, and thus the rejection of any instantiated excluded middle statement for an individual proposition is inconsistent. More generally, constructive mathematical theories tend to prove
classically equivalent reformulations of classical theorems. For example, in
constructive analysis
In mathematics, constructive analysis is mathematical analysis done according to some principles of constructive mathematics.
This contrasts with ''classical analysis'', which (in this context) simply means analysis done according to the (more comm ...
, one cannot prove the
intermediate value theorem
In mathematical analysis, the intermediate value theorem states that if f is a continuous function whose domain contains the interval , then it takes on any given value between f(a) and f(b) at some point within the interval.
This has two import ...
in its textbook formulation, but one can prove theorems with algorithmic content that, as soon as double negation elimination and its consequences are assumed legal, are at once classically equivalent to the classical statement. The difference is that the constructive proofs are harder to find.
Imposed restrictions
A restriction to the constructive reading of existence apriori leads to stricter requirements regarding which characterizations of a set
involving unbounded collections constitute a (mathematical, and so always implying
total
Total may refer to:
Mathematics
* Total, the summation of a set of numbers
* Total order, a partial order without incomparable pairs
* Total relation, which may also mean
** connected relation (a binary relation in which any two elements are comp ...
) function. This is often because the predicate in a case-wise would-be definition may not be decidable. Compared to the classical counterpart, one is generally less likely to prove the existence of relations that cannot be realized. Adopting the standard definition of set equality via extentionality, the full
Axiom of Choice
In mathematics, the axiom of choice, or AC, is an axiom of set theory equivalent to the statement that ''a Cartesian product of a collection of non-empty sets is non-empty''. Informally put, the axiom of choice says that given any collectio ...
is such a non-constructive principle that implies
for the formulas permitted in one's adopted Separation schema, by
Diaconescu's theorem In mathematical logic, Diaconescu's theorem, or the Goodman–Myhill theorem, states that the full axiom of choice is sufficient to derive the law of the excluded middle, or restricted forms of it, in constructive set theory. It was discovered in 19 ...
. Similar results hold for the
Axiom of Regularity
In mathematics, the axiom of regularity (also known as the axiom of foundation) is an axiom of Zermelo–Fraenkel set theory that states that every non-empty set ''A'' contains an element that is disjoint from ''A''. In first-order logic, the axi ...
in its standard form, as shown below. So at the very least, the development of constructive set theory requires rejection of strong choice principles and the rewording of some standard axioms to classically equivalent ones. Undecidability of disjunctions also affects the claims about total orders such as that of all
ordinal numbers
In set theory, an ordinal number, or ordinal, is a generalization of ordinal numerals (first, second, th, etc.) aimed to extend enumeration to infinite sets.
A finite set can be enumerated by successively labeling each element with the least n ...
, expressed by the provability and rejection of the clauses in the order defining disjunction
. This determines whether the relation is
trichotomous. And this in turn affects the proof theoretic strength defined in
ordinal analysis
In proof theory, ordinal analysis assigns ordinals (often large countable ordinals) to mathematical theories as a measure of their strength.
If theories have the same proof-theoretic ordinal they are often equiconsistent, and if one theory has ...
.
Metalogic
As in the study of constructive arithmetic theories, constructive set theories can exhibit attractive
disjunction and existence properties
In mathematical logic, the disjunction and existence properties are the "hallmarks" of constructive theories such as Heyting arithmetic and constructive set theories (Rathjen 2005).
Disjunction property
The disjunction property is satisfi ...
. These are features of a fixed theory which relate metalogical judgements and propositions provable in the theory. Particularly well-studied are those such features that can be expressed in
Heyting arithmetic In mathematical logic, Heyting arithmetic is an axiomatization of arithmetic in accordance with the philosophy of intuitionism.Troelstra 1973:18 It is named after Arend Heyting, who first proposed it.
Axiomatization
As with first-order Peano a ...
, with quantifiers over numbers and which can often be
realized by numerals, as formalized in
proof theory
Proof theory is a major branchAccording to Wang (1981), pp. 3–4, proof theory is one of four domains mathematical logic, together with model theory, axiomatic set theory, and recursion theory. Jon Barwise, Barwise (1978) consists of four correspo ...
. In particular, those are the numerical existence property and the closely related disjunctive property, as well as being closed under
Church's rule, witnessing any given function to be
computable
Computability is the ability to solve a problem in an effective manner. It is a key topic of the field of computability theory within mathematical logic and the theory of computation within computer science. The computability of a problem is close ...
.
A set theory does not only express theorems about numbers. Furthermore, one may consider a more general strong existence property that is harder to come by, as will be discussed. The theory has the property if the following can be established: For any property
, if the theory proves that a set exist that has that property, i.e. if the theory claims the existence statement, then there is also a property
that
uniquely describes such a set instance. More formally, for any predicate
there is a predicate
so that
:
The analogous role of the realized numeral is played by defined sets proven to exist according to the theory, and so this is a subtle question concerning term construction and the theories strength. While many theories discussed tend have all the various numerical properties, the existence property can easily be spoiled, as will be discussed. Weaker forms of existence properties have been formulated.
Some classical theories can in fact also be constrained to exhibit the strong existence property.
Zermelo–Fraenkel set theory
In set theory, Zermelo–Fraenkel set theory, named after mathematicians Ernst Zermelo and Abraham Fraenkel, is an axiomatic system that was proposed in the early twentieth century in order to formulate a theory of sets free of paradoxes such as ...
with the
constructible universe
In mathematics, in set theory, the constructible universe (or Gödel's constructible universe), denoted by , is a particular class of sets that can be described entirely in terms of simpler sets. is the union of the constructible hierarchy . It w ...
postulate,
, or
with sets all taken to be
ordinal-definable,
, do have the existence property. For contrast, consider the theory
given by
plus the full
axiom of choice
In mathematics, the axiom of choice, or AC, is an axiom of set theory equivalent to the statement that ''a Cartesian product of a collection of non-empty sets is non-empty''. Informally put, the axiom of choice says that given any collectio ...
''existence postulate''. Recall that this set of axioms implies the
well-ordering theorem
In mathematics, the well-ordering theorem, also known as Zermelo's theorem, states that every set can be well-ordered. A set ''X'' is ''well-ordered'' by a strict total order if every non-empty subset of ''X'' has a least element under the orde ...
, which in particular means that relations for
that establish its well-ordering are formally proven to exist (and claim existence of a least element for all subsets of
with respect to those relations). This is despite that fact that definability of such an ordering is
known
Knowledge can be defined as awareness of facts or as practical skills, and may also refer to familiarity with objects or situations. Knowledge of facts, also called propositional knowledge, is often defined as true belief that is distinc ...
to be
independent
Independent or Independents may refer to:
Arts, entertainment, and media Artist groups
* Independents (artist group), a group of modernist painters based in the New Hope, Pennsylvania, area of the United States during the early 1930s
* Independ ...
of
. The latter implies that for no particular formula
in the language of the theory does the theory prove that the corresponding set is a well-ordering relation of the reals. So
formally ''proves the existence'' of a subset
with the property of being a well-ordering relation, but at the same time no particular set
for which the property could be validated can possibly be defined.
Anti-classical principles
A situation commonly studied is that of a fixed theory
exhibiting the following meta-theoretical property: For an instance from some collection of formulas, here captured via
and
, one established the existence of a numeral constant
so that
. When adopting such a schema as an inference rule and nothing new can be proven, one says the theory
is closed under that rule. Adjoining excluded middle
to
, the new theory may then prove new, strictly classical statements for
and this may spoil the meta-theoretical property that was previously established for
.
One may instead consider adjoining the rule corresponding to the meta-theoretical property as an implication to
, as a
schema
The word schema comes from the Greek word ('), which means ''shape'', or more generally, ''plan''. The plural is ('). In English, both ''schemas'' and ''schemata'' are used as plural forms.
Schema may refer to:
Science and technology
* SCHEMA ...
or in quantified form. That is to say, to postulate that any such
implies
such that
holds, where the bound
is a number variable in language of the theory. The new theory with the principle added might be anti-classical, in that it may not be consistent anymore to also adopt
.
For example, Church's rule is an
admissible rule
In logic, a rule of inference is admissible in a formal system if the set of theorems of the system does not change when that rule is added to the existing rules of the system. In other words, every formula that can be derived using that rule is ...
in
Heyting arithmetic In mathematical logic, Heyting arithmetic is an axiomatization of arithmetic in accordance with the philosophy of intuitionism.Troelstra 1973:18 It is named after Arend Heyting, who first proposed it.
Axiomatization
As with first-order Peano a ...
and the
corresponding Church's thesis principle may be adopted, but the same is not possible in
, also known as
Peano arithmetic
In mathematical logic, the Peano axioms, also known as the Dedekind–Peano axioms or the Peano postulates, are axioms for the natural numbers presented by the 19th century Italian mathematician Giuseppe Peano. These axioms have been used nearly u ...
.
Now for a context of set theories with quantification over a fully formal notion of an infinite sequences space, i.e. function spaces, as will be defined below. A translation of Church's ''rule'' into the language of the theory itself may then read
:
Kleene's T predicate In computability theory, the T predicate, first studied by mathematician Stephen Cole Kleene, is a particular set of triples of natural numbers that is used to represent computable functions within formal theories of arithmetic. Informally, the ''T ...
together with the result extraction expresses that any input number
being mapped to the number
is, through
, witnessed to be a computable mapping. Here
now denotes a set theory model of the standard natural numbers and
is an index with respect to a fixed program enumeration. Stronger variants have been used, which extend this principle to functions
defined on domains
of low complexity. The principle rejects decidability for the predicate
defined as
, expressing that
is the index of a computable function
halting on its own index. Weaker, double negated forms of the principle may be considered too, which do not require the existence of a recursive implementation for every
, but which still make principles inconsistent that claim the existence of functions which provenly have no recursive realization. Some forms of a Church's thesis as principle are even consistent with the
weak classical second order arithmetic .
The collection of total, computable functions is classically subcountable, which classically is the same as being countable. But classical set theories will generally claim that
holds also other functions than the computable ones. For example there is a proof in
that total functions do exist that cannot be captured by a
Turing machine
A Turing machine is a mathematical model of computation describing an abstract machine that manipulates symbols on a strip of tape according to a table of rules. Despite the model's simplicity, it is capable of implementing any computer algori ...
.
Taking the computable world seriously as ontology, a prime example of an anti-classical conception related the Markovian school is the permitted
subcountability of various uncountable collections. Adopting the subcountability of the collection of all unending sequences of natural numbers (
) as an axiom, the "smallness" (in classical terms) of this collection in some realizations of a set theory is then already captured by that theory.
A constructive theory may also adopt neither classical nor anti-classical axioms and so stay agnostic towards either possibility.
Constructive principles already prove
and so for any given element
of
, the corresponding excluded middle statement for the proposition cannot be negated. Indeed, for any given
, by noncontradiction it is impossible to rule out
and rule out its negation both at once. But a theory may in some instances also permit the rejection claim
. Adopting this does not necessitate providing a particular
witnessing the failure of excluded middle for the particular proposition
, i.e. witnessing the inconsistent
. One may reject the possibility of decidability of some predicate
on an infinite domain
without making any existence claim in
.
As another example, such a situation is enforced in
Brouwerian intuitionistic analysis, in a case where the quantifier ranges over infinitely many
unending binary sequences and
states that a sequence
is everywhere zero. Concerning this property, of being conclusively identified as the sequence which is forever constant, adopting Brouwer's continuity principle rules out that this could be proven decidable for all the sequences.
So in a constructive context with a so-called
non-classical logic Non-classical logics (and sometimes alternative logics) are formal systems that differ in a significant way from standard logical systems such as propositional and predicate logic. There are several ways in which this is done, including by way of ...
as used here, one may consistently adopt axioms which are both in contradiction to quantified forms of excluded middle, but also non-constructive in the computable sense or as gauged by meta-logical existence properties discussed previously. In that way, a constructive set theory can also provide the framework to study non-classical theories.
History and overview
Historically, the subject of constructive set theory (often also "
") begun with
John Myhill
John R. Myhill Sr. (11 August 1923 – 15 February 1987) was a British mathematician.
Education
Myhill received his Ph.D. from Harvard University under Willard Van Orman Quine in 1949. He was professor at SUNY Buffalo from 1966 until his death ...
's work on the theories also called
and
.
In 1973, he had proposed the former as a first-order set theory based on intuitionistic logic, taking the most common foundation
and throwing out the Axiom of choice as well as the principle of the excluded middle, initially leaving everything else as is. However, different forms of some of the
axioms which are equivalent in the classical setting are inequivalent in the constructive setting, and some forms imply
, as will be demonstrated. In those cases, the intuitionistically weaker formulations were consequently adopted. The far more conservative system
is also a first-order theory, but of several sorts and bounded quantification, aiming to provide a formal foundation for
Errett Bishop
Errett Albert Bishop (July 14, 1928 – April 14, 1983) was an Americans, American mathematician known for his work on analysis. He expanded constructive analysis in his 1967 ''Foundations of Constructive Analysis'', where he Mathematical proof, p ...
's program of constructive mathematics.
The main discussion presents sequence of theories in the same language as
, leading up to
Peter Aczel
Peter Henry George Aczel (; born 31 October 1941) is a British mathematician, logician and Emeritus joint Professor in the Department of Computer Science and the School of Mathematics at the University of Manchester. He is known for his work in n ...
's well studied ''
'', and beyond. Many modern results trace back to Rathjen and his students.
is also characterized by the two features present also in Myhill's theory:
On the one hand, it is using the
Predicative Separation instead of the full, unbounded Separation schema, see also
Lévy hierarchy In set theory and mathematical logic, the Lévy hierarchy, introduced by Azriel Lévy in 1965, is a hierarchy of formulas in the formal language of the Zermelo–Fraenkel set theory, which is typically called just the language of set theory. This i ...
.
Boundedness can be handled as a syntactic property or, alternatively, the theories can be conservatively extended with a higher boundedness predicate and its axioms. Secondly, the
impredicative
In mathematics, logic and philosophy of mathematics, something that is impredicative is a self-referencing definition. Roughly speaking, a definition is impredicative if it invokes (mentions or quantifies over) the set being defined, or (more com ...
Powerset axiom is discarded, generally in favor of related but weaker axioms. The strong form is very casually used in
classical general topology
In mathematics, general topology is the branch of topology that deals with the basic set-theoretic definitions and constructions used in topology. It is the foundation of most other branches of topology, including differential topology, geomet ...
.
Adding
to a theory even weaker than
recovers
, as detailed below.
The system, which has come to be known as Intuitionistic Zermelo–Fraenkel set theory (
), is a strong set theory without
. It is similar to
, but less conservative or
predicative.
The theory denoted
is the constructive version of
, the classical
Kripke–Platek set theory
The Kripke–Platek set theory (KP), pronounced , is an axiomatic set theory developed by Saul Kripke and Richard Platek.
The theory can be thought of as roughly the predicative part of ZFC and is considerably weaker than it.
Axioms
In its fo ...
without a form of Powerset and where even the Axiom of Collection is bounded.
Models
As far as constructive realizations go there is a relevant
realizability In mathematical logic, realizability is a collection of methods in proof theory used to study constructive proofs and extract additional information from them. Formulas from a formal theory are "realized" by objects, known as "realizers", in a way t ...
theory. Relatedly, Aczel's theory constructive Zermelo-Fraenkel
has been interpreted in a
Martin-Löf type theories, as described below. In this way, set theory theorems provable in
and weaker theories are candidates for a computer realization.
For a set theory context without infinite sets, constructive first-order arithmetic can also be taken as an apology for most axioms adopted in
: The arithmetic theory
is
bi-interpretable with a weak constructive set theory, as described in the article on
Heyting arithmetic In mathematical logic, Heyting arithmetic is an axiomatization of arithmetic in accordance with the philosophy of intuitionism.Troelstra 1973:18 It is named after Arend Heyting, who first proposed it.
Axiomatization
As with first-order Peano a ...
. One may arithmetically characterize a membership relation "
" and with it prove - instead of the existence of a set of natural numbers
- that all sets in its theory are in bijection with a (finite)
von Neumann natural, a principle denoted
. This context further validates Extensionality, Pairing, Union, Binary Intersection (which is related to the
Axiom schema of predicative separation
In axiomatic set theory, the axiom schema of predicative separation, or of restricted, or Δ0 separation, is a schema of axioms which is a restriction of the usual axiom schema of separation in Zermelo–Fraenkel set theory.
This name &Del ...
) and the
Set Induction schema. Taken as axioms, the aforementioned principles constitute a set theory that is already identical with the theory given by
minus the existence of
but plus
as axiom. All those axioms are discussed in detail below.
Relatedly,
also proves that the
hereditarily finite sets fulfill all the previous axioms. This is a result which persists when passing on to
and
minus Infinity.
Many theories studied in constructive set theory are mere restrictions of
Zermelo–Fraenkel set theory () with respect to their axiom as well as their underlying logic. Such theories can then also be interpreted in any model of
.
More recently,
presheaf
In mathematics, a sheaf is a tool for systematically tracking data (such as sets, abelian groups, rings) attached to the open sets of a topological space and defined locally with regard to them. For example, for each open set, the data could ...
models for constructive set theories have been introduced. These are analogous to presheaf models for intuitionistic set theory developed by
Dana Scott
Dana Stewart Scott (born October 11, 1932) is an American logician who is the emeritus Hillman University Professor of Computer Science, Philosophy, and Mathematical Logic at Carnegie Mellon University; he is now retired and lives in Berkeley, Ca ...
in the 1980s.
Subtheories of ZF
Notation
Language
The
propositional connective symbols used to form syntactic formulas are standard. The axioms of set theory give a means to prove equality "
" of sets and that symbol may, by
abuse of notation
In mathematics, abuse of notation occurs when an author uses a mathematical notation in a way that is not entirely formally correct, but which might help simplify the exposition or suggest the correct intuition (while possibly minimizing errors an ...
, be used for classes. Negation "
" of elementhood "
" is often written "
", and usually the same goes for non-equality "
", although in constructive mathematics the latter symbol is also used in the context with
apartness relations.
Variables
Below the Greek
denotes a predicate variable in
axiom schema In mathematical logic, an axiom schema (plural: axiom schemata or axiom schemas) generalizes the notion of axiom.
Formal definition
An axiom schema is a formula in the metalanguage of an axiomatic system, in which one or more schematic variables ap ...
s and
or
is used for particular predicates. The word "predicate" is often used interchangeably with "formulas" as well, even in the
unary case.
Quantifiers range over sets and those are denoted by lower case letters.
As is common, one may use argument brackets to express predicates, for the sake of highlighting particular free variables in their syntactic expression, as in "
".
One abbreviates
by
and
by
. The syntactic notion of bounded quantification in this sense can play a role in the formulation of axiom schemas, as seen below.
Express the subclass claim
, i.e.
, by
.
The similar notion of subset-bounded quantifiers, as in
, has been used in set theoretical investigation as well, but will not be further highlighted here.
Unique existence here means
.
Classes
As is also common in the study of
set theories, one makes use set builder notation for
classes, which, in most contexts, are not part of the object language but used for concise discussion. In particular, one may introduce notation declarations of the corresponding class via "
", for the purpose of expressing
as
. Logically equivalent predicates can be used to introduce the same class. One also writes
as shorthand for
.
For a property
, trivially
. And so follows that
.
Equality
Denote by
the statement expressing that two classes have exactly the same elements, i.e.
, or equivalently
. This is not to be conflated with the concept of
equinumerosity
In mathematics, two sets or classes ''A'' and ''B'' are equinumerous if there exists a one-to-one correspondence (or bijection) between them, that is, if there exists a function from ''A'' to ''B'' such that for every element ''y'' of ''B'', ther ...
.
The following axiom gives a means to prove equality "
" of two ''sets'', so that through substitution, any predicate about
translates to one of
.
By the logical properties of equality, the converse direction holds automatically.
In a constructive interpretation, the elements of a subclass
of
may come equipped with more information than those of
, in the sense that being able to judge
is being able to judge
. And (unless the whole disjunction follows from axioms) in the
Brouwer–Heyting–Kolmogorov interpretation In mathematical logic, the Brouwer–Heyting–Kolmogorov interpretation, or BHK interpretation, of intuitionistic logic was proposed by L. E. J. Brouwer and Arend Heyting, and independently by Andrey Kolmogorov. It is also sometimes called the real ...
, this means to have proven
or having rejected it.
As
may be not decidable for all elements in
, the two classes must a priori be distinguished.
Consider a property
that provenly holds for all elements of a set
, so that
, and assume that the class on the left hand side is established to be a set. Note that, even if this set on the left informally also ties to proof-relevant information about the validity of
for all the elements, the Extensionality axiom postulates that, in our set theory, the set on the left hand side is judged equal to the one on the right hand side. While often adopted, this axiom has been criticized in constructive thought, as it effectively collapses differently defined properties, or at least the sets viewed as the extension of these properties, a
Fregian notion.
Modern type theories may instead aim at defining the demanded equivalence "
" in terms of functions, see e.g.
type equivalence. The related concept of function
extensionality
In logic, extensionality, or extensional equality, refers to principles that judge objects to be equal if they have the same external properties. It stands in contrast to the concept of intensionality, which is concerned with whether the internal ...
is often not adopted in type theory.
Other frameworks for constructive mathematics might instead demand a particular rule for equality or
apartness come for the elements
of each and every set
discussed. Even then, the above definition can be used to characterize equality of
subsets
In mathematics, set ''A'' is a subset of a set ''B'' if all elements of ''A'' are also elements of ''B''; ''B'' is then a superset of ''A''. It is possible for ''A'' and ''B'' to be equal; if they are unequal, then ''A'' is a proper subset of ...
and
.
Note that adopting "
" as a symbol in a predicate logic theory makes equality of two terms a quantifier-free expression.
Merging sets
Two other basic axioms are as follows. Firstly,
saying that for any two sets
and
, there is at least one set
, which hold at least those two sets (
).
And then,
saying that for any set
, there is at least one set
, which holds all members
, of
's members
.
The two axioms may also be formulated stronger in terms of "
". In the context of
with Separation, this is not necessary.
Together, the two previous axioms imply the existence of the binary union of two classes
and
when they have been established to be sets, and this is denoted by
or
. For a fixed set
, to validate membership
in the union of two given sets
and
, one needs to validate the
part of the axiom, which can be done by validating the disjunction of the predicates defining the sets
and
, for
.
Define class notation for a few given elements via disjunctions, e.g.
says
.
Denote by
the standard
ordered pair
In mathematics, an ordered pair (''a'', ''b'') is a pair of objects. The order in which the objects appear in the pair is significant: the ordered pair (''a'', ''b'') is different from the ordered pair (''b'', ''a'') unless ''a'' = ''b''. (In con ...
model
.
Set existence
The property that is false for any set corresponds to the empty class, which is denoted by
or zero, 0. That the empty class is a set readily follows from other axioms, such as the Axiom of Infinity below. But if, e.g., one is explicitly interested in excluding infinite sets in one's study, one may at this point adopt the
This axiom would also readily be accepted, but is not relevant in the context of stronger axioms below. Introduction of the symbol
(as abbreviating notation for expressions in involving characterizing properties) is justified as uniqueness for this set can be proven.
If there provenly ''exists'' a set
inside a set
, then we call
''inhabited'' and it is then provenly not the empty set. While classically equivalent, constructively ''non-empty'' is a weaker notion with two negations. Unfortunately, the word for the more useful notion of 'inhabited' is rarely used in classical mathematics.
For a set
, define the
successor set as
, for which
.
A sort of blend between pairing and union, an axiom more readily related to the successor is the
Axiom of adjunction In mathematical set theory, the axiom of adjunction states that for any two sets ''x'', ''y'' there is a set ''w'' = ''x'' ∪ given by "adjoining" the set ''y'' to the set ''x''.
: \forall x \,\forall y \,\exists w \,\forall z ...
. It is relevant for the standard modeling of individual
Neumann ordinals.
A simple and provenly false proposition then is, for example,
, corresponding to
in the standard arithmetic model.
Again, here symbols such as
are treated as convenient notation and any proposition really translates to an expression using only "
" and logical symbols, including quantifiers. Accompanied by a metamathematical analysis that the capabilities of the new theories are equivalent in an effective manner,
formal extensions by symbols such as
may also be considered.
BCST
The following makes use of
axiom schema In mathematical logic, an axiom schema (plural: axiom schemata or axiom schemas) generalizes the notion of axiom.
Formal definition
An axiom schema is a formula in the metalanguage of an axiomatic system, in which one or more schematic variables ap ...
s, i.e. axioms for some collection of predicates. Note that some of the stated axiom schemas are often presented with set parameters
as well, i.e. variants with extra universal closures
such that the
's may depend on the parameters.
Separation
Basic constructive set theory
consists of several axioms also part of standard set theory, except the
Separation axiom is weakened.
Beyond the four axioms above, it the Predicative Separation as well as the Replacement schema.
This axiom amounts to postulating the existence of a set
obtained by the intersection of any set
and any predicatively described class
.
When the predicate is taken as
for
proven to be a set, one obtains the binary intersection of sets and writes
. Intersection corresponds to conjunction in an analog way to how union corresponds to disjunction.
As noted, from Separation and the existence of at least one set (e.g. Infinity below) and a predicate that is false of any set, like
, will follow the existence of the empty set.
Within this conservative context of
, the Bounded Separation schema is actually equivalent to Empty Set plus the existence of the binary intersection for any two sets. The latter variant of axiomatization does not make use of a formula schema.
The axiom schema is also called Bounded Separation, as in Separation for set-
bounded quantifiers
In the study of formal theories in mathematical logic, bounded quantifiers (a.k.a. restricted quantifiers) are often included in a formal language in addition to the standard quantifiers "∀" and "∃". Bounded quantifiers differ from "∀" and " ...
only. It is a schema that takes into account syntactic aspects of predicates. The scope of legal formulas is enriched by further set existence postulates. The bounded formulas are also denoted by
in the set theoretical Lévy hierarchy, in analogy to
in the
arithmetical hierarchy
In mathematical logic, the arithmetical hierarchy, arithmetic hierarchy or Kleene–Mostowski hierarchy (after mathematicians Stephen Cole Kleene and Andrzej Mostowski) classifies certain sets based on the complexity of formulas that define th ...
. (Note however that the arithmetic classification is sometimes expressed not syntactically but in terms of subclasses of the naturals. Also, the bottom level of the arithmetical hierarchy has several common definitions, some not allowing the use of some total functions. The distinction is not relevant on the level
or higher.) The schema is also the way in which
Mac Lane
Saunders Mac Lane (4 August 1909 – 14 April 2005) was an American mathematician who co-founded category theory with Samuel Eilenberg.
Early life and education
Mac Lane was born in Norwich, Connecticut, near where his family lived in Taftville ...
weakens a system close to
Zermelo set theory
Zermelo set theory (sometimes denoted by Z-), as set out in a seminal paper in 1908 by Ernst Zermelo, is the ancestor of modern Zermelo–Fraenkel set theory (ZF) and its extensions, such as von Neumann–Bernays–Gödel set theory (NBG). It be ...
, for mathematical foundations related to
topos theory
In mathematics, a topos (, ; plural topoi or , or toposes) is a category that behaves like the category of sheaves of sets on a topological space (or more generally: on a site). Topoi behave much like the category of sets and possess a notion ...
.
No universal set
The following holds for any relation
, giving a purely logical condition by which two terms
and
non-relatable
:
The expression
is a bounded one and thus allowed in separation. By virtue of the rejection of the final disjunct above,
,
Russel's construction shows that
. So for any set
, Predicative Separation alone implies that there exists a set which is not a member of
. In particular, no
universal set
In set theory, a universal set is a set which contains all objects, including itself. In set theory as usually formulated, it can be proven in multiple ways that a universal set does not exist. However, some non-standard variants of set theory inc ...
can exist in this theory.
In a theory with the
axiom of regularity
In mathematics, the axiom of regularity (also known as the axiom of foundation) is an axiom of Zermelo–Fraenkel set theory that states that every non-empty set ''A'' contains an element that is disjoint from ''A''. In first-order logic, the axi ...
, like
, of course that subset
can be proven to be equal to
itself. As an aside, in a theory with
stratification
Stratification may refer to:
Mathematics
* Stratification (mathematics), any consistent assignment of numbers to predicate symbols
* Data stratification in statistics
Earth sciences
* Stable and unstable stratification
* Stratification, or str ...
, a universal set may exist because use of the syntactic expression
may be disallowed in proofs of existence by, essentially, separation.
Predicativity
The restriction in the axiom is also gatekeeping
impredicative
In mathematics, logic and philosophy of mathematics, something that is impredicative is a self-referencing definition. Roughly speaking, a definition is impredicative if it invokes (mentions or quantifies over) the set being defined, or (more com ...
definitions: Existence should at best not be claimed for objects that are not explicitly describable, or whose definition involves themselves or reference to a proper class, such as when a property to be checked involves a universal quantifier. So in a constructive theory without
Axiom of power set
In mathematics, the axiom of power set is one of the Zermelo–Fraenkel axioms of axiomatic set theory.
In the formal language of the Zermelo–Fraenkel axioms, the axiom reads:
:\forall x \, \exists y \, \forall z \, \in y \iff \forall w \ ...
, when
denotes some 2-ary predicate, one should not generally expect a subclass
of
to be a set, in case that it is defined, for example, as in
:
,
or via a similar definitions involving any quantification over the sets
. Note that if this subclass
of
is provenly a set, then this subset itself is also in the unbounded scope of set variable
. In other words, as the subclass property
is fulfilled, this exact set
, defined using the expression
, would play a role in its own characterization.
While predicative Separation leads to fewer given class definitions being sets, it must be emphasized that many class definitions that are classically equivalent are not so when restricting oneself to constructive logic. So in this way, one gets a broader theory, constructively.
Due to the potential
undecidability of general predicates, the notion of subset and subclass is more elaborate in constructive set theories than in classical ones. This remains true if full Separation is adopted, as in the theory
, which however spoils the
existence property
In mathematical logic, the disjunction and existence properties are the "hallmarks" of constructive theories such as Heyting arithmetic and constructive set theories (Rathjen 2005).
Disjunction property
The disjunction property is satisfi ...
as well as the standard type theoretical interpretations, and in this way spoils a bottom-up view of constructive sets. As an aside, as
subtyping
In programming language theory, subtyping (also subtype polymorphism or inclusion polymorphism) is a form of type polymorphism in which a subtype is a datatype that is related to another datatype (the supertype) by some notion of substitutabilit ...
is not a necessary feature of
constructive type theory
Intuitionistic type theory (also known as constructive type theory, or Martin-Löf type theory) is a type theory and an alternative foundation of mathematics.
Intuitionistic type theory was created by Per Martin-Löf, a Swedish mathematician and ph ...
, constructive set theory can be said to quite differ from that framework.
Replacement
Next consider the
It is granting existence, as sets, of the range of function-like predicates, obtained via their domains. In the above formulation, the predicate is not restricted akin to the Separation schema, but this axiom already involves an existential quantifier in the antecedent. Of course, weaker schemas could be considered as well.
With the Replacement schema, this theory proves that the
equivalence classes
In mathematics, when the elements of some set S have a notion of equivalence (formalized as an equivalence relation), then one may naturally split the set S into equivalence classes. These equivalence classes are constructed so that elements ...
or
indexed sums are sets. In particular, the
Cartesian product
In mathematics, specifically set theory, the Cartesian product of two sets ''A'' and ''B'', denoted ''A''×''B'', is the set of all ordered pairs where ''a'' is in ''A'' and ''b'' is in ''B''. In terms of set-builder notation, that is
: A\ti ...
, holding all pairs of elements of two sets, is a set.
Equality of elements inside a set
is decidable if the corresponding relation as a subset of
is decidable.
Replacement is not necessary in the design of a weak constructive set theory that is
bi-interpretable with
Heyting arithmetic In mathematical logic, Heyting arithmetic is an axiomatization of arithmetic in accordance with the philosophy of intuitionism.Troelstra 1973:18 It is named after Arend Heyting, who first proposed it.
Axiomatization
As with first-order Peano a ...
. However, some form of induction is. Replacement together with
Set Induction (introduced below) also suffices to axiomize
hereditarily finite sets constructively and that theory is also studied without Infinity. For comparison, consider the very weak classical theory called
General set theory
General set theory (GST) is George Boolos's (1998) name for a fragment of the axiomatic set theory Z. GST is sufficient for all mathematics not requiring infinite sets, and is the weakest known set theory whose theorems include the Peano axioms.
...
that interprets the class of natural numbers and their arithmetic via just Extensionality, Adjunction and full Separation.
Replacement can be seen as a form of comprehension. Only when assuming
does Replacement already imply full Separation. In
, Replacement is mostly important to prove the existence of sets of high
rank
Rank is the relative position, value, worth, complexity, power, importance, authority, level, etc. of a person or object within a ranking, such as:
Level or position in a hierarchical organization
* Academic rank
* Diplomatic rank
* Hierarchy
* H ...
, namely via instances of the axiom schema where
relates relatively small set
to bigger ones,
.
Constructive set theories commonly have Axiom schema of Replacement, sometimes restricted to bounded formulas. However, when other axioms are dropped, this schema is actually often strengthened - not beyond
, but instead merely to gain back some provability strength. Such stronger axioms exist that do not spoil the strong
existence properties of a theory, as discussed further below.
The discussion now proceeds with axioms granting existence of objects also found in
dependent type theory
In computer science and logic, a dependent type is a type whose definition depends on a value. It is an overlapping feature of type theory and type systems. In intuitionistic type theory, dependent types are used to encode logic's quantifiers lik ...
, namely natural numbers and
products
Product may refer to:
Business
* Product (business), an item that serves as a solution to a specific consumer problem.
* Product (project management), a deliverable or set of deliverables that contribute to a business solution
Mathematics
* Produ ...
.
ECST
Denote by
the inductive property,
. Here
denotes a generic set variable.
In terms of a predicate
underlying the class so that
, this translates to
.
For some fixed predicate
, the statement
expresses that
is the smallest (in the sense of "
") set among all sets
for which
holds true.
The elementary constructive Set Theory
has the axiom of
as well as the postulate of the existence of a smallest inductive set
Write
for
, the general intersection.
Define a class
, the intersection of all inductive sets.
With the above axiom,
is a uniquely characterized set, the smallest infinite
von Neumann ordinal
In set theory, an ordinal number, or ordinal, is a generalization of ordinal numerals (first, second, th, etc.) aimed to extend enumeration to infinite sets.
A finite set can be enumerated by successively labeling each element with the least n ...
. Its elements include the empty set and, for each set in
, another set in
that contains one element more. Symbols called zero and successor are in the
signature
A signature (; from la, signare, "to sign") is a handwritten (and often stylized) depiction of someone's name, nickname, or even a simple "X" or other mark that a person writes on documents as a proof of identity and intent. The writer of a ...
of the theory of
Peano
Giuseppe Peano (; ; 27 August 1858 – 20 April 1932) was an Italian mathematician and glottologist. The author of over 200 books and papers, he was a founder of mathematical logic and set theory, to which he contributed much notation. The stand ...
. In
, the above defined successor of any number also being in the class
follow directly from the characterization of the natural naturals by our von Neumann model. Since the successor of such a set contains itself, one also finds that no successor equals zero. So two of the
Peano axioms
In mathematical logic, the Peano axioms, also known as the Dedekind–Peano axioms or the Peano postulates, are axioms for the natural numbers presented by the 19th century Italian mathematician Giuseppe Peano. These axioms have been used nearly ...
regarding the symbols zero and the one regarding closedness of
come easily. Fourthly, in
, where
is a set,
can be proven to be an injective operation.
The pairwise order "
" on the naturals is captured by their membership relation "
". It is important to note that the theory proves the order as well as the equality relation on this set to be decidable. The value of formulating the axiom using the inductive property is explained in the discussion on arithmetic.
Weak forms of axioms of infinity can be formulated, all postulating that some set containing elements with the common natural number properties exist. Then full Separation may be used to obtain the "sparse" such set, the set of natural numbers.
In the context of otherwise weaker axiom systems, an axiom of infinity should be strengthened so as to imply existence of such a sparse set on its own. One weaker form of Infinity is
:
,
where
captures the predecessor membership relation in the von Neumann model
:
.
This weaker axioms characterizes the infinite set by expressing that all elements
of it are either equal to
or themselves hold a predecessor set which shares all other members with
.
This form can also be written more concisely using the successor notation
.
Number bounds
For a class of numbers
, the statement
: