Constructive set theory

Axiomatic constructive set theory is an approach to mathematical constructivism following the program of axiomatic set theory. The same first-order language with "" and "" of classical set theory is usually used, so this is not to be confused with a constructive types approach. On the other hand, some constructive theories are indeed motivated by their interpretability in type theories.

In addition to rejecting the principle of excluded middle (), constructive set theories often require some logical quantifiers in their axioms to be set bounded, motivated by results tied to impredicativity.

Introduction[edit]

Constructive outlook[edit]

Preliminary on the use of intuitionistic logic[edit]

The logic of the set theories discussed here is constructive in that it rejects the principle of excluded middle , i.e. that the disjunction automatically holds for all propositions . As a rule, to prove the excluded middle for a proposition , i.e. to prove the particular disjunction , either or needs to be explicitly proven. When either such proof is established, one says the proposition is decidable, and this then logically implies the disjunction holds. Similarly and more commonly, a predicate for in a domain is said to be decidable when the more intricate statement is provable. Non-constructive axioms may enable proofs that formally claim decidability of such (and/or ) in the sense that they prove excluded middle for (resp. the statement using the quantifier above) without demonstrating the truth of either side of the disjunction(s). This is often the case in classical logic. In contrast, axiomatic theories deemed constructive tend to not permit many classical proofs of statements involving properties that are provenly computationally undecidable.

The law of noncontradiction is a special case of the propositional form of modus ponens. Using the former with any negated statement , one valid De Morgan's law thus implies already in the more conservative minimal logic. In words, intuitionistic logic still posits: It is impossible to rule out a proposition and rule out its negation both at once, and thus the rejection of any instantiated excluded middle statement for an individual proposition is inconsistent. Here the double-negation captures that the disjunction statement now provenly can never be ruled out or rejected, even in cases where the disjunction may not be provable (for example, by demonstrating one of the disjuncts, thus deciding ) from the assumed axioms.

More generally, constructive mathematical theories tend to prove classically equivalent reformulations of classical theorems. For example, in constructive analysis, one cannot prove the intermediate value theorem in its textbook formulation, but one can prove theorems with algorithmic content that, as soon as double negation elimination and its consequences are assumed legal, are at once classically equivalent to the classical statement. The difference is that the constructive proofs are harder to find.

The intuitionistic logic underlying the set theories discussed here, unlike minimal logic, still permits double negation elimination for individual propositions for which excluded middle holds. In turn the theorem formulations regarding finite objects tends to not differ from their classical counterparts. Given a model of all natural numbers, the equivalent for predicates, namely Markov's principle, does not automatically hold, but may be considered as an additional principle.

In an inhabited domain and using explosion, the disjunction implies the existence claim , which in turn implies . Classically, these implications are always reversible. If one of the former is classically valid, it can be worth trying to establish it in the latter form. For the special case where is rejected, one deals with a counter-example existence claim , which is generally constructively stronger than a rejection claim : Exemplifying a such that is contradictory of course means it is not the case that holds for all possible . But one may also demonstrate that holding for all would logically lead to a contradiction without the aid of a specific counter-example, and even while not being able to construct one. In the latter case, constructively, here one does not stipulate an existence claim.

Imposed restrictions on a set theory[edit]

Compared to the classical counterpart, one is generally less likely to prove the existence of relations that cannot be realized. A restriction to the constructive reading of existence apriori leads to stricter requirements regarding which characterizations of a set involving unbounded collections constitute a (mathematical, and so always meaning total) function. This is often because the predicate in a case-wise would-be definition may not be decidable. Adopting the standard definition of set equality via extensionality, the full Axiom of Choice is such a non-constructive principle that implies for the formulas permitted in one's adopted Separation schema, by Diaconescu's theorem. Similar results hold for the Axiom of Regularity existence claim, as shown below. The latter has a classically equivalent inductive substitute. So a genuinely intuitionistic development of set theory requires the rewording of some standard axioms to classically equivalent ones. Apart from demands for computability and reservations regrading of impredicativity, technical question regarding which non-logical axioms effectively extend the underlying logic of a theory is also a research subject in its own right.

Metalogic[edit]

With computably undecidable propositions already arising in Robinson arithmetic, even just Predicative separation lets one define elusive subsets easily. In stark contrast to the classical framework, constructive set theories may be closed under the rule that any property that is decidable for all sets is already equivalent to one of the two trivial ones, or . Also the real line may be taken to be indecomposable in this sense. Undecidability of disjunctions also affects the claims about total orders such as that of all ordinal numbers, expressed by the provability and rejection of the clauses in the order defining disjunction . This determines whether the relation is trichotomous. A weakened theory of ordinals in turn affects the proof theoretic strength defined in ordinal analysis.

In exchange, constructive set theories can exhibit attractive disjunction and existence properties, as is familiar from the study of constructive arithmetic theories. These are features of a fixed theory which metalogically relate judgements of propositions provable in the theory. Particularly well-studied are those such features that can be expressed in Heyting arithmetic, with quantifiers over numbers and which can often be realized by numbers, as formalized in proof theory. In particular, those are the numerical existence property and the closely related disjunctive property, as well as being closed under Church's rule, witnessing any given function to be computable.[1]

A set theory does not only express theorems about numbers, and so one may consider a more general so-called strong existence property that is harder to come by, as will be discussed. A theory has this property if the following can be established: For any property , if the theory proves that a set exist that has that property, i.e. if the theory claims the existence statement, then there is also a property that uniquely describes such a set instance. More formally, for any predicate there is a predicate so that

The role analogous to that of realized numbers in arithmetic is played here by defined sets proven to exist by (or according to) the theory. Questions concerning the axiomatic set theory's strength and its relation to term construction are subtle. While many theories discussed tend have all the various numerical properties, the existence property can easily be spoiled, as will be discussed. Weaker forms of existence properties have been formulated.

Some theories with a classical reading of existence can in fact also be constrained so as to exhibit the strong existence property. In Zermelo–Fraenkel set theory with sets all taken to be ordinal-definable, a theory denoted , no sets without such definability exist. The property is also enforced via the constructible universe postulate in . For contrast, consider the theory given by plus the full axiom of choice existence postulate: Recall that this collection of axioms proves the well-ordering theorem, implying well-orderings exists for any set. In particular, this means that relations formally exist that establish the well-ordering of (i.e. the theory claims the existence of a least element for all subsets of with respect to those relations). This is despite the fact that definability of such an ordering is known to be independent of . The latter implies that for no particular formula in the language of the theory does the theory prove that the corresponding set is a well-ordering relation of the reals. So formally proves the existence of a subset with the property of being a well-ordering relation, but at the same time no particular set for which the property could be validated can possibly be defined.

Anti-classical principles[edit]

As mentioned above, a constructive theory may exhibit the numerical existence property, , for some number and where denotes the corresponding numeral in the formal theory. Here one must carefully distinguish between provable implications between two propositions, , and a theory's properties of the form . When adopting a metalogically established schema of the latter type as an inference rule of ones proof calculus and nothing new can be proven, one says the theory is closed under that rule.

One may instead consider adjoining the rule corresponding to the meta-theoretical property as an implication (in the sense of "") to , as an axiom schema or in quantified form. A situation commonly studied is that of a fixed exhibiting the meta-theoretical property of the following type: For an instance from some collection of formulas of a particular form, here captured via and , one established the existence of a number so that . Here one may then postulate , where the bound is a number variable in language of the theory. For example, Church's rule is an admissible rule in first-order Heyting arithmetic and, furthermore, the corresponding Church's thesis principle may consistently be adopted as an axiom. The new theory with the principle added is anti-classical, in that it may not be consistent anymore to also adopt . Similarly, adjoining the excluded middle principle to some theory , the theory thus obtained may prove new, strictly classical statements, and this may spoil some of the meta-theoretical properties that were previously established for . In such a fashion, may not be adopted in , also known as Peano arithmetic .

The focus in this subsection shall be on set theories with quantification over a fully formal notion of an infinite sequences space, i.e. function space, as it will be introduced further below. A translation of Church's rule into the language of the theory itself may here read

Kleene's T predicate together with the result extraction expresses that any input number being mapped to the number is, through , witnessed to be a computable mapping. Here now denotes a set theory model of the standard natural numbers and is an index with respect to a fixed program enumeration. Stronger variants have been used, which extend this principle to functions defined on domains of low complexity. The principle rejects decidability for the predicate defined as , expressing that is the index of a computable function halting on its own index. Weaker, double negated forms of the principle may be considered too, which do not require the existence of a recursive implementation for every , but which still make principles inconsistent that claim the existence of functions which provenly have no recursive realization. Some forms of a Church's thesis as principle are even consistent with the classical, weak so called second-order arithmetic theory , a subsystem of the two-sorted first-order theory .

The collection of computable functions is classically subcountable, which classically is the same as being countable. But classical set theories will generally claim that holds also other functions than the computable ones. For example there is a proof in that total functions (in the set theory sense) do exist that cannot be captured by a Turing machine. Taking the computable world seriously as ontology, a prime example of an anti-classical conception related the Markovian school is the permitted subcountability of various uncountable collections. When adopting the subcountability of the collection of all unending sequences of natural numbers () as an axiom in a constructive theory, the "smallness" (in classical terms) of this collection, in some set theoretical realizations, is then already captured by the theory itself. A constructive theory may also adopt neither classical nor anti-classical axioms and so stay agnostic towards either possibility.

Constructive principles already prove for any . And so for any given element of , the corresponding excluded middle statement for the proposition cannot be negated. Indeed, for any given , by noncontradiction it is impossible to rule out and rule out its negation both at once, and the relevant De Morgan's rule applies as above. But a theory may in some instances also permit the rejection claim . Adopting this does not necessitate providing a particular witnessing the failure of excluded middle for the particular proposition , i.e. witnessing the inconsistent . Predicates on an infinite domain correspond to decision problems. Motivated by provenly computably undecidable problems, one may reject the possibility of decidability of a predicate without also making any existence claim in . As another example, such a situation is enforced in Brouwerian intuitionistic analysis, in a case where the quantifier ranges over infinitely many unending binary sequences and states that a sequence is everywhere zero. Concerning this property, of being conclusively identified as the sequence which is forever constant, adopting Brouwer's continuity principle strictly rules out that this could be proven decidable for all the sequences.

So in a constructive context with a so-called non-classical logic as used here, one may consistently adopt axioms which are both in contradiction to quantified forms of excluded middle, but also non-constructive in the computable sense or as gauged by meta-logical existence properties discussed previously. In that way, a constructive set theory can also provide the framework to study non-classical theories, say rings modeling smooth infinitesimal analysis.

History and overview[edit]

Historically, the subject of constructive set theory (often also "") begun with John Myhill's work on the theories also called and .[2][3] In 1973, he had proposed the former as a first-order set theory based on intuitionistic logic, taking the most common foundation and throwing out the Axiom of choice as well as the principle of the excluded middle, initially leaving everything else as is. However, different forms of some of the axioms which are equivalent in the classical setting are inequivalent in the constructive setting, and some forms imply , as will be demonstrated. In those cases, the intuitionistically weaker formulations were consequently adopted. The far more conservative system is also a first-order theory, but of several sorts and bounded quantification, aiming to provide a formal foundation for Errett Bishop's program of constructive mathematics.

The main discussion presents a sequence of theories in the same language as , leading up to Peter Aczel's well studied ,[4] and beyond. Many modern results trace back to Rathjen and his students. is also characterized by the two features present also in Myhill's theory: On the one hand, it is using the Predicative Separation instead of the full, unbounded Separation schema, see also Lévy hierarchy. Boundedness can be handled as a syntactic property or, alternatively, the theories can be conservatively extended with a higher boundedness predicate and its axioms. Secondly, the impredicative Powerset axiom is discarded, generally in favor of related but weaker axioms. The strong form is very casually used in classical general topology. Adding to a theory even weaker than recovers , as detailed below.[5] The system, which has come to be known as Intuitionistic Zermelo–Fraenkel set theory (), is a strong set theory without . It is similar to , but less conservative or predicative. The theory denoted is the constructive version of , the classical Kripke–Platek set theory without a form of Powerset and where even the Axiom of Collection is bounded.

Models[edit]

Many theories studied in constructive set theory are mere restrictions of Zermelo–Fraenkel set theory () with respect to their axiom as well as their underlying logic. Such theories can then also be interpreted in any model of .

Peano arithmetic is bi-interpretable with the theory given by minus Infinity but plus the existence of all transitive closures. (The latter is also implied after promoting Regularity to Set Induction schema, which is discussed below.) Likewise, constructive arithmetic can also be taken as an apology for most axioms adopted in : Heyting arithmetic is bi-interpretable with a weak constructive set theory,[6] as also described in the article on . One may arithmetically characterize a membership relation "" and with it prove - instead of the existence of a set of natural numbers - that all sets in its theory are in bijection with a (finite) von Neumann natural, a principle denoted . This context further validates Extensionality, Pairing, Union, Binary Intersection (which is related to the Axiom schema of predicative separation) and the Set Induction schema. Taken as axioms, the aforementioned principles constitute a set theory that is already identical with the theory given by minus the existence of but plus as axiom. All those axioms are discussed in detail below. Relatedly, also proves that the hereditarily finite sets fulfill all the previous axioms. This is a result which persists when passing on to and minus Infinity.

As far as constructive realizations go there is a relevant realizability theory. Relatedly, Aczel's theory constructive Zermelo-Fraenkel has been interpreted in a Martin-Löf type theories, as sketched in the section on . In this way, theorems provable in this and weaker set theories are candidates for a computer realization.

Presheaf models for constructive set theories have also been introduced. These are analogous to presheaf models for intuitionistic set theory developed by Dana Scott in the 1980s.[7][8] Realizability models of within the effective topos have been identified, which, say, at once validate full Separation, relativized dependent choice , independence of premise for sets, but also the subcountability of all sets, Markov's principle and Church's thesis in the formulation for all predicates.[9]

Notation[edit]

In an axiomatic set theory, sets are the entities exhibiting properties. But there is then a more intricate relation between the set concept and logic. For example, the property of being a natural number smaller than 100 may be reformulated as being a member of the set of numbers with that property. The set theory axioms govern set existence and thus govern which predicates can be materialized as entity in itself, in this sense. Specification is also directly governed by the axioms, as discussed below. For a practical consideration, consider the property of being a sequence of coin flip outcomes that overall show more heads than tails. This property may be used to separate out a corresponding subset of any set of finite sequences of coin flips. Relatedly, the measure theoretic formalization of a probabilistic event is explicitly based around sets and provides many more examples.

This section introduces the object language and auxiliary notions used to formalize this materialization.

Language[edit]

The propositional connective symbols used to form syntactic formulas are standard. The axioms of set theory give a means to prove equality "" of sets and that symbol may, by abuse of notation, be used for classes. Negation "" of equality is sometimes called the denial of equality, and is commonly written "". However, in a context with apartness relations, for example when dealing with sequences, the latter symbol is also sometimes used for something different.

The common treatment, as also adopted here, formally only extends the underlying logic by one primitive binary predicate of set theory, "". As with equality, negation of elementhood "" is often written "".

Variables[edit]

Below the Greek denotes a proposition or predicate variable in axiom schemas and or is used for particular such predicates. The word "predicate" is sometimes used interchangeably with "formulas" as well, even in the unary case.

Quantifiers only ever range over sets and those are denoted by lower case letters. As is common, one may use argument brackets to express predicates, for the sake of highlighting particular free variables in their syntactic expression, as in "". Unique existence here means .

Classes[edit]

As is also common, one makes use set builder notation for classes, which, in most contexts, are not part of the object language but used for concise discussion. In particular, one may introduce notation declarations of the corresponding class via "", for the purpose of expressing any as . Logically equivalent predicates can be used to introduce the same class. One also writes as shorthand for . For example, one may consider and this is also denoted .

One abbreviates by and by . The syntactic notion of bounded quantification in this sense can play a role in the formulation of axiom schemas, as seen below.

If there provenly exists a set inside a class, meaning , then one calls it inhabited. One may also use quantification in to express this as . The class is then provenly not the empty set, introduced below. While classically equivalent, constructively non-empty is a weaker notion with two negations and ought to be called not uninhabited. Unfortunately, the word for the more useful notion of 'inhabited' is rarely used in classical mathematics.

Two ways to express disjointness capture many of the intuitionistically valid negation rules: . Using the above notation, this is a purely logical equivalence and below the proposition will furthermore be expressible as .

Express the subclass claim , i.e. , by . The similar notion of subset-bounded quantifiers, as in , has been used in set theoretical investigation as well, but will not be further highlighted here. For a predicate , trivially . And so follows that .

Extensional equivalence[edit]

Denote by the statement expressing that two classes have exactly the same elements, i.e. , or equivalently . This is not to be conflated with the concept of equinumerosity also used below.

With standing for , the convenient notational relation between and , axioms of the form postulate that the class of all sets for which holds actually forms a set. Less formally, this may be expressed as . Likewise, the proposition conveys " when is among the theory's sets." For the case where is the trivially false predicate, the proposition is equivalent to the negation of the former existence claim, expressing the non-existence of as a set.

Further extensions of class comprehension notation as above are in common used in set theory, giving meaning to statements such as "", and so on.

Syntactically more general, a set may also be characterized using another 2-ary predicate trough , where the right hand side may depend on the actual variable , and possibly even on membership in itself.

Subtheories of ZF[edit]

Here a series of familiar axioms is presented, or the relevant slight reformulations thereof. It is emphasized how the absence of in the logic affects what is provable and it is highlighted which non-classical axioms are, in turn, consistent.

Equality[edit]

Using the notation introduced above, the following axiom gives a means to prove equality "" of two sets, so that through substitution, any predicate about translates to one of . By the logical properties of equality, the converse direction of the postulated implication holds automatically.

Extensionality

In a constructive interpretation, the elements of a subclass of may come equipped with more information than those of , in the sense that being able to judge is being able to judge . And (unless the whole disjunction follows from axioms) in the Brouwer–Heyting–Kolmogorov interpretation, this means to have proven or having rejected it. As may be not decidable for all elements in , the two classes must a priori be distinguished.

Consider a predicate that provenly holds for all elements of a set , so that , and assume that the class on the right hand side is established to be a set. Note that, even if this set on the right informally also ties to proof-relevant information about the validity of for all the elements, the Extensionality axiom postulates that, in our set theory, the set on the right hand side is judged equal to the one on the left hand side. This above analysis also shows that a statement of the form , which in informal class notation may be expressed as , is then equivalently expressed as . This means that establishing such -theorems, e.g. the ones provable from full mathematical induction, enables treating the subclass of on the right hand side of the equality to be used wherever is used.

Alternative approaches[edit]

While often adopted, this axiom has been criticized in constructive thought, as it effectively collapses differently defined properties, or at least the sets viewed as the extension of these properties, a Fregian notion.

Modern type theories may instead aim at defining the demanded equivalence "" in terms of functions, see e.g. type equivalence. The related concept of function extensionality is often not adopted in type theory.

Other frameworks for constructive mathematics might instead demand a particular rule for equality or apartness come for the elements of each and every set discussed. Even then, the above definition can be used to characterize equality of subsets and . Note that adopting "" as a symbol in a predicate logic theory makes equality of two terms a quantifier-free expression.

Merging sets[edit]

Define class notation for the pairing of a few given elements via disjunctions. E.g. is the quantifier-free statement , and likewise says , and so on.

Two other basic existence postulates given some other sets are as follows. Firstly,

Pairing

Given the definitions above, expands to , so this is making use of equality and a disjunction. The axiom says that for any two sets and , there is at least one set , which hold at least those two sets.

With bounded Separation below, also the class exists as a set. Denote by the standard ordered pair model , so that e.g. denotes another bounded formula in the formal language of the theory.

And then, using existential quantification and a conjunction,

Union

saying that for any set , there is at least one set , which holds all the members , of 's members . The minimal such set is the union.

The two axioms are commonly formulated stronger, in terms of "" instead of just "", although this is technically redundant in the context of : As the Separation axiom below is formulated with "", for statements the equivalence can be derived, given the theory allows for separation using . In cases where is an existential statement, like here in the union axiom, there is also another formulation using a universal quantifier.

Also using bounded Separation, the two axioms just stated together imply the existence of a binary union of two classes and , when they have been established to be sets, denoted by or . For a fixed set , to validate membership in the union of two given sets and , one needs to validate the part of the axiom, which can be done by validating the disjunction of the predicates defining the sets and , for . In terms of the associated sets, it is done by validating the disjunction .

The union and other set forming notations are also used for classes. For instance, the proposition is written . Let now . Given , the decidability of membership in , i.e. the potentially independent statement , can also be expressed as . But, as for any excluded middle statement, the double-negation of the latter holds: That union isn't not inhabited by . This goes to show that partitioning is also a more involved notion, constructively.

Set existence[edit]

The property that is false for any set corresponds to the empty class, which is denoted by or zero, . That the empty class is a set readily follows from other existence axioms, such as the Axiom of Infinity below. But if, e.g., one is explicitly interested in excluding infinite sets in one's study, one may at this point adopt the

Axiom of empty set:

Introduction of the symbol (as abbreviating notation for expressions in involving characterizing properties) is justified as uniqueness for this set can be proven. As is false for any , the axiom then reads .

Write for , which equals , i.e. . Likewise, write for , which equals , i.e. . A simple and provenly false proposition then is, for example, , corresponding to in the standard arithmetic model. Again, here symbols such as are treated as convenient notation and any proposition really translates to an expression using only "" and logical symbols, including quantifiers. Accompanied by a metamathematical analysis that the capabilities of the new theories are equivalent in an effective manner, formal extensions by symbols such as may also be considered.

More generally, for a set , define the successor set as . The interplay of the successor operation with the membership relation has a recursive clause, in the sense that . By reflexivity of equality, , and in particular is always inhabited.

BCST[edit]

The following makes use of axiom schemas, i.e. axioms for some collection of predicates. Some of the stated axiom schemas shall allow for any collection of set parameters as well (meaning any particular named variables ). That is, instantiations of the schema are permitted in which the predicate (some particular ) also depends on a number of further set variables and the statement of the axiom is understood with corresponding extra outer universal closures (as in ).

Separation[edit]

Basic constructive set theory consists of several axioms also part of standard set theory, except the Separation axiom is weakened. Beyond the four axioms above, it postulates Predicative Separation as well as the Replacement schema.

Axiom schema of predicative separation: For any bounded predicate , with parameters and with set variable not free in it,

This axiom amounts to postulating the existence of a set obtained by the intersection of any set and any predicatively described class . For any proven to be a set, when the predicate is taken as , one obtains the binary intersection of sets and writes . Intersection corresponds to conjunction in an analog way to how union corresponds to disjunction.

When the predicate is taken as the negation , one obtains the difference principle, granting existence of any set . Note that sets like or are always empty. So, as noted, from Separation and the existence of at least one set (e.g. Infinity below) will follow the existence of the empty set (also denoted ). Within this conservative context of , the Bounded Separation schema is actually equivalent to Empty Set plus the existence of the binary intersection for any two sets. The latter variant of axiomatization does not make use of a formula schema.

For a proposition , a recurring trope in the constructive analysis of set theory is to view the predicate as the subclass of the second ordinal . If it is provable that holds, or , or , then is inhabited, or empty (uninhabited), or non-empty (not uninhabited), respectively. Clearly, is equivalent to both the proposition , and also . Likewise, is equivalent to and also, equivalently, also . In the model of the naturals, if is a number, expresses that is smaller than . The union that is part of the successor operation definition above may be used to express the excluded middle statement as . In words, is decidable if and only if the successor of is larger than the smallest ordinal . The proposition is decided either way through establishing how is smaller: By already being smaller than , or by being 's direct predecessor. Another way to express excluded middle for is as the existence of a least number member of the inhabited class . If ones separation axiom allows for separation with , then is a subset, which may be called the truth value associated with . Two truth values can be proven equal, as sets, by proving an equivalence. In terms of this terminology, the collection of proof values can a priori be understood to be rich. Unsurprisingly, decidable propositions have one of a binary set of truth values. The excluded middle disjunction for that is then also implied by .

The axiom schema of Predicative Separation is also called Bounded Separation, as in Separation for set-bounded quantifiers only. The scope of specified subsets that can be proven to exist is enriched with further set existence postulate. Bounded Separation is a schema that takes into account syntactic aspects of set defining predicates, up to provable equivalence. The permitted formulas are denoted by in the set theoretical Lévy hierarchy, in analogy to in the arithmetical hierarchy. (Note however that the arithmetic classification is sometimes expressed not syntactically but in terms of subclasses of the naturals. Also, the bottom level of the arithmetical hierarchy has several common definitions, some not allowing the use of some total functions. The distinction is not relevant on the level or higher. Finally note that a classification of a formula may be expressed up to equivalence in the theory.)

The schema is also the way in which Mac Lane weakens a system close to Zermelo set theory , for mathematical foundations related to topos theory. See also Kripke-Platek set theory.

No universal set[edit]

By a remark in the section on merging sets, a set cannot consistently ruled out to be a member of a class of the form . A constructive proof that it is in that class contains information. Now if is a set, then the class on the right is not a set. The following demonstrates this in the special case when is empty, i.e. when the right side is the universal class.

The following holds for any relation . It gives a purely logical condition such that two terms and cannot be -related to one another.

Most important here is the rejection of the final disjunct . The expression is a bounded one and thus allowed in separation. Russel's construction in turn shows that . So for any set , Predicative Separation alone implies that there exists a set which is not a member of . In particular, no universal set can exist in this theory.

In a theory with the axiom of regularity, like , of course that subset can be proven to be equal to itself. As an aside, in a theory with stratification like Intuitionistic New Foundations, a universal set may exist because use of the syntactic expression may be disallowed in proofs of existence by, essentially, separation.

Already the special case implies that the subclass of the universal class is proper as well.

Predicativity[edit]

The restriction in the axiom is also gatekeeping impredicative definitions: Existence should at best not be claimed for objects that are not explicitly describable, or whose definition involves themselves or reference to a proper class, such as when a property to be checked involves a universal quantifier. So in a constructive theory without Axiom of power set, when denotes some 2-ary predicate, one should not generally expect a subclass of to be a set, in case that it is defined, for example, as in

,

or via a similar definitions involving any quantification over the sets . Note that if this subclass of is provenly a set, then this subset itself is also in the unbounded scope of set variable . In other words, as the subclass property is fulfilled, this exact set , defined using the expression , would play a role in its own characterization.

While predicative Separation leads to fewer given class definitions being sets, it must be emphasized that many class definitions that are classically equivalent are not so when restricting oneself to constructive logic. So in this way, one gets a broader theory, constructively. Due to the potential undecidability of general predicates, the notion of subset and subclass is more elaborate in constructive set theories than in classical ones. This remains true if full Separation is adopted, as in the theory , which however spoils the existence property as well as the standard type theoretical interpretations, and in this way spoils a bottom-up view of constructive sets. As an aside, as subtyping is not a necessary feature of constructive type theory, constructive set theory can be said to quite differ from that framework.

Replacement[edit]

Next consider the

Axiom schema of Replacement: For any predicate with set variable not free in it,

It is granting existence, as sets, of the range of function-like predicates, obtained via their domains. In the above formulation, the predicate is not restricted akin to the Separation schema, but this axiom already involves an existential quantifier in the antecedent. Of course, weaker schemas could be considered as well.

Via Replacement, the existence of any pair also follows from that of any other discrete pair, such as . But as the binary union used in already made use of the Pairing axiom, this approach then necessitates postulating the existence of over that of . In a theory with the impredicative Powerset axiom, the existence of can also be demonstrated using Separation.

With the Replacement schema, the theory outlined thus far proves that the equivalence classes or indexed sums are sets. In particular, the Cartesian product, holding all pairs of elements of two sets, is a set. In turn, for any fixed number (in the metatheory), the corresponding product expression, say , can be constructed as a set. The axiomatic requirements for sets recursively defined in the language are discussed further below. Equality of elements inside a set is decidable if the corresponding relation as a subset of is decidable, in which case is sometimes called discrete.

Replacement is relevant for function comprehension and can be seen as a form of comprehension more generally. Only when assuming does Replacement already imply full Separation. In , Replacement is mostly important to prove the existence of sets of high rank, namely via instances of the axiom schema where relates relatively small set to bigger ones, .

Constructive set theories commonly have Axiom schema of Replacement, sometimes restricted to bounded formulas. However, when other axioms are dropped, this schema is actually often strengthened - not beyond , but instead merely to gain back some provability strength. Such stronger axioms exist that do not spoil the strong existence properties of a theory, as discussed further below.

Hereditarily finite sets[edit]

Pendants of the elements of the class of hereditarily finite sets can be implemented in any common programming language. The axioms discussed above abstract from common operations on the set data type: Union, Separation and Replacement are related to flattening, filtering and comprehension. Replacement together with Set Induction (introduced below) suffices to axiomize constructively and that theory is also studied without Infinity.

A sort of blend between pairing and union, an axiom more readily related to the successor is the Axiom of adjunction.[10][11] Such principles are relevant for the standard modeling of individual Neumann ordinals. While postulating Replacement is not a necessity in the design of a weak constructive set theory that is bi-interpretable with Heyting arithmetic , some form of induction is. For comparison, consider the very weak classical theory called General set theory that interprets the class of natural numbers and their arithmetic via just Extensionality, Adjunction and full Separation.

The discussion now proceeds with axioms granting existence of objects which, in different but related form, are also found in dependent type theories, namely products and the collection of natural numbers as a completed set. Infinite sets are particularly handy to reason about operations applied to sequences defined on unbounded index domains, say the formal differentiation of a generating function or the addition of two Cauchy sequences.

ECST[edit]

For some fixed predicate and a set , the statement expresses that is the smallest (in the sense of "") among all sets for which holds true, and that it is always a subset of such . The aim of the axiom of infinity is to eventually obtain unique smallest inductive set.

In the context of common set theory axioms, one statement of infinitude is to state that a class is inhabited and also includes a chain of membership (or alternatively a chain of supersets). That is,

.

More concretely, denote by the inductive property,

.

In terms of a predicate underlying the class so that , the latter translates to .

Write for the general intersection . (A variant of this definition may be considered which requires , but we only use this notion for the following auxiliary definition.)

One commonly defines a class , the intersection of all inductive sets. (Variants of this treatment may work in terms of a formula that depends on a set parameter so that .) The class exactly holds all fulfilling the unbounded property . The intention is that if inductive sets exist at all, then the class shares each common natural number with them, and then the proposition , by definition of "", implies that holds for each of these naturals. While bounded separation does not suffice to prove to be the desired set, the language here forms the basis for the following axiom, granting natural number induction for predicates that constitute a set.

The elementary constructive Set Theory has the axiom of as well as the postulate

Strong Infinity

Going on, one takes the symbol to denote the now unique smallest inductive set, an unbounded von Neumann ordinal. It contains the empty set and, for each set in , another set in that contains one element more.

Symbols called zero and successor are in the signature of the theory of Peano. In , the above defined successor of any number also being in the class follow directly from the characterization of the natural naturals by our von Neumann model. Since the successor of such a set contains itself, one also finds that no successor equals zero. So two of the Peano axioms regarding the symbols zero and the one regarding closedness of come easily. Fourthly, in , where is a set, can be proven to be an injective operation.

For some predicate of sets , the statement claims it for all subsets of the set of naturals, and the axiom now proves such sets do exist. Such quantification is also possible in second-order arithmetic.

The pairwise order "" on the naturals is captured by their membership relation "". It is important to note that the theory proves the order as well as the equality relation on this set to be decidable. Not only is no number smaller than , but induction implies that among subsets of , it is exactly the empty set which has no least member. The contrapositive of this proves the double-negated least number existence for all non-empty subsets of . Another valid principle also classically equivalent to it is least number existence for all inhabited decidable subsets. That said, the bare existence claim for the inhabited subset of is equivalent to excluded middle for , and a constructive theory will therefore not prove to be well-ordered.

Weaker formulations of infinity[edit]

Should it need motivation, the handiness of postulating an unbounded set of numbers in relation to other inductive properties becomes clear in the discussion of arithmetic in set theory further below. But as is familiar from classical set theory, also weak forms of Infinity can be formulated. For example, one may just postulate the existence of some inductive set, - such an existence postulate suffices when full Separation may then be used to carve out the inductive subset of natural numbers, the shared subset of all inductive classes. Alternatively, more specific mere existence postulates may be adopted. Either which way, the inductive set then fulfills the following predecessor existence property in the sense of the von Neumann model:

Without making use of the notation for the previously defined successor notation, the extensional equality to a successor is captured by . This expresses that all elements are either equal to or themselves hold a predecessor set which shares all other members with .

Observe that through the expression "" on the right hand side, the property characterizing by its members here syntactically again contains the symbol itself. Due to the bottom-up nature of the natural numbers, this is tame here. Assuming -set induction on top of , no two different sets have this property. Also note that there are also longer formulations of this property, avoiding "" in favor unbounded quantifiers.

Number bounds[edit]

Adopting an Axiom of Infinity, the set-bounded quantification legal in predicates used in -Separation then explicitly permits numerically unbounded quantifiers - the two meanings of "bounded" should not be confused. With at hand, call a class of numbers bounded if the following existence statement holds

This is a statements of finiteness, also equivalently formulated via