Implementation of mathematics in set theory

From Wikipedia, the free encyclopedia

This article examines the implementation of mathematical concepts in set theory. The implementation of a number of basic mathematical concepts is carried out in parallel in ZFC (the dominant set theory) and in NFU, the version of Quine's New Foundations shown to be consistent by R. B. Jensen in 1969 (here understood to include at least axioms of Infinity and Choice). For details of these two systems, consult their main articles.

What is said here applies also to two families of set theories: on the one hand, a range of theories including Zermelo set theory near the lower end of the scale and going up to ZFC extended with large cardinal hypotheses such as "there is a measurable cardinal"; and on the other hand a hierarchy of extensions of NFU which is surveyed in the New Foundations article. These correspond to different general views of what the set-theoretical universe is like, and it is the approaches to implementation of mathematical concepts under these two general views that are being compared and contrasted.

It is not the primary aim of this article to say anything about the relative merits of these theories as foundations for mathematics. The reason for the use of two different set theories is to illustrate that multiple approaches to the implementation of mathematics are feasible. Precisely because of this approach, this article is not a source of "official" definitions for any mathematical concept.

[edit] Preliminaries

Metaphorically, we are inclined to say that in the following sections we are carrying out certain constructions in the two theories ZFC and NFU and comparing the resulting implementations of certain usual mathematical structures (such as the natural numbers).

In a mathematical theory, what we do is prove theorems (and that is all we do). So when we say that a theory allows us to construct a certain object, what we are actually saying is that it is a theorem of that theory that that object exists. This is a statement about a definition of the form "the x such that $φ$ exists", where $φ$ is a formula of our language: we say that our theory proves the existence of "the x such that $φ$ " just in case it is a theorem that "there is one and only one x such that $φ$ ". (See Bertrand Russell's theory of descriptions.) We may loosely say that the theory "defines" or "constructs" this object in this case. If the statement is not a theorem, we say that the theory cannot show that the object exists; if the statement is provably false in the theory, we say that the theory proves that the object cannot exist, and may loosely say that we "cannot construct" or "cannot define" this object.

ZFC and NFU share the language of set theory, so the same formal definitions "the x such that $φ$ " can be contemplated in the two theories. A specific form of definition in the language of set theory is set-builder notation: $\{x \mid \phi\}$ means "the set A such that for all x, $x \in A \leftrightarrow \phi$ " (A cannot be free in $φ$ ). This notation admits certain conventional extensions: $\{x \in B \mid \phi\}$ is synonymous with $\{x \mid x \in B \wedge \phi\}$ ; $\{f(x_1,\ldots,x_n) \mid \phi\}$ is defined as $\{z \mid (\exists x_1,\ldots,x_n.z=f(x_1,...,x_n) \wedge \phi\}$ , where $f(x_1,\ldots,x_n)$ is an expression already defined.

Expressions definable in set-builder notation make sense in both ZFC and NFU: it may be that both theories prove that a given definition succeeds, or that neither do (the expression $\{x \mid x\not\in x\}$ fails to refer to anything in any set theory with classical logic; in class theories like NBG this notation does refer to a class, but it is defined differently), or that one does and the other doesn't. Further, an object defined in the same way in ZFC and NFU may turn out to have different properties in the two theories (or there may be a difference in what we can prove where there is no provable difference between their properties).

Further, we import concepts from other branches of mathematics (in intention, all branches of mathematics) into set theory. In some cases, we do not import the concepts into ZFC and NFU in the same way. For example, the usual definition of the first infinite ordinal $ω$ in ZFC is not suitable for NFU because the object (defined in purely set theoretical language as the set of all finite von Neumann ordinals) cannot be shown to exist in NFU. The usual definition of $ω$ in NFU is (in purely set theoretical language) as the set of all infinite well-orderings all of whose proper initial segments are finite, an object which can be shown not to exist in ZFC. In the case of such imported objects, we may in some cases give two different definitions, one for use in ZFC and related theories, and one for use in NFU and related theories. For such "implementations" of imported mathematical concepts to make sense, it is necessary to be able to show that the two parallel interpretations have the expected properties: for example, the implementations of the natural numbers in ZFC and NFU are different, but we can say that both are implementations of the same mathematical structure, because both include definitions for all the primitives of Peano arithmetic and satisfy the (translations of) the Peano axioms. It is then possible to compare what happens in the two theories as when only set theoretical language is in use, as long as the definitions appropriate to ZFC are understood to be used in the ZFC context and the definitions appropriate to NFU are understood to be used in the NFU context.

Whatever we prove to exist in a theory clearly provably exists in any extension of that theory; moreover, analysis of the proof that an object exists in a given theory may show that it exists in weaker versions of that theory (one may consider Zermelo set theory instead of ZFC for much of what is done in this article, for example). This is important to note because we are actually considering two hierarchies of related theories here.

[edit] Empty set, singleton, unordered pairs and tuples

These constructions appear first because they are the simplest constructions in set theory, not because they are the first constructions that come to mind in mathematics (though the notion of finite set is certainly fundamental!)

$\emptyset =_{\mathrm{def}} \{x \mid x \neq x\}$

The empty set is the unique set with no members. In NFU, there are also urelements with no members.

$\displaystyle\{x\} =_{\mathrm{def}} \{y \mid y = x\}$

For each object x, there is a set ${x}$ with x as its only element.

$\{x,y\} =_{\mathrm{def}} \{z \mid z=x \vee z=y\}$

For objects x and y, there is a set ${x, y}$ containing x and y as its only elements.

$x \cup y =_{\mathrm{def}} \{z \mid z \in x \vee z \in y\}$

The union of two sets is defined in the usual way.

$\{x_1,\ldots,x_n,x_{n+1}\} =_{\mathrm{def}} \{x_1,\ldots,x_n\} \cup \{x_{n+1}\}$

This is a recursive definition of unordered n-tuples for any concrete n (finite sets given as lists of their elements).

In NFU, all the set definitions given work by stratified comprehension; in ZFC, the existence of the unordered pair is given by the axiom of Pairing, the existence of the empty set follows by Separation from the existence of any set, and the boolean union of two sets exists by the axioms of Pairing and Union ( $x \cup y = \bigcup\{x,y\}$ ).

[edit] Ordered pair

The first substantial mathematical construction we consider is the ordered pair. The reason that this comes first is technical: we will need it to implement notions of relation and function which are needed for implementations of other concepts which may seem to be prior.

The first definition of the ordered pair was the definition $(x,y) =_{\mathrm{def}} \{\{\{x\},\emptyset\},\{\{y\}\}\}$ proposed by Norbert Wiener in 1914 in the context of the type theory of Principia Mathematica. Wiener observed that this allowed the elimination of types of n-ary relations for $n > 1$ from the system of that work.

It is more usual now to use the definition $(x, y) = def {{x},{x, y}}$ , due to Kuratowski.

Either of these definitions works in either ZFC or NFU. In NFU, the Kuratowski pair has a technical disadvantage: it is two types higher than its projections. It is common to postulate the existence of a type-level ordered pair (a pair $(x, y)$ which is the same type as its projections) in NFU. For the moment, we will use the Kuratowski pair in both systems, until we can give a formal justification for the introduction of the type-level pair.

The internal details of these definitions have nothing to do with their actual mathematical function. For any notion $(x, y)$ of ordered pair, the things that matter are that it satisfy the defining condition

$(x,y)=(z,w) \equiv x=z \wedge y=w$

and that it be reasonably easy to collect ordered pairs into sets.

[edit] Relations

Relations are sets whose members are all ordered pairs. Where possible, a relation R (understood as a binary predicate) is implemented as $\{(x,y) \mid x R y\}$ (which may be written as $\{z \mid \pi_1(z) R \pi_2(z)\}$ ). Where R is a set of ordered pairs, we read xRy as (x,y)∈R.

In ZFC, some relations (such as the general equality relation or subset relation on sets) are "too large" to be sets (but may be harmlessly reified as proper classes). In NFU, some relations (such as the membership relation) are not sets because their definitions are not stratified: in $\{(x,y) \mid x \in y\}$ x and y would need to have the same type (because they appear as projections of the same pair), but also successive types (because x is considered as an element of y).

[edit] Related definitions

For the time being, we do not consider relations with arity greater than 2: all our relations are binary. Let R and S be given binary relations. Then the following concepts are useful:

The converse of R is the relation $\{(y,x) \mid x R y\}$ .

The domain of R is the set $\{x \mid (\exists y.x R y)\}$ .

The range of R is the domain of the converse of R.

The field of R is the union of the domain and range of R.

The preimage of a member x of the field of R is the set $\{y \mid y Rx\}$ (used in the definition of "well-founded" below).

The downward closure of a member x of the field of R is the smallest set D containing x, and containing each zRy for each y∈D (i.e., including the preimage of each of its elements with respect to R as a subset).

The relative product of R and S, R|S, is the relation $\{(x,z) \mid (\exists y.xRy \wedge ySz)\}$ .

In ZFC, proving that these notions are all sets follows from Union, Separation, and Power Set. In NFU, it is easy to check that these definitions give rise to stratified formulas.

Notice that the range and codomain of a relation are not distinguished: this could be done by representing a relation R with codomain B as (R,B), but our development will not require this.

In ZFC, any relation whose domain is a subset of a set A and whose range is a subset of a set B will be a set, since the cartesian product $A \times B = \{(a,b) \mid a \in A \wedge b \in B\}$ is a set (being a subclass of $P(P(A \cup B))$ ), and Separation provides for the existence of $\{(x,y)\in A \times B \mid x R y\}$ . In NFU, some relations with global scope (such as equality and subset) can be implemented as sets. In NFU, we need to bear in mind that x and y are three types lower than R in $x R y$ (one type lower if a type-level ordered pair is used).

[edit] Properties and kinds of relations

Let R be some binary relation. R is:

Reflexive if $\forall x \exist y . (xRy \vee yRx) \leftrightarrow xRx$ .

Symmetric if $\forall xy . xRy \leftrightarrow yRx$ .

Transitive if $\forall xyz . xRy \wedge yRz \rightarrow xRz$ .

Antisymmetric if $\forall xy . xRy \wedge yRx \rightarrow x=y$ .

Well-founded if for every set S which meets the field of R, $\ \exists x \in S$ whose preimage under R does not meet S.

Extensional if for every x,y in the field of R, x=y if and only if x and y have the same preimage under R.

Relations having certain combinations of the above properties have standard names. R is:

An equivalence relation if and only if R is reflexive, symmetric, and transitive.

A partial order if and only if R is reflexive, antisymmetric, and transitive.

A linear order if and only if R is a partial order and for every x,y in the field of R, either xRy or yRx.

A well-ordering if and only if R is a linear order and well-founded.

A set picture if and only if R is well-founded and extensional, and the field of R either equals the downward closure of one of its members (called its top element), or is empty.

[edit] Functions

A functional relation is a binary predicate F such that $(\forall xyz.x F y \wedge x F z \rightarrow y=z)$ . Such a relation (predicate) is implemented as a relation (set) exactly as described in the previous section. So the predicate F is implemented by the set $\{(x,y) \mid x F y\}$ . A set of ordered pairs F is a function if and only if $(\forall xyz.(x,y) \in F \wedge (x,z) \in F \rightarrow y=z)$ . We define F(x) as the unique object y such that xFy (if there is one) or as the unique object y such that (x,y) ∈ F. The presence in both theories of functional predicates which are not sets makes it useful to allow the notation F(x) both for sets F and for important functional predicates. As long as one does not quantify over functions in the latter sense, all such uses are in principle eliminable.

In NFU, x has the same type as F(x), and F is three types higher than F(x) (one type higher, if a type-level ordered pair is used). For any set A, we define $F [A]$ as $\{y \mid (\exists x.x \in A \wedge y=F(x))\}$ , more conveniently written as $\{F(x) \mid x \in A\}$ . If A is a set and F is any functional relation, Replacement assures that $F [A]$ is a set in ZFC. In NFU, $F [A]$ and A have the same type, and F is two types higher than $F [A]$ (the same type, if a type-level ordered pair is used).

The function I(x) = x is not a set in ZFC because it is "too large". I(x) is however, a set in NFU. The function (predicate) $S (x) = {x}$ is not a function (set) in either theory; in ZFC because it is too large; in NFU because its definition is unstratified. Moreover, S(x) can be proved not to exist in NFU: see the resolution of Cantor's paradox in New Foundations.

[edit] Operations on functions

Let f and g be arbitrary functions. The composition of f and g, $g \circ f$ , is defined as the relative product $f | g$ , if this results in a function: $g \circ f$ is a function, with $(g \circ f)(x) = g(f(x))$ , if the range of f is a subset of the domain of g. The inverse of f, f^-1, is defined as the converse of f (if this is a function). Given any set A, the identity function $i A$ is the set $\{(x,x)\mid x \in A\}$ : this is a set in both ZFC and NF, but for different reasons.

[edit] Special kinds of function

A function is an injection and one-to-one if it has an inverse.

If f is a function from set A to set B, f is a:

Injection if the domain of f is A and the range of f is a subset of B. In other words, the images under f of distinct members of A are distinct members of B.
Surjection if f has domain A and range B.
Bijection if f is both an injection and a surjection.

This terminology adjusts for the fact that a function, as defined above, does not determine its codomain.

[edit] Size of sets

In both ZFC and NFU, we say that two sets A and B are the same size (or are equinumerous) if and only if there is a bijection f from A to B. We can write this $| A | = | B |$ as long as we note that for the moment this expresses a relation between A and B rather than a relation between objects $| A |$ and $| B |$ which have not yet been defined. We also provide notation $A \sim B$ for this relation to be used in contexts such as the actual definition of the cardinals where even the appearance of presupposing abstract cardinals should be avoided.

Similarly, we can define $|A| \leq |B|$ as holding if and only if there is an injection from A to B.

It is straightforward to show that the relation of equinumerousness is an equivalence relation: equinumerousness of A with A is witnessed by $i A$ ; if f witnesses $| A | = | B |$ , then $f - 1$ witnesses $| B | = | A |$ ; if f witnesses $| A | = | B |$ and g witnesses $| B | = | C |$ , then $g\circ f$ witnesses $| A | = | C |$ .

We can show that $|A| \leq |B|$ is a linear order -- on abstract cardinals, but not on sets. Reflexivity is obvious and transitivity is proven just as for equinumerousness. The Schroder-Bernstein theorem, provable in either ZFC or NFU in an entirely standard way, establishes that

$|A| \leq |B| \wedge |B| \leq |A| \rightarrow |A| = |B|$

(this establishes that we have antisymmetry on cardinals (not yet defined), but we are now considering a relation on sets), and

$|A| \leq |B| \vee |B| \leq |A|$

follows in a standard way in either theory from the Axiom of Choice.

[edit] Finite sets and natural numbers

Natural numbers can be considered either as finite ordinals or finite cardinals. Here we consider them as finite cardinal numbers. This is the first place where a major difference between the implementations in ZFC and NFU becomes evident.

The Axiom of Infinity of ZFC tells us that there is a set A which contains $\emptyset$ and contains $y \cup \{y\}$ for each $y \in A$ . This set A is not uniquely determined (it can be made larger while preserving this closure property): the set N of natural numbers is

$\{x \in A \mid (\forall B.(\emptyset \in B \wedge (\forall y.y \in B \rightarrow y \cup \{y\} \in B) \rightarrow x \in B)\}$

which is the intersection of all sets which contain the empty set and are closed under the "successor" operation $y \mapsto y \cup \{y\}$ .

In ZFC, we say that a set $A$ is finite if and only if there is $n \in N$ such that $| n | = | A |$ : further, we define $| A |$ as this n for finite A. (It can be proved that no two distinct natural numbers are the same size).

The usual operations of arithmetic can be defined recursively and in a style very similar to that in which the set of natural numbers itself is defined. For example, + (the addition operation on natural numbers) can be defined as the smallest set which contains $((\emptyset,\emptyset),\emptyset)$ and contains $((x,y \cup \{y\}),z \cup \{z\})$ whenever it contains $((x, y), z)$ .

In NFU, it is not obvious that this approach can be used, since the successor operation $y \cup \{y\}$ is unstratified and so the set N as defined above cannot be shown to exist in NFU (it is interesting to note that it is consistent for the set of finite von Neumann ordinals to exist in NFU, but this strengthens the theory, as the existence of this set implies the Axiom of Counting (for which see below or the New Foundations article)).

The standard definition of the natural numbers, which is actually the oldest set-theoretic definition of natural numbers, is as equivalence classes of finite sets under equinumerousness. We here present the definition of N appropriate to NFU in exactly this way (this is not the usual way to do it, but the results are the same): define Fin, the set of finite sets, as

$\{A \mid (\forall F.(\emptyset \in F \wedge (\forall xy.x \in F \rightarrow x \cup \{y\} \in F)) \rightarrow A \in F)\}$

For any set $A \in Fin$ , define $| A |$ as $\{B \mid A \sim B\}$ . Define N as the set $\{|A| \mid A \in Fin\}$ .

The Axiom of Infinity of NFU can be expressed as $V \not\in Fin$ : this is enough to establish that each natural number has a nonempty successor (the successor of $| A |$ being $|A \cup \{x\}|$ for any $x \not\in A$ ) which is the hard part of showing that the Peano axioms of arithmetic are satisfied.

The operations of arithmetic can be defined in a style similar to the style given above (using the definition of successor just given). They can also be defined in a natural set theoretical way: if A and B are disjoint finite sets, we can define |A|+|B| as $|A \cup B|$ . More formally, define m+n for m and n in N as

$\{A \mid (\exists BC.B \in m \wedge C \in n \wedge B \cap C = \emptyset \wedge A = B \cup C\}$

(But note that this style of definition is feasible for the ZFC numerals as well, but more circuitous: the form of the NFU definition facilitates set manipulations while the form of the ZFC definition facilitates recursive definitions, but either theory supports either style of definition).

The two implementations are quite different. In ZFC, we choose a representative of each finite cardinality to represent that cardinality (the equivalence classes themselves are too large to be sets); in NFU the equivalence classes themselves are sets, and are thus an obvious choice for objects to stand in for the cardinalities. However, the arithmetic of the two theories is identical: the same abstraction is implemented by these two superficially different approaches.

[edit] Equivalence relations and partitions

A general technique for implementing abstractions in set theory is the use of equivalence classes. If an equivalence relation R tells us that elements of its field A are alike in some particular respect, then for any set x we can regard the set $[x]_R=\{y \in A \mid x R y\}$ as representing an abstraction from the set x respecting just those features (we identify elements of A up to R).

For any set A, we say that a set $P$ is a partition of A if all elements of P are nonempty, any two distinct elements of P are disjoint, and $A=\bigcup P$ .

For every equivalence relation R with field A, $\{[x]_R \mid x \in A\}$ is a partition of A. Moreover, each partition P of A determines an equivalence relation $\{(x,y) \mid (\exists A \in P.x \in A \wedge y \in A)\}$ .

This technique has limitations in both ZFC and NFU. In ZFC, since the universe is not a set, it seems possible to abstract features only from elements of small domains. This can be circumvented using a trick due to Dana Scott: if R is an equivalence relation on the universe, define $[x] R$ as the set of all y such that $y R x$ and the rank of y is less than or equal to the rank of any $z R x$ . This works because the ranks are sets. Of course, there still may be a proper class of $[x] R$ 's. In NFU, the main difficulty is that $[x] R$ is one type higher than x, so for example the "map" $x \mapsto [x]_R$ is not in general a (set) function (though $\{x\} \mapsto [x]_R$ is a set). This can be circumvented by the use of the Axiom of Choice to select a representative from each equivalence class to replace $[x] R$ , which will be at the same type as x, or by choosing a canonical representative if there is a way to do this without invoking Choice (the use of representatives is hardly unknown in ZFC, either). In NFU, the use of equivalence class constructions to abstract properties of general sets is more common, as for example in the definitions of cardinal and ordinal number below.

[edit] Ordinal numbers

We say that two well-orderings $W 1$ and $W 2$ are similar and write $W_1 \sim W_2$ just in case there is a bijection f from the field of $W 1$ to the field of $W 2$ such that $x W_1 y \leftrightarrow f(x)W_2f(y)$ for all x and y.

Similarity is shown to be an equivalence relation in much the same way that equinumerousness was shown to be an equivalence relation above.

In NFU we define the order type of a well-ordering W as the set of all well-orderings which are similar to W. We define the set of ordinal numbers as the set of all order types of well-orderings.

In ZFC we cannot do this, because the equivalence classes are too large. It would be formally possible to use Scott's trick to define the ordinals in essentially this way nonetheless, but instead we use a device of von Neumann.

For any partial order $\leq$ , the corresponding strict partial order < is defined as $\{(x,y) \mid x \leq y \wedge x \neq y\}$ . Strict linear orders and strict well-orderings are defined similarly.

A set A is said to be transitive if $\bigcup A \subseteq A$ : each element of an element of A is also an element of A. A (von Neumann) ordinal is a transitive set on which membership is a strict well-ordering.

In ZFC, the order type of a well-ordering W is then defined as the unique von Neumann ordinal which is equinumerous with the field of W and membership on which is isomorphic to the strict well-ordering associated with W. (the equinumerousness condition distinguishes between well-orderings with fields of size 0 and 1, whose associated strict well-orderings are indistinguishable).

In ZFC there cannot be a set of all ordinals. In fact, the von Neumann ordinals are an inconsistent totality in any set theory: it can be shown with modest set theoretical assumptions that every element of a von Neumann ordinal is a von Neumann ordinal and the von Neumann ordinals are strictly well-ordered by membership. It follows that the class of von Neumann ordinals would be a von Neumann ordinal if it were a set: but it would then be an element of itself, which contradicts that fact that membership is a strict well-ordering of the von Neumann ordinals.

It is worth noting that the existence of order types for all well-orderings is not a theorem of Zermelo set theory: it requires the Axiom of replacement. Even Scott's trick cannot be used in Zermelo set theory without an additional assumption (such as the assumption that every set belongs to a rank which is a set, which does not essentially strengthen Zermelo set theory but is not a theorem of that theory).

In NFU, the collection of all ordinals is a set by stratified comprehension. The Burali-Forti paradox is evaded in an unexpected way. There is a natural order on the ordinals defined by $\alpha\leq \beta$ if and only if some (and so any) $W_1 \in \alpha$ is similar to an initial segment of some (and so any) $W_2\in \beta$ . Further, it can be shown that this natural order is a well-ordering of the ordinals and so must have an order type $Ω$ . It would seem that the order type of the ordinals less than $Ω$ with the natural order would be $Ω$ , contradicting the fact that $Ω$ is the order type of the entire natural order on the ordinals (and so not of any of its proper initial segments). But this relies on our intuition (correct in ZFC) that the order type of the natural order on the ordinals less than $α$ is $α$ for any ordinal $α$ . This assertion is unstratified, because the type of the second $α$ is four higher than the type of the first (two higher if a type level pair is used). The assertion which is true and provable in NFU is that the order type of the natural order on the ordinals less than $α$ is $T 4 (α)$ for any ordinal $α$ , where $T (α)$ is the order type of $W^{\iota}=\{(\{x\},\{y\})\mid xWy\}$ for any $W \in \alpha$ (it is easy to show that this does not depend on the choice of W; note that T raises type by one). Thus the order type of the ordinals less than $Ω$ with the natural order is $T 4 (Ω)$ , and $T 4 (Ω) < Ω$ . All uses of $T 4$ here can be replaced with $T 2$ if a type-level pair is used.

This shows that the T operation is nontrivial, which has a number of interesting consequences. It follows immediately that the singleton map $x \mapsto \{x\}$ is not a set, as otherwise restrictions of this map would establish the similarity of W and $W ι$ for any well-ordering W. T is (externally) bijective and order-preserving. Because of this, the fact $T 4 (Ω) < Ω$ establishes that $\Omega > T(\Omega) > T^2(\Omega) \ldots$ is a "descending sequence" in the ordinals which cannot be a set.

Ordinals fixed by T are called cantorian ordinals, and ordinals which dominate only cantorian ordinals (which are easily shown to be cantorian themselves) are said to be strongly cantorian. The motivation for these definitions will be clearer in the next section. It is important to note that there can be no set of cantorian ordinals or set of strongly cantorian ordinals.

[edit] Digression: von Neumann ordinals in NFU

It is possible to reason about von Neumann ordinals in NFU. Recall that a von Neumann ordinal is a transitive set A such that the restriction of membership to A is a strict well-ordering. This is quite a strong condition in the NFU context, since the membership relation involves a difference of type. A von Neumann ordinal A is not an ordinal in the sense of NFU, but $\in\lceil A$ belongs to an ordinal $α$ which may be termed the order type of (membership on) A. It is easy to show that the order type of a von Neumann ordinal A is cantorian: for any well-ordering W of order type $α$ , the induced well-ordering of initial segments of W by inclusion has order type $T (α)$ (it is one type higher, thus the application of T): but the order types of the well-ordering of a von Neumann ordinal A by membership and the well-ordering of its initial segments by inclusion are clearly the same because the two well-orderings are actually the same relation, so the order type of A is fixed under T. Moreover, the same argument applies to any smaller ordinal (which will be the order type of an initial segment of A, also a von Neumann ordinal) so the order type of any von Neumann ordinal is strongly cantorian!

The only von Neumann ordinals which can be shown to exist in NFU without additional assumptions are the concrete finite ones. However, the application of a permutation method can convert any model of NFU to a model in which every strongly cantorian ordinal is the order type of a von Neumann ordinal. This suggests that the concept "strongly cantorian ordinal of NFU" might be a better analogue to "ordinal of ZFC" than is the apparent analogue "ordinal of NFU".

[edit] Cardinal numbers

Cardinal numbers are defined in NFU in a way which generalizes the definition of natural number: for any set A, $|A| =_{\mathrm{def}} \{B \mid B \sim A\}$ .

In ZFC, these equivalence classes are too large as usual. Scott's trick could be used (and indeed is used in ZF without Choice), but we usually define $| A |$ as the smallest order type (here a von Neumann ordinal) of a well-ordering of A (that every set can be well-ordered follows from the Axiom of Choice in the usual way in both theories).

The natural order on cardinal numbers is seen to be a well-ordering: that it is reflexive, antisymmetric (on abstract cardinals, which are now available) and transitive has been shown above. That it is a linear order follows from the Axiom of Choice: well-order two sets and an initial segment of one well-ordering will be isomorphic to the other, so one set will have cardinality smaller than that of the other. That it is a well-ordering follows from the Axiom of Choice in a similar way.

With each infinite cardinal, many order types are associated for the usual reasons (in either set theory).

Cantor's theorem shows (in both theories) that there are nontrivial distinctions between infinite cardinal numbers. In ZFC, one proves $| A | < | P (A) | .$ In NFU, the usual form of Cantor's theorem is false (consider the case A=V), but Cantor's theorem is an ill-typed statement. The correct form of the theorem in NFU is $| P 1 (A) | < | P (A) |$ , where $P 1 (A)$ is the set of one-element subsets of A. $| P 1 (V) | < | P (V) |$ shows that there are "fewer" singletons than sets (the obvious bijection $x \mapsto \{x\}$ from $P 1 (V)$ to V has already been seen not to be a set). It is actually provable in NFU + Choice that $| P 1 (V) | < | P (V) | < < | V |$ (where << signals the existence of many intervening cardinals; there are many, many urelements!). We define a type-raising T operation on cardinals analogous to the T operation on ordinals: $T ( | A | ) = | P 1 (A) |$ ; this is an external endomorphism of the cardinals just as the T operation on ordinals is an external endomorphism of the ordinals.

A set A is said to be cantorian just in case $| A | = | P 1 (A) | = T ( | A | )$ ; the cardinal $| A |$ is also said to be a cantorian cardinal. A set A is said to be strongly cantorian (and its cardinal to be strongly cantorian as well) just in case the restriction of the singleton map to A ( $(x \mapsto \{x\})\lceil A$ ) is a set. Well-orderings of strongly cantorian sets are always strongly cantorian ordinals; this is not always true of well-orderings of cantorian sets (though the shortest well-ordering of a cantorian set will be cantorian). A cantorian set is a set which satisfies the usual form of Cantor's theorem.

The operations of cardinal arithmetic are defined in a set-theoretically motivated way in both theories. $|A| + |B| = \{C \cup D \mid C \sim A \wedge D \sim B \wedge C \cap D = \emptyset\}$ . One would like to define $|A|\cdot|B|$ as $|A \times B|$ , and one does this in ZFC, but there is an obstruction in NFU when using the Kuratowski pair: one defines $|A|\cdot|B|$ as $T^{-2}(|A \times B|)$ because of the type displacement of 2 between the pair and its projections, which implies a type displacement of two between a cartesian product and its factors. It is straightforward to prove that the product always exists (but requires attention because the inverse of T is not total).

Defining the exponential operation on cardinals requires T in an essential way: if we define $B A$ as the collection of functions from A to B, we note that this is three types higher than A or B, so it is reasonable to define $| B | | A |$ as $T - 3 ( | B A | )$ so that it is the same type as A or B ( $T - 1$ replaces $T - 3$ if we use a type-level pair). An effect of this is that the exponential operation is partial: for example, $2 | V |$ is undefined. In ZFC one defines $| B | | A |$ as $| B A |$ without difficulty.

The exponential operation is total and behaves exactly as we expect on cantorian cardinals, since T fixes such cardinals and it is easy to show that a function space between cantorian sets is cantorian (as are power sets, cartesian products, and other usual type constructors). This offers further encouragement to the view that the "standard" cardinalities in NFU are the cantorian (indeed, the strongly cantorian) cardinalities, just as the "standard" ordinals seem to be the strongly cantorian ordinals.

Now the usual theorems of cardinal arithmetic with the axiom of choice can be proved, including $\kappa \cdot \kappa = \kappa$ . From the case $|V|\cdot |V| = |V|$ we can derive the existence of a type level ordered pair: $|V| \cdot |V| = T^{-2}(|V \times V|)$ is equal to $| V |$ just in case $|V \times V| = T^2(|V|) = |P_1^2(V)|$ , which would be witnessed by a one-to-one correspondence between Kuratowski pairs $(a, b)$ and double singletons ${{c}}$ : redefine $(a, b)$ as the c such that ${{c}}$ is associated with the Kuratowski $(a, b)$ : this is a type-level notion of ordered pair.

[edit] The Axiom of Counting and subversion of stratification

We now have two different implementations of the natural numbers in NFU (though they are the same in ZFC): finite ordinals and finite cardinals. Each of these supports a T operation in NFU (basically the same operation). It is easy to prove that $T (n)$ is a natural number if n is a natural number in NFU + Infinity + Choice (and so $| N |$ and the first infinite ordinal $ω$ are cantorian) but it is not possible to prove in this theory that $T (n) = n$ . However, common sense indicates that this should be true, and we adopt this as an axiom:

Rosser's Axiom of Counting: For each natural number n, $T (n) = n$ .

One natural consequence of this axiom (and indeed its original formulation) is

$|\{1,\ldots,n\}| = n$ for each natural number n.

All that can be proved in NFU without Counting is $|\{1,\ldots,n\}| = T^2(n)$ .

A consequence of Counting is that N is a strongly cantorian set (again, this is an equivalent assertion).

Strongly cantorian sets have important properties which we introduce here. The type of any variable restricted to a strongly cantorian set A can be raised or lowered as desired by replacing references to $a \in A$ with references to $\bigcup f(a)$ (type of a raised; this presupposes that it is known that a is a set; we otherwise have to say "the element of $f (a)$ " to get this effect) or $f - 1 ({a})$ (type of a lowered) where $f (a) = {a}$ for all $a \in A$ , so it is not necessary to assign types to such variables for purposes of stratification.

Any subset of a strongly cantorian set is strongly cantorian. The power set of a strongly cantorian set is strongly cantorian. The cartesian product of two strongly cantorian sets is strongly cantorian.

Introducing the Axiom of Counting means that we do not need to assign types to variables restricted to N or to P(N), R (the set of reals) or indeed any set ever considered in classical mathematics outside of set theory.

There are no analogous phenomena in ZFC. See the main New Foundations article for stronger axioms that can be adjoined to NFU to enforce "standard" behavior of familiar mathematical objects.

[edit] Familiar number systems: positive rationals, magnitudes, and reals

Here we construct the system of real numbers.

We represent positive fractions as pairs of positive natural numbers (0 is excluded): $\frac pq$ is represented by the pair $(p, q)$ . $\frac pq = \frac rs \leftrightarrow ps=qr$ is what we expect: to implement this, introduce the relation $\sim$ defined by $(p,q)\sim (r,s) \leftrightarrow ps=qr$ . It is provable that this is an equivalence relation: define positive rational numbers as equivalence classes of pairs of positive natural numbers under this relation. Arithmetic operations on positive rational numbers and the order relation on positive rationals are defined just as in elementary school and proved (with some effort) to have the expected properties.

We represent magnitudes (positive reals) as nonempty proper initial segments of the positive rationals with no largest element. The operations of addition and multiplication on magnitudes are implemented by elementwise addition of the positive rational elements of the magnitudes. Order is implemented as set inclusion.

We represent real numbers as formal differences $m - n$ of magnitudes: formally speaking, a real number is an equivalence class of pairs $(m, n)$ of magnitudes under the equivalence relation $\sim$ defined by $(m,n) \sim (r,s) \leftrightarrow m+s = n+r$ . The operations of addition and multiplication on real numbers are defined just as one would expect from the algebraic rules for adding and multiplying differences. The treatment of order is also as in elementary algebra.

This is the briefest sketch of the constructions. The most interesting thing to note is that the constructions are exactly the same in ZFC and in NFU, except for the difference in the constructions of the natural numbers: since all variables are restricted to strongly cantorian sets, there is no need to worry about stratification restrictions. Without the Axiom of Counting, it might be necessary to introduce some applications of T in a full discussion of these constructions.

[edit] Operations on indexed families of sets

Here we discuss a class of constructions often found in mathematics in which it appears that ZFC has a clear advantage over NFU (though the constructions are clearly feasible in NFU, they are more complicated than in ZFC for reasons having to do with stratification). It is important to note that throughout this section we assume a type-level ordered pair.

We first discuss general n-tuples. We define $(x, y)$ using the type level pair indicated above. We define $(x_1,\ldots,x_n,x_{n+1})$ as $(x_1,(x_2,\ldots,x_n))$ . The definition of the general n-tuple using the Kuratowski pair is trickier, as one needs to keep the types of all the projections the same, and the type displacement between the n-tuple and its projections increases as n increases. Here, the n-tuple has the same type as each of its projections.

We can define general cartesian products similarly: $A_1 \times A_2 \times \ldots \times A_{n} = A_1 \times (A_2 \times \ldots \times A_{n})$

The definitions are the same in ZFC but without any worries about stratification (the grouping given here is opposite to that more usually used, but this is easily corrected for).

Now consider the infinite cartesian product $\Pi_{i \in I}A_i$ . In ZFC, we define this product as the set of all functions f with domain I such that $f(i) \in A_i$ (where A is implicitly understood as a function taking each i to $A i$ ).

In NFU, this is certainly manageable but requires attention to type. Given a set I and set valued function A whose value at ${i}$ in $P 1 (I)$ is written $A i$ , we define $\Pi_{i \in I}A_i$ as the set of all functions f with domain I such that $f(i) \in A_i$ : notice that $f(i) \in A_i = A(\{i\})$ is stratified because of our convention that A is a function with values at singletons of the indices. Note that the very largest families of sets (which cannot be indexed by sets of singletons) will not have cartesian products under this definition. Note further that the sets $A i$ are at the same type as the index set I (since one type higher than its elements); the product, as a set of functions with domain I (so at the same type as I), is one type higher (assuming a type-level ordered pair).

Now consider the product $\Pi_{i \in I}|A_i|$ of the cardinals of these sets. The cardinality | $\Pi_{i \in I}A_i$ | is one type higher than the cardinals $| A i |$ , so the correct definition of the infinite product of cardinals is $T^{-1}(|\Pi_{i \in I}A_i|)$ (because the inverse of T is not total, it is possible that this may not exist).

We repeat this for disjoint unions of families of sets and sums of families of cardinals. Again, let A be a set-valued function with domain $P 1 (I)$ : we write $A i$ for $A ({i})$ . The disjoint union $\Sigma_{i \in I}A_i$ is the set $\{(i,a) \mid a \in A_i\}$ , which the reader may check for stratification. This set is at the same type as the sets $A i$ .

The correct definition of the sum $\Sigma_{i \in I}|A_i|$ is thus $|\Sigma_{i \in I}A_i|$ , since there is no type displacement.

It is possible to extend these definitions to handle index sets which are not sets of singletons, but this introduces an additional type level and is not needed for most purposes.

In ZFC we can define the disjoint union $\Sigma_{i \in I}A_i$ as $\{(i,a) \mid a \in A_i\}$ , where $A i$ abbreviates $A (i)$ .

Permutation methods can be used to show relative consistency with NFU of the assertion that for every strongly cantorian set A there is a set I of the same size whose elements are self-singletons: $i = {i}$ for each i in I. The definitions above would be simpler with index sets of self-singletons if we were working with indexed families of strongly cantorian size.

[edit] The cumulative hierarchy

In ZFC, we define the cumulative hierarchy as the ordinal-indexed sequence of sets satisfying the following conditions: $V_0 = \emptyset$ ; $V α + 1 = P (V α)$ ; $V_{\lambda} = \bigcup\{V_{\beta} \mid \beta<\lambda\}$ for limit ordinals $λ$ . This is an example of a construction by transfinite recursion. The rank of a set A is said to be $α$ if and only if $A \in V_{\alpha+1}-V_{\alpha}$ . The existence of the ranks as sets depends on the axiom of replacement at each limit step (the hierarchy cannot be constructed in Zermelo set theory); by the axiom of foundation, every set belongs to some rank.

The cardinal $| P (V ω + α) |$ is called $\beth_{\alpha}$ .

This construction cannot be carried out in NFU because the power set operation is not a set function in NFU ( $P (A)$ is one type higher than A for purposes of stratification).

The sequence of cardinals $\beth_{\alpha}$ can be implemented in NFU. Recall that $2 | A |$ is defined as $T - 1 ( | {0,1} A | )$ , where ${0,1}$ is a convenient set of size 2, and $| {0,1} A | = | P (A) |$ because we are using a type level pair. Let $\beth$ be the smallest set of cardinals which contains $| N |$ (the cardinality of the set of natural numbers), contains the cardinal $2 | A |$ whenever it contains $| A |$ , and which is closed under suprema of sets of cardinals.

Now we introduce a convention for ordinal indexing of any well-ordering $W$ : $W α$ is defined as the element x of the field of $W$ such that the order type of the restriction of $W$ to $\{y \mid y W x\}$ is $α$ ; we then define $\beth_{\alpha}$ as the element with index $α$ in the natural order on the elements of $\beth$ . The cardinal $\aleph_{\alpha}$ is the element with index $α$ in the natural order on all infinite cardinals (which we have seen above to be a well-ordering): note that $\aleph_0 = |N|$ follows immediately from this definition. In all these constructions, notice that the type of the index $α$ is two higher (with type-level ordered pair) than the type of $W α$ .

Now we indicate how the cumulative hierarchy itself (as a mathematical structure) can be studied in NFU. Each set A of ZFC has a transitive closure $T C (A)$ (the intersection of all transitive sets which contains A). By the axiom of foundation, the restriction of the membership relation to the transitive closure of A is a well-founded relation. The relation $\in \lceil TC(A)$ is either empty or has A as its top element, so this relation is what we called a set picture above. It can be proved in ZFC that every set picture is isomorphic to some $\in \lceil TC(A)$ .

This suggests that we can study (an initial segment of) the cumulative hierarchy by considering the isomorphism classes of set pictures. These isomorphism classes are sets and make up a set in NFU. There is a natural set relation analogous to membership on isomorphism classes of set pictures: if $x$ is a set picture, we write $[x]$ for its isomorphism class, and we define $[x] E [y]$ as holding if $[x]$ is the isomorphism class of the restriction of y to the downward closure of one of the elements of the preimage under y of the top element of y. The relation E is a set relation, and it is straightforward to prove that it is well-founded and extensional. If the definition of E is found confusing, it can be deduced from the observation that it is induced by precisely the relationship which holds between the set picture associated with A and the set picture associated with B when $A \in B$ in the usual set theory.

There is a T operation on isomorphism classes of set pictures analogous to the T operation on ordinals: if x is a set picture, so is $x^{\iota} = \{(\{a\},\{b\})\mid (a,b) \in x\}$ , and we define $T ([x])$ as $[x ι]$ . It is easy to see that $[x]E[y] \leftrightarrow T([x])=T([y])$ .

Because E is extensional, we have an axiom of extensionality for this simulated set theory. Because E is well-founded, we have an axiom of foundation. There remains the question of what comprehension axiom we may have for E. Consider any collection of set pictures $\{x^{\iota}\mid x \in S\}$ (collection of set pictures whose fields are made up entirely of singletons). Since each $x ι$ is one type higher than x and we have a type level ordered pair, we can replace each element ${a}$ of the field of each $x ι$ in the collection with $(x,{a})$ , obtaining a collection of set pictures isomorphic to the original collection but with their fields disjoint. We can take the union of these set pictures and add a new top element to obtain a set picture whose isomorphism type will have as its preimages under E exactly the elements of the original collection. That is, for any collection of isomorphism types $[x ι] = T ([x])$ , there is an isomorphism type $[y]$ whose preimage under E is exactly this collection.

In particular, there will be an isomorphism type [v] whose preimage under E is the collection of all T[x]'s (including T[v]). Since T[v] E v and E is well-founded, we have $T[v] \neq v$ . This resembles the resolution of the Burali-Forti paradox discussed above and in the New Foundations article, and is in fact the local resolution of Mirimanoff's paradox of the set of all well-founded sets.

There are ranks of isomorphism classes of set pictures just as there are ranks of sets in the usual set theory. For any collection of set pictures A, define S(A) as the set of all isomorphism classes of set pictures whose preimage under E is a subset of A; call A a "complete" set if every subset of A is a preimage under E. The collection of "ranks" is the smallest collection containing the empty set and closed under the S operation (which is a kind of power set construction) and under unions of its subcollections. It is straightforward to prove (much as in the usual set theory) that the ranks are well-ordered by inclusion, and so the ranks have an index in this well-order: we refer to the rank with index $α$ as $R α$ . It is provable that $|R_{\alpha}|=\beth_{\alpha}$ for complete ranks $R α$ . The union of the complete ranks (which will be the first incomplete rank) with the relation E looks like an initial segment of the universe of Zermelo-style set theory (not necessarily like the full universe of ZFC because it may not be large enough). It is provable that if $R α$ is the first incomplete rank, then $R T (α)$ is a complete rank and thus $T (α) < α$ . So we have a "rank of the cumulative hierarchy" with an "external automorphism" T moving the rank downward: this is exactly the condition on a nonstandard model of a rank in the cumulative hierarchy under which we construct a model of NFU in the New Foundations article. There are technical details to verify, but there is an interpretation not only of a fragment of ZFC but of NFU itself in this structure, with $[x]\in_{NFU}[y]$ defined as $T([x]) E [y] \wedge [y] \in R_{T(\alpha)+1}$ : this "relation" $E N F U$ is not a set relation but has the same type displacement between its arguments as the usual membership relation $\in$ .

So there is a natural construction inside NFU of the cumulative hierarchy of sets (as much of it as we can see) which internalizes the natural construction of a model of NFU in Zermelo-style set theory.

Under the Axiom of Cantorian Sets described in the New Foundations article, the strongly cantorian part of the set of isomorphism classes of set pictures with the E relation as membership becomes a (proper class) model of ZFC (in which there are n-Mahlo cardinals for each n; this extension of NFU is strictly stronger than ZFC). This is a proper class model because the strongly cantorian isomorphism classes do not make up a set.

Permutation methods can be used to create from any model of NFU a model in which every strongly cantorian isomorphism type of set pictures is actually realized as the restriction of the true membership relation to the transitive closure of a set.

[edit] See also

[edit] References

Keith Devlin, 1994. The Joy of Sets, 2nd ed. Springer-Verlag.
Holmes, Randall, 1998. Elementary Set Theory with a Universal Set. Academia-Bruylant. The publisher has graciously consented to permit diffusion of this introduction to NFU via the web. Copyright is reserved.
Potter, Michael, 2004. Set Theory and its Philosophy, 2nd ed. Oxford Univ. Press.
Suppes, Patrick, 1972. Axiomatic Set Theory. Dover.
Tourlakis, George, 2003. Lectures in Logic and Set Theory, Vol. 2. Cambridge Univ. Press.

[edit] External links

Metamath: A web site devoted to an ongoing derivation of mathematics from the axioms of ZFC and first-order logic. Principia Mathematica done right.
Stanford Encyclopedia of Philosophy:
- Quine's New Foundations -- by Thomas Forster.
- Alternative axiomatic set theories -- by Randall Holmes.
Randall Holmes: New Foundations Home Page.

Categories: Systems of set theory