Lesson 14: Infinity Infinities

In lesson 13 we established that there are at least two different kinds of infinities.  Now, we go on to see that there are in fact infinitely many different kinds of infinities.  If you’ve studied and understood the method of proof that we used in the last lesson, then the methods that we use in this lesson will be a breeze.  We will again use the method of proof by contradiction, and we’ll alsou use an idea very similar to the “diagonal slash” argument in the previous lesson.

Recall that this diagonal method (which is so famous that it is actually referred to in the biz as “Cantor’s Diagonal Slash”) involved supposing that there is a bijective correspondence between two infinite sets, and then finding an element in one of the sets that isn’t “hit” by the correspondence.  We found this element by going down the “diagonal” and defining our element in a way that ensures that it doesn’t show up anywhere in the correspondence.

So how do we show that there are infinitely many infinities?  Well, this is actually the same question as how one shows that there are infinitely many numbers.  We all know that there are infinitely many numbers because if the enemy hands me a number (and perhaps she claims it’s “the biggest number”), I know that I can always add 1 to it.  (In fact, I can add a million to it if I wanted to, but adding 1 is sufficient.)  In a similar way, I’ll now show that if the enemy hands me an infinite set, I can define a new infinite set whose cardinality is fundamentally bigger in the sense that we’ve been studying.  Since this new set will obviously also be infinite, I’ll be able to define another new, infinite, and larger set.  I then repeat, infinitely many times.  This is the analogue for “infinite numbers” of “adding 1”.  Let us see how this works.

We begin with the “smallest” infinity, which is infinity type 1.  (This is the “smallest” in the sense that any other infinite set can have a surjective function defined from it to the infinity type 1 set, but the converse is not necessarily true.  I.e., given two infinite sets A and B, where A has cardinality infinity type 1, and B’s cardinality is unknown (other than it being infinite), then we can always define a surjective function from B to A, but not necessarily from A to B.  This was precisely what we showed last lesson where A=\{1, 2, 3, ...\} and B=\{\mathrm{real\ numbers}\}.)  What is the analoge of “adding 1” to an infinite number, so that we get a brand new infinity that is truly larger than the previous one?  We’ve already seen the construction in lesson 3, and we defined it more rigorously in lesson 4—it is the construction of the power set.

Recall that if we have a set A, we can form its power set, which we denote by P(A), whose elements are subsets of A.  We saw in lesson 3 that if A is a set with N elements, then P(A) is a set with 2^N elements (see lesson 3 for a description of the notation “2^N”).  But regardless of whether or not A has finitely many elements, we can always define its power set.  Clearly, if A has infinitely many elements, so does P(A).  This is easily seen by realizing that each element in A (of which there are infinitely many) defines a perfectly good 1-element subset of A, and therefore defines a perfectly good element in P(A).  What’s important to notice is that there are way more subsets than that, though!  We also have all of the two element subsets, and three element subsets, and four element subsets, and even all of the infinite element subsets!

Let’s now focus on the case A=\{1, 2, 3, ...\}.  There are the finite element subsets like \{1, 123526642, 235463, 4,6\}, and there are infinite element subsets like \{3, 6, 9, 12, 15, 18, ...\}.  Clearly, there are tons and tons and tons of distinct subsets.  But there were also tons and tons of fractions in FRAC, and we showed in lesson 12 that the cardinality of FRAC and A were the same.  So is that the case here?  The answer is no, and let us see why.

Again, we suppose that there is a bijective correspondence, call it F, between A and P(A).  I.e., F is a bijective function from A to P(A).  Now, if this were the case, then this would mean that F maps each positive whole number to some subset of the positive whole numbers.  We’d get infinitely many expressions of the form F(1)=S_1, F(2)=S_2, F(3)=S_3, ..., where S_1 is an entire subset of positive whole numbers (because it is an element in the power set of the positive whole numbers, whose elements are sets), as is S_2, S_3, and so on.

Using this infinitely long list of expressions of the form F(i)=S_i, we can construct a subset of A that is not in the list.  We do this as follows.  We’ll call this new set T, and we define it similarly to our diagonal slash from last lesson.  Namely, if 1 is in S_1 (i.e, if 1 is in the subset which it labels under the mapping F), then we don’t put 1 in the set T, and if 1 is not in S1, then we do put 1 in T.  Then we move on to 2.  If 2 is in S_2, then we don’t put it in T, and if 2 is not in S_2, we do put it in T.  We then do this for all of the sets S_i..  This defines a perfectly good subset of A, and is thus an element in P(A). But is T in our correspondence?  Clearly it isn’t, because it differs from S(1) by either having or not having the element 1 (according to whether or not S(1) does), and it differs from S(2) in the element 2, and from S(3) in the element 3, and so on, all the way down the list.  Since the list was supposed to have every element in P(A) (by assumption of the fact that F was bijective), we clearly have a contradiction!

Thus no bijective function can be defined from A to P(A), and thus P(A) must have a fundamentally different cardinality.  Namely, it must be a different (and larger, since we can certainly define a surjective function from P(A) to A) infinity!

But now let’s suppose that A is any infinite set.  Any infinite set at all.  Then can we put A and P(A) (the power set of whatever this new A is) into a bijective correspondence?  The answer is no (again, for the experts, heavily relying on the Axiom of Choice), and the reason can be seen quite easily after the above argument is understood.  Namely, we suppose there is a bijective function F from A to P(A).  Then we define a new set T—a subset of A—as follows.  For each element “a” in A, we put “a” into T if and only if “a” is not in F(a), which is the subset of A that “a” is mapped to.  By the same logic as above, this new subset T cannot possibly be mapped to by any element in A because we’ve defined T to differ by every F(a), for any “a” in A (there is actually a cool parallel between this logic and that used in the paradox of lesson 5 for which the details are in the exercise below).  Thus no bijective function from A to P(A) can be defined, and so P(A) must be a different (and larger) type of infinity than A.

Now here’s the catch—the punchline, if you will.  We’ve just shown that if A is an infinite set, then P(A) is an infinite set of a fundamentally larger cardinality.  But what’s even more important is that P(A) is infinite (duh).  Thus, the same logic that was used above can be used on P(A) itself!  I.e., the power set of {P(A)}, which we obviously denote as P(P(A)), (and which is read aloud as “the power set of the power set of A”) must have a fundamentally larger cardinality than P(A) since we can’t define a bijective function between the two!  Similarly, P(P(P(A))) must be fundamentally “more infinite” than P(P(A)), and P(P(P(P(A)))) must be fundamentally “more infinite” than P(P(P(A)))!  We can hit each successive power set with another “P()” operation, thus defining another new level of infinity.  Clearly we can do this as many times if like. Namely, if you hand me some set and say it’s “the most infinite” set out there, then I can just hit it with a P() and obtain a fundamentally more infinite one.  Sound familiar?  It should!  We’ve just found a way to literally “add 1” to infinity in order to get different, larger infinities.  In fact, we get infinitely many of them!

Before ending this lesson and moving on to wildly different concepts for a while, let us introduce some of the ideas and unsolved problems that exist out there in the world of math. We will discuss these more in future lessons but it doesn’t hurt to introduce them now. And since this is just an introduction, we will not discuss these ideas in depth and so some confusion may arise. This is okay, though, as it will encourage us to keep moving forward into this amazing world of mathematics.

In the last this lesson we saw a way of “adding 1” to infinity, in order to get another, larger infinity. We compared the procedure of taking the power set of a finite set to the procedure of adding 1 to a normal, finite number. The natural next question to ask, then, is whether or not there is an analogue to adding one half to a normal, finite number. Namely, we know that we can add 1 to 1 to get 2, but we also know that if we allow for fractions then we can add \frac{1}{2} to 1 to get \frac{3}{2}. The result of adding one \frac{1}{2} to a number is to get a number that is truly in between the number itself, and the number that is obtained by adding 1. Namely, \frac{3}{2} is truly in between 1 and 2, because it is greater than 1 and less than 2. Thus, we might ask whether or not there are any infinite sets that have a truly larger cardinality than \mathbb{N} and a truly smaller cardinality than P(\mathbb{N}).

For this to be the case, we would need a set A such that there is no bijective function from A to either \mathbb{N} or P(\mathbb{N}), so that the cardinality of A does not equal the cardinality of either \mathbb{N} or P(\mathbb{N}). Additionally, this set A would need to be such that there was a surjective function from A to \mathbb{N} (so that the cardinality of A is truly larger than that of \mathbb{N}), as well as a surjective from P(\mathbb{N}) to A (so that the cardinality of A is truly smaller than that of P(\mathbb{N})). If we could find such a set, then we will have found an infinity that is “between” the infinity of \mathbb{N} and that of P(\mathbb{N}). One natural guess for such a set would be \mathbb{R}, since we already know that it has a larger cardinality than \mathbb{N}. Therefore, if we could show that P(\mathbb{N}) has a larger cardinality than \mathbb{R} we will have solved the problem, and the answer would be that there do exist sets whose cardinality is between \mathbb{N} and P(\mathbb{N}). It turns out, however, that the cardinality of \mathbb{R} is the same as that of P(\mathbb{N}), though we will not provide the proof of this statement here. This means, however, that we haven’t yet found a set whose cardinality is between that of \mathbb{N} and P(\mathbb{N}). Let us therefore ask: Is there such a set at all?

This is a perfectly reasonable question to ask, but it turns out that no one knows the answer to it. Georg Cantor — the brilliant mathematician who developed this entire formalism regarding infinity — died before he could figure it out, and every other mathematician that has tried to figure it out is either still struggling over it or has also died before succeeding. Namely, there is a statement called the Continuum Hypothesis which states that there is not a set whose cardinality is between that of \mathbb{N} and that of P(\mathbb{N}), but no mathematician has been able to prove that this is a true statement. Indeed, the Continuum hypothesis is unlike many other unsolved problems in math — the problem itself has a problem. It has in fact been shown that, using our standard system of mathematical axioms, the Continuum Hypothesis is actually unsolvable. What we mean by “unsolvable” is that even if it is true, we can’t prove that it is so. The possibility of fundamentally not being able to prove something within a given set of axioms comes from the remarkable result uncovered by the mathematician Kurt Godel, who proved what’s known as the Incompleteness Theorem (we will have more to say about the Incompleteness Theorem and its profound impact on math in a future lesson). The Incompleteness Theorem effectively tells us that in order to know whether or not the statement “there is no set whose cardinality is strictly larger than that of \mathbb{N} and strictly smaller than that of P(\mathbb{N})” is true, we’ll have to change the very foundations of mathematics! This is most certainly not a small task to ask for, but the difficulty of a problem has not stopped mathematicians in the past!

Before ending this lesson we should note how all of this discussion about infinity is only the tip of a very large iceberg. To see this, let us recall yet again how we have constantly used various properties of the finite numbers to motivate various definitions in the infinite case. We saw that this analogy allowed us to define some very abstract notions of counting, and these definitions allowed us to uncover a remarkable organization and order within the realm of the infinite. It turns out that we can still do a lot more. Namely, there is a beautiful and abstract way to define addition, subtraction, multiplication, division, and exponentiation in such a way as to simultaneously incorporate the finite, as well as the infinite numbers. In doing so, we will have put all of the various infinities on essentially equal footing with all of the various finite numbers, thus extending our very notion of what a number is to a much larger class of numbers — a class of numbers which includes the previously mysterious notion of infinity.

In order to see all of this — i.e., the details of the Continuum Hypothesis, the implications of the Incompleteness Theorem, and the ways in which we can add, subtract, multiply, and divide infinite numbers just like finite ones — we must develop some more mathematical machinery. Along the way, we will see lots of other forms of mathematical beauty over a wide range of topics, so let us temporarily leave the world of the infinite behind us and move forward to develop more mathematical weaponry.

Next Lesson

Previous Lesson

Advertisement

2 Responses to Lesson 14: Infinity Infinities

  1. Tuomas says:

    The proof that a power set has larger cardinality that the original set does NOT require axiom of choice. This is a common misunderstanding about AC: you don’t need it to make infinite number of choices, just for making infinite number of arbitrary choices. Here you have a clear rule to follow for each choice, hence you don’t need the AC. See AC article on Wikipedia for more info. Source for the claim itself: http://caicedoteaching.files.wordpress.com/2009/04/580-choiceless.pdf

    • Very interesting! I never really took the time to think about it on my own, I think I just vaguely remember a professor saying that was the case but clearly either he or I (probably the latter) was wrong. Thanks for the clarification, I’ve made the necessary adjustment/deletion 🙂

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s