Mathematical Induction
Contents
This post is a much needed break from my steady stream of posts concerning topology. Before my next post on bases for topologies, I need to introduce a proof technique that I haven't used so far.
In mathematics, we come across patterns all the time. For instance, here's a pattern you may have noticed before:
It turns out that for any natural number
From this image, it is evident that adding together the first
The axiom of choice and well-ordering principle
Before we start, I should mention that you cannot do a proof by induction unless you accept the axiom of choice. The axiom of choice is an extra axiom of set theory which is somewhat controversial (among mathematicians). This is because at first glance it seems obviously true, but it actually leads to some bizarre and unexpected results.
Axiom of Choice. Suppose
is an indexing set and is a collection of nonempty sets for each . Then there exists a function such that for every .
In simpler words, the axiom of choice ensures that, given any number of nonempty sets, it is possible to choose precisely one element from each set. This is probably something you never would have dreamt was controversial, and it certainly seems like a natural assumption, but as I said before it allows for some strange results.
For instance, it leads to the Banach-Tarski Paradox, that is, the fact that given a solid ball in three dimensions (
I'm always going to assume the axiom of choice, mostly because I like it and it makes a lot of proofs considerably easier. Sometimes it is the only thing that makes proofs possible at all, like the proof of the principle of mathematical induction. Which I promise I'm getting to.
For this proof, we're going to need the axiom of choice in a slightly different form. This form is actually so commonly referred to that it even has a special name. The axiom of choice is equivalent to the statement that any set can be well-ordered. I'm not going to go into orderings right now, but I will state what this means for the natural numbers because we'll need to use it in our proof.
Well-Ordering Principle. Any nonempty set of natural numbers contains a least element.
This assumes the ordering you're familiar with for the natural numbers. You know, the one where
The well-ordering principle should hopefully seem very obvious to you as well, and I'm not going to show how it implies the principle of mathematical induction.
The principle of mathematical induction
I'll stop beating around the bush and just do the thing already.
Principle of Mathematical Induction. Suppose P(n) is a (true/false) statement about a natural number
, and that the following two criteria hold:
- Base Case:
is true. - Inductive Step: If
is true for some then so is . Then
is true for every . Proof. We give a proof by contradiction. That is, we're going to assume that the statement is false and show that this leads to an absurd result. We will then be forced to accept the principle of mathematical induction because we will have shown that it cannot be false.
Suppose then that both criteria hold, but that
is not true for at least one natural number. Let denote the set of natural numbers for which is false. By assumption, is not empty, so the well-ordering theorem tells us that contains a least element. Call this element . Then since is the smallest element of . That is, is false but is true. However, the second criterion tells us that must be true. This is a contradiction, since cannot be simultaneously true and false. It follows that is true for every natural number .
The proof is not terribly important, so don't spend too much time dissecting it. What really matters to us is the statement itself, which might seem daunting at first, so let's break it down piece by piece.
We start off with a statement which is either true or false depending on what number we feed it. The following are examples of such statements[1]:
is a prime number. sandwiches are on fire. . the th domino will fall over.
If we want to show that such a statement is true for all natural numbers, we must first verify the base case, which means showing that if we set
The last thing we have to do is show that if the statement holds for an arbitrary natural number
If we can do these two things, then we have shown that the statement is true for every natural number! Just remember: base case, then inductive step. Now let's look more closely at the example statements from above:
By definition,
On the other hand,
For now, let's focus on
What are the base case and inductive step for this example? The base case would be
So in other words, if we know for certain that the first domino will fall, this means that every domino will fall because we have verified both the base case and the inductive step.
The argument we have just given shows an infinite number of things to be true! This is part of the power of inductive arguments.
Examples
Let's now prove that the pattern I introduced at beginning of this post always holds.
Theorem. For any
,
Proof. We give a proof by induction on
. For our base case, we need to show that the statement is true when we let
. This is simple enough, because is certainly the sum of the first odd numbers, and it is equal to its square. That is, , so we have shown that our base case is true. Now for the inductive step. We will assume that
and use this to show that
This requires a tiny bit of algebraic manipulation, but isn't at all difficult:
In the first step above, we used our assumption that the sum of the first
odd numbers is and substituted this result into the equation. In the second step, we factored our resulting quadratic, and subsequently showed that our inductive step holds. It follows from the principle of mathematical induction that our assertion is true.
This proof is a good indicator of a trend that such proofs tend to follow. If we can show that the base case is true, we can frequently break down the statement for higher numbers into smaller pieces and then use our base case to glue everything back together.
In my first post on set theory, I stated without proof that De Morgan's Laws extend to arbitrary collections of sets. I still won't prove that in full, but I'll prove something that's nearly as powerful.
Theorem. Let
. Then
Proof. We proceed by induction on
. For our base case, we choose . That is, we need to show that
But this is just one of De Morgan's Laws, which we already know to be true! So we don't actually have to do any work to establish that the first criterion holds.
Next, let
be a natural number and suppose that
It is easy to see that
This completes the proof.
Let's look at another example. This is one of my favorites, and it lends itself nicely to a visual representation.
Theorem. A
checkerboard can be covered by L-shaped tiles, with the exception of one arbitrary square. Proof. Let's break down this statement a little bit first. When we talk about a
checkerboard, we are talking about a square board with sides of length . So
corresponds to the board, corresponds to the board, corresponds to the board, etc. For instance, here is the
checkerboard:
By 'L-shaped tile,' we mean a tile comprised of three squares which is shaped sort of like the letter L:
Lastly, when we say that a board can be covered by these tiles with the exception of one arbitrary square, we mean something like this:
It's important to point out that we could have left out any one square on the board above and still been able to tile the rest of the board. It does not matter which square we choose to exclude; we should still be able to cover the rest of the board. Try playing around with boards of different sizes, leaving one random square empty, and trying to come up with such a tiling. For larger boards, this quickly becomes no easy task.
Now that we have a better understanding of the problem, it is our goal to show that we can cover any
board in this manner, and we will do so by induction on . For our base case (
), we must show that the board can be tiled by L-shaped tiles, no matter which square we choose to omit. Since there are only four squares on this board, we can 'brute-force' the base case by demonstrating that with any square ommitted, the rest of the board can be tiled quite trivially using a single L-shaped tile:
We have shown that any
board can be covered in the desired manner, so the base case has been verified. Next we argue the inductive step. We begin with the assumption that for an arbitrary
, we can cover the checkerboard (with any one arbitrary square omitted) as desired. We must use this information to show that we can then tile the board in the same manner. Notice that if we are given a
board, the missing tile will necessarily lie in precisely one of its four quadrants:
But each of these quadrants is actually a
board, which we already know can be covered! So one of our quadrants has a missing tile already chosen for us, and the other three quadrants we can cover however we choose. So we do so in the following crafty way:
We've now covered the entire
board with the exception of our arbitrary missing square and a nice L-shaped hole in the middle, which we can fill in with a single L-shaped tile, leaving only the chosen square uncovered. We have thus shown that the inductive step holds, completing the proof.
I think that's more than enough for one post, so I'm going to leave it at that. We will be using induction for many of our future proofs, though, so this won't be the last time you'll see it!
The symbol
means 'is defined as.' ↩︎