Walton's JMU Math Blog: 2011

Wednesday, September 14, 2011

Proof by Induction and Summations

The two previous blog entries introduced the idea of the Principle of Mathematical Induction followed by a discussion of a typical example of a proof by induction. Be sure you read those two entries before you look at this one. This blog post extends these ideas by talking about how proof by induction applies to summations.

(I apologize right now about the formatting of summation notation, or sigma notation. I do not know how to get the math to look like math in a blog setting.)

Recall that the second condition of the PMI is that an arbitrary statement S(n) in the chain of statements being proved will guarantee that the next statement S(n+1) in the chain is also true. In a proof by induction, this step always involves using a recursive relation between something in the sentence S(n) with a corresponding object in the sentence S(n+1).

For summations, this recursive relation is always that a summation for statement S(n+1) exactly corresponds to the summation appearing in statement S(n) with some additional term(s).

For example, think back to our motivating example of a chain of statements:
S(1):    1 = 1(2)/2
S(2):    1+2 = 2(3)/2
S(3):    1+2+3 = 3(4)/2
S(4):    1+2+3+4 = 4(5)/2
S(5):    1+2+3+4+5 = 5(6)/2
...
Notice that the sum appearing on the left hand side of any given sentence appears in the sum on the very next sentence, but one more term has been added. Using parentheses to emphasize where the sum from the previous sentence appears in the new sentence, here is the same chain:

S(1):    1 = 1(2)/2
S(2):    (1)+2 = 2(3)/2
S(3):    (1+2)+3 = 3(4)/2
S(4):    (1+2+3)+4 = 4(5)/2
S(5):    (1+2+3+4)+5 = 5(6)/2
...

Summation (Sigma) notation gives us a handy way to describe sequences of sums like we see in the chain above. The terms of the sum follow a simple pattern. In this example, the pattern is the sequence of terms (1, 2, 3, 4, 5, ...). This sequence can be explicitly described as a_k = k for k=1, 2, 3, .... In the symbolism (notation) of summation notation, we write:

Σ¹_k=1(k) = 1
Σ²_k=1(k) = 1+2
Σ³_k=1(k) = 1+2+3
or
Σⁿ_k=1(k) = 1+2+3+...+n
(Notice that the formula in the parentheses (k) is the explicit formula of the sequence of terms in terms of the index which is listed at the bottom of Σ along with the first index of the sequence used in the sum. The index of the last term in the sum appears at the top of Σ.)

The pattern that we described above to illustrate the recursive relation between the sums of consecutive sentences in the chain of statements we wish to prove can also be written in terms of summation notation:
Σⁿ⁺¹_k=1(k) = Σⁿ_k=1(k) + (n+1)
When writing a proof by induction, we use this recursive relation.

Example: Prove that Σⁿ_k=1(k) = n(n+1)/2 for n=1, 2, 3, ...

In the proof, we will write S(n) to represent the sentence: Σⁿ_k=1(k) = n(n+1)/2.

Proof:
(1) We first need to think about the sentence where n is replaced by 1. That is, we need to prove that Σ¹_k=1(k) = 1(1+1)/2.
Σ¹_k=1(k) = 1 (Interpretation of summation notation)
(We are creating a true statement involving the summation symbol that appears on the left side of the equation we are proving. The sequence of terms is 1, 2, 3, 4, 5, ..., and the sum starts and ends with the 1st term.)
1(1+1)/2 = 1(2)/2 = 1 (Algebra)
(We now create an equation involving the formula on the right side of the equation we are proving, and we just showed it has the same value as the summation symbol.)

So Σ¹_k=1(k) = 1(1+1)/2.
(We obtained evidence above that the summation and the formula both represented the same value, so we conclude that they are equal.)

(2) We now need to show that S(n) implies S(n+1) for n=1, 2, 3, .... We will assume S(n), i.e. Σⁿ_k=1(k) = n(n+1)/2. Using the recursion connecting the sums, we will show S(n+1), i.e., Σⁿ⁺¹_k=1(k) = (n+1)((n+1)+1)/2.

Assume    Σⁿ_k=1(k) = n(n+1)/2    for some n=1, 2, 3, ...
Σⁿ⁺¹_k=1(k) = Σⁿ_k=1(k) + (n+1). (Recursive relation on summation of terms a_k = k)
Σⁿ⁺¹_k=1(k) = n(n+1)/2 + (n+1)     (Substitution: summation replaced by formula)
Σⁿ⁺¹_k=1(k) = n(n+1)/2 + 2(n+1)/2     (Find common denominator)
Σⁿ⁺¹_k=1(k) = (n²+n+2n+2)/2     (Distribution and adding fractions)
Σⁿ⁺¹_k=1(k) = (n²+3n+2)/2     (Combining terms)
(Notice that each equation was based on the previous equation. The key step was when we replaced Σⁿ_k=1(k) by the formula n(n+1)/2 that was provided by the assumed hypothesis. At this stage of the proof, we have a formula representing the value of the summation symbol with n+1 that is the left hand side of the equation in the sentence S(n+1). We now need to check the formula that appears in the right hand side of our sentence.)
(n+1)((n+1)+1)/2 = (n+1)(n+2)/2    (Arithmetic: 1+1=2)
(n+1)((n+1)+1)/2 = [n(n+2)+1(n+2)]/2     (Distributive law)
(n+1)((n+1)+1)/2 = [n²+2n + n+2]/2     (Distributive law again.)
(n+1)((n+1)+1)/2 = [n²+3n+2]/2     (Combining terms)
(Notice that each equation was based on the previous equation. Notice that the second and third steps (distributive law) are the formal steps in the idea commonly taught as FOIL when multiplying to binomial expressions together. The key is that we took the formula with n replaced by n+1 of the right hand side and discovered that it equals the same formula that we found for the summation above.)
So we have Σⁿ⁺¹_k=1(k) = (n+1)((n+1)+1)/2.
Consequently, S(n) implies S(n+1) for n=1, 2, 3, ....

By the PMI (since S(1) is true and S(n) implies S(n+1)), we have S(n) is true for n=1, 2, 3, ...
♦

Mathematical Induction (part 2)

My previous blog post introduced the basic idea of what mathematical induction is about. This post focuses more on the mechanics of writing a proof.

A proof is the mathematical form of argument or persuasion. A proof consists of a sequence of logical statements, each of which is shown to be a true sentence based only on information previous to that sentence. Examples of sentences that might appear in a proof are equations or inequalities for which there is clear reason that it is true, and never based on what we hope is true or will later show is true.

For example, suppose I needed to prove (x+1)²=x²+2x+1. In a proof, I can not write down this equation as the first statement because it is not something I know (yet). Instead, I can write down equations known to be true based on basic principles:

(x+1)² = (x+1)(x+1) (meaning of power 2)
(x+1)(x+1) = x(x+1) + 1(x+1) (distributive law)
x(x+1) + 1(x+1) = x²+x + x+1 (distributive law again)
x²+x + x+1 = x²+2x+1 (collecting like terms)

Notice that each of the sentences is a statement of equality. Although they to suggest that the "=" sign is being used in a computational sense (i.e., it looks like it is being used to say apply a rule), we really are seeing these sentences as declaring that the result of applying a rule demonstrates that the sentence is actually true. (This is a subtle distinction that you need to fight your mind until it sinks in.) Technically, our proof is not yet complete. Each sentence on its own is a complete, true sentence. However, we need to end by stating that the sentence we were trying to prove is actually true:

(x+1)²=x²+2x+1 (equivalence of equal quantities, or substitution)

Now, you should read the above paragraphs as illustrating the ideas of a proof in that it illustrates how sentences (equations) are listed as statements that are demonstrated to be true. But so far, we have not dealt with the idea of implication.

Recall that the Principle of Mathematical Induction (PMI) involves verifying the two conditions:

Show that the first statement in the chain, which we call S(1), is true.
Show that if any single statement in the chain, which we call S(n), is true, then this implies the next statement, which will be S(n+1), is also true.

So the structure of every proof by induction involves two subproofs (showing S(1) is true; showing S(n) implies S(n+1)) followed by an application of the PMI. The subproofs are the method we verify that the two conditions of the PMI have been satisfied.

Here is what to look for in the subproofs.

S(1) is almost always very easy to show. However, you still need to be clear that you follow the pattern of a proof described above.
Showing S(n) implies S(n+1) will consist of assuming that S(n) is true for some n (don't forget that this is a specific but unspecified value, so you must leave n or k, depending on the label being used, as a symbol and not an actual number). Then the proof will almost always rely on some type of recursive relation between a quantity involved in the statement S(n) and a similar quantity in the statement S(n+1).
After the subproofs are complete, you must invoke the PMI and then declare that the statements are true for every n=1, 2, 3, ....

Example: Given a sequence defined by x1=3 and the recursive relation x_k+1=x_k+2 for k=1, 2, 3, ..., prove that x_k=1+2k for k=1, 2, 3, ....

Notice what the chain of statements we are trying to prove will be. The sentence S(k) is the statement x_k=1+2k, where x_k is the value of the sequence defined recursively, and 1+2k is just a formula involving k. It is very important to remember that the "=" sign in this equation is not defining the value of the sequence, but is simply stating that the two quantities (the sequence value and the formula value) happen to always be equal.

Everything in italics is not part of the proof.

Proof:
(1) We first write a subproof that S(1) is true: x₁=1+2(1). To do this, we need to create a sequence of equations that we know are true based on the symbols themselves, and not the equation above.

x₁=3 (Given)
(We looked at the given information and saw this was provided.)
1+2(1) = 1+2 = 3 (Arithmetic)
(We needed an equation involving 1+2(1) so we wrote down an equation that was based on the rules of evaluating this formula.)
So x₁=1+2(1).   (Equivalence)
(Earlier, we showed x₁=3 and then we showed 1+2(1)=3, so this means the two quantities are equal. This ends the subproof since we just finished writing the statement corresponding to S(1) being true.)

(2) We next write a subproof that S(k) implies S(k+1) for k=1,2,3,... To do this, we will start by assuming S(k) is true for some unspecified k. Using the recursion to compute x_k+1 we will demonstrate that S(k+1) is also true. Note: S(k+1) is the statement x_k+1=1+2(k+1).

Assume x_k=1+2k is true for some k=1, 2, 3, ....
x_k+1=x_k+2 (Given recursive definition of sequence)
(The statement we are trying to prove involves the symbol x_k+1 so we need to create equations based on that symbol.)
x_k+1=(1+2k)+2 (Substitution: x_k=1+2k from assumed hypothesis)
(This is the key step: we use the recursive connection between x_k+1 and x_k to establish a formula for x_k+1.)
x_k+1=3+2k    (Algebra)
(We now have a simple formula for x_k+1 and now we turn our attention to the other half of the statement S(k+1), namely the formula 1+2(k+1).)
1+2(k+1) = 1 + 2k + 2   (Distributive law)
(We are creating a true equation involving the formula by considering the results of applying mathematical laws.)
1+2(k+1) = 3+2k    (Algebra from above: 1+2=3)
(Our target should always be that the formula with k+1 in place of k will match whatever we found by the recursion to compute a value for x_k+1.)
So x_k+1=1+2(k+1)    (Equivalence or Substitution)
That is, S(k) implies S(k+1) for k=1, 2, 3,....
(This ends the second subproof because we just finished writing the statement we were trying to prove in this part. We are now ready to invoke the PMI.)

Since S(1) is true and S(k) implies S(k+1) for k=1, 2, 3, ..., the Principle of Mathematical Induction guarantees S(k) is true for every k=1, 2, 3, .... That is, x_k=1+2k for k=1, 2, 3, ...
♦

Mathematical Induction

I actually posted on this very topic a few years ago. However, I have refined how I think about doing proofs by mathematical induction. And so I am writing one more time.

First, you might find it interesting to look at the Wikipedia article on mathematical induction.

The Principle of Mathematical Induction (PMI) is an axiom that describes the set of natural numbers, which is N = {1, 2, 3, 4, ...}. From one point of view, the PMI says that if S is a set that (1) contains the number 1 and (2) guarantees that whenever any number n is in the set, then n+1 is also in the set, then the set S contains all of N. From the perspective of proofs, however, we are really interested in showing that an infinite chain of logical statements is true.

Before we proceed with the idea of the PMI, let me be clear about a logical statement. A logical statement is a well-constructed sentence that is definitively true or false. Two well-known but easily misunderstood examples of logical statements are equations and inequalities. An equation "A=B" is a logical statement that declares two quantities (A and B) are equal (have the same value). An inequality "Afalse, but the statement itself is logically complete. (Here logical does not mean `makes sense' but it means `can be decided between True or False'.)

Related to this issue is a common misunderstanding by students that "=" is like an operation that means "has the value" or "find the answer" as it is often used on a calculator. This is especially important in a proof, where each statement (usually an equation) must be clearly true rather than a statement of what you hope or what you are trying to calculate.

Okay, back to the Principle of Mathematical Induction. This principle is about dealing with an infinite chain of logical sentences. For example, consider the following chain of statements. S(1) is the label for the first sentence, S(2) is the label for the second sentence, and so on:

S(1):     1 = 1(2)/2
S(2):     1+2 = 2(3)/2
S(3):     1+2+3 = 3(4)/2
S(4):     1+2+3+4 = 4(5)/2
S(5):     1+2+3+4+5 = 5(6)/2
S(6):     1+2+3+4+5+6 = 6(7)/2
...
(the pattern continues so that if n is any number n=1,2,3,..., we have):
S(n):     1+2+3+...+n = n(n+1)/2

On the left of each sentence is a summation. On the right of each sentence is a simple formula involving n. Using summation notation, we would have written this sentence:
S(n):    Σⁿ_k=1 k = n(n+1)/2

The pattern described by S(n) looks like a single sentence, but it is important to remember that it is describing the entire infinitely long chain of logical sentences. If you add up the values on the left side of any single sentence and compare it to the value of the formula on the right, you will see that the answers are the same. But this only verifies the formulas that you actually check.

The Principle of Mathematical Induction is a tool to prove that the entire chain of sentences are all true. The PMI is much like an infinite chain of dominoes (see the earlier linked wikipedia article). To knock over a chain, if you knock them down one at a time, you'll never finish. But if you show that knocking any single domino down will guarantee the next domino also falls and you show that you can knock down the first domino, then this is enough to guarantee that the entire chain will fall down.

So here is the PMI:

Suppose S(n), n=1, 2, 3, ..., is a chain of logical statements and that (1) S(1) is true and (2) S(n) implies S(n+1) for any n=1, 2, 3, ..., then S(n) is true for every n=1, 2, 3, ....

Notice that there are two conditions to use PMI:

We must verify that S(1) (the first sentence) is true.
We must show that the sentence S(n+1) is true whenever the previous sentence S(n) is true.

A proof by induction is the process whereby we verify these two conditions and then apply the PMI to conclude that the entire chain is true.

To learn more about writing the proofs and an example with commentary, please continue by reading the next blog post.

Tuesday, March 8, 2011

How We Learn Mathematics

I was reading the following paper: D. Breidenbach, E. Dubinsky, J. Hawks, and D. Nichols, "Development of the Process Conception of Function," Educational Studies in Mathematics, 23: 247-285, 1992.

Quote Dubinsky (1989): "A person's mathematical knowledge is her or his tendency to respond to certain kinds of perceived problem situations by constructing, reconstructing and organizing mental processes and objects to use in dealing with the situations."

"Applying this point of view to mathematics (or any other subject) consists of determining the nature of the specific processes and objects that are constructed and how they are organized when one studies mathematics"

Ways of thinking about functions:

prefunction - does not understand any real ways of using function concepts
action - repeatable mental or physical manipulation (e.g., plug in numbers and calculate); static; one step at a time
process - think of function as a single dynamic transformation

I then found another article: A. Sfard and L. Linchevski, "The Gains and the Pitfalls of Reification: The Case of Algebra," Educational Studies in Mathematics, 26 (2/3), 191-228, 1994 [Learning Mathematics: Constructivist and Interactionist Theories of Mathematical Development]

This article proceeds with the view that in mathematics, there is a duality in mathematical constructs being a process or an object. That is, conceive of things operationally (process) or structurally (object). Historical examples include the expansion of number systems: positive to negative (operational: subtraction as adding a negative to structural: negative numbers as objects), and real to complex (i=sqrt(-1) as an operational convenience to an actual object)

An included reference suggests finding another article: Kieran, C.: 1992, 'The learning and teaching of school algebra', in D. A. Grouws (ed.), The Handbook of Research on Mathematics Teaching and Learning, Macmillan, New York, pp. 390-419. I'll have to see if I can find this one, as it is cited for the sentence, "[reification] was also used to introduce some order into the quickly growing bulk of findings about algebraic thinking."

Interesting phrase: "the ability to grasp the structural aspect is not easy to achieve" and "those crucial junctions in the development of mathematics where a transition from one level to another takes place are the most problematic."

Another interesting way to think about how mathematics is organized: (1) Logical, or the way it fits together; (2) Historical, or the way in which it was developed; and (3) Cognitive, or the processes in which people learn.

Modes of Algebra
1.1) Algebra as Generalized Arithmetic: The Operational Phase
-- solve for the unknown, but not using symbols (grade school algebraic thinking)
-- rhetoric algebra
-- principally reversing processes
1.2) Algebra as Generalized Arithmetic: The Structural Phase
1.2.1) algebra of a fixed value (unknown)
-- Notational convenience, but treat variable as a fixed value
:::: becomes a mental challenge to think of formula as both a process and result
:::: example given: 2+3 represents process, 5 represents result. But x+3 represents both, no separate "result"
:::: compare to the challenges of new number types required to think about division, subtraction, and extracting square roots
** Nice comment: "Once we manage to overcome this difficulty, it is quickly forgotten. ... Our eyes are easily blinded by habit and by our own ontological beliefs. Nevertheless, much evidence for the difficulty of reification may also be found in today's classroom, provided those who listen to the students are open-minded enough to grasp the ontological gap between themselves and the less experienced learners."
1.2.2) Functional algebra (of a variable)
:::: View formula as object
:::: Parameters represented as symbols not numbers.
2) Abstract Algebra

Give examples of interview questions. Students at early stages of thinking think about formulas as recipes for computations (process) but do not perceive them as valid objects. "The equality sign is interpreted as a 'do something signal' (Behr et al 1976; Kieran 1981)"

Here's something I see all the time in calculus classes: "It [the = symbol] serves here as a 'run' command. When treated in this way, the equality symbol looses [sic] the basic characteristics of an equivalence predicate: it stops being symmetrical or transitive. Indeed, young children seem to have no qualms about solving word problems with the help of a chain of non-transitive equalities. For instance, when asked 'How many marbles do you have after you win 4 marbles 3 times and 2 marbles 5 times?', the child would often write: 3*4=12+5*2=12+10=22."

Equations of the form 2x-3 = 11 can be interpreted as a formula whose result is 11 (which can be solved by inverse operations); equations of the form 2x-3=5x-9 appear to be two different formulas, and inverse operations do not make sense.

Walton's JMU Math Blog