Walton's JMU Math Blog

On the Path to Accessibility: Handwritten Solutions (part 1)

2025-05-02T08:57:00.000-07:00

The Challenge: Create Accessible Resources

I work at a state university in the Commonwealth of Virginia. [Note that this document represents my personal experiences and opinions and is not meant to imply any endorsement by my university or the state—I'm just stating that for context.] Last year, the US federal government published final rulings on the ADA requirements for accessibility for web content on government websites, which includes state universities.

As a mathematician, I am used to posting documents in PDF form. Sometimes these are scans of handwritten solutions. Or they could be documents prepared using LaTeX. It is my understanding that both of these are problematic.

In this post, I want to explore dealing with handwritten solutions. The traditional work is just images. This would not meet the Perceivable or Understandable requirements. Is it possible for an instructor to start with their handwritten work and use current tools to convert it to an electronic representation that meets compliance?

Step 1: Create an example

I wanted to do a hands-on exploration. So I created a typical algebra exercise solution—solving a quadratic equation using factoring—and scanned it to a single page PDF. I also converted that image to a PNG file.

Step 2: Try AI

How well will a GPT AI model do at extracting my content. ChatGPT would not process the PDF version as it complained there was not text. When I attached the PNG image to my ChatGPT prompt, I was able to get something back successfully. I repeated the exercise on Microsoft Copilot, which is a resource that the university has licensed—I don't know details of what this license gives me.

ChatGPT

Here is the prompt I used for ChatGPT (GPT-4-turbo):

Convert this image to a markdown file, using latex math formulas for the math that appears in the document.

And here is the resulting markdown content:

1. Solve \( 3x^2 - 5x - 2 = 0 \)

Look for factors of \( (3)(-2) = -6 \) that add to -5.
\[
-6 + 1 = -5 \quad \text{and} \quad (-6)(1) = -6
\]

Rewrite \( 3x^2 - 5x - 2 = 3x^2 - 6x + 1x - 2 \) and factor by grouping:

\[
(3x^2 - 6x) + (x - 2) = 0 \\
3x(x - 2) + (x - 2) = 0 \\
(3x + 1)(x - 2) = 0
\]

Because this is now in the factored form \( AB = 0 \), we can solve each factor as an independent equation:

\[
3x + 1 = 0 \quad \Rightarrow \quad 3x = -1 \quad \Rightarrow \quad x = -\frac{1}{3}
\]

\[
x - 2 = 0 \quad \Rightarrow \quad x = 2
\]

The solutions are \( x = -\frac{1}{3} \) and \( x = 2 \).

I think ChatGPT did quite a good job. I did notice that it made some editorial decisions. In the first step of the solution, it interpreted the two statements about the sum and products of the factor as a single display equation with the word "and" separated as text. That seems reasonable.

Then at the last major step, I solved each factor equal to zero. On paper, I did this in two columns, with each step a line below. ChatGPT decided to interpret that as two separate display statements (one for each column) using \Rightarrow as a separator between the steps. Again, I think this is a good choice.

Copilot

Here is the prompt I gave Copilot (GPT-4):

The attached image is a scan of a solution to a math exercise. Convert the image to a markdown file using LaTeX representations for each formula that appears in the document.

And here is the resulting markdown:

1. Solve \(3x^2 - 5x - 2 = 0\)

Look for factors of \((3)(-2) = -6\) that add to \(-5\):
- \(6 + (-1) = 5\)
- \((-6)(1) = -6\)

Rewrite \(3x^2 - 5x - 2 = 3x^2 - 6x + 1x - 2\) and factor by grouping:
\[ (3x^2 - 6x) + (1x - 2) = 0 \]
\[ (3x(x-2)) + (1(x-2)) = 0 \]
\[ (3x+1)( x-2 )=0 \]

Because this is now in the factored form \(AB=0\), we can solve each factor as an independent equation:
\[
\begin{cases}
3x+1=0 \\
x-2=0
\end{cases}
\]

Solving these equations:
\[
\begin{cases}
x=-\frac{1}{3} \\
x=2
\end{cases}
\]

The solutions are \( x=-\frac{1}{3} \) and \( x=2 \).

I immediately saw two problems with this translation.

Where ChatGPT took that pair of equations dealing with factors of -6 and interpreted it as a single display equation, Copilot decided to reinterpret my work. The equation that should have said "-6+1 = -5" as instead became a bullet item with a different equation "6+(-1)=5".

Also, when solving the individual factors as equations, Copilot stated the original equations and then stated the answers. It skips the intermediate step that I had given "3x=-1" when solving "3x+1=0".

Conclusions

I think using GPT models to convert the images seems fairly promising, but it is clear that it will take proofreading to make sure that the AI didn't introduce changes to the solutions. I am also somewhat concerned about the tendency for the AI to try to restructure how the flow is presented.

Step 3: Convert to PreTeXt

Markdown alone is not going to be accessible. But it should be possible to generate an HTML version of the document using the PreTeXt document system. Since I already had ChatGPT create reasonable markdown, I asked it to finish the process and convert the markdown to PreTeXt.

It did okay, but the LaTeX formatted equations were left in LaTeX style using \( and \) or \[ and \]. So I had to repeat my prompt specifying that inline formulas should use <m> tags while display formulas should use <me> tags. This wasn't quite right, as I next needed to add that multi-line equations should use <md> tags with <mrow> tags around equation separate line.

It gave me back a PreTeXt markup of an exercise with a solution.

<exercise>
<title>Solve a Quadratic Equation by Factoring</title>
<statement>
Solve the equation <m>3x^2 - 5x - 2 = 0</m>.
</statement>
<solution>

Look for factors of <m>(3)(-2) = -6</m> that add to <m>-5</m>.

<ul>
<li><m>-6 + 1 = -5</m></li>
<li><m>(-6)(1) = -6</m></li>
</ul>


Rewrite <m>3x^2 - 5x - 2</m> as <m>3x^2 - 6x + x - 2</m>, and factor by grouping:


<md>
<mrow>(3x^2 - 6x) + (x - 2) = 0</mrow>
<mrow>3x(x - 2) + (x - 2) = 0</mrow>
<mrow>(3x + 1)(x - 2) = 0</mrow>
</md>


Because this is now in the factored form <m>AB = 0</m>, we can solve each factor as an independent equation:


<ul>
<li>
<me>3x + 1 = 0</me>
<me>3x = -1</me>
<me>x = -\frac{1}{3}</me>
</li>
<li>
<me>x - 2 = 0</me>
<me>x = 2</me>
</li>
</ul>


The solutions are <m>x = -\frac{1}{3}</m> and <m>x = 2</m>.

</solution>
</exercise>

This was a pretty good conversion. Notice, however, that the solutions for the separate factors is no longer a display equation, but is a series of display equations in a bullet list. Curiously, this was not done using the <md> tag.

When I asked ChatGPT to finish the work by wrapping the content in <handout>, it generated its own title and introduction to the exercise.

<handout xml:id="quadratic-factoring-example" xmlns="http://pretextbook.org">
<title>Solving a Quadratic Equation by Factoring</title>

<introduction>
This example demonstrates how to solve a quadratic equation using factoring and grouping techniques.
</introduction>

<exercise>
<title>Solve a Quadratic Equation by Factoring</title>
<statement>
Solve the equation <m>3x^2 - 5x - 2 = 0</m>.
</statement>
<solution>

Look for factors of <m>(3)(-2) = -6</m> that add to <m>-5</m>.

<ul>
<li><m>-6 + 1 = -5</m></li>
<li><m>(-6)(1) = -6</m></li>
</ul>


Rewrite <m>3x^2 - 5x - 2</m> as <m>3x^2 - 6x + x - 2</m>, and factor by grouping:


<md>
<mrow>(3x^2 - 6x) + (x - 2) = 0</mrow>
<mrow>3x(x - 2) + (x - 2) = 0</mrow>
<mrow>(3x + 1)(x - 2) = 0</mrow>
</md>


Because this is now in the factored form <m>AB = 0</m>, we can solve each factor as an independent equation:


<ul>
<li>
<me>3x + 1 = 0</me>
<me>3x = -1</me>
<me>x = -\frac{1}{3}</me>
</li>
<li>
<me>x - 2 = 0</me>
<me>x = 2</me>
</li>
</ul>


The solutions are <m>x = -\frac{1}{3}</m> and <m>x = 2</m>.

</solution>
</exercise>
</handout>

Will it compile?

No. PreTeXt requires a project. This should be embedded within a larger project. The header of the XML should not have the language name space. Also, the "\frac" statements were converted so that "\f" was interpreted as a special character. Once I created a place-holder project and fixed the error with "\frac"

Here is a screenshot of the final result. This is listed as "Chapter 3" because I put this in a default PreTeXt "course" project under a chapter for solutions.

I am now wondering how this workflow could be simplified and made more reliable.

Do you have other ideas? Leave them in the comments.

Mastery and Canvas

2020-02-02T18:23:00.001-08:00

Background

Last semester, I experimented with allowing test corrections on every exam. I appreciated the opportunity to allow students to come back from mistakes. At the same time, I did not necessarily feel that I was seeing the students really learn the material. During final exam week, I participated in an AIM workshop on "Interactive assessments in open source textbooks." Some of the discussion centered around how such resources could be leveraged in mastery-based teaching. Long story short, I decided to jump all in.

My Mastery Strategy

What is my strategy? I'm doing a weekly mastery quiz for 25 minutes once a week. It seems that we're getting about three or four new mastery outcomes for each week of class. The quiz includes questions for those outcomes as well as new versions of questions relating to outcomes assessed in earlier weeks. I adopted a four-level scoring system.

M = "mastered". The student provided a solution that both answered the question (essentially) correctly and conveyed a clearly communicated solution.
R = "resubmit". The student provided a solution that was nearly correct or nearly conveyed a clear solution, but there were some items that were unclear. I tend to think the student probably has the idea mastered but needs to correct a mistake or clarify their communication.
S = "shows progress". The student is making head-way but has at least one significant issue such that I'd like them to attempt a new version later.
L = "lacks evidence". Not enough work was shown to evaluate their understanding or the work shows a beginning development of understanding.

Once a student scores "M" on an objective, they do not need to repeat it again. If they score "R", then within a week then need to submit the corrections and I can choose whether the update convinces me they are either "M" or "S". For scores "S" or "L", they will need to try another question on a later week to demonstrate mastery.

Setting Up Mastery Tracking on Canvas

Our university uses the Canvas learning management system. Having been referred to a video by Sharona Krinsky about to track progress in learning outcomes on Canvas, I decided to give that a try. The rest of this post is really about what I've learned so far in doing this.

First, it was a pain to setup the learning outcomes. I realized very quickly that I would not be doing it through the built-in system. Canvas requires way too many clicks. Each outcome needs to have several different descriptions as well as a grading point scheme associated with my M/R/S/L system above. With several dozen planned learning outcomes, I quickly realized using Canvas's interface was insane.

Canvas does allow you to import your objectives from a CSV file. However, you can't export your existing work as a CSV file, edit the file, and then reimport the new version. You have to follow some directions to figure out the required CSV format.
I found the instructions about the format of columns relating to the mastery ratings and the required points to achieve mastery incomplete. If mastery can be achieved with multiple ratings, I did not find how to identify the minimum score for mastery. The instructions say there is a column before the first rating for this, but I only got errors when I tried. I must have been doing something else wrong.

One additional item that caught me was the vendor_guid field. The instructions make it seem that these can be any unique items in the .csv file, such as going through the alphabet. However, I learned that they perhaps need to be unique for the user, at least for the outcome groups. When I used the same vendor_guid entries of "a", "b", and "c", etc., for groups, I had an import error saying that the field had been used previously. I eventually decided on creating a short id for the groups and outcomes and then generated a vendor_guid field as a concatenation of the course id with the group and outcome ids.

Here is a portion of my import .csv file.

vendor_guid,object_type,title,description,display_name,calculation_method,calculation_int,workflow_state,parent_guids,ratings,,,,,,,,,
M234Sp20_D,group,G2: Derivatives,Derivatives and Differentiation,D,,,active,,,,,,,,,,,
M234Sp20_L,group,G3: Limits,Limits and Continuity,L,,,active,,,,,,,,,,,
M234Sp20_A,group,G4: Applications,Applications of Derivatives,A,,,active,,,,,,,,,,,
M234Sp20_T,group,G5: Trigonometry,Trigonometry,T,,,active,,,,,,,,,,,
M234Sp20_I,group,G6: Integration,Antiderivatives and Integration,I,,,active,,,,,,,,,,,
M234Sp20_D1,outcome,D1: Derivative from Graph,Find the derivative of a function defined by a graph by approximating the slope of a tangent line.,D1,n_mastery,1,active,M234Sp20_D,4,Mastery,3,Resubmit,2,Shows Progress,1,Lacks Evidence,0,No Work
M234Sp20_D2,outcome,D2: Numerical Derivative,Use the definition of the derivative to approximate the derivative of a function numerically.,D2,n_mastery,1,active,M234Sp20_D,4,Mastery,3,Resubmit,2,Shows Progress,1,Lacks Evidence,0,No Work
M234Sp20_D3a,outcome,D3a: Definition of Derivative: Polynomial,Apply the definition of the derivative to find the derivative of a quadratic polynomial.,D3a,n_mastery,1,active,M234Sp20_D,4,Mastery,3,Resubmit,2,Shows Progress,1,Lacks Evidence,0,No Work
M234Sp20_D3b,outcome,D3b: Definition of Derivative: Algebraic,Apply the definition of the derivative to find the derivative of a simple algebraic function.,D3b,n_mastery,1,active,M234Sp20_D,4,Mastery,3,Resubmit,2,Shows Progress,1,Lacks Evidence,0,No Work

I added sortable elements at the start of the title field because Canvas shows the entries sorted alphabetically in the Outcomes menu. This also plays a critical role later in parsing the exported data so that I can quickly identify outcomes in a script—the exported data uses the title and not the shorter display name. For example, my first differentiation outcome is "D1: Derivative from Graph" and the "D1:" is the prepended outcome code to force sort order.

Scoring Assessments

Well, now I have outcomes coded in Canvas and have written an assessment quiz. After the students have completed their work, it is my turn to evaluate their solutions and record their mastery ratings.
I create an Assignment in Canvas for the week. I have decided to make the value of the assessment worth 1 point for the purpose of being able to easily use a score of 1 to mean they did the assessment and 0 to mean that they were absent and did not. I'm not actually using the points for any purpose other than as a simple flag.
Canvas has a SpeedGrader that would work really well if students uploaded their own work. Unfortunately, Canvas does not support instructors uploading student work and assigning it to the individual students. I wish it did, because that would make it so I could type comments and get feedback to students more quickly. So I am currently marking my feedback on the paper copies.
I do, however, still use SpeedGrader because I can turn on Rubrics. I create a rubric for the assignment and import all of the outcomes that appear on the quiz. For later quizzes, I can find that same rubric. You have to be sure that it comes from the same class so that it doesn't import another class's outcomes. Then for later weeks, I only need to add new outcomes for that week.
I do wish there were keyboard shortcuts for entering data on the rubric. You have to tab too many times to make keyboard entry efficient. I prefer clicking on the ratings, but wish I could just type the scores associated with the ratings and tab to the next outcome.

Viewing Student Progress

My biggest gripe is the useless formatting of looking at student progress. The student view allows students to see their progress by group and outcomes are sorted alphabetically.

The instructor view is not as nice. The Gradebook does have a Learning Mastery view. Unfortunately, you can only view 18 students at a time and I have no idea what order Canvas is using to sort the outcomes. You can move them around on the screen but it doesn't save your new order.

As I am including more and more outcomes as the weeks pass, I realized I needed a custom way to look at student progress. Canvas allows you to export the Learning Mastery report from the Gradebook as a .csv file. I didn't find the format to be much better than the original report.

Fortunately, once the data is free from Canvas, I can use other tools to parse the data and generate anything I want. I took some time to create a script in R that imports the data and then generates a custom progress report.

The CSV file has two columns per outcome and one row per student. There is not an obvious order to how students are sorted or how outcomes are listed. So part of the script's task is to sort students into a meaningful order and to identify outcomes into a reasonable order. I wrote the script to identify outcome groups and outcomes within the groups. The script includes a structure so that I only include outcomes that have been assessed so far in the term and does not report about future outcomes.
Here is sample output for my Test Student, which has only been marked with two outcomes with mastery.

Test Student

Cross-Over CO1:
Cross-Over CO2a:
Cross-Over CO2b: Mastery Shown
Cross-Over CO3: Mastery Shown
Cross-Over CO4:

Derivatives D1:
Derivatives D2:
Derivatives D3a:
Derivatives D3b:

The script itself is available from my github repository as mastery-progress.R. Please feel free to use and adapt the script in any way that you desire.

Oh yeah! I've got infinity plus one!

2013-01-03T13:38:00.000-08:00

Earlier this week, my brother Chris sent me a Facebook question posed by my preschool-aged nephew: "What is infinity minus infinity? What is zero minus infinity?" I'm sure most of us at some point engaged in the one-upmanship game of making bigger numbers than our sibling or friend.

Chris: 20

Brian: 21

Chris: 100

Brian: 101

Chris: 1000

Brian: 1001

And then your brother makes the leap!

Chris: Infinity!

Brian: Infinity + 1!

Chris: That's just infinity. There's nothing bigger than infinity. I win!

No fair! What's up with this? Up to the point where we leap into the infinite, we are dealing with numbers. As it happens, infinity is not a number. Some people like to say, "Infinity is not a number, it's a concept." But this isn't very helpful at all. What good does it do to say it's a concept? What is a concept anyway? Or for that matter, what is a number?

Another of my Facebook friends relatively recently introduced me to a TED video that explores the cardinality idea of infinity. That is, cardinality is about sizes of sets, and natural numbers (positive integers) or counting numbers are exactly the type of numbers that measure cardinality. The idea of infinity in relation to cardinality corresponds to the idea that you have an unending number of elements in a set.

Bizarre things happen in the cardinality sense of infinity. For example, if you take all of the positive integers (1, 2, 3, 4, ...) and double each number (2, 4, 6, 8, ...), we still have the same number of elements in the set (since we just manipulated each object). But our new set happens to be a subset of the original. When dealing with finite sets, a proper subset always has fewer objects than the original set. But we just saw an example where a proper subset has an equal number of objects (infinitely many). So here, the phrase "number of objects" does not represent an actual number, but represents the concept of cardinality.

Alternatively, we could have started with all of the positive integers (1, 2, 3, 4, ...) and then just deleted every other number in the list. This also gives us (2, 4, 6, 8, ...) since the odd numbers were all removed. This is one way of showing that an infinite set taken away from an infinite set can still be infinite. This is an example where ∞ - ∞ = ∞. On the other hand, if we start with (1, 2, 3, 4, ...) and then take away the infinite collection of numbers (11, 12, 13, 14, ....), we are left with (1, 2, 3, 4, 5, 6, 7, 8, 9, 10). This would be an example where ∞ - ∞ = 10. In fact, by choosing how many numbers we want to leave and then just deleting all of the rest, it is easy to create examples where ∞ - ∞ = n for any integer represented by n.

The point here is that a cardinality interpretation of subtraction (removing elements from a set, like taking candy pieces out of a pile) reveals that the cardinality of ∞ (not a number) does not follow ordinary arithmetic rules. The infinite does that; it breaks the ordinary rules we are comfortable with when dealing with finite things.

In mathematics, we say ∞ - ∞ is indeterminate because the result actually depends on how the subtraction takes place.

A cardinality interpretation of 0 - ∞ does not actually make sense. Interpreting 0 - ∞ first requires an extension of the idea of numbers to negative numbers. For example, my children want to tell me that 2 - 5 = 0 because if you start with 2 candies and try to take 5, you don't have any left. But at first they don't realize that there are 3 candies that you never got to take.

We could introduce the idea of borrowing (loans). Suppose that I have 2 candies in my bowl and my daughter wants to eat 5 candies. If I want to give her 5 candies, then I can give her my 2 candies but then I'll need to get 3 more candies from someone else. This puts me in debt for 3 candies (-3) which I might represent by little paper IOUs. I have 3 IOUs. I can pay off the IOUs when I obtain candies. For each candy I receive, I pay off an IOU.

This is the idea of addition extended to all integers. -5 + 2 corresponds to have 5 IOUs and 2 candies. The 2 candies pay off 2 IOUs, leaving me the same as if I had only 3 IOUs to begin with. So -5 + 2 = -3. Subtraction is normally thought of as actually taking candies away. That is 5 - 3 corresponds to having 5 candies and taking 3 away, leaving only 2. The extended idea of subtraction is to think of having 5 candies and using 3 IOUs: 5 - 3 = 5 + -3. Taking away the candies is equivalent to redeeming IOUs.

That is, once we start dealing with negative numbers, subtraction is really about adding negatives (redeeming IOUs). The value -∞ simply means that we have an infinite number (there's that cardinality idea again) of IOUs. So 0 - ∞ is really the idea 0 + -∞, which means we start with a pile of 0 candies and an infinite number of IOUs. Since we can't redeem any of the IOUs, we still have infinitely many. That is, 0 - ∞ = -∞. By the same argument, 5 - ∞ = -∞ and ∞ - 5 = ∞.

In addition, this gives us a way of extending our earlier idea of subtraction for ∞ - ∞ to end with any possible answer from -∞ to ∞. The extended cardinality approach to ∞ - ∞ would mean that we have infinitely many candies and infinitely many IOUs. If we put the candies and IOUs each in some order (an interesting philosophical question is if this is always possible), then we can make a choice on how we redeem our candies.

We might redeem every IOU but occasionally (or regularly) skip some of the candies. For example, the first IOU could take the 1st candy, the 2nd IOU takes the 3rd candy, the 3rd IOU takes the 5th candy, and so on, leaving candies 2, 4, 6, .... This would correspond to ∞ - ∞ = ∞. Or we could redeem only some of the IOUs but use all of the candies. For example, the 1st candy redeems the 1st IOU, the second candy redeems the 3rd IOU, and so on, leaving infinitely many IOUs unredeemed. This corresponds to ∞ - ∞ = -∞. Or we could leave the first 12 IOUs unredeemed and then redeem each subsequent IOU by the subsequent candies, corresponding to an example of ∞ - ∞ = -12.

It is essential to see that when we deal with arithmetic involving ∞ or -∞, we are not dealing with numbers. For cardinality, we are actually dealing with correspondences between sets. ∞ - ∞ does not adequately describe the correspondence. But 12 - 5 is adequate because every finite correspondence results in the same final answer. But for infinite correspondences, the nature of the correspondence itself makes a difference. Indeterminate really is indeterminate.

Edit: I had a colleague point out to me that there is a second sense of infinite using ordinal numbers. In this setting, it is possible to talk about "Infinity plus one" as having meaning that is actually distinct from "Infinity". For now, you might want to look at the wikipedia page, but I'm working on my own response as well. (October 8, 2013)

Derivatives, Velocity, and Acceleration

2012-11-20T14:02:00.000-08:00

Calculus can be viewed as the study of rates of change of quantities. The most familiar rate of change in our ordinary experiences is velocity as the rate of change of position. As we ride our skateboards, bicycles, and cars, we understand that high velocities mean our position is changing quite rapidly; and we understand that when our velocity is zero, we are standing still.

So imagine the experience of a perfect rocket-car that experiences no friction and has no brakes. The only way to change its velocity is with rocket blasters that are installed on either end of the vehicle. Consider the following trajectory, illustrated as an animation, and repeating in a loop. A timer (16 seconds) is also shown to provide a measurable sense of time.

The following table describes when and which direction the rockets are firing.

Time Interval	Direction of Rocket	Magnitude
[0,1)	None	0
(1,3)	Right	Moderate
(3,5)	None	0
(5,7)	Left	Large
(7,8)	None	0
(8,12)	Right	Small
(12,16]	None	0

In addition, the rocket-car is driving along a track which has positions marked so that we can think of the position as a variable (in meters). That is, we can think of examining the relation between the variables of time and position. The following table provides the position of the rocket-car as recorded at each second.

time (s)	0	1	2	3	4	5	6	7
position (m)	0.0	0.0	1.0	4.0	8.0	12.0	14.0	12.0
time (s)	8	9	10	11	12	13	14	15
position (m)	8.0	4.5	2.0	0.5	0.0	0.0	0.0	0.0

A table is only useful for a coarse overview of the relation between the variables. For example, we can see that between t=2 s and t=3 s, the position went from x=1.0 m to x=4.0 m. We can compute an average velocity over this time interval:

So the average velocity on this interval was 3.0 m/s. However, this was during one of the intervals the rocket was firing. It would have been going slower at the beginning of the interval and faster at the end. The table does not give enough information for us to estimate the actual velocities.

Another representation of the relation between time and position is with a graph.

The derivative defines a new variable, dx/dt. Although this looks like a fraction, we should really think of the entire symbol as the name of our new variable, the derivative. This variable measures the rate of change of position x with respect to time t. On the graph, this corresponds to the slope of the graph at each point.

For example, if I were to consider the point at t=2 s and x=1 m, we might draw the tangent line and measure the slope. This slope is the derivative, which for this point corresponds to dx/dt=2.0 m/s. Notice that the derivative is a velocity, which is precisely what the derivative measures for position with respect to time. So we will use the variables v and dx/dt interchangeably.

We could compute the derivative for every point of the original relation. This new variable could be added to our table.

time (s)	0	1	2	3	4	5	6	7
position (m)	0.0	0.0	1.0	4.0	8.0	12.0	14.0	12.0
velocity (m/s)	0.0	0.0	2.0	4.0	4.0	4.0	0.0	-4.0
time (s)	8	9	10	11	12	13	14	15
position (m)	8.0	4.5	2.0	0.5	0.0	0.0	0.0	0.0
velocity (m/s)	-4.0	-3.0	-2.0	-1.0	0.0	0.0	0.0	0.0

But it would be better, just as with our original relation, to consider the relation as a graph. The figure below illustrates both graphs one above the other.

Notice that the velocity graph itself also has a slope at (nearly) every point. The slope of a velocity graph is also a rate of change, measuring how the rate of change of velocity (m/s) with respect to time (s). Consequently, the derivative dv/dt, which is called acceleration has units (m/s)/s, or more directly m/s². (If velocity was measured in miles per hour, then acceleration might be measured as mph/s.) We can use the variable name a=dv/dt.

Because our velocity is defined as piecewise linear, the acceleration will be piecewise constant. On intervals where the velocity was constant, the acceleration will be zero.

Notice that the acceleration is directly related to our original discussion of the rocket-blasters. When the rockets are blasting to the right, the acceleration is positive; when the rockets are blasting to the left, the acceleration is negative. In fact, this is exactly the idea behind Newton's second law of mechanics, F=ma. The rocket's thrust corresponds to the force F. Newton's law simply states that the acceleration is proportional to the force. The mass, which measures inertia, provides the proportionality constant. If we had the exact same rockets and the car had twice the mass, then the acceleration would be cut in half.

Because our rocket-car does not have brakes, the only way to slow down is to turn on the rockets in the opposite direction of motion. In the language of derivatives, the acceleration must have the opposite sign from the velocity. If the acceleration is the same sign as the velocity, then the effect of acceleration is an increase in the speed (the magnitude of velocity).

We close this reading by considering the effect of acceleration on the graph of position. Slowing down corresponds to making the slope closer to horizontal. Speeding up corresponds to making the slope steeper. So, let us look at the original graph of position as a function of time, but marking the graph with different colors, depending on how the car is accelerating.

When the velocity and acceleration are opposite, I have marked the graph in red. Notice that this is when the graph is becoming closer to horizontal (left-to-right). When the velocity and acceleration are the same direction, I have marked the graph in green. Notice that this is when the graph is becoming steeper (left-to-right). When there is no acceleration, I have marked the graph in purple (straight lines).

To visualize this relative to the original animation, I have color-coded the timer and have placed a colored-flag on the car. Try to connect the information in the graph to the visualization of the moving rocket-car.

Concavity is the word that describes how a graph bends. When the second derivative (acceleration) is positive, the graph will be concave up. On our graph, this corresponds to the first green segment, t ∈ (1,3), and the last red segment, t ∈ (8,12). When the second derivative is negative, the graph will be concave down. This corresponds to the middle red and green segments, t ∈ (5,7).

Great Circles and Non-Euclidean Geometry

2012-11-13T09:46:00.000-08:00

A few weeks ago, a few of my children were playing on the computer with Google Earth. They wanted to see the aerial photographs of some our previous residences. Then one of them wanted to measure exactly how far away we moved, when we moved from Mountlake Terrace, Washington, to Harrisonburg, Virginia. So he drew a path connecting our previous address to our current address.

What jumped out at me was not the distance, but the heading—90.4 degrees, or almost exactly due east. I was puzzled. Wasn't Virginia at a lower latitude than Washington? So how could we be exactly east of Virginia. Then I realized that Google Earth is calculated distance using great circles.

Here is a picture of the path. Notice how the path bends.

A great circle is a path on the surface of a sphere such that the center of the sphere is the center of the circle. My first instinct when I read the heading was 90 degrees was that the path followed a line of constant latitude. That is, I would follow the compass heading of "East", keeping the north pole always exactly to my left. The image (coming from Wikimedia Commons) below illustrates these lines as being "parallel" to the equator. It also illustrates lines of constant longitude, which go from pole to pole.

The equator and lines of constant longitude are examples of great circles. But the lines of constant latitude (other than the equator) are not, because their centers are shifted away from the center of the sphere.

How can you get other great circles? One way is to choose any point on the surface of the sphere. Then take a line and go through the center of the earth to the diametrically opposite point. (Etymology: The word "diametrically" comes from the word "diameter;" great circles always have the same diameter as the sphere on which they are constructed.) These two points will be on longitude lines exactly 180 degrees apart, which together form a great circle. You can then hold your two points constant and rotate that great circle around the axis you created by joining the two points.

Trivia (also known as a Cool Math Fact): Any great circle containing a point must include its diametrically opposite point.

Non-Euclidean Geometry

Now, let's talk about distance. If you take any two points on the surface of the sphere, we want to find the shortest path between the two points. In the geometry of the Greek mathematician and geometer Euclid, the shortest distance between two points is a line. But what happens when we are constrained to paths on the surface of a sphere?

Imagine that you actually want to measure distance. You might do this by taking a piece of string (on the earth, this would be a very long string) and lay it down along your path. On this string, you could mark units of measure (like a measuring tape). When you arrive at your destination, you can just read off the distance of your path. Wildly wandering paths would clearly involve a longer distance.

Given your path, you might want to see if you can improve it. You could do this by trying to pull your string tighter, and sliding it around on the surface to see if you can use any less string to reach your point. We have to imagine that the surface offers no friction, so that the string naturally slides toward a better path if it is available. A path that can not be improved corresponds to an optimal path.

If we did our experiment of minimal paths on a flat table, then the optimal paths would follow Euclid's prediction of what we think of as straight lines. But on a sphere, optimal paths always follow great circles. The idea of non-Euclidean geometry is to consider lines not by our usual sense of straightness (whatever we think that means) but in terms of minimal paths.

In geometry, points and lines are basic elements, meaning they can not be defined from more elementary objects. Euclid's geometry, which is our classical geometry based on what we think of as straight lines, is based on the premise that given any two points, you can use a straight edge (ruler) to draw a straight line. And given any finite line segment, you can extend this line indefinitely. But where does this infinite straight edge come from?

In non-Euclidean geometry, all we ever get are short straight edges. We can only extend a line a little bit at a time, although we mathematically assume that we can do this perfectly, without any errors in aligning the straight edge to the rest of the line.

In Euclidean geometry, using a short straight edge repeatedly is identical to using an infinite straight edge. But if we do our geometry on a curved surface (instead of a perfectly flat infinitely large plane), then using our short straight edge follows a path that locally looks straight. But it has to follow the curving of the surface so that it does not look anything like Euclid's sense of a straight line. On the surface of a sphere, a straight line that is formed by extending a short line segment with our short straight edge over and over again will be a great circle.

Where is this used?

Great circles are used to plan shortest paths for airline flights. Talking about distance "as a bird flies" corresponds to paths along a great circle.

But non-Euclidean geometry has an even more fundamental role in understanding physics and light. One of the basic principles of how light works is that it follows a path of least time (not distance). The idea of refraction (why a stick looks like it bends when immersed in water) is a direct consequence of the fact that light travels through different substances at different speeds (slower through water than through air). The amount of bending of the light's path can be exactly calculated by finding the path over which light takes the least time to traverse.

Curiously, sometimes there are multiple paths over which the light minimizes its time. For example, the speed of light is faster in less dense air. If air is heated, it expands and decreases its density. So there are times when light can find a path with a greater physical distance but less time by choosing a path through warmer, less dense air. Ripples that form a mirage on a hot road in front of you are evidence of these multiple paths.

Thinking about how light travels actually led to Einstein's development of relativity. His theory of general relativity considers how light would need to travel if it was in a box that was accelerating. By imagining that such a box is indistinguishable from a box that is subject to gravity, Einstein realized that light's path needs to bend as it passes through a gravitational field.

But the best way to think about these paths is by considering that light is actually always traveling in a straight line, just that the space in which it travels has a constantly changing sense of what is straight. Further, Einstein realized that space and time are inseparably connected. That is, light is moving in what is called the space–time continuum. Gravity distorts this space–time continuum, and the way this is studied is through non-Euclidean geometry.

By the way, the answer to our original question: 2,257 miles.

Proof by Induction and Summations

2011-09-14T21:07:00.000-07:00

The two previous blog entries introduced the idea of the Principle of Mathematical Induction followed by a discussion of a typical example of a proof by induction. Be sure you read those two entries before you look at this one. This blog post extends these ideas by talking about how proof by induction applies to summations.

(I apologize right now about the formatting of summation notation, or sigma notation. I do not know how to get the math to look like math in a blog setting.)

Recall that the second condition of the PMI is that an arbitrary statement S(n) in the chain of statements being proved will guarantee that the next statement S(n+1) in the chain is also true. In a proof by induction, this step always involves using a recursive relation between something in the sentence S(n) with a corresponding object in the sentence S(n+1).

For summations, this recursive relation is always that a summation for statement S(n+1) exactly corresponds to the summation appearing in statement S(n) with some additional term(s).

For example, think back to our motivating example of a chain of statements:
S(1):    1 = 1(2)/2
S(2):    1+2 = 2(3)/2
S(3):    1+2+3 = 3(4)/2
S(4):    1+2+3+4 = 4(5)/2
S(5):    1+2+3+4+5 = 5(6)/2
...
Notice that the sum appearing on the left hand side of any given sentence appears in the sum on the very next sentence, but one more term has been added. Using parentheses to emphasize where the sum from the previous sentence appears in the new sentence, here is the same chain:

S(1):    1 = 1(2)/2
S(2):    (1)+2 = 2(3)/2
S(3):    (1+2)+3 = 3(4)/2
S(4):    (1+2+3)+4 = 4(5)/2
S(5):    (1+2+3+4)+5 = 5(6)/2
...

Summation (Sigma) notation gives us a handy way to describe sequences of sums like we see in the chain above. The terms of the sum follow a simple pattern. In this example, the pattern is the sequence of terms (1, 2, 3, 4, 5, ...). This sequence can be explicitly described as a_k = k for k=1, 2, 3, .... In the symbolism (notation) of summation notation, we write:

Σ¹_k=1(k) = 1
Σ²_k=1(k) = 1+2
Σ³_k=1(k) = 1+2+3
or
Σⁿ_k=1(k) = 1+2+3+...+n
(Notice that the formula in the parentheses (k) is the explicit formula of the sequence of terms in terms of the index which is listed at the bottom of Σ along with the first index of the sequence used in the sum. The index of the last term in the sum appears at the top of Σ.)

The pattern that we described above to illustrate the recursive relation between the sums of consecutive sentences in the chain of statements we wish to prove can also be written in terms of summation notation:
Σⁿ⁺¹_k=1(k) = Σⁿ_k=1(k) + (n+1)
When writing a proof by induction, we use this recursive relation.

Example: Prove that Σⁿ_k=1(k) = n(n+1)/2 for n=1, 2, 3, ...

In the proof, we will write S(n) to represent the sentence: Σⁿ_k=1(k) = n(n+1)/2.

Proof:
(1) We first need to think about the sentence where n is replaced by 1. That is, we need to prove that Σ¹_k=1(k) = 1(1+1)/2.
Σ¹_k=1(k) = 1 (Interpretation of summation notation)
(We are creating a true statement involving the summation symbol that appears on the left side of the equation we are proving. The sequence of terms is 1, 2, 3, 4, 5, ..., and the sum starts and ends with the 1st term.)
1(1+1)/2 = 1(2)/2 = 1 (Algebra)
(We now create an equation involving the formula on the right side of the equation we are proving, and we just showed it has the same value as the summation symbol.)

So Σ¹_k=1(k) = 1(1+1)/2.
(We obtained evidence above that the summation and the formula both represented the same value, so we conclude that they are equal.)

(2) We now need to show that S(n) implies S(n+1) for n=1, 2, 3, .... We will assume S(n), i.e. Σⁿ_k=1(k) = n(n+1)/2. Using the recursion connecting the sums, we will show S(n+1), i.e., Σⁿ⁺¹_k=1(k) = (n+1)((n+1)+1)/2.

Assume    Σⁿ_k=1(k) = n(n+1)/2    for some n=1, 2, 3, ...
Σⁿ⁺¹_k=1(k) = Σⁿ_k=1(k) + (n+1). (Recursive relation on summation of terms a_k = k)
Σⁿ⁺¹_k=1(k) = n(n+1)/2 + (n+1)     (Substitution: summation replaced by formula)
Σⁿ⁺¹_k=1(k) = n(n+1)/2 + 2(n+1)/2     (Find common denominator)
Σⁿ⁺¹_k=1(k) = (n²+n+2n+2)/2     (Distribution and adding fractions)
Σⁿ⁺¹_k=1(k) = (n²+3n+2)/2     (Combining terms)
(Notice that each equation was based on the previous equation. The key step was when we replaced Σⁿ_k=1(k) by the formula n(n+1)/2 that was provided by the assumed hypothesis. At this stage of the proof, we have a formula representing the value of the summation symbol with n+1 that is the left hand side of the equation in the sentence S(n+1). We now need to check the formula that appears in the right hand side of our sentence.)
(n+1)((n+1)+1)/2 = (n+1)(n+2)/2    (Arithmetic: 1+1=2)
(n+1)((n+1)+1)/2 = [n(n+2)+1(n+2)]/2     (Distributive law)
(n+1)((n+1)+1)/2 = [n²+2n + n+2]/2     (Distributive law again.)
(n+1)((n+1)+1)/2 = [n²+3n+2]/2     (Combining terms)
(Notice that each equation was based on the previous equation. Notice that the second and third steps (distributive law) are the formal steps in the idea commonly taught as FOIL when multiplying to binomial expressions together. The key is that we took the formula with n replaced by n+1 of the right hand side and discovered that it equals the same formula that we found for the summation above.)
So we have Σⁿ⁺¹_k=1(k) = (n+1)((n+1)+1)/2.
Consequently, S(n) implies S(n+1) for n=1, 2, 3, ....

By the PMI (since S(1) is true and S(n) implies S(n+1)), we have S(n) is true for n=1, 2, 3, ...
♦

Mathematical Induction (part 2)

2011-09-14T20:17:00.000-07:00

My previous blog post introduced the basic idea of what mathematical induction is about. This post focuses more on the mechanics of writing a proof.

A proof is the mathematical form of argument or persuasion. A proof consists of a sequence of logical statements, each of which is shown to be a true sentence based only on information previous to that sentence. Examples of sentences that might appear in a proof are equations or inequalities for which there is clear reason that it is true, and never based on what we hope is true or will later show is true.

For example, suppose I needed to prove (x+1)²=x²+2x+1. In a proof, I can not write down this equation as the first statement because it is not something I know (yet). Instead, I can write down equations known to be true based on basic principles:

(x+1)² = (x+1)(x+1) (meaning of power 2)
(x+1)(x+1) = x(x+1) + 1(x+1) (distributive law)
x(x+1) + 1(x+1) = x²+x + x+1 (distributive law again)
x²+x + x+1 = x²+2x+1 (collecting like terms)

Notice that each of the sentences is a statement of equality. Although they to suggest that the "=" sign is being used in a computational sense (i.e., it looks like it is being used to say apply a rule), we really are seeing these sentences as declaring that the result of applying a rule demonstrates that the sentence is actually true. (This is a subtle distinction that you need to fight your mind until it sinks in.) Technically, our proof is not yet complete. Each sentence on its own is a complete, true sentence. However, we need to end by stating that the sentence we were trying to prove is actually true:

(x+1)²=x²+2x+1 (equivalence of equal quantities, or substitution)

Now, you should read the above paragraphs as illustrating the ideas of a proof in that it illustrates how sentences (equations) are listed as statements that are demonstrated to be true. But so far, we have not dealt with the idea of implication.

Recall that the Principle of Mathematical Induction (PMI) involves verifying the two conditions:

Show that the first statement in the chain, which we call S(1), is true.
Show that if any single statement in the chain, which we call S(n), is true, then this implies the next statement, which will be S(n+1), is also true.

So the structure of every proof by induction involves two subproofs (showing S(1) is true; showing S(n) implies S(n+1)) followed by an application of the PMI. The subproofs are the method we verify that the two conditions of the PMI have been satisfied.

Here is what to look for in the subproofs.

S(1) is almost always very easy to show. However, you still need to be clear that you follow the pattern of a proof described above.
Showing S(n) implies S(n+1) will consist of assuming that S(n) is true for some n (don't forget that this is a specific but unspecified value, so you must leave n or k, depending on the label being used, as a symbol and not an actual number). Then the proof will almost always rely on some type of recursive relation between a quantity involved in the statement S(n) and a similar quantity in the statement S(n+1).
After the subproofs are complete, you must invoke the PMI and then declare that the statements are true for every n=1, 2, 3, ....

Example: Given a sequence defined by x1=3 and the recursive relation x_k+1=x_k+2 for k=1, 2, 3, ..., prove that x_k=1+2k for k=1, 2, 3, ....

Notice what the chain of statements we are trying to prove will be. The sentence S(k) is the statement x_k=1+2k, where x_k is the value of the sequence defined recursively, and 1+2k is just a formula involving k. It is very important to remember that the "=" sign in this equation is not defining the value of the sequence, but is simply stating that the two quantities (the sequence value and the formula value) happen to always be equal.

Everything in italics is not part of the proof.

Proof:
(1) We first write a subproof that S(1) is true: x₁=1+2(1). To do this, we need to create a sequence of equations that we know are true based on the symbols themselves, and not the equation above.

x₁=3 (Given)
(We looked at the given information and saw this was provided.)
1+2(1) = 1+2 = 3 (Arithmetic)
(We needed an equation involving 1+2(1) so we wrote down an equation that was based on the rules of evaluating this formula.)
So x₁=1+2(1).   (Equivalence)
(Earlier, we showed x₁=3 and then we showed 1+2(1)=3, so this means the two quantities are equal. This ends the subproof since we just finished writing the statement corresponding to S(1) being true.)

(2) We next write a subproof that S(k) implies S(k+1) for k=1,2,3,... To do this, we will start by assuming S(k) is true for some unspecified k. Using the recursion to compute x_k+1 we will demonstrate that S(k+1) is also true. Note: S(k+1) is the statement x_k+1=1+2(k+1).

Assume x_k=1+2k is true for some k=1, 2, 3, ....
x_k+1=x_k+2 (Given recursive definition of sequence)
(The statement we are trying to prove involves the symbol x_k+1 so we need to create equations based on that symbol.)
x_k+1=(1+2k)+2 (Substitution: x_k=1+2k from assumed hypothesis)
(This is the key step: we use the recursive connection between x_k+1 and x_k to establish a formula for x_k+1.)
x_k+1=3+2k    (Algebra)
(We now have a simple formula for x_k+1 and now we turn our attention to the other half of the statement S(k+1), namely the formula 1+2(k+1).)
1+2(k+1) = 1 + 2k + 2   (Distributive law)
(We are creating a true equation involving the formula by considering the results of applying mathematical laws.)
1+2(k+1) = 3+2k    (Algebra from above: 1+2=3)
(Our target should always be that the formula with k+1 in place of k will match whatever we found by the recursion to compute a value for x_k+1.)
So x_k+1=1+2(k+1)    (Equivalence or Substitution)
That is, S(k) implies S(k+1) for k=1, 2, 3,....
(This ends the second subproof because we just finished writing the statement we were trying to prove in this part. We are now ready to invoke the PMI.)

Since S(1) is true and S(k) implies S(k+1) for k=1, 2, 3, ..., the Principle of Mathematical Induction guarantees S(k) is true for every k=1, 2, 3, .... That is, x_k=1+2k for k=1, 2, 3, ...
♦

Mathematical Induction

2011-09-14T19:14:00.000-07:00

I actually posted on this very topic a few years ago. However, I have refined how I think about doing proofs by mathematical induction. And so I am writing one more time.

First, you might find it interesting to look at the Wikipedia article on mathematical induction.

The Principle of Mathematical Induction (PMI) is an axiom that describes the set of natural numbers, which is N = {1, 2, 3, 4, ...}. From one point of view, the PMI says that if S is a set that (1) contains the number 1 and (2) guarantees that whenever any number n is in the set, then n+1 is also in the set, then the set S contains all of N. From the perspective of proofs, however, we are really interested in showing that an infinite chain of logical statements is true.

Before we proceed with the idea of the PMI, let me be clear about a logical statement. A logical statement is a well-constructed sentence that is definitively true or false. Two well-known but easily misunderstood examples of logical statements are equations and inequalities. An equation "A=B" is a logical statement that declares two quantities (A and B) are equal (have the same value). An inequality "Afalse, but the statement itself is logically complete. (Here logical does not mean `makes sense' but it means `can be decided between True or False'.)

Related to this issue is a common misunderstanding by students that "=" is like an operation that means "has the value" or "find the answer" as it is often used on a calculator. This is especially important in a proof, where each statement (usually an equation) must be clearly true rather than a statement of what you hope or what you are trying to calculate.

Okay, back to the Principle of Mathematical Induction. This principle is about dealing with an infinite chain of logical sentences. For example, consider the following chain of statements. S(1) is the label for the first sentence, S(2) is the label for the second sentence, and so on:

S(1):     1 = 1(2)/2
S(2):     1+2 = 2(3)/2
S(3):     1+2+3 = 3(4)/2
S(4):     1+2+3+4 = 4(5)/2
S(5):     1+2+3+4+5 = 5(6)/2
S(6):     1+2+3+4+5+6 = 6(7)/2
...
(the pattern continues so that if n is any number n=1,2,3,..., we have):
S(n):     1+2+3+...+n = n(n+1)/2

On the left of each sentence is a summation. On the right of each sentence is a simple formula involving n. Using summation notation, we would have written this sentence:
S(n):    Σⁿ_k=1 k = n(n+1)/2

The pattern described by S(n) looks like a single sentence, but it is important to remember that it is describing the entire infinitely long chain of logical sentences. If you add up the values on the left side of any single sentence and compare it to the value of the formula on the right, you will see that the answers are the same. But this only verifies the formulas that you actually check.

The Principle of Mathematical Induction is a tool to prove that the entire chain of sentences are all true. The PMI is much like an infinite chain of dominoes (see the earlier linked wikipedia article). To knock over a chain, if you knock them down one at a time, you'll never finish. But if you show that knocking any single domino down will guarantee the next domino also falls and you show that you can knock down the first domino, then this is enough to guarantee that the entire chain will fall down.

So here is the PMI:

Suppose S(n), n=1, 2, 3, ..., is a chain of logical statements and that (1) S(1) is true and (2) S(n) implies S(n+1) for any n=1, 2, 3, ..., then S(n) is true for every n=1, 2, 3, ....

Notice that there are two conditions to use PMI:

We must verify that S(1) (the first sentence) is true.
We must show that the sentence S(n+1) is true whenever the previous sentence S(n) is true.

A proof by induction is the process whereby we verify these two conditions and then apply the PMI to conclude that the entire chain is true.

To learn more about writing the proofs and an example with commentary, please continue by reading the next blog post.

How We Learn Mathematics

2011-03-08T09:44:00.000-08:00

I was reading the following paper: D. Breidenbach, E. Dubinsky, J. Hawks, and D. Nichols, "Development of the Process Conception of Function," Educational Studies in Mathematics, 23: 247-285, 1992.

Quote Dubinsky (1989): "A person's mathematical knowledge is her or his tendency to respond to certain kinds of perceived problem situations by constructing, reconstructing and organizing mental processes and objects to use in dealing with the situations."

"Applying this point of view to mathematics (or any other subject) consists of determining the nature of the specific processes and objects that are constructed and how they are organized when one studies mathematics"

Ways of thinking about functions:

prefunction - does not understand any real ways of using function concepts
action - repeatable mental or physical manipulation (e.g., plug in numbers and calculate); static; one step at a time
process - think of function as a single dynamic transformation

I then found another article: A. Sfard and L. Linchevski, "The Gains and the Pitfalls of Reification: The Case of Algebra," Educational Studies in Mathematics, 26 (2/3), 191-228, 1994 [Learning Mathematics: Constructivist and Interactionist Theories of Mathematical Development]

This article proceeds with the view that in mathematics, there is a duality in mathematical constructs being a process or an object. That is, conceive of things operationally (process) or structurally (object). Historical examples include the expansion of number systems: positive to negative (operational: subtraction as adding a negative to structural: negative numbers as objects), and real to complex (i=sqrt(-1) as an operational convenience to an actual object)

An included reference suggests finding another article: Kieran, C.: 1992, 'The learning and teaching of school algebra', in D. A. Grouws (ed.), The Handbook of Research on Mathematics Teaching and Learning, Macmillan, New York, pp. 390-419. I'll have to see if I can find this one, as it is cited for the sentence, "[reification] was also used to introduce some order into the quickly growing bulk of findings about algebraic thinking."

Interesting phrase: "the ability to grasp the structural aspect is not easy to achieve" and "those crucial junctions in the development of mathematics where a transition from one level to another takes place are the most problematic."

Another interesting way to think about how mathematics is organized: (1) Logical, or the way it fits together; (2) Historical, or the way in which it was developed; and (3) Cognitive, or the processes in which people learn.

Modes of Algebra
1.1) Algebra as Generalized Arithmetic: The Operational Phase
-- solve for the unknown, but not using symbols (grade school algebraic thinking)
-- rhetoric algebra
-- principally reversing processes
1.2) Algebra as Generalized Arithmetic: The Structural Phase
1.2.1) algebra of a fixed value (unknown)
-- Notational convenience, but treat variable as a fixed value
:::: becomes a mental challenge to think of formula as both a process and result
:::: example given: 2+3 represents process, 5 represents result. But x+3 represents both, no separate "result"
:::: compare to the challenges of new number types required to think about division, subtraction, and extracting square roots
** Nice comment: "Once we manage to overcome this difficulty, it is quickly forgotten. ... Our eyes are easily blinded by habit and by our own ontological beliefs. Nevertheless, much evidence for the difficulty of reification may also be found in today's classroom, provided those who listen to the students are open-minded enough to grasp the ontological gap between themselves and the less experienced learners."
1.2.2) Functional algebra (of a variable)
:::: View formula as object
:::: Parameters represented as symbols not numbers.
2) Abstract Algebra

Give examples of interview questions. Students at early stages of thinking think about formulas as recipes for computations (process) but do not perceive them as valid objects. "The equality sign is interpreted as a 'do something signal' (Behr et al 1976; Kieran 1981)"

Here's something I see all the time in calculus classes: "It [the = symbol] serves here as a 'run' command. When treated in this way, the equality symbol looses [sic] the basic characteristics of an equivalence predicate: it stops being symmetrical or transitive. Indeed, young children seem to have no qualms about solving word problems with the help of a chain of non-transitive equalities. For instance, when asked 'How many marbles do you have after you win 4 marbles 3 times and 2 marbles 5 times?', the child would often write: 3*4=12+5*2=12+10=22."

Equations of the form 2x-3 = 11 can be interpreted as a formula whose result is 11 (which can be solved by inverse operations); equations of the form 2x-3=5x-9 appear to be two different formulas, and inverse operations do not make sense.

Graph from a graph of f ' (x)

2010-04-28T19:57:00.001-07:00

First, pay attention: the graph provided on the assignment is the graph of the derivative f '(x) and not the graph of f. So you can't look at the picture and say that because the graph you are looking at is increasing that f '(x) is positive; if the graph is increasing, then that means f '(x) is increasing, and not f(x). (This is useful information, but you just need to think about what it does say.)

Second, the number line sign analysis summaries will help identify the shape of the graph. Imagine taking the unit circle and breaking it up according to quadrants. The signs of f '(x) and f ''(x) determine which of these four basic shapes the graph is most like.

f '(x) = + and f ''(x) = + means f(x) looks like Quadrant IV (incr, conc. up)
f '(x) = - and f ''(x) = + means f(x) looks like Quadrant III (decr, conc. up)
f '(x) = + and f ''(x) = - means f(x) looks like Quadrant II (incr, conc. down)
f '(x) = - and f ''(x) = - means f(x) looks like Quadrant I (decr, conc. down)

The graph is just formed by taking these shapes and putting them end-to-end. You wouldn't actually use the entire portion of the unit circle because we probably don't want vertical tangents like the unit circle has. The circle just helps us remember the basic shape. The points where we join the shapes together will probably be inflection points (concavity changes) or extreme values.

However, sign analysis does not tell us the heights of any points. The problem gives only one point: f(0) = 1. The rest of the points of interest (especially the local extreme values) can be found by thinking about the information relating to the areas of the graph of f '(x). (Again, think about the Fundamental Theorem of Calculus).

Sums of Geometric Sequences

2010-04-28T19:47:00.001-07:00

The second problem on the project introduces a new closed form for a sum:

∑_k=1ⁿ [A ρ^k] = A ρ (ρⁿ-1)/(ρ-1).

Unfortunately, too many of you are still intimidated simply by the symbols that are used.

The formula for the geometric sequence, A ρ^k, is like an exponential, except the power is an integer variable rather than a continuous variable like x. For example, if A=2 and ρ=1/3, we have terms that are increasing powers of (1/3) times 2:
2/3 (k=1), 2/9 (k=2), 2/27 (k=3), 2/81 (k=4), etc.

The summation is simply the sum of these values:
∑_k=1ⁿ [2 (1/3)^k] = 2/3 + 2/9 + 2/27 + 2/81 + ... + 2/3ⁿ.
The closed form gives a formula answering the value of this sum:
2(1/3)[(1/3)ⁿ-1]/[(1/3) - 1] = (2/3)*[(1/3)ⁿ-1]/(-2/3) = 1 - (1/3)ⁿ.

So when you write down the Riemann sum for the integral in question, you need to look at using the properties of exponentials so that the Riemann sum looks just like a sum of a geometric sequence. You should identify a factor that does not involve k, and this is A. You should identify the other factor as some number raised to the k power. Then you can use the closed form.

Populations, Birth Rates, and Death Rates

2010-04-28T19:25:00.000-07:00

This entry is a general assist for my class working on a project. Suppose you knew the rate at which births are occurring (call it a function of time, b(t)) and you knew the rate at which deaths are occurring (a function d(t)). If the only way the population changes is through births and deaths, then if P(t) is the function describing the size of the population in time, then P'(t) = b(t) - d(t). (It is still your job to explain why this makes biological sense.)

Okay, now for the general principle. Anytime you know the rate of change of a quantity, you can always get back to the original quantity through a definite integral (assuming the rate of change is continuous, anyway). This is the heart of the 2nd Fundamental Theorem of Calculus. Not using P and t as variables (so that you have at least something to translate), here is the basic idea.

Suppose you know f '(x). Then A(x) = ∫₀^x f '(z) dz is an antiderivative of f '. But so is f(x) since that is where f '(x) comes from. That is f(x) = A(x) + C for some constant. In particular, A(0) = 0, so C=f(0). That is, f(x) = f(0) + ∫₀^x f '(z) dz.

This will always work, even if I don't start the integral at 0: f(x) = f(a) + ∫_a^x f '(z) dz. Written another way, it looks like the first Fundamental Theorem of Calculus: f(x) - f(a) = ∫_a^x f '(z) dz.
In other words, the 2nd FTC implies that every function is its starting value plus the integral of its rate of change.

Now, for our population problem, we don't actually know the rate of change completely; we only know the value at specific points. So instead of computing an integral (to get an exact value), we will approximate the integral using a Riemann sum. We are restricted to using the table data, so Δt=2 is forced upon us. For example, ∫₀² b(t) dt can only be estimated with a single rectangle while ∫₀⁴ b(t) dt would involve two rectangles. The idea of the Riemann sum is that we choose b(t_k*) as one of our data points (either on the left or right).

More specifically, on the interval [0,2] (k=1), we can either use t₁*=0 so that b(t₁*)=100 or use t₁*=2 so that b(t₁*)=135. In the first case, the rectangle for k=1 contributes b(t₁*)Δt = 200, while the second case leads to a contribution of b(t₁*)Δt = 270. The average (midpoint) of these two values (200+270)/2 = 235 is the estimate that would come from using the trapezoid sum. We do this for each of the 8 intervals between our data points, for both the births and the deaths.

By considering our estimates for the number of births and deaths in each of the intervals ([0,2], [2,4], [4,6], etc.), we can produce an estimate of the new population at each of the times (2, 4, 6, 8, etc.). By thinking about what estimates lead to the largest predicted population, we get an upper limit (i.e. bound) for our estimate --- no population consistent with this data can ever go above that value. Similarly, we can choose those estimates to create a lower bound. The true population will be somewhere in between.

Exponential Functions

2010-02-05T05:32:00.000-08:00

I'm getting feedback that exponential functions are giving you extra trouble. I'd appreciate getting feedback to help know how I can clarify the concepts. Here is a summary of some of the key concepts that I'm wanting you to understand:

b^x is not just a formula "b to the power x" but is a new function, which I'm asking you to call exp_b(x).
The properties of exponents like b^x+y=b^x b^y and (b^x)^y=b^xy become properties of the exponential functions.
exp_b(x+y)=exp_b(x)*exp_b(y)
exp_b(xy)= [exp_b(x)]^y

Logarithms are the inverse functions of exponential functions.
exp_b(log_b(x))=x
log_b(exp_b(x))=x

In formula representation, these are written as follows:
b^log_b(x)=x
log_b(b^x)=x
Whenever you see a formula with an exponential, say b^3x-2, you should be able to think in both formula and function modes interchangeably.
b^3x-2 = b^3x b^-2 = (b³)^x b^-2
b^3x-2 = exp_b(3x-2)
The first mode allows us to recognize that b^3x-2t is actually of the form Aq^x where A=b^-2 and q=b³. The second mode is useful to remind us that we really have a composition when we need to compute a derivative or when dealing with inverse functions.
There is a special base e that is most important because the corresponding exponential function is its own derivative.
exp'_e(x)=exp_e(x)
d/dx[e^x]=e^x

The natural exponential is written without a base. The corresponding inverse function is called the natural logarithm, ln(x).
exp(ln(x))=x
ln(exp(x))=x

In formula representation, these are written as follows:
e^ln(x)=x
ln(e^x)=x
The x in these formulas, as always, is a placeholder. So any number or formula could be used in place of x.
Any exponential can be written in terms of the natural exponential. The key is to use the properties of exponents.
b = e^ln(b)
b^x = [e^ln(b)]^x = e^ln(b)^x
Another way to think of this is using composition of inverse functions: exp(ln( ))
b^x = exp(ln(b^x))
ln(b^x) = x ln(b)
b^x = exp(x ln(b)) = e^{x ln(b)}

That is, by replacing b^x by e^ln(b)^x, we can write any exponential A b^x in the form A e^kx where k is the number ln(b).
Derivatives of exponentials use the basic property that exp'(x) = exp(x), or d/dx[e^x] = e^x. Usually, this also requires the chain rule: d/dx[e^u] = e^u u'.
d/dx[e^2x] = e^2x (2) = 2e^2x (u = 2x)
d/dx[e^-3x] = e^-3x (-3) = -3e^-3x (u = -3x)
d/dt[e^(t²-5t)] = e^(t²-5t) (2t-5) = (2t-5)e^(t²-5t) (u = t²-5t)

So this was a moderately long list. Perhaps just re-reading it helped you understand something better. Or perhaps you realize something is still confusing. Please post comments explaining why you are finding problems challenging or perhaps explaining what helped you suddenly understand what you had missed earlier.

Concerns about Derivatives

2009-11-17T12:11:00.001-08:00

Okay, I have just finished grading HW #4. This assignment had some differentiation rules. But many of you are not comprehending the purpose of a derivative rule.

For example, we had the special rule for squares of functions:

d/dx[f²(x)] = 2 f(x) f '(x)

This means that any time there is a formula squared and you need to take its derivative, you can apply this rule.

(2x+3)² corresponds to the function f(x)=2x+3 being squared. So since f '(x) = 2:

d/dx[(2x+3)²] = 2 (2x+3) (2) = 4(2x+3)

Similarly, (x²-4x+5)² corresponds to the function f(x) = x²-4x+5 being squared, and f '(x)=2x-4

d/dx[(x²-4x+5)²] = 2 (x²-4x+5) (2x-4)

You need to be an expert at identifying the form of an expression in order to apply appropriate rules of differentiation, and next semester, rules of integration (anti-differentiation).

Science and Mathematics

2009-09-03T08:31:00.000-07:00

The other day, I had my students respond to a question about how mathematics relates to science.

In class, I had pointed out that mathematical definitions are very precise while scientific measurements can be rather messy. A mathematician has a very precise meaning when they say that two variables are proportional or have a linear relation. But when we get real data, even if they do not satisfy these precise meanings, we still gain significant information about the relation and might even say that the measured variables are proportional or linear. Unfortunately, many students seemed to think I was looking for a repeated discussion of this point.

Science can be thought of as the study of the physical world through the scientific method. Essentially, we make observations on what happens in the world (whether that be physical, chemical or biological interactions) and want to understand why that is happening as well as to predict what will happen in the future. In order to do this, scientists propose various hypotheses based on their observations (and past accumulated scientific experience) and then test those hypotheses. Experiments quite often include quantitative measurements, and part of the prediction is often to propose relationships between independent variables (the variables in treatments and control) and the dependent variables (outcomes). Experience may support a hypothesis or falsify the hypothesis, but it never can prove a hypothesis.

Knowledge based on patterns that we predict will continue, but which we can support but never prove, is called inductive knowledge. Science is an example of inductive knowledge.

Mathematics can be thought of as the study of structures that satisfy very specific rules. We have properties of arithmetic, algebra, and calculus. We establish specific axioms that describe our basic assumptions about the structures and then use logical argument to deduce the behavior of more complicated constructions. We might look at examples to see what ideas might be true or false, and in this sense mathematics can also take advantage of inductive knowledge. However, the objective in mathematics is not just to suppose that a pattern will continue; the objective is to determine conclusively if it must continue. We seek for proofs (that it is true) or counterexamples (break the pattern).

Knowledge based on basic assumptions (axioms) and logical argument that determines conclusively what must follow from these assumptions is called deductive knowledge. mathematics is an example of deductive knowledge.

Models form a connection between mathematics and science. Data often appear to follow a general trend, even in the presence of the noise of messy observation. A mathematical model takes that messiness and forms an abstract clean relationship that mathematics can work with. Based on the deductive approach of mathematics, we can often establish consequences of the assumed model form. We then apply those consequences as hypotheses in our scientific framework. The predictions from the deductive approach provide the predictions that can be used to falsify these hypotheses.

First Reading Assignment

2009-08-24T10:36:00.000-07:00

Since I am not sure when Blackboard is going to be available for the class (I procrastinated asking for the two sections to be merged into a single section), here is the reading assignment and preparation for classwork for Wednesday.

Reading Assignments:

Online Calculus Textbook: Read Sections 1.1 and 1.2 (link below). These sections emphasize the idea that variables (which represent physical quantities) can be related, as independent and dependent variables. We want to think about related quantities throughout this semester. This text specifically asks for the web-browser Firefox version 3.0 or later (this is to render formulas correctly).

Come prepared to class having prepared answers for Problems 3, 4, 6, 7, 10 from Section 1.2 Problems. We will discuss these problems but will not turn them in.

How Students Learn: This book prepared by the National Academy of Sciences is actually written for teachers to focus on making courses better suited for students to learn. The first 12 pages introduce three concepts that students should be aware of in their own learning. The link below goes to the first page, and then follow the links to read through page 12.

How Students Learn, page 1 through page 12.

Why is math fun? Why is math hard?

2009-08-24T10:25:00.000-07:00

Today in class, I tried to help break the ice and reduce some of the anxiety related to taking a university mathematics course (Math 231). I asked students for examples of why they might find mathematics fun and why they might find mathematics hard. Here are some of the responses.

Why fun?

It's fun when you struggle with a concept and then it finally clicks and you understand.
It's fun to see mathematics actually being applied to a real problem.
It's fun when you are able to spot trends and make predictions based on data.
It's fun why you really understand why instead of just the "required" steps.
When you understand, it becomes easy.
It can be a lot like a game or solving a puzzle.
It's fun to develop things logically.

Why hard?

Later material builds on earlier material, so missing something early is permanent hardship.
It can be really hard when the teacher goes too fast.
It can be really hard when the teacher is unclear, especially if they can't give alternate ways of thinking about an idea.
It can be hard if the teaching style is very different from your learning style.
There are so many formulas, it can be overwhelming to try to memorize them.
The theorems, rules and definitions are full of little details.
It can be difficult to understand the many conceptual ideas that interact.
Learning related technology can be challenging.
Only one answer, so you can't fake it.
Very hard to cram for exams.
It can be really hard to find a little (stupid) mistake when proofing your work.
Lots of homework, and problems can take a lot of time.

I'd welcome more comments, including examples of when you found mathematics especially exciting or examples of how your relationship with mathematics soured. Feel free to post a comment.

A New Semester --- Two Classes

2009-01-13T12:55:00.001-08:00

This semester I am teaching a mathematical models in biology (Math/Bio 342) as well as the first semester of Calculus with Functions (Math 231). So I expect to have entries for both of these courses showing up.

Values, Equations, and Theorems

2008-12-03T08:08:00.000-08:00

There is some confusion about theorems. For example, consider the Mean Value Theorem: If f is continuous on [a,b] and differentiable on (a,b), then there is some value c∈(a,b) so that f'(c)=(f(b)-f(a))/(b-a).

Some students think that the ratio (f(b)-f(a))/(b-a) is the Mean Value Theorem. But it is not; it is just a value that is called the average rate of change of f between a and b. You can compute this value as long as both f(a) and f(b) exist. It has nothing to do with derivatives or continuity.

Other students think that the equation f'(c)=(f(b)-f(a))/(b-a). This is closer to the truth, but still is incorrect. First of all, what is c? Second, this statement may not be true. For example, suppose that f(x)=-1 if x<0 and f(x)=1 if x>0. Suppose that a=-2 and b=+2. Then the ratio (f(b)-f(a))/(b-a) is equal to 1/2. But f'(x)=0 everywhere except at x=0, where f'(0) does not exist.

Even closer is to say that f'(c)=(f(b)-f(a))/(b-a) for some c between a and b. This is actually the conclusion of the Mean Value Theorem. It requires the entire statement, particularly the statement that the equation is true for some c and that the value c must be between a and b.

But my example above provides an example where the conclusion of the Mean Value Theorem is false. That does not mean that the Mean Value Theorem itself is false. After all, it is a theorem, and that means that it has been proved to be true always. The part that is missing is the hypothesis for the theorem. The conclusion can only be guaranteed to be true using the theorem if the hypotheses are all satisfied. In this case, you must also check (or give a reason why) the function f is continuous and differentiable on the interval from a to b, including the endpoints for continuity.

Similarly, ∫_a^b f(x) dx/(b-a) computes the average value of a function f on an interval [a,b]. The value can be computed anytime the function is integrable over the interval [a,b]. The Mean Value Theorem for Integrals has nothing to do (in principle) with this calculation.

However, if f is continuous on [a,b], then
f(c)=∫_a^b f(x) dx/(b-a)
for some c∈(a,b). This entire statement comprises the Mean Value Theorem for Integrals. The hypothesis that must be verified to use the theorem is that f is continuous on [a,b]. The conclusion is that you are guaranteed that
f(c)=∫_a^b f(x) dx / (b-a)
for at least one value c between a and b.

Limit or Function Evaluation?

2008-12-02T10:19:00.000-08:00

I've noticed that some students are perplexed about when they use a limit or function evaluation. I presume that the cause of this confusion is that students have learned that you evaluate a limit by plugging in a value. But this is only because nearly all functions that they work with are continuous.

You use a limit evaluation when you need to know what the value of the function should be by using information from the side of the point of interest. When using a limit, you must use limit notation: lim_x→c f(x). Then you use the appropriate rules of limits to evaluate (and hopefully, the function is continuous).

You use a function evaluation when you need to know the value of the function at an actual point. There is no limit involved, just function evaluation. You just use function notation, say f(c), and compute the value defined by the function.

For example, suppose you are calculating an instantaneous rate of change as the limit of an average rate of change. The average rate of change only makes sense when the interval of interest includes two points (endpoints of an interval). The instantaneous rate of change is found by seeing what the value of the average rate of change does when the two points move closer to each other, or more particularly, as the second point approaches the first point.

On the other hand, suppose that you know the derivative, which is itself a function. Then the instantaneous rate of change is calculated by function evaluation using the derivative function. (The limit was already used to create this new function.)

In particular, I have noticed this problem when dealing with finding extreme values of a function. When the interval of interest is an open interval, we are acting as though the domain does not include the endpoints. So, with this restricted domain, evaluation of the function is not possible (since the points are not in the domain). So we must use the information about the function immediately adjacent to the endpoints, and this is with a limit. In this context, the value found in the limit is not achieved at the endpoint, although it might be achieved somewhere else in the domain.

On the other hand, if the interval is a closed interval, then the endpoints are included in the restricted domain. If the function is continuous at these points, then evaluating the function directly is appropriate, since the point is in the domain.

Final remark, if the function is discontinuous at some point in the interval, you must also check the limits at that point for consideration when looking for extreme values.

Differential Equations Project Tips

2008-11-17T17:59:00.000-08:00

As some questions are asked more regularly, I thought I'd provide some general discussion here.

(1) Start with the proposed form of X(t). Compute X'(t) and X''(t) based on that form. Then use those calculations to discover when X'' + k/m X = 0.

(2) The question "mean physically about the mass on a spring" is not asking you to think about the mass (as in measurement) but is asking you to think about what the statement X(0)=1 means about the state of the mass at time t=0 and what the statement X'(0)=0 means about the state of the mass at time t=0.

(3) X(0) is a constant and has derivative of d/dt[X(0)] = 0. Recall that dX/dt > 0 implies that X is increasing, dX/dt < 0 implies that X is stationary (instantaneous rate = 0)

(4) An arbitrary quantity A is proportional to some other quantity B if it is always that case that A = k B for some constant value k (the constant of proportionality). Now interpret the statements to identify what pieces of the equation are proportional to what.

(5) Although you have studied e^x in precalculus, you will not use the logarithm at all in this work. Instead, I just want you to consider some function that has the special property that exp' = exp. (This sentence is analogous to sin'=cos and cos'=-sin.) However, you do need to think about the chain rule: X(t) = A exp(rt) (Since the argument is not simply t, you must use the chain rule.) This problem is exactly analogous to Step 1.

(6) You will get something like

dX/dt = "formula involving X and a, b, and m"

X is increasing when "formula" > 0, decreasing when "formula" < 0, and stationary when "formula" = 0. So use your skills with algebra (think sign analysis) to find conditions when these are the case.

(7) You need to understand the relationship between a rate of change and an actual change. To understand this as well as possible, see the section we skipped in Chapter 3 (last section). But what you essentially need is that we will follow the tangent line for the time increment Δt. How much change is there when the rate of change and the duration of time are both known?

(8) The new version of Excel has some unanticipated differences from what I had when I wrote the project. The labels are not assigned from a menu anymore. Instead of the 3-step process that is described, you just click in the label field in the header section of Excel and type in the new label and then hit enter.

The calculations you see in the first few lines should exactly match your hand calculations in part (7).

Do not print the spreadsheet (it takes WAY too many pages). That is why I ask you to submit your spreadsheet on Blackboard as part of the project.

(9) I must receive a print out of the graph -- hand drawn figures are not acceptable. Ideally, this entire project report would be typed (perhaps using Equation Editor for the equations), with the figures naturally fitting in.

(10) Make hypotheses and test your hypotheses.

Exponential Project Tips

2008-11-17T17:35:00.000-08:00

As some questions are asked more regularly, I thought I'd provide some general discussion here.

(1) exp is the name of the function, just as sin and cos are names of functions. From calculus, you learn that sin'=cos and cos'=-sin. This step shows that exp_b' = ln b * exp_b. (That is, it leaves the function alone except for a constant multiple. (But be careful where the chain rule is needed!)

(2) You do not need to use the limit definition (epsilons and deltas). Instead, for perhaps the easiest solution, you should think about how to finish the statement:

lim b^x = lim [(b^x-1)/x ... ]

That is, if you start with (b^x-1)/x, what do you do to that expression to leave only b^x. Then use elementary limit rules to compute your resulting limit.

(3) One method is to use the method of substitution for limits (change of variables) and then use an identity for the function so that the result of Step 2 is applied --- this method mimics what is done to show that sin x is continuous everywhere. A second method is to use a general theorem that makes continuity an obvious conclusion of the results from Step 1.

(4) You must start with a statement like:

ln (1/b) = lim_{x → 0} [(1/b)^x - 1]/x

There are two easy approaches: (1) Find a common denominator to rewrite this as a simple fraction before continuing or (2) Think of (1/b)^x as b to some appropriate power and then use a limit substitution.

(5) Since you do not know the derivative of ln x, it is incorrect to use the Mean Value Theorem applied to the logarithm. Instead, you should apply the MVT to the function exp_b(x) on an interval so that b^a and b^b are incredibly easy and where it is clear which value is larger (so that you know if the average rate of change is positive or negative). You may use the fact that b^x is positive for all values of x.

(6) The function fb(x) is a linear function. You should write it in slope-intercept form (e.g., mx+b).

(7) Do not attempt to solve the equation fb(x) = exp_b(x). There is one obvious solution from the definition: x=0. But the formulas themselves do not explain where there would not be more solutions. Instead, you should define a function (perhaps g) so that

g(x) = exp_b(x) - fb(x).

You know that g(0) = 0. You need to show that g(x)>0 for all x ≠ 0. My hint suggested Rolle's theorem, but I have since found that the Mean Value Theorem helps even more. Use the Mean Value Theorem to show that for x>0, the average rate of change between 0 and x must be positive. What about x<0? x="0?">

(8) You may not use a limit form of the type b^∞. You may take a limit of the function fb(x) because that is of a form we know how to work with. Then you should use the result of (7) to conclude what the limit of exp_b(x) must be.

(9) and (10) put all of the previous steps together to perform analysis similar to Sections 4.2 and 4.4 to understand the graph.

Spotlight: Math Games

2008-10-24T13:06:00.001-07:00

The other day before class, I introduced a little game called Sprouts. I found the rules summarized at the MAA website. There is also a nice discussion on the Science News website. I find it interesting that such a simple game can be analyzed using mathematical properties.

One of my favorite "math" games is a game called Eleusis. This game was invented Robert Abbott as an analogy of the scientific method. So perhaps we should call this a "science" game. The game is played with a deck (or multiple decks) of playing cards. I turn over the first card and then think of a pattern that would start with that card. Now, the remaining players take turns attempting to choose a card from their hands that they believe would be a valid next card in the pattern. If they are correct, I leave it there. If they are incorrect, I move the card out of the sequence and below the card they tried to follow (for future reference).

The goal for the players is to eventually arrive at a hypothesis that they believe explains the pattern. By playing cards, they attempt to critically assess whether their hypothesis is a complete explanation of the pattern. This mimics the scientific method because we see patterns in how nature functions, and through experiment we attempt to see if controlled efforts are consistent with or contradict our acting hypotheses.

Try it out? Let me know how the game goes.

Mathematical Induction

2008-10-10T08:58:00.000-07:00

The principle of mathematical induction is a topic that our textbook unfortunately skips over. It is used when we want to prove a rule that applies to positive integers. Often, it is the argument that is needed when you want to say, "See! It works the same way for this case and that case, so the pattern will just keep repeating." But to say that a pattern keeps repeating is exactly what we should attempt to make more precise.

The natural numbers are the set of all positive integers: 1, 2, 3, 4, .... It is the dot-dot-dot that creates the problem. Using "..." attempts to tell us that the pattern continues. But what exactly is the pattern? For the natural numbers, the pattern is that you just add one to the previous number. So here is one way of describing the natural numbers, and it is what motivates the principle of mathematical induction.

1 is in the set
For every number that is in the set, call it n, we also have n+1 in the set.

We could restate this using an implication:

1 is in the set
If n is in the set, then (n+1) is in the set.

And that is what we do for all applications of mathematical induction. We provide a starting point (such as 1 is in the set). Then we establish an implication that if a statement is true for one value (n is in the set), then it must also be that the statement is true for the next value (n+1 is in the set).

Here is an example from our past that should have used induction.

Theorem: xⁿ is continuous for n=1, 2, 3, ....

Scratchwork:
Before proving this statement, we should think how we might attempt this without induction. Well, f(x) is really a product xⁿ= x x x ... x, where there are n factors of x. (See how the "..." allows us to hand-wave our notation?) Well, we know that the limit of each factor x will just go to the value c, so the limit must be lim_{x → c} f(x) = cⁿ. That use of "..." keeps us from clearly stating how we used the limit of a product, other than again referring to a pattern: "Use the limit of a product n-1 times." The use of induction makes this precise.

Proof:
We prove by induction.
1) First, we show that f(x) = x¹ = x is continuous. (This is the starting point)
But this is already known: lim_{x → c} x = c.
So f is continuous at any point c.
The statement is true when n=1.
2) Second, we assume that f(x) = xⁿ is continuous and now show that this implies that g(x) = xⁿ⁺¹ is also continuous. (This is the inductive step)
So assume that f is continuous.
g(x) = x f(x) (Relate the new function in terms of what is assumed)
So using the limit of a product:
lim_{x → c} g(x) = lim_{x → c} x f(x) = c f(c) = c cⁿ = cⁿ⁺¹ = g(c)
So xⁿ⁺¹ is continuous whenever xⁿ is continuous.

So by induction, since the statement is true for n=1, and whenever the statement is true for one value n it is also true for the next value n+1, the statement is true for all integers starting with 1.
(End of Proof)

Sometimes induction is compared to reaching different rungs on a ladder. The first statement is what allows you to climb onto the first step. The inductive implication is what says that if you have already reached one rung, then you can move to the next rung. Putting the two together, you first climb on the ladder's first rung. Then you know that you can climb from the first to the second rung, from the second rung to the third rung, from the third to the fourth, and so on forever. The implication, in one fell swoop, justifies climbing each step from the previous. The principle of mathematical induction replaces the uncertainty in "..."

Intermediate Value Theorem

2008-10-06T07:37:00.000-07:00

A theorem is a statement that is always true because it has been proved. Theorems are usually stated as implications. That is, they usually are stated as "If [something is true], then [something else is true]." However, this does not mean that the hypothesis (what appears as [something is true]) is actually true. Nor does it mean that the conclusion (the statement instead of [something else is true]) is true. It means that you are guaranteed to know that the conclusion is true whenever the hypothesis is true.

When applying a theorem, it is your task to establish that the hypothesis is true. Then, by stating the theorem, you are allowed to state that the conclusion is also true.

Here is an example using the Intermediate Value Theorem. Recall that the theorem states that if you have a function f that is continuous on a closed interval [a,b] (where a and b can be any numbers with a < b), then for any y-value C between the values f(a) and f(b), you are guaranteed to be able to find a value x such that a < x < c and f(x) = C.

Here is a hypothetical situation. My car holds 12 gallons of gasoline. (That is not the hypothetical part -- I have actually filled the tank :-) I have installed an automated gas-tank tracking system that records the amount of gas as a function of the car's mileage. (Yep, that's the hypothetical part) If you ask me how much gas I had when the car was at 97,034 miles, then I can tell you it had exactly 5.93 gallons of gasoline.

Last week, I filled up my tank when the car was at 98,012 miles. This morning, I checked my car and it now records the tank as having 1.45 gallons and 98,143 miles. (All figures are also hypothetical, including mileage) So here is a question: will I actually be able to identify a mileage on the car when between that last fill up and today when the car contained exactly 4.7 gallons?

Hmm. Let's see. Imagine that we use the variable x to represent the mileage on the car. Also, let f be a function that measures the gallons in the car f(x) when the mileage is x. We know that f(98,012) = 12 and f(98,143) = 1.45. So C=4.7 is between f(98,012) and f(98,143). What does the Intermediate Value Theorem say?

Now, before you go on, I need to tell you a story. On Friday, I needed to mow the lawn. My backyard is pretty large, so it takes a while. Funny thing! I ran out of gas. I knew I had recently filled the car, so I found my gas siphon and pumped a gallon out of the car's tank and into my gas can. Phew! Glad that was available! Finished the lawn with nary a problem.

So what did you answer?

(Extra credit toward quiz grade if you answer correctly this week by e-mail: waltondb at jmu dot edu)