Here is the problem that we discussed today:
Problem. Let \(g(x,y)=x^{2}y+3y\) and let \(A=(3,1)\), \(B=(0,5)\). Let \( \mathbf{u}\) be the unit vector in the direction from \(A\) to \(B\). Think of the quantity \(g(x,y)\) as being the temperature at the point \((x,y)\) in the heated coordinate \(xy\)-plane.
Thus, we must have \[ 2 x^2 + 6 = 3 x y. \] Since we want our point \(P = (x,y)\) to be on the line segment \(\overline{AB}\) we must have \[ y = - \frac{4}{3} x + 5 \quad \text{with} \quad 0 \leq x \leq 3. \] Substituting the formula for \(y\) in \(2 x^2 + 6 = 3 x y\) we get \[ 2 x^2 + 6 = 3 x y = - 4 x^2 + 15 x. \] This is a quadratic equation \[ 2 x^2 -5 x + 2 = 0, \] whose solutions are \[ x = \frac{1}{2} \quad \text{and} \quad x = 2. \]
Hence, at the points \[ C = \Bigl( 2,\dfrac{7}{3} \Bigr) \quad \text{and} \quad D = \Bigl(\dfrac{1}{2}, \dfrac{13}{3}\Bigr) \] the directional derivative in direction \(\mathbf u\) is equal to \(0.\) Let us calculate the temperatures at these points, respectively, It is important to note that both points \(C\) and \(D\) belong to the line segment \(\overline{AB}.\)
Let us calculate the temperatures at these points: \[ g(C) = g\Bigl( 2,\dfrac{7}{3} \Bigr) = \frac{49}{3} = 16\tfrac{1}{3} \quad \text{and} \quad g(D) = g\Bigl(\dfrac{1}{2}, \dfrac{13}{3}\Bigr) = \frac{169}{12} = 14\tfrac{1}{12}. \]
Problem. Consider a differentiable function $f(x,y)$. The following information about $f$ is given:
Solution. We are given that \(f(2,3)=1\). We need to find \((\boldsymbol{\nabla} f)(2,3).\) Set \((\boldsymbol{\nabla} f)(2,3)=\langle a,b\rangle\) where \(a\) and \(b\) are unknowns.
What do we know about \((\boldsymbol{\nabla} f)(2,3)=\langle a,b\rangle\)?
We know the directional derivative from \((2,3)\) toward \((1,4)\). The vector determined by this oriented line segment is \[ \langle 1,4 \rangle - \langle 2, 3 \rangle =\langle -1,1 \rangle . \] The corresponding unit vector is \[ \Bigl\langle -\frac{\sqrt{2}}{2}, \frac{\sqrt{2}}{2} \Bigr\rangle. \] Since the directional derivative in this direction is given to be \(1,\) we have \begin{align*} 1 & = (\boldsymbol{\nabla} f)(2,3) \cdot \Bigl\langle -\frac{\sqrt{2}}{2}, \frac{\sqrt{2}}{2} \Bigr\rangle \\ & = \langle a,b\rangle \cdot \Bigl\langle -\frac{\sqrt{2}}{2}, \frac{\sqrt{2}}{2} \Bigr\rangle \\ & = -\frac{\sqrt{2}}{2} a + \frac{\sqrt{2}}{2} b. \end{align*} Therefore \[ - a + b = \sqrt{2}. \]
We know the directional derivative from \((2,3)\) toward \((0,2)\). The vector determined by this oriented line segment is \[ \langle 0,2 \rangle - \langle 2, 3 \rangle =\langle -2, -1 \rangle . \] The corresponding unit vector is \[ \Bigl\langle -\frac{2}{\sqrt{5}}, - \frac{1}{\sqrt{5}} \Bigr\rangle. \] Since the directional derivative in this direction is given to be \(\sqrt{10},\) we have \begin{align*} \sqrt{10} & = (\boldsymbol{\nabla} f)(2,3) \cdot \Bigl\langle -\frac{2}{\sqrt{5}}, - \frac{1}{\sqrt{5}} \Bigr\rangle \\ & = \langle a,b\rangle \cdot \Bigl\langle -\frac{2}{\sqrt{5}}, - \frac{1}{\sqrt{5}} \Bigr\rangle \\ & = -\frac{2}{\sqrt{5}} a - \frac{1}{\sqrt{5}} b. \end{align*} Therefore \[ 2 a + b = - 5 \sqrt{2}. \]
Now we solve \[ \begin{aligned} -a + b &= \sqrt{2},\\ 2a + b &= -5\sqrt{2}. \end{aligned} \] Subtracting the first from the second equation we get \(a,\) then we substitute \(a\) in the first equation to get \(b\) \[ a = -2 \sqrt{2}, \quad b = - \sqrt{2}. \] Therefore \[ (\boldsymbol{\nabla} f)(2,3) = - \sqrt{2} \, \bigl\langle 2, 1 \bigr\rangle. \]
To get the approximation we need the partial derivatives. Since \[ (\boldsymbol{\nabla} f)(2,3) = \bigl\langle f_x(2,3) , f_y(2,3) \bigr\rangle, \] we conclude \[ f_x(2,3) = - 2 \sqrt{2}, \quad f_y(2,3) = - \sqrt{2}. \]
The approximation for \(f(x,y)\) near \((2,3)\) is \begin{align*} f(x,y) & \approx f(2,3) + f_x(2,3) (x-2) + f_y(2,3) (y - 3) \\ & = 1 + \bigl( - 2 \sqrt{2} \bigr) (x-2) + \bigl( - \sqrt{2} \bigr) (y - 3). \end{align*} Therefore \begin{align*} f(2.1,2.9) & \approx 1 - 2 \sqrt{2} \ 0.1 - \sqrt{2} (-0.1) \\ & = 1 - 0.1 \sqrt{2} \approx 1 - \dfrac{1}{10} \dfrac{99}{70} \\ & = \dfrac{601}{700}. \end{align*} Thus \[ f(2.1,2.9) \approx \dfrac{601}{700}. \]
Today we did Section 4.6 Directional Derivative and the Gradient in OpenStax Calculus Volume 3. Do as many problems from 260 to 309 as you can. In particular: 263, 265, 267, 274, 281, 284, 290, 293, 294, 296, 298, 302, 303, 304.
Today we started Section 4.5 Chain Rule in OpenStax Calculus Volume 3. I do not see many interesting problems here. Do:227, 233, 248, 249.
A line is better than sine.
A plane is better than a wave.
This is a part of Section 4.4 Tangent Planes and Linear Approximations in OpenStax Calculus Volume 3. More linear approximation problems you can find among the problems in this sections: 163, 169, 170, 181, 182, 183, 187, 188, 189, 190, 194, 195, 207, 208, 211.
Clicking on the image will cycle through 8 different individual scenes of this movie with various values of $x_0 \in [0,\pi]$.
Today we finished by starting Section 4.4 Tangent Planes and Linear Approximations in OpenStax Calculus Volume 3. Do as many problems from 163 to 214 as you can. In particular: 163, 169, 170, 181, 182, 183, 187, 188, 189, 190, 194, 195, 207, 208, 211.
Today we started Section 4.3 Partial Derivatives in OpenStax Calculus Volume 3. Do as many problems from 112 to 162 as you can. In particular: 112, 115, 116, 117, 118, 119, 122, 127, 129, 137, 138, 139, 141, 145, 148, 149, 152, 153, 158, 159.
In Section 4.2, Limits and Continuity, the book presents the \(\require{ams}\epsilon\text{-}\delta\) definitions of limit and continuity. I regard these \(\epsilon\text{-}\delta\) definitions as one of humanity’s highest intellectual achievements. We tried to convey their spirit within the short time we devoted to this topic.
Here is a link to a song celebrating \(\epsilon\text{-}\delta\) definitions, Lyrics: Tom Lehrer — “There’s a Delta for Every Epsilon” or Tom Lehrer at YouTube
Today we started Section 4.2 Limits and Continuity in OpenStax Calculus Volume 3.
In this section we study limits of the form \[ \lim_{(x,y)\to (a,b)} f(x,y). \] Here \(f\) is a function of two variables with real numbers as values.
In all examples in this section, the function \(f(x,y)\) is given by an algebraic formula, that is, a formula built from constants, variables, and the operations of addition, subtraction, multiplication, division, and taking roots, or familiar exponential function, logarithms, or trigonometric functions.
There are two distinct cases:
The point \((a,b)\) is in the domain of the function \(f(x,y)\), that is we can evaluate the value \(f(a,b).\) In this case we have \[ \lim_{(x,y)\to (a,b)} f(x,y) = f(a,b). \] That is, functions given by algebraic formulas are continuous wherever they are defined. Thus, to find the limit of such a function at a point in its domain, you can simply substitute the point into the formula.
Example 4.8 and Exercises 60, 61, 63, 64, 66-75 all belong to this kind of problems.
The point \((a,b)\) is NOT in the domain of the function \(f(x,y)\), that is, the value \(f(a,b)\) is not defined. A limit \[ \lim_{(x,y)\to (a,b)} f(x,y) \] of this kind is more difficult to asses. In some examples the limit exists, and we have to figure out what it is and make the argument why that limit is what we claim it is. Sometimes the limit simply does not exist and we provide argument why.
Example 4.9 explains how to deal with limits of this kind. Exercises 86, 87, 88, 89 are similar. Exercises 65, 76, 77, 80, 81, 82, 83 belong to this kind of problems, some of these limits exist, some do not.
Three examples of limits of this kind that do exist are as follows: \[ \lim_{(x,y)\to (0,0)} \frac{x y }{|x| + |y|} = 0, \] \[ \lim_{(x,y)\to (0,0)} \frac{x^2 y }{x^2 + y^2} = 0, \] \[ \lim_{(x,y)\to (0,0)} \frac{x y }{\sqrt{x^2 + y^2}} = 0. \] And three that do not exist \[ \lim_{(x,y)\to (0,0)} \frac{x + y }{|x| + |y|}, \] \[ \lim_{(x,y)\to (0,0)} \frac{x y}{x^2 + y^2}, \] \[ \lim_{(x,y)\to (0,0)} \frac{x + y }{\sqrt{x^2 + y^2}}. \]
Place the cursor over an image to start the animation.
|
|
|
Place the cursor over an image to start the animation.
|
|
|
All illustrations on my website are produced with Wolfram Mathematica. Mathematica is a tool for doing mathematics on the computer. You can think of it as a super-powerful graphing calculator. It can do symbolic and numerical calculations, produce simple and sophisticated graphics, all in one environment.
You will definitely benefit from taking some interest in it. To get started with Mathematica, visit my Mathematica page. It contains links to several videos that will help you get started efficiently. Mathematica is available in the computer labs in BH 215, CF 312, HH 233, and BH 209.
Below is Wolfram Mathematica code to plot the function \(f(x,y) = y^3 + x y\).
Plot3D[y^3 + x y, {x, -3.5, 2.5}, {y, -2.5, 2.5},
PlotRange -> {{-3.5, 2.5}, {-2.5, 2.5}, {-4, 4}},
PlotPoints -> {201, 201}, PlotStyle -> Directive[Opacity[0.6]],
Mesh -> True, AxesLabel -> {"x", "y", "z"}, BoxRatios -> {1, 1, 1},
ClippingStyle -> None, ImageSize -> 600]
Below is Wolfram Mathematica code to plot the function \(f(x,y) = \sqrt{4 - x^2 - y^2}\).
Plot3D[Sqrt[4 - x^2 - y^2], {x, -2.5, 2.5}, {y, -2.5, 2.5},
PlotRange -> {{-2.5, 2.5}, {-2.5, 2.5}, {0, 3}},
PlotPoints -> {201, 201}, PlotStyle -> Directive[Opacity[0.8]],
Mesh -> True, AxesLabel -> {"x", "y", "z"}, BoxRatios -> {5, 5, 3},
ClippingStyle -> None, ImageSize -> 600]
You can copy and paste the above code directly into a Mathematica notebook. Then press , and Mathematica will display a plot of the function. You can modify the code to adjust the plot to your liking.
Large Language Models are very good at giving advice on how to do things in Wolfram Mathematica. I have used ChatGPT, Gemini, Copilot, and DeepSeek, and have had both extremely positive and negative experiences with each of them. Overall, the experience has been overwhelmingly positive. Using LLMs to improve your computer skills is a great way to learn through interaction. When you ask an LLM to provide code for a task, it is usually easy to assess whether the code works. If you try different LLMs for the same task, you may be surprised by how different their answers can be.
Before moving on to the new topic, I wanted to share with you the derivation of the parametric equations of the Viviani Curve, Problem 355 in Section 2.6. The Viviani Curve is the intersection of the sphere of radius \(2\) centered at the origin and the circular cylinder or radius \(1\) whose axis is the line \(x=1, y = 0, z = t\) where \(t \in \mathbb{R}.\)
In set notation the Viviani Curve is the following intersection \[ \Bigl\{ (x,y,z ) \in \mathbb{R}^3 \ \bigl| \bigr. \ x^2 + y^2 + z^2 = 4 \Bigr\} \, \bigcap \,\Bigl\{ (x,y,z) \in \mathbb{R}^3 \ \bigl| \bigr. \ (x-1)^2 + y^2 = 1 \Bigr\}. \]
I vividly remember being assigned the Viviani Curve in my calculus class in 1975 at the University of Sarajevo. Now, half a century later, this animation is my tribute to that beautiful mathematical concept.
The animation below illustrates how the curve's parametrization is obtained. To help you follow along, the light gray dashed lines represent vectors, and the white point is the origin.
The constant vector is \(\mathbf{i}\). The vector that is added to \(\mathbf{i}\) is the vector \[ (\cos \theta) \mathbf{i} + (\sin \theta) \mathbf{j}. \] Here, \(\theta\) can be any real number, although, later on we will realize that it is sufficient to take \(\theta\) such that \(0 \leq \theta \leq 4 \pi.\) Why are these two vectors important? When added together, we get \[ \mathbf{i} + (\cos \theta) \mathbf{i} + (\sin \theta) \mathbf{j} = \bigl(1+ \cos \theta \bigr) \mathbf{i} + (\sin \theta) \mathbf{j}. \] The head of this vector, the point \[ \bigl( 1+ \cos \theta, \sin \theta, 0 \bigr) \] is on the cylinder \[ \Bigl\{ (x,y,z) \in \mathbb{R}^3 \ \bigl| \bigr. \ (x-1)^2 + y^2 = 1 \Bigr\}. \] In fact, for any \(z \in \mathbb{R}\), the point \[ \bigl( 1+ \cos \theta, \sin \theta, z \bigr) \] is on that cylinder. Therefore, we just need to find \(z\) for which the point \[ \bigl( 1+ \cos \theta, \sin \theta, z \bigr) \] is on the sphere \[ \Bigl\{ (x,y,z ) \in \mathbb{R}^3 \ \bigl| \bigr. \ x^2 + y^2 + z^2 = 4 \Bigr\}. \] To calculate such \(z\) we substitute the point \[ \bigl( 1+ \cos \theta, \sin \theta, z \bigr) \] into \[ x^2 + y^2 + z^2 = 4 \] and solve for \(z\): \[ (1+ \cos \theta)^2 + (\sin\theta)^2 + z^2 = 4. \] Simplifying the last expression yields \[ z^2 = 2(1 - \cos\theta). \] As often, trigonometric identities are useful whenever we encounter trigonometric functions. Here, the half-angle formula for \(\cos\theta\) does the magic: \[ \cos\theta = \cos^2\Bigl(\frac{\theta}{2}\Bigr) - \sin^2\Bigl(\frac{\theta}{2}\Bigr) \] and \[ 1 = \cos^2\Bigl(\frac{\theta}{2}\Bigr) + \sin^2\Bigl(\frac{\theta}{2}\Bigr). \] Therefore, \[ 1 - \cos\theta = 2 \sin^2\Bigl(\frac{\theta}{2}\Bigr). \] Hence \[ z^2 = 4 \sin^2\Bigl(\frac{\theta}{2}\Bigr). \] It turns out that all the solutions for \(z\) are given by \[ z = 2 \sin\Bigl(\frac{\theta}{2}\Bigr) \quad \text{where} \quad 0 \leq \theta \leq 4 \pi. \] Thus the vertical dashed vector is \[ 2 \sin\Bigl(\frac{\theta}{2}\Bigr) \, \mathbf{k}. \] This vector starts at the blue dot on the blue circle and ends at the sphere.
This is how to reach a point on the Viviani Curve from the origin \[ \underbrace{\mathbf{i}}_{\text{the first dashed vector}} + \underbrace{(\cos \theta) \mathbf{i} + (\sin \theta) \mathbf{j}}_{\text{the second dashed vector}} + \underbrace{2 \sin\Bigl(\frac{\theta}{2}\Bigr) \, \mathbf{k}}_{\text{the third dashed vector}}. \]
Thus the parametric equations of the Viviani Curve are \[ x = 1 + \cos\theta, \quad y = \sin\theta, \quad z = 2 \sin\Bigl(\frac{\theta}{2}\Bigr), \quad \text{where} \quad 0 \leq \theta \leq 4 \pi. \]
Let us verify that these points are on the sphere. It is convenient to rewrite the parametric equations using the half-angle formulas again: \[ x = 2 \Bigl(\cos \frac{\theta}{2}\Bigr)^2, \quad y = 2\Bigl( \sin \frac{\theta}{2}\Bigr) \Bigl(\cos \frac{\theta}{2}\Bigr), \quad z = 2 \sin\Bigl(\frac{\theta}{2}\Bigr), \quad \text{where} \quad 0 \leq \theta \leq 4 \pi. \] Then \begin{align*} x^2 + y^2 + z^2 & = 4 \Bigl( \cos \frac{\theta}{2} \Bigr)^4 + 4 \Bigl( \sin \frac{\theta}{2} \Bigr)^2 \Bigl( \cos \frac{\theta}{2} \Bigr)^2 + 4 \Bigl(\sin \frac{\theta}{2} \Bigr)^2 \\ & = 4 \Bigl( \cos \frac{\theta}{2} \Bigr)^2 \biggl( \Bigl( \cos \frac{\theta}{2} \Bigr)^2 + \Bigl( \sin \frac{\theta}{2} \Bigr)^2 \biggr) + 4 \Bigl(\sin \frac{\theta}{2} \Bigr)^2 \\ & = 4 \Bigl( \cos \frac{\theta}{2} \Bigr)^2 + 4 \Bigl(\sin \frac{\theta}{2} \Bigr)^2 \\ & = 4. \end{align*}
Place the cursor over the image to start the animation.
The following table presents a derivation of the equation of a cardioid. For optimal viewing, use a wide screen.
Click through several helpful images. |
|
Place the cursor over the image to start the animation.
Place the cursor over the image to start the animation.
The key sentence is:
To my taste, the highlighted sentence is too terse. I would prefer to restate it as follows:
The inverse of \(f\) exists if and only if \(f\) is bijective. If \(f\) is bijective, that is if the inverse of \(f\) exists, then the inverse of \(f\) is denoted by \(\displaystyle f^{-1}.\)
Now, I see that some might complain that my highlighted paragraph above is somewhat repetitive; I phrase it so to be absolutely clear, and to emphasise the following important implication
From the preceding item, to understand the inverse of a function \(f\) we must understand what it means for a function \(f\) to be bijective. Again, Wikipedia is helpful, Bijection. Please read carefully the definition below.
Now we go back to the concept of an inverse function
\( a x^2 + b y^2 + c z^2 = d, \) where \(a, b, c, d\) are any of the numbers \(-1,0,1.\) For example, \[ x^2 + y^2 - z^2 = 1, \quad x^2 + y^2 - z^2 = 0, \quad x^2 + y^2 - z^2 = -1, \] or \[ x^2 - y^2 + z^2 = 1, \quad x^2 - y^2 + z^2 = 0, \quad x^2 - y^2 + z^2 = -1. \]
$ a x^2 + b y^2 + c z = d, $ where \(a, b, c, d\) are any of the numbers \(-1,0,1.\) For example, \[ z = x^2 + y^2, \quad z = x^2 - y^2, \quad z = x^2, \quad z = y^2, \] or, \[ x^2 + y^2 = 1, \quad x^2 - y^2 = 1, \quad x^2 = y^2. \]
$ a x^2 + b y + c z^2 = d, $ where \(a, b, c, d\) are any of the numbers \(-1,0,1.\) For example, \[ y = x^2 + z^2, \quad y = x^2 - z^2, \quad y = x^2, \quad y = z^2, \] or, \[ x^2 + z^2 = 1, \quad x^2 - z^2 = 1, \quad x^2 = z^2. \]
$ a x + b y^2 + c z^2 = d, $ where \(a, b, c, d\) are any of the numbers \(-1,0,1.\) For example, \[ x = y^2 + z^2, \quad x = y^2 - z^2, \quad x = y^2, \quad x = z^2, \] or, \[ y^2 + z^2 = 1, \quad y^2 - z^2 = 1, \quad y^2 = z^2. \]
Problem 356 is also interesting. Although, the picture provided looks inaccurate. The second surface is a cone and its vertex is at the origin. The provided picture does not show the vertex at the origin. But, I am not interested in these complicated equations. I would look at a simple hyperboloid and a simple cone:
\[
x^2 + y^2 - z^2 = 1, \qquad -x^2 + y^2 + z^2 = 0.
\]
Substituting \(x^2 = y^2 + z^2\) from the second equation, into the first, we get \(2 y^2 = 1\). Hence, the intersection of these two surfaces lies in two planes parallel to \(xz\)-coordinate plane. One such plane is \(y =\sqrt{2}/2\) and the other is \(y = -\sqrt{2}/2.\) Thus, the intersection is given by the equations:
\[
x^2 - z^2 = 1/2, \qquad y^2 = 1/2.
\]
We can replace cone in Problem 356 with a horizontal hyperboloid of one sheet and consider the intersection of the surfaces
\[
x^2 + y^2 - z^2 = 1, \qquad -x^2 + y^2 + z^2 = 1.
\]
Substituting \(x^2 = y^2 + z^2 - 1\) from the second equation into the first, we get \(2 y^2 = 2\). Hence, the intersection of these two surfaces lies in two planes parallel to \(xz\)-coordinate plane. One such plane is \(y = 1\) and the other is \(y = -1.\) Thus, the intersection is given by the equations:
\[
x^2 - z^2 = 0, \qquad y^2 = 1.
\]
Finally, we can replace cone in Problem 356 with a horizontal hyperboloid of two sheets and consider the intersection of the surfaces
\[
x^2 + y^2 - z^2 = 1, \qquad -x^2 + y^2 + z^2 = -1.
\]
Substituting \(x^2 = y^2 + z^2 + 1\) from the second equation into the first, we get \(2 y^2 = 0\). Hence, the intersection of these two surfaces lies in the \(xz\)-coordinate plane. Thus, the intersection is given by the equations:
\[
x^2 - z^2 = 1, \qquad y = 0.
\]
Place the cursor over the image to start the animation.
Five of the above level surfaces at different level of opacity.
My favorite application of vectors: COLORS. In fact, I love this application so much that I wrote a webpage to celebrate it: Color Cube.
One exercise in this context would be to ask you to find three colors which are between teal and yellow, one in the middle between teal and yellow and the other two in the middle between teal and the mid-color and in the middle of mid-color and yellow.
In the picture below, the red vector is \(\mathbf{a}\), the green vector is \(\mathbf{b}\), and the blue vector is \(\mathbf{c}.\) The black vector is \(\mathbf{a}\times \mathbf{b}\). The area of the parallelogram formed by \(\mathbf{a}\) and \(\mathbf{b}\) is \(\|\mathbf{a}\times \mathbf{b}\|.\) The height of the parallelepiped orthogonal to the parallelogram formed by \(\mathbf{a}\) and \(\mathbf{b}\) is \(\|\mathbf{c}\| \cos \theta.\) Therefore, the volume of the parallelepiped is \[ \|\mathbf{a}\times \mathbf{b}\| \mkern 2mu \bigl(\|\mathbf{c}\| \cos \theta \bigr) = (\mathbf{a}\times \mathbf{b})\cdot \mathbf{c}, \] if \(\theta\) is such that \(0 \leq \theta \lt \pi/2\) (like in the picture below) and it is \[ \|\mathbf{a}\times \mathbf{b}\| \mkern 2mu \bigl(\|\mathbf{c}\| \mkern 1mu \bigl| \cos \theta \bigr| \bigr) = \bigl| (\mathbf{a}\times \mathbf{b})\cdot \mathbf{c} \bigr| , \] if \(\theta\) is such that \(\pi/2 \leq \theta \leq \pi.\) This second case occurs if the vectors \(\mathbf{a},\) \(\mathbf{b},\) and \(\mathbf{c}\) do not satisfy the right-hand rule.
The volume of the parallelepiped formed by three linearly independent vectors \(\mathbf{a}\), \(\mathbf{b}\), \(\mathbf{c}\).
Recall that the coordinate definition of the cross product \(\mathbf a\times\mathbf b\) is \[ \mathbf a\times\mathbf b =\Big\langle a_2 b_3-a_3 b_2,\; -(a_1 b_3 - a_3 b_1),\; a_1 b_2-a_2 b_1 \Big\rangle. \] Therefore \begin{align*} (\mathbf{a}\times \mathbf{b})\cdot \mathbf{c} & = \Big\langle a_2 b_3-a_3 b_2,\; -(a_1 b_3 - a_3 b_1),\; a_1 b_2-a_2 b_1 \Big\rangle \cdot \bigl\langle c_1, c_2, c_3 \bigr\rangle \\ & = (a_2 b_3-a_3 b_2) c_1 - (a_1 b_3 - a_3 b_1) c_2 + (a_1 b_2-a_2 b_1) c_3 \\ & = a_1b_2c_3 - a_1b_3c_2 - a_2b_1c_3 + a_2b_3c_1 - a_3b_1c_2 - a_3b_2c_1 \\ & = \left| \begin{array}{ccc} a_1 & a_2 & a_3 \\ b_1 & b_2 & b_3 \\ c_1 & c_2 & c_3 \end{array} \right|. \end{align*}
Thus, the volume of the parallelepiped formed by three vectors \(\mathbf{a}\), \(\mathbf{b}\), \(\mathbf{c}\) is the absolute value of the above expression.
Given two nonzero vectors \(\mathbf{a}\) and \(\mathbf{b},\) what is the angle formed by these vectors? By the angle formed by the vectors \(\mathbf{a}\) and \(\mathbf{b}\), I mean an angle between \(0\) and \(\pi\) including \(0\) and \(\pi.\) To answer this question, we use the dot product, and this is one of the prominent usages of the dot product.
From the geometric definition of the dot product we have \[ \mathbf{a} \cdot \mathbf{b} = \| \mathbf{a} \| \mkern 1mu \| \mathbf{b} \| \mkern 1mu \cos\theta, \] where \(\theta\) is the angle angle formed by \(\mathbf{a}\) and \(\mathbf{b}.\) From this formula we calculate \[ \cos\theta = \frac{\mathbf{a} \cdot \mathbf{b}}{ \| \mathbf{a} \| \mkern 1mu \| \mathbf{b} \|}. \] Therefore, \[ \theta = \arccos \left(\frac{\mathbf{a} \cdot \mathbf{b}}{ \| \mathbf{a} \| \mkern 1mu \| \mathbf{b} \|} \right). \]
Inspired by the the fact that we used the arccos function above, I briefly reviewed the concept of a function. I wrote more about Functions on this webpage: Functions
Theorem (a characterization of a rectangle) (often used in carpentry).
Let \(ABCD\) be a parallelogram with vertices \(A, B, C, D\) and the diagonals \(\overline{AC}\) and \(\overline{BD}\), as pictured below. Then \(ABCD\) is a rectangle if and only if the diagonals \(\overline{AC}\) and \(\overline{BD}\) have the same length.
Proof (using vectors).
Theorem (a characterization of a rhombus)
Let \(ABCD\) be a parallelogram with vertices \(A, B, C, D\) and the diagonals \(\overline{AC}\) and \(\overline{BD}\), as pictured above. Then \(ABCD\) is a rhombus if and only if the diagonals \(\overline{AC}\) and \(\overline{BD}\) perpendicular.
Proof (using vectors).
Theorem of Thales. If \(A\), \(B\), and \(C\) are points on a circle with diameter \(\overline{AC}\), then the angle at \(B\) is a right angle. Conversely, if the triangle \(ABC\) is a right triangle with the right angle at \(B\), then \(B\) lies on the circle having \(\overline{AC}\) as its diameter.
Proof (using vectors). Assume that \(A\), \(B\), and \(C\) are points on a circle with diameter \(\overline{AC}.\) Our goal is to prove that the line segments \(\overline{BA}\) and \(\overline{BC}\) are orthogonal. The dot product of vectors is an efficient tool for proving orthogonality. Therefore we use vectors in this proof.
Since \(\overline{AC}\) is a diameter of a circle, this circle's center is the midpoint of \(\overline{AC}.\) Denote this midpoint by \(O.\) Introduce the following vectors by their representative oriented line segments: \[ \mathbf{a}=\overrightarrow{BA}, \quad \mathbf{b}=\overrightarrow{BC}, \quad \mathbf{c}=\overrightarrow{OC}, \quad \text{and} \quad \mathbf{d}=\overrightarrow{BO}. \] Since \(O\) is the midpoint of \(\overline{AC},\) we have that the vector \(-\mathbf{c}\) is represented by \(\overrightarrow{OA}.\) The following algebraic relationships hold among the vectors that we introduced: \begin{align*} \mathbf{a} & = \mathbf{d} - \mathbf{c} \\ \mathbf{b} & = \mathbf{d} + \mathbf{c}. \end{align*} Since \(O\) is the center of a circle and the points (A\), \(B\), and \(C\) are on that circle, we have \[ \| \mathbf{c} \| = \| \mathbf{d} \|. \] Our goal is to prove that the line segments \(\overline{BA}\) and \(\overline{BC}\) are orthogonal. To prove this, we calculate \begin{align*} \mathbf{a} \cdot \mathbf{b} & = (\mathbf{d} - \mathbf{c})\cdot (\mathbf{d} + \mathbf{c}) \\ & = \mathbf{d} \cdot \mathbf{d} - \mathbf{c} \cdot \mathbf{d} + \mathbf{d} \cdot \mathbf{c} - \mathbf{c} \cdot \mathbf{c} \\ & = \mathbf{d} \cdot \mathbf{d} - \mathbf{d} \cdot \mathbf{c} + \mathbf{d} \cdot \mathbf{c} - \mathbf{c} \cdot \mathbf{c} \\ & = \mathbf{d} \cdot \mathbf{d} - \mathbf{c} \cdot \mathbf{c} \\ & = \| \mathbf{d} \|^2 - \| \mathbf{c} \|^2 \\ & = 0. \end{align*} Consequently, the vectors \(\mathbf{a}\) and \(\mathbf{b}\) are orthogonal. Therefore, the line segments \(\overline{BA}\) and \(\overline{BC}\) are orthogonal. QED
An important application of dot product is the calculation of the orthogonal projection.
Here is my presentation of the orthogonal projection of \(\mathbf{v}\) onto \(\mathbf{u}.\) Given two nonzero vectors \(\mathbf{u}\) and \(\mathbf{v}\), the orthogonal projection of \(\mathbf{v}\) onto \(\mathbf{u}\) is a special scaling of \(\mathbf{u}\), call it \[ \alpha \mkern1mu \mathbf{u}, \qquad \alpha \in \mathbb{R}, \] which has a very special property that the difference \[ \mathbf{v} - \alpha \mkern1mu \mathbf{u} \qquad \text{is orthogonal to} \qquad \mathbf{u}. \] Using the orthogonality stated on the line above, we can calculate \(\alpha\) in terms of \(\mathbf{u}\) and \(\mathbf{v}\): \[ \bigl( \mathbf{v} - \alpha \mkern1mu\mathbf{u}\bigr) \cdot \mathbf{u} = 0. \] Now we use the distributivity property of the dot product to get \[ \mathbf{v} \cdot \mathbf{u} - \alpha \mkern1mu (\mathbf{u} \cdot \mathbf{u}) = 0. \] The last equality we can solve for \(\alpha\): \[ \alpha = \frac{\mathbf{v} \cdot \mathbf{u}}{\mathbf{u} \cdot \mathbf{u}}. \] Now we have the formula for the projection vector of \(\mathbf{v}\) onto \(\mathbf{u}\): \[ \operatorname{proj}_{\mathbf{u}} (\mathbf{v}) = \frac{\mathbf{v} \cdot \mathbf{u}}{\mathbf{u} \cdot \mathbf{u}}\ \mathbf{u} = \frac{\mathbf{v} \cdot \mathbf{u}}{\|\mathbf{u}\|^2} \ \mathbf{u} \]
Now I can write a formula in which we see what is so special about the projection vector of \(\mathbf{v}\) onto \(\mathbf{u}\): \[ \mathbf{v} = \underbrace{\frac{\mathbf{v} \cdot \mathbf{u}}{\|\mathbf{u}\|^2}\,\mathbf{u}}_{\text{a multiple of } \mathbf{u}} + \underbrace{ \left(\mathbf{v} - \frac{\mathbf{v} \cdot \mathbf{u}}{\|\mathbf{u}\|^2}\,\mathbf{u} \right)}_{\text{orthogonal to } \mathbf{u}} \]
Orthogonal projections \(\operatorname{proj}_{\mathbf{u}}(\mathbf{v})\) and related vectors in color
The animation below illustrates the fact that the projection \(\operatorname{proj}_{\mathbf{u}}(\mathbf{v})\) does not change when \(\mathbf{u}\) is replaced by its nonzero scalar multiple.
The Definition of the projection vector on page 140 is confused and formula (2.7) is wrong.
Below is a snippet from the definition.
Reading just the orange words, one does not know whether the projection is in the same direction as \(\mathbf{u}\), or it can also be in the opposite direction of \(\mathbf{u}.\) In fact, if the angle between \(\mathbf{v}\) and \(\mathbf{u}\) is acute, then the projection of \(\mathbf{v}\) onto \(\mathbf{u}\) points in the same direction as \(\mathbf{u}\), and if the angle between \(\mathbf{v}\) and \(\mathbf{u}\) is obtuse, then the projection of \(\mathbf{v}\) onto \(\mathbf{u}\) points in the opposite direction of \(\mathbf{u}.\) The projection of \(\mathbf{v}\) onto \(\mathbf{u}\) is always colinear with \(\mathbf{u}.\) That is, it is a scalar multiple of \(\mathbf{u}.\)
Below is the snippet of (2.7) on page 140:
This is quite confusing. When you look at (2.7), on the left-hand side is a vector, while on the right-hand side is a scalar. They cannot be equal. Vector can never equal a scalar. I was not familiar with the concept scalar projection of \(\mathbf{v}\) onto \(\mathbf{u}.\) Based on my research, the common meaning of the scalar projection of \(\mathbf{v}\) onto \(\mathbf{u}\) is not the length of the projection, but something a little more complicated. One could say that it is the signed length; it is the length when the projection and \(\mathbf{u}\) point in the same direction and it is the opposite of the length when the projection and \(\mathbf{u}\) point in different directions. Or it is the scalar that we use to scale the unit vector of \(\mathbf{u}\) to get the projection.
So, the book is wrong for using the word "length," in the sentence with (2.7), but the formula for \(\operatorname{comp}_{\mathbf{u}} \mathbf{v}\) is correct when one removes \(\operatorname{proj}_{\mathbf{u}} \mathbf{v}.\)
Here is an explanation of \(\operatorname{comp}_{\mathbf{u}} \mathbf{v}\), starting from the formula for \(\operatorname{proj}_{\mathbf{u}} \mathbf{v}\): \[ \operatorname{proj}_{\mathbf{u}}\mkern-5mu \mathbf{v} = \frac{\mathbf{v}\cdot\mathbf{u}}{\|\mathbf{u}\|^2}\,\mathbf{u} = \frac{\mathbf{v}\cdot\mathbf{u}}{\|\mathbf{u}\|}\,\left(\frac{1}{\|\mathbf{u}\|} \mathbf{u}\right) = \underbrace{ \frac{\mathbf{v}\cdot\mathbf{u}}{\|\mathbf{u}\|}}_{ \operatorname{comp}_{\mathbf{u}}\mkern-5mu\mathbf{v}} \mkern 7mu \underbrace{\left(\frac{1}{\|\mathbf{u}\|} \mathbf{u}\right)}_{\begin{array}{c} \text{the unit } \\ \text{vector of} \ \mathbf{u} \end{array}}. \]
Theorem about the diagonals in a deltoid. (A common name for a deltoid is a kite.)
If \(ABCD\) is a deltoid with vertices \(A, B, C, D\), as shown below, then its diagonals \(\overline{AC}\) and \(\overline{BD}\) are perpendicular.
Definition of a deltoid. A quadrilateral \(ABCD\) is called a deltoid if and only if it has two pairs of adjacent sides of equal length: \(|\overline{AB}| = |\overline{AD}|\) and \(|\overline{CB}| = |\overline{CD}|\).
Proof (using vectors). Step 1. Introduce the following vectors by their representative line segments: \(\mathbf{a} = \overrightarrow{AD}\), \(\mathbf{b} = \overrightarrow{AB}\), \(\mathbf{c} = \overrightarrow{CB}\), and \(\mathbf{d} = \overrightarrow{CD}.\) Assume that \(|\overline{AB}| = |\overline{AD}|\) and \(|\overline{CB}| = |\overline{CD}|.\) With the vectors introduced, we assume that \[ \|\mathbf{a}\| = \|\mathbf{b}\| \quad \text{and} \quad \|\mathbf{c}\| = \|\mathbf{d}\|. \] Since \[ \mathbf{a} - \mathbf{d} + \mathbf{c} - \mathbf{b} = \mathbf{0}, \] we have that \[ \mathbf{a} - \mathbf{b} = \mathbf{d} - \mathbf{c}, \] and \[ \mathbf{a} - \mathbf{d} = \mathbf{b} - \mathbf{c}. \] The oriented line segment \(\overrightarrow{AC}\) is on one diagonal and it represents the vector \(\mathbf{a} - \mathbf{d} = \mathbf{b} - \mathbf{c}.\) The oriented line segment \(\overrightarrow{BD}\) is on the other diagonal and it represents the vector \(\mathbf{a} - \mathbf{b} = \mathbf{d} - \mathbf{c}.\) Next we will calculate the dot product \[ (\mathbf{a} - \mathbf{b}) \cdot (\mathbf{a} - \mathbf{d}). \]
Step 2. Before calculating this dot product, we establish the following useful equality: \begin{align*} \mathbf{a} - \mathbf{b} & = \frac{1}{2} (\mathbf{a} - \mathbf{b}) + \frac{1}{2} (\mathbf{a} - \mathbf{b}) \\ & = \frac{1}{2} (\mathbf{a} - \mathbf{b}) + \frac{1}{2} (\mathbf{d} - \mathbf{c}) \\ & = \frac{1}{2} (\mathbf{a} + \mathbf{d}) - \frac{1}{2} (\mathbf{b} + \mathbf{c}). \end{align*} Hence we have \[ \mathbf{a} - \mathbf{b} = \frac{1}{2} (\mathbf{a} + \mathbf{d}) - \frac{1}{2} (\mathbf{b} + \mathbf{c}). \]
Step 3. We use this equality to calculate \begin{align*} (\mathbf{a} - \mathbf{b}) \cdot (\mathbf{a} - \mathbf{d}) & = \left(\frac{1}{2} (\mathbf{a} + \mathbf{d}) - \frac{1}{2} (\mathbf{b} + \mathbf{c}) \right) \cdot (\mathbf{a} - \mathbf{d}) \\ & = \frac{1}{2} (\mathbf{a} + \mathbf{d}) \cdot (\mathbf{a} - \mathbf{d}) - \frac{1}{2} (\mathbf{b} + \mathbf{c}) \cdot (\mathbf{a} - \mathbf{d}) \\ & = \frac{1}{2} \bigl( \|\mathbf{a}\|^2 - \|\mathbf{d}\|^2 \bigr) - \frac{1}{2} (\mathbf{b} + \mathbf{c}) \cdot (\mathbf{b} - \mathbf{c}) \\ & = \frac{1}{2} \bigl( \|\mathbf{a}\|^2 - \|\mathbf{d}\|^2 \bigr) - \bigl( \|\mathbf{b}\|^2 - \|\mathbf{c}\|^2 \bigr) \\ & = \frac{1}{2} \bigl( \|\mathbf{a}\|^2 - \|\mathbf{b}\|^2 \bigr) + \frac{1}{2} \bigl( \|\mathbf{c}\|^2 - \|\mathbf{d}\|^2 \bigr) \\ \end{align*} Since we assume \[ \|\mathbf{a}\| = \|\mathbf{b}\| \quad \text{and} \quad \|\mathbf{c}\| = \|\mathbf{d}\|, \] we have \[ (\mathbf{a} - \mathbf{b}) \cdot (\mathbf{a} - \mathbf{d}) = \frac{1}{2} \bigl( \|\mathbf{a}\|^2 - \|\mathbf{b}\|^2 \bigr) + \frac{1}{2} \bigl( \|\mathbf{c}\|^2 - \|\mathbf{d}\|^2 \bigr) = 0. \] Thus, the vectors \(\mathbf{a} - \mathbf{b}\) and \(\mathbf{a} - \mathbf{d}\) are orthogonal. Therefore, the diagonals are orthogonal. QED.
Varignon's Theorem
Let \(ABCD\) be a quadrilateral with vertices \(A, B, C, D\) and denote by \(A'\) the midpoint of \(\overline{AB}\), by \(B'\) the midpoint of \(\overline{BC}\), by \(C'\) the midpoint of \(\overline{CD}\), and by \(D'\) the midpoint of \(\overline{DA}\), as pictured below. Then \(A'B'C'D'\) is a parallelogram.
Proof (using vectors). Step 1. Introduce the following vectors by their representative line segments: \(\mathbf{a} = \overrightarrow{AB}\), \(\mathbf{b} = \overrightarrow{BC}\), \(\mathbf{c} = \overrightarrow{CD}\), and \(\mathbf{d} = \overrightarrow{DA}.\) Since \[ \mathbf{a} + \mathbf{b} + \mathbf{c} + \mathbf{d} = \mathbf{0}, \] we have that \[ \mathbf{b} + \mathbf{c} = - (\mathbf{a} + \mathbf{d}), \] (this vector is represented by the oriented line segment \(\overrightarrow{BD}\)) and \[ \mathbf{a} + \mathbf{b} = -(\mathbf{c} + \mathbf{d}). \] (this vector is represented by the oriented line segment \(\overrightarrow{AC}\)).
Step 2. In triangle \(AA'D'\) we have that the oriented line segment \(\overrightarrow{A'D'}\) represents the vector \(-\frac{1}{2}(\mathbf{a} + \mathbf{d}).\) In triangle \(CB'C'\) we have that the oriented line segment \(\overrightarrow{B'C'}\) represents the vector \(\frac{1}{2}(\mathbf{b} + \mathbf{c}).\) Since we have \[ -\frac{1}{2}(\mathbf{a} + \mathbf{d}) = \frac{1}{2}(\mathbf{b} + \mathbf{c}), \] the oriented line segments \(\overrightarrow{A'D'}\) and \(\overrightarrow{B'C'}\) are equivalent. That is they are parallel and have equal length.
Step 3. In triangle \(BA'B'\) we have that the oriented line segment \(\overrightarrow{A'B'}\) represents the vector \(\frac{1}{2}(\mathbf{a} + \mathbf{b}).\) In triangle \(DD'C'\) we have that the oriented line segment \(\overrightarrow{D'C'}\) represents the vector \(-\frac{1}{2}(\mathbf{d} + \mathbf{c}).\) Since we have \[ -\frac{1}{2}(\mathbf{c} + \mathbf{d}) = \frac{1}{2}(\mathbf{a} + \mathbf{b}), \] the oriented line segments \(\overrightarrow{A'B'}\) and \(\overrightarrow{D'C'}\) are equivalent. That is they are parallel and have equal length.
Conclusion. We proved that the line segments \(\overline{A'D'}\) and \(\overline{B'C'}\) are parallel and have equal length and that the line segments \(\overline{A'B'}\) and \(\overline{D'C'}\) are parallel and have equal length. Therefore, \(A'B'C'D'\) is a parallelogram. QED.
A compass rose is a common tool for communicating direction in navigation. The compass rose below shows 32 distinct directions. Here are the names of all 32 directions:
| E - East | EbN - East-by-North | ENE - East-Northeast | NEbE - Northeast-by-East |
| NE - Northeast | NEbN - Northeast-by-North | NNE - North-Northeast | NbE - North-by-East |
| N - North | NbW - North-by-West | NNW - North-Northwest | NWbN - Northwest-by-North |
| NW - Northwest | NWbW - Northwest-by-West | WNW - West-Northwest | WbN - West-by-North |
| W - West | WbS - West-by-South | WSW - West-Southwest | SWbW - Southwest-by-West |
| SW - Southwest | SWbS - Southwest-by-South | SSW - South-Southwest | SbW - South-by-West |
| S - South | SbE - South-by-East | SSE - South-Southeast | SEbS - Southeast-by-South |
| SE - Southeast | SEbE - Southeast-by-East | ESE - East-Southeast | EbS - East-by-South |
The role played by the compass rose in navigation, when we deal with vectors, is played by the unit circle in a coordinate plane.
Below, I show the unit circle with 32 marked angles and their corresponding cosine and sine values. Note, however, that the angles on the compass rose are not the same as those on the unit circle: the compass rose uses angles that are multiples of \(\pi/16\) (11.25 degrees).
For example, if we are given a vector \(\bigl\langle -1, \sqrt{3} \bigr\rangle\), we calculate the corresponding unit vector in the same direction: \(\bigl\langle -1/2, \sqrt{3}/2 \bigr\rangle\); then calculate the angle \(\theta\) such that \(\cos\theta = -1/2, \sin\theta = \sqrt{3}/2\); that is the angle \(\theta = 2 \pi /3\) and we state that the direction of the vector \(\bigl\langle -1, \sqrt{3} \bigr\rangle\) makes the angle of \(2 \pi /3\) with the positive direction of the horizontal axes.
Although it is not done in mathematics classes, having calculated this angle \(2 \pi /3\) or 120 degrees, we can calculate: \(120/11.25 \approx 10.6667.\) This tells us that the vector \(\bigl\langle -1, \sqrt{3} \bigr\rangle\) points in the direction between NNW - North-Northwest and NWbN - Northwest-by-North, a little closer to Northwest-by-North.
Click here to get a pdf file.
Mouse-over the image for a simplified version.
Here are a compass rose and the unit circle next to each other:
I made some progress on calculating volume of an antiprism whose base is a regular \(n\)-gon and with a given height. For that I use the fact that such an antiprism can be split in \(2n\) congruent pieces. Then I just calculate the volume of one piece. It is like advising a birthday party how to cut a cake in the shape of an antiprism.
Hover your mouse over an image to see the animation.
| 2-dimensional object: | square | rectangle | parallelogram | circle | disk | ellipse |
| 3-dimensional object: | cube | cuboid | parallelepiped | sphere | ball | ellipsoid |
Most of these terms should be familiar. I would like to point out the less frequently used term the cuboid, which is the 3D version of a rectangle. You may know it as a "rectangular prism" or a box 📦. I like the term cuboid; the reason: so prominent object should have its own name, not share it with the family of prisms or a packaging item. The cuboid is a convenient, concise and precise name for a very common shape. As the picture below shows, the term cuboid is used in geometry instruction, even at the primary school level.