Section 11.9 Curvature and the Second Derivative Test
¶The argument presented here requires some familiarity with the properties of eigenvalues and eigenvectors of symmetric matrices.
Curvature.
Recall that the curvature of a parametric curve \(\rr(t)\) is given by
This expression is independent of the parameter used, so choose an arclength parameterization, in which case \(\vv\) is just the unit tangent vector \(\TT\) and \(|\vv|=1\text{,}\) so that
Parametric curve on a graph.
Now consider the graph of \(z=f(x,y)\text{.}\) The position vector from the origin to any point on this surface takes the form
We can obtain a curve on this surface by specifying a relationship between \(x\) and \(y\text{.}\) In particular, suppose that
so that
which leads to
Using the master formula, we have
so that
Taking another derivative yields
where the column vector \(v\) and the matrix \(J\) are defined by the last equality.
Curvature at a critical point.
Now suppose \((a,b)\) is a critical point, so that
Then the velocity reduces to
which is a unit vector. Furthermore, \(\vv\perp\aa\text{,}\) so that the curvature becomes simply
If we drop the absolute value sign, we obtained a notion of “signed curvature”, which allows us to distinguish concave up (positive) from concave down (negative).
Thus, at a critical point, \(v^T J v\) measures the concavity in the direction \(v\text{.}\)
Maximum and minimum concavity.
We can now ask for the maximum and minimum values of the concavity. Using the product rule, we have
But the last two terms are the same, since \(J\) is symmetric. Thus, at a max or min, we must have \(\frac{dv^T}{dt} Jv = 0\text{,}\) or equivalently \(\frac{dv}{dt}\perp Jv\text{.}\)
On the other hand, since \(v\) is a unit vector, \(v^T v=1\text{,}\) and a similar argument now shows that \(\frac{dv^T}{dt} v=0\text{,}\) or equivalently \(\frac{dv}{dt}\perp v\text{.}\)
Thus, both \(v\) and \(Jv\) are perpendicular to \(\frac{dv}{dt}\text{,}\) and in two dimensions this can only happen if \(Jv\parallel v\text{.}\) In other words, at a max or min of the concavity, \(v\) must be an eigenvector of \(J\text{!}\)
But \(J\) is a symmetric matrix, which implies first of all that \(J\) admits two real eigenvalues, \(\lambda_\pm\text{,}\) and second that the corresponding eigenvectors are perpendicular. When classifying a critical point, it is enough to consider these two directions. If the graph is concave down/up in both directions, the critical point corresponds to a local max/min, respectively. If it is concave up in one direction, but concave down in the other, then it is a saddle point. If one or both concavities are zero, anything can happen.
Thus, in order to classify a critical point, one needs to know the product of the concavities, or equivalently the product of the eigenvalues. But the product of the eigenvalues of a matrix is precisely its determinant.
Second Derivative Test.
This, finally, brings us to the second derivative test for classifying critical points. Determine the matrix \(J\) of partial derivatives, evaluated at the critical point. Compute the determinant \(D=\det(J)\) of this matrix.
If \(D>0\text{,}\) the critical point is either a local min or a local max, depending on the sign of the concavity in any direction.
If \(D\lt 0\text{,}\) the critical point is a saddle point.
If \(D=0\text{,}\) anything can happen.
In the first case, the standard method for distinguishing between a max and a min is to examine the sign of the \((1,1)\) element of \(J\text{,}\) which gives the concavity in the \(x\)-direction.