Parametric Curves and Differentiation
Parametric Curves and Differentiation
Corresponding material in the book: Section 10.1, 10.2. We are omitting the topic of surface area
mentioned at the end of Section 10.2 of the book.
What students should definitely get: Parametric descriptions of curves, relationship with functional
and relational descriptions, domain of parameters, parametric differentiation, arc length.
What students should hopefully get: Distinction between curve and parameterization, self-intersections,
going back and forth between parametric and relational descriptions, the subtletly behind higher derivatives.
Executive summary
Words ...
(1) A parametric description of a curve is one where both coordinates are expressed as functions of
of a parameter, typically denoted t. Parametric descriptions offer an alternative to functional and
implicit (relational) descriptions of curves. Here, t varies over some subset of the real numbers. In
symbols, we have something like x = f (t), y = g(t), where t varies over some subset D of the real
numbers.
(2) Descriptions where x is a function of y or y is a function of x are special cases of parametric
descriptions.
(3) The same curve may admit multiple parametrizations, and different parameterizations may corre-
spond to different speeds and different orderings of traversal of the point. The curve itself only
contains the information of what points were traversed, not the information of the sequence and pace
in which they were traversed.
(4) The curve traced by a parameterization depends not only on the coordinate functions but also the
domain for the parameter. The larger the domain, in general, the larger the curve traced. However,
in some cases, expanding the domain may not make the curve strictly larger. This happens in cases
where both coordinate functions are even or have commensurable periods.
(5) A parameterization of a curve may involve self-intersections, retracings (e.g., tracing back for even
function pairs), or even wrapping around itself (for periodic function pairs).
(6) Function composition allows us to switch between multiple parameterizations.
(7) In some cases, it is possible to move back and forth between parametric and relational descriptions.
(8) Parametric differentiation: if x = f (t) and y = g(t), then dy/dx = (dy/dt)/(dx/dt) = g 0 (t)/f 0 (t).
This can also be used to differentiate repeatedly. Note that the derivative is a function of t rather
than of (x, y), so to find the derivative given the point (x, y) we need to go back and determine t.
1
(9) Higher derivatives can be computed iteratively using parametric differentiation. But note that it is
not true that d2 y/dx2 = (d2 y/dt2 )/(d2 x/dt2 ). The actual formula/procedure is more complicated
(see lecture notes).
Rbp
(10) Arc length: The formula for arc length from t = a to t = b (with a < b) is a (dx/dt)2 + (dy/dt)2 dt.
Actions ...
(1) Parametric to relational: elimination of parameter: In many cases, it is possible to eliminate a pa-
rameter from a parametric description. The idea is to use some well known identities or manipulation
techniques to try to directly relate x and y by finding some equation between them that is true for
all t. However, this is not the full story. We next need to see if there are additional restrictions on
x and y deducible from the fact that they arose as function of t, also keeping in mind the domain
restrictions on t.
For instance, the parameterization x = t2 , y = t4 for t ∈ R can be rewritten as y = x2 , but we
need the additional condition that x ≥ 0.
See more examples in the lecture notes, quizzes, and homeworks.
(2) Relational to parametric: Here, we see a relation between x and y, and try to choose a parametric
description that would give rise to the relation. Again, the domain of choice for the parameter needs
to be chosen wisely.
See more examples in the lecture notes and quizzes.
(3) Parametric differentiation and geometric consequences: We use the formula (dy/dt)/(dx/dt). If
x = f (t) and y = g(t), then this becomes g 0 (t)/f 0 (t). This is valid for all t in the interior of the
domain of definition where both f 0 and g 0 are defined and f 0 6= 0. If f 0 (t) = 0 but g 0 (t) 6= 0, we have
a vertical tangent situation. If g 0 (t) = 0 but f 0 (t) 6= 0, we have a horizontal tangent situation.
1.2. A new approach: a parameter. In addition to functional and relational descriptions of curves, there
is a third kind of description called a parametric description. Here, we find functions f and g and a parameter
t and write:
x = f (t), y = g(t), t ∈ D
where D is a subset of R. The curve traced by this parametric description is precisely the set of points:
cos2 t + sin2 t = 1 ∀ t ∈ R
Thus, we get that x and y satisfy the relation:
x2 + y = 1
Note that in this case, we see that y is in fact a function of x:
y = 1 − x2
However, and this is the key point, there may well be a loss of information when we move to the relational
description. Specifically, the parameterization using t imposes the constraints that −1 ≤ x ≤ 1 and 0 ≤ y ≤ 1.
On the other hand, the relation x2 + y = 1 covers a larger swath of possibilities for x and y. Thus, we need
to explicitly include the restrictions based on the range of possible values for x and y, in this case, the
restriction that x ∈ [−1, 1]. Since y is determined based on x, the additional restriction on the value of y is
not necessary.
Here are some other examples:
• x = cosh t, y = sinh t, t ∈ R. We obtain that x2 = 1 + y 2 , with the additional
p restriction that x > 0
2
(and no restriction on y). Alternatively, we can express this as x = 1 + y . If we write it in this
form, the additional restriction is unnecessary.
4
• x = cos t, y = cos(2t), t ∈ R. We obtain that y = 2x2 − 1, with the additional restriction that
x ∈ [−1, 1].
• x = t3 , y = t7 . We can rewrite this as x7 = y 3 , with no additional restrictions on x and y. Note that
the situation becomes a little more tricky if we have even exponents instead of the odd numbers 3
and 7.
• x = t4 , y = t2 + 1, t ∈ R. We get x = (y − 1)2 , with the restriction that y ≥ 1.
• x = t2 , y = t3 , t ∈ [1, 2]. We get x3 = y 2 , with y ∈ [1, 8].
3.2. Proceeding from a relational to a parametric description. Converting a relational description
to a nice parametric description is tricky, and occasionally requires the invention of new branches of math-
ematics. For instance, trigonometry is the branch of mathematics that was developed to provide a nice
parametric description for the circle x2 + y 2 = 1. There are some simple cases where parametric descriptions
are easy to construct, but these are usually exceptions.
For instance, for x2 − y 2 = 1, x > 0, we could take the parametric description x = cosh t and y = sinh t
for t ∈ R or the parametric description x = sec t and y = tan t for t ∈ (−π/2, π/2).
sin(x + y) = xy
We differentiate both sides:
dy dy
cos(x + y) + cos(x + y) =x +y
dx dx
Now, we move stuff together to one side, to get:
dy
(cos(x + y) − x) = y − cos(x + y)
dx
And we now isolate dy/dx:
dy y − cos(x + y)
=
dx cos(x + y) − x
Notice first that the right side is a mixed expression in both x and y, rather than an expression purely in
terms of x. This is to be expected since y is not explicitly a function of x.
5
But something subtler is going on. When we do differentiation with the English letter “d” then that
notation makes sense only if y is (locally) a function of x. If it isn’t, then the answer above notwithstanding,
the derivative dy/dx does not exist.
Here’s another example: the circle of radius 1 centered at the origin. This is given by the equation
x2 + y 2 = 1. Note that in this case, y is not a function of x, because for many values of x, there √ are two
values√of y. For instance, for x = 0, we have y = 1 and y = −1. For x = 1/2, we have y = 3/2 and
y = − 3/2. So, y is not a function of x.
However, locally y is still a function of x, in the following sense. If you just restrict
√ yourself to the part
above the x-axis, then you do get y as a function of x. This is the function y := √ 1 − x2 for −1 ≤ x ≤ 1. If
we restrict ourselves to the part below the x-axis, we consider the function y := − 1 − x2 for −1 ≤ x ≤ 1.
Now, how do we calculate dy/dx? Well, it depends on whether we are interested in the√part above the
x-axis or in the part below the x-axis. For the part above the x-axis, we have the function 1 − x2 , and we
get that the derivative is:
√ √
d( 1 − x2 ) d( 1 − x2 ) d(1 − x2 ) 1 −x
= = √ · (2x) = √
dx d(1 − x2 ) dx 2 1−x 2 1 − x2
√
If we are interested in the lower side, we get x/ 1 − x2 .
Now, in this case, we have to split into two cases, and do a painful calculation involving differentiating a
square root via the chain rule.
Let’s now consider how to use implicit differentiation to do the same example.
We start with the original expression:
x2 + y 2 = 1
This is an identity, which means that it’s true for every point on the curve. When we have an equation
that is identically true, it is legitimate to differentiate both sides and still get an identity. Differentiating
both sides with respect to x, we get:
d(x2 ) d(y 2 )
+ =0
dx dx
Simplifying and using the chain rule, we get:
dy
2x + 2y =0
dx
We thus get:
dy −x
=
dx y
√
2
Notice
√ √ we get −x/y, which
that with this method, √ works in both cases. When y = 1 − x , we get
2 2 2
−x/ 1 − x , and when y = − 1 − x , we get x/ 1 − x . The method that we used is called implicit
differentiation.
So, with a functional description, we need to split into cases. With an implicit description, we can give
the general answer in one case, but it involves both variables x and y. With these examples in mind, let’s
consider the parametric situation.
4.2. Parametric differentiation. For the curve given parametrically by x = f (t), y = g(t), t ∈ D, where
f and g are both differentiable at some point t0 in the interior of D, the derivative dy/dx is given by:
dy dy/dt g 0 (t)
|t=t0 = |t=t0 = 0 |t=t0
dx dx/dt f (t)
In short:
dy g 0 (t)
= 0
dx f (t)
6
This is an application of the chain rule, or thinking of the derivative as the relative rate of change.
Basically, we are saying that the relative rate of change of y with respect to x at time t0 is the rate of change
of y with respect to time divided by the rate of change of x with respect to time. That makes sense.
Also note that the answer that we get is purely a function of t, and x and y do not appear in the description.
This is important at many levels. The first is that it takes account of self-intersections. Remember that
given a parametric description, the same point on the curve can be repeated at different times, with the
curve moving in different directions at these different times. Since the differentiation is with respect to t, we
can thus get different derivative values at different times for the same physical point location.
Second, if given a point (x, y), we first need to reverse engineer and find the time t that gave rise to the
point (x, y) and then plug in the formula to find the derivative value.
Let’s apply this to the circle x2 + y 2 = 1 using the parameterization x = cos t, y = sin t. We get:
dy sin0 t cos t
= = = − cot t
dx cos0 t − sin t
√
Now, suppose we want to find the derivative value at the point (1/2, 3/2). We first need to determine
that this point arises from t = 2nπ + π/3. Note that since the curve repeats periodically,
√ the choice of n is
irrelevant, so we can choose t = π/3. Then, the derivative value is − cot(π/3) = −1/ 3.
You can compare this to the derivative value obtained using implicit differentiation – −x/y – and see that
they are in fact the same.
4.3. Vertical/horizontal tangents and cusps. Here x = f (t), y = g(t), and t ∈ D. Even if f and g are
both differentiable functions everywhere on the domain D, it is still possible for there to be vertical/horizontal
tangents and cusps. Below are given some sufficient conditions for a horizontal/vertical tangent.
• Horizontal tangent: This ocurs when g 0 (t) = 0 and f 0 (t) 6= 0. Note that the horizontal tangent itself
could be of many kinds – a local extremum type or a point of inflection type, or something different.
• Vertical tangent: This occurs when f 0 (t) = 0 and g 0 (t) 6= 0. Again, there are many possibilities for
this.
If both f 0 (t) and g 0 (t) are zero, then we need to determine which is more zeroey, and also whether there
is a sign change, etc. to determine whether we have a horizontal/vertical tangent/cusp. We refrain from the
details here.
4.4. Repeated differentiation. Note that the second derivative d2 y/dx2 is the derivative of the first
derivative and the roles of x and y are not symmetric. In particular, the freshman chain rule does not work
for second derivatives – we need something more sophisticated. Some of you have worked out chain rule
equivalents for second derivatives, e.g., we have that (f ◦ g)00 = (f 00 ◦ g) · (g 0 )2 + (f 0 ◦ g) · (g 00 ) and also that
(f −1 )00 = −(f 00 ◦ f −1 )/(f 0 ◦ f −1 )3 . You don’t need to memorize these, since they can always be derived from
the rules for first derivatives. The key point is that the naive chain rules are false. In particular, d2 y/dx2 is
not the same as (d2 y/dt2 )/(d2 x/dt2 ).
So how do we differentiate twice in practice? We first compute the first derivative in parametric form,
and then differentiate that with respect to x, again using parametric differentiation. Specifically, if x = f (t)
and y = g(t), we have:
dy g 0 (t)
= 0
dx f (t)
Thus, we get:
g 0 (t)
d
d2 y dt f 0 (t)
=
dx2 f 0 (t)
where the denominator is just dx/dt. The same idea can be used to calculate higher derivatives. Note
that we could simplify the above as a general expression using the quotient rule for differentiation, and thus
get another version of the same formula:
7
d2 y f 0 (t)g 00 (t) − g 0 (t)f 00 (t)
=
dx 2 (f 0 (t))3
It’s up to you which form you choose to remember. I recommend that for better conceptual understanding
and generalizability, you definitely remember the first version, since the second version can easily be derived
from it on the spot.
4.5. Arc length, speed, and unit speed parameterization. This is new stuff, in the sense that you
have probably not seen this even in the context of functional descriptions. It turns out that the description
is easier to give in parametric form anyway. Basically, the arc length is given by:
Z p
(dx)2 + (dy)2
More concretely, if x = f (t), y = g(t), then the arc length from time t = a to t = b (with a < b) is given
by:
Z b p
(dx/dt)2 + (dy/dt)2 dt
a
or equivalently:
Z b p
(f 0 (t))2 + (g 0 (t))2 dt
a
The speed of a curve is defined as the speed with which it is traveling with respect to t, and is hence
simply:
p
(dx/dt)2 + (dy/dt)2
In other words, if x = f (t) and y = g(t), the speed as a function of time t is:
p
(f 0 (t))2 + (g 0 (t))2
We say that a particular parameterization of a curve is a unit speed parameterization if the speed at any
time t is 1. What this means is that the arc length from t = a to t = b is simply b − a.
An example of a unit speed
√ parameterization
√ is the parameterization x = cos t, y = sin t for a circle.
Another example is x = t/ 2, y = t/ 2 for the line y = x.
Note that for the particular case where y is a function of x, the arc length from x = a to x = b, with
a < b, becomes:
Z b p
1 + (dy/dx)2 dx
a
Note on the difficulty of arc length computations. Actually computing the arc length as a number
is not easy in most cases because the integration that we need to do is not possible within the world of
elementary integrations. In fact, whole new branches of mathematics (such as the theory of elliptic integrals)
were invented to be able to perform some of the integrations that arose in arc length computations.
Roughly, here are the kinds of curves for which arc lengths can be computed:
• Straight lines (yeah, and we didn’t even need calculus for those)
• Circles (again, no need for calculus here – the radian measure directly defines arc length).
• Parabolas such as x = y 2 and y = x2 . To compute the arc length, we need to do a trigonometric or
hyperbolic trigonometric substitution. The final answer we get involves something like integration
sec3 θ: For instance, consider y = x2 . We get:
Z p
1 + (2x)2 dx
• The graph of the hyperbolic cosine function (catenary): The derivative of cosh is sinh, so we get:
Z b q
1 + (sinh2 x) dx
a
This becomes:
Z b
cosh x dx
a
giving [sinh x]ba = sinh b − sinh a.
While this list above is by no means exhaustive, most of the computable examples you will see will be
variations on this theme.