9. The chain rule
The chain rule: and informal treatment
Earlier in these notes the idea of differentiating a function of the form
was discussed.
This function can be thought of as a function of a function, in the sense that there is a function, "f", such that
, and a function, "g", such that
.
The chain rule is an attempt to generalise this such that it is possible to differentiate a function in the form:
(where each function is differentiable).
The idea of the chain rule lies in the delegation of a statement to a letter.
Consider the example:
.
In this case a candidate might naturally think to let
, hence the problem becomes:
.
Notice the result which is known (from the previous notes), and the derivative of this new component.
It seems that if
,
.
(Note that if the derivatives were fractions this would be true [this is often a good memory technique], however, the derivatives should not be thought of as fractions [the notation is coincidental]).
Considering this process as rates of change can also be interesting.
The change in "y" with respect to "x" is
.
The change in "y" with respect to "u" is
.
The change in "u" with respect to "x" is
.
Immediately it seems that the aforementioned connection should be drawn.
To more formally state the definition:
.
Deriving the chain rule
The idea (met in Core 1) of
was the instantaneous change in "y" with respect to "x". The idea of
was used as a precursor; the average change in "y" with respect to "x", representing the gradient of a secant line. Upon limitation, the following is defined:
, as the closer the secant line gets to a point, the closer it gets to the same gradient as the point.
The treatment of the average (using delta) can be one of a fraction, as there is a division, hence:
.
This arises from a small increase in "x" resulting in a corresponding increase in "u", and finally an increase in "y".
Using limits the derivative can be found:
.
Two assumptions are made at this level;
, and
.
Hence:
Hence:
.
An application of the chain rule
The chain rule can be used to extend the already demonstrated (in Core 1) rule that
from
to
.
.
Hence:
.
Related rates of change
In many cases the relationship between area, volume and time can enable a candidate to use the chain rule to deduce the rate of change of a quantity.
Example
1. Ripples in a pond are such that they form perfect circles, the outer-most of which has a radius that changes at a rate of
; calculate the rate at which the areachange at the time when the radius is 100cm.
Consider that
, and
.
(As the radius is known).
Hence:
.
Be sure to use the correct units (and quote them in the answer).
The relation between
, and 
Sometimes it is useful to know what the rate of change of "x" with respect to "y". One example is when the candidate wishes to calculate the points at which the gradient of a curve is parallel to the y-axis, as this would imply an undefined rate of change of "y" with respect to "x", but a zero rate of change of "x" with respect to "y".
It is useful to consider a simple example:
Hence:
.
It therefore seems that the relationship is:
.
Consider the very basic technique of finding the gradient of a curve at a point, draw a tangent, and calculate the gradient of the straight line.
If there is a graph of
, the gradient of the tangent at a point is equal to the tangent of the angle that this tangent makes with the x-axis.
Consider a function of "y":
.
Following a convention, the "y" would be on the bottom axis (as it is independent), and the "x" would go upwards.
Similarly, the angle which the tangent at a point makes with the x-axis produces the gradient of the tangent when the tangent function is applied to it.
Assuming that the functions have been chosen such that
, the axes of the graph of
can be switched around to form what is the same graph as
.
The angles written of earlier now will add to make
.
Hence, the tangent of one of these (the gradient of the tangent at the point) is equal to the cotangent of the other. Therefore the relationship is true for the cases when
.