Derivative - inverse function – Serlo

In the following article we will investigate the conditions under which the inverse function of a bijective function is differentiable at one point. We will also derive a formula with which we can explicitly determine the derivative of the inverse function. The practical thing about this formula is that it allows us to determine the derivative at certain points, even if we do not know the inverse function explicitly or it is insanely difficult.

Motivation

Let us first consider a linear function as an example. For this it is very easy to determine the derivative of the inverse function. Non-constant linear functions are bijective and therefore invertible on $\mathbb {R}$ . In this case we can calculate the inverse function explicitly and differentiate it. Concretely we choose $f:\mathbb {R} \to \mathbb {R}$ with $f(x)=2x-1$ . The inverse function is

f^{-1}:\mathbb {R} \to \mathbb {R} :f^{-1}(y)={\tfrac {1}{2}}y+{\tfrac {1}{2}}

$f^{-1}$ is differentiable on $\mathbb {R}$ and $(f^{-1})'(y)={\frac {1}{2}}$ for all $y\in \mathbb {R}$ .

Let us next consider the function $f(x)=x^{2}$ . Here we have to be careful, because it is not injective on all of $\mathbb {R}$ and therefore not invertible. But if we restrict the domain of definition to $\mathbb {R} _{0}^{+}$ , then $f:\mathbb {R} _{0}^{+}\to \mathbb {R} _{0}^{+},f(x)=x^{2}$ is bijective. The inverse function is the square root function

f^{-1}:\mathbb {R} _{0}^{+}\to \mathbb {R} _{0}^{+}:f^{-1}(y)={\sqrt {y}}

For differentiability we have to consider another thing: $f^{-1}$ is not differentiable at $y=0$ . We can show this by examining the differential quotient. Or we consider the following:

Since the root function $f^{-1}$ is the inverse function of the square function $f$ , there is $f^{-1}\circ f={\text{id}}$ . At zero there is thus in particular

\underbrace {f^{-1}(f(0))} _{=f^{-1}(0)}={\text{id}}(0)

If now $f^{-1}$ was differentiable at 0, then the chain rule would yield

\underbrace {(f^{-1})'(f(0))\cdot \overbrace {f'(0)} ^{=0}} _{=0}=\underbrace {{\text{id}}'(0)} _{=1}

So $f^{-1}$ cannot be differentiable at 0. However, on $\mathbb {R} ^{+}$ , the function $f^{-1}$ is differentiable, and there is

(f^{-1})'(y)={\tfrac {1}{2{\sqrt {y}}}}

This example shows that in case $f'(x)=0$ , it may happen that $f^{-1}$ is not differentiable although $f$ is differentiable everywhere.

In the two examples it was relatively easy to determine the derivative of the inverse function directly (it was a polynomial). But what about more complicated functions, for example $\ln$ as an inverse function of $\exp$ ? Here we cannot simply calculate the derivative of the inverse function, if only derivatives of exponentials and polynomials are known. It may even occur that a bijective function cannot be inverted explicitly. In these cases it would be good to have a general formula with which we can determine the derivative of $f^{-1}$ from the derivative of $f$ . If we look again at the derivative from the second example, we may see the following:

(f^{-1})'(y)={\tfrac {1}{2{\sqrt {y}}}}={\tfrac {1}{2f^{-1}(y)}}={\tfrac {1}{f'(f^{-1}(y))}}

Since there is $f^{-1}(y)={\sqrt {y}}$ for all $y\in \mathbb {R} ^{+}$ and $f'(x)=2x$ for all $x\in \mathbb {R} ^{+}$ . In the first example (straight lines), there is also

(f^{-1})'(y)={\tfrac {1}{2}}={\tfrac {1}{f'(f^{-1}(y))}}

Can this be chance? Actually, it's not: the formula is valid for a general. Consider $f:D\to W$ being differentiable at ${\tilde {x}}\in D$ and being differentiable $f^{-1}:W\to D$ at ${\tilde {y}}=f({\tilde {x}})\in W$ . By definition of the inverse function,

y=f(f^{-1}(y))

for all $y\in W$ . Now we take the derivative and obtain by the chain rule:

1=f'(f^{-1}({\tilde {y}}))\cdot (f^{-1})'({\tilde {y}})

Here we have used that $f$ in $f^{-1}({\tilde {y}})={\tilde {x}}$ and $f^{-1}$ in ${\tilde {y}}$ are differentiable. Now we divide on both sides by $f'(f^{-1}({\tilde {y}}))$ (note: this only possible if the expression is not equal to zero), and get

(f^{-1})'({\tilde {y}})={\frac {1}{f'(f^{-1}({\tilde {y}}))}}

or equivalently

(f^{-1})'(f({\tilde {x}}))={\frac {1}{f'({\tilde {x}})}}

So the formula also holds in general under certain conditions. Now the question is, under which conditions at $f$ the derivative of $f^{-1}$ exists.

On the one hand the $f^{-1}$ must exist. This is exactly the case if $f$ is bijective, which is exactly the case if $f$ is surjective and strictly monotonous.
As we have seen above, $f$ must be differentiable in the point ${\tilde {x}}=f^{-1}({\tilde {y}})$ with $f'({\tilde {x}})\neq 0$ .
We will see that we need one more condition, namely that $f^{-1}$ is continuous in ${\tilde {y}}$ . If the domain of definition $D$ of $f$ is an interval, then this is always fulfilled according to the theorem about continuity of the inverse function.

These are the conditions necessary for our formula to hold. Let's put it into a theorem:

Theorem: derivative of the inverse function

Theorem and proof

Theorem (Derivative of the inverse function)

Let $D,W\subseteq \mathbb {R}$ and $D$ be an interval. further, let $f:D\to W$ be a surjective and strictly monotonous function, which is differentiable in ${\tilde {x}}\in D$ where $f'({\tilde {x}})\neq 0$ . Then, $f$ has an inverse function $f^{-1}:W\to D$ , which is differentiable at ${\tilde {y}}:=f({\tilde {x}})$ and there is:

(f^{-1})'({\tilde {y}})={\frac {1}{f'(f^{-1}({\tilde {y}}))}}

Remarks:

The surjectivity of $f$ is equivalent to $W=f(D)$ .
If $f$ is differentiable on all of $D$ , then according to the monotony criterion the strict monotony can be seen most easily by $f'>0$ or $f'<0$ .
As we have seen above with the derivative of the square root function $y\mapsto {\sqrt {y}}$ in ${\tilde {y}}=f({\tilde {x}})=0$ , the condition $f'({\tilde {x}})\neq 0$ must not be omitted under any circumstances. Otherwise, it produces "infinite derivatives", which are not well-defined!
The theorem also holds if $D$ is not an interval. But then it must be demanded additionally that $f^{-1}$ in ${\tilde {y}}$ is continuous. Furthermore, ${\tilde {x}}$ and ${\tilde {y}}$ must be accumulation points of $D$ and $W$ respectively.
If $f$ is additionally continuous, then by continuity of the inverse function it follows that $W$ is an interval.

Summary of proof (Derivative of the inverse function)

First of all we justify that $f^{-1}$ exists. Then we conclude by the theorem about the continuity of the inverse function that $f$ is continuous. We show that the differential quotient $\lim \limits _{y\to {\tilde {y}}}{\tfrac {f^{-1}(y)-f^{-1}({\tilde {y}})}{y-{\tilde {y}}}}$ exists and has the value ${\tfrac {1}{f'(f^{-1}({\tilde {y}}))}}$ . That is, that for every sequence $(y_{n})$ with $y_{n}\to {\tilde {y}}$ there is $\lim \limits _{n\to \infty }{\tfrac {f^{-1}(y_{n})-f^{-1}({\tilde {y}})}{y_{n}-{\tilde {y}}}}={\tfrac {1}{f'(f^{-1}({\tilde {y}}))}}$ .

Proof (Derivative of the inverse function)

$f:D\to W$ is surjective and strictly monotonous, i.e. bijective. So the inverse function $f^{-1}:W\to D$ exists. Since we have assumed that $D$ is an interval, the theorem about the continuity of the inverse function implies that $f^{-1}$ is continuous on $W$ . There is thus $\lim \limits _{y\to {\tilde {y}}}{f^{-1}(y)}=f^{-1}({\tilde {y}})$ with ${\tilde {y}}:=f({\tilde {x}})\in W$ . Let now $(y_{n})_{n\in \mathbb {N} }=(f(x_{n}))_{n\in \mathbb {N} }$ be a sequence in $W$ with $\lim _{n\to \infty }y_{n}={\tilde {y}}$ , then there is

{\begin{aligned}&\lim \limits _{n\to \infty }{\frac {f^{-1}(y_{n})-f^{-1}({\tilde {y}})}{y_{n}-{\tilde {y}}}}\\[0.3em]&{\color {OliveGreen}\left\downarrow \ f(f^{-1}(y_{n}))=y_{n}{\text{ and }}f(f^{-1}({\tilde {y}}))={\tilde {y}}\right.}\\[0.3em]=\ &\lim \limits _{n\to {\tilde {\infty }}}{\frac {f^{-1}(y_{n})-f^{-1}({\tilde {y}})}{f(f^{-1}(y_{n}))-f(f^{-1}({\tilde {y}}))}}\\[0.3em]=\ &\lim \limits _{n\to \infty }{\frac {1}{\frac {f(f^{-1}(y_{n}))-f(f^{-1}({\tilde {y}}))}{f^{-1}(y_{n})-f^{-1}({\tilde {y}})}}}\\[0.3em]&{\color {OliveGreen}\left\downarrow \ f^{-1}(y_{n})=x_{n}{\text{ and }}f^{-1}({\tilde {y}})={\tilde {x}}\right.}\\[0.3em]=\ &\lim \limits _{n\to \infty }{\frac {1}{\frac {f(x_{n})-f({\tilde {x}})}{x_{n}-{\tilde {x}}}}}\\[0.3em]=\ &{\frac {1}{\lim \limits _{n\to \infty }{\frac {f(x_{n})-f({\tilde {x}})}{x_{n}-{\tilde {x}}}}}}\\[0.3em]&{\color {OliveGreen}\left\downarrow \ f{\text{ differentiable at }}{\tilde {x}}\right.}\\[0.3em]=\ &{\frac {1}{f'({\tilde {x}})}}\\[0.3em]=\ &{\frac {1}{f'(f^{-1}({\tilde {y}}))}}\end{aligned}}

Hence, $f^{-1}$ is differentiable in ${\tilde {y}}$ and there is $(f^{-1})'({\tilde {y}})={\frac {1}{f'(f^{-1}({\tilde {y}})}}$ .

Alternative proof (Derivative of the inverse function)

Another way of proof is given by an equivalent characterization of the derivative: $f$ is differentiable in ${\tilde {x}}\in D$ if and only if there is a function $\varphi :D\to W$ continuous at ${\tilde {x}}$ with

f(x)-f({\tilde {x}})=\varphi (x)(x-{\tilde {x}})

If this is the case, then $\varphi ({\tilde {x}})=f'({\tilde {x}})$ . Since by assumption, $f({\tilde {x}})=\varphi ({\tilde {x}})\neq 0$ and $f$ is strictly monotonous, $\varphi (x)\neq 0$ follows for all $x\in D$ . If we now set $y=f(x)$ and $x=f^{-1}(y)$ , the above equation is

y-{\tilde {y}}=\varphi (f^{-1}(y))(f^{-1}(y)-f^{-1}({\tilde {y}}))

This is now equivalent to

f^{-1}(y)-f^{-1}({\tilde {y}})={\frac {1}{\varphi (f^{-1}(y))}}(y-{\tilde {y}})

Since $\varphi$ and $f^{-1}$ are continuous at ${\tilde {y}}=f({\tilde {x}})\in W$ we also get continuity of ${\tfrac {1}{\varphi \circ f^{-1}}}$ at ${\tilde {y}}$ . If we now use again the equivalent characterization of continuity, it follows from the last equation that $f^{-1}$ is differentiable in ${\tilde {y}}$ with

(f^{-1})'({\tilde {y}})={\frac {1}{\varphi (f^{-1}({\tilde {y}}))}}={\frac {1}{f'(f^{-1}({\tilde {y}}))}}

Memory rule and visualization

Using Leibniz's notation for the derivative, the formula of the derivative of the inverse function can be illustrated by a simple fraction-swap trick: For $f^{-1}(y)=x$ and $f(x)=y$ there is

{\frac {\mathrm {d} x}{\mathrm {d} y}}={\frac {1}{\frac {\mathrm {d} y}{\mathrm {d} x}}}

We can also visualize the formula graphically: If the function $f$ is differentiable at $x_{0}$ , then $f'(x_{0})$ corresponds to the slope of the tangent to the graph in $(x_{0}|f(x_{0}))$ . Hence,

f'(x_{0})={\frac {\mathrm {d} y}{\mathrm {d} x}}=m

We now obtain the graph of the inverse function in two steps:

First we have to rotate the graph of $f$ by $90^{\circ }$ (clockwise or counter-clockwise). The resulting graph has the slope $-{\tfrac {1}{m}}$ at the point $x_{0}$ , because the tangent at this point is perpendicular to the original tangent.
Then we have to mirror the graph (horizontally or vertically). The sign of the tangent gradient is reversed.

Altogether we get

(f^{-1})'(f(x_{0}))=(f^{-1})'(y_{0})={\frac {\mathrm {d} x}{\mathrm {d} y}}=-\left(-{\frac {1}{m}}\right)={\frac {1}{m}}

Extension to the whole domain

The converse of the theorem also holds:

Theorem (Converse of the theorem about inverse function derivative)

Let $D,W\subseteq \mathbb {R}$ and $D$ be an interval. Further, let $f:D\to W$ be a surjective, strictly monotonous function, which is differentiable at ${\tilde {x}}\in D$ . If further, the inverse function $f^{-1}:W\to D$ is differentiable at ${\tilde {y}}:=f({\tilde {x}})$ , then there is: $f'({\tilde {x}})\neq 0$ and

(f^{-1})'({\tilde {y}})={\frac {1}{f'(f^{-1}({\tilde {y}}))}}

Proof (Converse of the theorem about inverse function derivative)

The proof works with the trick from the introduction. For all $x\in D$ we have

f^{-1}(f(x))=x

Under the above conditions, the left-hand side is differentiable at ${\tilde {x}}\in D$ (chain rule) with

(f^{-1})'(f({\tilde {x}}))\cdot f'({\tilde {x}})=1

Because 0 has no divisor (other than 0) in $\mathbb {R}$ , there must be $f'({\tilde {x}})\neq 0$ and we get

(f^{-1})'(f({\tilde {x}}))={\frac {1}{f'({\tilde {x}})}}

Let us now additionally demand in the original theorem that $f$ is differentiable on all of $D$ with $f'\neq 0$ . Then we can determine the derivative function of $f^{-1}$ on all of $W$ :

Theorem (Derivative of the inverse function)

Let $D,W\subseteq \mathbb {R}$ and $D$ be an interval. Further, let $f:D\to W$ be a surjective, differentiable, strictly monotonous function with $f'(x)\neq 0$ for all $x\in D$ . Then $f$ has a differentiable inverse function, whose derivative is given by:

(f^{-1})'={\frac {1}{f'\circ f^{-1}}}

Examples

Example (linear functions)

Let $a\in \mathbb {R} \setminus \{0\}$ , $b\in \mathbb {R}$ and

f:\mathbb {R} \to \mathbb {R} ,\ f(x)=ax+b

a linear function. Then $f$ is surjective and strictly monotonously increasing, if $a>0$ , and strictly monotonously decreasing, if $a<0$ . Furthermore, $f$ is differentiable on all of $\mathbb {R}$ with derivative $f'\equiv a$ . According to the theorem about the derivative of the inverse function there is thus for all $y\in \mathbb {R}$

(f^{-1})'(y)={\frac {1}{f'(f^{-1}(y))}}={\frac {1}{a}}

We could also have calculated this directly, as above.

Example (Root functions)

Let for $k\in \mathbb {N}$

f:\mathbb {R} ^{+}\to \mathbb {R} ^{+},\ f(x)=x^{k}

Then $f$ is differentiable and has the derivative $f'(x)=kx^{k-1}>0$ . So it is monotonously increasing. Furthermore, $f$ is surjective. The inverse function is the $k$ -th root function

f^{-1}:\mathbb {R} ^{+}\to \mathbb {R} ^{+},\ f^{-1}(y)={\sqrt[{k}]{y}}

For every $y\in \mathbb {R} ^{+}$ our theorem now yields

$(f^{-1})'(y)={\frac {1}{f'(f^{-1}(y))}}={\frac {1}{k{\sqrt[{k}]{y}}^{k-1}}}$ .

If $k$ is odd, then the formula holds even for all $y\in \mathbb {R}$ .

Example (Logarithmic functions)

Let us look at the exponential function

f:\mathbb {R} \to \mathbb {R} ^{+},\ f(x)=\exp(x)

We have learned that $f'=\exp$ . So the function is differentiable, and because of $f'>0$ strictly monotonously increasing. Furthermore, $f$ is surjective. The inverse function is the (natural) logarithm function

f^{-1}:\mathbb {R} ^{+}\to \mathbb {R} ,\ f^{-1}(y)=\ln y

Our theorem now implies for $y\in \mathbb {R} ^{+}$ :

(f^{-1})'(y)={\frac {1}{f'(f^{-1}(y))}}={\frac {1}{e^{\ln y}}}={\frac {1}{y}}

Exercises

Exercise (Derivative of the inverse function)

Prove that die function

f:\mathbb {R} ^{+}\to \mathbb {R} :x\mapsto f(x)=-4x^{2}-3-2\ln(x)

has a differentiable inverse function $f^{-1}$ . Determine the domain of definition of $f^{-1}$ and calculate $(f^{-1})'(-7)$ .

Solution (Derivative of the inverse function)

We have to check all conditions of the theorem about the derivative of the inverse function, one after the other.

Proof step: $f$ is surjective

$f$ is continuous on $\mathbb {R} ^{+}$ as it is a composition of continuous functions. Further, there is

{\begin{aligned}\lim _{x\to 0+}f(x)&=\infty \\\lim _{x\to \infty }f(x)&=-\infty \end{aligned}}

By the intermediate value theorem, for every $y\in \mathbb {R}$ there is an $x\in \mathbb {R} ^{+}$ with $f(x)=y$ . So $f$ is surjective.

Proof step: $f$ is strictly monotonous

$f$ is differentiable on $\mathbb {R} ^{+}$ as it is a composition of continuous functions and

f'(x)=-8x-{\frac {2}{x}}<0

for all $x\in \mathbb {R} ^{+}$ . According to the monotony criterion, $f$ is strictly monotonically decreasing, and therefore injective on $\mathbb {R} ^{+}$ .

So $f$ is bijective, and thus has a in inverse function $f^{-1}:\mathbb {R} \to \mathbb {R} ^{+}$ . The domain of definition $\mathbb {R}$ corresponds to the range of values of $f$ .

Proof step: $f$ is differentiable on $\mathbb {R} ^{+}$ and $f'(x)\neq 0$ for all $x\in \mathbb {R} ^{+}$

Differentiability was proven in step. As $f'(x)<0$ there is also $f'(x)\neq 0$ for all $x\in \mathbb {R} ^{+}$ .

According to the theorem about the derivative of the inverse function, it is differentiable on all of $\mathbb {R}$ .

Proof step: Computation of von $(f^{-1})'(-7)$

There is $f(1)=-4-3-2\ln(1)=-7$ . Hence $f^{-1}(-7)=1$ , and with the formula for the derivative of the inverse function there is

(f^{-1})'(-7)={\frac {1}{f'(1)}}={\frac {1}{-8-{\frac {2}{1}}}}=-{\frac {1}{10}}

Exercise (Second derivative of the inverse function)

Let $f:(a,b)\to (c,d)$ with $-\infty \leq a<b\leq \infty$ , and $-\infty \leq c<d\leq \infty$ be a twice differentiable bijective function with $f'>0$ . Show that the inverse function $f^{-1}$ is twice differentiable, as well and express the second derivative of $f^{-1}$ at the position $y\in (c,d)$ by derivatives of $f$ at a suitable position.

As application: Compute for the polynomial $f(x)=x^{3}+2x+4$ the derivatives $(f^{-1})'(1)$ and $(f^{-1})''(1)$ .

Solution (Second derivative of the inverse function)

Proof step: First derivative of $f^{-1}$

$(a,b)$ is an interval and $f$ is bijective. Because of $y\in (c,d)$ there is an $x\in (a,b)$ with $f(x)=y$ . Since $f'>0$ there is $f'(x)\neq 0$ . According to the theorem about the derivative of the inverse function $f^{-1}$ , is differentiable in $y$ with

(f^{-1})'(y)={\frac {1}{f'(f^{-1}(y))}}

Proof step: Second derivative of $f^{-1}$

$f$ is twice differentiable. This means that $f'$ is differentiable. According to the quotient and chain rule $(f^{-1})'={\tfrac {1}{f'\circ f^{-1}}}$ is therefore also differentiable at $y$ and there is

(f^{-1})''(y)={\frac {0-f''(f^{-1}(y))\cdot (f^{-1})'(y)}{(f'(f^{-1}(y)))^{2}}}=-{\frac {f''(f^{-1}(y))}{(f'(f^{-1}(y)))^{2}\cdot f'(f^{-1}(y))}}=-{\frac {f''(f^{-1}(y))}{(f'(f^{-1}(y)))^{3}}}

Proof step: Computing the derivatives $(f^{-1})'(1)$ and $(f^{-1})''(1)$

$f$ is differentiable on $\mathbb {R}$ with $f'(x)=3x^{2}+2>0$ . So $f$ is strictly monotonously increasing and therefore injective. Because $\lim _{x\to \pm \infty }f(x)=\pm \infty$ , according to the intermediate value theorem $f:\mathbb {R} \to \mathbb {R}$ is also surjective. So, ion total bijective. With $f(-1)=(-1)^{3}+2(-1)+4=1\iff f^{-1}(1)=-1$ the derivative theorem of the inverse function implies

(f^{-1})'(1)={\frac {1}{f'(f^{-1}(1))}}={\frac {1}{f'(-1)}}={\frac {1}{3(-1)^{2}+2}}={\frac {1}{5}}

Further, $f$ is twice differentiable with $f''(x)=6x$ . With the formula proven in step 2, there is hence

(f^{-1})''(1)=-{\frac {f''(f^{-1}(1))}{(f'(f^{-1}(1)))^{3}}}=-{\frac {f''(-1)}{(f'(-1))^{3}}}=-{\frac {6(-1)}{(3(-1)^{2}+2)^{3}}}={\frac {6}{5^{3}}}={\frac {6}{125}}

Examples for derivatives →

Feedback? Do you want to join?

If you have questions concerning the content, or didn't understand something, the feel free to contact us! We would love to answer your questions! Also we are thankful for critics and/or comments! If you share our vision to explain university math in an comprehensible way, then contact us under:

E-Mail: en@serlo.org

This article is licensed under the free license CC-BY-SA 3.0. With that you can use it, modify it or share it freely, as long as you name „Serlo“ as source and put you changes under the same CC-BY-SA 3.0 oder an compatible license. On the page „Kopier uns!“ we explain you what you have to pay attention to, when using our texts, picture or videos.