Extreme value theorem

In the following we are going to deal with continuous functions on compact intervals. These are intervals that are closed and bounded, i.e. have the form $[a,b]$ . We will see that such functions are always bounded and attain a minimum and maximum. This theorem is called the Extreme Value Theorem. It is used in mathematics to prove the existence of relative extrema, i.e. points of a function that are "at the extreme" of being the lowest point in the graph (the minimum) or the highest point in the graph (the maximum).

Motivation

Motivation and intuition behind the extreme value theorem (in German)

Let's take a continuous function $f$ which is defined on a compact interval $[a,b]$ . I.e. we are considering a function $f:[a,b]\to \mathbb {R}$ . This function has the value $f(a)$ at the point $a$ and the value $f(b)$ at the point $b$ .

Now $f$ is defined for every intermediate point between $a$ and $b$ . Intuitively, for functions without gaps in the domain of definition, continuity means that we are able to draw the graph without lifting the pen from the paper. Hence, the graph of $f$ connects the points $(a,f(a))$ and $(b,f(b))$ by a continuous path without jumps. The following graph provides an example for such a function $f$ :

We note that the function $f$ above is bounded. And it attains a maximum and minimum value:

Is it always that way? Try for yourself to connect the points $(a,f(a))$ and $(b,f(b))$ by different graphs without lifting the pen. Could you imagine to draw the graph of an unbounded function - even if your paper was infinitely large?

Intuitively, the answer is no. No matter how far your graph goes up or down, you need to return to the end points at $(a,f(a))$ or $(b,f(b))$ . Going to infinity "forces you to lift the pen" and is therefore not allowed. However, the function $f$ can attain very large values (like $10^{100}$ or more) while staying bounded, as long as you return in order to reach the end point. This situation is illustrated in the following figure:

So our first intuition tells us that when connecting the two points $(a,f(a))$ and $(b,f(b))$ without lifting the pen, our function stays bounded. And it attains a maximum and a minimum. Now let's think about what could go wrong when phrasing this intuition in a mathematical way. The end points $a$ and $b$ of the domain of definition $[a,b]$ could become problematic: For an open domain of definition $(a,b)$ , the function could run towards $\pm \infty$ at $a$ or $b$ , or it could converge towards a value without attaining it. Including the boundary points $\{a,b\}$ in the domain of definition excludes these cases as it "catches the function" at the end points $(a,f(a))$ and $(b,f(b))$ . If we move a boundary to infinity, let's say by considering the domain of definition $[a,\infty )$ , the function could have "infinitely much time" and might run towards infinity while being continuous. This happens for instance for $f(x)=x^{2}$ . So we also expect problems with unbounded domains of definition. A statement like "the maximum and minimum are attained" can only be expected to hold true on a compact interval $[a,b]$ . Now, in between $a$ and $b$ , the function could also "break out" and tend towards $\pm \infty$ (like $f(x)={\tfrac {1}{x}}$ near $x=0$ ). This scenario will be prevented by assuming continuity of the function $f$ .

In the following, we will mathematically verify that our intuition is true. That means we prove a statement like "the maximum and minimum are attained" for continuous functions defined on a compact interval $[a,b]$ and discuss what may go wrong if we choose other domains of definition.

Theorem (Extreme value theorem)

Explanation of the extreme value theoremm (in German). (YouTube-Video published by Quatematik)

Every continuous function defined on a compact interval $[a,b]$ is bounded and attains a maximum and a minimum (extreme values). That means, if $f:[a,b]\to \mathbb {R}$ with $a,b\in \mathbb {R}$ and $a<b$ is a continuous function, then there are arguments ${\tilde {x}},{\hat {x}}\in [a,b]$ , such that for all arguments $x\in [a,b]$ the inequality $f({\hat {x}})\leq f(x)\leq f({\tilde {x}})$ holds.

Example (Extreme value theorem)

Consider $f:[0,1]\to \mathbb {R}$ with $f(x)=x^{2}\cdot \cos(x)\cdot e^{\cos(x)}-\ln(x+1)$ . The domain of definition $[0,1]$ is a compact interval. In addition, $f$ is continuous as it is composed out of continuous functions $x\mapsto x^{2}$ , $x\mapsto e^{x}$ , $x\mapsto \cos(x)$ and $x\mapsto \ln(x+1)$ with domains of definition $[0,1]$ . Hence, $f$ must attain a maximum and a minimum.

Proof (Extreme value theorem)

Proof of the extreme value theorem (in German)

Let $f:[a,b]\to \mathbb {R}$ be a continuous function with $a,b\in \mathbb {R}$ and $a<b$ . We will only prove explicitly that $f$ is bounded from above and attains a maximum. The analogous statement that $f$ is bounded from below and attains a minimum can be shown the same way.

So let us consider the image $f([a,b])$ . This is the the set of all function values, which are attained by $f$ . Let us take the supremum $\sup f([a,b])$ of the set $f([a,b])$ , where we explicitly allow for use of the extended definition of the supremum $\sup f([a,b])=\infty$ . If $f$ is bounded from above, then $\sup f([a,b])=\infty$ and else, $\sup([a,b])\in \mathbb {R}$ (since $f([a,b])\neq \emptyset$ , the case $\sup \emptyset =-\infty$ cannot occur).

Now, we know that there is a sequence in $f([a,b])$ , which tends towards the supremum $\sup f([a,b])$ (for each nonempty set $M$ there is a subsequence in $M$ , tending towards $\sup M$ ). Hence there is a sequence $(x_{n})_{n\in \mathbb {N} }$ of arguments in $[a,b]$ with $\lim _{n\to \infty }f(x_{n})=\sup f([a,b])$ .

We now make use of the Bolzano-Weierstraß theorem. This theorem tells us that each sequence in a compact interval $[a,b]$ with $a,b\in \mathbb {R}$ and $a<b$ has a converging subsequence. Hence, $(x_{n})_{n\in \mathbb {N} }$ also has a convergent subsequence $\left(x_{n_{k}}\right)_{k\in \mathbb {N} }$ . Let ${\tilde {x}}$ be the limit of the convergent subsequence $\left(x_{n_{k}}\right)_{k\in \mathbb {N} }$ . Since $a\leq x_{n_{k}}\leq b$ for all $k\in \mathbb {N}$ , there is also $a\leq {\tilde {x}}\leq b$ and therefore ${\tilde {x}}\in [a,b]$ . So, ${\tilde {x}}$ must be an argument of the function $f$ . Since $f$ is continuous, we can make use of the sequential definition of continuity

f\left({\tilde {x}}\right)=f\left(\lim _{k\to \infty }x_{n_{k}}\right)=\lim _{k\to \infty }f\left(x_{n_{k}}\right)=\sup f([a,b])

$\sup f([a,b])$ is a function value of $f$ and hence a real number. Therefore we know that $f$ is bounded from above. And we have shown that $f$ attains its upper bound $\sup f([a,b])$ at the argument ${\tilde {x}}$ . Therefore, $f(x)\leq f({\tilde {x}})$ for all $x\in [a,b]$ and indeed, $f({\tilde {x}})$ is the maximum among all function values of $f$ .

Assumptions of the theorem

Assumptions of the extreme value theorem (in German)

Let's take a look at the assumptions made within the extreme value theorem:

$f$ is a continuous function
$f$ is defined on a compact interval $[a,b]$

Are those assumptions really necessary or can we relax them without losing validity of the extreme value theorem?

Assumption of continuity

First, we note that continuity prevents the function $f$ from "breaking out" to $+\infty$ or $-\infty$ within its domain of definition. Ij we just allow any function $f:[a,b]\to \mathbb {R}$ , no matter whether it is continuous or not, we will find non-continuous functions which are violating the extreme value theorem. The following function is unbounded (so it does not attain any extrema) and non-continuous at $x=0$ :

f:[-1,1]\to \mathbb {R} :x\mapsto {\begin{cases}{\frac {1}{x}}&;x\neq 0\\0&;x=0\end{cases}}

So we cannot simply drop the assumption that $f$ is continuous.

Interval-assumption

The domain of definition is also important. It must include its boundary (i.e. be closed). This way we "catch" the function at the interval boundary and make sure it does not "run away" towards infinity. The function $g:(0,1]\to \mathbb {R} :x\mapsto {\tfrac {1}{x}}$ is an example which "runs away" as we approach $x=0$ .

Unboundedly large domains are also problematic, since the function has "infinitely much time" to run away. An easy example is the function $h:[0,\infty )\to \mathbb {R} :x\mapsto x^{2}$ . And there are functions, which are defined on a bounded domain of definition, continuous and do not "run away" towards infinity, but do not attain an extremum. This happens if there are open boundaries or gaps within the domain of definition. The extremum would then be attained at the boundary (or the gap) - but this argument has been removed from the domain of definition.

Question: Does the continuous function $f$ always attain a maximum or a minimum, if it is defined on a bounded interval?

No. For instance, the function $f:(0,1)\to \mathbb {R}$ , $f(x)=x$ does not. There is: $f((0,1))=(0,1)$ . So the infimum and maximum of the image are 0 and 1, which would be attained at the boundary ( $x=0$ and $x=1$ ), if $f$ was defined there. However, we removed 0 and 1 from the domain of definition, so $f$ does no longer attain an extremum.

The same may happen when removing a maximum or minimum from the interior of the domain of definition instead of the boundary - which creates a gap. Of course, the function may also "run away" at such a gap. An example for this effect is the continuous but unbounded function $j:[-1,1]\setminus \{0\}\to \mathbb {R} :x\mapsto {\tfrac {1}{x}}$ . The argument $x=0$ is excluded from the domain of definition $[-1,1]\setminus \{0\}=[-1,0)\cup (0,1]$ . So this function is well defined and continuous, but it "runs away" at the gap. In a mathematical language, the function is unbounded and hence violates the conclusion of the extreme value theorem.

Outlook: Generalization of the theorem

So far, we only considered intervals (possibly with gaps) as candidates for the domain of definition. Is this restriction really necessary? This time, the answer is no. For instance, we can take the union of two intervals $D=[a,b]\cup [c,d]$ with $a<b<c<d$ and define some continuous and real-valued function $j$ on $D$ . If we restrict $j$ to only $[a,b]$ or $[c,d]$ , we can apply the extreme value theorem. Both the functions $j_{1}=j|_{[a,b]}$ and $j_{2}=j|_{[c,d]}$ with restricted domain of definition are bounded and hence attain a maximum and a minimum. The function $j$ must therefore also be bounded. Its maximum is the larger of the both maxima of $j_{1}$ and $j_{2}$ , so $j$ also attains a maximum (the same holds for the minimum). Therefore, every continuous function defined on the union of two closed intervals $[a,b]\cup [c,d]$ fulfills the conclusion of the extreme value theorem. The same holds if we consider three or more closed intervals - or an even larger class of domains of definition. In fact, we can precisely state what this larger class of domains of definition is:

If we take a second look at the proof, we note that the domain of definition is only mentioned at one point: where we make use of the Bolzano-Weierstraß theorem. We used it to show that any sequence from the domain of definition contains a convergent subsequence. Hence, the proof arguments hold true, as long as the domain of definition allows for the usage of the Bolzano-Weierstraß theorem.

So we can generalize the above theorem. It will hold not only on closed intervals $[a,b]$ , but on all sets satisfying the Bolzano-Weierstraß theorem. We will call these sets satisfying the Bolzano-Weierstraß theorem sequentially closed sets:

Definition (Sequential closedness)

A subset of the real numbers is called sequentially closed iff any sequence out of this set has a convergent subsequence.

If the domain of definition $D$ of a continuous function $f:D\to \mathbb {R}$ is sequentially compact, then the function $f$ must fulfill the extreme value theorem. The generalization of sequential compactness from real numbers to other sets of mathematical objects is one of the topics dealt with in topology.

Exercise: Image of polynomials of even degree

Exercise: Image of polynomials of even degree (in German)

Exercise (Image of polynomials)

Let

p:\mathbb {R} \to \mathbb {R} ,\ p(x)=a_{n}x^{n}+a_{n-1}x^{n-1}+\ldots +a_{1}x+a_{0}

be a polynomial function with $a_{0},a_{1},\ldots ,a_{n-1},a_{n}\in \mathbb {R}$ and $a_{n}\neq 0$ . Let $p$ further have an even degree $n$ . Show that the image of $p$ is given by

p(\mathbb {R} )={\begin{cases}[y_{\min },\infty )&{\text{ for }}a_{n}>0,\\(-\infty ,y_{\max }]&{\text{ for }}a_{n}<0.\end{cases}}

Here, $y_{\min }$ (an case $a_{n}>0$ ) and $y_{\max }$ (in case $a_{n}<0$ ) are real numbers.

Proof (Image of polynomials)

We will consider the case $a_{n}>0$ . The proof for the case $a_{n}<0$ works analogously. At first, we note that the polynomial $p$ is a composition of continuous functions and hence continuous itself on $\mathbb {R}$ . It is temping to use the extreme value theorem in order to show that $p$ attains a minimum. However, $\mathbb {R} =(-\infty ,\infty )$ is not a compact interval. However, we can cut it off at very large values and make it compact this way. For even $n$ , there is $\lim _{x\to \pm \infty }x^{n}=\infty$ , da $n$ . The $x^{n}$ -term dominates the other ones, so there is also

\lim _{x\to \pm \infty }p(x)=\lim _{x\to \pm \infty }\underbrace {x^{n}} _{\to \infty }(\underbrace {a_{n}+a_{n-1}x^{-1}+\ldots +a_{1}x^{-n+1}+a_{0}x^{-n}} _{\to a_{n}>0})=\infty

Now, let us take any function value of $p$ – for instance $p(0)$ . Since $\lim _{x\to \infty }p(x)=\infty$ there is an $S\in \mathbb {R}$ , such that $p(x)>p(0)$ for all $x\geq S$ . Analogously, since $\lim _{x\to -\infty }p(x)=\infty$ there is an $s\in \mathbb {R}$ , such that $p(x)>p(0)$ for arguments $x$ smaller than $s$ . Both on $(-\infty ,s]$ and $[S,\infty )$ , the polynomial $p$ is larger than the function value $p(0)$ .

We can hence cut off the real number axis and restrict to the interval $[s,S]$ . Since on $[S,\infty )$ the function is larger than $p(0)$ , the argument $0$ does not belong to this set and $S>0$ . Analogously, $s<0$ . Therefore, $[s,S]$ is a nonempty, closed and bounded interval. Hence it is compact an we can apply the extreme value theorem. The polynomial $p$ indeed attains a minimum $y_{\min }$ on $[s,S]$ . Now, $y_{\min }\leq p(0)$ (since $0\in [s,S]$ ) and therefore, $y_{\min }$ is also a global minimum of the polynomial.

The intermediate value theorem additionally yields that the image of $p$ is an interval (see also Conclusions from the intermediate value theorem). Since $\lim _{x\to \infty }p(x)=\infty$ and $y_{\min }$ is a global minimum of $p$ , the image of $p$ must be of the form $[y_{\min },\infty )$ .

Exercise: Continuous functions on [0,1]

Exercise: Continuous functions on [0,1] (in German)

Exercise (There is no continuous function on a compact interval attaining all function values exactly twice)

Show that there is no continuous function $f:[0,1]\to \mathbb {R}$ attaining all its function values exactly twice. That means, there is no continuous function $f:[0,1]\to \mathbb {R}$ , such that for all $y\in f([0,1])$ exactly two numbers $x_{1},x_{2}\in [0,1]$ with $f(x_{1})=f(x_{2})=y$ exist.

Solution (There is no continuous function on a compact interval attaining all function values exactly twice)

We perform a proof by contradiction. Let $f:[0,1]\to \mathbb {R}$ be a continuous function attaining all its values exactly twice. Since $f$ is continuous and $[0,1]$ is a compact interval, $f$ has to be bounded and to attain a maximum $M$ . By assumption, this maximum is attained exactly twice. So there are two arguments $x_{M},{\tilde {x}}_{M}\in [0,1]$ with $f(x_{M})=f({\tilde {x}}_{M})=M$ . Let without loss of generality be $x_{M}<{\tilde {x}}_{M}$ .

Now, $f$ must also attain a minimum on the interval $[x_{M},{\tilde {x}}_{M}]$ , which we call $m$ . Since $M$ is the maximum of $f$ , it is also the maximum of the restriction $f|_{[x_{M},{\tilde {x}}_{M}]}$ . Therefore, $M\geq m$ . In case $m=M$ , the function $f$ would have to be constant on $[x_{M},{\tilde {x}}_{M}]$ and hence attain exactly one value infinitely often. Therefore, $m<M$ .

Since the minimum $m$ is attained by $f$ on the interval $[x_{M},{\tilde {x}}_{M}]$ and the function attains the maximum $M$ on both ends of the interval, there is an $x_{m}$ with $x_{M}<x_{m}<{\tilde {x}}_{M}$ and $f(x_{m})=m$ . And we know that $m$ is attained at some second argument ${\tilde {x}}_{m}\in [0,1]$ . This argument ${\tilde {x}}_{m}$ mab be situated on the inside of the interval $[x_{M},{\tilde {x}}_{M}]$ or on the outside.

Fall 1: ${\tilde {x}}_{m}\notin [x_{M},{\tilde {x}}_{M}]$

First, we consider the case where ${\tilde {x}}_{m}$ is not in $[x_{M},{\tilde {x}}_{M}]$ on consider - without loss of generality - the case $x_{M}<x_{m}<{\tilde {x}}_{M}<{\tilde {x}}_{m}$ . The mean ${\tfrac {M+m}{2}}$ of $M$ and $m$ is an intermediate value and will hence be attained by the function between $x_{M}$ and $x_{m}$ by means of the intermediate value theorem. Analogously, ${\tfrac {M+m}{2}}$ is attained in the intervals $[x_{m},{\tilde {x}}_{M}]$ and $[{\tilde {x}}_{M},{\tilde {x}}_{m}]$ . So ${\tfrac {M+m}{2}}$ is attained at least at three arguments, which leads to a contradiction to the function attaining each value exactly twice:

Fall 2: ${\tilde {x}}_{m}\in [x_{M},{\tilde {x}}_{M}]$

Now, we consider the case where ${\tilde {x}}_{m}$ is situated inside the interval $[x_{M},{\tilde {x}}_{M}]$ . Without loss of generality, we assume $x_{M}<x_{m}<{\tilde {x}}_{m}<{\tilde {x}}_{M}$ . Within the interval $[x_{m},{\tilde {x}}_{m}]$ , $f$ must attain a maximum $M_{2}$ . Since $m$ is the minimum on $[x_{m},{\tilde {x}}_{m}]$ , there will be $M_{2}\geq m$ . In addition, $m$ has already been attained twice, so we need the strict inequality $M_{2}>m$ (else, the function would be constant on $[x_{m},{\tilde {x}}_{m}]$ ). As $M$ is the maximum of $f$ , which was already attained twice at $x_{M}$ and ${\tilde {x}}_{M}$ , there is $M>M_{2}>m$ .

Now, the intermediate value theorem tells us that $M_{2}$ is attained within the interval $[x_{M},x_{m}]$ (since $f(x_{M})=M>M_{2}>m=f(x_{m})$ ). For the same reason ( $f({\tilde {x}}_{m})=m<M_{2}<M=f({\tilde {x}}_{M})$ ), the value $M_{2}$ is also attained on $[{\tilde {x}}_{m},{\tilde {x}}_{M}]$ . So $M_{2}$ is a value attained at least three times: inside the open interval $(x_{M},x_{m})$ , inside of $(x_{m},{\tilde {x}}_{m})$ and inside of $({\tilde {x}}_{m},{\tilde {x}}_{M})$ . This is a contradiction to every value being attained exactly twice:

In addition, the statement of the above exercise can be generalized in multiple ways:

We have shown that on a compact interval $[a,b]$ , there is no continuous function attaining each value twice.
Similarly, one may show that there is no continuous function $f:\mathbb {R} \to \mathbb {R}$ attaining each of its values twice.
And for each given number $n\in \mathbb {N}$ , $n>1$ , one can show that there is no continuous function $f:[a,b]\to \mathbb {R}$ attaining each of its values exactly $n$ times.

Exercise for understanding: Give an example for:

A continuous function $f:[0,1]\to \mathbb {R}$ , attaining each of its function values exactly once.
A function $f:(0,1]\to \mathbb {R}$ (non-continuous) attaining all of its function values exactly twice.

Possible solution:

$f:[0,1]\to \mathbb {R}$ with $f(x)=x$
$f:(0,1]\to \mathbb {R}$ with $f(x)={\begin{cases}2x&0<x\leq {\frac {1}{2}}\\2x-1&{\frac {1}{2}}<x\leq 1\end{cases}}$

Intermediate value theorem →

Feedback? Do you want to join?

If you have questions concerning the content, or didn't understand something, the feel free to contact us! We would love to answer your questions! Also we are thankful for critics and/or comments! If you share our vision to explain university math in an comprehensible way, then contact us under:

E-Mail: en@serlo.org

This article is licensed under the free license CC-BY-SA 3.0. With that you can use it, modify it or share it freely, as long as you name „Serlo“ as source and put you changes under the same CC-BY-SA 3.0 oder an compatible license. On the page „Kopier uns!“ we explain you what you have to pay attention to, when using our texts, picture or videos.