Supremum and infimum – Serlo
Introduction
[Bearbeiten]Supremum (from Latin „supremum“ = "the highest/supreme“) sounds, as if it were "the maximum“ (that is, the largest element of the set). In the course of the article, however, we will see that the supremum generalzes the maximum. Let's start by remembering the following:
Every maximum is a supremum, but not every supremum is a maximum.
While the maximum has to be an element of a considered set, this need not apply to the supremum. Therefore we should aptly translate "supremum“ as "the number immediately restricting from the top“. It is "restricting from the top“, because it is like the maximum greater than or equal to any number of the set. And it is "immediate" because it is the smallest of all "upward limiting numbers".
Similarly, the infimum is a generalization of the minimum. It is the "number that immediately restricts downwards", i.e. the largest of all the "numbers that restrict downwards" of a set. We will get to know concrete examples in the coming sections.
For us the concept of the supremum is important, because with it the completeness of the real numbers can be described alternatively. In addition, the supremum is a useful tool in proofs or in defining new terms.
Explanation of the supremum
[Bearbeiten]To explain the Supremum, we will examine how to arrive at its precise definition. For this we will determine how the supremum can be generalized from the maximum. Remember: the maximum of a set is its largest element. The maximum of a quantity has the following properties:
- is an element of .
- For every is .
In the second property there is therefore a smaller-equal and no smaller sign, because in the statement could also be equal to . For finite quantities, the maximum is always defined, but this is not necessarily the case for infinite quantities.
First of all, we may encounter the problem that the set under consideration is unlimited upwards. Take for example the set . This set cannot have a maximum or the like, since there is a larger number of for each real number. This set cannot have a largest element. There is also no element that could be "directly the largest" element. Therefore, a question about this with this set simply does not make sense.
For the transfer of the term "maximum" to infinite sets, the set must therefore be limited upwards. So there must be a number , which is greater than or equal to each element of the set. As a result does not necessarily have to be an element of the set.
But even then problems can still arise. Take for example the set . This set is limited to the top, because for any number greater than can be selected.
Does the quantity have a maximum? Unfortunately not. For each is another number from with property (the number is in the middle between and ). However, cannot have a maximum element, because for each number from there is at least one larger number from .
Thus, when looking at infinite quantities, the maximum loses one property. Namely, that it is element of the set[1]:
m is an element of M.- For every is .
The only property that remains is that the number you are looking for is greater than any element in the set. Such a number is called the "upper limit" of the set:
Definition (upper bound)
Let be a subset of . Then a number , which is greater than or equal to each element of , is called an upper bound. So it is for all .
Similarly, a lower bound is a number that limits a quantity downwards:
Definition (Lower bound)
Let be a subset of . Then a number , which is less than or equal to any element of , is called a lower bound. So it is for all .
When we look at our new definition, we see two things. First: Upper and lower limits do not have to be elements of the considered set, because this is not required by the definition. And secondly: the definition says nothing about a possible uniqueness of the bounds.
For example, consider the set . Here we certainly first think of as the upper bound. However, is also an upper bound and meets the requirements of the definition. Apart from the fact that is far above our example set, both numbers are not elements of the set. This example shows that there can be more than one upper bound. But it becomes even more disturbing: A limited subset of the real numbers always has infinitely many upper bounds. If is an upper bound of , any larger number, i.e. for all , is also an upper bound.
On closer inspection, the terms upper and lower bound are not very appropriate. They provide much less than a maximum term. The maximum is always unique: there can only be one of them. That is not the case with the upper bound. Let us therefore try to improve the concept.
On closer inspection, the terms upper and lower bound are not very accurate. They provide much less than a maximum term. The maximum is always unique: there can only be one of them. That is not the case with the upper barrier. Let us therefore try to improve the concept.
Consider as an example again the set . Which number could be used to generalize the maximum for ? Intuitively the number occurs to us. But why choose this number?
We want a general term that works even when the set is no longer so clearly described. Therefore, all upper limits of , i.e. all numbers greater than or equal to , are possible. Now our number should be optimal in the sense that it is as small as possible. So we get to the number . It is not only an upper bound, it is also the smallest upper bound of . We have already seen that for each there is another number with (namely ). Thus no number smaller than can be an upper bound of . is what we consider to be the "immediately above" number of .
Question: What might a set look like where it is not intuitively "clear" what number the Supremum could be?
Let's briefly have a look at this beautiful looking set of numbers: The Mandelbrot set. They are obtained by inserting all points in a two-dimensional coordinate system into a certain function . It takes a coordinate and turns it into another coordinate . This result is put back into this function and then again and again and again and again.... The coordinates you get with every step become very large for some starting points very fast, for others they remain small. Once the coordinates have moved far enough away from their starting point (have exceeded a limit ), they never come back and "run for it". If a point for the start value always remains below , the point belongs to the set and is colored black. If it exceeds , it gets a certain color, depending on when it exceeded . What we see on the right is the resulting image.
The Mandelbrot set is now in the plane, its points have - and coordinates, therefore it is not suitable for our supremum concept at first. But we can simply "look at the set of all coordinates of the Mandelbrot set" and try to find its supremum. To put it more clearly: We would like to know how far up the black dots in the picture reach and are looking for the smallest upper bound. Which value it has exactly, however, is completely unclear at the first (and also at the second) look[2].
Die kleinste obere Schranke wird durch folgende zwei Eigenschaften charakterisiert:
- ist obere Schranke von : Für jedes ist .
- Jede obere Schranke von ist mindestens so groß wie : Gilt für alle , so gilt auch . Anders formuliert: Für jedes gibt es mindestens eine Zahl mit .
Das können wir als Definition des Supremums verwenden, da es offenbar die kleinste obere Schranke charakterisiert. Das Infimum wird analog als die größte untere Schranke definiert.
}}Definition of the Supremum and Infimum
[Bearbeiten]Die Definition des Supremums und des Infimums lautet:
Definition (Supremum)
Let be a subset of . The supremum of the set is the smallest upper bound of . The supremum is characterized by the following two properties:
- For every it holds .
- There is no number less than that is an upper bound of : For all there exists at least one number with .
Definition (Infimum)
Let be a subset of . The infimum of the set is the largest lower bound of . The infimum is characterized by the following two properties:
- For every it holds .
- No number larger than is a lower bound of : For all there exists at least one number with .
The Epsilon Definition
[Bearbeiten]In the second property in the definition of the supremum of the set , which is an element of the set , it says:
"Every number less than is not an upper bound of : For all there exists at least one number with .“
In mathematical literature and textbooks, authors often set with . This is a way to write the second propery of the supremum as a formal mathematical claim. Namely, we could replace the second property given above with the equivalent statement:
"For all there exists some with .“
Since both statements are equivalent and just differently worded, it is up to our discretion which variant we choose to use in proofs.
Question: What is the epsilon definition of the infimum?
is an infimum of if is a lower bound of and if for any there exists some such that holds.
Maximum and Minimum
[Bearbeiten]For the maximum and minimum we have the following well-known definitions:
Definition (Maximum)
The maximum of a set is a number with the two following properties:
- .
- For all it holds .
Definition (Minimum)
The minimum of a set is a number with the following two properties:
- .
- For all it holds .
From these definitions it follows immediately that the maximum of a set is also the supremum of the set. I.e. let be the maximum of the set . For one, is by definition the upper bound of . Furthermore, for every with there exists a with , namely . On the other hand, not every supremum is a maximum, like we saw above using the set . The number is the supremum of this set, but not the maximum! Similar statements are true for the minimum and infimum.
Notation
[Bearbeiten]Notation | Meaning |
---|---|
Supremum of | |
Supremum of | |
Infimum of | |
Infimum of | |
Maximum of | |
Minimum of |
The Duality Principle
[Bearbeiten]We've already seen in the above definitions and explanations that the terms supremum and infimum can be considered and used similarly. This is a result of the fact that, by switching around the ordering of the real numbers, i.e. replacing with , the supremum becomes the infimum and the infimum becomes the supremum. This means we can introduce a new ordering such that holds if and only if (this is the same as us reflecting the real numbers around zero). With this new ordering, the supremum acts as the infimum and vice versa. Both orderings and have the same mathematical ordering properties. This means they are isomorphic to one another. Therefore, the properties of the infimum and supremum must be the same for this reversed ordering. This means that any statements we make in the future for suprema will also apply to infima and vice versa. The same applies to the interchangeability of statements regarding the maximum and minimum.
Example (Duality Principle)
For all it is true that . Similarly for all we have the inequality .
Existence and Uniqueness
[Bearbeiten]Up until this point we have exclusively been speaking of the supremum. This sounds as if the supremum always exists and is always unique. This hints at answers to some important mathematical questions, namely: why did we bother defining the supremum in the first place? If the maximum of a set does not always exist, defining a supremum does not actually solve this existence problem. What is the advantage of introducing the concept of a supremum? Intuitively we know that of all upper bounds of a set, there should be exactly one smallest upper bound, i.e. intuitively this smallest upper bound should always exist and be unique. However, we haven't yet proven this mathematically. Indeed, the existence and uniqueness is a true property of the supremum, which we will now show formally.
In the following theorem we will prove the uniqueness of the supremum (and infimum), i.e. that a set can have at most one supremum and one infimum.
Theorem (Uniqueness of the Supremum and Infimum)
A set can have at most one supremum and one infimum.
Proof (Uniqueness of the Supremum and Infimum)
We can use the standard proof method for showing uniqueness: first we assume there exists some set with two suprema and . Then we will show . Both of the suprema have the following properties:
- and are upper bounds of .
- No number less than and is an upper bound of .
By the second property and since is an upper bound o , can't be smaller than and must therefore be greater than or equal to . Similarly it must hold . Since and we can conclude . The proof for the uniqueness of the infimum is similar.
Using the completeness axiom we can also prove the existence of the supremum of a non-empty subset of the real numbers that is bounded above. However, we will not deal with this in this chapter. We can also prove a similar statement about the exitence of the infimum of a non-empty subset of the real numbers that is bounded below. It is indeed the case that the supremum and infimum of a non-empty subset of the real numbers exist when this set is bounded above and below and are always unique.
Exegesis: Suprema and Infima in Partial Orderings
[Bearbeiten]We introduced the above definitions for suprema and infima for sets of real numbers. This is sufficient for an Introduction to Real Analysis class, since such classes often deal with subsets of . In higher-level theoretical math classes, the concept of the partial ordering, which satisfies the reflexitivity, anti-symmetry, and transitivity properties, but does not satisfy the totality property, i.e. there may be elements for which neither nor holds. In the case of partial orderings, the definitions we provided above are not sufficient to construct a sensible supremum or infimum. The main issue with these definitions in the case of partial orderings is that we lose the uniqueness of the supremum and infimum. In order to ensure the uniqueness of the supremum, we instead introduce the following definition:
Definition (Supremum in Partial Orderings)
In the partially ordered set , an element is the supremum of the set when it holds:
- is an upper bound of : for every it holds .
- For every other upper bound of it holds:
In order to show that these definitions is a sensible generalization of the supremum with respect to a partial ordering, we must show that both definitions coincide on a subset of the real numbers:
Theorem (Equivalent Definition of the Supremum)
Let be arbitrary. Our definition of the supremum is:
- For every it holds .
- Every number less than is not an upper bound of : for all there exists at least one number with .
This definition is equivalent to the definition of the supremum with respect to partial orderings:
- is an upper bound of : for every it holds .
- For every other upper bound of it holds:
Proof (Equivalent Definition of the Supremum)
Let be arbitrary. Since the first property of each pairs of properties are identical, we only have to show that the second properties coincide. I.e. we have to show the equivalence of:
- Every number less than is not an upper bound of : for al there exists at least one number with .
- For every other upper bound of it holds .
We can formalize the two claims in the following:
We can show the equivalence of these two claims in the following way: