Zum Inhalt springen

Cosets of a subspace – Serlo

Aus Wikibooks

Derivation: coset or affine subspace

[Bearbeiten]

Lines in

[Bearbeiten]

You probably already know the concept of a straight line. But how do we describe a line in mathematically? You know from school that you can parameterise straight lines by , where are two fixed vectors and takes all values in . That is, all points on the straight line form the set . Geometrically described, this is the (infinitely long) line running through in the direction of .

An affine line is described by the support vector v and the direction vector u.

In general, a line does not pass through the origin . Thus is not a subspace of , since by definition every subspace contains the origin. However, the line is a displaced version of the line by the vector . Here is a line passing through the origin. This is a subspace because it contains the origin and is closed under addition and scalar multiplication. That is, every straight line is given by the choice of a (one-dimensional) subspace and a vector . This justifies the notation . This notation can also be formalised:

For a subspace consider a vector . Let be by . Then the following applies for the sets and defined above, that .

Planes in

[Bearbeiten]

Let's increase the dimension and consider . We can describe a line in analogy to the set with vectors and . This is a displaced version of a line through the origin by a vector . So formally again, any line is of the form for a vector and a one-dimensional subspace .

What about the planes in ? We parameterise them by , where are fixed vectors and pass through all values in . The vectors and must not be scalar multiples of each other - otherwise we would get a line. All points on the plane form the set . As in the case of lines, the plane is generally not a subspace, since the origin need not lie in . However, the plane is a displaced version of the subspace by the vector . It is therefore analogously true that every plane is given by a two-dimensional subspace and a vector, i.e. that .

An affine plane is described by the support vector v and the direction vectors u_1 and u_2.

Lines in

[Bearbeiten]

We can also look at certain straight lines in a more complicated space: We consider the -vector space . In the article vector space we have already seen that we can think of this vector space as regular points on a torus. Now what is a "straight line" on this torus? We have seen in the previous two sections how we can describe straight lines in the vector spaces and : There a straight line is the same as a set with a support vector and a direction vector . In other words, it is the set , where is a one-dimensional subspace. We can transfer this construction to , that is, we can consider a straight line as , where is a one-dimensional subspace of . That is, is of the form . We can visualise this set on a torus:

points of an affine line in (Z/5Z)^2 on a torus
points of an affine line in (Z/5Z)^2 on a torus

The points appear to lie on a line. If we connect the points each in the shortest way, we get a closed line that feels like a straight line on the torus.

Points of an affine straight line in (Z/5Z)^2 on a torus, connected by a line
Points of an affine straight line in (Z/5Z)^2 on a torus, connected by a line

Thus, displaced one-dimensional subspaces also correspond to straight lines here.

We consider another example of a straight line in . Consider the one-dimensional subspace . We shift this by the vector . Thus we obtain the line . Here a line consists of only five vectors. In our case .

We have characterised geometric objects (e.g. lines and planes) as displaced subspaces in various vector spaces. Let's give them a name.

Definition: coset or affine subspace

[Bearbeiten]

Definition (Affine subspace or coset)

Let be a -vector space and a subspace of , so . Further let . Then, the set is called the affine subspace or coset obtained by translating by the vector .

Derivation: set of cosets of a subspace

[Bearbeiten]

We have defined cosets as displaced subspaces. Consider the following example of a displaced subspace of by two different vectors and :

Different displacements of a subspace leading to the same affine subspace
Different displacements of a subspace leading to the same affine subspace

In the example above, we see that different displacements of a subspace can lead to the same affine subspace. So we ask ourselves the following question:

When are two shifted subspaces and the same?

Let us first imagine the whole thing in , where both shifted subspaces are lines. If they are equal, they have the same slope. This characterises the lines passing through the origin and . It follows that and must be equal.

Let us now consider the question for general vector spaces. So let be a vector space, be subspaces of vectors, be vectors, and let be sets. We would like to first conclude (as in ) that . To do this, it would be nice to get from . This is done by taking all vectors of and subtracting , which indeed gives us . Hence, we can write as:

Since is a subspace, we have . The above equation thus implies , i.e., there is a , such that , i.e., . In particular, .

More generally, for each subspace and vector , we have . The reason is that each can be written as . Since , we have . Geometrically, you can also imagine the whole thing like this: If you move the subspace in a direction in which it already lies, it is mapped onto itself.

Back to our original question: Since , we know that . So all in all we get the desired . On the way we have also seen that is also a necessary criterion for .

Are these criteria also sufficient? Yes: Suppose we have and with and . Then and hence, by adding on both sides, we have .

Let us summarise: Two shifted subspaces are equal exactly if the (non-shifted) subspaces are equal, i.e. , and the difference of the shifts lie in , i.e., .

Given a subspace, we can now find out whether two displacements by or give the same affine subspace. We can thus construct a kind of "new equality" by considering and to be "equal" if they produce the same affine subspace. Such new equalities behave reasonably if they are equivalence relations.

Recall the definition of an equivalence relation.

Definition (Equivalence relation)

An equivalence relation is a homogeneous binary relation on a basic set that has the following properties:

  • reflexive
  • symmetric
  • transitive

Two elements that are in relation with respect to an equivalence relation are called equivalent. If two elements and are equivalent to each other with respect to an equivalence relation , one often writes or simply .

To formally write down the "new equality" mentioned above, we define a relation given by . Intuitively, our relation should be an equivalence relation, since it says when two shifted subspaces are equal. We now check this formally:

Theorem ( is an equivalence relation)

The relation defined by is an equivalence relation. This means that the relation is reflexive, symmetrical and transitive.

How to get to the proof? ( is an equivalence relation)

To show the assertion, we need to verify the three axioms of an equivalence relation: reflexivity, symmetry and transitivity. For reflexivity, we have to show for all that is satisfied. By definition of we must show that . Now and is a subspace. Therefore and hence reflexivity holds.

For transitivity and symmetry we proceed in the same way: we insert the definition and infer the desired property from the fact that is a subspace.

Proof ( is an equivalence relation)

Proof step: Reflexivity

Since is a subspace, we have . For an arbitrary vector we have . By definition of the relation, it follows that for all .

Proof step: Symmetry

We want to show that from we can infer . So let . Therefore, . As is a subspace, must be closed under taking inverses. So . But this is equivalent to . Hence, .

Proof step: Transitivity

Finally, we need to show that from and we get . So let , i.e., , and , i.e., . Since is a subspace, must be closed under addition. So in particular . As we also have and hence .

We can now consider the equivalence classes of this relation, that is, to we consider the set . So the set consists of all vectors , that displace to the same affine subspace . How else can we characterise these equivalence classes? We have

That is, the equivalence classes of our relation are precisely the coset classes.

Just as we can look at an equivalence relation and its equivalence classes, we can also construct a space in which the "new equality" of the equivalence relation becomes a real equality. This is the set of equivalence classes to which we now want to give a special name.

Definition: set of cosets of a subspace

[Bearbeiten]

Definition (Set of cosets of a subspace)

Let be a -verctorspace and a subspace of , so . Further, let . Define . Then, is an equivalence relation on and the Equivalence class of an element is the set . We call this the coset generated by with respect to .

We define

to be the set of cosets of .


We have defined the set of cosets as the set of equivalence classes according to . In the last section we saw that the equivalence class generated by is given exactly by the affine subspace . Thus an equivalence class with respect to is the same as a displaced version of . This provides two equivalent views of the set : on the one hand, is the set of equivalence classes with respect to ; on the other hand, it is the set of displaced versions of .

Hint

Depending on which of the two interpretations of the elements of one uses, one uses different names for the elements of . If one uses the equivalence relation to divide the elements of into different sets, one speaks of cosets. If, on the other hand, one considers a displacement of , one speaks of an affine subspace.

Examples for cosets

[Bearbeiten]

Example (Physics: Change of potential energy)

We consider the three-dimensional space with a gravitational field on it, with a positive gravitational constant .As coordinates, we use , and . Such a space can be, for example, the room where you are reading this article. We put our origin at some point on your table, so we define the potential energy at that point as 0. From that point, you can move an object to different points, assigning to each of those destination points the potential energy of a point particle we move there, which depends only on its height above the table. We can also take it to mean that we want to assign to each movement from the origin its change in potential energy. Let the table in our consideration be within the -- plane. The potential energy of a particle or the change of the potential energy by a movement from the origin to is thus:

We want to classify the possible rectilinear displacements from the origin based on their change in potential energy, and call two displacements equivalent if their change in the potential energy of a point particle matches. We want to group displacements into one class that change the potential energy in the same way. The mass, as well as the gravitational constant, are given for our point particle. Therefore, two displacements under consideration have the same potential energy if they have the same change in height. The displacements are therefore in the same class if their -value matches.

Let us now abstract our illustrative example. Our space is the -vector space . Mathematically, displacements from the origin are described by vectors. Displacements that cause the same change in potential energy for a point particle move it from the origin in to the same plane parallel to the - plane, since exactly the point particles on this plane have the same potential energy. For a given change in potential energy, we can choose any of the vectors causing such a change as representatives.

In the chapter on subspaces we have already seen that the --plane is a subspace of . In our physical example, we saw that planes shifted along the - axis were equivalence classes with respect to the change of potential energy. These classes are also called cosets.

Example (Finance: Change in the balance of two accounts)

Let us assume that each person always has exactly two bank accounts. Now we want to know how much money each person has in total. So we are interested in the sum of all the money each person has in their bank accounts. We look at the two bank accounts that Anna has. In these she has saved amounts of and respectively. So the total money saved by Anna is .

Let us now consider two people, Emma and Fritz. Emma has on her accounts . Fritz has on his accounts . Emma and Fritz therefore have exactly the same amount of money if . We call the pairs of accounts and equivalent if there is the same amount of money on them, i.e. if .

With this definition, the following pairs of accounts are equivalent, for example:

The reason is .

The two accounts of Emma and Fritz are therefore equivalent if , i.e., . We define the difference of the vectors and as

The vectors and are equivalent if and only if .

In other words, the sum of the funds from two accounts is given by the following linear map:

Thus the kernel of is the set of pairs of accounts whose sum is zero. So two pairs of accounts are equivalent if they differ only by a vector of . We can further transform the kernel of :

The equivalence classes with respect to the sum of the account balances are thus exactly the coset classes modulo the subspace . All cosets are of the form

with .

We can also think of it like this: We want to look at the summed balance of the two accounts. In the process of summation, information on the single accounts is lost. We still know how much money a person has in total, but no longer how the money is distributed between the two accounts.

Example (Two switches for one light source)

We consider the following scenario: A hallway has two light switches, both of which are to control the ceiling lamp. For this purpose, one can use a multiway switching. We want to model the behaviour of this circuit using vector spaces.

A light switch that points upwards.

We start with the light switches. Every light switch has two states: it points either up or down. So we can model it with . The stands for the light switch in the upper position and the for the light switch in the lower position. When you flip the light switch, the state changes from to or from to – depending on the current position of the light switch. This change of state corresponds to the mathematical operation within the field .

Now that we know how to model a light switch and the flipping of a light switch, we process all the input data of the multiway switching: these are the two light switches. We have four states, as each light switch has the state up () and down (). Thus is a suitable vector space for modelling the states. The state of the first light switch is given by the first component and the state of the second one by the second component. In this modelling, flipping the first light switch then corresponds to and flipping the second light switch corresponds to . If both light switches are in the upper position (this corresponds to ) and we flip the first light switch, we arrive at the state .

In order to extract the information from this model as to whether the lamp is on or off, we must first understand this circumstance with the multiway switching. The toggle switch allows us freedom at this point: We can build it in such a way that both switches have to be in the same position for the lamp to be on. We can also build it such that the lamp is on when the two switches are in different positions. For this example, we consider the case where the lamp is on exactly when both switches are in different positions. Thus we identify the states and with a lamp being on. We identify the other states and with a switched-off lamp. The latter form a subspace . If we shift this by or , we get the states of a switched-on lamp . These are all cosets of in . Thus we can summarise the alternating switch situation as follows: Given a switch state , we get the information whether the lamp is on by determining the coset generated by in . If this is , the lamp is off. If it is , the lamp is on.

With this understanding of the lamp state, we can also describe the influence that flipping a switch has. Each of the operations and affects the lamp states by changing the state of the lamp. If the lamp is in the off state and the second switch is flipped with , it is then in the on state . If the first switch is now flipped, the lamp changes back to the switched-off state. Mathematically, this is represented by .

Properties of equivalence classes applied to cosets

[Bearbeiten]

We have seen above that cosets of a one-dimensional subspace in are parallel straight lines. We can also explain this by characterising cosets as equivalence classes: Two equivalence classes, as sets, are either equal or disjoint. For us, this means that two cosets, i.e. two straight lines, are either equal or that they have no point of intersection. The latter means that they are parallel.

Furthermore, we know about equivalence classes that they cover the whole space, i.e. the union of all equivalence classes results in the whole set. From this we conclude that the union of all cosets (in our case parallel straight lines) gives the whole . We can therefore decompose the vector space into the cosets - like leaves. This decomposition is also called a partition. So the cosets partition the vector space. In our example, this means that we can decompose the into displaced versions of an origin line . This is illustrated in the following picture:

V=\R ^2 is partitioned by a line
V=\R ^2 is partitioned by a line

Both points mentioned also work in general (not only in ), since we have not used any property of in any of our arguments. It is therefore true for a vector space and a subspace that:

is the union of the cosets and any two cosets are disjoint.

Outlook

[Bearbeiten]

Cosets occur when solving systems of linear equations: The solutions of the associated homogeneous system of equations form a subvector space. If the linear system of equations has a solution, the solutions form an affine subspace with respect to .

To-Do:

Set a link as soon as the article on solving linear systems of equations is written..