Linear Algebra II
Peter Philip∗
Lecture Notes Originally Created for the Class of Spring Semester 2019 at LMU Munich Includes Subsequent Corrections and Revisions†
September 19, 2021
Contents
1 Affine Subspaces and Geometry 4 1.1 AffineSubspaces ...... 4 1.2 AffineHullandAffineIndependence ...... 6 1.3 AffineBases...... 10 1.4 BarycentricCoordinates and Convex Sets...... 12 1.5 AffineMaps ...... 14 1.6 AffineGeometry...... 20
2 Duality 22 2.1 LinearFormsandDualSpaces...... 22 2.2 Annihilators...... 27 2.3 HyperplanesandLinearSystems ...... 31 2.4 DualMaps...... 34
3 Symmetric Groups 38
∗E-Mail: [email protected] †Resources used in the preparation of this text include [Bos13, For17, Lan05, Str08].
1 CONTENTS 2
4 Multilinear Maps and Determinants 46 4.1 MultilinearMaps ...... 46 4.2 Alternating Multilinear Maps and Determinants ...... 49 4.3 DeterminantsofMatricesandLinearMaps ...... 57
5 Direct Sums and Projections 71
6 Eigenvalues 76
7 Commutative Rings, Polynomials 87
8 CharacteristicPolynomial,MinimalPolynomial 106
9 Jordan Normal Form 119
10 Vector Spaces with Inner Products 136 10.1 Definition,Examples ...... 136 10.2 PreservingNorm,Metric,InnerProduct ...... 138 10.3Orthogonality ...... 145 10.4TheAdjointMap ...... 152 10.5 Hermitian,Unitary,andNormalMaps ...... 158
11 Definiteness of Quadratic Matrices over K 172
A Multilinear Maps 175
B Polynomials in Several Variables 176
C Quotient Rings 192
D Algebraic Field Extensions 199 D.1 BasicDefinitionsandProperties ...... 199 D.2 AlgebraicClosure...... 205 CONTENTS 3
References 212 1 AFFINE SUBSPACES AND GEOMETRY 4
1 Affine Subspaces and Geometry
1.1 Affine Subspaces
Definition 1.1. Let V be a vector space over the field F . Then M V is called an affine subspace of V if, and only if, there exists a vector v V and a (vector)⊆ subspace U V such that M = v + U. We define dim M := dim U∈ to be the dimension of M (this⊆ notion of dimension is well-defined by the following Lem. 1.2(a)). —
Thus, the affine subspaces of a vector space V are precisely the translations of vector subspaces U of V , i.e. the cosets of subspaces U, i.e. the elements of quotient spaces V/U. Lemma 1.2. Let V be a vector space over the field F .
(a) If M is an affine subspace of V , then the vector subspace corresponding to M is unique, i.e. if M = v1 + U1 = v2 + U2 with v1,v2 V and vector subspaces U1, U2 V , then ∈ ⊆ U = U = u v : u,v M . (1.1) 1 2 { − ∈ } (b) If M = v + U is an affine subspace of V , then the vector v in this representation is unique if, and only if, U = 0 . { } Proof. (a): Let M = v + U with v V and vector subspace U V . Moreover, let 1 1 1 ∈ 1 ⊆ U := u v : u,v M . It suffices to show U1 = U. Suppose, u1 U1. Since v1 M and v{+−u M, we∈ have} u = v + u v U, showing U U.∈ If a U, then there∈ 1 1 ∈ 1 1 1 − 1 ∈ 1 ⊆ ∈ are u1,u2 U1 such that a = v1 + u1 (v1 + u2) = u1 u2 U1, showing U U1, as desired. ∈ − − ∈ ⊆ (b): If U = 0 , then M = v and v is unique. If M = v + U with 0 = u U, then M = v + U ={ v} + u + U with{ v}= v + u. 6 ∈ 6 Definition 1.3. In the situation of Def. 1.1, we call affine subspaces of dimension 0 points, of dimension 1 lines, and of dimension 2 planes – in R2 and R3, such objects are easily visualized and they then coincide with the points, lines, and planes with which one is already familiar. —
Affine spaces and vector spaces share many structural properties. In consequence, one can develop a theory of affine spaces that is in many respects analogous to the theory 1 AFFINE SUBSPACES AND GEOMETRY 5
of vector spaces, as will be illustrated by some of the notions and results presented in the following. We start by defining so-called affine combinations, which are, for affine spaces, what linear combinations are for vector spaces:
Definition 1.4. Let V be a vector space over the field F with v1,...,vn V and n ∈ λ1,...,λn F , n N. Then i=1 λi vi is called an affine combination of v1,...,vn if, ∈ n ∈ and only if, λi = 1. i=1 P Theorem 1.5.P Let V be a vector space over the field F , = M V . Then M is an affine subspace of V if, and only, if M is closed under∅ affine 6 combinations.⊆ More precisely, the following statements are equivalent:
(i) M is an affine subspace of V .
n n (ii) If n N, v1,...,vn M, and λ1,...,λn F with i=1 λi =1, then i=1 λi vi M. ∈ ∈ ∈ ∈ P P If char F =2, then (i) and (ii) are also equivalent to1 6 (iii) If v ,v M and λ ,λ F with λ + λ =1, then λ v + λ v M. 1 2 ∈ 1 2 ∈ 1 2 1 1 2 2 ∈ Proof. Exercise.
The following Th. 1.6 is the analogon of [Phi19, Th. 5.7] for affine spaces: Theorem 1.6. Let V be a vector space over the field F .
(a) Let I = be an index set and (M ) a family of affine subspaces of V . Then the 6 ∅ i i∈I intersection M := i∈I Mi is either empty or it is an affine subspace of V . (b) In contrast to intersections,T unions of affine subspaces are almost never affine sub- spaces. More precisely, if M1 and M2 are affine subspaces of V and char F =2 (i.e. 1 = 1 in F ), then 6 6 − M M is an affine subspace of V M M M M (1.2) 1 ∪ 2 ⇔ 1 ⊆ 2 ∨ 2 ⊆ 1 (where “ ” also holds for char F =2, but cf. Ex. 1.7 below). ⇐ 1 For char F = 2, (iii) does not imply (i) and (ii): Let F := Z2 = 0, 1 . Let V be a vector space over F with #V 4 (e.g. V = F 2). Let p,q,r V be distinct, M{:= }p,q,r (i.e. #M = 3). If ≥ ∈ { } λ1,λ2 F with λ1 +λ2 = 1, then (λ1,λ2) (0, 1), (1, 0) and (iii) is trivially true. On the other hand, v := p ∈+ q + r is an affine combination of p,q,r∈ { , since 1+1+1=1} in F ; but v / M: v = p + q + r = p implies q = r = r, and v = r, q likewise leads to a contradiction (this counterexample∈ was pointed out by Robin Mader).− 1 AFFINE SUBSPACES AND GEOMETRY 6
Proof. (a): Let M = . We use the characterization of Th. 1.5(ii) to show M is an 6 ∅ n affine subspace: If n N, v1,...,vn M, and λ1,...,λn F with k=1 λk = 1, then n ∈ ∈ ∈ v := k=1 λk vk Mi for each i I, implying v M. Thus, M is an affine subspace of V . ∈ ∈ ∈ P P (b): If M1 M2, then M1 M2 = M2, which is an affine subspace of V . If M2 M , then M⊆ M = M ,∪ which is an affine subspace of V . For the converse, we⊆ 1 1 ∪ 2 1 now assume char F = 2, M1 M2, and M1 M2 is an affine subspace of V . We have to show M 6 M . Let 6⊆m M M ∪and m M . Since M M is an 2 ⊆ 1 1 ∈ 1 \ 2 2 ∈ 2 1 ∪ 2 affine subspace, m2 + m2 m1 M1 M2 by Th. 1.5(ii). If m2 + m2 m1 M2, then m = m + m (m− + m∈ m∪) M , in contradiction to m /−M .∈ Thus, 1 2 2 − 2 2 − 1 ∈ 2 1 ∈ 2 m2 + m2 m1 M1. Since char F = 0, we have 2 := 1+1 = 0 in F , implying m = 1 (m−+ m ∈ m )+ 1 m M , i.e.6 M M . 6 2 2 2 2 − 1 2 1 ∈ 1 2 ⊆ 1 2 Example 1.7. Consider F := Z2 = 0, 1 and the vector space V := F over F . Then M := U := (0, 0), (1, 0) = (1, 0){ is} a vector subspace and, in particular, an affine 1 1 { } h{ }i subspace of V . The set M2 := (0, 1) + U1 = (0, 1), (1, 1) is also an affine subspace. Then M M = V is an affine subspace, even{ though neither} M M nor M M . 1 ∪ 2 1 ⊆ 2 2 ⊆ 1
1.2 Affine Hull and Affine Independence
Next, we will define the affine hull of a subset A of a vector space, which is the affine analogon to the linear notion of the span of A (which is sometimes also called the linear hull of A): Definition 1.8. Let V be a vector space over the field F , = A V , and ∅ 6 ⊆ := M (V ): A M M is affine subspace of V , M ∈P ⊆ ∧ where we recall that (V ) denotes the power set of V . Then the set P aff A := M M∈M \ is called the affine hull of A. We call A a generating set of aff A. —
The following Prop. 1.9 is the analogon of [Phi19, Prop. 5.9] for affine spaces: Proposition 1.9. Let V be a vector space over the field F and = A V . ∅ 6 ⊆ (a) aff A is an affine subspace of V , namely the smallest affine subspace of V containing A. 1 AFFINE SUBSPACES AND GEOMETRY 7
(b) aff A is the set of all affine combinations of elements from A, i.e.
n n aff A = λ a : n N λ ,...,λ F a ,...,a A λ =1 . i i ∈ ∧ 1 n ∈ ∧ 1 n ∈ ∧ i ( i=1 i=1 ) X X (1.3) (c) If A B V , then aff A aff B. ⊆ ⊆ ⊆ (d) A = aff A if, and only if, A is an affine subspace of V . (e) aff aff A = aff A.
Proof. (a): Since A aff A implies aff A = , (a) is immediate from Th. 1.6(a). ⊆ 6 ∅ (b): Let W denote the right-hand side of (1.3). If M is an affine subspace of V and A M, then W M, since M is closed under affine combinations, showing W aff A. ⊆ ⊆ k k ⊆ On the other hand, suppose N,n1,...,nN N, a1,...,ank A for each k 1,...,N , k k ∈ ∈ ∈ { } λ ,...,λ F for each k 1,...,N , and α1,...,αN F such that 1 nk ∈ ∈ { } ∈
nk N k λi = αi =1. k∈{1∀,...,N} i=1 i=1 X X Then N nk α λk ak W, k i i ∈ i=1 Xk=1 X since N nk N k αkλi = αk =1, i=1 Xk=1 X Xk=1 showing W to be an affine subspace of V by Th. 1.5(ii). Thus, aff A W , completing the proof of aff A = W . ⊆ (c) is immediate from (b). (d): If A = aff A, then A is an affine subspace by (a). For the converse, while it is clear that A aff A always holds, if A is an affine subspace, then A , where is as in Def. 1.8,⊆ implying aff A A. ∈M M ⊆ (e) now follows by combining (d) with (a). Proposition 1.10. Let V be a vector space over the field F , A V , M = v + U with v A and U a vector subspace of V . Then the following statements⊆ are equivalent: ∈ (i) aff A = M. 1 AFFINE SUBSPACES AND GEOMETRY 8
(ii) v + A = U. h− i Proof. Exercise.
We will now define the notions of affine dependence/independence, which are, for affine spaces, what linear dependence/independence are for vector spaces:
Definition 1.11. Let V be a vector space over the field F .
(a) A vector v V is called affinely dependent on a subset U of V (or on the vectors in U) if, and∈ only if, there exists n N and u ,...,u U such that v is an affine ∈ 1 n ∈ combination of u1,...,un. Otherwise, v is called affinely independent of U. (b) A subset U of V is called affinely independent if, and only if, whenever 0 V is written as a linear combination of distinct elements of U such that the coefficients∈ have sum 0, then all coefficients must be 0 F , i.e. if, and only if, ∈
n N W U #W = n λ u =0 ∈ ∧ ⊆ ∧ ∧ u u∈W X
λu F λu =0 λu =0. (1.4) ∧u∈ ∀W ∈ ∧ ⇒u∈ ∀W u∈W ! X Sets that are not affinely independent are called affinely dependent.
—
As a caveat, it is underlined that, in Def. 1.11(b) above, one does not consider affine combinations of the vectors u U, but special linear combinations (this is related to the fact that 0 is only an affine∈ combination of vectors in U, if aff U is a vector subspace of V ).
Remark 1.12. It is immediate from Def. 1.11 that if v V is linearly independent of U V , then it is also affinely independent of U, and, if U∈ V is linearly independent, then⊆ U is also affinely independent. However, the converse⊆ is, in general, not true (cf. Ex. 1.13(b),(c) below).
Example 1.13. Let V be a vector space over the field F .
(a) is affinely independent: Indeed, if U = , then the left side of the implication in (1.4)∅ is always false (since W U means∅ #W = 0), i.e. the implication is true. ⊆ 1 AFFINE SUBSPACES AND GEOMETRY 9
(b) Every singleton set v , v V , is affinely independent, since λ = 1 λ = 0 { } ∈ 1 i=1 i means λ1 = 0 (if v = 0, then v is not linearly independent, cf. [Phi19, Ex. 5.13(b)]). { } P
(c) Every set v,w with two distinct vectors v,w V is affinely independent (but not linearly{ independent} for w = αv with some α∈ F ): 0= λv λw = λ(v w) implies λ =0 or v = w. ∈ − −
—
There is a close relationship between affine independence and linear independence:
Proposition 1.14. Let V be a vector space over the field F and U V . Then the following statements are equivalent: ⊆
(i) U is affinely independent.
(ii) If u U, then U := u u : u U u is linearly independent. 0 ∈ 0 { − 0 ∈ \ { 0}} (iii) The set X := (u, 1) V F : u U is a linearly independent subset of the { ∈ × ∈ } vector space V F . × Proof. Exercise.
The following Prop. 1.15 is the analogon of [Phi19, Prop. 5.14(a)-(c)] for affine spaces:
Proposition 1.15. Let V be a vector space over the field F and U V . ⊆
(a) U is affinely dependent if, and only if, there exists u0 U such that u0 is affinely dependent on U u . ∈ \ { 0} (b) If U is affinely dependent and U M V , then M is affinely dependent as well. ⊆ ⊆ (c) If U is affinely independent and M U, then M is affinely independent as well. ⊆ Proof. (a): Suppose, U is affinely dependent. Then there exists W U, #W = n N, such that λ u = 0 with λ F , λ = 0, and there exists⊆ u W ∈with u∈W u u ∈ u∈W u 0 ∈ λu = 0. Then 0 6 P P u = λ−1 λ u = ( λ−1λ )u, ( λ−1λ )=( λ−1) ( λ )=1, 0 − u0 u − u0 u − u0 u − u0 · − u0 u∈WX\{u0} u∈WX\{u0} u∈WX\{u0} 1 AFFINE SUBSPACES AND GEOMETRY 10
showing u to be affinely dependent on U u . Conversely, if u U is affinely 0 \ { 0} 0 ∈ dependent on U u0 , then there exists n N, distinct u1,...,un U u0 , and λ ,...,λ F with\ { }n λ = 1 such that ∈ ∈ \ { } 1 n ∈ i=1 i P n n u = λ u u + λ u =0, 0 i i ⇒ − 0 i i i=1 i=1 X X showing U to be affinely dependent, since the coefficient of u0 is 1 = 0 and 1+ n − 6 − i=1 λi = 0. P(b) and (c) are now both immediate from (a).
1.3 Affine Bases
Definition 1.16. Let V be a vector space over the field F , let M V be an affine subspace, and B V . Then B is called an affine basis of M if, and⊆ only if, B is a generating set for ⊆M (i.e. M = aff B) that is also affinely independent. —
There is a close relationship between affine bases and vector space bases: Proposition 1.17. Let V be a vector space over the field F , let M V be an affine subspace, and let B M with v B. Then the following statements are⊆ equivalent: ⊆ ∈ (i) B is an affine basis of M.
(ii) B := b v : b B v is a vector space basis of the vector space U := 0 { − ∈ \ { }} v v : v ,v M . { 1 − 2 1 2 ∈ } Proof. As a consequence of Lem. 1.2(a), we know U to be a vector subspace of V and M = a + U for each a M. Moreover, v B M implies B U. According ∈ ∈ ⊆ 0 ⊆ to Prop. 1.14, B is affinely independent if, and only if, B0 is linearly independent. According to Prop. 1.10, aff B = M holds if, and only if, v + B = U, which, since B =( v + B) 0 , holds if, and only if, B = U. h− i 0 − \ { } h 0i The following Th. 1.18 is the analogon of [Phi19, Th. 5.17] for affine spaces: Theorem 1.18. Let V be a vector space over the field F , let M V be an affine subspace, and let = B V . Then the following statements (i) – (iii)⊆are equivalent: ∅ 6 ⊆ (i) B is an affine basis of M. 1 AFFINE SUBSPACES AND GEOMETRY 11
(ii) B is a maximal affinely independent subset of M, i.e. B is affinely independent and each set A M with B ( A is affinely dependent. ⊆ (iii) B is a minimal generating set for M, i.e. aff B = M and aff A ( M for each A ( B.
Proof. Let v B, and let B and U be as in Prop. 1.17 above. Then, due to Prop. ∈ 0 1.14, B is a maximal affinely independent subset of M if, and only if, B0 is a maximal linearly independent subset of U. Moreover, due to Prop. 1.10, B is a minimal (affine) generating set for M if, and only if, B0 is a minimal (linear) generating set for U. Thus, the equivalences of Th. 1.18 follow by combining Prop. 1.17 with [Phi19, Th. 5.17].
The following Th. 1.19 is the analogon of [Phi19, Th. 5.23] for affine spaces: Theorem 1.19. Let V be a vector space over the field F and let M V be an affine subspace. ⊆
(a) If S M is affinely independent, then there exists an affine basis of M that contains S. ⊆ (b) M has an affine basis B M. ⊆ (c) Affine bases of M have a unique cardinality, i.e. if B M and B˜ M are both affine bases of M, then there exists a bijective map φ : ⊆B B˜. ⊆ −→ (d) If B is an affine basis of M and S M is affinely independent, then there exists C B such that B˜ := S ˙ C is an affine⊆ basis of M. ⊆ ∪ Proof. Let v V and let U be a vector subspace of V such that M = v + U. Then v M and U∈= v v : v ,v M according to Lem. 1.2(a). ∈ { 1 − 2 1 2 ∈ } (a): It suffices to consider the case S = . Thus, let v S. According to Prop. 1.14(ii), S := x v : x S v is a6 linearly∅ independent∈ subset of U. According to 0 { − ∈ \ { }} [Phi19, Th. 5.23(a)], U has a vector space basis S0 B0 U. Then, by Prop. 1.17, S (v + B ) ˙ v is an affine basis of M. ⊆ ⊆ ⊆ 0 ∪{ } (b) is immediate from (a). (c): Let B M and B˜ M be affine bases of M. Moreover, let b B and ˜b B˜. ⊆ ⊆ ∈ ∈ Then, by Prop. 1.17, B0 := x b : x B b and B˜0 := x ˜b : x B˜ ˜b are both vector space bases of{ U−. Thus, by∈ [Phi19,\ { }} Th. 5.23(c)],{ there− exists∈ a bijective\ { }} map ψ : B B˜ . Then, clearly, the map 0 −→ 0 ˜b for x = b, φ : B B,˜ φ(x) := −→ ˜b + ψ(x b) for x = b, ( − 6 1 AFFINE SUBSPACES AND GEOMETRY 12
is well-defined and bijective, thereby proving (c). (d): If B aff S, then, according to Prop. 1.9(c),(e), M = aff B aff aff S = aff S, i.e. aff S =⊆ M, as M is an affine subspace containing S. Thus, S⊆is itself an affine basis of M and the statement holds with C := . It remains to consider the case, where there exists b B S such that S b is affinely∅ independent. Then, by Prop. 1.17, B := x b :∈x \B b is a vector∪ { } space basis of U and, by Prop. 1.14(ii), 0 { − ∈ \ { }} S0 := x b : x S is a linearly independent subset of U. Thus, by [Phi19, Th. 5.23(d)],{ there− exists∈ C} B such that B˜ := S ˙ C is a vector space basis of U, and, 0 ⊆ 0 0 0 ∪ 0 then, using Prop. 1.17 once again, (b + B˜0) ˙ b = S ˙ C with C := (b + C0) ˙ b B is an affine basis of M. ∪{ } ∪ ∪{ }⊆
1.4 Barycentric Coordinates and Convex Sets
The following Th. 1.20 is the analogon of [Phi19, Th. 5.19] for affine spaces:
Theorem 1.20. Let V be a vector space over the field F and assume M V is an affine subspace with affine basis B of M. Then each vector v M has unique barycentric⊆ coordinates with respect to the affine basis B, i.e., for each v∈ M, there exists a unique finite subset B of B and a unique map c : B F 0 such∈ that v v −→ \ { } v = c(b) b c(b)=1. (1.5) ∧ bX∈Bv bX∈Bv
Proof. The existence of Bv and the map c follows from the fact that the affine basis B is an affine generating set, aff B = M. For the uniqueness proof, consider finite sets B , B˜ B and maps c : B F 0 ,c ˜ : B˜ F 0 such that v v ⊆ v −→ \ { } v −→ \ { } v = c(b) b = c˜(b) b c(b)= c˜(b)=1. ∧ b∈B ˜ b∈B ˜ Xv bX∈Bv Xv bX∈Bv
Extend both c andc ˜ to A := Bv B˜v by letting c(b) := 0 for b B˜v Bv andc ˜(b):=0 for b B B˜ . Then ∪ ∈ \ ∈ v \ v 0= c(b) c˜(b) b, − b∈A X such that the affine independence of A implies c(b)=˜c(b) for each b A, which, in ∈ turn, implies Bv = B˜v and c =˜c. Example 1.21. With respect to the affine basis 0, 1 of R over R, the barycentric coordinates of 1 are 2 and 1 , whereas the barycentric{ coordinates} of 5 are 4 and 5. 3 3 3 − 1 AFFINE SUBSPACES AND GEOMETRY 13
Remark 1.22. Let V be a vector space over the field F and assume M V is an affine subspace with affine basis B of M. ⊆
(a) Caveat: In the literature, one also finds the notion of affine coordinates, however, this notion of affine coordinates is usually (but not always, so one has to use care) defined differently from the notion of barycentric coordinates as defined in Th. 1.20 above: For the affine coordinates, one designates one point x B to be the origin 0 ∈ of M. Let v M and let c : Bv F 0 be the map yielding the barycentric coordinates according∈ to Th. 1.20.−→ We\ write { } x B = x ,x ,...,x with { 0}∪ v { 0 1 n} distinct elements x1,...,xn M (if any) and we set c(x0) := 0 in case x0 / Bv. Then ∈ ∈ n n v = c(x ) x c(x )=1, i i ∧ i i=0 i=0 X X which, since 1 n c(x )= c(x ), is equivalent to − i=1 i 0 P n n v = x + c(x )(x x ) c(x )=1. 0 i i − 0 ∧ i i=1 i=0 X X
One calls the c(x1),...,c(xn), given by the map ca := c↾Bv\{x0}, the affine coordi- nates of v with respect to the affine coordinate system x ( x +B) (for v = x , { 0}∪ − 0 0 ca turns out to be the empty map).
(b) If x1,...,xn M are distinct points that are affinely independent and n := n 1 =0 in F , then one∈ sometimes calls · 6
1 n v := x M n i ∈ i=1 X
the barycenter of x1,...,xn. Definition and Remark 1.23. Let V be a vector space over R (we restrict ourselves to vector spaces over R, since, for a scalar λ we will need to know what it means for λ to be positive, i.e. λ > 0 needs to be well-defined). Let v1,...,vn V and n ∈ λ1,...,λn R, n N. Then we call the affine combination i=1 λi vi of v1,...,vn a ∈ ∈ n convex combination of v1,...,vn if, and only if, in addition to λi = 1, one has P i=1 λi 0 for each i 1,...,n . Moreover, we call C V convex if, and only if, C ≥ ∈ { } ⊆ P is closed under convex combinations, i.e. if, and only if, n N, v1,...,vn C, and + n n ∈ ∈ λ1,...,λn R0 with i=1 λi = 1, implies i=1 λi vi C (analogous to Th. 1.5, C V is then convex∈ if, and only if, each convex combination∈ of merely two elements of⊆ C is again in C). NoteP that, in contrast toP affine subspaces, we allow convex sets to 1 AFFINE SUBSPACES AND GEOMETRY 14
be empty. Clearly, the convex subsets of R are precisely the intervals (open, closed, half-open, bounded or unbounded). Convex subsets of R2 include triangles and disks. Analogous to the proof of Th. 1.6(a), one can show that arbitrary intersections of convex sets are always convex, and, analogous to the definition of the affine hull in Def. 1.8, one defines the convex hull conv A of a set A V by letting ⊆ := C (V ): A C C is convex subset of V , C ∈P ⊆ ∧ conv A := C. C∈C \ Then Prop. 1.9 and its proof still work completely analogously in the convex situation and one obtains conv A to be the smallest convex subset of V containing A, where conv A consists precisely of all convex combinations of elements from A; A = conv A holds if, and only if, A is convex; convconv A = conv A; and conv A conv B for each A B V . If n N and A = x ,x ,...,x V is an affinely⊆ independent set, consisting⊆ ⊆ of ∈ 0 { 0 1 n} ⊆ the n + 1 distinct points x0,x1,...,xn, then conv A is called an n-dimensional simplex (or simply an n-simplex) with vertices x0,x1,...,xn – 0-simplices are called points, 1- simplices line segments, 2-simplices triangles, and 3-simplices tetrahedra. If e1,...,ed denotes the standard basis of Rd, d N, then conv e ,...,e , 0 n 1.5 Affine Maps We first study a special type of affine map, namely so-called translations. Definition 1.24. Let V be a vector space over the field F . If v V , then the map ∈ T : V V, T (x) := x + v, v −→ v is called a translation, namely, the translation by v or the translation with translation vector v. Let (V ) := T : v V denote the set of translations on V . T { v ∈ } Proposition 1.25. Let V be a vector space over the field F . (a) If v V and A, B V , then Tv(A + B) = v + A + B. In particular, translations map∈ affine subspaces⊆ of V into affine subspaces of V . (b) If v V , then T is bijective with (T )−1 = T . In particular, (V ) S , where ∈ v v −v T ⊆ V SV denotes the symmetric group on V according to [Phi19, Ex. 4.9(b)]. (c) Nontrivial translations are not linear: More precisely, T with v V is linear if, v ∈ and only if, v =0 (i.e. Tv = Id). 1 AFFINE SUBSPACES AND GEOMETRY 15 (d) If v,w V , then T T = T = T T . ∈ v ◦ w v+w w ◦ v (e) ( (V ), ) is a commutative subgroup of (SV , ). Moreover, ( (V ), ) ∼= (V, +), whereT ◦ ◦ T ◦ I : (V, +) ( (V ), ), I(v) := T , −→ T ◦ v constitutes a group isomorphism. Proof. Exercise. We will now define affine maps, which are, for affine spaces, what linear maps are for vector spaces: Definition 1.26. Let V and W be vector spaces over the field F . Amap A : V W −→ is called affine if, and only if, there exists a linear map L (V,W ) and w W such that ∈ L ∈ A(x)=(Tw L)(x)= w + L(x) (1.6) x∈∀V ◦ (i.e. the affine maps are precisely the compositions of linear maps with translations). We denote the set of all affine maps from V into W by (V,W ). A Proposition 1.27. Let V,W,X be vector spaces over the field F . (a) If L (V,W ) and v V , then L T = T L (V,W ). ∈L ∈ ◦ v Lv ◦ ∈A (b) If A (V,W ), L (V,W ), and w W , then A = T L if, and only if, ∈ A ∈ L ∈ w ◦ T−w A = L. In particular, A = Tw L is injective (resp. surjective, resp. bijective) if, and◦ only if, L is injective (resp. surjective,◦ resp. bijective). (c) If A : V W is an affine and bijective, then A−1 is also affine. −→ (d) If A : V W and B : W X are affine, then so is B A. −→ −→ ◦ (e) Define GA(V ) := A (V,V ): A bijective . Then (GA(V ), ) forms a subgroup { ∈A } ◦ of the symmetric group (SV , ) (then, clearly, GL(V ) forms a subgroup of GA(V ), cf. [Phi19, Cor. 6.23]). ◦ Proof. (a): If L (V,W ) and v,x V , then ∈L ∈ (L T )(x)= L(v + x)= Lv + Lx =(T L)(x), ◦ v Lv ◦ proving L T = T L. ◦ v Lv ◦ 1 AFFINE SUBSPACES AND GEOMETRY 16 (b) is due to the bijectivity of T : One has, since T T = Id, w −w ◦ w A = T L T A = T T L = Id L = L. w ◦ ⇔ −w ◦ −w ◦ w ◦ ◦ Moreover, for each x,y V and z W , one has ∈ ∈ Ax = w + Lx = w + Ly = Ay Lx = Ly, ⇔ z + w = Ax = w + Lx z = Lx, ⇔ z w = Lx z = w + Lx = Ax, − ⇔ proving A = Tw L is injective (resp. surjective, resp. bijective) if, and only if, L is injective (resp. surjective,◦ resp. bijective). (c): If A = Tw L with L (V,W ) and w W is affine and bijective, then, by (b), L is bijective. Thus,◦ A−1 = L∈L−1 (T )−1 = L−∈1 T , which is affine by (a). ◦ w ◦ −w (d): If A = T L, B = T K with L (V,W ), w W , K (W, X), x X, then w ◦ x ◦ ∈L ∈ ∈L ∈ (B A)(a)= B(w + La)= x + Kw +(K L)(a)= TKw+x (K L) (a), a∈∀V ◦ ◦ ◦ ◦ showing B A to be affine. ◦ (e) is an immediate consequence of (c) and (d). Proposition 1.28. Let V and W be vector spaces over the field F . (a) Let v V , w W , L (V,W ), and let U be a vector subspace of V . Then ∈ ∈ ∈L (T L)(v + U)= w + Lv + L(U) w ◦ (in particular, each affine image of an affine subspace is an affine subspace). More- over, if A := Tw L and S V such that M := v + U = aff S, then A(M) = w + Lv + L(U) =◦ aff(A(S)). ⊆ (b) Let y W , L (V,W ), and let U be a vector subspace of W . Then L−1(U) is a ∈ ∈L vector subspace of V and L−1(y + U)= v + L−1(U) v∈L−∀1{y} (in particular, each linear preimage of an affine subspace is either empty or an affine subspace). (c) If M W is an affine subspace of W and A (V,W ), then A−1(M) is either ⊆ ∈ A empty or an affine subspace of V . 1 AFFINE SUBSPACES AND GEOMETRY 17 Proof. Exercise. The following Prop. 1.29 is the analogon of [Phi19, Prop. 6.5(a),(b)] for affine spaces (but cf. Caveat 1.30 below): Proposition 1.29. Let V and W be vector spaces over the field F , and let A : V W be affine. −→ (a) If A is injective, then, for each affinely independent subset S of V , A(S) is an affinely independent subset of W . (b) A is surjective if, and only if, for each subset S of V with V = aff S, one has W = aff(A(S)). Proof. Let w W and L (V,W ) be such that A = T L. ∈ ∈L w ◦ (a): If A is injective, S V is affinely independent, and λ1,...,λn F ; s1,...,sn S distinct; n N; such that⊆ n λ = 0 and ∈ ∈ ∈ i=1 i n n P n n n 0= λiA(si)= λi(w + Lsi)= λi w + L λisi = L λisi , i=1 i=1 i=1 ! i=1 ! i=1 ! X X X X X n then i=1 λisi = 0 by [Phi19, Prop. 6.3(d)], implying λ1 = = λn = 0 and, thus, showing that A(S) is also affinely independent. ··· P (b): If A is not surjective, then aff(A(V )) = A(V ) = W , since A(V ) is an affine subspace of W by Prop. 1.28(a). Conversely, if A is surjective,6 S V , and aff(S)= V , then ⊆ Prop. 1.28(a) W = A(V )= A(aff S) = aff(A(S)), thereby establishing the case. Caveat 1.30. Unlike in [Phi19, Prop. 6.5(a)], the converse of Prop. 1.29(a) is, in general, not true: If dim V 1 and A w W is constant, then A is affine, not injective, but it maps every nonempty≥ affinely≡ independent∈ subset of V (in fact, every nonempty subset of V ) onto the affinely independent set w . { } Corollary 1.31. Let V and W be vector spaces over the field F , and let A : V W be affine and injective. If M V is an affine subspace and B is an affine basis−→ of M, then A(B) is an affine basis of⊆A(M) (Caveat 1.30 above shows that the converse is, in general, not true). Proof. Since B is affinely independent, A(B) is affinely independent by Prop. 1.29(a). On the other hand, A(M) = aff(A(B)) by Prop. 1.28(a). 1 AFFINE SUBSPACES AND GEOMETRY 18 The following Prop. 1.32 shows that affine subspaces are precisely the images of vector subspaces under translations and also precisely the sets of solutions to linear systems with nonempty sets of solutions: Proposition 1.32. Let V be a vector space over the field F and M V . Then the ⊆ following statements are equivalent: (i) M is an affine subspace of V . (ii) There exists v V and a vector subspace U V such that M = T (U). ∈ ⊆ v (iii) There exists a linear map L (V,V ) and a vector b V such that = M = L−1 b = x V : Lx = b ∈(if LV is finite-dimensional,∈ then L−1 b =∅ 6 (L b), where{ } (L{b) denotes∈ the set} of solutions to the linear system Lx ={b according} L | to [Phi19,L Rem.| 8.3]). Proof. “(i) (ii)”: By the definition of affine subspaces, (i) is equivalent to the existence of v V and⇔ a vector subspace U V such that M = v + U = T (U), which is (ii). ∈ ⊆ v “(iii) (i)”: Let L (V,V ) and b V such that = M = L−1 b . Let x M. Then, ⇒ ∈L ∈ ∅ 6 { } 0 ∈ by [Phi19, Th. 4.20(f)], M = x0 + ker L, showing M to be an affine subspace. “(i) (iii)”: Now suppose M = v + U with v V and U a vector subspace of V . According⇒ to [Phi19, Th. 5.27(c)], there exists a subspace∈ W of V such that V = U W . Then, clearly, L : V V , L(u+w) := w (where u U, w W ), defines a linear⊕ map. Let b := Lv. Then M−→= L−1 b : Indeed, if u U, then∈ L(∈v + u) = Lv +0= Lv = b, showing M L−1 b ; if L(u+{w})= w = b = Lv∈, then u+w = v+u+w v v+U = M (since L(u +⊆w {v)} = Lw Lv = w Lv = 0 implies u + w v− ∈U), showing − − − − ∈ L−1 b M. { }⊆ The following Th. 1.33 is the analogon of [Phi19, Th. 6.9] for affine spaces: Theorem 1.33. Let V and W be vector spaces over the field F . Moreover, let MV = v + UV V and MW = w + UW W be affine subspaces of V and W , respectively, where v ⊆ V , w W , U is a vector⊆ subspace of V and U is a vector subspace of ∈ ∈ V W W . Let BV be an affine basis of MV and let BW be an affine basis of MW . Then the following statements are equivalent: (i) There exists a linear isomorphism L : U U such that V −→ W M =(T L T )(M ). W w ◦ ◦ −v V (ii) UV and UW are linearly isomorphic. 1 AFFINE SUBSPACES AND GEOMETRY 19 (iii) dim MV = dim MW . (iv) #BV =#BW (i.e. there exists a bijective map from BV onto BW ). Proof. “(i) (ii)” is trivially true. ⇒ “(ii) (i)” holds, since the restricted translations T : M U and T : U ⇒ −v V −→ V w W −→ MW are, clearly, bijective. “(ii) (iii)”: By Def. 1.1, (iii) is equivalent to dim UV = dim UW , which, according to [Phi19,⇔ Th. 6.9], is equivalent to (ii). “(iii) (iv)”: Let x B and y B . Then, by Prop. 1.17, S := b x : b B x ⇔ ∈ V ∈ W V { − ∈ V \{ }} is a vector space basis of UV and SW := b y : b BW y is a vector space basis of U , where the restricted translations T { : −B x∈ \S { }}and T : B y S W −x V \{ } −→ V −y W \{ } −→ W are, clearly, bijective. Thus, if dim MV = dim MW , then there exists a bijective map φ : SV SW , implying (Ty φ T−x): BV x BW y to be bijective as well. Conversely,−→ if ψ : B x ◦ B◦ y is bijective,\ { } −→ so is\ (T { } ψ T ): S S , V \ { } −→ W \ { } −y ◦ ◦ x V −→ W implying dim MV = dim MW . Analogous to [Phi19, Def. 6.17], we now consider, for vector spaces V,W over the field F , (V,W ) with pointwise addition and scalar multiplication, letting, for each A, B (V,WA ), λ F , ∈ A ∈ (A + B): V W, (A + B)(x) := A(x)+ B(x), −→ (λ A): V W, (λ A)(x) := λ A(x) for each λ F. · −→ · · ∈ The following Th. 1.34 corresponds to [Phi19, Th. 6.18] and [Phi19, Th. 6.21] for linear maps. Theorem 1.34. Let V and W be vector spaces over the field F . Addition and scalar multiplication on (V,W ), given by the pointwise definitions above, are well-defined in the sense that, if A,A B (V,W ) and λ F , then A+B (V,W ) and λA (V,W ). Moreover, with these pointwise∈A defined∈ operations, (V,W∈A) forms a vector∈A space over F . A Proof. According to [Phi19, Ex. 5.2(c)], it only remains to show that (V,W ) is a V A vector subspace of (V,W )= W . To this end, let A, B (V,W ) with A = Tw1 L1, B = T L , whereF w ,w W , L ,L (V,W ), and let∈Aλ F . If v V , then ◦ w2 ◦ 2 1 2 ∈ 1 2 ∈L ∈ ∈ (A + B)(v)= w1 + L1v + w2 + L2v = w1 + w2 +(L1 + L2)v, (λA)(v)= λw1 + λL1v, proving A + B = T (L + L ) (V,W ) and λA = T (λL ) (V,W ), as w1+w2 ◦ 1 2 ∈A λw1 ◦ 1 ∈A desired. 1 AFFINE SUBSPACES AND GEOMETRY 20 1.6 Affine Geometry The subject of affine geometry is concerned with the relationships between affine sub- spaces, in particular, with the way they are contained in each other. Definition 1.35. Let V be a vector space over the field F and let M,N V be affine subspaces. ⊆ (a) We define the incidence M I N by M I N : M N N M . ⇔ ⊆ ∨ ⊆ If M I N holds, then we call M,N incident or M incident with N or N incident with M. (b) If M = v + UM and N = w + UN with v,w V and UM , UN vector subspaces of V , then we call M,N parallel (denoted M N∈) if, and only if, U I U . k M N Proposition 1.36. Let V be a vector space over the field F and let M,N V be affine subspaces. ⊆ (a) If M N, then M I N or M N = . k ∩ ∅ (b) If n N0 and n denotes the set of affine subspaces with dimension n of V , then the parallelity∈ relationA of Def. 1.35(b) constitutes an equivalence relation on . An (c) If denotes the set of all affine subspaces of V , then, for dim V 2, the parallelity relationA of Def. 1.35(b) is not transitive (in particular, not an equivalence≥ relation) on . A Proof. (a): Let M = v + UM and N = w + UN with v,w V and UM , UN vector subspaces of V . Without loss of generality, assume U U∈ . Assume there exists M ⊆ N x M N. Then, if y M, then y x UM UN , implying y = x +(y x) N and M∈ N∩. ∈ − ∈ ⊆ − ∈ ⊆ (b): It is immediate from Def. 1.35 that is reflexive and symmetric. It remains to k show is transitive on n. Thus, suppose M = v + UM , N = w + UN , P = z + UP with v,w,zk V and U A, U , U vector subspaces of dimension n of V . If M N, then ∈ M N P k UM I UN and dim UM = dim UN = n implies UM = UN by [Phi19, Th. 5.27(d)]. In the same way, N P implies UN = UP . But then UM = UP and M P , proving transitivity of . k k k (c): Let u,w V be linearly independent, U := u , W := w . Then U V and ∈ h{ }i h{ }i k W V , but U ∦ W (e.g., due to (a)). k 1 AFFINE SUBSPACES AND GEOMETRY 21 Caveat 1.37. The statement of Prop. 1.36(b) becomes false if n N0 is replaced by an infinite cardinality: In an adaptation of the proof of Prop. 1.36(c),∈ suppose V is a vector space over the field F , where the distinct vectors v1,v2,... are linearly independent, and define B := vi : i N , U := B v1 , W := B v2 , X := B . Then, clearly, U X and W{ X,∈ but}U ∦ W (e.g.,h \ { due}i to Prop.h 1.36(a)).\ { }i h i k k Proposition 1.38. Let V be a vector space over the field F . (a) If x,y V with x = y, then there exists a unique line l V (i.e. a unique affine subspace∈ l of V with6 dim l =1) such that x,y l. Moreover,⊆ this affine subspace is ∈ given by l = x + x y . (1.7) h{ − }i (b) If x,y,z V and there does not exist a line l V such that x,y,z l, then there exists a unique∈ plane p V (i.e. a unique affine⊆ subspace p of V with∈ dim p = 2) such that x,y,z p. Moreover,⊆ this affine subspace is given by ∈ p = x + y x,z x . (1.8) h{ − − }i (c) If v ,...,v V , n N, then aff v ,...,v = v + v v ,...,v v . 1 n ∈ ∈ { 1 n} 1 h{ 2 − 1 n − 1}i Proof. Exercise. Proposition 1.39. Let V,W be vector spaces over the field F and let M,N V be affine subspaces. ⊆ (a) If A (V,W ), then M I N implies A(M)I A(N), and M N implies A(M) A(N)∈. A k k (b) If v V , then T (M) M. ∈ v k Proof. (a): Let A (V,W ). Then M I N implies A(M)I A(N), since M N im- plies A(M) A(N∈) and A A(M),A(N) are affine subspaces of W due to Prop.⊆ 1.28(a). ⊆ Moreover, if M = v + UM , N = w + UN with v,w V and UM , UN vector subspaces of V , A = T L with x W and L (V,W ), then∈ A(M) = x + Lv + L(U ) and x ◦ ∈ ∈ L M A(N) = x + Lw + L(UN ), such that M N implies A(M) A(N), since UM UN implies L(U ) L(U ). k k ⊆ M ⊆ N (b) is immediate from T (M)= v + w + U for M = w + U with w V and U a vector v ∈ subspace of V . 2 DUALITY 22 2 Duality 2.1 Linear Forms and Dual Spaces If V is a vector space over the field F , then maps from V into F are often of particular interest and importance. Such maps are sometimes called functionals or forms. Here, we will mostly be concerned with linear forms: Let us briefly review some examples of linear forms that we already encountered in [Phi19]: Example 2.1. Let V be a vector space over the field F . (a) Let B be a basis of V . If cv : Bv F 0 , Bv V , are the corresponding v c b−→b \ { }v V⊆ b B coordinate maps (i.e. = b∈Bv v( ) for each ), then, for each , the projection onto the coordinate with respect to b, ∈ ∈ P cv(b) for b Bv, πb : V F, πb(v) := ∈ −→ 0 for b / B , ( ∈ v is a linear form (cf. [Phi19, Ex. 6.7(b)]). (b) Let I be a nonempty set, V := (I,F ) = F I (i.e. the vector space of functions from I into F ). Then, for each i FI, the projection onto the ith coordinate ∈ π : V F, π (f) := f(i), i −→ i is a linear form (cf. [Phi19, Ex. 6.7(c)]). (c) Let F := K, where, as in [Phi19], we write K if K may stand for R or C. Let V be the set of convergent sequences in K. Then A : V K,A(zn)n∈N := lim zn, −→ n→∞ is a linear form (cf. [Phi19, Ex. 6.7(e)(i)]). (d) Let a,b R, a b, I := [a,b], and let V := (I, K) be the set of all K-valued Riemann∈ integrable≤ functions on I. Then R J : V K, J(f) := f, −→ ZI is a linear form (cf. [Phi19, Ex. 6.7(e)(iii)]). Definition 2.2. Let V be a vector space over the field F . 2 DUALITY 23 (a) The functions from V into F (i.e. the elements of (V,F ) = F V ) are called func- tionals or forms on V . In particular, the elementsF of (V,F ) are called linear functionals or linear forms on V . L (b) The set V ′ := (V,F ) (2.1) L is called the (linear2) dual space (or just the dual) of V (in the literature, one often also finds the notation V ∗ instead of V ′). We already know from [Phi19, Th. 6.18] that V ′ constitutes a vector space over F . Corollary 2.3. Let V be a vector space over the field F . Then each linear form α : V F is uniquely determined by its values on a basis of V . More precisely, if B is a basis−→ of V , (λ ) is a family in F , and, for each v V , c : B F 0 , B V , b b∈B ∈ v v −→ \ { } v ⊆ is the corresponding coordinate map (i.e. v = cv(b) b for each v V ), then b∈Bv ∈ P α : V F, α(v)= α cv(b) b := cv(b) λb, (2.2) −→ ! bX∈Bv bX∈Bv is linear, and α˜ V ′ with ∈ α˜(b)= λb, b∈∀B implies α =α ˜. Proof. Corollary 2.3 constitutes a special case of [Phi19, Th. 6.6]. Corollary 2.4. Let V be a vector space over the field F and let B be a basis of V . Using Cor. 2.3, define linear forms α V ′ by letting b ∈ 1 for a = b, αb(a) := δba = (2.3) (b,a)∈∀B×B 0 for a = b. ( 6 Define B′ := α : b B . (2.4) b ∈ (a) Then B′ is linearly independent. 2In Functional Analysis, where the vector space V over K is endowed with the additional structure n ′ of a topology (e.g., V might be the normed space K ), one defines the (topological) dual Vtop of V (there usually also just denoted as V ′ or V ∗) to consist of all linear functionals on V that are also continuous with respect to the topology on V (cf. [Phi17b, Ex. 3.1]). Depending on the topology on ′ ′ ′ V , Vtop can be much smaller than V – Vtop tends to be much more useful in an analysis context. 2 DUALITY 24 (b) If V is finite-dimensional, dim V = n N, then B′ constitutes a basis for V ′ (in particular, dim V = dim V ′). In this case,∈ B′ is called the dual basis of B (and B the dual basis of B′). (c) If dim V = , then B′ ( V ′ and, in particular, B′ is not a basis of V ′ (in fact, in this case,∞ one has hdimi V ′ > dim V , see [Jac75, pp. 244-248]). Proof. Cor. 2.4(a),(b),(c) constitute special cases of the corresponding cases of [Phi19, Th. 6.19]. Definition 2.5. If V is a vector space over the field F with dim V = n N and ′ ∈ B := (b1,...,bn) is an ordered basis of V , then we call B := (α1,...,αn), where ′ αi V αi(bj)= δij , (2.5) i∈{1∀,...,n} ∈ ∧ the ordered dual basis of B (and B the ordered dual basis of B′) – according to Cor. 2.4(b), B′ is, indeed, an ordered basis of V ′. 2 Example 2.6. Consider V := R . If b1 := (1, 0), b2 := (1, 1), then B := (b1,b2) is ′ ′ an ordered basis of V . Then the ordered dual basis B = (α1,α2) of V consists of the ′ maps α1,α2 V with α1(b1) = α2(b2) = 1, α1(b2) = α2(b1) = 0, i.e. with, for each (v ,v ) V , ∈ 1 2 ∈ α (v ,v )= α (v v )b + v b = v v , 1 1 2 1 1 − 2 1 2 2 1 − 2 α (v ,v )= α (v v )b + v b = v . 2 1 2 2 1 − 2 1 2 2 2 Notation 2.7. Let V be a vector space over the field F , dim V = n N, with ordered ′ ∈ basis B = (b1,...,bn). Moreover, let B = (α1,...,αn) be the corresponding ordered dual basis of V ′. If one then denotes the coordinates of v V with respect to B as the column vector ∈ v1 . v = . , vn then one typically denotes the coordinates of γ V ′ with respect to B′ as the row vector ∈ γ = γ1 ... γn (this has the advantage that one then can express γ(v) as a matrix product, cf. Rem. 2.8(a) below). Remark 2.8. We remain in the situation of Not. 2.7 above. 2 DUALITY 25 (a) We obtain n n n n n n γ(v)= γkαk vlbl = γkvlαk(bl)= γkvlδkl ! ! Xk=1 Xl=1 Xl=1 Xk=1 Xl=1 Xk=1 n v1 . = γkvk = γ1 ... γn . . k=1 vn X (b) Let B˜ := (˜v ,..., v˜ ) be another ordered basis of V and (c ) GL (F ) such that V 1 n ji ∈ n n v˜i = cjivj. i∈{1∀,...,n} j=1 X ˜′ ˜ If BV := (˜α1,..., α˜n) denotes the ordered dual basis corresponding to BV and −1 (dji):=(cji) , then n n t α˜i = djiαj = dijαj, i∈{1∀,...,n} j=1 j=1 X X t where (dji) denotes the transpose of (dji), i.e. t t (dji) GLn(F ) with dji := dij : ∈ (j,i)∈{1,...,n∀}×{1,...,n} Indeed, for each k,l 1,...,n , we obtain ∈ { } n n n n n n dkjαj (˜vl)= dkjαj cilvi = dkjcilδji = dkjcjl = δkl. j=1 ! j=1 ! i=1 ! j=1 i=1 j=1 X X X X X X Proposition 2.9. Let V be a vector space over the field F . If U is a vector subspace of V and v V U, then there exists α V ′, satisfying ∈ \ ∈ α(v)=1 α(u)=0. ∧u∈ ∀U Proof. Let B be a basis of U. Then B := v ˙ B is linearly independent and, U v { } ∪ U according to [Phi19, Th. 5.23(a)], there exists a basis B of V such that Bv B. According to Cor. 2.3, ⊆ 1 for b = v, α : V F, α(b) := −→ b∈∀B 0 for b = v, ( 6 defines an element of V ′, which, clearly, satisfies the required conditions. 2 DUALITY 26 Definition 2.10. Let V be a vector space over the field F . (a) The map , : V V ′ F, v,α := α(v), (2.6) h· ·i × −→ h i is called the dual pairing corresponding to V . (b) The dual of V ′ is called the bidual or the second dual of V . One writes V ′′ := (V ′)′. (c) The map Φ: V V ′′, v Φv, (Φv): V ′ F, (Φv)(α) := α(v), (2.7) −→ 7→ −→ is called the canonical embedding of V into V ′′ (cf. Th. 2.11 below). Theorem 2.11. Let V be a vector space over the field F . (a) The canonical embedding Φ: V V ′′ of (2.7) is a linear monomorphism (i.e. a −→ linear isomorphism Φ: V = Im Φ V ′′). ∼ ⊆ ′′ (b) If dim V = n N, then Φ is a linear isomorphism Φ : V ∼= V (in fact, the converse is also∈ true, i.e., if Φ is an isomorphism, then dim V < , cf. the remark in Cor. 2.4(c)). ∞ Proof. (a): Exercise. (b): According to Cor. 2.4(b), n = dim V = dim V ′ = dim V ′′. Thus, by [Phi19, Th. 6.10], the linear monomorphism Φ is also an epimorphism. Corollary 2.12. Let V be a vector space over the field F , dim V = n N. If B′ = α ,...,α is a basis of V ′, then there exists a basis B of V such that ∈B and B′ are { 1 n} dual. Proof. According to Th. 2.11(b), the canonical embedding Φ : V V ′′ of (2.7) ′′ −→′′ constitutes a linear isomorphism. Let B = f1,...,fn be the basis of V that is dual ′ −{1 } to B and, for each i 1,...,n , bi := Φ (fi). Then, as Φ is a linear isomorphism, B := b ,...,b is a∈ basis { of V .} Moreover, B and B′ are dual: { 1 n} αi(bj)=(Φbj)(αi)= fj(αi)= δij, i,j∈{∀1,...,n} where we used that B′ and B′′ are dual. 2 DUALITY 27 2.2 Annihilators Definition 2.13. Let V be a vector space over the field F , M V , S V ′. Moreover, let Φ : V V ′′ denote the canonical embedding of (2.7). Then⊆ ⊆ −→ V ′ for M = , M ⊥ := α V ′ : α(v)=0 = ∅ ∈ v∈∀M ker(Φv) for M = ( v∈M 6 ∅ is called the (forward) annihilator of M in V ′, T V for S = , S⊤ := v V : α(v)=0 = ∅ ∈ α∀∈S ker α for S = ( α∈S 6 ∅ is called the (backward) annihilator of S in V . InT view of Rem. 2.15 and Ex. 2.16(b) below, one also calls v V and α V ′ such that ∈ ∈ (2.6) α(v) = v,α =0 h i perpendicular or orthogonal and, in consequence, sets M ⊥ and S⊤ are sometimes called M perp and S perp, respectively. Lemma 2.14. Let V be a vector space over the field F , M V , S V ′. Then M ⊥ is a subspace of V ′ and S⊤ is a subspace of V . Moreover, ⊆ ⊆ M ⊥ = M ⊥, S⊤ = S ⊤. (2.8) h i h i Proof. Since M ⊥ and S⊤ are both intersections of kernels of linear maps, they are subspaces, since kernels are subspaces by [Phi19, Prop. 6.3(c)] and intersections of sub- spaces are subspaces by [Phi19, Th. 5.7(a)]. Moreover, it is immediate from Def. 2.13 that M ⊥ M ⊥ and S⊤ S ⊤. On the other hand, consider α M ⊥ and v S⊤. Let λ ,...,λ⊇ h iF , n N.⊇ If hv ,...,vi M, then ∈ ∈ 1 n ∈ ∈ 1 n ∈ n n α∈M ⊥ α λivi = λiα(vi) = 0, i=1 ! i=1 X X showing α M ⊥ and M ⊥ M ⊥. Analogously, if α ,...,α S, then ∈h i ⊆h i 1 n ∈ n n v∈S⊤ λiαi (v)= λiαi(v) = 0, i=1 ! i=1 X X showing v S ⊤ and S⊤ S ⊤. ∈h i ⊆h i 2 DUALITY 28 Remark 2.15. On real vector spaces V , one can study so-called inner products (also called scalar products), , : V V R, (v,w) v,w R, which, as part of their definition, haveh· the·i requirement× −→ of being bilinear7→ h formsi, ∈ i.e., for each v V , v, : V R is a linear form and, for each w V , ,w : V R is a linear∈ form (weh ·i will come−→ back to vector spaces with inner products∈ h· againi Sec.−→ 10 below). One then calls vectors v,w V perpendicular or orthogonal with respect to , if, and only if, v,w = 0 so that∈ the notions of Def. 2.13 can be seen as generalizingh· ·i orthogonality h i with respect to inner products (also cf. Ex. 2.16(b) below). Example 2.16. (a) Let V be a vector space over the field F and let U be a subspace of V with BU being a basis of U. Then, according to [Phi19, Th. 5.23(a)], there exists a basis B of V such that B B. Then Cor. 2.3 implies U ⊆ α U ⊥ α(b)=0 . ′ α∈∀V ∈ ⇔b∈ ∀BU (b) Consider the real vector space R2 and let , : R2 R2 R, (v ,v ), (w ,w ) := v w + v w , h· ·i × −→ h 1 2 1 2 i 1 1 2 2 denote the so-called Euclidean inner product on R2. Then, clearly, for each w = (w ,w ) R2, 1 2 ∈ α : R2 R, α (v) := v,w = v w + v w , w −→ w h i 1 1 2 2 2 defines a linear form on R . Let v := (1, 2). Then the span of v, i.e. lv := (λ, 2λ): λ R , represents the line through v. Moreover, for each w =(w ,w ) {R2, ∈ } 1 2 ∈ α v ⊥ = l⊥ α (v)= w +2w =0 w = 2w w ∈ { } v ⇔ w 1 2 ⇔ 1 − 2 w l := ( 2λ,λ): λ R . ⇔ ∈ ⊥ { − ∈ } Thus, l is spanned by ( 2, 1) and we see that l⊥ consists precisely of the linear ⊥ − v forms αw that are given by vectors w that are perpendicular to v in the Euclidean geometrical sense (i.e. in the sense usually taught in high school geometry). — The following notions defined for linear forms in connection with subspaces can some- times be useful when studying annihilators: Definition 2.17. Let V be a vector space over the field F and let U be a subspace of V . Then R : V ′ U ′, Rf := f ↾ , −→ U 2 DUALITY 29 is called the restriction operator from V to U; I : (V/U)′ V ′, (Ig)(v) := g(v + U), −→ is called the inflation operator from V/U to V . Theorem 2.18. Let V be a vector space over the field F and let U be a subspace of V with the restriction operator R and the inflation operator I defined as in Def. 2.17. (a) R : V ′ U ′ is a linear epimorphism with ker R = U ⊥. Moreover, −→ dim U ⊥ + dim U ′ = dim V ′ (2.9) and ′ ′ ⊥ U ∼= V /U . (2.10) (see [Phi19, Th. 6.8(a)] for the precise meaning of (2.9) in case at least one of the occurring cardinalities is infinite). If dim V = n N, then one also has ∈ n = dim V = dim U ⊥ + dim U. (2.11) ′ ⊥ (b) I is a linear isomorphism I : (V/U) ∼= U . Proof. (a): Let α, β V ′ and λ,µ F . Then, for each u U, ∈ ∈ ∈ R(λα + µβ)(u)= λα(u)+ µβ(u)= λ(Rα)(u)+ µ(Rβ)(u)= λ(Rα)+ µ(Rβ) (u), showing R to be linear. Moreover, for each α V ′, one has ∈ α ker R α(u)=0 α U ⊥, ∈ ⇔u∈ ∀U ⇔ ∈ ⊥ proving ker R = U . Let BU be a basis of U. Then, according to [Phi19, Th. 5.23(a)], ′ there exists C V such that BU ˙ C is a basis of V . Consider α U . Using Cor. 2.3, define β V ′ by⊆ setting ∪ ∈ ∈ α(b) for b B , β(b) := ∈ U 0 for b C. ( ∈ Then, clearly, Rβ = α, showing R to be surjective. Thus, we have [Phi19, Th. 6.8(a)] dim V ′ = dimker R + dimIm R = dim U ⊥ + dim U ′, thereby proving (2.9). Next, applying the isomorphism theorem of [Phi19, Th. 6.16(a)] yields ′ ′ ′ ⊥ U = Im R ∼= V / ker R = V /U , 2 DUALITY 30 which is (2.10). Finally, if dim V = n N, then ∈ Cor. 2.4(b) (2.9) Cor. 2.4(b) n = dim V = dim V ′ = dim U ⊥ + dim U ′ = dim U ⊥ + dim U, proving (2.11). (b): Exercise. Theorem 2.19. Let V be a vector space over the field F . (a) If U is a subspace of V , then (U ⊥)⊤ = U. (b) If S is a subspace of V ′, S (S⊤)⊥. If dim V = n N, then one even has (S⊤)⊥ = S. ⊆ ∈ (c) If U1, U2 are subspaces of V , then (U + U )⊥ = U ⊥ U ⊥, (U U )⊥ = U ⊥ + U ⊥. 1 2 1 ∩ 2 1 ∩ 2 1 2 ′ (d) If S1,S2 are subspaces of V , then (S + S )⊤ = S⊤ S⊤, (S S )⊤ S⊤ + S⊤. 1 2 1 ∩ 2 1 ∩ 2 ⊇ 1 2 If dim V = n N, then one also has ∈ (S S )⊤ = S⊤ + S⊤. 1 ∩ 2 1 2 Proof. (a): Exercise. (b): According to Def. 2.13, we have (S⊤)⊥ := α V ′ : α(v)=0 , ∈ v∈∀S⊤ showing S (S⊤)⊥. Now assume dim V = n N and suppose there exists α (S⊤)⊥ S. Then, according⊆ to Prop. 2.9, there exists f ∈ V ′′, satisfying ∈ \ ∈ f(α)=1 f(β)=0. ∧β ∀∈S Since dim V = n N, we may employ Th. 2.11(b) to conclude that the canonical embedding Φ : V ∈ V ′′ is a linear isomorphism, in particular, surjective. Thus, there exists v V such−→ that f = Φv, i.e. f(γ) = γ(v) for each γ V ′. Since f S⊥, we have β(v∈) = f(β) = 0 for each β S, showing v S⊤. Thus,∈ α (S⊤)⊥ implies∈ the ∈ ∈ ∈ contradiction 0 = α(v)= f(α) = 1. In consequence, (S⊤)⊥ S = , proving (b). \ ∅ 2 DUALITY 31 ⊥ ⊥ (c): Let α (U1 + U2) . Then U1 U1 + U2 implies α U1 , U2 U1 + U2 implies ⊥ ∈ ⊥ ⊥ ⊆ ⊥ ⊥ ⊥∈ ⊆ ⊥ ⊥ α U2 , showing α U1 U2 and (U1 + U2) U1 U2 . Conversely, if α U1 U2 ∈ ∈ ∩ ⊆ ∩ ∈ ∩ ⊥ and u1 U1, u2 U2, then α(u1 + u2) = α(u1)+ α(u2) = 0, showing α (U1 + U2) ⊥∈ ⊥ ∈ ⊥ ∈ ⊥ ⊥ and U1 U2 (U1 + U2) . To prove the second equality in (c), first, let α U1 + U2 , ∩ ⊆ ⊥ ⊥ ∈ i.e. α = α1 + α2 with α1 U1 , α2 U2 . Then, if v U1 U2, one has α(v) = ∈ ∈ ⊥ ⊥ ⊥∈ ∩ ⊥ α1(v)+ α2(v) = 0, showing α (U1 U2) and U1 + U2 (U1 U2) . Conversely, let α (U U )⊥. We now proceed∈ ∩ similar to the proof of⊆ [Phi19,∩ Th. 5.30(c)]: We ∈ 1 ∩ 2 choose bases B , B , B of U U , U , and U , respectively, such that B B and ∩ U1 U2 1 ∩ 2 1 2 ∩ ⊆ U1 B∩ BU2 , defining B1 := BU1 B∩, B2 := BU2 B∩. Then it was shown in the proof of [Phi19,⊆ Th. 5.30(c)] that \ \ B := B B = B ˙ B ˙ B + U1 ∪ U2 1 ∪ 2 ∪ ∩ is a basis of U1 + U2 and we may choose C V such that B := B+ ˙ C is a basis of V . Using Cor. 2.3, we now define α ,α V ′ by⊆ setting ∪ 1 2 ∈ α(b) for b B B1, α(b) for b B1, α1(b) := ∈ \ α2(b) := ∈ b∈∀B 0 for b B , 0 for b B B . ( ∈ 1 ( ∈ \ 1 Since α↾ 0, we obtain α ↾ = α ↾ ˙ 0 and α ↾ = α ↾ ˙ 0, showing B∩ 1 BU1 1 B1 ∪ B∩ 2 BU2 2 B2 ∪ B∩ ⊥ ≡ ⊥ ≡ ≡ α1 U1 and α2 U2 . On the other hand, we have, for each b B, α(b)= α1(b)+α2(b), ∈ ∈ ⊥ ⊥ ⊥ ⊥ ∈ ⊥ showing α = α1 + α2, α U1 + U2 , and (U1 U2) U1 + U2 , thereby completing the proof of (c). ∈ ∩ ⊆ (d): Exercise. 2.3 Hyperplanes and Linear Systems In the present section, we combine duality with the theory of affine spaces of Sec. 1 and with the theory of linear systems of [Phi19, Sec. 8]. Definition 2.20. Let V be a vector space over the field F . If α V ′ 0 and r F , then the set ∈ \ { } ∈ H := α−1 r = v V : α(v)= r V α,r { } { ∈ }⊆ is called a hyperplane in V . Notation 2.21. Let V be a vector space over the field F , v V , and α V ′. We then write ∈ ∈ v⊥ := v ⊥, α⊤ := α ⊤. { } { } Theorem 2.22. Let V be a vector space over the field F . 2 DUALITY 32 (a) Each hyperplane H in V is an affine subspace of V , where dim V = 1+dim H, i.e. dim V = dim H if V is infinite-dimensional, and dim H = n 1 if dim V = n N. More precisely, if 0 = α V ′ and r F , then − ∈ 6 ∈ ∈ w ⊤ w Hα,r = r + α = r + ker α. (2.12) w∈V : ∀α(w)6=0 α(w) α(w) (b) If dim V = n N and M is an affine subspace of V with dim M = n 1, then M is a hyperplane∈ in V , i.e. there exist 0 = α V ′ and r F such that M− = H . 6 ∈ ∈ α,r (c) Let α, β V ′ 0 and r, s F . Then ∈ \ { } ∈ Hα,r = Hβ,s β = λα s = λr . ⇔06= ∃λ∈F ∧ Moreover, if α = β, then Hα,r and Hβ,s are parallel. Proof. (a): Let 0 = α V ′ and r F with w V such that α(w) = 0. Define w 6 ∈ ∈ −1 ∈ 6 v := r α(w) . Then α(v) = r, i.e. v Hα,r = α r . Thus, by [Phi19, Th. 4.20(f)], we have ∈ { } H = v + ker α = v + x V : α(x)=0 = v + α⊤, α,r { ∈ } proving (2.12). In particular, we have dim Hα,r = dimker α and, by [Phi19, Th. 6.8(a)], dim V = dimker α + dimIm α = dim Hα,r + dim F = dim Hα,r +1, thereby completing the proof of (a). (b),(c): Exercise. Proposition 2.23. Let V be a vector space over the field F , dim V = n N. If M V ∈ ⊆ is an affine subspace of V with dim M = m N0, m ri := αi(v). i∈{1,...,n∀ −m} We claim M = N := n−m H : Indeed, if x = v + u with u U, then i=1 αi,ri ∈ ⊥ T αi∈U αi(x)= αi(v)+ αi(u) = ri +0= ri, i∈{1,...,n∀ −m} 2 DUALITY 33 showing x N and M N. Conversely, let x N. Then ∈ ⊆ ∈ ⊤ αi(x v)= ri ri =0, i.e. x v αi , i∈{1,...,n∀ −m} − − − ∈ implying Th. 2.19(a) x v α ,...,α ⊤ =(U ⊥)⊤ = U, − ∈ h{ 1 n−m}i showing x v + U = M and N M as claimed. ∈ ⊆ Example 2.24. Let F be a field. As in [Phi19, Sec. 8.1], consider the linear system n ajk xk = bj, (2.13) j∈{1∀,...,m} Xk=1 where m,n N; b ,...,b F and the a F , j 1,...,m , i 1,...,n . ∈ 1 m ∈ ji ∈ ∈ { } ∈ { } We know that we can also write (2.13) in matrix form as Ax = b with A = (aji) t m ∈ (m,n,F ), m,n N, and b =(b1,...,bm) (m, 1,F ) ∼= F . The set of solutions toM (2.13) is ∈ ∈M (A b)= x F n : Ax = b . L | { ∈ } If we now define the linear forms v1 v1 n n . . αj : F F, αj . := aj1 ... ajn . = ajk vk, j∈{1∀,...,m} −→ k=1 vn vn X then we can rewrite (2.13) as x1 m . x = . Hαj ,bj x Hαj ,bj . (2.14) j∈{1∀,...,m} ∈ ⇔ ∈ j=1 xn \ m Thus, we have (A b)= j=1 Hαj ,bj and we can view (2.14) as a geometric interpretation of (2.13), namelyL that| the solution vectors x are required to lie in the intersection of the T m hyperplanes Hα1,b1 ,...,Hαm,bm . Even though we know from [Phi19, Th. 8.15] that the elementary row operations of [Phi19, Def. 8.13] do not change the set of solutions (A b), it might be instructive to reexamine this fact in terms of linear forms and hyperplanes:L | The elementary row operation of row switching merely corresponds to changing the order of the H in the intersection yielding (A b). The elementary row operation of row αj ,bj L | multiplication rj λrj (0 = λ F ) does not change (A b) due to Hαj ,bj = Hλαj ,λbj according to Th.7→ 2.22(c). The6 elementary∈ row operationL of|row addition r r + λr j 7→ j i 2 DUALITY 34 (λ F , i = j) replaces Hαj ,bj by Hαj +λαi,bj +λbi . We verify, once again, what we already know∈ form6 [Phi19, Th. 8.15], namely m m (A b)= Hα ,b = M := Hα ,b Hα +λα ,b +λb : L | k k k k ∩ j i j i k=1 k=1, \ k\6=j If x (A b), then (α + λα )(x) = b + λb , showing x H and x M. ∈ L | j i j i ∈ αj +λαi,bj +λbi ∈ Conversely, if x M, then αj(x)=(αj +λαi)(x) λαi(x)= bj +λbi λbi = bj, showing x H and x∈ (A b). − − ∈ αj ,bj ∈L | 2.4 Dual Maps Theorem 2.25. Let V,W be vector spaces over the field F . If A (V,W ), then there exists a unique map A′ : W ′ V ′ such that (using the notation∈L of (2.6)) −→ (A′β)(v)= v, A′β = Av, β = β(Av). (2.15) v∈∀V β∈∀W ′ h i h i Moreover, this map turns out to be linear, i.e. A′ (W ′,V ′). ∈L Proof. Clearly, given A (V,W ), (2.15) uniquely defines a map A′ : W ′ V ′ (for each β W ′, (2.15) defines∈ L the map (A′β)= β A (V,F )= V ′). It merely−→ remains to check∈ that A′ is linear. To this end, let β, β ,◦ β ∈LW ′, λ F , and v V . Then 1 2 ∈ ∈ ∈ ′ ′ ′ A (β1 + β2) (v)=(β1 + β2)(Av)= β1(Av)+ β2(Av)=(A β1)(v)+(A β1)(v) ′ ′ =(A β1 + A β2)(v), A′(λβ) (v)=(λβ)(Av)= λ(A′β)(v),