Lecture Notes in Algebraic Number Theory
Lectures by Dr. Sheng-Chi Liu
Throughout these notes, signifies end proof, and N signifies end of example.
Table of Contents
Table of Contents i
Lecture 1 Review 1 1.1 Field Extensions ...... 1
Lecture 2 Ring of Integers 4 2.1 Understanding Algebraic Integers ...... 4 2.2 The Cyclotomic Fields ...... 6 2.3 Embeddings in C ...... 7 Lecture 3 Traces, Norms, and Discriminants 8 3.1 Traces and Norms ...... 8 3.2 Relative Trace and Norm ...... 10 3.3 Discriminants ...... 11
Lecture 4 The Additive Structure of the Ring of Integers 12 4.1 More on Discriminants ...... 12 4.2 The Additive Structure of the Ring of Integers ...... 14
Lecture 5 Integral Bases 16 5.1 Integral Bases ...... 16 5.2 Composite Field ...... 18
Lecture 6 Composition Fields 19 6.1 Cyclotomic Fields again ...... 19 6.2 Prime Decomposition in Rings of Integers ...... 21
Lecture 7 Ideal Factorisation 22 7.1 Unique Factorisation of Ideals ...... 22
Lecture 8 Ramification 26 8.1 Ramification Index ...... 26 Notes by Jakob Streipel. Last updated June 13, 2021.
i TABLE OF CONTENTS ii
Lecture 9 Ramification Index continued 30 9.1 Proof, continued ...... 30
Lecture 10 Ramified Primes 32 10.1 When Do Primes Split ...... 32
Lecture 11 The Ideal Class Group 35 11.1 When are Primes Ramified in Cyclotomic Fields ...... 35 11.2 The Ideal Class Group and Unit Group ...... 36
Lecture 12 Minkowski’s Theorem 38 12.1 Using Geometry to Improve λ ...... 38
Lecture 13 Toward Minkowski’s Theorem 42 13.1 Proving Minkowski’s Theorem ...... 42 13.2 The Dirichlet Unit Theorem ...... 45
Lecture 14 Dirichlet Unit Theorem 46 14.1 Dirichlet Unit Theorem ...... 46 14.2 Fermat’s Last Theorem ...... 48
Lecture 15 Kummer’s Theorem 49 15.1 Using the Dirichlet Unit Theorem ...... 49 15.2 Kummer’s Theorem ...... 49
Lecture 16 Kummer’s Theorem, continued 51 16.1 Proof continued ...... 51
Lecture 17 Local Fields 53 17.1 Valuations ...... 53
Lecture 18 Ostrowski’s Theorem 56 18.1 Ostrowski’s Theorem ...... 56 18.2 Completions ...... 57
Lecture 19 Ostrowski’s Theorem continued 58 19.1 Proof of Ostrowki’s Theorem ...... 58
Lecture 20 Local Fields 62 20.1 Local Fields ...... 62 20.2 p-adic Numbers ...... 63
Lecture 21 Hensel’s Lemma 65 21.1 Generalisation of p-adic Numbers ...... 65 21.2 Topology of Local Fields ...... 66 21.3 Hensel’s Lemma ...... 67
Lecture 22 Hensel’s Lemma revisited 69 22.1 Second Form of Hensel’s Lemma ...... 69 22.2 Extension of Valuations ...... 71
Lecture 23 Krasner’s Lemma 72 TABLE OF CONTENTS iii
23.1 Proof continued ...... 72 23.2 Krasner’s Lemma ...... 74
Lecture 24 Eisenstein Extensions 76 24.1 Proof continued ...... 76 24.2 Eisenstein Extension ...... 77
Lecture 25 Totally Ramified Extensions 79 25.1 Proof of Lemma ...... 79 25.2 Unramified Extensions ...... 81
Lecture 26 Unramified Extensions 81 26.1 Classifying Unramified Extensions ...... 81
Lecture 27 Unramified Extensions 84 27.1 Tamely and Wildly Ramified Extensions ...... 84 27.2 Decomposition Group and Intertia Group ...... 87
Lecture 28 Decomposition and Inertia Group 88 28.1 Decomposition and Inertia ...... 88
Lecture 29 More on Ramification of Extensions 91 29.1 Splitting of Primes ...... 91
Lecture 30 The Different Ideal 94 30.1 Duals ...... 94
Index 97 REVIEW 1
Lecture 1 Review
Algebraic number theory is fundamentally the study of finite extensions of the rational numbers Q, called number fields. To this end we will first review some of the theory of field extensions.
1.1 Field Extensions Definition 1.1.1 (Finite field extension). A finite field extension of a field K is a field L ⊃ K such that dimK L is finite (as a vector space over K). The degree of the extension L/K is defined to be deg(L/K) := dimK L. Fact. If we have a tower of finite field extensions, say K ⊂ L ⊂ F , then deg(F/K) = deg(F/L) · deg(L/K). The proof of this boils down to considering a basis of either extension and expressing one with the help of the other, and the amount of elements in the basis between F and K will be the product of the number of basis elements in either intermediate extensions.
Definition 1.1.2 (Algebraic element). Let L/K be a field extension, and let α ∈ L. Then α is called algebraic over K if it satisfies f(α) = 0 for some f ∈ K[x]. Fact. Any element of a finite extension of K is algebraic over K.
To see this, consider an extension L ⊃ K of degree n and let α ∈ L. Then 1, α, α2, . . . , αn are linearly dependent in K, since there are more than n of them, and therefore there is a nontrivial linear combination over K of them that adds to 0, which in turn gives us a polynomial in K[x] with α as a root. Fact. An algebraic element α ∈ L has a minimal polynomial over K, i.e. there exists a unique pα(x) ∈ K[x] such that if f(x) ∈ K[x] with f(α) = 0, then pα(x) | f(x), with the uniqueness being up to multiplication by an element in K. Equivalently, therefore, there is a unique monic polynomial of least degree with α as its root.
Fact. For an algebraic element α ∈ L with minimal polynomial pα(x), we have the isomorphism K[x] K(α) =∼ , hpα(x)i with the isomorphism being the map that evaluates x in α. Moreover
deg(K(α)/K) = deg(pα(x)) = n and 1, α, α2, . . . , αn−1 is a bases for K(α) over K. Thus K(α) = K[α], and in the latter αn and higher degrees can be reduced to lower degrees.
Definition 1.1.3 (Number field). An (algebraic) number field K (⊂ C) is a finite extension of Q. Date: January 9th, 2018. 1.1 Field Extensions 2
Theorem 1.1.4 (Primitive element theorem). Let K be a number field. Then K = Q(α) = Q[α] for some α ∈ K. Two famous examples of number fields are the following: √ Example 1.1.5 (Quadratic fields). Consider K = Q( D), where D is a square-√ free integer (otherwise, simply pull out the square part).√ If D > 0, then Q( D) is called a real quadratic field, and if D < 0 then Q( D) is an imaginary quadratic field. √ These extensions are of degree 2, since the minimal polynomial of D is x2 − D, and moreover they are Galois extensions since they are both separable and normal. N
Example 1.1.6 (Cyclotomic fields). Consider K = Q(ξm), where ξm is a prim- m n itive mth root of unity, meaning that ξm = 1 and ξm 6= 1 for 0 < n < m. 2πi/m For instance, and by convention what we’ll always use, ξm = e . The other k primitive mth roots of unity are ξm where gcd(m, k) = 1, so deg(K/Q) = ϕ(m), where by ϕ we mean Euler’s totient function. Note that if m = 1, ξ1 = 1 and if m = 2 we have ξ2 = −1, so K = Q in both cases, making neither of these interesting. Hence when discussing cyclotomic 2πi/3 fields we are interested in m ≥ 3, since when m = 3 we get ξ3 = e yielding K = Q(ξ3) which is an extension of degree ϕ(3) = 2. 4 2 2 Now if m = 6, then ξ6 = −ξ6 = −(ξ6 ) , by straight forward computation, 2 2 meaning that Q(ξ6 ) = Q(ξ6), but note that ξ6 = ξ3, so in fact Q(ξ3) = Q(ξ6). N This last idea generalises:
Proposition 1.1.7. For odd m, Q(ξm) = Q(ξ2m). 2πi/m 2πi/(2m) 2 Proof. Take ξm = e and likewise ξ2m = e . Then clearly ξ2m = ξm, 2 so Q(ξm) = Q(ξ2m) ⊂ Q(ξ2m). m πi m+1 On the other hand, ξ2m = e = −1, meaning that ξ2m = −ξ2m. Since m+1 2 m is odd, m + 1 is even, and therefore ξ2m = −ξ2m ∈ Q(ξ2m), implying that 2 Q(ξ2m) ⊂ Q(ξ2m) = Q(ξm).
Remark 1.1.8. We will show that for even m, all Q(ξm) are distinct. Note also ∼ × that Gal(Q(ξm)/Q) = (Z/mZ) . Definition 1.1.9 (Algebraic integer). An element α ∈ C is called an algebraic integer if it is a root of a monic polynomial f(x) ∈ Z[x]. √ Example 1.1.10. Many elements are algebraic integers. For instance, 2 is, 2 since its minimal polynomial is x −2, which√ is monic. Similarly 5 is an algebraic integer, since it is the root of x − 5, and D is the root of x2 − D. m The primitive roots of unity are too, since ξm is a root of f(x) = x − 1. N This notion is equivalent with the notion from commutative algebra of α being integral over Z. Note also that since there are only countably many polynomials over Z, the fundamental theorem of algebra implies that there are only countably many algebraic integers.
Theorem 1.1.11. The following are equivalent for α ∈ C: 1.1 Field Extensions 3
(i) α is an algebraic integer.
(ii) Z[α] is a finitely generated Z-module (i.e. Abelian group). (iii) There exists a subring of C containing α that is finitely generated. (iv) There exists a finitely generated nonzero Z-module M ⊂ C such that αM ⊂ M. Proof. That (i) implies (ii) is clear: since α is an algebraic integer we have an equation n n−1 α + cn−1α + ... + c1α + c0 = 0 n for some ci ∈ Z. Rearranging this for α we have
n n−1 α = −(cn−1α + ... + c1α + c0) meaning that Z[α] is finitely generated by 1, α, α2, . . . , αn−1. (ii) trivially implies (iii) by just taking Z[α] itself, and in the same way (iii) implies (iv) using the subring from (iii) itself. Finally (iv) implies (i) by letting x1, x2, . . . , xn be generators of M, and since αM ⊂ M, we have
αxi = ai1x1 + ai2x2 + ... + ainxn for all i = 1, 2, . . . , n, with aij ∈ Z. As matrix equations we then have αx1 a11 a12 ··· a1n x1 αx2 a21 a22 ··· a2n x2 = , . . . .. . . . . . . . . αxn an1 an2 ··· ann xn which rearranged and calling the coefficient matrix A and the xi-vector x, that (αIn − A)x = 0, where In is the identity matrix. Now since x1, x2, . . . , xn generate a nontrivial Z-module, x 6= 0, so (αIn − A) has linearly dependent columns, making its determinant 0, so
n 0 = det(αIn − A) = α + lower order terms in α.
Corollary 1.1.12. Let A be the set of algebraic integers in C. Then A is a ring. Proof. We need to show that α + β and αβ are both in A if α and β are in A. This follows almost trivially from the above theorem, however: we have that Z[α] and Z[β] are finitely generated Z-modules, and therefore Z[α, β] is finitely generated as a Z-module as well. This latter ring contains α + β and αβ, and therefore they are algebraic integers.
Definition 1.1.13. Let K be a number field. The ring of integers OK of K is the set of algebraic integers in K, i.e. OK = A ∩ K. By the above corollary, it is a subring of K. Note also that in the language of commutative algebra, this makes OK the integral closure of Z in K, meaning in particular that OK is integrally closed. RING OF INTEGERS 4
Theorem 1.1.14. Let K be a number field, and let α ∈ K. Then α can be written as α = β/d, where β ∈ OK and d ∈ Z. Hence K is the quotient field of OK .
Proof. Since α ∈ K is algebraic over Q we have
n n−1 α + cn−1α + ... + c1α + c0 = 0 with ci ∈ Q. We can write all of the fractions ci with common denominator, say ci = bi/d, with bi, d ∈ Z, and d 6= 0. Hence b b b αn + n−1 αn−1 + ... + i α + 0 = 0. d d d Multiplying this by dn we get
n n−1 n−2 n−1 (dα) + bn−1(dα) + ... + b1d (dα) + b0d = 0, which we can interpret as a monic polynomial in dα, meaning that dα = β ∈ OK , and so α = β/d.
Lecture 2 Ring of Integers
2.1 Understanding Algebraic Integers
Recall that α is an algebraic integer, or integral over Z, if f(α) for some monic f(x) ∈ Z[x]. The question with which we will concern√ ourselves for the foreseeable future is this: what is OK if K is, say, Q, Q( D), or Q(ξm). The answers, respectively, are Z, depends on whether D ≡ 1 (mod 4), and Z[ξm]. We will spend some time proving these claims.
Theorem 2.1.1. Let α be an algebraic integer. Let f(x) ∈ Z[x] be a monic polynomial of least degree such that f(α) = 0. (Note that this implies that f is irreducible over Z.) Then f(x) is irreducible over Q. We will prove this in a moment, but first note the following straight-forward corollary: Corollary 2.1.2. The number α is an algebraic integer if and only if the monic minimal polynomial of α over Q has coefficients in Z. Proof. The latter implying the former is true by definition, and the former implying the latter is the above theorem. We will use the following result to prove the theorem:
Lemma 2.1.3 (Gauss Lemma). Let f(x) ∈ Z[x] be a monic polynomial. Sup- pose f(x) = g(x)h(x) with g(x), h(x) ∈ Q[x] monic. Then g(x), h(x) ∈ Z[x]. More generally, we can replace Z by any unique factorisation domain and Q by its field of fractions. Date: January 11th, 2018. 2.1 Understanding Algebraic Integers 5
Proof. Let m and n be the smallest positive integers such that mg(x) and nh(x) have coefficients in Z. Hence the coefficients of mg(x) have no common factors, and the same is true for the coefficients of nh(x). We claim that m = n = 1. Suppose not, i.e. mn > 1. Take a prime p such that p | mn, and consider mnf(x) = (mg(x))(nh(x)). Reducing this modulo p we get 0 = mg(x)·nh(x) ∈ Z/pZ[x]. Now Z/pZ is a field, which in particular makes it an integral domain, so the polynomial ring is an integral domain too. Hence mg(x) = 0 or nh(x) = 0. Therefore p divides all coefficients of mg(x), or p divides all coefficients of nh(x), which is a contradiction since we chose m and n minimal in order to make sure the coefficients had no common factors. Using this we are equipped to prove the theorem:
Proof of 2.1.1. Suppose f(x) is reducible over Q, i.e. f(x) = g(x)h(x), where g(x), h(x) ∈ Q[x]. Since f(x) is monic and since Q is a field, we can assume without loss of generality that g and h are monic too. Gauss Lemma therefore implies that g(x), h(x) ∈ Z[x]. Moreover deg g, deg h < deg f, but since 0 = f(α) = g(α)h(α) implies that g(α) = 0 or h(α) = 0, we have a contradiction, for f was chosen to be of minimal degree with α as a zero.
Corollary 2.1.4. The only algebraic integers in Q is Z. Proof. The minimal monic polynomial of α ∈ Q over Q is f(x) = x − α, nat- urally. Now α is an algebraic integer if and only if f(x) ∈ Z[x], if and only if α ∈ Z. We are now ready to find the ring of integers for quadratic integers. √ Corollary 2.1.5. Consider K = Q( D), with D being square-free. Then ( √ √ [ D] = { a + b D | a, b ∈ }, if D 6≡ 1 (mod 4), Z √ √ Z OK = 1+ D 1+ D Z[ 2 ] = { a + b 2 | a, b ∈ Z }, if D ≡ 1 (mod 4). Proof. Let ( √ √ [ D] = { a + b D | a, b ∈ }, if D 6≡ 1 (mod 4), Z √ √ Z R = 1+ D 1+ D Z[ 2 ] = { a + b 2 | a, b ∈ Z }, if D ≡ 1 (mod 4). √ 2 First we√ claim that OK ⊃ R. The minimal√ polynomial of D is f(x) = x −D ∈ Z[x], so D ∈ OK , implying that Z[ D] ⊂ OK . (Regardless of whether D is congruent to 1 modulo 4 or not, as it happens). If D ≡ 1 (mod 4), then √ √ 1 + D 1 − D D − 1 f(x) = x − x − = x2 − x + ∈ [x] 2 2 4 Z √ √ 1− D since D−1 is a multiple of 4, implying that (1+ D)/2 ∈ OK , and so Z[ 2 ] ⊂ OK . √ Secondly we show that R ⊃ OK . Suppose α = s+t D ∈ OK , with s, t ∈ Q. 2.2 The Cyclotomic Fields 6
If t = 0, α = s ∈ Q ∩ OK = Z implying, by the previous theorem, that α ∈ Z ⊂ R. Hence assume t 6= 0. Then α ∈ OK if and only if the minimal polynomial of α over Q, namely √ √ 2 2 2 pα(x) = (x − (s + t D))(x − (s − t D)) = x + 2sx + s − t D is in Z[x]. This in turn is equivalent with 2s ∈ Z and s2 − t2D ∈ Z. Hence set s0 = 2s, or s = s0/2. We have two possibilities: s0 is either even or odd. 0 2 Suppose s is even. Then√s ∈ Z, so t D ∈ Z which, since D is square-free, means t ∈ Z. Thus α = s + t D ∈ R since s, t ∈ Z, so OK ⊂ R. Next suppose s0 is odd. Let n = s2 − t2D ∈ Z, which we rewrite as n = (s0/2)2 − t2D, or 4n = (s0)2 − (2t)2D meaning that (2t)2D ∈ Z which, again by D being square-free, means 2t ∈ Z. Set t0 = 2t ∈ Z. Then 4n = (s0)2 − (t0)2D. Reducing this modulo 4 we get 0 ≡ 1 − (t0)2D (mod 4) since s0 is odd, so (t0)2D ≡ 1 (mod 4). Now (t0)2 is either 0 or 1 modulo 4, but we can’t have it being 0 since the product is 1, so (t0)2 ≡ 1 (mod 4), and D ≡ 1 (mod 4). Hence t0 is odd, and we learned along the way that this second case, with s0 being odd, never happens if D 6≡ 1 (mod 4). Now D ≡ 1 (mod 4) means √ √ s0 t0 √ s0 − t0 1 + D α = s + t D = + D = +t0 ∈ R 2 2 2 2 | {z } ∈Z since s0 and t0 are both odd.
2.2 The Cyclotomic Fields
We learned last time that it suffices to consider K = Q(ξm) with m even, since Q(ξm) = Q(ξ2m) if m is odd. There are two things we aspire to show: first, that Q(ξm) are all distinct for even m, and secondly that OK = Z[ξm]. We’ll recall some basic properties we’ll need. Let ω = e2πi/m. 1.A (Galois) conjugate of ω is a root of a minimal polynomial of ω over Q. Note that this has coefficients in Z since ω is an algebraic integer. 2. Every conjugate of ω is a primitive mth root of unity, i.e. a root of xm − 1 but not a root of xn − 1 for n < m. 3. The conjugates are ωk for 1 ≤ k ≤ m, gcd(k, m) = 1.
Let ξm be a primitive mth root of unity and let K = Q(ξm). Then
4. deg(K/Q) = ϕ(m), 5. Gal(K/Q) =∼ (Z/mZ)×. 2.3 Embeddings in C 7
Proposition 2.2.1. If m is even, then the only roots of unity in Q(ξm) are n m mth roots of unity. In other words, θ ∈ Q(ξm) with θ = 1 implies θ = 1. In addition, if m is off, then the only roots of unity in Q(ξm) are (2m)th roots of unity.
Proof. The second result follows immediately from the first since Q(ξm) = Q(ξ2m) for m odd. Hence assume m is even. Let θ ∈ Q(ξm) be a primitive kth root of unity. Then Q(ξm) contains a primitive rth root of unity where r = lcm(k, m), since 2πi/k 2πi/m if θ = e and ξm = e and gcd(k, m) = d = ak + bm, we have
b a bm+ak d b a 2πi( k + m ) 2πi mk 2πi mk 2πi/ lcm(m,k) 2πi/r ξr = θ ξm = e = e = e = e = e .
Now ϕ(r) | ϕ(m). However m | r, meaning that m = r so k | m.
Corollary 2.2.2. Let m be even. Then the mth cyclotomic fields Q(ξm) are all distinct.
In order to prove OK = Z[ξm], we first need to discuss embeddings in C.
2.3 Embeddings in C Let K = Q(α) be a number field of degree n, meaning that α has n conjugates (since its minimal polynomial is of degree n). Each conjugate of β determines a unique embedding of K in C, namely σ : K → C, α 7→ β. Hence there are exactly n embeddings of K in C. √ √ Example 2.3.1. Let√ K = Q( D√) with D square-free. Then D has two conjugates,√ namely√ D itself and − D. So we have two embeddings,√ σ1 : K√ → C by D 7→ D, namely the identity map, and σ2 : K 7→ C by D 7→ − D. Since both of the embeddings are contained in K, this is a Galois extension. N √ √ 3 3 Example 2.3.2. Let K = Q( 2). Since the minimal polynomial of 2 over Q 3 is x − 2, there are three conjugates,√ √ and we√ can see them all by factoring√ this 3 3 3 2 polynomial over C. We get 2, 2ω, and 2ω , where√ ω =√ (−1√ + −√3)/2. 3 3 3 3 Hence√ we have√ three embeddings, σ1, σ2, σ3 : K → C by 2 7→ 2, 2 7→ 2ω, 3 3 2 and 2 7→ 2ω respectively. √ √ 3 3 2 This extension is not Galois since 2ω and 2ω are not in K. N More generally, if K m L n
Q meaning that K and L are number fields, then every embedding σ : L → C can be extended to m embeddings of K in C. In particular K has m embeddings in C that fix L pointwise. TRACES, NORMS, AND DISCRIMINANTS 8
Lecture 3 Traces, Norms, and Discriminants
3.1 Traces and Norms
Let K be a number field of degree n, and let σ1, σ2, . . . , σn be the embeddings of K in C.
Definition 3.1.1. If σi(K) ⊂ R, then σi is called a real embedding. Otherwise σi is called a complex embedding. Fact. The complex embeddings come in pairs. Since σ(x) := σ(x) is also a complex embedding of K in C.
Letting r1 denote the number of real embeddings and r2 denote the number of conjugate pairs of complex embeddings, we of course have n = r1 + 2r2. Definition 3.1.2 (Trace and norm). For any α ∈ K, we define the trace of α to be
TrK/Q(α) = σ1(α) + σ2(α) + ... + σn(α) and the norm of α to be
NK/Q(α) = σ1(α)σ2(α) · ... · σn(α). When it is obvious from context what field K we are working over we will omit the subscript K/Q.
Exercise 3.1.3. Consider the Q-linear map Lα : K → K defined by Lα(x) = αx. Then TrK/Q(α) = tr(Lα) and NK/Q(α) = det(Lα). The trace and norm behave largely as expected:
1. Tr(α + β) = Tr(α) + Tr(β),
2. If r ∈ Q, then Tr(rα) = r Tr(α) and Tr(r) = nr, 3.N( αβ) = N(α) N(β), and
4. If r ∈ Q, then N(rα) = rn N(α) and N(r) = rn. Theorem 3.1.4. Let K be a number field of degree n and let α ∈ K. Then n Tr (α) = Tr (α) K/Q d Q(α)/Q and n/d NK/Q(α) = NQ(α)/Q(α) where d = [Q(α): Q], meaning that d | n.
Proof. There exist d embeddings of Q(α) in C, say σ1, σ2, . . . , σd. Each σi extends to exactly n/d embeddings of K in C, say σi,1, σi,2, . . . , σi,n/d. Then
{ σi,j } 1≤i≤d 1≤j≤n/d
Date: January 16th, 2018. 3.1 Traces and Norms 9 gives all n embeddings of K in C. Hence X X n n Tr (α) = σ (α) = σ (α) = Tr (α) K/Q i,j d i d Q(α)/Q i,j i since for α ∈ K we have σi,j(α) = σi(α) for all j. Similarly
n/d Y Y n/d NK/Q(α) = σi,j(α) = (σi(α)) = NQ(α)/Q(α) . i,j i
Corollary 3.1.5. For α ∈ K we have
TrK/Q(α), NK/Q(α) ∈ Q and if α ∈ OK we have
TrK/Q(α), NK/Q(α) ∈ Z.
d d−1 Proof. Let pα(x) = x + ad−1x + ... + a1x + a0 be the monic minimal polynomial of α over Q. Then we know moreover that
pα(x) = (x − σ1(α))(x − σ2(α)) · ... · (x − σd(α)) since σi(α) are the Galois conjugates of α. Hence, multiplying out,
TrQ(α)/Q(α) = −ad−1 ∈ Q and
NQ(α)/Q(α) = ±a0 ∈ Q (with the plus or minus depending on the degree d). By Theorem 3.1.4 we therefore moreover have TrK/Q(α) and NK/Q(α) in Q, since they are rationals times integers. If α ∈ OK , then ai ∈ Z by definition, whence the above become the products of integers, hence integers. Note that
TrK/Q : K → (Q, +) is an additive homomorphism and that
∗ NK/Q : K → (Q, ·) is a multiplicative homomorphism, whereby K∗ we mean K \{ 0 }. √ √ Example 3.1.6. Let K = Q( D) with D square-free, and let α = a + b D with a, b ∈ Q. Then √ √ Tr(α) = (a + b D) + (a − b D) = 2a and √ √ 2 2 N(α) = (a + b D)(a − b D) = a − b D. N 3.2 Relative Trace and Norm 10
These constructs are not just idle curiosities. Indeed, computing norms can offer insight into whether something is a unit. −1 Let α ∈ OK be a unit. Then N(α) = ±1. Seeing this is easy: α, α ∈ OK implies N(αα−1) = N(1) = 1, meaning that N(α) N(α−1) = 1, so N(α) | 1. Another observation: let α ∈ OOK so that N(α) ∈ Z is a prime. Then α is irreducible, since α = β1β2 is equivalent with N(α) = N(β1) N(β2), so if the left-hand side is a prime, one of the elements in the right-hand side must be a unit. √ Example 3.1.7. This tells us that, for instance, Z[ −2]× = { 1, −1 } since 2 2 N(α) = a + 2b = 1, forcing b =√ 0, and hence a = ±1.√ √ It also tells us that, say, 9 + 10 is irreducible in Z[ 10] since N(9 + 10) = 71, which is prime. N
3.2 Relative Trace and Norm
We don’t necessarily have to consider traces and norms with respect to Q. Definition 3.2.1 (Relative trace and norm). Let K and L be two number fields such that K ⊂ L and [L : K] = n. Let σ1, σ2, . . . , σn be the n embeddings of L in C that fix K. For all α ∈ L we define the relative trace
TrL/K (α) = σ1(α) + σ2(α) + ... + σn(α) and relative norm by
NL/K (α)σ1(α)σ2(α) · ... · σn(α).
As before, these are homomorphisms, and if α ∈ OL, then TrL/K (α) ∈ OK . Theorem 3.2.2 (Transitivity). Let K, L, and M be number fields with K ⊂ L ⊂ M. Then for α ∈ M we have
TrL/K (TrM/L(α)) = TrM/K (α) along with NL/K (NM/L(α)) = NM/K (α).
Proof. Let [M : L] = m and [L : K] = n, and let τ1, τ2, . . . , τm be the embed- dings of M in C which fix L, and σ1, σ2, . . . , σn be the embeddings of L in C which fix K. Take N to be a normal extension of Q such that N ⊃ M. The extension being normal means that every embedding can be extended to it, whereby we letτ ¯i be the extension of τi to N andσ ¯j the extension of σj to N. In other words they are automorphisms from N to N. Then we have X X X X X TrL/K (TrM/K (α)) = TrL/K τi(α) = σj(τi(α)) = σ¯j(¯τi(α)) i j i j i
(where we brought the inner sum outside since these are homomorphisms). We now need to show that this is the same as TrM/K (α). 3.3 Discriminants 11
Now we know by degree considerations that there are nm embeddings of M in C that fix K, so if we can show that
{ σ¯jτ¯i }1≤i≤m 1≤j≤n are all distinct, then they must be the mn embeddings of M in C that fix K.
Suppose therefore thatσ ¯j1 τ¯i1 =σ ¯j2 τ¯i2 on M. For x ∈ L,
σ¯j1 (¯τi1 (x)) =σ ¯j1 (τi1 (x)) =σ ¯j1 (x) and
σ¯j2 (¯τi2 (x)) =σ ¯j2 (τi2 (x)) =σ ¯j2 (x) since in L,τ ¯i(x) = τi(x) = x. Moreover these two remain equal, soσ ¯j1 (x) =
σ¯j2 (x) for all x ∈ L. By the same argument again therefore this means σj1 (x) =
σj2 (x), so j1 = j2, whenceτ ¯i1 =τ ¯i2 on M, so i1 = i2. The argument for the norm is almost identical, but with products in place of sums.
3.3 Discriminants Definition 3.3.1 (Discriminant). Let K be a number field of degree n and let σ1, σ2, . . . , σn be the n embeddings of K in C. For any n elements
α1, α2, . . . , αn ∈ K, we define the discriminant of α1, α2, . . . , αn by 2 disc(α1, α2, . . . , αn) = det[σi(αj)]1≤i≤n . 1≤j≤n Note that because of the square the discriminant is independent of the order of embeddings or elements, since permuting these simply changes the sign, but squaring nullifies this. Theorem 3.3.2. With the setup as in the definition above, we have
disc(α1, α2, . . . , αn) = det([TrK/Q(αiαj)]). Proof. Note that
det([TrK/Q(αiαj)]) = [σj(αi)][σi(αj)] where the two matrices in the right-hand side are each others transposes. Taking determinants now we get the expression sought.
Corollary 3.3.3. Again with the setup as above, we have disc(α1, α2, . . . , αn) ∈ Q and if αi ∈ OK , then disc(α1, α2, . . . , αn) ∈ Z. THE ADDITIVE STRUCTURE OF THE RING OF INTEGERS 12
Lecture 4 The Additive Structure of the Ring of Integers
4.1 More on Discriminants
Theorem 4.1.1. With the same setup as above, disc(α1, α2, . . . , αn) = 0 if and only if α1, α2, . . . , αn are linearly dependent over Q.
Proof. We start by assuming α1, α2, . . . , αn are linearly dependent over Q. This means that [σi(αj)] has linearly dependent column vectors over Q, whence its determinant is 0, and so the discriminant is 0 too. For the converse, assume disc(α1, α2, . . . , αn) = 0. This means that the matrix [Tr(αiαj)] has linearly dependent row vectors. Let Ri be the ith row vector of this matrix. Then
a1R1 + a2R2 + ... + anRn = 0 (4.1.1) for some ai ∈ Q with not all ai = 0. Let
α = a1α1 + a2α2 + ... + anαn.
The jth coordinate of (4.1.1) is
a1 Tr(α1αj) + a2 Tr(α2αj) + ... + an Tr(αnαj) = 0 which by linearity of trace we can rewrite as
Tr((a1α1 + a2α2 + ... + anαn)αj) = 0 and so in all Tr(ααj) = 0 for all j. Now suppose α1, α2, . . . , αn are linearly independent over Q, meaning that they form a basis for K over Q and α 6= 0 implies αα1, αα2, . . . , ααn is also a basis for K over Q. But the last equation above then means that Tr is 0 on a basis, whence Tr(x) = 0 for all x ∈ K. This is a contradiction, since we know Tr(1) = n 6= 0.
We get a special case of the discriminant if we select α1, α2, . . . , αn to be powers of α:
Proposition 4.1.2. Let K = Q(α) with [K : Q] = n. Then 1, α, α2, . . . , αn−1 is a basis for K over Q, and
2 n−1 Y 2 disc(α) := disc(1, α, α , . . . , α ) = (αi − αj) 1≤i Proof. We have j 2 j 2 disc(α) = det([σi(α )]) = det([αi ]) which is a Vandermode matrix, so this is equal to 2 Y 2 Y n(n−1)/2 Y Y (αi − αj) = (αi − αj) = (−1) (αi − αj) 1≤i 2πi/p Example 4.1.3. Let K = Q(ξp) with ξp = e and p a prime. Say p ≥ 3, since otherwise K = Q. Then [K : Q] = ϕ(p) = p − 1, whence we consider p−2 disc(ξp) = disc(1, ξp, . . . , ξp ). The monic minimal polynomial of ξp over Q is f(x) = xp−1 + xp−2 + ... + x + 1 0 0 and we would like to compute f (ξp) and in particular NK/Q(f (ξp)). Rather than doing this directly, note that xp − 1 = (x − 1)f(x), so by the product rule pxp−1 = f(x) + (x − 1)f 0(x) whereby if we plug in x = ξp we have p−1 0 pξp = (ξ − 1)f (ξp) which we rearrange to p−1 0 pξp f (ξp) = . ξp − 1 Taking norms and recalling that it is a multiplicative homomorphism, we get p−1 0 N(p) N(ξp) N(f (ξp)) = . N(ξp − 1) We evaluate the factors of the right-hand side one at a time. First, N(p) = pp−1. Next N(ξp) by definition is the product of then conjugates of ξp, so 2 p−1 p(p−1)/2 p (p−1)/2 N(ξp) = ξpξp · ... · ξp = ξp = (ξp ) = 1. Finally p−1 p−2 2 p−1 f(x) = x + x + ... + x + 1 = (x − ξp)(x − ξp) · ... · (x − ξp ) whereby 2 p−1 f(1) = p = (1 − ξp)(1 − ξp) · ... · (1 − ξp ) = N(1 − ξp) = N(ξp − 1). 4.2 The Additive Structure of the Ring of Integers 14 Hence pp−1 · 1 N(f 0(ξ )) = = pp−2, p p whereby p(p−1)/2 p−2 disc(ξp) = (−1) p , p−2 meaning that disc(ξp) | p . N Something more general is true: 2πi/m ϕ(m) Example 4.1.4. Let K = Q(ξm), with ξm = e . Then disc(ξm) | m . m Let f(x) be the monic minimal polynomial of ξm over Q again. Then x − 1 = f(x)g(x), wherein f(x), g(x) ∈ Z[x] by Gauss lemma. Therefore again by the product rule mxm−1 = f 0(x)g(x) + f(x)g0(x) m−1 0 whence mξm = f (ξm)g(ξm), since ξm is a root of f(x) itself. Multiplying both sides by ξm we get 0 m = ξmf (ξm)g(ξm) which if we take norms becomes ϕ(m) 0 m = N(f (ξm)) N(g(ξm)ξm) wherein g(ξm)ξm ∈ OK so the second norm is in Z. The same is true for the ϕ(m) first one, in both cases since ξm is an algebraic integer. Hence disc(ξm) | m , since note that the first norm is disc(ξm) save for maybe a sign. N 4.2 The Additive Structure of the Ring of Integers As per usual, let K be a number field of degree n. The goal now is to show that OK is a free abelian group of rank n (or in other words a free Z-module of rank n). First recall some facts about such matters: Fact. A free abelian group of finite rank n is isomorphic to n Z ⊕ Z ⊕ ... ⊕ Z = Z . Fact. Every subgroup of a free abelian group of rank n is also a free abelian group of rank m ≤ n. For an example that doesn’t decrease the rank, consider for instance replacing one of the Z’s above with 2Z. Corollary 4.2.1. Let G1 ⊂ G2 ⊂ G3 be groups. If G1 and G3 are free abelian groups of rank n, then so is G2. Hence our strategy is to show that OK is between two free abelian groups of rank n. Remark 4.2.2. We know that K has a basis α1, α2, . . . , αn over Q. In fact, we can arrange for this basis to be elements from OK , since any αi = βi/mi with βi ∈ OK and mi ∈ Z, so scaling to get rid of the denominator yields a basis in OK . Hence K = Q(α), and we can assume α ∈ OK , and the basis is 1, α, . . . , αn−1. 4.2 The Additive Structure of the Ring of Integers 15 Notice now that K ⊃ OK )Z[α]. The latter is a free abelian group of rank n though, so half of our inclusion is trivially true. Letting α1, α2, . . . , αn be a basis contained in OK , consider G = Zα1 ⊕ Zα2 ⊕ ... ⊕ Zαn = { m1α1 + m2α2 + ... + mnαn | mi ∈ Q }. This is a free abelian group of rank n on α1, α2, . . . , αn. We claim, and soon prove, that 1 α α α G ⊂ O ⊂ G = 1 ⊕ 2 ⊕ ... ⊕ n , K d Z d Z d Z d where d = disc(α1, α2, . . . , αn). 1 Theorem 4.2.3. Let d = disc(α1, α2, . . . , αn). Then OK ⊂ d G. In fact, every α ∈ OK can be written as m α + m α + ... + m α α = 1 1 2 2 n n d 2 with mi ∈ Z and d | mi . Before we prove this, note two things: d 6= 0 since αi is a basis, and d ∈ Z since αi ∈ OK . Note also that this being true, we have the immediate corollary we want: Corollary 4.2.4. Let K be a number field of degree n. Then OK is a free abelian group of rank n. Proof of theorem. Let α ∈ OK , whereby α = x1α1 + x2α2 + ... + xnαn for some xi ∈ Q with αi forming a basis. Moreover let σ1, σ2, . . . , σn be the embeddings of K in C. Then we have the following system of equations σ1(α1)x1 + σ1(α2)x2 + ... + σ1(αn)xn = σ1(α) σ2(α1)x1 + σ2(α2)x2 + ... + σ2(αn)xn = σ2(α) . . σn(α1)x1 + σn(α2)x2 + ... + σn(αn)xn = σn(α) which we rewrite as x1 σ1(α) x2 σ2(α) σ (α ) = . i j . . . . xn σn(α) 2 If we solve for xi using Cramer’s rule, letting δ = det([σi(αj)]), whence δ = d ∈ Z, we have xi = γi/δ, where σi is the determinant of the original matrix [σi(αj)] except with the ith column replaced by the right-hand side of the system. 2 Hence dxi = δ γi/δ = δγi. INTEGRAL BASES 16 Lecture 5 Integral Bases 5.1 Integral Bases We start by finishing the proof from last time. Proof continued. We have dxi = δγi, where since d ∈ Z and xi ∈ Q the left- hand side is in Q, and the right-hand side is an algebraic integer if α ∈ OK , so indeed it is an integer. Then dxi = mi ∈ Z, and we write xi = mi/d, whence m α + m α + ... + m α α = 1 1 2 2 n n . d Next we have m2 d2x2 γ2 i = i = dx2 = δ2 i = γ2, d d i δ2 i 2 which is an algebraic integer in Q, so it’s an integer. Hence d | mi . Definition 5.1.1 (Integral basis). Let K be a number field of dimension n. Since OK has a basis over Z (since it’s a free abelian group) there exists β1, β2, . . . , βn ∈ OK such that every α ∈ OK can be written as α = m1β1 + m2β2 + ... + mnβn, with mi ∈ Z. Such a basis { β1, β2, . . . , βn } is called an integral basis for OK (or K). Note that if we take mi ∈ Q, the above will span K. √ Example√ 5.1.2. Let K = Q( D),√ with D being square-free. Then OK is Z[ D] if D 6≡ 1 (mod 4) and Z[(1 + D)√/2] if D ≡ 1 (mod 4). √ Hence OK has the integral basis { 1, D } in the former case and { 1, (1 + D)/2 } in the latter. N 2πi/pr r Theorem 5.1.3. Let K = Q(ξpr ), with ξpr = e and p ≥ 3 for p prime. Then OK = Z[ξpr ]. We will need two lemmas to prove this. Lemma 5.1.4. With ξ = ξpr as above, Z[1−ξ] = Z[ξ], and disc(1−ξ) = disc(ξ). Proof. The first statement is clear: Z[ξ] ⊂ Z[1 − ξ] since 1 − ξ ∈ Z[ξ], and since 1 − (1 − ξ) = ξ the opposite inclusion is true as well. For the second statement we have Y 2 Y 2 disc(ξ) = (αi − αj) = ((1 − αi) − (1 − αj)) = disc(1 − ξ) 1≤i Lemma 5.1.5. With the same setup, Y (1 − ξk) = p 1≤k≤pr p-k meaning that N(1 − ξ) = p. Proof. The monic minimal polynomial of ξ over Q is r Y xp − 1 tp − 1 f(x) = (x − ξk) = = = tp−1 + tp−2 + ... + t + 1 xpr−1 − 1 t − 1 1≤k≤pr p-k r−1 with t = xp . Hence taking x = 1 gives t = 1 i.e. Y f(1) = p = (1 − ξk). 1≤k≤pr p-k We are now ready to prove the theorem: 2 n−1 Proof of theorem. Taking, again, ξ = ξpr , we have { 1, ξ, ξ , . . . , ξ } ⊂ OK with n = ϕ(pr) is a basis for K over Q. So therefore is 2 n−1 { 1, (1 − ξ), (1 − ξ) ,..., (1 − ξ) } ⊂ OK . Hence we can write α ∈ OK as m + m (1 − ξ) + ... + m (1 − ξ)n−1 α = 1 2 n , d where d = disc(1 − ξ) = disc(ξ) by the first lemma. We now claim that OK = Z[1 − ξ], which by the first lemma is the same as Z[ξ]. Suppose OK is strictly larger than Z[1 − ξ], i.e. there exists some α ∈ OK r ϕ(pr ) r such that d - mi for some i. Note that d | (p ) , so d = p for some r. There must exist some element in OK of the form t (1 − ξ)i−1 + ... + t (1 − ξ)n−1 β = i n p with t - ti. r Hint. Let mj1 , ··· , mjk be all mis that are not divisible by d = p . Write rjs mjs = p tjs with (tjs , p) = 1 and rjs < r. Take i to be the smallest index such r−ri−1 that ri = min{ rjs }. Consider p α subtracting its first (i − 1) terms. r Note that 1 − ξ | 1 − ξk for all k, so (1 − ξ)ϕ(p ) = (1 − ξ)n | p, meaning that p/(1 − ξ)i ∈ Z[ξ] for 1 ≤ i ≤ n. Therefore p ti β = + t + t (1 − ξ) + ... + t (1 − ξ)n−i−1, (1 − ξ)i 1 − ξ i+1 i+2 n where the sum of the latter terms are in Z[1 − ξ] ⊂ OK . This implies that ti/(1 − ξ) ∈ OK , whence t N i ∈ K/Q 1 − ξ Z n n so ti / N(1 − ξ) = ti /p ∈ Z, meaning that p | ti, which is a contradiction. Hence OK = Z[1 − ξ] = Z[ξ]. 5.2 Composite Field 18 Theorem 5.1.6. Let { α1, α2, . . . , αn } and { β1, β2, . . . , βn } be two integral bases for OK . Then disc(α1, α2, . . . , αn) = disc(β1, β2, . . . , βn). Proof. Express the αi in terms of βj’s, say α1 β1 α2 β2 = M . . . . αn βn where M is an n × n matrix with elements in Z. Hence 2 2 2 disc(α1, α2, . . . , αn) = det([σi(αj)]) = det(M[σi(βj)]) = det(M) disc(β1, β2, . . . , βn) whence disc(β1, β2, . . . , βn) | disc(α1, α2, . . . , αn) and by reversing the argument we have the opposite divisibility as well. Now since they both have the same sign, since det(M)2 > 0, they must be equal. Definition 5.1.7. Let K be a number field, and let { α1, α2, . . . , αn } be an integral basis for OK . Define the discriminant of K (or OK ) by disc(K) = disc(OK ) = disc(α1, α2, . . . , αn) which by the above theorem is independent of the choice of integral basis. Exercise 5.1.8. Consider { α1, α2, . . . , αn } ⊂ OK . Then { α1, α2, . . . , αn } is an integral basis for OK if and only of disc(α1, α2, . . . , αn) = disc(K). 5.2 Composite Field Definition 5.2.1 (Composite Field). Let K be a number field of degree m and let L be a number field of degree n. Then n r o X KL = αiβi αi ∈ K, βi ∈ L i=1 is a subfield of C called the composition of K and L. We have the following inclusions: C KL K L K ∩ L Q A few facts follow immediately from this definition: COMPOSITION FIELDS 19 • [KL : K] ≤ [L : L ∩ K], • [KL : Q] ≤ [K : Q] · [L : Q], • Define n r o X OK ·OL = αiβi αi ∈ OK , βi ∈ OL i=1 then OK ·OL ⊂ OKL. Lecture 6 Composition Fields 6.1 Cyclotomic Fields again We are soon ready to prove what has been our goal for quite some time now, namely that if K = Q(ξm), then OK = Z[ξm]. We know that this is true for m = pr, so what about general m? The strategy is to prime factorise m, because r s if, say, m = p q with p and q distinct primes, and L = Q(ξpr ) and M = Q(ξqs ), we have K = Q(ξm) L = Q(ξpr ) M = Q(ξqs ) Q and we will show that K = LM and moreover OK = Z[ξpr ]Z[ξqs ] = Z[ξm]. Proposition 6.1.1. Let K and L be two number fields with [K : Q] = m and [L : Q] = n. Assume [KL : Q] = mn, i.e. K ∩ L = Q, and let d = gcd(disc(K), disc(L)). Then 1 O ⊂ O ·O . KL d K L An immediate result that follows is Corollary 6.1.2. Under the same assumptions, if d = 1, we have OK ·OL = n m OKL and disc(KL) = disc(K) disc(L) . To prove the proposition, we first establish the following lemma: Lemma 6.1.3. Assume [KL : Q] = mn. Let σ be an embedding of K in C and let τ be an embedding of L in C. Then there exists an embedding of KL in C which restricted to K is σ and restricted to L is τ. Proof. We know that σ has n distinct extensions to embeddings of KL in C, sayσ ˜1,..., σ˜n, and moreoverσ ˜i|L 6=σ ˜j|L for i 6= j since otherwiseσ ˜i =σ ˜j on KL, but they’re all distinct. Henceσ ˜i|L give n distinct embeddings of L in C, but there are only n distinct embeddings of L in C since [L : Q] = n, so we must haveσ ˜i|L = τ for some i. Date: January 25th, 2018. 6.1 Cyclotomic Fields again 20 Proof of proposition. Let { α1, α2, . . . , αm } be an integral basis for OK and let { β1, β2, . . . , βn } be an integral basis for OL. Then { αiβj } for 1 ≤ i ≤ m and 1 ≤ j ≤ n is a basis for KL over Q. It is also a basis for OK ·OL over Z, and { αiβj } ⊂ OKL. By Theorem 4.2.3, for γ ∈ OKL we can write X aij γ = α β M i j i,j where M, aij ∈ Z and we have reduced in such a way that gcd(M, a11, . . . , amn) = 1, and we know that M | disc(αiβj). 1 Our goal is to show that M | d since then OKL ⊂ d OK OL. Let σ1, σ2, . . . , σn be the embeddings of K in C. For each σi there is an extensionσ ˜i to KL such thatσ ˜i|L = IdL by the lemma. Apply this to γ above, whence X aij X σ˜ (γ) = σ (α )β = σ (α )x k M k i j k i i i,j j where X aij x = β . i M j j Setting this up as a matrix equation we have x1 σ˜1(γ) x2 σ˜2(γ) σ (α ) = k i . . . . xm σ˜m(γ) 2 which we solve using Cramer’s rule by taking δ = det([σk(αi)]), so δ = disc(K) and letting γi be the determinant of [σk(αi)] with the ith column replaced by the right-hand side above. Then xi = γi/δ. Note that δ, γi ∈ A, the algebraic integrals in C, and so γ disc(K)x = δ2 i = δx ∈ A, i δ i but the left-hand side is X disc Kaij β ∈ L ∩ A = O M j L j since βj form an integral basis for OL. Hence disc(K)a ij ∈ , M Z so M | disc(K) since by construction M and aij are all coprime. By the exact same argument but replacing K with L we have M | disc(L), so M | gcd(disc(K), disc(L)). If d = 1, then M = ±1, so { αiβj } is an integral basis for OKL, and then by computation n m disc(KL) = disc(αiβj) = disc(K) disc(L) . 6.2 Prime Decomposition in Rings of Integers 21 2πi/m Corollary 6.1.4. Let K = Q(ξm) with ξm = e . Then OK = Z[ξm]. r1 r2 Proof. It suffices to consider m = p1 p2 with p1 and p2 distinct primes. Let r1 r2 m1 = p1 and m2 = p2 , and let K1 = Q(ξm1 ) and K2 = Q(ξm2 ). Then m1 OK1 = Z[ξm1 ] and OK2 = Z[ξm2 ]. Note that gcd(m1, m2) = 1 and ξm = ξm2 and ξm2 = ξ , and so ξ = ξs ξt where sm + tm = 1, s, t ∈ . m m1 m m1 m2 1 2 Z ϕ(mi) Since disc(Ki) | mi , we have gcd(disc(K1), disc(K2)) = 1, and hence by the corollary above OK = OK1 ·OK2 = Z[ξm1 ]Z[ξm2 ] = Z[ξm]. Remark 6.1.5. With K = Q(α) ⊂ OK , we do not in general have OK = Z[α]. 6.2 Prime Decomposition in Rings of Integers Before proceeding we need some prerequisites from commutative algebra. Definition 6.2.1 (Dedekind domain). A Dedekind domain D is a Noethe- rian, integrally closed domain of dimension 1. Definition 6.2.2 (Noetherian ring). A Noetherian ring R is a ring that satisfies the ascending chain condition, meaning that a chain of ideals I1 ⊂ I2 ⊂ I3 ⊂ ... is stationary, i.e. eventually Ik = Ik+1 = .... This is in turn equivalent with every ideal I ⊂ R being finitely generated as an R-module. Definition 6.2.3 (Integrally closed). Let D be an integral domain and let K be its quotient field. Then D is integrally closed if α ∈ K and f(α) = 0 for some monic f(x) ∈ D[x] implies α ∈ D. Definition 6.2.4 (Dimension 1). A Noetherian domain R is of dimension 1 if every nonzero prime ideal is maximal. Theorem 6.2.5. The ring OK is a Dedekind domain. ∼ n Proof. To see that OK is Noetherian, take I ⊂ OK = Z , with this isomorphism being as Z-modules. Then I is finitely generated as a Z-module, and since Z ⊂ OK , it is also finitely generated as an OK -module. By definition of OK , it is the integral closure of Z in K and hence it is integrally closed. Finally to see that OK is of dimension 1, take P ⊂ OK a nonzero prime ideal. We want to show that P is maximal, which is equivalent to OK /P being a field. Since P is prime by assumption, we know first of all that OK /P is an integral domain. It suffices to show that OK /P is finite since every finite integral domain is a field. Let α 6= 0 in P and let Z 3 m = NK/Q(α) = αβ, where by β we mean the product of the remaining Galois conjugates of α. Then m β = ∈ K ∩ A = O α K whence m = αβ ∈ P , since α ∈ P and β ∈ OK . Hence (m) ⊂ P ⊂ OK , and so O O n K ≤ K = Z < ∞ P (m) nZn by the aforementioned isomorphism. IDEAL FACTORISATION 22 Definition 6.2.6. Let D be a Dedekind domain and K its field of fractions. Let I,J ⊂ D be ideals. We say that I divides J, denoted I | J if I ⊃ J. We also call gcd(I,J) = I + J, lcm(I,J) = I ∩ J, and n o X IJ = aibi ai ∈ I, bi ∈ J i and then I ∩ J = IJ if gcd(I,J) = 1 = (1) = D, i.e. I and J are coprime. Lecture 7 Ideal Factorisation 7.1 Unique Factorisation of Ideals In the following discussion, D is a Dedekind domain and K is its field of fractions. Lemma 7.1.1. For every ideal I ⊂ D, I 6= 0, there exists nonzero prime ideals P1,P2,...,Pr such that P1 · P2 · ... · Pr ⊂ I, P1,P2,...,Pr not necessarily distinct. Note how this corresponds to how every integer has at least one prime factor. Proof. Let S be the set of proper ideals of D for which the assumption in the lemma is false. Assume S 6= ∅—we clearly wish for this to lead to a contradiction. Since D is Noetherian, S has a maximal element, say M. Then M is not a prime ideal, for otherwise it is its own factor P1 above. Hence there exists some b1, b2 ∈ D \ M such that b1b2 ∈ M. Now consider I1 = (b1) + M ) M and I2 = (b2) + M ) M. Since M is maximal in S, we must have I1,I2 6∈ S. Therefore they do contain products of prime ideals, so there exist P1 · P2 · ... · Pr ⊂ I1 and Q1 · Q2 · ... · Qs ⊂ I2, where Pi and Qj are prime ideals. Then P1P2 ··· PrQ1Q2 ··· Qs ⊂ I1I2 ⊂ M, since b1b2 ∈ M, which is a contradiciton. Hence S = ∅. Lemma 7.1.2. Let P be a nonzero prime ideal in D and define P −1 = { x ∈ K | xP ⊂ D }. Then IP −1 6= I for every nonzero ideal I ⊂ D. Note that n r o −1 X −1 IP = aixi ai ∈ I, xi ∈ P i=1 so, for example, 1 P −1 = (p )−1 = Z pZ in Z. Date: January 30th, 2018. 7.1 Unique Factorisation of Ideals 23 Proof. We claim first that P −1 6⊂ D. Let 0 6= a ∈ P with (a) ( P . Take P1P2 ··· Pr ⊂ (a) ( P with r as small as possible, which is of course possible by the above lemma. Note r ≥ 2. Then if Pi 6⊂ P for every i = 1, 2, . . . , r, then there exist αi ∈ Pi, hence αi ∈ P for every i = 1, 2, . . . , r, such that α1α2 ··· αr 6∈ P. But since P is a prime ideal, this is a contradiction. Hence Pi ⊂ P for some i. But in a Dedekind domain every nonzero prime ideal is maximal, so Pi = P . Without loss of generality, let us assume P1 = P (if not, just rearrange and relabel). Now P2P3 ··· Pr 6⊂ (a) since r was chosen as small as possible. Take b ∈ P2P3 ··· Pr, b 6∈ (a). Then b P ⊂ D, a which means by definition that b/a ∈ P −1, but b/a 6∈ D since b 6∈ (a). Now let I 6= 0, I ⊂ D, and let α1, α2, . . . , αn be the generators of I as a D-module (which must exist since D is Noetherian). Suppose IP −1 = I. For any x ∈ P −1, we have n X xαi = aijαj j=1 with aij ∈ D. Let A = [aij] and so α1 α1 α2 α2 A = x . . . . αn αn meaning that α1 α2 (xI − A) = 0 . . αn and so det(xI − A) = 0, and expanding this determinant we get some monic polynomial xn + ... ∈ D[x], whence x is integral over D. Since D is Dedekind it is also integrally closed, meaning that x ∈ D, whereby P −1 ⊂ D. That’s a contradiction, which means that IP −1 6= I. Theorem 7.1.3. Every nonzero proper ideal I ( D had a unique factorisation e1 e2 er I = P1 P2 ··· Pr with ei > 0 and Pi distinct primes. 7.1 Unique Factorisation of Ideals 24 Proof. We start with the existence proof. Let S be the set of nonzero proper ideals in D such that they do not have a prime factorisation. Assume, as before, that S 6= ∅. Then since D is Noetherian there must be a maximal element in S, call it M. Then we must have M ⊂ P ⊂ D for some maximal (and prime) ideal P . By definition, D ⊂ P −1, and by the lemma −1 −1 M ( MP ⊂ PP = D. The last equality comes from P ⊂ PP −1 ⊂ D, and since P is maximal we must have P = PP −1 or PP −1 = D, and by the lemma it can’t be the former. Since M ( P , and MP −1 6= D, for otherwise MP −1 = D which implies P = DP = MP −1P = M, which is a contradiction. Thus M ( MP −1 ( D, and since M is maximal in S this means that MP −1 6∈ S, so it by assumption has a unique prime factorisation, say −1 e1 e2 er MP = P1 P2 ··· Pr , where Pi are prime ideals and ei > 0. But if we just multiply by P we get e1 e2 er M = PP1 P2 ··· Pr which is a contradiction since M ∈ S, and so therefore S = ∅. Now let us consider uniqueness. Suppose e1 e2 er a1 a2 a2 I = P1 P2 ··· Pr = Q1 Q2 ··· Qs . This means that a1 a2 a2 Q1 Q2 ··· Qs ⊂ P1 implying that Qi = Pi for some i; let’s say Q1 = P1. Then −1 e1−1 e2 er a1−1 a2 a2 P1 I = P1 P2 ··· Pr = Q1 Q2 ··· Qs . Repeat this, yielding eventually Pi = Qi, ei = ai, and r = s. Note that the uniqueness proof is essentially the same as the standard proof of the same for integers. Note also that, as with integers, it is often very hard to actually factor ideals, even though we know we can. Definition 7.1.4 (Fractional ideal). Let D be a Dedekind domain and K its field of fractions. A fractional ideal of K is a finitely generated D-submodule of K. Example 7.1.5. Any ideal I ⊂ D is a fractional ideal. In particular, these are called integral ideals of K. N Remark 7.1.6. Consider I = (α1, α2, . . . , αk) as a D-module, with αi ∈ K. Then there exist di ∈ D such that diαi ∈ D, and so there exists a common denominator d ∈ D such that dI is an integral ideal. Lemma 7.1.7. For any ideal I ⊂ D, I 6= 0, I−1 = { x ∈ K | xI ⊂ D } is a fractional ideal. 7.1 Unique Factorisation of Ideals 25 Proof. Let x 6= 0 in I. Then xI−1 ⊂ D by the definition of I−1. Since D is Noetherian, we moreover know that xI−1 is finitely generated as a D-module, −1 say by xα1, xα2, . . . , xαn. Then for any y ∈ I we have n X xy = aixαi i=1 where ai ∈ D. Since D is an integral domain, we have cancellation laws, so n X y = aiαi, i=1 −1 meaning that I is generated by α1, α2, . . . , αn as a D-module, whence it’s a fractional ideal. Theorem 7.1.8. Let IK denote the set of all nonzero fractional ideals of K. Then IK is an abelian group under multiplication with identity D = (1) and the −1 inverse of I ⊂ IK is I = { x ∈ K | xI ⊂ D }. Proof. First we prove that it is closed under multiplication. For any I,J ∈ IK , if I is generated by α1, α2, . . . , αn and J by β1, β2, . . . , βm, then IJ is generated by αiβj. Hence IJ ∈ IK . Secondly, let us show that D = (1) is the identity, i.e. that DI = I for any I ⊂ IK . But this is immediately true since I is a D-module. Finally let us verify that the inverse does what it should. Let P be a prime ideal in D. Then P ( PP −1 ⊂ D, with the previous lemma guarantueeing P ( PP −1. Now since P is maximal, we must have PP −1 = D. For I ⊂ D, use the prime factorisation to conclude II−1 = D. Now for general I ∈ IK , there exists some d ∈ D such that dI ⊂ D, so (dI)(dI)−1 = D. Remark 7.1.9. Every nonzero fractional ideal I has a unique factorisation e1 e2 ek I = P1 P2 ··· Pk where, as compared to I ⊂ D, ei ∈ Z \{ 0 }, i.e. we allow negative powers. In other words IK is a free abelian group on the set of nonzero prime ideals of D. Definition 7.1.10 (Principal fractional ideal). The principal fractional ide- als of K are the D-modules in K of the form xD = (x), with x ∈ K. Let PK denote the set of nonzero principal fractional ideals in K. Then PK is a subgroup of IK . Definition 7.1.11 (Class group, class number). The class group of K is Cl(K) = IK /PK . Moreover h(K) = # Cl(K) is called the class number of K. Remark 7.1.12. The class group Cl(K) measures the failure of unique factori- sation in D, since if Cl(K) is trivial, i.e. h(K) = 1, then D is a principal ideal domain, which for a Dedekind domain means its a unique factorisation domain. We will show later that if K is a number field, then h(K) < ∞. RAMIFICATION 26 Example 7.1.13. Note that prime ideals in Z are√ not, when lifted, necessarily prime ideal in number fields. Suppose, say K = Q( −5), and consider √ √ 2 K ⊃ OK = Z[ −5] ⊃ 2OK = (2, 1 + −5) Q ⊃ Z ⊃ 2Z. Here 2Z√is a prime ideal in Z, but 2OK is not a prime ideal in OK . However, (2, 1 + −5) is. N Lecture 8 Ramification 8.1 Ramification Index Given a number field K, with a prime ideal pZ in Z ⊂ Q, it in general won’t be the case that pOK is a prime. However, by the discussion last time, it must be the case that e1 e2 er pOK = q1 q2 ··· qr where qi ⊂ OK are prime ideals. Definition 8.1.1 (Ramification). Let L ⊃ K be number fields. Let P ⊂ OK be a prime ideal. e1 e2 er (i) If P OL = q1 q2 ··· qr , then ei is called the ramification index of P at qi. We denote this by e(qi/P ). (ii) The prime P is said to be ramified in L if some ei > 1. Proposition 8.1.2. Let L ⊃ K be number fields. Let P ⊂ OK be a prime in OK , and q ⊂ OL be a prime in OL. The following are equivalent: (i) q | P OL, i.e. P OL ⊂ q; (ii) q ⊃ P ; (iii) q ∩ OK = P ; and (iv) q ∩ K = P . Remark 8.1.3. We say that q lies over P or P lies under q in the above proposition. Proof. (i) and (ii) are trivially equivalent by definition. Likewise (iii) and (iv) are equivalent since q ⊂ OL and OL ∩ K = OK . (iii) implies (ii) again trivially, and finally (ii) implies (iii) by noting that P ⊂ q ∩ OK ( OK , because it q ∩ OK = OK = (1), then 1 ∈ q meaning that q = OL, a contradiction. So P in the left-hand side is nonzero, making it maximal, so P = q ∩ OK . Theorem 8.1.4. Let L ⊃ K be number fields. Date: February 1st, 2018. 8.1 Ramification Index 27 (i) Every prime q ⊂ OL lies over a unique prime P ⊂ OK . (ii) Every prime P ⊂ OK lies under at least one prime q ⊂ OL. Proof. (i) follows from q ∩ OK being a prime in OK . Similarly, (ii) follows from P OL 6= OL, hence has a prime factorisation. −1 −1 Suppose P OL = OL. Then 1 ∈ P OL. Consider P ⊂ K, P 6⊂ OK . −1 Then there exists some e ∈ K \OK , r ∈ P , i.e. rP ∈ OK . Then rP OL ⊂ OK OL = OL, but 1 ∈ P OL, meaning that r ∈ OL ∩K = OK , a contradiction. In light of this, consider the following setting: L ⊃ OL ⊃ q K ⊃ OK ⊃ P. Consider the ring homomorphism O ϕ: O → L , K q where since q is a nonzero prime ideal it is also maximal, whence OL/q is a field. The kernel of this map is ker ϕ = OK ∩ q = P , so this induces a homomorphism O O ϕ¯: K ,→ L P q where both of the above quotients are fields. Hence OL q f OK P is a field extension. We call OL/q the residue field associated with q. The dimension of OL/q over OK /P as a fector space is called the inertial degree of q over P , denoted f(q/P ). Example 8.1.5. Consider 2 K = Q(i) ⊃ Z[i] = OK ⊃ 2OK = (1 − i) Q ⊃ Z ⊃ (2), where we then consider Z[i] (1−i) Z . 2Z 8.1 Ramification Index 28 2 Then we have 2OK = (1 − i) , so e((1 − i)/2) = 2 and [i] Z = 2 (1 − i) and |Z/2Z| = 2, so f((1 − i)/2) = 1. N Exercise 8.1.6. As expected, if we let M ⊃ L ⊃ K be number fields, and let R ⊃ q ⊃ P be primes in OM ⊃ OL ⊃ OK respectively, such that M ⊃ OM ⊃ R L ⊃ OL ⊃ q K ⊃ OK ⊃ P, we have e(R/P ) = e(R/q)e(q/P ) and f(R/P ) = f(R/q)f(q/P ). Theorem 8.1.7. Let e1 e2 er L ⊃ OL ⊃ P OL = q1 q2 ··· qr n K ⊃ OK ⊃ P and hence OL/qi fi OK /P with fi = f(qi/P ) and ei = e(qi/P ). Then r X eifi = n. i=1 Before we prove this, let us define a new notion and state a proposition which we’ll prove alongside the theorem. Definition 8.1.8 (Norm of ideal). Let K be a number field and let I ⊂ OK be an ideal. The norm of I is O N(I) = N (I) = K . K I Proposition 8.1.9. Let L ⊃ K be number fields with [L : K] = n. (i) For ideals I,J ⊂ OK , N(IJ) = N(I) N(J). 8.1 Ramification Index 29 n (ii) Let I ⊂ OK . Then NL(IOL) = NK (I) . (iii) Let α 6= 0, α ∈ OK . For the principal ideal (α) we have NK ((α)) = |NK/Q(α)|, where the first norm is that of an ideal, and the second is that of an element. Proof. We start by proving (i) from the proposition. First assume I and J are coprime, i.e. I +J = OK , or equivalently IJ = I ∩J. By the Chinese remainder theorem we have that OK /IJ = OK /I∩J is isomorphic to OK /I×OK /J. Hence O O O K = K · K , IJ I J whereby N(IJ) = N(I) N(J). m Secondly, assume I = P for P ⊂ OK prime, i.e. I is a prime power. Then the goal is to show that N(I) = N(P )m, and O N(I) = K P m by definition. Now note that 2 3 m OK ⊃ P ⊃ P ⊃ P ⊃ ... ⊃ P , where the norm between each is N(P ). Hence the claim is true if O P k N(P ) = K = . P P k+1 Fix any α ∈ P k \ P k+1, i.e. nonzero in the latter quotient. We have an isomor- phism OK /P → αOK /αP which is induced by the natural projection OK → αOK /αP. Now consider k k+1 the homomorphism ϕ: αOK → P /P , which is the natural inclusion since k αOK ⊂ P . k+1 Then ker ϕ = αOK ∩ P = αP , and αO + P k+1 P k Im ϕ = K = P k+1 P k+1 k+1 k+1 k since P ( αOK + P = P . k k+1 Therefore ϕ induces an isomorphism between αOK /(αP ) and P /P , whence O αO P k K = K = . P αP P k+1 Hence N is multiplicative for coprime ideals and prime powers, so for two arbi- trary ideals, first factor them, then apply these two. Next let us tackle the theorem for the special case K = Q. Then P = pZ, with p a prime, and e1 e2 er NL(pOL) = N(q1) N(q2) ··· N(qr) RAMIFICATION INDEX CONTINUED 30 fi by the above. Hence N(qi) = p , and r Pr Y fiei fiei NL(pOL) = p = p i=1 . i=1 Moreover O n N (pO ) = L = Z = pn, L L n pOL pZ whereby r X n = eifi, i=1 so the theorem holds for K = Q. Lecture 9 Ramification Index continued 9.1 Proof, continued We showed last time that the theorem we are working on is true for K = Q. Now let us use what we know so far to tackle the second part of the propo- sition, but first a lemma: Lemma 9.1.1. Let D be a Dedekind domain with the field of fractions K. Let B ⊂ A ( D be ideals. Then there exists r ∈ K such that rB ⊂ D but rB 6⊂ A. Proof. We know from last time that BB−1 = (1) = D, where B−1 is a frac- tional ideal. Hence there exists some α ∈ D such that αB−1 ⊂ D, and hence (αB−1)B = αD. Now since A ( D we have aA ( αD, and we select β ∈ αB−1 such that βB ⊂ αD but βB 6⊂ αA. Take r = β/α. Then rB = β/αB ⊂ D but rB 6⊂ A. Proof of Proposition (ii). Since N is multiplicative, it suffices to consider I = P , a prime ideal. Then OL/P OL m OK /P where OL/P OL is a vector space over OK /P since the latter is a field. Our goal is to prove that the dimension of this vector space is n. First let us establish that the dimension is bounded above by n, i.e. [OL/P OL : OK /P ] ≤ n. Let α1, α2, . . . , αn+1 ∈ OL. We then want to show that the corresponding elements in OL/P OL are linearly dependent over OK /P . Note first of all that the elements are linearly dependent over K, since [L : K] = n, and hence they are also linearly dependent over OK (to see this, find a common denominator of the linear sum in K). Hence β1α1 + β2α2 + ... + βn+1αn+1 = 0 Date: February 6th, 2018. 9.1 Proof, continued 31 for some βi ∈ OK . It is then our goal to show that not all βi ≡ 0 (mod P ). Applying the lemma with A = P and B = (β1, β2, . . . , βn+1), there exists some r ∈ K such that rB ⊂ OK , i.e. rβi ∈ OK for all i, but at the same time rB 6⊂ P , i.e. not all rβi ≡ 0 (mod P )¿ Hence (rβ1)α1 + (rβ2)α2 + ... + (rβn+1)αn+1 = 0 and we are done. Using this we can prove that in fact [OL/P OL : OK /P ] = n. Consider P ∩ Z = pZ, with p a prime number. Then e1 e2 er pOK = P1 P2 ··· Pr , and one of them must be equal to P , say P1 = P . Let fi = f(Pi/p), whence r Y ei pOL = pOK ·OL = Pi OL i=1 whence by the first part of the proposition r Y ei NL(pOL) = NL(PiOL) . i=1 Set ni = [OL/PiOLLOK /Pi]. Then ni ≤ n for all i by the first claim above, ni fi and NL(PiOL) = NK (Pi) , and moreover NK (Pi) = p . Hence r r Y niei Y finiei NL(pOL) = NK (Pi) = p . i=1 i=1 But we also have O mn N (pO ) = L = Z = pmn, L L mn pOL pZ where m = [K : Q], whereby r X finiei = nm, i=1 and since the theorem is true for the base field being Q we have r X eifi = m, i=1 so, since ni ≤ n, we get ni = n for all i. We are finally ready to prove the general case of the theorem. e1 e2 er Proof of Theorem. Consider the factorisation P OL = q1 q2 ··· qr in L. By n (ii) in the proposition, NL(P OL) = NK (¶) , where n = [L : K]. At the same time r r Pr Y ei Y fiei fiei NL(P OL) = NL(qi) = NK (P ) = NK (P ) i=1 , i=1 i=1 so r X eifi = n. i=1 RAMIFIED PRIMES 32 We end the lecture by proving the last part of the proposition. Proof of Proposition (iii). Let M be a normal extension of Q containing K, say M n K m Q and let σ : K,→ C extend toσ ¯ : M,→ C (actuallyσ ¯ : M,→ M since M is normal). Moreoverσ ¯(OM ) = OM , hence NM (αOM ) = N(σ(α)OM ) for α ∈ K. Let a = NK/Q(α) = σ1(α)σ2(α) ··· σm(α) ∈ Z. Then NM (aOM ) = NM (σ1(α)OM )NM (σ2(α)OM ) ··· NM (σm(α)OM ) and since the extension is Galois all the embeddings just permute themselves, so this is equal to m mn NM (αOM ) = NK (αOK ) . mn mn Now since a ∈ Z we have NM (aOM ) = NQ(aZ) = |a| , whence NK (αOK ) = |a| = |NK/Q(α)|. Lecture 10 Ramified Primes 10.1 When Do Primes Split Proposition 10.1.1. Let L ⊃ K be a Galois extension, and let G = Gal(L/K). Then G acts transitively on the set of prime ideals q of OL lying over P ⊂ OK . e1 e2 er The idea, then, is that P OL = q1 q2 ··· qr and if σ ∈ Gal(L/K), then σ : L → L and σ(OL) = OL. The proposition says that for each pair qi and qj there exists some σ ∈ Gal(L/K) such that σ(qi) = qj. 0 Proof. Suppose there exists q and q being prime ideals in OL lying above P such that q 6= σ(q0) for all σ ∈ G. Then by the Chinese remainder theorem there exists some x ∈ OL such that ( x ≡ 0 (mod q) x ≡ 1 (mod σ(q0)) for all σ ∈ G. Now Y NL/K (x) = σ(x) ∈ q ∩ K = P, σ∈G meaning that x 6∈ σ(q0) for all σ ∈ G, so σ−1(x) 6∈ q0 for every σ ∈ G. Date: February 8th, 2018. 10.1 When Do Primes Split 33 Therefore σ(x) 6∈ q0 for every σ ∈ G, so Y 0 NL/K (x) = σ(x) 6∈ q σ∈G since q0 is prime, but q0 ∩ K = P , so this is a contradiction. Corollary 10.1.2. Suppose L ⊃ K is a Galois extension, and let P ⊂ OK be a nonzero prime ideal. Let q1, q2 ⊂ OL be two prime ideals lying over P . Then e(q1/P ) = e(q2/P ) and f(q1/P ) = f(q2/P ), and hence [L : K] = ref where r is the number of primes q ⊂ OL lying above P . The fundamental question we wish to ask and answer is this: Let K be a number field. For which primes p ∈ Z is p ramified in K, i.e. e(q/p) > 1 for some q ∈ OK ? The answer is very simple to state, but not nearly as simple to prove: p is ramified in K if and only if p | disc(K). We will prove the forward direction today, and the converse much, much later. Theorem 10.1.3. Let p be a prime in Z. Suppose p is ramified in a number field K. Then p | disc(K). Proof. Let q ∈ OK lie above p with e(q/p) > 1. We have e e1 e2 er e−1 e1 e2 er pOK = q q1 q2 ··· qr = q(q q1 q2 ··· qr ). | {z } = I Then pOK = qI with pOK ( I and I is divisible by all primes in OK lying above p by construction. Let L be a Galois extension of Q such that K ⊂ L, and let σ1, σ2, . . . , σn be the embeddings of K in C. Each of them extends to L, call itσ ¯i. Let α1, α2, . . . , αn be any integral basis for OK , and let d = disc(K) = disc(α1, α2, . . . , αn). Since pOK ( I, we can find α ∈ I \ pOK . Write α = m1α1 + m2α2 + ... + mnαn with mi ∈ Z, not all mi ≡ 0 (mod p) since α 6∈ pOK . Without loss of generality, say m1 6≡ 0 (mod p), i.e. p - m1. Then 2 disc(α, α2, α3, . . . , αn) = disc(m1α1, α2, . . . , αn) = m1 disc(α1, α2, . . . , αn). It then suffices to show p | disc(α, α2, . . . , αn) since p - m1. Now fix any prime P ⊂ OL lying above p. Then α ∈ I means that α ∈ qi ⊂ OK lying above p for every i, and so α ∈ Pi ⊂ OL lie above p for every i too. Hence α ∈ P1 ∩ P2 ∩ ... ∩ Ps whereby σ(α) ∈ P1 ∩ P2 ∩ ... ∩ Ps for every σ ∈ Gal(L/Q). Fix P = P1. Then σ(α) ∈ P for all σ ∈ Gal(L/Q), and in particular σi(α) ∈ P for σi : K,→ C. 10.1 When Do Primes Split 34 Hence 2 m1 disc(α1, α2, . . . , αn) = disc(α, α2, . . . , αn) ∈ P ∩ Z = pZ where the left-hand side is in Z, so p | disc(α, α2, . . . , αn). Corollary 10.1.4. Let K = Q(α) with α ∈ OK . Let f(x) be the monic minimal 0 polynomial of α over Q. Suppose p ∈ Z is a prime such that p - NK/Q(f (α)). Then p is unramified in K. Proof. We have K = Q(α) ⊃ OK ⊃ Z[α], and so 2 disc(α) = [OK : Z[α]] disc(K), 0 0 where the left-hand side is ± NK/Q(f (α)). Hence p - NK/Q(f (α)) implies p - disc(K), and so by the contrapositive of the theorem p is unramified. Corollary 10.1.5. There are only finitely many primes in Z that are ramified in a number field K. Corollary 10.1.6. Let L ⊃ K be number fields. Then there are only finitely many prime ideals that are unramified in OL. √ Example 10.1.7. Let K = Q( D), with D being square-free. We know that ( √ [ D], if D 6≡ 1 (mod 4), Z √ OK = 1+ D Z[ 2 ] if D ≡ 1 (mod 4), along with ( 4D, if D 6≡ 1 (mod 4), disc(K) = D if D ≡ 1 (mod 4). Let p ∈ Z be prime. Then 2 P if p | disc(K), disc(K) pOK = P if p - disc(K), p = −1, disc(K) P1P2 if p - disc(K), p = 1. We call the second case inert and the third case split. N 2πi/m Example 10.1.8. Consider the cyclotomic field K = Q(ξm) where ξm = e , and m ≥ 3. We know that OK = Z[ξm]. First suppose m = pr, with p ∈ Z a prime. Then we know that disc(K) | mϕ(m) = pn for some power n, and so disc(K) = pv for some power v. Hence p is the only prime ramified in K. r1 r2 Next suppose m = p1 p2 , with p1 6= p2 primes in Z. Then as we know K L1 = (ξ r1 ) L2 = (ξ r2 ) Q p1 Q p2 n1 n2 Q THE IDEAL CLASS GROUP 35 n2 n1 n1 n2 and disc(K) = disc(L1) disc(L2) = p2 p1 so p is ramified in K if and only r1 r2 if p | m where m = p1 p2 for some powers r1 and r2. Inductively, for any m ≥ 3, p is ramified in Q(ξm) if and only if p | m. N Note that if K = Q(ξm) we also have that K is a Galois extension of Q, and so e pOK = (q1q2 ··· qr) and f = f(qi/p) for all i, and so ref = ϕ(m). Now let us see if we can determine r, e, and f. The first case to consider is p - m. Then p is unramified in K, so e = 1, and we claim that f is the multiplicative order of p modulo m. The way to think about this is that if gcd(p, m) = 1, then p ∈ (Z/mZ)×, whereby pf ≡ (mod m), so r = ϕ(m)/f. So assume first that m = qk for q ∈ Z a prime such that q 6= p. Then if Q lies above P in OK , then OK /Q = Fpf f Z/pZ = Fp p and Gal(Fpf /Fp) = (τ) is cyclic, and in particular τ(x) = x is the Frobenius automorphism, and naturally |(τ)| = f. ∼ × a On the other hand, Gal(K/Q) = (Z/mZ) by the isomorphism σ1(x) = x p corresponding to a (mod m), and in particular σp(x) = x corresponding to p (mod m). Lecture 11 The Ideal Class Group 11.1 When are Primes Ramified in Cyclotomic Fields Let K = Q(ξm) and let p ∈ Z be a prime. Then p is ramified in K if and only if p | m. e We know that K is a Galois extension of Q, so pOK = (q1q2 ··· qr) , and f = f(q1/p) = ... = f(qr/p). We therefore want to determine r, e, and f, knowing that ref = n = ϕ(m). Let us first consider the case p - m, so that p is unramified, meaning that e = 1, and so pOK = q1q2 ··· qr. We claim that f is the multiplicative order of p modulo m. × This makes sense since p - m means that (p, m) = 1, and thereby p ∈ Zm, and we claim that f is the smallest positive integer such that pd ≡ 1 (mod m). k Assume m = q for q ∈ Z a prime, but p 6= q. Let ξ = ξqk . We then have ∼ OK /q1 = Fpf f ∼ Z/pZ = Fp p and Gal(Fpd /Fp) = (τ), where τ : Fpf → Fpf , τ(x) = x is the Frobenius automorphism, and of course |(τ)| = f. We also have Gal(K/Z) =∼ (Z/mZ)×, a p by σa(ξ) = ξ ↔ a (mod m), and so in particular σp(ξ) = ξ ↔ p (mod m). Date: February 11th, 2018. 11.2 The Ideal Class Group and Unit Group 36 The order of p modulo m is equal to the order of σp in Gal(K/Q). Hence we want to show that the order of σp in Gal(K/Q) is in turn equal to the order of τ in Gal(Fpf /Fp). a a This is the same as saying σp = Id if and only if τ = Id, for every a ∈ N. a a pa First σp = Id is equivalent with σp (ξ) = ξ if and only if ξ = ξ which is the case if and only if pa ≡ 1 (mod m) since ξ is a primitive mth root of unity. a a pa Secondly, τ = Id if and only if τ (ξ) ≡ ξ (mod q1), if and only if ξ ≡ ξ a (mod q1). We claim that this, in turn, is equivalent with p ≡ 1 (mod m). The reverse direction is trivial since ξ is an mth root of unity. pa The forward direction is a little trickier. Assume ξ ≡ ξ (mod q1), and a write pa ≡ b (mod m) for 0 ≤ b < m, meaning that ξp = ξb. Then b 6= 0 since otherwise ξ = 1 (and indeed p | m if it’s the case). × × m pa−1 We have ξ ∈ OK = Z[ξ] since N(ξ) = 1 (and ξ = 1). So ξ ≡ 1 b−1 (mod q1), implying that ξ ≡ 1 (mod q1). Then (1 − ξ)(1 − ξ2) ··· (1 − ξm−1) = m since xm−1 + xm−2 + ... + x + 1 = (x − ξ)(x − ξ2) ··· (x − ξm−1) so we plug in x = 1. b−1 Now if b > 1, 1−ξ appears somewhere in the above, whence m ∈ q1 ∩Z = pZ, but this is a contradiction since p - m. Therefore b = 1. So the order of σp is the same as the order of τ. This same argument works for any (m, p) = 1¡ so for p - m, with e = 1, we know f, and therefore we can find r. Now consider instead the case p | m, whereby p is ramified in K, so e pOK = (q1q2 ··· qr) k ϕ(m) with e > 1. Assume m = p . We claim pOK = (1 − ξ) . We have N(1 − ξ) = p, and moreover Y k Y 1−ξk ϕ(m) N(1 − ξ) = (1 − ξ ) = 1−ξ × (1 − ξ) = u(1 − ξ) , ∈OK (k,m)=1 (k,m)=1 1≤k ϕ(m) ϕ(m) so pOK = u(1−ξ) OK = (1−ξ) , so ref = ϕ(m) and e ≥ ϕ(m) (because (1−ξ) could in principle split), but since the we have ref = ϕ(m) we must have e = ϕ(m) and so r = f = 1, meaning that (1 − ξ) is a prime ideal in OK . The case we haven’t dealt with is the possibility that m = pk · n, where (n, p) = 1. 11.2 The Ideal Class Group and Unit Group Let K be a number field and let IK denote the set of fractional ideals in K and PK denote the set of principal fractional ideals in K. The class group is Cl(K) = IK /PK , and the class number is h(K) = # Cl(K). The goal of this discussion is to show that h(K) < ∞. Theorem 11.2.1. Let K be a number field. Then there exists λ > 0 (depending only on K) such that for every nonzero I ⊂ OK there exists a nonzero α ∈ I with |NK/Q(α)| ≤ λ NK (I). 11.2 The Ideal Class Group and Unit Group 37 Proof. Let n = [K : Q]. Let α1, α2, . . . , αn be an integral basis for OK and let σ1, σ2, . . . , σn be the embeddings of K in C. Set n n Y X λ = |σi(αj)| i=1 j=1 depending on K. n For any nonzero ideal I ⊂ OK , let m ∈ N such that m ≤ NK (I) = |OK /I| < (m + 1)n. n Consider the following (m + 1) elements in OK defiend by n X mjαj j=1 with 0 ≤ mj ≤ m. By construction this is a larger set of elements than NK (I), so two must be equal modulo I. Take the difference of those two elements, call it n X rjαj ∈ I j=1 with |rj| ≤ m, and call this element our α. Then we have n n n n Y X Y X |N (α)| = r σ (α ) ≤ mn |σ (α )| ≤ λ N (I) K/Q j i j i j K i=1 j=1 i=1 j=1 since mn ≤ NK (I) by construction. Corollary 11.2.2. Every ideal class of K contains an integral ideal J ⊂ OK with NK (J) ≤ λ, with λ as above. −1 Proof. Given an ideal class C ∈ Cl(K) = IK /PK , we know that C exists since the class group is indeed a group. Fix any integral ideal I ∈ C−1. By the previous theorem there exists some nonzero α ∈ I such that |NK/Q(α)| ≤ λ NK (I). Note that (α) ⊂ I ⊂ OK , meaning that (α) = IJ for some J ⊂ OK . Consider this equation modulo PK , i.e. in IK /PK , we get IJ ≡ 1 (mod PK ) since (α) is principal. Now I is a representative for the coset C−1 and J one for C, so J ∈ C. Then |NK/Q(α)| = N((α)) = N(I) N(J) ≤ λ N(I) and so N(J) ≤ λ. Corollary 11.2.3. The class number h(K) is finite. Proof. There are only finitely many primes p ∈ Z such that |p| < λ, and above each of those there is a finite number of ideals J, whose norms are greater than the size of the prime they lie above, and so there are only finitely many ideals in OK with norm less than λ. Hence by the above corollary there are only finitely many ideal classes, since each ideal class contains an integral ideal with norm less than or equal to λ. MINKOWSKI’S THEOREM 38 √ √ Example√ 11.2.4. Let K = Q( 2), so OK = Z[ 2], with the integral basis { 1, 2 }. Then √ √ √ λ = (1 + 2)(1 + |− 2|) = (1 + 2)2 ≈ 5.8 so the only candidate√ primes we need consider are 2, 3, and 5. 2 Now 2OK = ( 2) factors, but 3OK and 5OK are prime ideals. For the 2 latter two we must therefore have e = 1 and f = 2, and N(3OK ) = 3 = 9 along 2 with N(5OK ) = 5 = 25, both of which exceed λ. √ √ Hence the ideals in OK with norms less than λ are OK = (1), 2OK = ( 2), and 2OK = (2), all of which are principal, so OK is a principal ideal domain. N Lecture 12 Minkowski’s Theorem 12.1 Using Geometry to Improve λ We showed last time that given a number field K, there exists some λ (depending on K) such that for every nonzero ideal I ⊂ OK there is some nonzero α ∈ I for which |NK/Q(α)| ≤ λ NK (I). Last time we showed that n n Y X λ = |σi(αj)| i=1 j=1 works, where { αi } is an integral basis for OK and σj : K,→ C are the embed- dings of K into C. The choice of this λ, as we saw at the end last time, decides how many primes we need to study in order to tell if OK is a principal ideal domain or not. It is therefore in our interest to sharpen λ, making it smaller. Definition 12.1.1 (Lattice). A lattice Λ in V = Rn is an additive subgroup of V of the form Λ = Zv1 + Zv2 + ... + Zvn with v1, v2, . . . , vn linearly independent over R. Example 12.1.2. The standard lattice that comes to mind is the integer lattice, 2 2 i.e. in V = R we have Λ = Z = Ze1 + Ze2, where e1 = (1, 0) and e2 = (0, 1). N Let K be a number field of degree n, and let σ1, σ2, . . . , σr be the real embeddings of K, with τ1, τ1, . . . , τ2, τs being the complex embeddings of K Consider the following function ϕ: K → Rn defined by ϕ(α) = (σ1(α), σ2(α), . . . , σr(α), Re τ1(α), Im τ1(α),..., Re τs(α), Im τs(α)). This is an additive homomorphism with ker ϕ = { 0 }, and hence ϕ is an em- bedding. Date: February 15th, 2018. 12.1 Using Geometry to Improve λ 39 Proposition 12.1.3. Let K be a number field with ring of integers OK . Then n Λ = ΛOK = ϕ(OK ) is a lattice in R . Proof. Let α1, α2, . . . , αn be an integral basis for OK , and set vi = ϕ(αi) for i = 1, 2, . . . , n. Then Λ = Zv1 + Zv2 + ... + Zvn, since OK = Zα1 + Zα2 + ... + Zαn. We need to verify that vi are linearly independent over R, so let us compute the determinant of the matrix made up of them as rows. Let v1 v2 δ = det . . vn which expanded is σ1(α1) ··· σr(α1) Re τ1(α1) Im τ1(α1) ··· Re τs(α1) Im τs(α1) . . . . . σ1(αn) ··· σr(αn) Re τ1(αn) Im τ1(αn) ··· Re τs(αn) Im τs(αn) If we multiply the first Im column by i, and then add the Im-column to the Re-column, we get σ1(α1) ··· σr(α1) τ1(α1) i Im τ1(α1) ··· Re τs(α1) Im τs(α1) 1 . . . . . i σ1(αn) ··· σr(αn) τ1(αn) i Im τ1(αn) ··· Re τs(αn) Im τs(αn) Next add −2 times the τ1-column to the i Im-column, whence we have σ1(α1) ··· σr(α1) τ1(α1) τ1(α1) ··· Re τs(α1) Im τs(α1) 1 . . . . . −2i σ1(αn) ··· σr(αn) τ1(αn) τ1(αn) ··· Re τs(αn) Im τs(αn) Repeat this for tall the columns corresponding to complex embeddings to get σ1(α1) ··· σr(α1) τ1(α1) τ1(α1) ··· τs(α1) τs(α1) 1 . . . s . . (−2i) σ1(αn) ··· σr(αn) τ1(αn) τ1(αn) ··· τs(αn) τs(αn) But the determinant there is the discriminant of K, so 1 |δ|2 = |disc(K)|= 6 0 22s and so δ 6= 0, meaning that vi are indeed linearly independent over R. Definition 12.1.4 (Fundamental parallelotope). A fundamental parallelo- tope for Λ is the set n n o X D = aivi 0 ≤ ai < 1 . i=1 We denote D = Rn/Λ. 12.1 Using Geometry to Improve λ 40 Example 12.1.5. With V = R2 and Λ = Z2, the fundamental parallelotope is the unit square between (0, 0), (1, 0), (1, 1), and (0, 1), with the right and top edges excluded. N Note that G V = (λ + D) λ∈Λ meaning that shifted D tile the space V . Proposition 12.1.6. We have v1 v2 Vol( n/Λ) = det R . . vn n and Vol(R /Λ) is independent of the choice of vi. These are elementary linear algebra considerations. The first one can in fact be taken as the definition of the determinant, and the second one follows from there being a transformation between any two integral bases whose determinant is ±1. n Theorem 12.1.7. The map ϕ: K,→ R sends OK to a lattice Λ = ϕ(OK ) p with Vol(Rn/Λ) = 2−s |disc(K)|. Moreover ϕ(K) = Qv1 + Qv2 + ... + Qvn is dense in Rn. n Let V = R and assume Λ1 ⊂ Λ2 ⊂ V are lattices. Then Λ2/Λ1 is a finite group, and n n Vol(R /Λ1) = Vol(R /Λ2) · |Λ2/Λ1|. 2 Example 12.1.8. Consider V = R and Λ2 = Z × Z ⊃ Λ1 = 2Z × Z, i.e. in the second one we only have even integers in our x-coordinates. Then the volume of the first fundamental parallelotope is 1·1 = 1, and the second one is 2·1 = 2, and the factor between them of course is 2. N We are now almost at the point where this excursion in geometry ties into our number theory: n Example 12.1.9. Let ϕ: K,→ R be as before, and let Λ = ϕ(OK ) be the same lattice. Let I ⊂ OK be an ideal. Then ΛI = ϕ(I) ⊂ Λ is a sublattice, and so n n n Vol(R /ΛI ) = Vol(R /Λ) · |OK /I| = Vol(R /Λ) NK (I) 1 = p|disc(K)| N (I). 2s K N Let us now define a special norm on Rn depending on K, a number field of degree n with r real embeddings and 2s complex embeddings. For x = n (x1, . . . , xr, xr+1, . . . , rn) ∈ R we define 2 2 2 2 2 2 N(x) = x1x2 ··· xr(xr+1 + xr+2)(xr+3 + xr+4) ··· (xn−1 + xn) 12.1 Using Geometry to Improve λ 41 n which is of note since if ϕ: K,→ R by α 7→ ϕ(α) = x, then NK/Q(α) = N(x) since r s Y Y NK/Q(α) = σi(α) τj(α)τj(α). i=1 j=1 Note that whilst we defined this in terms of a number field K, we could equally well forget about the number field and define such a norm on Rn for any partition n = r + 2s. Theorem 12.1.10 (Minkowski). Let n = r + 2s, r, s ∈ Z. For any lattice Λ ⊂ Rn there exists a nonzero x ∈ Λ such that n! 8 s |N(x)| ≤ Vol( n/Λ). nn π R Notice the complete lack of number fields above. It works for any partition n = r + 2s; it’s pure analysis. Theorem 12.1.11. For every nonzero ideal I ⊂ OK , there exists a nonzero α ∈ I such that n! 4 s |N (α)| ≤ p|disc(K)| N (I). K/Q nn π K Proof. Simply apply Minkowski’s theorem to ΛI = ϕ(I). In other words we can use n! 4 s λ = p|disc(K)| nn π in the type of considerations made last time: Corollary 12.1.12. Every ideal class of K contains an integral ideal J ⊂ OK with n! 4 s N (J) ≤ p|disc(K)|. K nn π Remark 12.1.13. The quantity n! 4 s nn π is called the Minkowski constant and notably decays very, very quickly, mean- ing that for number fields of fixed discriminant, as n → ∞ almost all of them are principal ideal domains. Example 12.1.14. Let K = Q(ξ5) with OK = Z[ξ5] and [K : Q] = ϕ(5) = 4 = n. We have disc(K) = 55−2 = 125 since s = 2, and so 4! 4 2√ N (J) ≤ 125 < 2 K 44 π and therefore NK (J) = 1, so J = OK , meaning that every ideal class contains OK , and therefore OK is a principal ideal domain. N Let us repeat the computation from last time as well, to show how much better this λ is. TOWARD MINKOWSKI’S THEOREM 42 √ Example 12.1.15. Let K = Q( 2), so s = 0 and n = 2, and disc(K) = 4 · 2 = 8. Then 2! 4 0√ √ N (J) ≤ 8 = 2, K 22 π so NK (J) = 1, and OK is a principal ideal domain. Notice how this time we didn’t have to try to factor a single prime ideal in OK . N Corollary 12.1.16. For K 6= Q, i.e. [K : Q] > 1, we have |disc(K)| > 1. Proof. Simyply use the last corollary. Since NK (J) ≥ 1, we have n! 4 s 1 ≤ p|disc(K)| nn π which rearranged becomes nn π s p|disc(K)| ≥ > 1 n! 4 for every n > 1. It remains to prove Minkowski’s theorem. This will take some work, but first: Lemma 12.1.17. Let Λ ⊂ Rn be a lattice. Let E be a convex, measurable, symmetric subset of Rn. If Vol(E) > 2n Vol(Rn/Λ), then E contains a nonzero element of Λ. Moreover if E is also compact, then the assumption can be weakened to Vol(E) ≥ 2n Vol(Rn/Λ). Before we prove this, let us first recall what the various properties above mean. We call a set convex if for every x, y ∈ E we also have ax + (1 − a)y ∈ E for every 0 ≤ a ≤ 1, i.e. the line segment between any two points in the set is also contained in the set. By measurable we mean a Lebesgue measurable set, so it is in the σ-algebra generated by the opens sets in Rn. When we say symmetric we mean that if x ∈ E then −x ∈ E as well. Lecture 13 Toward Minkowski’s Theorem 13.1 Proving Minkowski’s Theorem We’ll start by proving the lemma stated at the end of last lecture. Proof. Let F = Rn/Λ be the fundamental parallelotope for Λ. Then n G R = x + F x∈Λ meaning that 1 G 1 E = E ∩ (x + F ) . 2 2 x∈Λ Date: February 20th, 2018. 13.1 Proving Minkowski’s Theorem 43 By assumption, 1 X X Vol(F ) < Vol(E) = Vol(1/2E) = Vol(1/2E∩(x+F )) = Vol((1/2E−x)∩F ) 2n x∈Λ x∈Λ since the Lebesgue measure is translation invariant. Hence (1/2E −x)∩(1/2E − y) 6= ∅ for some x, y ∈ Λ, x 6= y, since otherwise the volume would be less than or equal to Vol(F ), which is a contradiction. So 1/2e1 − x = 1/2e2 − y for some e1, e2 ∈ E, meaning that 1 1 1 x − y = e − e = (e + (−e )) ∈ E 2 1 2 2 2 1 2 since E is symmetric, so −e2 ∈ E, and E is convex, with the last step above being a convex combination of e1 and −e2. Moreover x − y ∈ Λ since Λ is a group, whence x − y 6= 0 is in Λ ∩ E. Suppose now that E is compact and Vol(E) ≥ 2n Vol(Rn/Λ). For m = 1, 2,..., we have n n n Vol((1 + 1/m)E) = (1 + 1/m) Vol(E) > 2 Vol(R /Λ), so by the above there exists some nonzero xm ∈ Λ ∩ (1 + 1/m)E. ∞ Hence { xm }m=1 ⊂ Λ ∩ (2E), where 2E is compact (it’s closed and bounded n in R ), and Λ ∩ (2E) is finite since Λ is discrete. Hence { xm } takes finitely many values, meaning that there exists some subsequence { xk } with xk = x for all k. Therefore x = xk ∈ (1 + 1/k)E for all k, so when in the limit (1 + 1/k)E goes to E = E since E is compact. Hence x ∈ E ∩ Λ. Corollary 13.1.1. Let A be a compact, convex, symmetric subset of Rn with Vol(A) > 0 and |N(a)| ≤ 1 for all a ∈ A, where by N we mean the special norm on Rn defined by n = r + 2s as before. Then for every lattive Λ ⊂ Rn there exists some nonzero x ∈ Λ with 2n |N(x)| ≤ Vol( n/Λ). Vol(A) R Proof. Let E = tA where 2n tn = Vol( n/Λ). Vol(A) R Then n n n Vol(E) = Vol(tA) = t Vol(A) = 2 Vol(R /Λ). By the lemma, there exists a nonzero x ∈ Λ ∩ E such that x = ta for some a ∈ A. Then 2n |N(x)| = |N(ta)| = |tn N(a)| ≤ Vol( n/Λ). Vol(A) R Theorem 13.1.2 (Minkowski). For every lattice Λ ⊂ Rn there exists a nonzero x ∈ Λ such that n! 8 s |N(x)| ≤ Vol( n/Λ) nn π R where n = r + 2s. 13.1 Proving Minkowski’s Theorem 44 Proof. Define the set A by q q 2 2 2 2 |x1| + |x2| + ... + |xr| + 2( xr+1 + xr+2 + ... + xn−1 + xn) ≤ n, where n = r + 2s. Then A is compact (it’s certainly closed and bounded), it is convex (it suffices to study the middle points between any two points), and it’s trivially symmetric. We also have Vol(A) > 0 since we at the very least contain, say, some small intervals in each coordinate. Moreover for a ∈ A we ahve |N(a)| ≤ 1 since by the AM-GM inequality q q 2 2 2 2 |x1| + |x2| + ... + |xr| + 2( xr+1 + xr+2 + ... + xn−1 + xn) 1 ≥ n 2 2 2 2 1/n 1/n ≥ (|x1||x2| · · · |xr|(xr+1 + xr+2) ··· (xn−1 + xn)) = N(a) . We now claim nn π s Vol(A) = 2r . n! 2 Then the theorem follows from the corollary above. r+2s Let Vr,s(t) be the volume of the subset of R defined by q q 2 2 2 2 |x1| + |x2| + ... + |xr| + 2( xr+1 + xr+2 + ... + xn−1 + xn) ≤ t. r+2s Then Vol(A) = Vr,s(n) and by scaling Vr,s(t) = t Vr,s(1). We will show that 1 π s V (1) = 2r . r,2 (r + 2s)! 2 Now if r > 0, we have could move |xr| to the right-hand side and then bound by 1 − t, so Z 1 Z 1 r−1+2s Vr,s(1) = 2 Vr−1,s(1 − t) dt = 2 (1 − t) Vr−1,s(1) dt 0 0 2 = V (1), r + 2s r−1,s which by induction becomes 2r V (1) = V (1). r,s (r + 2s)(r + 2s − 1) ··· (2s + 1) 0,s Note that if s = 0 we define V0,0(1) = 1 (since V1,0(1) = 2 = 2V0,0(1)), and we are done. Now if s > 0, V0,s(1) is given by q q 2 2 2 2 2( xr+1 + xr+2 + ... + xn−1 + xn) ≤ 1, which if we move one of the square roots over yields ZZ p 2 2 V0,s(1) = V0,s−1(1 − 2 x + y ) dx dy R 13.2 The Dirichlet Unit Theorem 45 where R = { x2 + y2 ≤ 1/4 } is the circle of radius 1/2. We therefore switch to polar coordinates, x = ρ cos(θ) and y = ρ sin(θ), with dx dy = ρ dρ dθ, whence our integral becomes Z 2π Z 1/2 V0,s(1) = V0,s−1(1 − 2ρ)ρ dρ dθ 0 0 Z 2π Z 1/2 2(s−1) = (1 − 2ρ) V0,s−1(1)ρ dρ dθ 0 0 π 1 = V (1), 2 (2s)(2s − 1) 0,s−1 so by induction π s 1 V (1) = V (1) 0,s 2 (2s)(2s − 1) ··· 2 · 1 0,0 whence 1 π s V (1) = 2r . r,s (r + 2s)! 2 13.2 The Dirichlet Unit Theorem × Let K be a number field and OK its ring of integers. Let OK be the set of units in OK . Next let σ1, σ2, . . . , σr be the real embeddins of K and τ1, τ2, . . . , τs be the distinct nonconjugate compled embeddings of K, with n = r + 2s. Define a logarithm map log: K× → Rr+2 by log(α) = log|σ1(α)|, log|σ2(α)|,..., log|σr(α)|, 2 log|τ1(α)|,..., 2 log|τs(α)| . It is easy to check log(αβ) = log(α)+log(β), making it a multiplicative-additive group homomorphism. × r+s × Therefore log(OK ) is an additive subgroup of R . For α ∈ OK , we have 2 2 1 = |NK/Q(α)| = |σ1(α)σ2(α) ··· σr(α)|τ1(α)| · · · |τs(α)| |, means that 0 = log|σ1(α)| + ··· + log|σr(α)| + 2 log|τ1(α)| + ... + 2 log|τs(α)| which means that × r+s log(OK ) ⊂ H = { x ∈ R | x1 + x2 + ... + xr+s = 0 }, a hyperplane with dimR H = r + s − 1. Proposition 13.2.1. Consider log| × , by which we mean the logarithm re- OK × stricted to OK . Then (i) ker(log) = µ(K) is the group of roots of unity in K, which is finite. (ii) Im(log) is a lattice in H. Hence Im(log) =∼ Zr+s−1. DIRICHLET UNIT THEOREM 46 × ∼ r+s−1 Corollary 13.2.2 (Dirichlet unit theorem). We have OK = µ(K) × Z . To prove these, we first need the following lemma. Lemma 13.2.3. The number of elements on OK with bounded embeddings is finite, i.e. #{ α ∈ OK | |σi(α)| < M, |τj(α)| < M, i = 1, 2, . . . , r, j = 1, 2, . . . , s } is finite for every M > 0. Proof. Note that α ∈ OK is a root of a monic polynomial in Z[x] of degree less than or equal to n, with coefficients bounded by n max M i 1≤i≤n i by the binomial theorem. There exists only finitely many such polynomials, and therefore only finitely many such roots. Lecture 14 Dirichlet Unit Theorem 14.1 Dirichlet Unit Theorem We start by proving the proposition stated last time. Proof. The first part, that µ(K) ⊂ ker(log), is trivial, since roots of unity have absolute value 1, so the constituent logarithms all become 0. For the opposite inclusion, ker(log) ⊂ µ(K), consider × ker(log) = { α ∈ OK | |σi(α)| = 1, |τj(α)| = 1, i = 1, 2, . . . , r, j = 1, 2, . . . , s }. By the lemma this is finite, i.e. # ker(log) < ∞. So if we pick some α ∈ ker(log), the set { α, α2, α3,... } must be finite—all of these elements are in the kernel, since log is a homomorphism. Hence αm = αh for some m > h, so αm−h = 1, whence α is a root of unity, meaning that α ∈ µ(K). × For the second part we need to show that log(OK ) is discrete, and that × log(OK ) spans H over R. For any bounded set in H, it is contained in A = [−M,M]r+s ∩ H for some −1 M. Now the pre-image log (A) in OK is contained in M M/2 { α ∈ OK | |σi(α)| < e , |τj(α)| < e , i = 1, 2, . . . , r, j = 1, 2, . . . , s } × × which by the lemma is finite. Hence log(OK ) ∩ A is a finite set, and so log(OK ) is discrete. To prove the second part we need two lemmata. Lemma 14.1.1. Fix any k, 1 ≤ k ≤ r + s. For each α ∈ OK , there exists some nonzero β ∈ OK depending on α such that 2 s |N (β)| ≤ p|disc(K)| K/Q π Date: February 22nd, 2018. 14.1 Dirichlet Unit Theorem 47 independent of α, and if log(α) = (a1, a2, . . . , ar+s) and log(β) = (b1, b2, . . . , br+1) then bi < ai for all i 6= k. 2 Proof. Take E as the subset defined by |xi| ≤ ci for i = 1, 2, . . . , r, and xr+1 + 2 2 2 ai xr+2 ≤ cr+1, . . . , xn−1 + xn ≤ cr+s, where ci are chosen to satisfy 0 < ci < e for all i 6= k, and moreover 2 s c c ··· c = p|disc(K)|, 1 2 r+s π which then gives us ck. Then r s n n Vol(E) = 2 π c1c2 ··· cr+s = 2 Vol(R /ΛOK ), n −sp where we know Vol(R /ΛOK ) = 2 |disc(K)|. This set E is convex, compact, symmetric, and Lebesgue measurable, so we can apply Minkowski’s theorem, saying that there exists a nonzero x ∈ E ∩ΛOK . n Now take β ∈ OK such that ϕ(β) = x, where ϕ: OK ,→ R ∩ E simply sends β a vector with each component being an embedding of β. Then 2 s |N (β)| = |N(x)| ≤ p|disc(K)| K/Q π and β satisfies pur requirements by the choice of ci. × Lemma 14.1.2. Fix any k, 1 ≤ k ≤ r + s. Then there exists a unit u ∈ OK such that if log(u) = (y1, y2, . . . , yr+s), then yi < 0 for all i 6= k and yk > 0 (they couldn’t all be negative, since their sum must be 0). Proof. Take any α1 6= 0 in OK and apply the last lemma repeatedly to construct α2, then α3, and so forth. Thus for each i 6= k the ith coordinate of log(αj+1) is less than the ith coordinate of log(αj). Now since 2 s N ((α )) = |N (α )| ≤ p|disc(K)| K j K/Q j π if a uniform bound, and since there exists only finitely many distinct ideals of bounded norm (since they all lie above a finite number of primes less than some bound), we must have (αj) = (αh) for some j < h, and so αj = uαj for some × unit u ∈ OK . Hence log(αh) = log(uαj) = log(u) + log(αj). Call the ith coordinate of the left-hand side ah,i and the ith coordinates of the two addends in the right-hand side yi and aj,i respectively. Then for j < h we have ah,i = yi + aj,i for i 6= k, and so ah,i < aj,i meaning that yi < 0 for every i 6= k. 14.2 Fermat’s Last Theorem 48 With this in hand we apply the second lemma for each 1 ≤ k ≤ r + s − 1 to × obtain u1, u2, . . . , ur+s−1 ∈ OK , where log(uj) has positive jth coordinate and all other coordinates negative. Set vi = log(ui) = (yi,1, yi,2, . . . , yi,r+s−1), so vi,j < 0 if i 6= j and yi,i > 0. We claim that the set { v1, v2, . . . , vr+s−1 } spans H over R. In other words we want to show that it is a linearly independent set, since the dimension of H over R is exactly r + s − 1. Suppose t1w1 + t2w2 + ... + tr+s−1wr+s−1 = 0 for ti ∈ R, not all ti = 0, where wi are the columns of the matrix whose rows are vi. We may assume tk = 1 for some k, and |ti| ≤ 1 for i 6= k (simply find the largest coefficient and divide both sides by it). Then the kth coordinate is r+s−1 X X 0 = tiyi,k = yk,k + tiyi,k i=1 i6=k where yk,k > 0 and yi,k < 0. Note that at least one tj 6= 0 for some j 6= k since otherwise we’d have 0 = yk,k > 0, which is impossible. Hence X 0 > yk,k + yi,k = 0 i6=k since log(vi) ∈ H. Hence vi are linearly independent, and there are as many of them as there are dimensions in H, so they span H. 14.2 Fermat’s Last Theorem Theorem 14.2.1 (Fermat’s last theorem). Let n ≥ 3 be an integer. Then xn + yn = zn has no nontrivial solutions in Z (nontrivial meaning xyz 6= 0). There is a direct reduction to be made by considering primes p | n, since if n = pm, then (xm)p + (ym)p = (zm)p. Hence if xn + yn = zn has a nontrivial solution, then xp + yp = zp also has a nontrivial solution. Hence by contrapositive for any off prime p, xp +yp = zp having no nontrivial solutions implies that Fermat’s last theorem is true. Now assume gcd(x, y, z) = 1 (otherwise factor the gcd out). We have two cases: First, p - xyz, and secondly p divides exactly one of x, y, and z. For the second case, if p | x, then yp ≡ zp (mod p), and so y ≡ z (mod p), so if p | x and p | y, then gcd(x, y, z) > 1. The first case is provided a partial answer by Kummer’s theorem, namely with p - h(K), for K = Q(ζp). KUMMER’S THEOREM 49 Lecture 15 Kummer’s Theorem 15.1 Using the Dirichlet Unit Theorem × ∼ r+s−1 The Dirichlet unit theorem says that OK = µ(K) × Z , which in turn is isomorphic to Z/ωZ × Zr+s−1, where |µ(K)| = ω. We call Z/ωZ the torsion part and Zr+s−1 the free part. × n1 n2 nr+s−1 For any u ∈ OK , we therefore have u = ξε1 ε2 ··· εr+s−1 , where ξ ∈ µ(K) and ni, with ε1, ε2, . . . , εr+s−1 being free generators. Then { ε1, ε2, . . . , εr+s−1 } is called a fundamental system of units. √ Example 15.1.1. Let K = Q( D) with D being square free. We have two cases: × 2 First, D < 0. We know that OK is { ±1, ±i } if D = −1, it’s { ±1, ±ξ3, ±ξ3 } is D = −3, and it’s { ±1 } otherwise. × ∼ We have n = 2, r = 0, and s = 1, so r+s−1 = 0, i.e. rank 0, so OK = µ(K). Next consider D > 0. Then n = 2, r = 2, and s = 0, so r + s − 1 = 1. We have µ(K) = { ±1 }, and so O× ∼ { ±1 } × . K = Z √ Let ε be a fundamental unit of K with ε > 1. Then ε = a + b D if D 6≡ 1 (mod 4) (and similar but with half integers for D ≡ 1 (mod 4)). Then N(ε) = a2 −b2D = ±1, and ε is a fundamental solution of this equation since all units u = ±εm for some m ∈ Z. This equation is known as Pell’s equation. N 15.2 Kummer’s Theorem Theorem 15.2.1 (Kummer’s theorem). Suppose p > 2 is prime. If p - h(K), p p p with K = Q(ξp), then there are no nontrivial solutions to x + y = z with p - xyz. Remark 15.2.2. A prime p satisfying the above is called a regular prime, so this theorem says that Fermat’s last theorem is true for all regular primes. Before we proceed we recall some properties of the cyclotomic field K = Q(ξp). 1. OK = Z[ξp], and (1 − ξp)OK is a prime ideal. n p−1 2. (1 − ξp) = (1 − ξp) = pOK , so p is totally ramified—this is what implies that (1 − ξp)O)K is prime. 3.N K/Q(1 − ξp) = ±p. ∼ 4. Z/pZ = OK /(1 − ξp) since f = 1. s 5. µ(K) = { ±ξp }0≤s≤p−1. In order to prove Kummer’s theorem, we first need the following lemma. Lemma 15.2.3. (i) For each α ∈ OK , there exists some a ∈ Z such that p p p α ≡ a (mod (1 − ξp) ). × r (ii) Every unit ε ∈ OK is of the form ε = u · ξp with u ∈ R and r ∈ Z. Date: February 27th, 2018. 15.2 Kummer’s Theorem 50 Proof. (i) By the fourth property above, there exists some a ∈ Z such that α = a + (1 − ξp)β for β ∈ OK , so p−1 X p αp = ap + an((1 − ξ )β)p−n + (1 − ξ )pβp. n p p n=1