Class Field Theory
Andrew Kobin 2013-2015 Contents Contents
Contents
0 Introduction 1
1 Algebraic Number Fields 2 1.1 Rings of Algebraic Integers ...... 2 1.2 Dedekind Domains ...... 4 1.3 Ramification of Primes ...... 8 1.4 The Decomposition and Inertia Groups ...... 12 1.5 Norms of Ideals ...... 15 1.6 Discriminant and Different ...... 17 1.7 The Class Group ...... 24 1.8 The Hilbert Class Field ...... 32 1.9 Orders ...... 42 1.10 Units in a Number Field ...... 49
2 Class Field Theory 58 2.1 Valuations and Completions ...... 58 2.2 Frobenius Automorphisms and the Artin Map ...... 65 2.3 Ray Class Groups ...... 69 2.4 L-series and Dirichlet Density ...... 75 2.5 The Frobenius Density Theorem ...... 83 2.6 The Second Fundamental Inequality ...... 89 2.7 The Artin Reciprocity Theorem ...... 94 2.8 The Conductor Theorem ...... 99 2.9 The Existence and Classification Theorems ...... 101 2.10 The Cebotarevˇ Density Theorem ...... 103 2.11 Ring Class Fields ...... 110
3 Quadratic Forms and n-Fermat Primes 115 3.1 Binary Quadratic Forms ...... 115 3.2 The Form Class Group ...... 119 3.3 n-Fermat Primes ...... 124
A Appendix 128 A.1 The Four Squares Theorem ...... 128 A.2 The Snake Lemma ...... 129 A.3 Cyclic Group Cohomology ...... 130 A.4 Helpful Magma Functions ...... 132
i 0 Introduction
0 Introduction
These notes are a product of nearly two years of research in class field theory as part of my Master’s thesis at Wake Forest University. The main topics covered are:
Algebraic number fields and their extensions
Factorization of primes in number field extensions
The class group
The Hilbert class field
Dirichlet’s unit theorem
Valuations and completions
Ray class groups
Dirichlet L-series, Dirichlet density and the proof of Dirichlet’s theorem on primes in arithmetic progression
The main theorems of class field theory:
– Artin reciprocity – The Conductor Theorem – The fundamental equality – The Existence and Classification Theorems
An extended discussion of Frobenius’ and Cebotarev’sˇ density theorems
Ring class fields and orders
Quadratic forms and n-Fermat primes
In fact the first motivation for studying these topics is to fully answer the question, as described in [7], “Given a positive integer n, when can a prime number be written in the form x2 + ny2?” The reader will see that although the question has a rather elementary statement, it requires the depth and power of class field theory to fully understand. After describing the answer to this first question, we will turn our attention to the much more difficult, and unanswered question, “Given a positive integer n, if x2 + ny2 is prime, when is y2 + nx2 also prime?” In certain sections (1.8, 2.10 and 3.3) we use the Magma Computational Algebra System to handle large or complicated computations. Many of the basic commands can be found in the Magma handbook, available at http://magma.maths.usyd.edu.au/magma/handbook/ through the University of Sydney’s Computational Algebra Group.
1 1 Algebraic Number Fields
1 Algebraic Number Fields
In the first chapter we provide a detailed description of the main topics in algebraic number theory: algebraic number fields, rings of integers, the behavior of prime ideals in extensions, norms of ideals, the discriminant and different, the class group, the Hilbert class field, orders and Dirichlet’s unit theorem.
1.1 Rings of Algebraic Integers
Let Q be an algebraic closure of Q. Then Q is an infinite dimensional Q-vector space and every polynomial f ∈ Q[x] splits in Q[x]. An example of such an algebraic closure is Q = {u ∈ C | f(u) = 0 for some f ∈ Q[x]}. Then Q ⊂ Q ⊂ C. Note that any two choices of Q are isomorphic. One of the most important elements of a number field we will be working with is:
Definition. An element α ∈ Q is an algebraic integer if it is a root of some monic polynomial with coefficients in Z. √ 2 1 Example 1.1.1. 2 is an algebraic integer since it is a root of x − 2. However, 2 , π and 1 e are not algebraic integers. We will see in a moment why 2 is not algebraic, but the proof for π and e is famously difficult.
Note that the set of algebraic integers in Q is precisely the integers Z. In a moment we will generalize this set to fields other than Q.
Definition. The minimal polynomial of α ∈ Q is the monic polynomial f ∈ Q[x] of minimal degree such that f(α) = 0.
The minimal polynomial of α is unique, as the following lemma shows.
Lemma 1.1.2. Suppose α ∈ Q. Then the minimal polynomial f of α divides any other polynomial h such that h(α) = 0.
Proof. Suppose h(α) = 0. Then by the division algorithm, h = fq + r with deg r < deg f. Note that r(α) = h(α) − f(α)q(α) = 0 so α is a root of r. But since deg f is minimal among all polynomials of which α is a root, r must be 0. This shows that f divides h.
Lemma 1.1.3. If α ∈ Q is an algebraic integer then the minimal polynomial has coefficients in Z.
Proof. Let f ∈ Q[x] be the minimal polynomial of α. Since α is an algebraic integer, there is some g ∈ Z[x] such that g(α) = 0. By Lemma 1.1.2, g = fh for some monic h ∈ Q[x]. Suppose f 6∈ Z[x]. Then there is some prime p dividing the denominator of at least one of the coefficients of f; let pi be the largest power of p that divides a denominator. Likewise let pj be the largest power of p that divides the denominator of a coefficient of h. Then pi+jg = (pif)(pjh) and reducing mod p gives 0 on the left, but two nonzero polynomials in Fp[x] on the right, a contradiction. Hence f ∈ Z[x].
2 1.1 Rings of Algebraic Integers 1 Algebraic Number Fields
An important characterization of algebraic integers is proven in the next proposition.
Pn i Proposition 1.1.4. α ∈ Q is an algebraic integer if and only if Z[α] = { i=0 ciα : ci ∈ Z, n ≥ 0} is a finitely generated Z-module.
Proof. ( =⇒ ) Suppose α is integral with minimal polynomial f ∈ Z[x], where deg f = k. Then Z[α] is generated by 1, α, . . . , αk−1. ( ⇒ = ) Suppose α ∈ Q and Z[α] is generated by f1(α), . . . , fn(α). Let d ≥ M where M = max{deg fi | 1 ≤ i ≤ n}. Then
n d X α = aifi(α) i=1
n d X for some choice of ai ∈ Z. Hence α is a root of x − aifi(x) so it is integral. i=1
1 1 Example 1.1.5. α = 2 is not an algebraic integer since Z 2 is not finitely generated as a Z-module.
Definition. For a given algebraic closure Q of Q, we will denote the set of all algebraic integers in Q by Z.
This set inherits the binary operations + and · from Q, and an important property is that Z is closed under these operations:
Proposition 1.1.6. The set Z of all algebraic integers is a ring.
Proof. Note that 0 is a root of the zero polynomial, so 0 ∈ Z. Then it suffices to prove closure under addition and multiplication. Suppose α, β ∈ Z and let m and n be the degrees of their respective minimal polynomial. Then 1, α, . . . , αm−1 span Z[α] and 1, β, . . . , βn−1 likewise span Z[β]. So the elements αiβj for 1 ≤ i ≤ m, 1 ≤ j ≤ n span Z[α, β], so this Z-module is finitely generated. This implies that the submodules Z[α + β] and Z[αβ] of Z[α, β] are also finitely generated, so it follows by Proposition 1.1.4 that α + β and αβ are algebraic integers. The two most important objects of study in algebraic number theory are number fields and their associated rings of integers, which are defined below.
Definition. A number field is a subfield K ⊂ Q such that K is a finite dimensional vector space over Q. The dimension of K/Q is called the degree of the field extension, denoted [K : Q]. Definition. The ring of integers of a number field K is
OK = K ∩ Z = {α ∈ K | α is an algebraic integer}.
Example 1.1.7. Q is the unique number field of degree 1, and its ring of integers is the rational integers Z.
3 1.2 Dedekind Domains 1 Algebraic Number Fields
Example 1.1.8. Q(i) is a number field of degree 2. Its ring of integers is Z[i], the Gaussian integers. √ √ Example 1.1.9. K = Q( √5) has ring of integers OK = Z[(1 + 5)/2]. The reader may recognize this number (1 + 5)/2 as the golden ratio.
An object we will study in Section 1.9 is:
Definition. An order in OK is any subring O ⊂ OK such that the quotient OK /O of abelian groups is finite.
Example 1.1.10. For OQ(i) = Z[i], the subring Z+niZ is an order for every nonzero n ∈ Z. However, Z ⊂ Z[i] is not an order since Z does not have finite index in Z[i].
Example 1.1.11. For K = Q(α) where α is an algebraic integer, Z[α] is an order in OK but in general Z[α] 6= OK . We study orders in further detail in Section 1.9 and see some important examples where Z[α] 6= OK .
Lemma 1.1.12. For any number field K, OK ∩ Q = Z and QOK = K. a Proof. Suppose α ∈ OK ∩ Q such that α = b in lowest terms. We may assume b > 0. Since a α is integral, Z b is finitely generated as a Z-module so b = 1. On the other hand, suppose α ∈ K with minimal polynomial f(x) ∈ Q[x], where deg f = n x n. For any positive integer d, the minimal polynomial of dα is d f d . In particular, let d n x be the least common multiple of the denominators of the coefficients of f. Then d f d has integer coeffients, so dα ∈ OK . Hence QOK = K.
Definition. A lattice in a number field K is a subset L such that QL = K and L is an abelian group of rank [K : Q]. Proposition 1.1.13. For any number field K, the ring of integers is a lattice in K.
Proof. QOK = K was proven in Lemma 1.1.12, and the second statement can be shown by choosing a basis for K consisting of elements in OK .
Corollary 1.1.14. OK is a noetherian ring.
Proof. By the proposition, OK is finitely generated as a Z-module, so it is clearly finitely generated as a ring. It is well known (cf. 2.2.7 in [25]) that this implies OK is noetherian.
1.2 Dedekind Domains
A nice property of the integers Z is unique factorization: every integer n can be written as a unique product of powers of prime numbers. This unique factorization property fails in general for rings of algebraic integers. However, OK has the special property that every nonzero ideal factors uniquely as a product of prime ideals.
Definition. An integral domain R is integrally closed in its field of fractions K if every α ∈ K that is a root of a monic polynomial f ∈ R[x] is itself in R.
4 1.2 Dedekind Domains 1 Algebraic Number Fields
Proposition 1.2.1. Z is integrally closed and for any number field K, its ring of integers OK is integrally closed.
Proof. First suppose α ∈ Q is integral over Z. Then there is some monic polynomial f(x) n in Z[x] such that f(α) = 0. If f(x) = a0 + a1x + ... + x then the ai all lie in OK , where K = Q(a0, . . . , an−1). Since OK is finitely generated as a Z-module, so is Z[a0, . . . , an−1]. Now f(α) = 0 means that we can write αn as a combination of αi for i < n, with weights ci ∈ Z[a0, . . . , an−1]. Thus Z[a0, . . . , an−1, α] is also a finitely generated Z-module. But notice that Z[α] is a submodule of Z[a0, . . . , an−1, α], so it too is finitely generated. Hence α is integral over Z, meaning α ∈ Z. This proves the first statement, and the second statement now follows easily. Suppose α ∈ K is integral over OK . Then since Z is integrally closed, α ∈ Z, implying α ∈ K ∩ Z = OK .
This property of OK is important in establishing it as a special type of domain, called a Dedekind domain.
Definition. An integral domain R is a Dedekind domain if
(1) R is noetherian.
(2) R is integrally closed in its field of fractions.
(3) Every nonzero prime ideal p ⊂ R is maximal. √ √ √ Example 1.2.2.√ Z[ 5] is not integrally closed since√ for example (1√ + 5)/2 ∈ Q( 5) is integral over Z[ 5] but is not itself an element of Z[ 5]. Therefore Z[ 5] is not a Dedekind√ domain, but as we shall see in Section 1.9 it is an order of the ring of integers Z[(1 + 5)/2]. Example 1.2.3. Any field is (trivially) a Dedekind domain.
Example 1.2.4. Z is integrally closed and every nonzero prime ideal is maximal, but Z is not noetherian and hence not a Dedekind domain.
Proposition 1.2.5. OK is a Dedekind domain.
Proof. We have shown (Corollary 1.1.14 and Proposition 1.2.1) that OK is integrally closed and noetherian, so it suffices to show that prime ideals are maximal. n n−1 Suppose p is a nonzero prime ideal of OK . Let α ∈ p and let f(x) = x +an−1x +...+a0 n n−1 be its minimal polynomial. Then f(α) = α + an−1α + ... + a1α + a0 = 0 so
n n−1 a0 = −α − an−1α − ... − a1α ∈ p.
Then a0 ∈ Z ∩ p so every element of the quotient OK /p is killed by a0, which implies OK /p is finite. Since p is prime, OK /p is an integral domain, and every finite integral domain is a field, which proves that p is maximal. Hence OK is a Dedekind domain. The crucial property of Dedekind domains is that their nonzero ideals factor uniquely into prime ideals. In fact, unique factorization holds for a more general class of objects in a Dedekind domain called fractional ideals.
5 1.2 Dedekind Domains 1 Algebraic Number Fields
Definition. Let R be a Dedekind domain and K be its field of fractions. A fractional ideal of R is a nonzero R-submodule of K that is finitely generated as an R-module.
Note that since fractional ideals are finitely generated, we can clear denominators of a generating set to realize every fractional ideal in the form
aI = {ab | b ∈ I}
where a ∈ K and I is an integral ideal of the ring R.
1 Example 1.2.6. 2 Z is a fractional ideal of Z. Lemma 1.2.7. Let R be a Dedekind domain. For every nonzero ideal I ⊂ R, there exist prime ideals p1,..., pn such that p1 ··· pn ⊂ I. Proof. Let S be the set of nonzero ideals in R that do not satisfy the conclusion of the lemma. The idea here is to use the fact that R is noetherian to show that S must be empty. Supposing to the contrary that S is not empty, the noetherian property allows us to choose a maximal element I ∈ S. If I were prime, it would trivially contain a product of primes so we know this is not the case. Then there exist a, b ∈ RrI such that ab ∈ I. Let J1 = I +(a) and J2 = I + (b). Then neither J1 nor J2 is in S since I is maximal, so each contains the product of primes, say
p1 ··· pr ⊂ J1 and q1 ··· qs ⊂ J2.
2 Then p1 ··· prq1 ··· qs ⊂ J1J2 = I + I(b) + (a)I + (ab) ⊂ I. We have shown I to contain a product of primes, producing the necessary contradiction to show that S is empty. Hence every nonzero ideal of R contains a product of primes. The critical property of fractional ideals is proven next.
Theorem 1.2.8. The set of fractional ideals of a Dedekind domain R forms an abelian group under ideal multiplication, with identity R.
Proof. The product of two fractional ideals is again finitely generated, hence a fractional ideal. Also, for any nonzero ideal I, IR = R so it suffices to show the existence of inverses. First we prove that if p ⊂ K is prime, it has an inverse. Let I = {a ∈ K | ap ⊂ R}; we will show this is an inverse of p. Fix a nonzero b ∈ p. Since I is an R-module, bI is an ideal in R. And since R ⊂ I we have p ⊂ Ip ⊂ R, but p is maximal (R is a Dedekind domain) so either p = Ip or Ip = R. If Ip = R then I is an inverse of p and we’re done. Instead suppose Ip = p. By Lemma 1.2.7 we can choose a minimal product of prime ideals p1p2 ··· pm ⊂ (b) ⊂ p. If no Q pi is contained in p then for each i there is some ai ∈ pi with ai 6∈ p, but ai ∈ p which contradicts that p is a prime ideal. Thus there is some pi ⊂ p. However, every prime is maximal so pi = p. Since m was minimal, p2p3 ··· pm 6⊂ (b) and so there is some c 6∈ (b) that c lies in p2p3 ··· pm. Then p(c) ⊂ (b) so we have d := b ∈ I. However d 6∈ R since if it were, it c would lie in (b). But note that d preserves p as an R-module – that is, dp ⊂ p since d = b – so d must be in R, a contradiction. Hence Ip = R, so every prime ideal has an inverse in R.
6 1.2 Dedekind Domains 1 Algebraic Number Fields
Now we turn to fractional ideals. Every fractional ideal is of the form aI for some a ∈ K and I an ideal of R. Since the prime ideals are maximal in R, I ⊂ p for some prime p. Multiplying both sides of this containment by p−1, we have
I ⊂ p−1I ⊂ p−1p = R.
By the same argument as above, p−1I = R so every fractional ideal has an inverse. In the next two theorems we show that unique factorization of ideals holds in any Dedekind domain. Theorem 1.2.9. Every nonzero ideal I in a Dedekind domain R can be written as a unique (up to order) product of prime ideals. Proof. Suppose I is maximal among those ideals that cannot be factored into primes. Every ideal is contained in a maximal ideal so I ⊂ p for some maximal p which is also prime. If Ip−1 = I then p−1 = R by group properties, but this is impossible. However, R ⊂ p−1 −1 −1 which implies I ( Ip . By maximality of I, Ip = p1 ··· pn for prime ideals pi. Then I = p1 ··· pnp, which shows I can in fact be written as a product of primes, contradicting our initial assumption. Hence every ideal has a prime factorization. To prove uniqueness, suppose p1 ··· pn = q1 ··· qm. If no qi is contained in p1 then for each i there is some ai ∈ qi r p1. But then a1 ··· am ∈ q1 ··· qm = p1 ··· pn ⊂ p1 which contradicts primality of p1. Thus p1 = qi for some i, and this argument can be repeated for each pj to show that pj = qi for some i. Thus the factorization is unique up to order.
Theorem 1.2.10. If I is a fractional ideal of R then there exist prime ideals p1,..., pn and q1,..., qm so that −1 I = (p1 ··· pn)(q1 ··· qm) and this factorization is unique up to order. Proof. We can clear denominators to write aI = J for some a ∈ R and J an integral ideal of R. Apply unique factorization to J and (a) and the result follows from Theorem 1.2.8. √ √ √ Example 1.2.11. Let K = Q( −6) with ring of integers OK = Z[ −6]. If ab = −6 with neither a unit, then Norm(a)Norm(b) = 6 (see√ Section 1.5). Without loss of generality let 2 2 Norm(a) = 2 and Norm(b) = 3. If a√= x + y −6 then Norm(a) = x + 6y = 2 which has no solutions in Z.√ This shows√ that −6 is irreducible, and even if a or b were a unit, the other would equal −6 so −6 would be irreducible anyways. So 6 cannot be written as a product of irreducibles in OK . However, (6) factors into prime ideals as Theorem 1.2.9 suggests: √ √ (6) = (2, 2 + −6)2(3, 3 + −6)2. This is not trivial to calculate, but we will develop the techniques required to determine such a factorization in subsequent sections. A special case of a Dedekind domain is: Definition. An integral domain R is a discrete valuation ring if it is noetherian, inte- grally closed and contains exactly one nonzero prime ideal.
7 1.3 Ramification of Primes 1 Algebraic Number Fields
In Section 2.1, we will see where the name ‘discrete valuation ring’ comes from, as well as study some of the properties of a DVR as they relate to absolute values on a field. We proved in Theorem 1.2.8 that the set of fractional ideals of a Dedekind domain forms a group under ideal multiplication, but there is an even stronger characterization. Theorem 1.2.12. Let R be an integral domain. Then the following are equivalent: (1) R is a Dedekind domain.
(2) For every prime ideal p ⊂ R, the local ring Rp is a discrete valuation ring. (3) The fractional ideals of R form a group.
(4) For every fractional ideal I ⊂ R there is an ideal J ⊂ R such that IJ = R. Proof. See VIII.6.10 in [14].
In the case of OK , there are some important groups that arise from fractional ideals, the most important being the class group.
Definition. Let IK denote the group of fractional ideals of OK and let PK denote the sub- group of all principal fractional ideals of OK :
∗ PK = {αOK | α ∈ K }.
Then the quotient IK /PK is called the ideal class group of K, denoted C(OK ). In Section 1.7 we explore this group fully. A major result we will prove is
Theorem. C(OK ) is a finite group.
1.3 Ramification of Primes
Let K be a number field and suppose L/K is any finite extension. If p is a prime ideal of OK then pOL is an ideal of OL and hence has prime factorization
e1 eg pOL = P1 ··· Pg where Pi are the distinct prime ideals of OL containing p. We will sometimes say a prime Pi lies over p, Pi contains p or Pi divides pOL.
Definition. For each Pi, the integer ei is called the ramification index of p in Pi. If any of these are greater than 1, p is said to ramify in L.
Definition. Each ideal Pi lying over p gives a residue field extension OL/Pi ⊃ OK /p. The degree of this extension, denoted fi, is called the inertial degree of p in Pi.
Definition. A prime p is said to split completely in L if ei = fi = 1 for all Pi in the prime factorization of pOL. If in addition pOL is itself a prime ideal, i.e. g = 1, we say p is inert.
8 1.3 Ramification of Primes 1 Algebraic Number Fields
The set of all prime ideals of a ring R together with (0) is called the spectrum of R, denoted Spec(R). We will also occasionally use Spec(p) to denote the set of primes P ⊂ OL lying over p ⊂ OK .
2 Example 1.3.1. In Z[i], (2) = (1 + i) so (2) ramifies with e1 = 2. By contrast, (3) is inert ∼ in Q(i) with residue field Z[i]/(3) = F9, and (5) = (2 + i)(2 − i) is unramified.
The next lemma characterizes the primes of OL which divide pOL.
Lemma 1.3.2. A prime ideal P ⊂ OL divides pOL if and only if p = P ∩ K.
Proof. ( =⇒ ) Clearly p ⊂ P ∩ K 6= OK . Since p is maximal, this implies p = P ∩ K. ( ⇒ = ) If p ⊂ P then we have seen that pOL ⊂ P and this implies that P occurs in the prime factorization of pOL. There is an important relation between the ramification indices, inertial degrees and number of primes in Spec(p) that is described in the next theorem, known as the efg theorem.
Theorem 1.3.3. Let m = [L : K] and let P1,..., Pg be the prime OL-ideals containing p ⊂ OK . Then g X eifi = m. i=1
Furthermore, if L/K is Galois, then all the ramification indices are equal to e = e1 and all the inertial degrees are equal to f = f1, so efg = m.
Proof. The first statement is proven by showing both sides are equal to [OL/pOL : OK /p]. By the Chinese remainder theorem,
OL OL Y OL = ∼= . Q ei ei pOL Pi Pi
For each i = 1, . . . , g, fi is the degree of the extension OL/Pi ⊃ OK /p, and for each ri, ri ri+1 ri ri+1 Pi /Pi is an OL/Pi-module. Since there is no ideal between Pi and Pi –(OL)Pi is a DVR – this module has dimension 1 as an OL/Pi-vector space, and hence dimension fi as an OK /p-vector space. Therefore each quotient in the chain
2 ei OL ⊃ Pi ⊃ Pi ⊃ · · · ⊃ Pi
ei has dimension fi over OK /p. Thus [OL/Pi : OK /p] = eifi. This shows that the left side equals [OL/pOL : OK /p]. For the other equality, we first prove it when OL is a free OK -module (e.g. when OK is a ∼ n = n PID). On one hand, OK −→OL induces an isomorphism K → L which shows that n = m. ∼ n = n On the other hand, OK −→OL also induces an isomorphism (OK /p) → OL/pOL which shows that m = n = [OL/pOL : OK /p]. In the general case, localize OK at p to obtain a 0 0 DVR OK = (OK )p. Since a DVR is always a PID, OL = (OL)p satisfies
0 Y 0 ei pOL = (PiOL)
9 1.3 Ramification of Primes 1 Algebraic Number Fields
0 0 0 0 so [OL/pOL : OK /pOK ] = m. This completes the first part of the proof. Now assume L is Galois over K. Take σ ∈ G = Gal(L/K). Then if P ⊂ OL is a prime ideal, so is σ(P). Moreover, if P contains p then by Lemma 1.3.2 so must σ(P). Clearly e(σ(P) | p) = e(P | p) and f(σ(P) | p) = f(P | p). To complete the proof, we will show that G acts transitively on Spec(p), the set of prime ideals of OL lying over p. Suppose P and Q both contain p but are not Galois conjugates. By the Chinese remainder theorem we can find an element β ∈ Q that does not lie in σ(P) for any σ ∈ G. Define b = N(β), where N denotes the norm (see Section 1.5). Then b ∈ OK and −1 since β ∈ Q, b ∈ Q as well. Thus b ∈ OK ∩ Q = p. On the other hand, β 6∈ σ (P) for any σ ∈ G so σ(β) 6∈ P. However, N(σ(β)) = N(β) = b ∈ p so we have p ⊂ P which contradicts primality of p. Hence Gal(L/K) acts transitively on the primes containing p and the result follows by the preceding paragraph since e and f are invariant under Gal(L/K). As we saw in Example 1.2.11, it is hardly easy to determine the factorization of ideals in a number field. The next theorem will be of immense importance going forward, as it allows us to describe the splitting behavior of a prime p ⊂ OK as we pass to an extension L/K.
Theorem 1.3.4. Let L/K be Galois, where L = K(α) for some α ∈ OL. Let f(x) ∈ OK [x] be the minimal polynomial of α over K. Suppose p is a prime ideal of OK and f(x) is separable mod p. Then (1) p is unramified in L.
(2) If f(x) ≡ f1(x) ··· fg(x) mod p for distinct fi(x) which are irreducible mod p, then Pi = pOL + fi(α)OL is a prime ideal of OL, and the prime factorization of pOL is
pOL = P1 ··· Pg.
Furthermore, deg fi = f(Pi | p) for all i, and since L/K is Galois, these are all the same.
(3) p splits completely in L ⇐⇒ f(x) ≡ 0 mod p has a solution in OK . Proof. (1) and (3) will follow during the course of proving (2). To prove (2), observe that since f(x) is separable mod p, f(x) ≡ f1(x) ··· fg(x) mod p for distinct, irreducible (mod p) polynomials fi(x). If P ⊂ OL is a prime lying over p, then fi(α) ∈ P for some i; we may relabel the fi so that f1(α) ∈ P. Then by Galois theory,
[OL/P : OK /p] ≥ [L : K] = deg f.
Now for any σ ∈ Gal(L/K) such that σ(P) = P, f1(σ(α)) ∈ P and f1(x) is separable by hypothesis, so deg f1 ≥ ef, where e and f are the ramification index and inertial degree, respectively, of p in P. This shows that e = 1 and f = deg f1, so (1) is proved. Now let pOL = P1 ··· Pg be the prime factorization of pOL into prime ideals of OL. Theorem 1.3.3 implies that deg fi = f for all i, so it remains to prove that each Pi is generated by p and fi(α). On one hand, pOL + fi(α)OL is contained in Pi since fi(α) ∈ Pi Y (reindexing if necessary). On the other hand, (pOL + fi(α)OL) ⊂ pOL. Each ideal on the left is contained in a prime ideal in the factorization of pOL, and this must be Pi for each i. This completes the proof of (2).
10 1.3 Ramification of Primes 1 Algebraic Number Fields
We will develop further techniques for deciding when a prime ramifies/splits/stays inert in Section 1.6. For the moment, we do not even know if there are an infinite number of primes splitting in an extension L/K; this question will finally be given an answer in Section 2.10.
Example 1.3.5. In this example we provide a full characterization√ of the splitting behavior of primes in quadratic extensions. Suppose K = Q( n) where n is a squarefree integer. Then K/Q is Galois, so for each prime p ∈ Z we have 2 = efg by Theorem 1.3.3. There are exactly three possibilities for e, f and g:
2 e = 2 and f, g = 1. In this case p ramifies in OK so pOK = P for some prime ideal P. It turns out that there are only finitely many such primes since by (3) of the previous theorem, p ramifies in K if and only if x2 + n ≡ 0 (mod p) has a multiple root. This ties in with the idea that the discriminant of a polynomial determines its number of roots – in Section 1.6, we will see that the connection between ramification and discriminants runs even deeper.
f = 2 and e, g = 1. In this case p is inert, so pOK is prime. It turns out that this happens half the time (minus the finitely many cases when a prime ramifies).
g = 2 and e, f = 1. Here p splits completely in OK , so pOK = P1P2 for prime ideals P1 6= P2. This happens the other half of the time. √ Definition. For a quadratic field K = Q( n), the discriminant of K is ( n if n ≡ 1 (mod 4) dK = 4n otherwise.
For any integer q we also define the Kronecker symbol by 0 if q ≡ 0 (mod 4) q = 1 if q ≡ 1 (mod 8) 2 −1 if q ≡ 5 (mod 8). √ As a consequence of the above characterization of primes in OK , where K = Q( n), we have the following characterization of the splitting of primes in a quadratic extension. √ Proposition 1.3.6. A prime p ramifies in K = Q( n) if and only if p | dK , and p splits dK completely in K if and only if p = 1. The first statement follows from the general case in Sections 1.6 and the second is a −4n −n 2 consequence of (3) of Theorem 1.3.4, since p = p = 1 if and only if x + n ≡ 0 (mod p) for some integer x. For now, let’s take a look at a familiar example.
Example 1.3.7. Let K = Q(i) and recall that the Gaussian integers Z[i] are the ring of integers for K. In this example we will describe the splitting behavior of primes in Z[i]. From the last few results, we claim that for an odd prime integer p (excluding p = 2) the following are equivalent:
11 1.4 The Decomposition and Inertia Groups 1 Algebraic Number Fields
(i) p ≡ 1 (mod 4).
(ii)( p) splits completely in Z[i]. (iii) p = x2 + y2 for some integers x, y.
Proof. To prove our claim, note that Z[i] is the ring of integers for K = Q(i) so we may take the α in Theorem 1.3.4 to be i, which has minimal polynomial x2 + 1 over Q. Thus we know that (p) splits completely in Z[i] if and only if x2 + 1 splits modulo p. This in turn happens × if and only if Fp contains a fourth root of unity, i.e. Fp contains an element of order 4. Since × Fp has order p − 1, this means 4 | p − 1 and so (i) ⇐⇒ (ii) is proven. Next suppose (p) splits in Z[i]; let (p) = p1p2 for prime ideals p1, p2 ∈ Z[i]. In Exam- ple 1.7.2, we will prove that the ring of Gaussian integers Z[i] is a PID. Using this fact, we 2 2 know p1 = (x + yi) for integers x and y, but then p2 must be (x − yi). Therefore p = x + y up to multiplication by a unit in Z[i]. However the only units are ±1, ±i so clearly p must just be x2 + y2. Conversely, if p = x2 + y2 then p = (x + yi)(x − yi) in Z[i]. Note that this solves Fermat’s theorem characterizing primes of the form x2 + y2. It will be a continuing theme in these notes to fully characterize primes of the form x2 + ny2 for all integers n.
1.4 The Decomposition and Inertia Groups
In this section we describe two important subgroups of Gal(L/K) for a Galois extension L/K of number fields.
Definition. For a Galois extension L/K and a prime ideal P ⊂ OL lying over p ⊂ OK , the decomposition group of P is
DP = {σ ∈ Gal(L/K) | σ(P) = P} and the inertia group of P is
IP = {σ ∈ Gal(L/K) | σ(α) ≡ α mod P for all α ∈ OL}.
Let k = OK /p and ` = OL/P denote the respective residue fields of p and P. We will prove that there is an exact sequence
1 → IP → DP → Gal(`/k) → 1.
Recall from the proof of Theorem 1.3.3 that G = Gal(L/K) acts transitively on Spec(p). Then we can interpret DP as the stabilizer of P under this action. The Orbit-Stabilizer Theorem tells us that [G : DP] = g, where g is the number of distinct primes in the factorization of pOL. Hence |Dp| = ef.
Lemma 1.4.1. For a fixed prime ideal p ⊂ OK , the decomposition groups DP of the prime ideals lying over p are conjugate subgroups of Gal(L/K).
12 1.4 The Decomposition and Inertia Groups 1 Algebraic Number Fields
Proof. This is a more general fact about the stabilizers of a transitive group action. Note that for σ, τ ∈ Gal(L/K),
−1 −1 τ στ ∈ DP ⇐⇒ τ στP = P ⇐⇒ στP = τP ⇐⇒ σ ∈ DτP
−1 −1 which implies that σ ∈ DP ⇐⇒ τστ ∈ DP. Hence τDPτ = DτP. The decomposition group is useful because we can view an extension L/K as a tower of extensions so that we understand the splitting of primes better in each step of the tower.
Proposition 1.4.2. Let L/K be a Galois extension and fix a prime p ⊂ OK . Let D = DP be the decomposition group for a particular prime P lying over p. Then the fixed field LD = {α ∈ L | σ(α) = α for all σ ∈ D}
is the smallest subfield E of L such that g = 1 for P ∩ OE. Proof. First suppose E = LD. By Galois theory, Gal(L/E) ∼= D and as in the last section, D acts transitively on the set of primes of OL lying over PE := P∩OE. One of these primes is P itself, and D fixes P by definition, so this must be the only prime lying over PE, i.e. g = 1. On the other hand, if g = 1 for PE then Gal(L/E) fixes P: it’s the only prime over PE. So Gal(L/E) ≤ D and by Galois correspondence, LD ⊂ E. This shows that p does not split when moving from LD to L: it either ramifies or stays D inert. Let E = L and denote P ∩ OE by PE, e = e(P | p), f = f(P | p) and g = g(P | p). To piece together more of the puzzle, we have the following.
Proposition 1.4.3. Given K, L, E, p, P and PE as above, e(PE | p) = f(PE | p) = 1, g = [E : K], e = e(P | PE) and f = f(P | PE). Proof. As mentioned in the remarks preceding Lemma 1.4.1, the Orbit-Stabilizer Theorem implies that g = [Gal(L/K): D]. Then by Galois theory, [Gal(L/K): D] = [E : K], so this equals g. The previous proposition gives us g(P | PE) = 1, and by Theorem 1.3.3 we have [L : K] e(P | P )f(P | P ) = [L : E] = E E [E : K] efg = = ef. [E : K]
Now e(P | PE) ≤ e and f(P | PE) ≤ f, so we must have that e(P | PE) = e and f(P | PE) = f. It follows easily that e(PE | p) and f(P | p) are 1.
Fix a prime P ⊂ OL lying over p ⊂ OK . Observe that each σ ∈ DP acts on the finite field ` = OL/P and fixes k = OK /p so we obtain a group homomorphism
ϕ : DP −→ Gal(`/k). The next two results establish that ϕ is surjective, which we will use to prove exactness of the sequence described at the start of this section.
13 1.4 The Decomposition and Inertia Groups 1 Algebraic Number Fields
Lemma 1.4.4. The residue field extension `/k is Galois. Proof. First we show that `/k is normal. To do this, take anyα ˜ ∈ ` and let f(x) be its minimal polynomial over k. Let α ∈ OL be a lift ofα ˜. Then Y f(x) = (x − σ(α)) ∈ OK [x]
σ∈DP
splits completely over OK and hasα ˜ as a root, when taken mod p. Thus `/k is normal. Furthermore, `/k will be Galois whenever it is separable, but since OK /p is a finite field, it is perfect and therefore any finite extension is separable [10]. Proposition 1.4.5. ϕ is surjective.
Proof. By the lemma, `/k is Galois. We will show that ϕ(DP) acts transitively on the conjugates ofα ˜ over k. By the Chinese remainder theorem, one may choose α ∈ OL such that ( α˜ mod P α ≡ 0 mod P0 for any other P0 lying over p.
−1 Then for any σ ∈ G r DP we have α ≡ 0 mod σ P and hence σ(α) ≡ 0 mod P. This implies that Y Y f˜(x) = x − σ](α) x
σ∈DP σ6∈DP Y Y = (x − ϕ(σ)(˜α)) x
σ∈DP σ6∈DP
which lies in k[x]. Notice that the first product lies in k[x], so it is divisible by the minimal polynomial of α over k. So given any conjugateα ˜0 ofα ˜,(x − α˜0) divides the first product 0 above and thusα ˜ must equal ϕ(σ)(˜α) for some σ ∈ DP. Hence the action of ϕ(DP) on the conjugates ofα ˜ is transitive and it follows that ϕ is surjective since the image has at least [` : k] = | Gal(`/k)| elements.
Next we relate the inertia group IP to the map ϕ, and use it to prove that the original sequence we defined is exact.
Proposition 1.4.6. The inertia group IP is the kernel of ϕ : DP → Gal(`/k). Proof. By definition
ker ϕ = {σ ∈ DP | σ(α) ≡ α mod P for all α ∈ OL} so it suffices to show that if σ 6∈ DP then there exists an α ∈ OL such that σ(α) 6≡ α mod P. −1 −1 −1 If σ 6∈ DP then of course σ 6∈ DP so σ (P) 6= P. Since both σ (P) and P are maximal ideals, there exists some α ∈ P with α 6∈ σ−1(P), which implies σ(α) 6∈ P. Thus σ(α) 6≡ α mod P and it follows that IP = ker ϕ. We now summarize our findings.
14 1.5 Norms of Ideals 1 Algebraic Number Fields
Corollary 1.4.7. If L/K is a Galois extension, the sequence
1 → IP → DP → Gal(`/k) → 1 is exact. Moreover, |IP| = e and |DP| = ef, where e and f are the ramification index and inertial degree, respectively, for L/K.
Notice that the inertia group is a very useful measure of how a prime p ramifies in a Galois extension L. This is a common theme in algebraic number theory: the behavior of primes in an extension is often encoded in the automorphisms of the field itself.
1.5 Norms of Ideals
In this section we define the norm of an ideal. As in previous sections, all of these definitions and results generalize to any Dedekind domain A with integral closure B – see [19] for the general cases. In our context, we will replace A with OK and B with OL, which have fields of fractions K and L, respectively. Let IK and IL denote the groups of fractional ideals of OK and OL, respectively. We want to define a group homomorphism N : IL → IK . Since IL is the free abelian group on the set of prime ideals in OL, we only have to define N for p prime. Let p be a prime ideal of OL and factor
Y ei pOL = Pi for Pi prime. Suppose p = (π) is principal. Then we should have
m m N (pOL) = N (πOL) = N (π)OK = (π) = p where m = [L : K]. We also want N to be a homomorphism, so we must have Y ei Y ei N (pOL) = N Pi = N (Pi) .
X Recall that m = eifi, so the correct definition for N is
Definition. For a prime P ⊂ OL lying over p ⊂ OK , the norm of P is defined to be
N (P) = pf where f = [OL/P : OK /p]. To distinguish this norm from a similar norm to be defined shortly, we will sometimes refer to N as the ideal norm. If the norm is taken with respective to an extension L/K, we write NL/K but when the context is clear we will often drop the decoration. Remark. By the properties of inertial degree f, it is easy to see that for a tower M ⊃ L ⊃ K,
NL/K (NM/L(a)) = NM/K (a).
15 1.5 Norms of Ideals 1 Algebraic Number Fields
Next we check that the properties discussed above hold for the norm we have defined.
Proposition 1.5.1. Let L/K, OK and OL be as above.
m (a) For any nonzero ideal a ⊂ OK , N (aOL) = a where m = [L : K].
(b) If L/K is Galois and P ⊂ OL is any nonzero prime ideal with p = P ∩ OK and e pOL = (P1 ··· Pg) , then
ef Y N (P) = (P1 ··· Pg) = σ(P). σ∈Gal(L/K)
(c) For any nonzero element β ∈ OL, N(β)OK = N (βOL), where N denotes the regular field norm.
Proof. (a) It suffices to prove this for prime ideals, for which we have
P Y ei eifi m N (pOL) = N Pi = p = p using Theorem 1.3.3. f (b) Since N (Pi) = p for any prime Pi in the prime factorization of pOL, the left equality is clear. Recall that G = Gal(L/K) acts transitively on the set Spec(p) = {P1,..., Pg}. Then by the Orbit-Stabilizer Theorem, each Pi occurs | Gal(L/K)| m = = ef |Spec(p)| g times in the collection {σ(P) | σ ∈ G}, which implies the right equality. (c) First suppose L/K is Galois. Denote βOL by b. The map IK → IL given by a 7→ aOL is injective since IK and IL are free on nonzero prime ideals, so it suffices to show that N(β)OL = N (b). But by (b), ! Y Y Y N (b) = σ(b) = (σ(β)OL) = σ(β) OL = N(β)OL. σ∈G σ∈G σ∈G
In the general case, let E be a finite Galois extension of K containing L, with d = [E : L] and OE the integral closure of OL in E. Then we have
d NL/K (βOL) = NE/K (βOE) by the remark
= NE/K (β)OK by the Galois case d = NL/K (β) OK .
Lastly since IK is torsion-free, the above implies that NL/K (βOL) = NL/K (β)OK for all nonzero β ∈ OL.
For a Galois extension K/Q, we define a different norm taking ideals of OK to integers. We will see that the definition below coincides with the ideal norm.
16 1.6 Discriminant and Different 1 Algebraic Number Fields
Definition. Let a ⊂ OK be a nonzero ideal. The numerical norm of a is its index in the lattice of integers: N(a) = [OK : a].
In order to justify this definition, we need to check that [OK : a] is always finite.
Proposition 1.5.2. Every nonzero ideal a in OK has finite index in the lattice OK .
Proof. Let a be a nonzero OK -ideal. The proof of Proposition 1.2.5 shows that a contains a nonzero integer m (a0 from that proof). So consider ϕ : OK /mOK → OK /a, which is clearly surjective. By Proposition 1.1.13, OK is a free Z-module of rank n = [K : Q]. This n OK ∼ Z n means that = n is a finite quotient of order m . Since ϕ is surjective, it follows mOK m n Z that |OK /a| ≤ m < ∞.
Notice that the ideal norm is defined for any extension L/K and outputs an ideal of OK . On the other hand, the numerical norm is defined on K/Q and outputs an integer in Z. The connection between the two norms is described in the next proposition. Proposition 1.5.3. Let K be any number field.
(a) For any ideal a ⊂ OK , NK/Q(a) = (N(a)) and therefore N(ab) = N(a)N(b). −1 (b) For any fractional ideals b ⊂ a of OK , [a : b] = N(a b).
Y ei fi Proof. (a) Write a = pi and let fi = f(pi | pi) where (pi) = Z ∩ pi. Then N (pi) = (pi) . ∼ Y ei By the Chinese remainder theorem, OK /a = OK /pi and thus
Y ei [OK : a] = [OK : pi ].
ei eifi Y eifi We previously proved that [OK : pi ] = pi , thus [OK : a] = (pi ) = NK/Q(a). When we identify the set of nonzero ideals of Z with the set of positive integer generators, N and N are seen to coincide, and multiplicativity of N follows from the same property of the ideal norm. (b) We can multiply by some integer d to make a and b integral ideals. Then part (a) gives us [O : db] N(db) [a : b] = [da : db] = K = = N(a−1b). [OK : da] N(da)
1.6 Discriminant and Different
One may recall the definition of discriminant from field theory. Here we present it in the context of extensions of number fields.
Definition. For a number field extension L/K with rings of integers OL ⊃ OK , suppose OL has a basis {β1, . . . , βm} over OK . Then the discriminant of OL is
D(OL) = D(β1, . . . , βm) = det(T rL/K (βiβj)) where T r denotes the trace.
17 1.6 Discriminant and Different 1 Algebraic Number Fields
In this section we use the discriminant to characterize which primes ramify in Galois extensions of a number field.
Definition. Let D = D(OL) as above. The discriminant ideal of OL, denoted ∆(L/K) or simply ∆, is the ideal of OK generated by D. We will prove
Theorem. The primes which ramify in OL are those that divide ∆. First, we establish some properties of the discriminant. The details can be found in [19]. Definition. Let L = K(β) for some β ∈ L and let f be the minimal polynomial of β over K, setting deg f = m. Then the discriminant of f is defined to be
2 m−1 m(m−1)/2 0 D(f) := D(1, β, β , . . . , β ) = (−1) NL/K (f (β)). Proposition 1.6.1. D(f) = 0 if and only if f has a repeated root, i.e. is not separable. Example 1.6.2. Let f(x) = xn + ax + b for some a, b ∈ K. We may assume f is irreducible and separable over K. Let β be any root of f and set
γ = f 0(β) = nβn−1 + a.
To compute the norm of γ, note that
nβ−1(βn + aβ + b) = 0 nβn−1 + na + nbβ−1 = 0 or γ = −(n − 1)a − nbβ−1.
Solving for β yields −nb β = γ + (n − 1)a and we can see that K(β) = K(γ), so the minimal polynomial of γ over K also has degree n. Next let g(x) −nb = f . h(x) x + (n − 1)a We do this because the object on the right may not be a proper polynomial, but it is rational. g(γ) Then h(γ) = f(β) = 0 so g(γ) = 0. Since
g(x) = (x + (n − 1)a)n − na(x + (n − 1)a)n−1 + (−1)nnnbn−1
is monic with degree n, this must be the minimal polynomial of γ. Moreover, N (γ) is just (−1)n times the constant term of g, so N (γ) = nnbn−1 + (−1)n−1(n − 1)n−1an. This suggests the following. Proposition 1.6.3. Let f(x) = xn + ax + b with f(β) = 0 for some β ∈ K. Then
D(f) = (−1)n(n−1)/2(nnbn−1 + (−1)n−1(n − 1)n−1an).
18 1.6 Discriminant and Different 1 Algebraic Number Fields
Proof. By the work above,
N (f 0(β)) = N (γ) = nnbn−1 + (−1)n−1(n − 1)n−1an.
Apply Proposition 1.6.3 to obtain the desired formula. ¯ ¯ Lemma 1.6.4. Let OL have basis {β1, . . . , βm} over OK . Then for any OK -ideal a, {β1,..., βm} is a basis for OL/aOL over OK /a and the discriminant satisfies ¯ ¯ D(β1,..., βm) ≡ D(β1, . . . , βm) mod a,
where the discriminant on the left is taken with respect to OL/aOL (as a module over OK /a) and on the right with respect to OL over OK . Proof. See 3.36 in [19]. We are now ready to prove the main result.
Theorem 1.6.5. A prime p ⊂ OK ramifies in OL if and only if p | ∆(L/K).
Proof. By definition ∆ = ∆(L/K) is the ideal generated by D = D(OL). Thus p | ∆ if and only if D ∈ p, which in turn happens if and only if ¯ ¯ ¯ ¯ D(β1,..., βm) = det(T r(βiβj)) = 0
e1 eg in OK /p by Lemma 1.6.4. Let p have factorization pOL = P1 ··· Pg . By the Chinese remainder theorem, ∼ e1 eg OL/pOL = OL/P1 ⊕ · · · ⊕ OL/Pg .
First suppose p is not ramified in L. Then each ei = 1 and OL/Pi is a separable extension of OK /p. Let ti denote the trace map OL/Pi → OK /p. Select a basis {ui} for OL/pOL such that {u1, . . . , uk} is a basis for OL/P1, {uk+1, . . . , uk+l} is a basis for OL/P2, etc. Then for eachy ¯ ∈ OL/pOL,y ¯ = y1 + ... + yg with yi ∈ OL/Pi. Each multiplication map ri : x 7→ xyi takes OL/Pi to itself, and if ri has standard matrix Ai then the matrix for r : x 7→ xy¯ decomposes into the block matrix A1 0 A 2 A = .. 0 . Ag
Then T r(¯y) = t1(y1) + ... + tg(yg). More importantly, the discriminant matrix has block form ∆1 ∆ 2 B = .. . ∆g
19 1.6 Discriminant and Different 1 Algebraic Number Fields
where ∆i is the discriminant of the chosen basis of OL/Pi over OK /p. But OL/Pi is separable over OK /p if and only if det ∆i 6= 0. Hence if we view the B above as a change of basis matrix, we have
¯ ¯ 2 D(β1,..., βg) = (det B) (det ∆1)(det ∆2) ··· (det ∆g) 6= 0.
By the initial comments, this shows that when p is unramified, p - ∆. On the other hand, if some ei > 1 then OL/Pi is not separable over OK /p. We may e1 reindex the primes lying over p so that e1 > 1. Choose a basis {vi} for OL/P1 such that e1 e1 v1 ∈ P1/P1 . In the quotient, v1 = 0 so the multiplication map rv1 : x 7→ xv1 from above e1 (now defined for the vi) is trivial. Moreover, (v1vj) = 0 so the characteristic polynomial
for the map rv1vj only has roots for its eigenvalues. Thus ti(v1vj) = T r(rv1vj ) = 0 so the discriminant matrix for OL/P1 over OK /p has a row of zeros. Hence det ∆1 = 0 which ¯ ¯ implies D(β1,..., βm) = 0. By the preliminary comments, this shows that p divides ∆.
This tells us when a prime in OK ramifies in L, but the discriminant misses some critical information:
Which primes in OL lying over p ramify? That is, which primes P in the factorization of pOL have ramification index greater than 1? How do we determine the multiplicity of a prime dividing the discriminant? The rest of this section follows K. Conrad’s paper “The Different Ideal”, which outlines the techniques required to answer these questions. To motivate the problem, consider the following example.
Example 1.6.6. Let K = Q(α) where α is a root of f(x) = x3 −x−1. By Proposition 1.6.3, this polynomial has discriminant −23 so 23 is the only integer prime which ramifies in OK . Since [K : Q] = 3 and x3 − x − 1 ≡ (x − 3)(x − 10)2 mod 23,
2 the factorization we obtain from Theorem 1.3.4 is 23OK = pq where p 6= q and both are prime. In general, how do we know that q ramifies but p doesn’t? Definition. For a lattice L in a number field K, its dual lattice is
∨ L = {α ∈ K | T rK/Q(αL) ⊂ Z}.
Proposition 1.6.7. For a lattice L with Z-basis {e1, . . . , en}, the dual lattice may be written
n ∨ M ∨ L = Zei i=1
∨ ∨ where {ei } is the dual basis of {ei} relative to the trace product on K/Q. In particular, L is a lattice. Proof. See 3.4 in [6].
20 1.6 Discriminant and Different 1 Algebraic Number Fields
We will see in the next section just how useful lattices can be in algebraic number theory, but for now we will focus on the lattice OK and its fractional ideals. Consider the dual lattice ∨ ∨ OK . First, we should recognize that OK is not just the elements of K with trace in Z – ∨ actually it’s smaller. But since algebraic integers have integral trace, we see that OK ⊂ OK .
∨ ∨ −1 ∨ Proposition 1.6.8. For any fractional ideal a in K, a is a fractional ideal and a = a OK . ∨ Moreover, OK is the largest fractional ideal of K whose elements all have integral trace.
∨ Proof. By definition, a = {α ∈ K | T rK/Q(αa) ⊂ Z}. First, since any dual lattice is a lattice by the previous proposition, we know a∨ is a finitely generated Z-module. Take ∨ α ∈ a and x ∈ OK . Then for any β ∈ a,
T rK/Q((xα)β) = T rK/Q(α(xβ)) ∈ Z
∨ ∨ ∨ since xβ ∈ a and α ∈ a . Thus xα ∈ a so a is a fractional OK -ideal. ∨ ∨ Next we check the formula for a . Take α ∈ a again. Then for any β ∈ a, T r(αβOK ) ⊂ Z ∨ −1 ∨ ∨ −1 ∨ since βOK ⊂ a. Thus αa ⊂ OK which implies α ∈ a OK . This shows a ⊂ a OK and the reverse containment is similarly shown. For the last statement, note that any fractional OK -ideal satisfies a = aOK . Therefore
∨ T r(a) ⊂ Z ⇐⇒ T r(aOK ) ⊂ Z ⇐⇒ a ⊂ OK .
∨ Since OK is a fractional ideal containing OK , its inverse is an integral ideal contained in OK , from which we define: Definition. The different of K is the ideal
∨ −1 ∨ DK = (OK ) = {x ∈ K | xOK ⊂ OK }.
Example 1.6.9. For K = Q(i) with OK = Z[i], T r(a + bi) ∈ Z precisely when 2a ∈ Z, so ∨ 1 we see that Z[i] = 2 Z[i]. Thus the different of K is 2Z[i]. This can be verified with the next proposition.
0 Proposition 1.6.10. If OK = Z[α] then DK = (f (α)) where f(x) is the minimal polynomial of α over Q. Proof. See 4.3 in [6]. √ Example 1.6.11. For a quadratic field K = Q( n) where n is squarefree, ( √ (2 n) if n 6≡ 1 (mod 4) DK = √ ( n) if n ≡ 1 (mod 4).
The different is related to the field discriminant dK by the following.
Theorem 1.6.12. For a number field K of discriminant dK , NK/Q(DK ) = |dK |.
21 1.6 Discriminant and Different 1 Algebraic Number Fields
Proof. Let {β1, . . . , βn} be a Z-basis for OK so that we have
n M OK = Zβi. i=1
n −1 ∨ M ∨ Then DK = OK = Zβi by Proposition 1.6.7. Using the definition of norm, we have i=1
−1 ∨ N (DK ) = [OK : DK ] = [DK : OK ] = [OK : OK ].
∨ We can calculate [OK : OK ] by finding | det A| where A is the matrix expressing the basis ∨ ∨ ∨ {β1, . . . , βn} in terms of the dual basis {β1 , . . . , βn }. Since {βi } is a dual basis of {βi} it follows that
A = (T rK/Q(βiβj)) and by definition det A = D(OK ) = dK . The result follows.
−1 Lemma 1.6.13. For any nonzero ideal a ⊂ OK , a | DK if and only if T r(a ) ⊂ Z.
∨ −1 −1 ∨ Proof. This may be stated as a ⊃ DK = (OK ) which in turn is equivalent to a ⊂ OK . By Proposition 1.6.8, this is equivalent to T r(a−1) ⊂ Z. We next prove the main characterization of ramified primes in terms of the different ideal.
Theorem 1.6.14 (Dedekind). The prime factors of DK are exactly the primes in K that ramify over Q. In particular, for any prime ideal p ⊂ OK lying over a prime p ∈ Z with ramification index e, the multiplicity of p in DK is e − 1 when e 6≡ 0 (mod p), and at least e when p | e.
e−1 Proof. Note that if we assume the last statement holds and p | DK , then every ramified prime divides DK and every unramified prime has e = 1 and hence does not divide DK since 1 and p are relatively prime. Therefore it suffices to prove the last statement. e−1 First, we may write pOK = p a for some fractional ideal a because OK is a Dedekind e domain. Since p divides pOK by definition of e, we must have p | a. By Lemma 1.6.13, e−1 −(e−1) −(e−1) 1 p | DK is equivalent to T r(p ) ⊂ Z. Note that p = p a so that
−(e−1) T r(p ) ⊂ Z ⇐⇒ T r(a) ⊂ pZ. Furthermore, this is the same as writing T r(α) ≡ 0 (mod p) for all α ∈ a. Let α ∈ a. Then
T rK/Q(α) = T rOK /Z(α) and
T rOK /Z(α) mod p = T r(OK /pOK )/Fp (α)
where on the right, α is viewed as an element of a which divides pOK . Moreover, since a is divisible by p, we will have p | aq for some high exponent q. Therefore a high power of α is in OK /pOK which means the map x 7→ xα defined on OK /pOK is nilpotent and thus has e−1 zero trace. By the above work, T r(α) ≡ 0 (mod p) and so p | DK .
22 1.6 Discriminant and Different 1 Algebraic Number Fields
e e To finish the proof, we will show p | DK if and only if p | e. We may write pOK = p b where p - b since e is the ramification index of p in p. By Lemma 1.6.13, e p | DK ⇐⇒ T rOK /Z(b) ⊂ pZ ⇐⇒ T r(OK /pOK )/Fp (β) = 0 for all β ∈ b. ∼ e By the Chinese remainder theorem, OK /pOK = OK /p × OK /b (as rings) so we can use properties of trace to write
e T r(OK /pOK )/Fp (x) = T r(OK /p )/Fp (x) + T r(OK /b)/Fp (x) for any x ∈ OK . Note that if x ∈ b, x = 0 in OK /b so we just have
e T r(OK /pOK )/Fp (x) = T r(OK /p )/Fp (x).
For any y ∈ OK , there is an x ∈ OK such that x ≡ y mod pe and x ≡ 0 mod b
e by the Chinese remainder theorem, so the traces are equal for (OK /pOK )/Fp and (OK /p )/Fp. e Thus it suffices to prove T r(OK /p )/Fp (y) = 0 for all y ∈ OK if and only if p | e. e To do this, filter OK /p by powers of p:
e e 2 e e−1 e OK /p ⊃ p/p ⊃ p /p ⊃ · · · ⊃ p /p ⊃ {1}.
i Each p is an ideal, so the multiplication map my(x) = xy is well-defined on each subspace above. Using a well-known property from linear algebra (see Conrad), we can write
e−1 X i i+1 i i+1 e T r(OK /p )/Fp (y) = T r(my : p /p → p /p ) i=1
2 i i where the traces on the right are in Fp. Take π ∈ p r p so that (π ) is divisible by p but i+1 i i i+1 ∼ i i+1 not p . In other words, p = (π ) + p . This implies OK /p = p /p as OK -modules via i x 7→ π x. Moreover, this isomorphism commutes with my on both sides, so
i i+1 i i+1 T r(my : p /p → p /p ) = T r(my : OK /p → OK /p).
e We have thus proven that T r(OK /p )/Fp (y) = e T r(OK /p)/Fp (y) for all y ∈ OK . We will complete the proof when we have shown that e T r(OK /p)/Fp (y) = 0 for all y ∈ OK /p if and only if p | e. But since OK /p is a finite field, T r : OK /p → Fp is not the zero map, so the above holds if and only if e = 0 in Fp, which is the same as p | e.
Corollary 1.6.15. The primes in Z that ramify in K are precisely the prime divisors of dK .
Proof. Use the fact that |dK | = N (DK ) and Theorem 1.6.14. Note that this also proves the rest of Proposition 1.3.6 which characterized ramified primes in quadratic extensions. The theorem we just proved showed the true power of the different: while the discriminant also tells us if a prime ramifies in an extension, it does not tell us anything about the ramification indices of the primes in the larger field. This information is conveyed by the different, but if we know the full factorization of pOK , we can relate this multiplicity to dK :
23 1.7 The Class Group 1 Algebraic Number Fields
e1 eg Corollary 1.6.16. Suppose pOK = p1 ··· pg with inertial degrees denoted by fi. Then the multiplicity of p in dK is at least
(e1 − 1)f1 + ... + (eg − 1)fg = n − (f1 + ... + fg).
Furthermore, if p - ei for all i then this is the exact multiplicity of p in dK .
It turns out that the multiplicity of a prime p ∈ DK is bounded by
e − 1 ≤ ordp(DK ) ≤ e − 1 + e ordp(e). The left is Theorem 1.6.14 and the right was proven by Hensel (see [6]). ∨ We can extend the ideas of OK and DK to an arbitrary extension of number fields L/K in the following way. Define the fractional ideal
∨ OL = {x ∈ L | T rL/K (xy) ∈ OK for all y ∈ OK }. Then we have Definition. For an extension of number fields L/K, the relative different is
∨ −1 ∨ δL/K = (OL) = {x ∈ L | xOL ⊂ OL}
which is an integral ideal of OL.
As in the case with K/Q we have several important results for the relative different. See section 15 of [13] for details. Theorem 1.6.17. For any extension of number fields L/K, the discriminant and relative different are related by DL/K = NL/K (δL/K ).
Theorem 1.6.18. Let dL and dK be the field discriminants of L and K, respectively. Then dL = ±dK NL/K (DL/K ).
Theorem 1.6.19. For all number field extensions L/K, DL = DK δL/K .
Corollary 1.6.20. The primes of OL that ramify over K are precisely those that divide the relative different δL/K .
1.7 The Class Group
Recall that the class group of a number field K is C(OK ) = IK /PK , where IK is the group of fractional ideals of OK and PK is the subgroup of principal fractional ideals. There is an exact sequence ∗ ∗ 0 → OK → K → IK → C(OK ) → 0. In this section we explore the structure of the class group and prove that its order, called the class number hK of K, is finite. In the previous section we defined the discriminant of a number field K to be dK = D(OK ). We will prove
24 1.7 The Class Group 1 Algebraic Number Fields
Theorem. Let K be a number field with [K : Q] = n and discriminant dK . Let 2s be the number of nonreal embeddings of K into C. Then there exists a set of representatives for the ideal class group C(OK ) consisting of ideals a ⊂ OK with
n! 4 s N(a) ≤ p|d |. nn π k
The value on the right is called the Minkowski bound and is often denoted BK . According to [24], BK is currently the best known bound for a generating set of C(OK ) that does not depend on unproven conjectures. In the statement of the main theorem, 2s counted the number of nonreal embeddings K,→ C. Alternatively, by the primitive element theorem we may write K = Q(α) for some α ∈ K with minimal polynomial f(x) ∈ Q[x]. Then 2s is the number of nonreal roots of f. We will also denote by r the number of real roots of f, so that we have
∼ r s ∼ r+2s K ⊗Q R = R × C = R (as Q-vector spaces). Before proving the main theorem, we present some applications and examples using the Minkowski bound. The first result is an important property of the class group.
Theorem 1.7.1. The class number hK := |C(OK )| is finite for any number field K.
Proof. It suffices to show there are only finitely many ideals a ⊂ OK whose norms fall under Y ri Y rifi the bound. Let a = pi so that N(a) = pi where (pi) = Z ∩ pi. Since N(a) is bounded by BK , there are only finitely many possibilities for the pi – and hence for the pi – and only finitely many possibilities for the ri. Hence the number of such a is finite, and it follows that the class group is finite.
Note that hK = 1 if and only if OK is a principal ideal domain. Thus the class group is a direct measure of how far the ring of integers is from being a PID. Since every PID is also a UFD, the class number is related to how badly unique factorization fails in OK . An open question in class field theory asks if there are infinitely many number fields with hK = 1. However, it is known√ [7] that there are only nine imaginary quadratic fields with class number 1; these are Q( n) for n = −1, −2, −3, −7, −11, −19, −43, −67, −163.
Example 1.7.2. Let K = Q(i). Then n = 2, s = 1 and |dK | = 4 so the Minkowski bound is
2! 4 1 √ 4 4 = < 2. 22 π π
Thus every fractional ideal is equivalent to an ideal of norm 1. Since the only ideal of norm 1 is (1), every ideal is principal. Hence hK = 1, which reflects the fact that Z[i] is a PID. √ √ Example 1.7.3. Let K = Q( 10) with OK = Z[ 10]. Then n = 2, s = 0 and |dK | = 40, so the Minkowski bound is
2! 4 0 √ 1 √ √ 40 = · 2 10 = 10 < 4. 22 π 2
25 1.7 The Class Group 1 Algebraic Number Fields
The main theorem implies that every ideal class has an integral representative with norm 1, 2 or 3. We will use the techniques in Section 1.3 to compute the class√ group. 2 The ideal 2OK is ramified in OK and we see that 2OK √= (2, 10) . If this were a principal ideal, we would have 2OK = (α) for some α = a + b 10 which would have norm 2 2 ±2. Equivalently the equation a − 10b = ±2 would have an integer solution. However,√ 0 and ±1 are the only squares mod 5 so a2 −10b2 = ±2 has no integer solutions. Thus (2, 10) is a nontrivial element in the class group and has order 2 since its square is the principal ideal 2OK . This shows that 2 | hK . Next we find integral ideals with norm 3. By Proposition 1.3.6, 3OK splits and we compute its factorization to be √ √ 3OK = (3, 2 + 10)(3, 4 + 10).
2 2 If either of these prime divisors were principal, then x − 10y √= ±3 would have√ integer solutions. Since it doesn’t for the same reasons as above, (3, 2 + 10) and (3, 4 + 10) are both nontrivial elements of the class group. Finally we must√ decide if any√ of these prime ideals belong to the same ideal class in C(O ). Let u = 4+√10 = 1 (1 + 10). Then K 2+ 10 3 √ √ √ √ √ (3, 2 + 10) · u = (3u, 4 + 10) = (1 + 10, 4 + 10) = (3, 4 + 10)
so the classes with norm 3 are equal. We have shown that everything in C(OK ) is equivalent to one of √ √ (1) (2, 10) or (3, 2 + 10). Thus the class group has order ≤ 3 and contains an element of order 2. This implies |C(OK )| = 2. √ Example 1.7.4. Let K √= Q( −5). Then a set of representatives for C(OK ) may be chosen with N(a) ≤ BK ≈ 0.63 20 < 3. Thus every ideal that satisfies this must divide 2OK . In fact we can use Theorem 1.3.4 to compute the factorization of 2 in K as √ 2 2OK = (2, 1 + −5) . √ 2 Since N√(2OK ) = 2 = 4, it must be that N((2, 1 + −5)) = 2. This shows√ that OK and (2, 1 + −5) form a set of representatives for C√(OK ). Further, (2, 1 + −5) cannot be principal because there is no element α = a + b −5 with N(α) = a2 + 5b2 = 2. Hence |C(OK )| = 2. Another useful application of the Minkowski bound is to prove
Theorem 1.7.5. Every extension of Q ramifies at some prime.
Proof. We will prove this for K/Q a finite extension. A set of representatives of C(OK ) has at least one element and the element has numerical norm ≥ 1. Define a sequence an by nn π n/2 an = n! 4 and note that by the Minkowski bound, nn π s a ≤ ≤ p|d |. n n! 4 K
26 1.7 The Class Group 1 Algebraic Number Fields
We can also see that a2 > 1 and for all n ≥ 2,
a π 1/2 1 n n+1 = 1 + > 1. an 4 n
So the sequence (an) is monotone increasing. This implies |dK | > 1 and Corollary 1.6.15 tells us that some prime ramifies.
This shows that the only unramified extension of Q is Q itself. However, there may exist unramified extensions of a number field other than Q. In the next section we will describe the Hilbert class field of K, which is the maximal unramified abelian extension L of K. This field has the special property that Gal(L/K) is isomorphic to the class group C(OK ). Constructing further abelian extensions of the Hilbert class field is called the class field tower problem. Let K1 be a number field with class number h1 > 1. Let K2 be the Hilbert class field of K1, let K3 be the Hilbert class field of K2, and so on. It is an open question to decide when the tower · · · ⊃ K3 ⊃ K2 ⊃ K1 ⊃ Q is infinite, or terminates with a field of class number 1 in a finite number of steps. Golod and Shafarevich [11] proved that there are fields K1 with infinite class field towers. For the rest of the section, we develop the mechanics required to prove the Minkowski bound. First we redefine the notion of a lattice in a vector space – a slight generalization of the definition given in Section 1.1, where lattices were assumed to have full rank.
Definition. Let V be an n-dimensional vector space over R.A lattice in V is a subgroup of the form Λ = Ze1 + ... + Zer
where e1, . . . , er are linearly independent vectors in V . When r = n, Λ is said to be a full lattice in V .
Remark. A lattice is a free abelian subgroup of V generated by elements of V that are linearly independent over R. A full lattice Λ ⊂ V is a subgroup such that the map
R ⊗Z Λ −→ V X X ri ⊗ xi 7−→ rixi
is an isomorphism.
Since V is isomorphic to Rn, this induces a topology on V . Definition. A subgroup W ⊂ V is a discrete subgroup if every point in W is open in the topology on V , i.e. if every w ∈ W has a neighborhood U such that U ∩ W = {w}.
Proposition 1.7.6. A subgroup Λ ⊂ V is a lattice if and only if it is a discrete subgroup.
Proof. ( =⇒ ) is clear. See 4.15 in [19] for the other direction.
27 1.7 The Class Group 1 Algebraic Number Fields
X Definition. Let Λ = Zei be a full lattice in V . Then for any λ0 ∈ Λ, the set n X o D = λ0 + aiei : 0 ≤ ai < 1 is called a fundamental parallelopiped for Λ.
The shape of the parallelopiped depends on the choice of the ei, but for a fixed basis we may vary the λ ∈ Λ so that the parallelopipeds cover Rn without overlaps. Furthermore, if X D is a fundamental parallelopiped for Λ = Zei, the volume of D is given by
µ(D) = | det(e1, . . . , en)|. (Here µ is actually Lebesgue measure, but all our sets will have well-defined volumes.) Notice that if Λ = Zf1 + ... + Zfn then the change-of-basis matrix between {ei} and {fi} has determinant ±1, so the volume of D does not depend on the choice of basis for Λ. Definition. For a set T ⊂ Rn, we say T is convex if every pair of points in T is connected by a line that lies in T . Definition. A set T ⊂ Rn is symmetric about the origin if α ∈ T implies −α ∈ T . Lemma 1.7.7. Let D be a fundamental parallelopiped for a full lattice Λ in V and suppose S is a measurable subset of V . If µ(S) > µ(D) then S contains distinct points α and β such that β − α ∈ Λ. Proof. See 4.17 in [19].
1 It will be useful to let T be a subset of V such that for any α, β ∈ T , 2 (α − β) ∈ T , and 1 let S = 2 T . Then by Lemma 1.7.7, T contains the difference of any two points in S and so 1 −n T will contain a point of Λ r {0} whenever µ(D) < µ 2 T = 2 µ(T ). The main theorem en route to proving the Minkowski bound is a classic theorem in the geometry of numbers in its own right: Theorem 1.7.8 (Minkowski). Let T be a subset of V that is compact, convex and symmetric about the origin. If Λ is a lattice in V with fundamental parallelopiped D such that µ(T ) ≥ 2nµ(D) then T contains a point of Λ other than the origin. Proof. Let ε > 0. Then µ((1 + ε)T ) = (1 + ε)nµ(T ) > 2nµ(D)
so by the preceding comments, (1 + ε)T contains a point of Λ r {0}. T only contains finitely many points in Λ r {0} since Λ is discrete and T is compact (and so is (1 + ε)T ). Now since T is closed, \ T = (1 + ε)T ε>0 so if none of the finitely many points in Λ∩(1+ε)T other than the origin were in T , we could keep making ε > 0 smaller and smaller so that (1 + ε)T contains no point of Λ other than the origin. This of course contradicts the lemma, hence T contains a point in Λ r {0}.
28 1.7 The Class Group 1 Algebraic Number Fields
For a fascinating application of Minkowski’s theorem to the proof of the Four Squares Theorem, see Appendix A.1. Moving forward, let K be a number field with [K : Q] = n. Suppose K has r real embeddings {σ1, . . . , σr} and 2s complex embeddings {σr+1, σ¯r+1, . . . , σr+s, σ¯r+s}, so that n = r + 2s. Then we have an embedding
r s σ : K,→ R × C α 7→ (σ1(α),..., σ¯r+s(α)).
Let V = Rr × Cs and identify V with Rn using {1, i} as a basis for C. The relation between ideals of OK and lattices in V is contained in the following proposition.
Proposition 1.7.9. Let a ⊂ OK be any nonzero ideal. Then σ(a) is a full lattice in V and −s p the volume of any fundamental parallelopiped for σ(a) is 2 N(a) |dK |.
Proof. Let {α1, . . . , αn} be a basis for a as a Z-module. We claim that {σ(α1), . . . , σ(αn)} is a basis for σ(a). To prove this, we will show that the matrix A whose ith row is
(σ1(αi), . . . , σr(αi), re(σr+1(αi)), im(σr+1(αi)),..., re(σr+s(αi)), im(σr+s(αi)))
has nonzero determinant. First consider the matrix B with ith row
(σ1(αi), . . . , σr(αi), σr+1(αi), σ¯r+1(αi),..., σ¯r+s(αi)).
2 By definition of the discriminant, (det B) = D(α1, . . . , αn) 6= 0. Next we relate the determinants of A and B. If we perform the column operations 1 Cr+2 + Cr+1 → Cr+1 and − 2 Cr+1 + Cr+2 → Cr+2 to matrix B, we will have 2 re(σr+1(αi)) in Cr+1 and −i · im(σr+1(αi)) in Cr+2. Repeat this for the remaining pairs of columns to obtain a matrix A0. These column operations do not change det B and it’s easy to scale A0 to obtain A, so we have det B = det A0 = (−2i)s det A, or
−s −s 1/2 det A = (−2i) det B = ±(−2i) D(α1, . . . , αn) 6= 0.
Thus {σ(αi)} is a basis for σ(a), which proves σ(a) is a lattice in V of rank n. X Now we can write σ(a) = Zσ(αi) so the volume of the fundamental parallelopiped for σ(a) is simply | det A|. One can prove that
2 2 |D(α1, . . . , αn)| = [OK : a] |D(OK /Z)| = N(a) |dK |
−s 1/2 −s p (see 4.26 in [19] for details). Hence µ(D) = 2 |D(α1, . . . , αn)| = 2 N(a) |dK |.
∗ Lemma 1.7.10. Let a ⊂ OK be an integral ideal. Then a contains an element α ∈ K whose norm is bounded by
n! 4 s |N(α)| ≤ N(a)p|d |. nn π K
29 1.7 The Class Group 1 Algebraic Number Fields
Proof. For a fixed positive t ∈ R, define X(t) = {v ∈ V : ||v|| ≤ t}, where || · || is the Euclidean norm on V . Using complex analysis (see 4.27 in [19]), one can calculate
π s tn µ(X(t)) = 2r . 2 n! The set X(t) is compact (it is closed and bounded), convex and symmetric about the origin, and we may choose a large enough t so that
µ(X(t)) ≥ 2nµ(D)
where D is a fundamental parallelopiped for a. Then Minkowski’s theorem (1.7.8) says that X(t) contains some σ(α) 6= 0, for some α ∈ a. Consider
2 2 |N(α)| = |σ1(α)| · · · |σr(α)| · |σr+1(α)| · · · |σr+s(α)| (P |σ (α)| + P 2|σ (α)|)n ≤ i i (geometric mean ≤ arithmetic mean) nn tn ≤ . nn Now in the case that µ(X(t)) ≥ 2nµ(D), we must have by Proposition 1.7.9 that
π s tn 2n−r 2r ≥ 2n2−sN(a)p|d | ⇐⇒ tn ≥ n! N(a)p|d |. 2 n! k πs K
n 2n−r p So choose t ∈ R so that t = n! πs N(a) |dK |. Then the above work gives the result: n! 2n−r n! 4 s |N(α)| ≤ · N(a)p|d | = N(a)p|d |. nn πs K nn π K
We are now ready to prove the main theorem: Theorem 1.7.11. For a number field K, there exists a set of representatives for the class group C(OK ) consisting of integral ideals a whose norms satisfy n! 4 s N(a) ≤ p|d |. nn π K
∗ Proof. Let c be a fractional ideal in IK . There is some d ∈ K that clear the denominators of c−1, so b := dc−1 is an integral ideal. By Lemma 1.7.10 there exists a nonzero element β ∈ b with |N(β)| ≤ BK N(b). Note that βOK ⊂ b which implies βOK = ab for some integral −1 ideal a, with a ∼ b ∼ c in the class group. Then N(a)N(b) = |N(β)| ≤ BK N(b). Since N(b) = [OK : b] > 0, we can cancel to obtain N(a) ≤ BK .
In general we can compute C(OK ) with the following approach:
1) Use the results from Section 1.3 to list all prime ideals p ⊂ OK that appear in the factorization of any prime p ≤ BK .
30 1.7 The Class Group 1 Algebraic Number Fields
2) Find the group generated by the ideal classes [p] for the primes found in Step 1. √ Example 1.7.12. Let K = Q( −6). Note that n = 2, r = 0, s = 1 and dK = −24 so
2! 4 1 √ B = 24 ≈ 3.1. K 22 π √ Thus C(OK ) is generated by the√ prime ideals lying over 2 and 3. Note that OK = Z[ −6] and the minimal polynomial of −6 over Q is x2 + 6. Factoring this mod 2 and 3, we see that √ √ p2 = (2, −6) and p3 = (3, −6) 2 2 generate the class group. Also, 2 and 3 ramify so 2OK = p2 and 3OK = p3 so each of these prime ideals has order at most 2 in C(O√K ). Suppose p2 = (α) for some α = a + b −6 ∈ OK . Then
2 2 2 = N(p2) = |N(α)| = a + 6b ,
2 2 but a + 6b = 2 has no integer solutions. Thus p2 is not principal. By a similar argument, p3 is not principal either. Hence p2 and p3 both belong to classes of order 2 in C(OK ). Furthermore, observe that √ √ √ √ √ p2p3 = (2, −6)(3, −6) = (6, 2 −6, 3 −6) ⊂ ( −6) √ √ √ but the norms of (6, 2 −6, 3 −6) and ( −6) are both 6, so they must be the same ideal. Hence p2p3 is principal so C(OK ) = hp2i and hK = 2. √ √ Example 1.7.13. Let K = Q( −19) with ring of integers OK = Z[(1 + −19)/2]. Since n = 2, r = 0, s = 1 and dK = −19, the Minkowski bound for K is
2! 4 1 √ B = 19 ≈ 2.775. K 22 π
So every class in C(OK ) is represented by a prime ideal with norm either 1 or 2.√ The ideal 2OK is unramified in K since 2 - dK . The minimal polynomial of α = (1 + −19)/2 is 2 −19 f(x) = x − x + 5, so because 2 = −1 and f has no roots mod 2, Theorem 1.3.4 tells us that 2OK is inert and thus prime in K. Clearly this is principal,√ so the class group is trivial. By previous comments h(−19) = 1 implies that Z[(1 + −19)/2] is a PID. √ √ Example 1.7.14. Let K = Q( −2) with OK = Z[ −2]. Note that n = 2, r = 0, s = 1 and dK = −8 so the Minkowski bound is calculated to be
2! 4 1 √ B = 8 ≈ 1.801. K 22 π √ √ It easily follows that C(OK ) is trivial and hence Z[ −2] is a PID. In particular, Z[ −2] has unique factorization. We will use this fact to deduce a famous theorem of Fermat whose proof was first discovered by Euler.
31 1.8 The Hilbert Class Field 1 Algebraic Number Fields
Theorem 1.7.15 (Fermat). The only integer solutions to x3 = y2 + 2 are (3, ±5). √ Proof. First suppose ab = u3 in Z[ −2] where a and b are relatively prime. We will show √ √ Y that a and b must be cubes in [ −2]. Since [ −2] is a UFD, we may write u = γ pei √ Z Z i for primes pi ∈ Z[ −2], integers ei and some unit γ. Then 3 3 Y ei 3 Y 3ei ab = u = γ pi = γ pi .
Since a and b are relatively prime, each pi appears in exactly one of the factorizations for a and b. So by the above equality, a and b each factor into products of primes whose exponents are all 3ei. We have not worried about√ the unit γ yet, but that is because the units in K are ±1, each of√ which is a cube in Z[ −2] anyways. Thus we conclude that a and b are both cubes in Z[ −2]. √ √ 3 2 Now suppose√ (x, y) is an integer√ solution to x = y + 2 = (y + −2)(y − −2). If d divides both y + −2 and y − −2, then it divides their difference: √ √ √ (y + −2) − (y − −2) = 2 −2. √ √ However −2 is prime in Z[ −2] (norm is multiplicative), so d must divide 2. Suppose x were even. Then we would have y2 + 2 ≡ x3 ≡ 0 (mod 8), or y2 ≡ −2 (mod 8). Of course 2 −2 is not a square mod 8, so x must√ be odd. This√ forces y to be odd as well, so d | y + 2 implies that d must be 1. Hence y + √−2 and y − √−2 are relatively prime. √ By the first part of the proof, y + −2 and y − −2 are both cubes in Z[ −2]. Write √ √ √ y + −2 = (a + b −2)3 = (a3 − 6ab2) + (3a2b − 2b3) −2.
We now solve for a and b to show that (3, ±5) are the only valid choices for (x, y). From the above, we see that 1 = 3a2b − 2b3 = b(3a2 − 2b2). Since a and b are integers, this implies b = ±1. If b = −1, the other factor is 3a2 + 2 = 1, which can be written 3a2 = −1. This of course is impossible. So b = 1 and this means 3a2 − 2 = 1 which has solutions a = ±1. Plugging these values in above, we see that y = ±5 and x = 3.
1.8 The Hilbert Class Field
Prime ideals p ⊂ OK are often referred to as finite primes to distinguish them from infinite primes, which are defined as
Definition. A real infinite prime of a number field K is an embedding σ : K,→ R, while a complex infinite prime is a pair of conjugate embeddings σ, σ¯ : K,→ C. We will see why it is useful to include infinite primes in our list of primes of K in Section 2.1. For now, we use it to define unramified extensions of a number field, but first we need to define when an infinite prime ramifies.
Definition. Given an extension L/K, an infinite prime σ of K is said to ramify in L if σ is real and has an extension to L which is complex.
32 1.8 The Hilbert Class Field 1 Algebraic Number Fields
√ Example√ 1.8.1. The infinite prime σ : Q ,→ R is unramified in Q( 2) but σ is ramified in Q( −2). Definition. We say an extension of number fields L/K is unramified if every prime in K, finite or infinite, is unramified in L.
A number field may have unramified extensions of arbitrary degree – the work of Golod and Shafarevich [11] in the 1960s was famous for its rather complicated examples. However, if we restrict our focus to unramified abelian extensions, the theory becomes more tractable.
Theorem. For every number field K, there exists a finite Galois extension L ⊃ K such that L is an unramified abelian extension of K, and L contains every other unramified abelian extension of K.
Proof. This will follow from a more general result established in Section 2.9.
Definition. The Hilbert class field of a number field K is the maximal unramified abelian extension of K.
For now we will assume the existence of the Hilbert class field and further develop the connections between Hilbert class fields and rings of integers. The main tool in describing this relationship is the Artin symbol, whose existence is proved in the following lemma.
Lemma 1.8.2. Let L/K be a Galois extension, p ⊂ OK an unramified prime and P a prime of OL lying over p. Then there is a unique element σ ∈ Gal(L/K) such that for all α ∈ OL,
σ(α) ≡ αN(p) mod P
where N(p) = [OK : p] is the norm of p.
Proof. Let D = DP and I = IP be the decomposition and inertia groups of P ⊃ p. Let ` = OL/P and k = OK /p, with Ge = Gal(`/k). Recall from Proposition 1.4.5 that each σ ∈ D maps via ϕ to an elementσ ˜ ∈ Ge. Since p is unramified in L, |I| = e(P | p) = 1 and since ker ϕ = I by Proposition 1.4.6, ϕ is an isomorphism. Let q = N(p) = |OK /p|. It is well known that Ge is a cyclic group generated by the Frobenius automorphism x 7→ xq. Thus there is a unique σ ∈ G which maps to the Frobenius automorphism. Finally, since q = N(p), this σ satisfies the lemma.
Definition. For a given prime P ⊂ OL, the unique element σ ∈ DP described above is called L/K the Artin symbol, denoted . It satisfies P
L/K (α) ≡ αN(p) mod P P
L/K for all α ∈ O , where p = P ∩ O . If p = O ∩ P then is called a Frobenius L K K P element for p.
33 1.8 The Hilbert Class Field 1 Algebraic Number Fields
We will describe Frobenius automorphisms in greater detail in Section 2.2 but for now we will focus on their relation to the Hilbert class field.
Proposition 1.8.3. For a Galois extension L/K, an unramified prime p ⊂ OK and a prime P ⊃ p, the Artin symbol has the following properties. L/K L/K (i) For all σ ∈ Gal(L/K), = σ σ−1. σ(P) P L/K (ii) The order of in D is the inertial degree f = f(P | p). P P L/K (iii) p splits completely in L ⇐⇒ = 1. P L/K Proof. (i) follows from the uniqueness of and the fact, proven in Section 1.3, that P all primes lying over p are conjugates under the action of Gal(L/K). ∼ (ii) From Lemma 1.8.2, DP = Ge = Gal(`/k) and the order of Ge is [OL/P : OK /p] = f. L/K By definition, the Artin symbol maps to a generator of Ge so the order of is f. P (iii) Recall that p splits completely if and only if e = f = 1. Then e = 1 since we are L/K assuming p is unramified in L, and f = 1 ⇐⇒ = 1 follows from part (ii). P Since L/K is abelian, the Artin symbol only depends on the underlying prime p: if P 0 0 and P are both primes of OL containing p, then P = σ(P) for some σ ∈ Gal(L/K) as we have already shown. Thus (i) of the proposition implies L/K L/K L/K L/K L/K = = σ σ−1 = σσ−1 = . P0 σ(P) P P P L/K We will write the Artin symbol as to indicate that it is determined by the underlying p prime p ⊂ OK . The Artin symbol is the first step in establishing a powerful tool in class field theory called Artin reciprocity (Section 2.7). The name comes from the fact that it is a generalization of more elementary reciprocity laws, such as quadratic, cubic and biquadratic reciprocities established by Euler, Legendre and Gauss. The next example shows that the Artin symbol properly encapsulates cubic reciprocity. √ √ 3 2πi/3 Example√ 1.8.4. Let K = Q( −3) and L = K( 2). Here OK = Z[ω] where ω = e = −1+ −3 2 . Note that for the extension K/Q, we have n = 2, r = 0, s = 1 and dK = −3 so the Minkowski bound for K is 2! 4 1 √ B = 3 ≈ 1.103. K 22 π As we have seen before, this shows that K has class number 1, which is equivalent to Z[ω] being a PID.
34 1.8 The Hilbert Class Field 1 Algebraic Number Fields
Knowing that the ring of integers is a PID is important, since any prime ideal can be ∼ written πZ[ω] for some prime element π ∈ Z[ω]. One can calculate that Gal(L/K) = Z/3Z L/K but the important part is that Gal(L/K) is abelian, so is defined. In fact the entire √ π automorphism is determined by its action on 3 2: L/K √ 2 √ ( 3 2) = 3 2 π π 3 2 where is the cubic Legendre symbol, defined to be the unique cubic root of unity to π 3 (N(π)−1)/3 which 2 is congruent mod π. Specifically, let P be a prime of OL lying over π. Then by definition, L/K √ √ 2 √ ( 3 2) ≡ 2(N(π)−1)/3 · 3 2 ≡ 3 2 mod P. π π 3 Hence the Artin symbol generalizes the cubic Legendre symbol!
When L/K is an unramified abelian extension, things are especially nice. Let IK be the Y ri group of fractional ideals of OK . For any a ∈ IK with prime factorization a = pi we can define the Artin symbol on a by
r L/K Y L/K i = . a pi Definition. The Artin map for an extension L/K is the homomorphism
L/K : I −→ Gal(L/K). · K
Notice that if L/K is ramified at any primes, the Artin map is not defined for all of IK . Likewise if Gal(L/K) is not abelian, the Artin symbol may not be uniquely defined for all p ∈ IK . For this reason many of the main theorems in class field theory are complicated to state, as we will see in Chapter 2. However when L is the Hilbert class field of K we have the following characterization of the Artin map. Theorem 1.8.5 (Artin Reciprocity for the Hilbert Class Field). If L is the Hilbert class field of a number field K, the Artin map L/K : I −→ Gal(L/K) · K ∼ is surjective and its kernel is PK . Therefore the Artin map induces an isomorphism C(OK ) = Gal(L/K) where C(OK ) = IK /PK is the ideal class group. Proof. This will follow from the full Artin reciprocity theorem in Section 2.7. Using Galois theory, we have the following classification of unramified abelian extensions of K.
35 1.8 The Hilbert Class Field 1 Algebraic Number Fields
Corollary 1.8.6. For a number field K, there is a one-to-one correspondence
unramified abelian extensions subgroups ←→ . M ⊃ K H ≤ C(OK )
Furthermore, if the extension M/K corresponds to the subgroup H, then the Artin map ∼ induces an isomorphism C(OK )/H = Gal(M/K). Proof. This too will be proven in a more general setting in Section 2.9. This is a good example of the general strategy employed in class field theory: describe a certain type of extensions of K – in this case unramified abelian extensions – using informa- tion encoded in K itself, e.g. subgroups of the class group.
Corollary 1.8.7. Let L be the Hilbert class field of a number field K and let p ⊂ OK be a prime ideal. Then p splits completely in L ⇐⇒ p is a principal ideal. L/K Proof. By (iii) of Proposition 1.8.3, p splits completely if and only if = 1. Since the p ∼ Artin map induces C(OK ) = Gal(L/K) by the Artin reciprocity theorem (Theorem 1.8.5), L/K = 1 ⇐⇒ [p] is trivial in the class group, which is equivalent to p being a principal p ideal. The Hilbert class field has an important application to the study of primes of the form p = x2 + ny2.
Theorem 1.8.8 ([7]). Let n > 0 be a squarefree integer such that n 6≡ 3 (mod 4). Then there is√ a monic irreducible polynomial fn(x) ∈ Z[x] of degree h(−4n) – the class number of K = Q( −n) – such that if p is an odd prime that does not divide n or the discriminant of fn, then −n p = x2 + ny2 ⇐⇒ = 1 and f (x) ≡ 0 (mod p) has an integer solution. p n
Furthermore, any choice of fn(x) will be the minimal polynomial of a real algebraic integer α for which L = K(α) is the Hilbert class field of K.
We devote the rest of this section to the proof of Theorem 1.8.8 and its applications. The first step is to relate p = x2 + ny2 to the splitting behavior of p in the Hilbert class field. √ Theorem 1.8.9. Let L be the Hilbert√ class field of K = Q( −n), where n > 0 is squarefree and n 6≡ 3 (mod 4), so that OK = Z[ −n]. If p is an odd prime not dividing n, then p = x2 + ny2 ⇐⇒ p splits completely in L.
Proof. We will prove √ dK = −4n ⇐⇒ OK = Z[ −n] ⇐⇒ n is squarefree and n 6≡ 3 (mod 4)
36 1.8 The Hilbert Class Field 1 Algebraic Number Fields
in the next section. For now, assume the conditions on n imply that dK = −4n. Let p be an odd prime not dividing n, so that p - dK . By Theorem 1.6.5 this means that p is unramified in K. To prove the theorem, we will prove
2 2 (i) p = x + ny ⇐⇒ pOK = pq where p 6= q and p is principal in OK (ii)
⇐⇒ pOK = pq, p 6= q and p splits completely in L (iii) ⇐⇒ p splits completely in L. (iv)
2 2 √ √ √ (i) ⇐⇒ (ii) Suppose p = x + ny = (x + y −n)(x − y −n). Let p = (x + y√−n)OK . Then pOK = pq must be the prime factorization of pOK , where q = p¯ = (x − y −n)OK . Since p is unramified, p 6= q. This entire argument is reversible, so we have proved the first equivalence. (ii) ⇐⇒ (iii) follows from Corollary 1.8.7. (iii) ⇐⇒ (iv) First we prove that L is Galois over Q. To do this, let τ denote complex conjugation. It is easy to see that τ(L) is an unramified abelian extension of τ(K) = K. Then since [τ(L): K] = [L : K] and L is the maximal unramified abelian extension of K by definition, we must have τ(L) = L. Hence τ ∈ Gal(L/K) and this implies L/Q is Galois by conventional Galois theory arguments. To finish the final equivalence, note that condition (iii) says that p splits in K and some prime lying over p splits in L. Since L/Q is Galois, this is the same as p splitting in L. Hence p = x2 + ny2 if and only if p splits completely in L. The next step is to further describe the criteria for when p splits in L.
Theorem 1.8.10. Let K be an imaginary quadratic field and L be a finite extension of K that is Galois over Q. Then (1) There exists a real algebraic integer α such that L = K(α).
(2) Let f denote the minimal polynomial of α over Q, with f(x) ∈ Z[x]. If p is an odd prime not dividing the discriminant of f(x), then
d p splits in L ⇐⇒ K = 1 and f(x) ≡ 0 (mod p) has an integer solution. p
Proof. (1) By hypothesis, L/Q is Galois so [L ∩ R : Q] = [L : K] since L ∩ R is the fixed field of complex conjugation. Then for any α ∈ L ∩ R, L ∩ R = Q(α) precisely when L = K(α). Hence if α ∈ OL ∩ R such that L ∩ R = Q(α) then α is a real algebraic integer generating the extension L/K. Such an element exists by the primitive element theorem. (2) Now let f be the minimal polynomial of α over Q. By the first part, [L ∩ R : Q] = [L : K] so f is also the minimal polynomial of α over K. Let p be a prime not dividing the discriminant of f(x). Then f(x) is separable mod p, so by Proposition 1.3.6,
d pO = pp¯ where p 6= p¯ ⇐⇒ K = 1. K p
37 1.8 The Hilbert Class Field 1 Algebraic Number Fields
∼ We may assume p splits completely in K, so that Z/pZ = OK /p. Since f(x) is separable over Z/pZ, it is separable over OK /p. Then Theorem 1.3.4 gives us
p splits completely in L ⇐⇒ f(x) ≡ 0 mod p is solvable in OK ⇐⇒ f(x) ≡ 0 mod p is solvable in Z. Finally (2) is proven using (iii) ⇐⇒ (iv) from the previous proof. We are now ready to prove Theorem 1.8.8. √ Proof. Since the Hilbert class field L of K = Q( −n) is Galois over Q, Theorem 1.8.10 says there is a real algebraic integer α which is a primitive element of the extension L/K. Let fn be its minimal polynomial and let p be a prime that does not divide n or the discriminant of fn. Then the previous two theorems show that p = x2 + ny2 ⇐⇒ p splits completely in L −n ⇐⇒ = 1 and f (x) ≡ 0 mod p is solvable in . p n Z
As discussed in the proof of Theorem 1.8.9, the hypotheses imply that dK = −4n so d −n K = . p p
It remains to show that deg fn = h(−4n), but by√ Artin reciprocity, [L : K] = | Gal(L/K)| = |C(OK )|, and h(−4n) = |C(OK )| when K = Q( −n), so the theorem is proved.
The polynomial fn(x) is not unique since L/K has infinitely many primitive elements. We can at least use this theorem to predict deg fn, and later we will see that fn(x) completely describes the Hilbert class field – quite an amazing result indeed! The Hilbert class field also allows us to relate the ideal class group C(OK ) to the form class group C(dK ) for binary quadratic forms. In Section 3.2 we prove
Theorem. Let K be an imaginary quadratic field of discriminant dK = −4n, n ≥ 1. (1) If f(x, y) = ax2 + bxy + cy2 is a primitive positive definite quadratic form of discrim- inant dK , then p p [a, (−b + dK )/2] = {ma + n(−b + dK )/2 | m, n ∈ Z}
is an ideal of OK . √ ∼ (2) The map f(x, y) 7→ [a, (−b + dK )/2] is an isomorphism between C(OK ) = C(dK ) and hence |C(OK )| = h(dK ) which is the number of reduced forms of discriminant dK . √ Example 1.8.11. Let K = Q( −14). Here dK = −56 and the reduced forms of discrimi- nant −56 are: x2 + 14y2 2x2 + 7y2 3x2 ± 2xy + 5y2.
38 1.8 The Hilbert Class Field 1 Algebraic Number Fields
Moreover, only x2 + 14y2 and 2x2 + 7y2 belong to classes of order at most 2 (see chapter 3 ∼ ∼ in [7]). Thus C(−56) = Z/4Z and by the above theorem C(OK ) = Z/4Z. We know from Theorem 1.8.8 that there is a polynomial f14(x) such that −14 p = x2 + 14y2 ⇐⇒ = 1 and f (x) ≡ 0 mod p has an integer solution. p 14
We determined above that h(−56) = so deg f14 = 4, but we don’t yet know how to find this p √ polynomial. Let L = K(α) where α = 2 2 − 1. Cox [7] suggests that L is the Hilbert class field of K. To check this, we need the following lemma. √ Lemma 1.8.12. Let L = K( β) for some β ∈ OK and let p ⊂ OK be a prime ideal. Then p is unramified in L if either of the following two conditions are met: (i) 2β 6∈ p, or
2 (ii) 2 ∈ p, β 6∈ p and β = b − 4c for some b, c ∈ OK . Proof. (i) Since the discriminant of x2 − β is 4β 6∈ p, x2 − β is separable mod p and hence p
is unramified by Theorem 1.3.4. √ −b+ β 2 (ii) Note that L = K(γ) as well, where γ = 2 is a root of x +bx+c. The discriminant of x2 + bx + c is b2 − 4c 6∈ p so by Theorem 1.3.4 again, p is unramified. √ Now we can prove the claim about the Hilbert class field H of K = Q( −14). The reciprocity theorem tells us that [H : K] = h(−56) = 4 and H is unique, so it suffices to prove that L = K(α) is an unramified abelian extension of degree 4 over K. It’s easy to see that [L : K] = 4 by standard arguments, and this means L/K is guaranteed to be abelian, so the only thing we must check is that L/K is unramified at√ every prime. Note that every infinite√ prime is unramified,√ since K = Q( −14) is imaginary quadratic. Observe that α2 = 2 2 − 1 implies that 2 ∈ L, so we have a tower √ K ⊂ K( 2) ⊂ L. √ √ The result will follow if we show that K( 2)/K and L/K( √2) are both unramified. First suppose p ⊂ OK is prime (and finite). Let E = K( 2).√ By (i) of Lemma√ 1.8.12, p is unramified√ in E when 2 6∈ p so√ let us assume 2 ∈ p. Since −14 ∈ K and 2 ∈ E, we also have −7 ∈ E, i.e. E = K( −7). Then −7 6∈ p and −7 = 12 − 4 · 2 imply by (ii) of the Lemma that p is unramified in E. √ √ 0 Now consider the other extension√ L/E. If we let µ = 2 2 − 1 and µ = −2 2 − 1, it’s √ 0 0 easy to see that L = E( µ) = E( µ ). Let p ⊂ OE be prime. If 2 6∈ p then µ + µ = −2 0 implies that either µ 6∈ p or µ 6∈ p. By (i) of Lemma 1.8.12, this shows√ that p is unramified in L. On√ the other hand, if 2 ∈ p we see that µ 6∈ p since µ = 2 2 − 1. Also note that µ = (1 + 2)2 − 4 so (ii) of the Lemma proves p is unramified in L. Hence L = K(α), where p √ √ α = 2 2 − 1, is the Hilbert class field of K = Q( −14). This example allows us to prove the following characterization for primes p = x2 + 14y2. Note that this is our first big application of class field theory to the main question of when primes have the form x2 + ny2.
39 1.8 The Hilbert Class Field 1 Algebraic Number Fields
Theorem 1.8.13. Let p 6= 7 be an odd prime. Then −14 p = x2 + 14y2 ⇐⇒ = 1 and (x2 + 1)2 ≡ 8 mod p for some x ∈ . p Z p √ √ Proof. As above, let α = 2 2 − 1 and K = Q( −14), so that L = K(α) is the Hilbert 4 2 2 2 class field of K. It can be shown that f14(x) = x + 2x − 7 = (x + 1) − 8 is the minimal 14 polynomial of α by basic root analysis. The discriminant of f14 is −2 · 7 which explains why we exclude p = 2, 7. Then by the main result, Theorem 1.8.8, −14 p = x2 + 14y2 ⇐⇒ = 1 and f (x) ≡ 0 mod p. p 14
2 2 Since f14(x) = (x + 1) − 8, the theorem follows immediately. √ Example 1.8.14. Let K = Q( −17). We will repeat the steps of the last example and prove a result for primes of the form p = x2 + 17y2 similar to Theorem 1.8.13. Note that n = 2, r = 0, s = 1 and dK = −68 so the Minkowski bound is computed as
2! 4 1 √ B = 68 ≈ 5.250. K 22 π
Thus the class group C(OK ) is generated by prime ideals with norm ≤ 5. These correspond to ideals pOK for p = 2, 3 and 5. Corollary 1.6.15 tells us that of these, only 2 ramifies, so we have the following factorizations:
2 2OK = p2 where p2 is prime. Using quadratic reciprocity, we calculate −17 −1 17 −1 2 = = = −1 · −1 = 1. 3 3 3 3 3 Thus by our characterization of quadratic extensions in Proposition 1.3.6, 3 splits in 0 0 K and we write 3OK = p3p3 for prime ideals p3 6= p3. Likewise, for 5 we have −17 −1 17 −1 2 = = = 1 · −1 = −1. 5 5 5 5 5
So 5 is inert, i.e. 5OK is prime.
0 This shows that C(OK ) may be generated by [p2] and [p3],√ since p3p3 is principal. 2 2 Suppose p2 is principal, say p2 = αOK for α = a + b −17. Then 2OK = p2 = α OK 2 2 2 so we must have 4 = N (2OK ) = N (α) , or N (α) = ±2. However a + 17b = ±2 has no integer solutions, so p2 must not be principal. Thus its ideal class is an element of order 2 2 in the class group. Similar arguments shows that p3 is not principal, and that p3 = p2. Therefore |C(OK )| = 4.
40 1.8 The Hilbert Class Field 1 Algebraic Number Fields
q √ We claim that the Hilbert class field of K is L = K(α), where α = (1 + 17)/2, following a suggestion [7]. The work above shows the Hilbert class field is a degree 4 extension of K, so it suffices to show that L = K(α) is an unramified abelian extension of degree 4 over K, from which it will follow from the uniqueness of the Hilbert class field.√ It’s easy to verify, using the minimal polynomial x2 − x − 4 for α2 = (1 + 17)/2, that the minimal polynomial for α is f(x) = x4 − x2 − 4 which splits in L. This shows that L/K is Galois, so [L : K] = 4. Of course every group of order 4 is abelian, so L/K is an abelian extension. It remains to check that L/K is ramified at every√ prime of OK . Of course any infinite prime is unramified since K = Q( −17) is imaginary quadratic and thus has√ no real embeddings. We will use Lemma 1.8.12 to show that E/K and L/E, where E = K( 17), are both unramified√ extensions and it√ will follow that L/K√is unramified. As a sidenote, observe that α2 = (1 + 17)/2 implies 17 ∈ L, so K ⊂ K( 17) ⊂ L and thus it makes sense to define the extensions E/K and L/E. Let p be a prime ideal of OK . Since (i) of Lemma 1.8.12 tells us that p is unramified in E whenever 2 6∈ p, let us assume 2 ∈ p. Note that 17 6∈ p and 17 can be written
17 = 12 − 4(−4)
and 1, −4 ∈ Z ⊂ OK so (ii) of the lemma tells us that p is unramified in E. Thus E/K is an unramified extension. √ √ 0 Now we turn our√ attention to L/E. Let µ = (1 + 17)/2 and µ = (1 − 17)/2, so that √ 0 L = E( µ) = E( µ ). Suppose p ⊂ OE is a prime ideal; we may assume 2 ∈ p by (i), and furthermore 1 6∈ p, else it’s the whole ring of integers. Notice that µ + µ0 = 1 6∈ p, so that either µ 6∈ p or µ0 6∈ p. But these each satisfy x = x2 − 4 so (ii) of the lemma tells us that p is unramified. We have shown L/K to be an unramified abelian extension of degree 4, so by uniqueness it is the Hilbert class field. We now use this to prove a theorem for primes of the form x2 + 17y2 as we did before for n = −14.
Theorem 1.8.15. Let p 6= 17 be an odd prime. Then
−17 p = x2 + 17y2 ⇐⇒ = 1 and x2(x2 − 1) ≡ 4 mod p has an integer solution. p √ Proof. Let K = Q( −17). We proved that the Hilbert class field of K is L = K(α) where q √ 4 2 α = (1 + 17)/2. We also know that the minimal polynomial for α is f17(x) = x −x −4 = 2 2 16 2 x (x − 1) − 4. Note that the discriminant of f17 is −2 · 17 which explains why we remove p = 2 and 17 from consideration. The result follows from Theorem 1.8.8. It is clear that even when K is only quadratic, the Hilbert class field is nontrivial to compute. In the next example we show how Magma can be used to facilitate these compu- tations. √ Example 1.8.16. Let K = Q( −47). According to Magma, the class number of K is 5:
41 1.9 Orders 1 Algebraic Number Fields
1.9 Orders
In the previous section we were able to prove a full characterization of when a prime is of the form p = x2 + ny2 given certain restrictions on n. We have thus described the main question for infinitely many√ n, but what about the rest? In general, if K = Q( n) we have the following characterization (see [7]) of the ring of integers: ( √ Z[ n] if n 6≡ 1 (mod 4) √ OK = h 1+ n i Z 2 if n ≡ 1 (mod 4). Recall that for a quadratic extension, the field discriminant is given by ( n if n ≡ 1 (mod 4) dK = 4n otherwise.
Using this allows us to write the ring of integers more succinctly: √ d + d O = K K . K Z 2
42 1.9 Orders 1 Algebraic Number Fields
The√ important thing is that when n does not√ satisfy the criteria in Section 1.8, i.e. when Z[ −n] is√ not the full ring of integers for Q( −n), we still have a characterization that involves Z[ −n]. We will make some headway on the x2 + ny2 question towards the end of this section, but a full characterization of primes of the form x2 + ny2 will not be possible until we have the√ theorems of class field theory at our disposal. The ring Z[ −n] is an example of an order. In Lemma 1.9.2 we will prove that the following definition is equivalent to the one given in Section 1.1. Definition. Let K be a number field. Then a subring O ⊂ K is an order if
1K ∈ O
O is finitely generated as a Z-module
O contains a Q-basis of K. There is a more general notion of an order in an arbitrary ring R, but the behavior is quite different even when R is not a field. We will primarily make use of orders in quadratic fields. Proposition 1.9.1. Let O be an order in a quadratic number field K.
(1) O is a free Z-module of rank 2. (2) K is the field of fractions of O.
(3) OK is an order in K containing every other order. In other words OK is the maximal order in K.
Proof. (1) Clearly O is torsion free, so since it is a Z-module it is free. Also, since O contains a Q-basis of a quadratic field, O is at least rank 2, so it must be exactly rank 2. (2) follows from the fact that O contains a Q-basis for K. (3) Since 1K ∈ OK and OK is a Z-module of rank [K : Q] = 2 by Proposition 1.1.13, it suffices to show that OK contains a√ basis for K/Q. But this follows from the discussion dK + dK above: OK is generated by 1 and 2 . Now let O be any order in K. Since O is a free Z-module, it is noetherian. Let α ∈ O and consider the chain of Z-submodules I0 ⊂ I1 ⊂ I2 ⊂ · · · where I0 = Z and for n ≥ 1,
2 n In = Z + αZ + α Z + ... + α Z.
By the noetherian condition, there is some n such that for all m ≥ n, Im = In. So for all such m we have Z + αZ + ... + αmZ = Z + αZ + ... + αnZ. This implies αm = αi for some 1 ≤ i ≤ n and thus the powers of α are finite. This shows that Z[α] is finitely generated as a Z-module, so Proposition 1.1.4 shows α ∈ OK . Thus O ⊂ OK . Recall that Z + niZ is an order in Q(i) for every nonzero n ∈ Z. The next lemma shows that this is essentially the form of every order in a quadratic field.
Lemma 1.9.2. Let O be an order in a quadratic field K with discriminant dK and ring of integers OK . Then f = [OK : O] is finite and O = Z + fOK .
43 1.9 Orders 1 Algebraic Number Fields
Proof. The finiteness of f is a result of the fact that O and OK are both free Z-modules of rank 2. On one hand, since f = [OK : O] we have
fOK ⊂ O =⇒ Z + fOK ⊂ O.
On the other hand, our description of OK at the beginning of the section allows us to write Z + fOK = [1, fwK ], where √ d + d w = K K . K 2
Clearly [1, fwK ] has index f in [1, wK ] = OK , which proves the result.
Definition. The index f = [OK : O] is called the conductor of the order. This is not to be confused with the conductor of an extension in class field theory, which will be discussed in Section 2.8. To add to the clutter, each order has an associated value called the discriminant which is distinct from, although related to, the field discriminant. Definition. For an order [α, β], its discriminant is defined to be
α β 2 D = det α0 β0 where α0 and β0 denote the respective images of α and β under the nontrivial automorphism of K/Q. α β The discriminant of an order is independent of the basis chosen, since if A = then α0 β0 changing basis is done by conjugating A by some invertible matrix B, but this doesn’t change the determinant calculation above. Therefore we can let O = [1, fwK ] as in Lemma 1.9.2 2 and have D = f dK . This shows that an order is determined by its conductor. Moreover, the maximal order OK has conductor 1 which shows that the discriminant of OK is dK . √By our description of dK for quadratic√ fields, we see that D ≡ 0, 1 (mod 4). Let K = Q( −n) for any integer n. Then Z[ −n] is an order in K with discriminant −4n. By the 2 comments√ above, −4n = f dK which makes it relatively easy to compute the conductor of Z[ −n]. In fact, if D ≡ 0 or 1 (mod 4) there will be an in order in a quadratic field whose discriminant is D. For D ≡ 0 (mod√ 4), we may write D = 4n and see that the maximal order OK = [1, wK ] in K = ( n) has discriminant dK = 4n = D. On the other hand, √ Q √ h 1+ D i if D ≡ 1 (mod 4), Q( D) has ring of integers OK = Z 2 which has discriminant dK = D. Recall that OK is a Dedekind domain and has unique factorization of ideals. Unfortu- nately this is not true in general for an order O ( OK so our description of the ideals of O requires a bit more care. It turns out that we can still define a class group C(O) by restrict- ing to certain types of ideals. One should view the subsequent construction as a precursor to the types of constructions used in class field theory in Chapter 2. Proposition 1.9.3. Let a be a nonzero ideal in an order O of K. Then the quotient O/a is finite.
44 1.9 Orders 1 Algebraic Number Fields
Proof. By Proposition 1.5.2, every nonzero ideal a of the maximal order OK has finite index in OK . If b is a nonzero ideal in an order O of K, Proposition 1.9.1 tells us that O ⊂ OK so that b ⊂ OK . Then [OK : b] = [OK : O][O : b] and the left side is finite, so [O : b] must also be finite. This allows us to define
Definition. For an order O, the norm of an O-ideal a is N(a) = [O : a].
For any nonzero ideal a ⊂ O, O ⊆ {β ∈ K : βa ⊂ a}, but equality may not always hold. The ideals for which equality does hold have a special name.
Definition. An ideal a of an order O is a proper ideal if O = {β ∈ K : βa ⊂ a}.
Notice that principal ideals are always proper. Also, every ideal of the maximal order OK is proper. From this definition we proceed with our construction of a class group for O by defining an analog of fractional ideals.
Definition. For an order O, a fractional O-ideal is a subset of K which is finitely gener- ated as an O-module. We say a fractional O-ideal b is proper if O = {β ∈ K : βb ⊂ b}.
Proposition 1.9.4. Every fractional O-ideal is of the form αa for some nonzero α ∈ K and ideal a ⊂ O.
Proof. This is identical to the property for fractional ideals of a Dedekind domain; see Section 1.2.
Lemma 1.9.5. Let K = Q(α) be a quadratic field and suppose ax2 + bx + c is the minimal polynomial for α – we may assume (a, b, c) = 1. Then [1, α] is a proper fractional ideal of the order [1, aα] in K.
Proof. First, [1, aα] is an order by Lemma 1.9.2 since [1, aα] = Z + aαOK and aα is an algebraic integer. Now suppose β ∈ K such that β[1, α] ⊂ [1, α]. This is equivalent to
β · 1 ∈ [1, α] and β · α ∈ [1, α].
The first of these gives us β = j + kα for j, k ∈ Z, so we can write the second as k ck bk β · α = (j + kα)α = jα + kα2 = jα + (−bα − c) = − + − + j α. a a a
By hypothesis (a, b, c) = 1 so the above shows β · α ∈ [1, α] if and only if a | k. This implies
{β ∈ K : β[1, α] ⊂ [1, α]} = [1, aα]
proving [1, α] is a proper fractional ideal of [1, aα]. For orders in a quadratic field, we have a nice characterization of their fractional ideals.
Proposition 1.9.6. A fractional O-ideal a is proper if and only if a is invertible.
45 1.9 Orders 1 Algebraic Number Fields
Proof. ( ⇒ = ) If a is invertible, there exists some fractional O-ideal b such that ab = O. Suppose β ∈ K such that βa ⊂ a. Then βO = β(ab) = (βa)b ⊂ ab = O. This implies β ∈ O so a is a proper fractional O-ideal. ( =⇒ ) Suppose a ⊂ O is a proper fractional ideal. Since K is quadratic, a is a free γ Z-module of rank 2, so a = [β, γ] for some β, γ ∈ K. Let α = β ; then a = β[1, α] and Lemma 1.9.5 implies that O = [1, aα] where ax2 + bx + c is the minimal polynomial of α over Q. Let z 7→ z0 be the nontrivial automorphism in Gal(K/Q). Since α0 is also a root of ax2 + bx + c, Lemma 1.9.5 also shows that a0 = β0[1, α0] is a fractional O-ideal. We will show that aaa0 = N(β)O. Note that aaa0 = aββ0[1, α][1, α0] = N(β)[a, aα, aα0, aαα0].
0 b 0 c Also observe that α + α = − a and αα = a , so aaa0 = N(β)[a, aα, −b, c] = N(β)[1, aα] = N(β)O since (a, b, c) = 1. This proves the claim, and it follows that a is invertible. √ √ Example 1.9.7.√ O = Z[ −3] is an order of conductor 2 in K = Q( −3). Consider the ideal [2, 1 + −3] in O. It’s easy to see that √ √ O ( {β ∈ K : β[2, 1 + −3] ⊂ [2, 1 + −3]} = OK . √ √ √ √ Further, 2, 1+ −3 and 1− −3 are all irreducible in O, but 4 = 2·2 = (1+ −3)(1− −3) showing that unique factorization fails in O. In the next theorem we construct a class group C(O) for an order in a quadratic number field. As with the class group in Section 1.7, we take a quotient of a fractional ideal group by some principal fractional ideals, but in this context we must restrict our consideration to proper fractional ideals in O. Theorem 1.9.8. Given an order O in a quadratic number field, the set I(O) of proper fractional O-ideals forms a group under ideal multiplication. Moreover, the set P (O) of principal O-ideals is a subgroup of I(O) and hence the ideal class group C(O) = I(O)/P (O) is defined. Proof. Let a and b be proper fractional ideals of the order O. By Proposition 1.9.6, it is equivalent to consider invertible ideals. First note that O is clearly the identity in I(O). Since a is invertible, there is some fractional O-ideal which we will denote a−1, such that aa−1 = O. This shows that a−1 is also invertible and hence proper, so I(O) has inverses. Now consider the product (ab)c, where we set c = b−1a−1. Then (ab)c = abb−1a−1 = aOa−1 = aa−1 = O so we see that ab is invertible and hence proper. This proves that I(O) is a group. Clearly P (O) is a subgroup of I(O) since every principal ideal is proper, and the product of principal ideals is again principal. C(O) = I(O)/P (O) is a quotient of abelian groups, so it is a group. This completes the proof of the theorem.
46 1.9 Orders 1 Algebraic Number Fields
In order to make our work on orders in quadratic fields more compatible with the rest of class field theory, it will be advantageous to translate O-ideals into the language of OK -ideals. Definition. Given an order O of conductor f, we say that a nonzero O-ideal a is prime to f if a + fO = O.
Lemma 1.9.9. Let O be an order of conductor f.
(1) An O-ideal a is prime to f ⇐⇒ N(a) is relatively prime to f.
(2) Every O-ideal that is prime to f is proper.
Proof. (1) Define the map ϕf : O/a → O/a to be multiplication by f. Note that
a + fO = O ⇐⇒ ϕf is surjective
⇐⇒ ϕf is an isomorphism ⇐⇒ f and |O/a| are relatively prime
where the last equivalence comes from the fundamental theorem of finite abelian groups. Then by definition of numerical norm, |O/a| = N(a) so (1) is proved. (2) Suppose a is prime to the conductor. Let β ∈ K and suppose βa ⊂ a. Then
βO = β(a + fO) = βa + βfO ⊂ a + fOK .
But fOK ⊂ O so βO ⊂ O which proves β ∈ O. Hence a is proper. Note that since norm is multiplicative, (1) can be used to show that the set of O-ideals prime to the conductor forms a subgroup I(O, f) ≤ I(O). Moreover, the set
P (O, f) = {αO | α ∈ O, (N(α), f) = 1}
is a subgroup of I(O, f). The next proposition describes the class group C(O) in terms of O-ideals prime to the conductor.
Proposition 1.9.10. I(O, f)/P (O, f) ∼= I(O)/P (O) = C(O). Proof. A result in Chapter 3 will imply that every ideal class in C(O) contains a proper O-ideal whose norm is prime to a fixed M ∈ Z. Thus the map I(O, f) → C(O) is surjective with kernel I(O, f) ∩ P (O), so it suffices to show P (O, f) = I(O, f) ∩ P (O). On one hand, P (O, f) ⊂ I(O, f) ∩ P (O) is clear from the definitions of these subgroups. On the other hand, every element of I(O, f) ∩ P (O) is a fractional ideal of the form αO = ab−1, where α ∈ K and a, b are O-ideals prime to f. Let m = N(b). Then mO = bb¯ ∈ P (O, f) and mb−1 = b¯ which implies
mαO = mab−1 = a(mb−1) = ab¯ ⊂ O.
So mαO ∈ P (O, f). It follows that αO = (mαO)(mO)−1 ∈ P (O, f) and hence the kernel is equal to P (O, f).
47 1.9 Orders 1 Algebraic Number Fields
Given any positive integer m, an OK -ideal a is prime to m provided that a+mOK = OK . By Lemma 1.9.9, this is equivalent to (N(a), m) = 1. This implies that for every ring of integers OK , inside the group of fractional OK -ideals we have a subgroup IK (m) ≤ IK . In Section 2.3 we will generalize this construction using class field theory, but for now we have Theorem 1.9.11. Let O be the order of conductor f in an imaginary quadratic field K.
(1) If a is an OK -ideal prime to f, then a ∩ O is an O-ideal prime to f and N(a ∩ O) = N(a), where the first norm is taken with respect to O and the second with respect to OK .
(2) If b is an O-ideal prime to f, then bOK is an OK -ideal prime to f with the same norm. ∼ (3) IK (f) = I(O, f).
Proof. (1) Let a be an OK -ideal prime to f. By the natural injection ν : O/(a∩O) ,→ OK /a, (N(a), f) = 1 implies (N(a ∩ O), f) = 1 as well. This shows a ∩ O is prime to f. As in Lemma 1.9.9, the map ϕf is an automorphism of OK /a, but fOK ⊂ O so the injection ν is also a surjection. Hence the norms are equal. (2) and (3) Let b be an O-ideal prime to f. Then
bOK + fOK = (b + fO)OK = OOK = OK
which shows that bOK is an OK -ideal prime to f. In a moment we will show the norms are equal, but first consider
bOK ∩ O = (bOK ∩ O)O
= (bOK ∩ O)(b + fO)
⊂ b + f(bOK ∩ O)
⊂ b + b(fOK ).
Since fOK ⊂ O this proves bOK ∩ O ⊂ b. The other containment, b ⊂ bOK ∩ O, is clear so we have bOK ∩ O = b. On the other hand, suppose a is an OK -ideal prime to f. Then
a = aO = a(a ∩ O + fO) ⊂ (a ∩ O)OK + fa,
but fa ⊂ fOK ⊂ O so fa ⊂ a ∩ O ⊂ (a ∩ O)OK and it follows that a ⊂ (a ∩ O)OK . Again the other inclusion is obvious, so we have (a ∩ O)OK = a. These two identities for O- and OK -ideals, along with (1), prove the equality of norms in (2). Furthermore we have established a bijection
IK (f) ←→ I(O, f) a 7−→ a ∩ O
bOK 7−→ b. To show this is an isomorphism, we must simply check that it is multiplicative: 0 0 (aa )OK = (aOK )(a OK ) and we have proven the theorem.
48 1.10 Units in a Number Field 1 Algebraic Number Fields
Using unique factorization of ideals in OK , we have Corollary 1.9.12. Every O-ideal prime to the conductor has a unique decomposition as a product of prime O-ideals which are prime to the conductor.
Finally we describe C(O) in terms of the maximal order.
Theorem 1.9.13. Let O be the conductor of order f in an imaginary quadratic field K and define PK,Z(f) of IK (f) by
PK,Z(f) = {αOK | α ∈ OK and α ≡ a mod fOK for some a ∈ Z, (a, f) = 1}. ∼ Then C(O) = IK (f)/PK,Z(f). Proof. We have proven that C(O) ∼= I(O, f)/P (O, f). In the proof of Theorem 1.9.11 ∼ we saw that I(O, f) = IK (f), so it suffices to show that the image of P (O, f) under this
isomorphism is PK,Z(f). To do so, we will prove that for α ∈ OK ,
α ≡ a mod fOK , a ∈ Z, (a, f) = 1 ⇐⇒ α ∈ O, (N(α), f) = 1.
( =⇒ ) Assume α ≡ a mod fOK where a ∈ Z is relatively prime to f. By definition of the numerical norm in a quadratic field, N(α) ≡ a2 (mod f) which implies (N(α), f) = 2 (a , f) = 1. Since fOK ⊂ O we see that α ∈ O. ( ⇒ = ) Conversely, suppose α ∈ O = [1, fwK ] with (N(α), f) = 1. We may write 2 α = a + bfwK for a, b ∈ Z, so α ≡ a mod fOK . Since (N(α), f) = 1, N(α) ≡ a (mod f) again implies (a, f) = 1. This proves the stated equivalence. Now by definition P (O, f) is generated by ideals αO, where α ∈ O and (N(α), f) = 1. =∼ Thus we see that the image of P (O, f) under the isomorphism I(O, f) −→ IK (f) is generated by the corresponding ideals αOK . By the equivalence proven above, this proves the image
is precisely PK,Z(f).
We are by no means finished working with orders. In Section 2.11 we will realize PK,Z(f) as a congruence subgroup for the conductor, and show that there is a corresponding field ∼ extension L/K with the special property that Gal(L/K) = IK (f)/PK,Z(f). This will allow us to provide a full solution to the question of when a prime is of the form p = x2 + ny2, which we have only answered partially as of Section 1.8.
1.10 Units in a Number Field
In this section we further describe the structure of OK by characterizing the group of units UK , which is the group of all elements in OK which have a multiplicative inverse. In particular we will prove Dirichlet’s unit theorem, which is the main structure theorem for the group of units in a number field. At the end of the section we will discuss a nice application of Dirichlet’s unit theorem to Pell’s equation x2 − dy2 = 1. × ∼ t By the theory of finitely generated abelian groups, OK = Z × T where t is the rank of × × × OK and T is the torsion subgroup of OK . Henceforth we will write OK = UK and T = µ(K), which is equivalently the set of roots of unity which lie in OK .
49 1.10 Units in a Number Field 1 Algebraic Number Fields
Definition. A set of units u1, . . . , ut is called a fundamental system of units if it forms m1 mt a basis for UK modulo torsion, i.e. if every unit u ∈ UK can be written u = ζu1 ··· ut for ζ ∈ µ(K) and mi ∈ Z. Proposition 1.10.1. The torsion subgroup µ(K) is finite and cyclic.
Proof. Recall that if ζ is a primitive mth root of unity then Q(ζ) is a Galois extension of Q ∼ × with Gal(Q(ζ)/Q) = (Z/mZ) . Moreover, [Q(ζ): Q] = | Gal(Q(ζ)/Q)| = φ(m) where φ is Euler’s function, defined to be the number of positive integers less than m that are relatively prime to m – we will provide a proof of this well-known fact using the Frobenius density r1 rs theorem in Section 2.5. There is a product formula for φ(m): if m = p1 ··· ps is the prime Q ri−1 factorization of m, then φ(m) = pi (pi − 1). Now note that for any number field K,
ζ ∈ µ(K) =⇒ ζ ∈ K =⇒ K(ζ) ⊂ K =⇒ φ(m) | [K : Q].
Since [K : Q] is finite, there are only a finite number of choices for ζ, the primitive root, and each one has finite order. It follows easily that µ(K) is cyclically generated by ζ.
Using the field norm N = NK/Q, we have the following characterization of the units of a number field K.
Proposition 1.10.2. An element α ∈ K is a unit if and only if α ∈ OK and NK/Q(α) = ±1.
Proof. ( =⇒ ) If α is a unit, then there exists a β ∈ OK such that αβ = 1. Then N(α),N(β) ∈ Z and since norm is multiplicative, 1 = N(αβ) = N(α)N(β). Therefore N(α) = ±1. ( ⇒ = ) Fix an embedding σ0 : K,→ C. By properties of the field norm, Y Y N(α) = σ(α) = α σ(α).
σ:K,→C σ6=σ0
Y −1 Let β = σ(α). Then α ∈ OK implies β ∈ OK as well. If N(α) = ±1, then β = ±α so
σ6=σ0 α has an inverse ±β in OK and is therefore a unit.
For all real fields K, UK = {±1}. This turns out to be the unit group for many nonreal fields as well. √ Example 1.10.3. Let K = Q( d), a quadratic field. Recall that √ ( [ d] n 6≡ 1 (mod 4) Z √ OK = h 1+ d i Z 2 n ≡ 1 (mod 4).
In each case, the units in OK are the solutions to one of
x2 − dy2 = ±1 (2x + y)2 − dy2 = ±4.
50 1.10 Units in a Number Field 1 Algebraic Number Fields
When d < 0 (i.e. K is imaginary quadratic), these equations only have finitely many solutions, so UK = µ(K). In fact, ζm ∈ K if and only if φ(m) ≤ 2 which only happens for m dividing 4 or 6. Thus µ(K) = {±1} except for the following cases:
Q(i): µ(K) = {±1, ±i} √ √ 2 1+ −3 Q( −3) : µ(K) = {±1, ±ρ, ±ρ } where ρ = 2 . For d > 0, there are infinitely many solutions. The equation x2 − dy2 = ±1 is known as Pell’s equation; we will describe this case further after proving the unit theorem. Proposition 1.10.4. For any m, M ∈ Z, the set of all algebraic integers α such that the degree of α is ≤ m and |α0| ≤ M for all conjugates α0 of α is finite. Proof. If the degree of α is bounded by m then α is equivalently the root of a monic irreducible polynomial with degree at most m. On the other hand, |α0| ≤ M for all conjugates α0 of α implies the coefficients of the polynomial are all bounded. Since the degree and coefficients of such a polynomial are all integers, and in this case they are all bounded, there are only a finite number of polynomials satisfying the requirements, each one having only finitely many roots. Hence the set of these α described above is finite. Corollary 1.10.5. An algebraic integer α is a root of unity ⇐⇒ |α0| = 1 for all conjugates α0 of α. Proof. Apply the proposition to the set {1, α, α2,...} to see that it is finite, and hence αn = 1 for some n ∈ N. Example 1.10.6. Note that the condition that α be an algebraic integer is necessary. For 3+4i 0 3−4i 0 instance, α = 5 has only one conjugate α = 5 . Both α and α have modulus 1 (in the complex plane) but the set {1, α, α2,...} is not finite. As in Section 1.7, consider the map
r s σ : K −→ R × C α 7−→ (σ1(α), . . . , σr(α), σr+1(α), σ¯r+1(α),..., σ¯r+s(α)) where {σ1, . . . , σr, σr+1, σ¯r+1,..., σ¯r+s} is the set of all embeddings of K into C. This map takes sums to sums, but to describe a multiplicative group such as UK we want to map products to sums instead. The solution is to construct a map using logarithms:
r+s L : UK −→ R α 7−→ log |σ1(α)|,..., log |σr(α)|, log |σr+1(α)|,..., log |σr+s(α)| .
If u is a unit in OK , then by Proposition 1.10.2, N(u) = ±1 and so 2 2 |σ1(u)| · · · |σr(u)| |σr+1(u)| · · · |σr+s(u)| = 1. Taking the log of both sides shows that the image L(u) lands in the hyperplane
H : x1 + ... + xr + 2xr+1 + ... + 2xr+s = 0. This is a linear system with one degree of dependence, so we see that H is isomorphic to Rr+s−1. This suggests the main result in this section, Dirichlet’s unit theorem.
51 1.10 Units in a Number Field 1 Algebraic Number Fields
Theorem 1.10.7 (Dirichlet’s Unit Theorem). Let K be a number field with ring of integers ∼ r+s−1 OK . Then UK = Z × µ(K) where r is the number of real embeddings of K and s is the number of pairs of complex embeddings.
r+s−1 We will prove this theorem by establishing that L(UK ) is a lattice in R with full rank. First we have
Lemma 1.10.8. The image of L : UK → H is a lattice in H and the kernel of L is µ(K). Proof. Let C be a bounded subset of H, say
C = {(xi) ∈ H : |xi| ≤ M for all i}.
M If L(u) ∈ C for a unit u ∈ UK then |σj(u)| ≤ e for each embedding σj. By Proposi- tion 1.10.4, this implies there are only a finite number of such u, and thus L(UK ) ∩ C is finite. This implies L(UK ) is a lattice in H. Further, u ∈ ker L if and only if |σi(u)| = 1 for all i and thus Corollary 1.10.5 says that ker L = µ(K). We will also need
Lemma 1.10.9. Suppose A = (aij) is an m × m matrix such that aij < 0 for all i 6= j
ai1 + ai2 + ... + aim > 0 for each i. Then A is invertible.
Proof. If A is not invertible, then the system of equations
m X aijxj = 0, i = 1, . . . , m j=1
has a nontrivial solutionx ¯ = (x1, . . . , xm). Suppose xk is a component such that |xk| = max{|xj|}. We may scalex ¯ so that xk = 1. Then |xj| ≤ 1 for each j 6= k which implies
m X X X 0 = akjxj = akk + akjxj ≥ akk + akj > 0, j=1 j6=k j6=k
a contradiction. Thus A must be invertible. We now proceed to the proof of the Unit Theorem (1.10.7).
Proof. We will show that L(UK ) is a full lattice in H, which, since ker L is finite, will imply r s that UK has rank r + s − 1. Again we will make use of the map σ : K,→ R × C . Set V = Rr × Cs. For eachx ¯ ∈ V , define
N(¯x) = x1 ··· xrxr+1x¯r+1 ··· x¯r+s.
52 1.10 Units in a Number Field 1 Algebraic Number Fields
2 2 Then N(σ(α)) = NK/Q(α). Also note that |NK/Q(α)| = |x1 · · · |xr| |xr+1| · · · |xr+s| . Recall from Proposition 1.7.9 that σ(OK ) is a full lattice in V and the volume of its fundamental −sp parallelopiped is 2 |dK |. Equivalently, if {α1, . . . , αn} is a Z-basis for OK then we showed that the absolute value of the determinant of the matrix whose ith row is σ(αi) = σ1(αi),..., re(σr+1(αi)), im(σr+1(αi)),..., im(σr+s(αi))
−sp 1 is equal to 2 |dK |. For the remainder of the proof, letx ¯ ∈ V with 2 ≤ N(¯x) ≤ 1. Define an action of V on σ(OK ) byx ¯·σ(OK ) = {x¯·σ(α) | α ∈ OK }, where u·v is the multiplication operation of V as a ring. Thenx ¯ · σ(OK ) is again a lattice in V and the volume of its fundamental parallelopiped is the determinant of the matrix with ith row
(x1 · σ1(αi),..., re(xr+1 · σr+1(αi)), im(xr+1 · σr+1(αi)),...) −sp 1 which equals 2 |dK | |N(¯x)|. Observe that asx ¯ ranges over the set of points with 2 ≤ N(¯x) ≤ 1 these volumes remain bounded. Let T be a compact subset of V such that T is symmetric about the origin and convex, and large enough in volume so that by Minkowski’s Theorem (1.7.8) T contains a pointx ¯ · σ(γ) for some nonzero γ ∈ OK . The points in T have bounded coordinates (by compactness and the Heine-Borel Theorem) and thus bounded norms; say M is a bound on their norms. Then sincex ¯ · σ(γ) ∈ T , |N(¯x · σ(γ))| ≤ M and hence M |N (γ)| ≤ ≤ 2M. K/Q N(¯x)
Consider the set of ideals γOK where γ runs through the chosen algebraic integers above for eachx ¯, i.e. those γ such thatx ¯ · σ(γ) ∈ T . The norm of these ideals is bounded by 2M, so there are only finitely many: {γ1OK , . . . , γtOK }. If γ is any nonzero algebraic integer in OK such thatx ¯ · σ(γ) ∈ T then γOK = γiOK for some 1 ≤ i ≤ t, and thus −1 for some unit u we have γ = γiu. This shows thatx ¯ · σ(u) ∈ σ(γi )T . Note that the set 0 St −1 T = i=1 σ(γi )T is bounded and does not depend onx ¯. We have therefore shown that 1 for eachx ¯ with 2 ≤ N(¯x) ≤ 1 there exists a unit u for which the coordinates ofx ¯ · σ(u) are bounded uniformly. To prove L(UK ) is a full lattice in H, we may assume r + s − 1 ≥ 1, since otherwise the 1 proof is trivial. For each i between 1 and r + s, we may choose anx ¯ ∈ V with 2 ≤ N(¯x) ≤ 1 0 such that all the coordinates xj for j 6= i are large compared to ally ¯ ∈ T , and xi is small enough so that |N(¯x)| = 1. For each i = 1, . . . , r + s − 1, we have shown that there exists a unit ui such that the coordinates ofx ¯ · σ(ui) are all bounded, and so for all j 6= i,
|σj(ui)| < 1 =⇒ log |σj(ui)| < 0.
We claim that L(u1),...,L(ur+s−1) are linearly independent vectors in L(UK ), which we prove by showing that the matrix with ith row log |σ1(ui)|,..., log |σr+s−1(ui)|
is invertible. The non-diagonal entries of the matrix are all negative, but ui ∈ ker L and so
log |σ1(ui)| + ... + log |σr+s(ui)| = 0
=⇒ log |σ1(ui)| + ... + log |σr+s−1(ui)| = − log |σr+s(ui)| > 0.
53 1.10 Units in a Number Field 1 Algebraic Number Fields
In other words the sum of the entries across each row are positive. Then Lemma 1.10.9 implies that our matrix is invertible. Thus rank L(UK ) ≥ r + s − 1, but H has rank r + s − 1. It follows that L(UK ) is a full lattice in H, completing the proof of the Unit Theorem. We now apply the Unit Theorem to Pell’s equation.
Example 1.10.10. The famous problem of Pell is to find all positive√ integer solutions 2 2 (x, y) to x√ − dy = 1, where d > 0 is squarefree. Let K = Q( d). Note that for any α = x + y d ∈ K, √ √ N(α) = (x + y d)(x − y d) = x2 − dy2.
Thus the solutions to Pell’s equation form a finite-index subgroup of UK , the group of units. By Dirichlet’s Unit Theorem, the solution set is an infinite abelian group of rank 1, a fact that is exponentially harder to show using just elementary number theory.
Definition. Let t = r + s − 1 and let u1, . . . , ut be a fundamental system of units for UK . The regulator of K is the determinant of the matrix whose ith row is L(ui) = log |σ1(ui)|,..., 2 log |σt(ui)| .
In other words, the regulator is the signed volume of the fundamental parallelopiped for L(UK ). We will encounter the regulator again in Section 2.4. We next work out an example with cubic fields of negative discriminant, combining techniques from the last few sections to fully describe the class group of such a field. First s note that since the sign of dK is (−1) , which implies in this case that r = s = 1, the unit group consists of all elements of the form ±εm for some ε, called a fundamental unit of K.
Lemma 1.10.11. Let K be a cubic number field with dK < 0 and let ε be the fundamental 3 unit in UK with ε > 1. Then |dK | < 4ε + 24.
Proof. Since ε 6∈ Q we must have K = Q(ε). The two other conjugates must be complex conjugates, and the product of all three must be +1. Write ε = u2 for u ∈ R, u > 1. Then the other conjugates of ε can be written as
u−1eiθ and u−1e−iθ for some 0 ≤ θ ≤ π.
Let D = D(1, ε, ε2) be the discriminant of the minimal polynomial for ε. Then √ D = (u2 − u−1eiθ)(u2 − u−1e−iθ)(u−1eiθ − u−1e−iθ) = 2i(u3 + u−3 − 2 cos θ) sin θ.
If we set 2ξ = u3 + u−3 then p|D| = 4(ξ − cos θ) sin θ. For a given u, this equation has a maximum where its derivative is 0:
ξ cos θ − cos2 θ + sin2 θ = 0.
Set g(x) = −ξx + 2x2 − 1. We are thus seeking a root of g with |x| < 1. Note that since u3+u−3 1 3 −6 u > 1 and ξ = 2 , g(1) = 1 − ξ < 0 and g − 2u3 = 4 (u − 1) < 0. Then it appears 1 that g(x) has one root greater than 1, and that the desired root is less than 2u3 .
54 1.10 Units in a Number Field 1 Algebraic Number Fields
If x0 is this root, consider 1 x2 > =⇒ u−6 − 4x2 < 0 =⇒ u−6 − 4x−2 − 4x−4 < 0. 0 4u6 0 0 0
2 2 2 This yields |D| ≤ 16(ξ − 2ξx0 + x0)(1 − x0). Also note that by the above, we may write
2 2 2 4 2 ξx0 = 2x0 − 1 =⇒ ξ x0 = 4x0 − 4x0 + 1.
Then
2 2 4 |D| ≤ 16(ξ + 1 − x0 − x0) 6 −6 2 4 = 4u + 24 + 4(u − 4x0 − 4x0) < 4u6 + 24 = 4e3 + 24.
2 Finally since D = dK · m for some m ∈ Z, we have proven the lemma. Let’s apply this to a couple examples.
Example 1.10.12. Let K = Q(α) where α is a real root of f(x) = x3 + 10x + 1. One may calculate dK = −4027 so by Lemma 1.10.11
r4027 − 24 ε > 3 > 10 4 where ε is the fundamental unit in UK with ε > 1. Note that N(α) = −1 so by Proposi- tion 1.10.2, α is a unit. Explicitly, α = −0.099903 ... and −α−1 = 10.00993 ... which means −1 m we must have ε = −α and UK = {±α | m ∈ Z}. Once we know ε it’s easier to compute the class group. It turns out that p = (2, 1 + α) 6 (α−1)3 generates the class group, and it’s easy to check that p is generated by α+2 so it suffices to show that p2 and p3 are not principal. 3 2 m (α−1)3 First suppose p = (γ) for some γ ∈ OK . Then γ = ±α α+2 for some m ∈ Z. This implies that at least one of the numbers below is a square: α − 1 α − 1 α − 1 α − 1 − α − α . α + 2 α + 2 α + 2 α + 2
Let β be the one that’s a square. If β ∈ OK /q for some prime ideal q, then we should find that β is still a square mod q. First let q = (29, α − 2). We have
x3 + 10x + 1 ≡ (x + 5)(x − 3)(x − 2) mod 29.
The residue field is OK /q = F29 and under the evaluation homomorphism Z[α] → F29, α 7→ 2 (mod 29), we see that α − 1 7→ 1 (α + 2)−1 7→ 22 α + 2 7→ 4 − 1 7→ −1.
55 1.10 Units in a Number Field 1 Algebraic Number Fields
α−1 Now 1, 4 and −1 are all squares mod 29, but 22 is not; hence m must be 0. Since α+2 < 0 it α−1 can’t be a square (in fact it’s non-real) so the only possibility is β = − α+2 . However, if we look at r = (7, α + 3) and the residue field OK /r = F7, under the map Z[α] → F7 we have α 7→ −3 ≡ 4 (mod 7) α − 1 3 1 − 7−→ − = − ≡ −4 ≡ 3 (mod 7). α + 2 6 2 Then 3 is not a square mod 7, so we have eliminated all choices for β and shown that p3 in fact cannot be principal. By a similar argument, p2 is not principal. After establishing this, it follows that C(OK ) = Z/6Z. √ 3 Example 1.10.13. Let K = Q(θ) where θ = 11. Then Z[θ] ⊆ OK – in fact Z[θ] is the whole ring of integers but we won’t need that here. We can compute the discriminant to be
D = D(1, θ, θ2) = −33113 = −3267.
Then dK | D so we will use D in the Minkowski bound: 3! 4 √ B = 3267 ≈ 16.17. K 27 π
Thus C(OK ) is generated by the ideal classes with representatives p such that N(p) < 17; then it suffices to consider the primes lying over p = 2, 3, 5, 7, 11 and 13. Using the techniques from Section 1.3, we see that 3 2 0 0 x − 11 ≡ (x − 1)(x + x + 1) mod 2 so 2OK = p2p2 with N(p2) = 2 and N(p2) = 4. 3 2 0 x − 11 ≡ (x − 1)(x + x + 1) mod 5 as well, so 5OK = p5p5 with N(p5) = 5 and 0 N(p5) = 25.
3 x − 11 is irreducible mod 7, so 7OK = p7 is prime and N(p7) = 343.
3 x − 11 is also irreducible mod 13, so 13OK = p13 is prime as well and N(p13) = 2197. 3 11 is ramified since it divides the discriminant. Then N(θ) = 11 so 11OK = p11, where p11 = θOK .
3 is also ramified so 3OK = p3, prime.
Note that for any k ∈ Z, θ + k has minimal polynomial (x − k)3 − 11 and so N(θ + k) = k3 + 11. This fact will be useful in several calculations below. In particular, N(θ − 2) = 3 3 so p3 = (θ − 2)OK is prime. It follows that 3OK = p3. We can immediately throw out p3, p7, p11 and p13 since they are all principal. Further, 0 0 p2p2 and p5p5 are each principal, so C(OK ) is generated by p2 and p5. Also, by the fact above N(θ − 1) = 10 = 2 · 5 so (θ − 1)OK is the product of primes with norm 2 and 5. This must be p2 and p5 so we conclude that p2 is the sole generator of C(OK ). m To use the power of the Unit Theorem (1.10.7), note that r = s = 1 and so UK = {±u } for a fundamental unit u. It turns out that u = 89 + 40θ + 18θ2 (see [15]). Now suppose
56 1.10 Units in a Number Field 1 Algebraic Number Fields
2 3 p2 = αOK for some α ∈ OK . By a similar trick as above, N(θ +k) = k +121 for any k ∈ Z, 2 2 2 0 and so N(θ − 5) = −4, showing N((θ − 5)OK ) = 4. It turns out that (θ − 5)OK 6= p2 2 2 2 2 2 (again see [15]), so we must have (θ − 5)OK = p2. Then p2 = α OK = (θ − 5)OK which 2 2 means α = (θ − 5)w for some unit w ∈ UK . For any prime ideal p, it must be that
±ud(θ2 − 5) ≡ β mod p
where β is a square mod p, the sign is fixed and d = 0, 1 (since w = ±um). First consider p3 = (θ − 2)OK . The map OK → OK /p3 is given by θ 7→ 2. Then
β ≡ ±(89 + 40(2) + 18(4))d(4 − 5) ≡ ±(1)d(−1) (mod 3).
Since −1 is not a square mod 3, the sign must be negative. Next, the trick allows us to calculate N(θ + 9) = 740 = 22 · 5 · 37 ∼ so (θ +9)OK is divisible by a prime p37 with norm 37 and residue degree 1. In OK /p37 = F37, we map θ 7→ −9 and compute
β ≡ −(89 − 40(9) + 18(81))d(81 − 5) ≡ −(3)d(2) (mod 37).
However, note that
3 −1 2 = 1 = 1 and = −1 37 37 37
β 3 d −1 2 so = = −1 which shows β is not a square mod 37. Hence p is 37 37 37 37 2 not principal, and we have proven that h(K) = |C(OK )| = 2.
57 2 Class Field Theory
2 Class Field Theory
In this chapter we develop the main concepts and theorems in class field theory, including valuations on a field, the Artin map, ray class groups, Dirichlet L-series, Artin reciprocity, the Conductor and Existence Theorems and two density theorems. At the end of the section, 2 2 we prove the main characterization√ of primes of the form x + ny by applying class field theory to the ring class field of Z[ −n]. For the sake of reaching this goal efficiently, some of the proofs of the main CFT theorems are condensed, but references are always provided for full discussions of these topics.
2.1 Valuations and Completions
Definition. A function |·| : K → R on a field K is called an absolute value, or valuation, if it satisfies (1) |x| ≥ 0 for all x ∈ K and |x| = 0 ⇐⇒ x = 0.
(2) |xy| = |x| |y| for all x, y ∈ K.
(3) |x + y| ≤ |x| + |y|. This is called the triangle inequality. There is an additional axiom that defines an important type of valuation. Definition. An absolute value on K is nonarchimedean if |x + y| ≤ max{|x|, |y|} for all x, y ∈ K. Otherwise, | · | is said to be archimedean. By (1) and (2), | · | is a multiplicative homomorphism from K× to the positive reals. × Moreover, since R>0 is torsion-free, | · | maps K to 1. This implies the following properties. (a) | − 1| = 1.
(b) | − x| = |x| for all x ∈ K. Examples.
1 For any number field K, let σ : K,→ C be any complex embedding. Then |x| := |σ(x)| is a valuation, where the second set of absolute values denotes the usual absolute value on C.
2 Let ord : K× → Z be a discrete (additive) valuation. Then for any real number c > 1, the following gives us a nonarchimedean absolute value on K: ( c− ord(x) if x 6= 0 |x| = 0 if x = 0.
3 The most important example of the valuation in 2 is the p-adic valuation for any − ordp(a) −v prime number p. It is a map | · |p : Q → R defined by |a|p = p = p where a = pvm such that p does not divide the numerator of m.
58 2.1 Valuations and Completions 2 Class Field Theory
4 This can be generalized to extensions of Q in the following way. For any prime ideal p in the ring of integers of a number field K, we define the normalized p-adic valuation
| · | : K −→ R − ordp(a) a 7−→ |a|p := N(a)
where a = aOK , N(a) is the ideal norm from Section 1.5 and ordp(a) is the largest integer n such that pn divides a.
5 On any field, the trivial absolute value is |x| = 1 for all x 6= 0. Note that the only absolute value on a finite field F is the trivial absolute value since every nonzero element of F is a root of 1F . Remark. The condition |x + y| ≤ max{|x|, |y|}, called the nonarchimedean condition, is equivalent to
X xi ≤ max{|xi|}. We also have the following characterization.
Proposition 2.1.1. An absolute value | · | is nonarchimedean if and only if it is bounded on the set {m · 1K | m ∈ Z}. Proof. ( =⇒ ) If | · | is nonarchimedean, then for any m > 0,
|m · 1| = |1 + 1 + ... + 1| ≤ |1| = 1.
By property (a) following the definition of absolute values, | − m · 1| = |m · 1| so the values are bounded for all m ∈ Z. ( ⇒ = ) Conversely, suppose there is some N such that |m · 1| ≤ N for all m ∈ Z. Then n n n X n r n−r X n r n−r |x + y| = x y ≤ |x| |y| r r r=0 r=0
by the triangle inequality for sums. Clearly |x|r|y|n−r ≤ max{|x|n, |y|n} = max{|x|, |y|}n n and r is an integer, so we see that
|x + y|n ≤ N(1 + n) max{|x|, |y|}n =⇒ |x + y| ≤ N 1/n(1 + n)1/n max{|x|, |y|}.
As n → ∞, N 1/n(1 + n)1/n tends to 1 by limit laws, so we have |x + y| ≤ max{|x|, |y|}. Hence | · | is nonarchimedean.
Corollary 2.1.2. If K is a field of characteristic p 6= 0, then every absolute value on K is nonarchimedean.
Proof. If char K 6= 0 then the set {m · 1K | m ∈ Z} is finite and therefore bounded under | · |. Apply Proposition 2.1.1.
59 2.1 Valuations and Completions 2 Class Field Theory
In example 2 , we saw that an additive map ord on K induces an absolute value |x| = c− ord(x), where c is any positive real number. Taking logs, we have
logc |x| = − ord(x), or ord(x) = − logc |x| which suggests the following connection between additive and multiplicative valuations. Definition. An absolute value | · | on K is said to be discrete if |K×|, the image of the units of K, is a discrete subgroup of R>0. Proposition 2.1.3. For any nonarchimedean absolute value | · | on K, define v : K× → R by v(x) = − log |x| (with v(0) defined to be 0). Then (i) v(xy) = v(x) + v(y),
(ii) v(x + y) ≥ min{v(x), v(y)}.
Furthermore, if the image v(K×) is discrete in R, then v is a multiple of some discrete valuation ord : K×→→ Z ⊂ R. Proof. Obvious. See [19] for the details. We next define several important objects which arise from a nonarchimedean valuation on a field. Definition. For any nonarchimedean valuation | · | on K, define
A = {a ∈ K : |a| ≤ 1} U = {a ∈ K : |a| = 1} m = {a ∈ K : |a| < 1}.
Proposition 2.1.4. Let | · | be a nonarchimedean valuation on K. Then A is a local subring of K with U as its group of units and m as its unique maximal ideal. Furthermore, | · | is discrete if and only if m is principal, in which case A is a DVR. Proof. It follows easily from the properties of a nonarchimedean valuation that A is a ring. By property (a) following the definition of valuations, it is easy to see that U is the group of units of A. By the nonarchimedean condition, m is an ideal of A. To see that it is the unique maximal ideal, let y ∈ A r m. Then |y| = 1 by definition, and |y−1| = 1 so y−1 ∈ A as well. Thus every element in A outside of m is a unit, which implies that m is the unique maximal ideal of A. Turning to the last statement, if | · | is discrete then v(|x|−1) = −v|x| implies that one of v(|A|), −v(|A|) contains all the positive integers. We may assume 1 ∈ v(|A|). Let π ∈ A such that v|π| = 1. For any x ∈ A, v|x| = n ∈ Z+ and so v|xπ−n| = 0. However, v : K× → Z is an isomorphism so |xπ−n| = 1. Thus u = xπ−n is a unit, and x = uπn. We have proven that every nonzero x ∈ A can be written as a unit times a power of π, so the only ideals in A are powers of πA. Hence m must be principal, which of course implies A is a DVR. The other direction follows from showing that v : K× → Z is again an isomorphism and using the fact that Z is a discrete subgroup of R.
60 2.1 Valuations and Completions 2 Class Field Theory
Definition. The ring A defined above is called the valuation ring for | · |, and m is called its valuation ideal.
The last statement in Proposition 2.1.4 says that | · | is discrete exactly when A is a discrete valuation ring, which explains where this term comes from. Every absolute value defines a metric on K given by d(x, y) = |x − y| and hence induces a metric topology on K. For each x ∈ K, the sets
B(x, ε) = {y ∈ K : 0 < |x − y| < ε}
form a basis of neighborhoods around x.
Example 2.1.5. The p-adics are a perfect case study of absolute values and their related groups, rings and topologies. In the p-adic topology induced on Q by | · |p, two rationals a and b are considered “close” if their difference is divisible by a high power of p. For n 2 example, the sequence xn = p = (1, p, p ,...) converges to 0 quite rapidly in the p-adic P∞ n topology. Even the series n=1 p converges in this topology. Gouvˆea[12] provides a great introductory-level examination of the p-adics.
Definition. Two absolute values | · |1 and | · |2 on a field K are said to be equivalent if they induce the same topology on K.
Proposition 2.1.6. For two absolute values | · |1, | · |2 : K → R, the following are equivalent:
(1) | · |1 and | · |2 are equivalent.
(2) |x|1 < 1 ⇐⇒ |x|2 < 1 for all x ∈ K.
c (3) There exists some c > 0 such that |x|1 = |x|2 for all x. Proof. This is a standard proof in the study of absolute values. See [19] or any topology book for details. Nonarchimedean absolute values cause the metric topology on K to have some strange properties. For example,
If y ∈ B(x, ε) then B(y, ε) = B(x, ε).
The open ball B(x, ε) is also closed.
Under this topology, K is totally disconnected.
The next theorem characterizes all possible (equivalence classes of) absolute values on Q. We will write | · |∞ for the usual absolute value on R.
Theorem 2.1.7 (Ostrowski). Let | · | be a nontrivial absolute value on Q.
(1) If | · | is archimedean, it is equivalent to | · |∞.
(2) If | · | is nonarchimedean, it is equivalent to | · |p for exactly one prime p.
61 2.1 Valuations and Completions 2 Class Field Theory
r Proof. Let m, n ∈ Z. Then we can write m = a0 + a1n + ... + arn where ai ∈ Z, 0 ≤ ai < n and nr ≤ m. Let N = max{1, |n|}. By the triangle inequality,
r r X i X r |m| ≤ |ai| |n| ≤ |ai| N . i=0 i=0
log m Note that r ≤ log n and |ai| ≤ |1 + ... + 1| = ai|1| = ai ≤ n. So we have r log m log m |m| ≤ (1 + r)nN ≤ 1 + nN log n . log n
If we replace m with mt for an integer t, this gives us
1/t t log m 1/t log m |m| ≤ 1 + n N log n . log n
log m Then as t → ∞, this becomes |m| ≤ N log n . For the first case, suppose that |n| > 1 for all integers n > 1. Here N = |n|, so we see that |m|1/ log m ≤ |n|1/ log n. By symmetry, these must be equal so there exists a number c > 1 such that c = |m|1/ log m for all integers m. Hence
|m| = clog m = elog c log m = mlog c
log c for all m > 1. This shows that |m| = |m|∞ for all m ∈ Z, m > 1. Since | · | and | · |∞ are equivalent on a set of generators for Q (the integers), they must be equivalent absolute values. Now suppose there exists some integer n > 1 such that |n| ≤ 1. In this case N = 1 and the inequality proved in the first part of the proof implies |m| ≤ 1 for all m ∈ Z. By Proposition 2.1.1, | · | is nonarchimedean. Let A and m be its valuation ring and ideal, respectively. By definition, Z ⊂ A and so m ∩ Z is a prime ideal in Z since it is nonzero. Thus m ∩ Z = (p) for some prime p. This implies that |m| = 1 if p - m, so for all rationals q such that p does not divide its numerator or denominator, |qpr| = |p|r. Let c be a number −c c such that |p| = p . Then |x| = |x|p for all x ∈ Q and we have shown that | · | is equivalent to the p-adic valuation | · |p. Definition. For a number field K, an equivalence class of valuations on K is called a place or prime of K.
By Theorem 2.1.7, the places on Q are in one-to-one correspondence with the prime integers – it is for this reason that places are also called primes – with the exception of ∞. To rectify this, we refer to ∞ as the infinite prime, which thus corresponds to the equivalence class of the archimedean absolute values on Q. This motivates our use of the term infinite prime in Section 1.8.
Definition. For each place p of Q, let p be its prime integer representative. There is a 1 valuation | · |p in p that satisfies |p|p = p , called the normalized absolute value of p. By convention, the normalized absolute value of the infinite place is taken to be | · |∞.
62 2.1 Valuations and Completions 2 Class Field Theory
The next theorem shows an important relation between the values of any x ∈ Q under the normalized absolute values of all primes on Q.
Theorem 2.1.8 (The Product Formula). For each prime p = 2, 3, 5,..., ∞, let | · |p be the corresponding normalized absolute value on Q. Then for all x ∈ Q, Y |x|p = 1. p
a Proof. Let x = b for a, b ∈ Z. Then |x|p = 1 except when p | a or p | b. (Importantly, this establishes that the product above really is finite.) The map
× × Q −→ R Y x 7−→ |x|p p is a homomorphism, so it suffices to show that the image of −1 and all primes q ∈ Z is 1. But | − 1| = 1 for any absolute value, and since q is prime we have q p = ∞ 1 |q|p = q p = q 1 p 6= q.
Therefore the product over all primes of Q is equal to 1. One of the first objectives in class field theory is to prove an analog of the product formula for finite extensions K/Q. This will require a description of how to extend an absolute value to a field extension of Q. Recall that a field K is said to be complete with respect to an absolute value | · | if every Cauchy sequence in K converges with respect to | · |. Of course not every field is complete: for example, many sequences in Q itself do not converge to rational numbers. For this reason we embed K into a complete field Kˆ , called the completion of K. The most common method is to define Kˆ to consist of equivalence classes of Cauchy sequences in K. This is a common construction in many first courses in analysis; the reader may consult [16] for the general case or [15] for the construction in the number field context. We highlight one important result. Theorem 2.1.9. The completion (K,ˆ | · |) of (K, | · |) is unique up to valuation-preserving isomorphism.
Example 2.1.10. For Q, the completion with respect to | · |∞ is isomorphic to R. For any ˆ prime p,(Q, | · |p) is called the p-adic numbers and is denoted Qp. The ring of integers in this completion is called the p-adic integers, denoted Zp. We briefly state, without proof, some results about nonarchimedean valuations in com- pletions. Our objective is to obtain a description of all possible completions of a number field K, so we will content ourselves with highlights from the lengthier discussions in [15] and [19]. The objects related to nonarchimedean absolute values extend nicely in a completion.
63 2.1 Valuations and Completions 2 Class Field Theory
Proposition 2.1.11. If | · | is a nonarchimedean absolute value on K whose valuation ring A is a DVR, then the valuation ring Aˆ of the completion (K,ˆ | · |) is also a DVR, and its maximal ideal has the same generator as m ⊂ A.
Proposition 2.1.12. Let Kˆ be a completion of a number field K with respect to | · |. Every element α ∈ Kˆ r {0} has a unique representation as a power series
r 2 α = π (s0 + s1π + s2π + ...)
where si ∈ A and π is a generator of m. Theorem 2.1.13. Let K be complete with respect to a nonarchimedean absolute value | · | and let L be a finite extension of K, with n = [L : K]. Then | · | extends uniquely to L by
1/n |x| = |NL/K (x)| .
The next result is a famous one in the study of absolute values.
Theorem 2.1.14 (Hensel’s Lemma). Suppose A is a commutative ring that is complete with respect to an ideal m. Let k denote the residue field for A, and for any f(x) ∈ A[x] denote its image in k[x] by f¯(x). Then for every monic polynomial f(x) ∈ A[x] such that ¯ f(x) = g0(x)h0(x) where g0 and h0 are monic and relatively prime in k[x], we can factor ¯ f(x) = g(x)h(x) where g¯ = g0 and h = h0.
p−1 Corollary 2.1.15. In Qp, x − 1 has p − 1 distinct roots. Let K be an algebraic number field, A a DVR such that K is its field of fractions, and p the maximal ideal of A. Let L be a finite extension of K and let B be a DVR which is the integral closure of A in L, with P the maximal ideal of B. Now complete both fields with respect to the p-adic valuation | · |p, which by Theorem 2.1.13 extends uniquely to | · |P on L. The next proposition completely describes the behavior of primes in the extension L/ˆ Kˆ in the language of Section 1.3.
Proposition 2.1.16. Let A,ˆ B,ˆ pˆ and Pˆ denote the DVRs and maximal ideals, respectively, of the extension L/ˆ Kˆ . Then
(a) pˆ = pAˆ and Pˆ = PBˆ.
(b) pˆBˆ = Pˆ e where e = e(P | p), the ramification index in L/K.
(c) Moreover, e(Pˆ | pˆ) = e(P | p) = e and f(Pˆ | pˆ) = f(P | p) = f.
(d) [Lˆ : Kˆ ] = ef.
Proof. It suffices to prove (a) and (b), since the main results in Section 1.3 will then imply that (c) and (d) are true. Proposition 2.1.11 directly implies (a), since pˆ has the same generator as p and Pˆ has the same generator as P. e1 eg To prove (b), let pB have prime factorization pB = P1 ··· Pg where P = P1 – this is possible since B is a DVR and hence Dedekind. Each Pi, i ≥ 2, contains elements of B
64 2.2 Frobenius Automorphisms and the Artin Map 2 Class Field Theory
outside of the maximal ideal P, and since Proposition 2.1.11 says that Bˆ is a DVR with PBˆ ˆ ˆ as its maximal ideal, this implies that PiB = B for i ≥ 2. Hence
ˆ ˆ ˆ e1 eg ˆ ˆ e1 ˆ e1 pˆB = (pA)B = (P1 ··· Pg )B = (P1B) = P .
For now, this concludes our description of nonarchimedean absolute values in extensions. We conclude the section by discussing how to extend archimedean absolute values. Theorem 2.1.17. If K is complete with respect to an archimedean absolute value | · |, then K is isomorphic to either R or C, and | · | is equivalent to the usual absolute value on these. Proof. Since | · | is archimedean, the values for |n| where n ∈ Z are unbounded. Thus K must have characteristic zero and therefore | · | restricts to an archimedean valuation on Q. By Theorem 2.1.7 (Ostrowski’s theorem), we may replace | · | with the usual (archimedean) absolute value on Q. K is complete, so the completion of Q with respect to | · | is contained in K. We know this completion is isomorphic to R, with | · | equivalent to the usual absolute value on R, so it remains to show that either K = R already or K = C (up to isomorphism). 2 ∼ Suppose K contains i, a root of x + 1 = 0. Then K = R(i) = C. If not, adjoin i to obtain a field K(i). Then | · | extends to K(i) by
|a + bi| = p|a|2 + |b|2
and K(i) is complete under this valuation. It is straightforward to check that this is equiv- alent to the usual absolute value on C. In any case we may at this point assume C ⊆ K or replace K with K(i). Janusz [15] shows that C 6= K (or K(i)) produces a contradiction, so we must have K = C or K(i) = C, proving the theorem.
Corollary 2.1.18. Let K be an algebraic number field and {σ1, . . . , σr, σr+1,..., σ¯r+s} be the set of embeddings of K into C. Then every archimedean absolute value on K is equivalent to exactly one of the form |x|i = |σi(x)|. From this description of absolute values on K, both archimedean and nonarchimedean, we have a generalized product formula for the places of K. Theorem 2.1.19 (Product Formula). Let K be an algebraic number field. For each prime p of K, finite or infinite, we may select a valuation | · |p in p such that for all nonzero x ∈ K, Y |x|p = 1. p
2.2 Frobenius Automorphisms and the Artin Map
Fix a Galois extension L of a number field K and let G be the Galois group of this extension. Recall from Section 1.8 that for an unramified prime P ⊂ OL, there is an automorphism q σ ∈ G called the Artin symbol such that σ(α) = α for all α ∈ OL/P, where q = |OK /p|
65 2.2 Frobenius Automorphisms and the Artin Map 2 Class Field Theory
L/K if p = P ∩ O . Cox [7] denotes the Artin symbol by since it is used to define the K P L/K Artin map : I → G in the abelian case. On the other hand, Janusz [15] and many · K other authors refer to this element as the Frobenius automorphism, denoted FrobL/K (P). We will use these names and notations interchangeably, since each has its uses in particular contexts and neither is really preferred in the literature. There should be no confusion. We’ve already proven the existence and uniqueness of the Frobenius automorphism (Lemma 1.8.2) and in Proposition 1.8.3 we gave some nice properties, which we recall here: L/K L/K (i) For all σ ∈ G, = σ σ−1. σ(P) P
(ii) FrobL/K (P) has order f = [OL/P : OK /p] in G. L/K (iii) p splits completely in L ⇐⇒ = 1 for any prime P lying over p. P
Note that (i) means that in general, the set {FrobL/K (P) | P ⊂ OL divides p} is a conjugacy class in G. If L/K is abelian, this represents a single element of G which we denote with L/K or Frob (p). p L/K It will be useful to know how the Frobenius automorphism behaves in towers. Suppose L ⊃ E ⊃ K and denote P ∩ E by pE. If p = P ∩ K is unramified in L, pE is clearly also unramified in L so there is a Frobenius automorphism FrobL/E(P) which relates to FrobL/K (P) by the next few results. L/K f0 L/E Proposition 2.2.1. Let f = f(p | p). Then = . 0 E P P Proof. The residue fields are related in the following way:
OL/P ⊃ OE/pE ⊃ OK /p
f f0 0 and they have orders q , q and q, respectively. Consider G = Gal(`/ε), where ` = OL/P qf0 and ε = OE/pE. This group is generated by the automorphism x 7→ x which is the f0th power of the generator of Gal(`/k). The proposition then follows from the definitions of the Frobenius automorphisms. Proposition 2.2.2. Suppose L ⊃ E ⊃ K is a tower of fields so that L/K is abelian and E/K is normal. Let m be a modulus on K and let mE denote the modulus of E defined by the primes lying over each p | m. Then the following diagram commutes:
Frob (·) m L/K IK Gal(L/K) σ
σ|E mE IE Gal(E/K) FrobE/K (·)
66 2.2 Frobenius Automorphisms and the Artin Map 2 Class Field Theory
Proof. Let P ∈ OL and set pE = P ∩ E. Since E/K is normal, FrobE/K (pE) is defined. To show the diagram commutes, it suffices to prove that the restriction of FrobL/K (P) to E is q q exactly FrobE/K (pE). For any α ∈ OE, σ(α) ≡ α mod P if and only if σ(α) ≡ α mod pE since pE = P ∩ E is fixed by all of G when E/K is normal. Therefore
FrobL/K (P) E = FrobE/K (pE).
Corollary 2.2.3. Suppose E1 and E2 are normal extensions of K and L = E1E2. Define p1 = P ∩ E1 and p2 = P ∩ E2 so that their Frobenius elements are all defined. Then the homomorphism
Gal(L/K) −→ Gal(E1/K) × Gal(E2/K)
σ 7−→ (σ |E1 , σ |E2 ) is one-to-one and therefore L/K E /K E /K = 1 × 2 . P p1 p2 Proof. The previous proposition shows that the map is a well-defined homomorphism. Then the fact that p splits completely in L ⇐⇒ p splits completely in E1 and E2 proves the map is one-to-one. Let’s take a look at Frobenius automorphisms in our favourite example. K/ Example 2.2.4. Let K = (i) and take any prime integer p. Since K/ is abelian, Q Q Q p represents a single element. We claim that ( K/ complex conjugation if p ≡ 3 (mod 4) Q = p 1 if p ≡ 1 (mod 4).
To prove this, first let p ≡ 3 (mod 4). Then p remains prime in Q(i) and the residue fields are given by ` = Z[i]/pZ[i] = Fp2 and k = Z/pZ = Fp. The Frobenius element for p in `/k must be x 7→ xp:
(a + bi)p = ap + bpip ≡ a − bi (mod p).
So the Frobenius element of any prime p ≡ 3 (mod 4) is complex conjugation. On the other hand, recall that if p ≡ 1 (mod 4), (p) splits completely in Q(i). If pZ[i] = p1p2, these prime ideals must be complex conjugates. Then we have
Z[i]/p1 = Z[i]/p2 = Fp and Z/pZ = Fp so the Frobenius automorphism is the identity.
67 2.2 Frobenius Automorphisms and the Artin Map 2 Class Field Theory
Next we describe Frobenius automorphisms in general cyclotomic extensions. 2πi/n ∼ Example 2.2.5. Let K = Q(ζn) where ζn = e for some n ≥ 2. Then Gal(L/K) = × × k (Z/nZ) via the automorphism identifying [k] ∈ (Z/nZ) with the map ζn 7→ ζn. For a prime p - n, this implies that K/ Q = (ζ 7→ ζp) ←→ p (mod n). p n n
In particular, this implies that (p) splits completely in Q(ζn) if and only if p ≡ 1 (mod n). For the rest of the section, we focus on setting up the right conditions for a generalization of the Artin map. The definition is simpler when it is a map on unramified primes of OK so we need a way to restrict to these primes.
Definition. For a number field K, let IK be the group of fractional OK -ideals and let S be S a finite set of primes in OK . Then IK is defined to be the subgroup of IK generated by those prime ideals which are not in S. In practice we will take S to be the set of primes that ramify in an extension L/K. For this choice of S, we define
Definition. Suppose L/K is abelian and let S = {primes p ⊂ OK | p ramifies in L} so S that IK is generated by the unramified primes in OK . Define the Artin map to be the homomorphism
S ϕL/K : IK −→ G = Gal(L/K) e Y L/K i a 7−→ pi pi
Q ei where a is a fractional ideal with prime factorization a = pi . Since L/K is abelian, this map is well-defined. We will later (Section 2.10) generalize the Artin map to non-abelian extensions. Suppose E is a finite extension of K. Then EL/E is an abelian extension whose Galois S group, say H, is a subgroup of Gal(L/K) when we restrict elements of H to L. Let IE denote the subgroup of IE generated by primes in OE that do not lie over any prime in S. Note S S that this is equivalent to saying IE is generated by the primes of OE which have norm in IK . Proposition 2.2.6. Let G = Gal(L/K) and H = Gal(EL/E). Then restricting H to L S gives us ϕEL/E = ϕL/K NE/K on IE.
Proof. Let P ⊂ OEL be prime and let PE = P ∩ E, PL = P ∩ L and p = P ∩ K. Then f q := NK/Q(p) is a prime power and NE/K (PE) = p . Let σ = FrobEL/E(PE). Then for qf each α ∈ OEL we have σ(α) ≡ α mod P. Recall that σ(P) = P and σ(PL) = PL. Let τ = FrobL/K (p). Then when α ∈ OL we have
q f qf τ(α) ≡ α mod PL =⇒ τ (α) ≡ α mod PL Since the Frobenius automorphism is unique, τ f = σ on L. This proves the property for all S S primes in IE and since they generate IE we’re done.
68 2.3 Ray Class Groups 2 Class Field Theory
S Corollary 2.2.7. Let ϕ be the Artin map in an extension L/K. Then NL/K (IL ) ⊆ ker ϕ.
Proof. Let E = L and apply Proposition 2.2.6 to obtain ϕL/K NL/K = ϕL/L = 1.
From this we obtain a nice description of ϕ for any abelian extension K of Q. Theorem 2.2.8. Let K/Q and let S be the set of prime ideals containing (m) for some positive integer m. Then the Artin map ϕ : IS → Gal(K/ ) is surjective with Q Q n a o ker ϕ = fractional ideals : a ≡ b (mod m) . b Proof. See III.3.3 of [15]. Surjectivity of ϕ will follow from the Frobenius Density Theorem in Section 2.5. When L/K is not an abelian extension, a description of the Artin map becomes more difficult. For this reason many theorems in class field theory are complicated to state. It is our goal in the next few sections to provide a glimpse of some of the constructions required to prove a more general description of the Artin map.
2.3 Ray Class Groups
In this section we generalize the class group from Chapter 1. Definition. A modulus m is a formal product of places of K: Y m = pn(p). p This product is taken over all places of K, and the n(p) are nonnegative integers subject to the following conditions: (1) If p is finite then n(p) ≥ 0 and only finitely many of these are nonzero. (2) If p is a real infinite prime, n(p) = 0 or 1. (3) If p is a complex infinite prime, n(p) = 0.
It is common to write a modulus as m = m0m∞ where m0 denotes the product of all finite primes with positive exponent and m∞ denotes the product of the real primes in m. In this way m0 may be realized as an integral ideal in OK . Fix a place p of K and take α ∈ K∗. If p is a real infinite place, we say α ≡ 1 mod p n(p) if αp > 0. Otherwise α 6≡ 1 mod p. If p is finite, we say α ≡ 1 mod p if α is in the valuation ring corresponding to p and α−1 ∈ pn(p). We can extend this notion of congruence for elements of K∗ to any modulus m by α ≡ 1 mod m if and only if α ≡ 1 mod pn(p) for all primes with n(p) > 0. Definition. For a modulus m of a number field K, define the following subgroups of K∗:
a Km = b | a, b ∈ OK and aOK , bOK are relatively prime to m0 Km,1 = {α ∈ Km | α ≡ 1 mod m}.
69 2.3 Ray Class Groups 2 Class Field Theory
S S Let IK be as in the last section; that is, for any set of primes S, IK is the subgroup of IK generated by primes outside S. We define a special case of this for moduli of K.
Definition. Let S be the set of primes dividing m0 for some modulus m. Then we denote S m the subgroup IK ≤ IK by I . ∗ There is a natural inclusion i : K → IK given by α 7→ (α); we denote the image of Km,1 under this map by PK (m, 1) := i(Km,1). This allows us to define
m Definition. The ray class group of a modulus m is CK (m) = I /PK (m, 1). The cosets of PK (m, 1) in this quotient are referred to as ray classes mod m.
Example 2.3.1. If m = 1 then PK (m, 1) is just the subgroup of principal ideals and thus CK (m) is the full ideal class group C(OK ). Y Example 2.3.2. If m = ν then CK (m) = IK /{(a): |a|ν > 0 for all real ν} is called the ν real narrow class group of K.
3 2 3 2 Example 2.3.3. Let m = (2) (17) (19) · ∞, a modulus of Q. Then m0 = (2) (17) (19) so Qm,1 consists of all x ∈ Q satisfying x > 0 x ≡ 1 mod 23 x ≡ 1 mod 172 x ≡ 1 mod 19.
a For example, if x = b for a, b ∈ Z and b 6= 0 then the condition at the place 2 tells us a and b are odd and ab−1 ≡ 1 mod 8. This looks similar to the Chinese remainder theorem. The connection is made clear in the weak approximation theorem.
Theorem 2.3.4 (Weak Approximation). Let | · |1,..., | · |n be inequivalent, nontrivial val- ∗ uations on K and let β1, . . . , βn ∈ K . Then for any ε > 0 there exists an element α ∈ K such that |α − βi|i < ε for all i = 1, . . . , n.
Proof. Following the proof in [15], we first prove the existence of elements y1, . . . , yn in K that satisfy |yi|i > 1 and |yi|j < 1 for all i 6= j. We do this by inducting on n. The base case n = 2 is proven by the negation of the definition of equivalent valuations. Now suppose there is an element y ∈ K satisfying
|y|1 > 1 and |y|j < 1 for j = 2, . . . , n − 1.
By the base case there exists some t ∈ K such that |t|1 > 1 and |t|n < 1. Now choose y1 according to y if |y| < 1 n r y1 = y t if |y|n = 1 yrt 1+yr if |y|n > 1
70 2.3 Ray Class Groups 2 Class Field Theory
r for a real number r yet to be chosen. If |y|n = 1 then |y1|j = |y|j |t|j for all 2 ≤ j ≤ n. Thus for sufficiently large r, |y1|j < 1 for all 2 ≤ j ≤ n. In the case that |y|n > 1, note that
r |y |j 1 r < −r −→ 0 as r → ∞. |1 + y |j |y |j
In all cases we have y1 ∈ K that is “large” at | · |1 and “small” at the other valuations. We could have picked another valuation to start with, so the same proof produces y2, . . . , yn with the desired properties. Now let n X yr α = i β . 1 + yr i i=1 i We claim that α is the element prescribed by the theorem when r is chosen appropriately. By the triangle inequality, r βi X yj |α − βi|i ≤ + βj . 1 + yr 1 + yr i i j6=i j i For any ε > 0 we can choose r large enough so that both terms on the right are less than ε. This completes the proof.
Remark. When p is an infinite place of K, the statement |α − β|p < ε for small ε > 0 is α v(α) equivalent to β > 0, i.e. α ≡ β mod p. When p is a finite place, recall that |α|p = c p for some real number c, 0 < c < 1. Then we see that |α − β|p < ε is equivalent to ε α 0 β − 1 < =: ε . p |β|p
0 0 n α α In turn when ε is small, say ε < c for some n, then v β − 1 > 1 which means β − 1 is in the valuation ring for p. Recall that this is the same as saying α ≡ β mod pn. So in n general we see that |α − β|p < ε is equivalent to α ≡ β mod p for a sufficiently large n. As suggested in Example 2.3.3, the reformulation of the weak approximation theorem (2.3.4) in terms of congruences allows us to view it as a generalization of the Chinese remainder theorem. The weak approximation theorem and this remark allow us to prove Theorem 2.3.5. For every modulus m of K, there is an exact sequence
0 → UK /Um,1 → Km/Km,1 → CK (m) → C(OK ) → 0 and isomorphisms
∼ Y Y n(p) × ∼ Y × Km/Km,1 = {±1} × (OK /p ) = {±1} × (OK /m0)
p real p|m0 p real p|m p|m where Um,1 = UK ∩ Km,1.
71 2.3 Ray Class Groups 2 Class Field Theory
m Proof. First, the inclusion I ,→ IK induces a homomorphism CK (m) → C(OK ). Consider the sequence m 0 → UK → Km → I → C(OK ) → 0. m We will show that it is exact. In particular, to show I → C(OK ) is surjective, we must prove that every ideal class is represented by an ideal in Im. Let a be a fractional ideal; we may write a = bc−1 where b and c are integral ideals. For any c ∈ c, a · (c) = bc−1(c) is integral so we may assume a is integral in the first place. Write Y a = pn(p)b p|m
m 2 where b ∈ I . For each p | m, choose πp ∈ p r p such that πp ≡ 1 mod p. By the weak n(p) n(p)+1 approximation theorem (2.3.4), there is some a ∈ OK so that a ≡ πp mod p for all p | m. This means we can write Y (a) = pn(p)b0 where b0 ∈ Im p|m
−1 m m but then a a ∈ I and this belongs to the same ideal class as a. Hence I → C(OK ) is m surjective. Next, if a ∈ I maps to the trivial class in C(OK ) then a = (α) for some α ∈ Km and this α is uniquely determined up to multiplication by a unit u ∈ UK . This implies exactness of the rest of the sequence. f g m Now consider the maps Km,1 −→ Km −→ I . By the work above, ker g = UK and coker g = C(OK ). By definition, coker(g ◦ f) = CK (m) and ker(g ◦ f) = Km,1 ∩ UK = Um,1. Finally, f is injective by the definitions of Km and Km,1. Hence by Lemma A.2.2 we have an exact sequence 0 → Um,1 → UK → Km/Km,1 → CK (m) → C(OK ) → 0.
Next we prove the isomorphisms. Let p | m. If p is an infinite prime we map α ∈ Km to the sign (+ or −) of the image of α under the embedding (·)p : K,→ C. If p is finite, we −1 n(p) × map α to [a][b] ∈ (OK /p ) where a, b ∈ OK such that a ≡ b ≡ 1 mod m0. Since a and b are in particular relatively prime to p, it makes sense to define their equivalence classes n(p) × and take inverses in (OK /p ) . Consider the map we have defined:
Y Y n(p) × ϕ : Km −→ {±} × (OK /p ) .
p real p|m0 By the weak approximation theorem (2.3.4) and subsequent remark, ϕ is surjective. More- over, its kernel is Km,1 by the way this subroup is defined. This shows the first isomorphism, and the second is easily concluded from the Chinese remainder theorem.
Corollary 2.3.6. The ray class group CK (m) for any modulus m is a finite group of order
r0 hK 2 N(m0) Y 1 hm = 1 − [UK : Um,1] N(p) p|m0 where r0 is the number of real primes dividing m.
72 2.3 Ray Class Groups 2 Class Field Theory
n n Proof. First, OK /p is a local ring with maximal ideal p/p ; this can be seen by the cor- respondence between its ideals and the ideals of OK containing p. Moreover, the units n n n × in OK /p are precisely those elements not in p/p . It follows that (OK /p ) has order n−1 q (q − 1) where q = N(p) = [OK : p]. Then by Theorem 2.3.5,
|CK (m)| = |(Km/Km,1)/(UK /Um,1)| · |C(OK )|
Y Y n(p) × −1 = {±1} × (OK /p ) [UK : Um,1] · hK
p real p|m0
r0 −1 Y n(p)−1 = hK 2 [UK : Um,1] N(p) (N(p) − 1).
p|m0
Furthermore, this expression is equal to the desired one when we factor out N(m0) from the product on the right, using that N is multiplicative.
The most important implication of Corollary 2.3.6 is that every ray class group CK (m) is finite. Let’s take a look at some examples.
Example 2.3.7. For K = Q, the narrow class group is trivial. √ Example 2.3.8. Let K = Q( n) for n > 0. Here there are two real primes and UK = m ∼ {±ε } = Z/2Z × Z for a fundamental unit ε. Letε ¯ be the conjugate of ε. Then ( 2hK if ε, ε¯ have the same sign hm = hK otherwise.
Also note that N(ε) = −1 if and only if ε andε ¯ have different signs. For the first few values of n we have
n hK ε√ N(ε) 2 1 1 + √2 −1 3 1 2 +√ 3 1 5 1 (1 + √5)/2 −1 6 1 5 + 2 6 1 √ √ so we see that the narrow class numbers for Q( 3) and Q( 6) are 2, whereas the others have narrow class number 1. Example 2.3.9. Let’s look at the important example of cyclotomic extensions. Let L = 2πi/m Q(ζm) where ζm = e for m > 2. Define the modulus m = (m)∞ on L. We claim that all ramified primes of L divide m. The minimal polynomial of ζm over Q is well known: it is the mth cyclotomic polynomial Φm(x). These polynomials are constructed by setting Φ1(x) = x − 1 and recursively defining xm − 1 Φm(x) = Y . Φd(x) d|m d 73 2.3 Ray Class Groups 2 Class Field Theory m The relevant property we will use is that Φm(x) is a factor of x − 1. For a prime p, m m m−1 consider x − 1 over the finite field Fp. Since the formal derivative of x − 1 is mx , these polynomials are relatively prime unless m = 0 in Fp, i.e. p | m. In particular this shows that m if p - m, x − 1 is separable mod p and so are all of its irreducible factors, namely Φm(x). Hence by Theorem 1.3.4, p is unramified in L. This allows us to consider the Artin map ϕ : Im → Gal(L/ ) ∼ ( /m )×. L/Q Q Q = Z Z We know from Section 2.2 that in any abelian extension L/K, the Artin map takes a m q prime p ∈ IK to the Frobenius automorphism x 7→ x where q = |OK /p|. In this example K = Q and L = Q(ζm) so OK = Z and p = (p) for a prime integer p. The isomorphism ∼ × k Gal(L/Q) = (Z/mZ) is exhibited by (σ : ζm 7→ ζm) 7→ [k]. Using this description, we can Q sp Q tr extend ϕL/Q to all fractional ideals. If a = p and b = r then we should have −1