MA3A6 Algebraic Number Theory
David Loeffler
Term 2, 2014–15 Chapter 0
Introduction
Lecture 1 0.1 What is this module about?
This is a module about algebraic number fields. An algebraic number field is a special kind of field, which contains the rational numbers Q, but is a little bit bigger. We’ll give a formal definition soon enough, but a good example to bear in mind is the Gaussian field
Q(i) = {a + bi : a, b ∈ Q}, which comes with its subring of Gaussian integers
Z[i] = {a + bi : a, b ∈ Z}.
Exercise. Why is the Gaussian field a field? (Most of the axioms are straightforward, but why is it closed under inverses?)
In Algebra 2 you saw that Z[i] was a unique factorization domain, and you used this to show that any prime number p = 1 mod 4 could be written as the sum of two squares,
p = x2 + y2.
So rings like Z[i] have some interesting structure; and they tell us new things about Z.
0.2 Logistics
• There will be 4 problem sheets, which will be distributed as we go along. These count for 15% of your grade. The deadlines will be – Sheet 1: distributed Thursday, week 2; deadline 3pm Monday, week 4. – Sheet 2: distributed Thursday, week 4; deadline 3pm Monday, week 6. – Sheet 3: distributed Thursday, week 6; deadline 3pm Monday, week 8. – Sheet 4: distributed Thursday, week 8; deadline 3pm Monday, week 10. • Weekly office hour: Tuesdays 13.30–14.30, Zeeman B1.25. • Support classes with Heline Deconinck: Fridays 11–12, MS.04, from week 2 onwards.
1 • Books: see list on Undergraduate Handbook page. The main reference is Stewart & Tall, which is also probably the friendliest of the books on the list; Swinnerton-Dyer’s book is harder going, but was the book which inspired me to become a number theorist. • Most of you have done Galois theory, and about half of you are doing Commutative Algebra.
2 Chapter 1
Algebraic number fields
1.1 Extensions of fields
Notation 1.1.1. Let K and L be fields. If K is a subfield of L, we say L is a field extension of K, and we write L | K.
For instance, C | Q is a field extension, as is C | R.
Definition 1.1.2. Let L | K be a field extension, and let α ∈ L. We say α is algebraic over K if there exists a nonzero polynomial g ∈ K[X] such that g(α) = 0. Example 1.1.3. In the extension C | R, the element iπ is algebraic over R (it’s a root of X2 + π2). However, it is not algebraic over Q. Proposition 1.1.4. Let α be algebraic over K. Then there is a unique polynomial f ∈ K[X] such that f (α) = 0 and f is irreducible and monic (its leading coefficient is 1). We call this the minimal polynomial of f over K.
Proof. Recall from Algebra 2 the concept of an ideal and a principal ideal. The set I ⊂ K[X] of polynomials g such that g(α) = 0 is an ideal of K[X]; the ring K[X] is a Euclidean domain, so every ideal of this ring is principal, i.e. consists of the multiples of some polynomial f (which we can assume is monic, by multiplying it by an element of K× if necessary). To see that f is irreducible, we suppose that we can write f = gh. Then g(α)h(α) = 0; since L is a field, we must have either g(α) = 0 or h(α) = 0, and thus at least one of g and h is in I. So f divides one of g and h, WLOG g. Since g also divides f , we have deg(g) = deg( f ) and hence h is constant. Thus f is irreducible. Remark. For Commutative Algebra students: a slightly posher way of stating the last part is that I is the kernel of the homomorphism K[X] → L g 7→ g(α). L is an integral domain (being a field); the kernel of a homomorphism to an integral domain is a prime ideal; and a generator of a principal prime ideal is a prime element, and hence must be irreducible.
Definition 1.1.5. Let L | K be an extension. We say L | K is algebraic if every α ∈ L is algebraic over K. We say L | K is finite if L has finite dimension as a K-vector space. Example 1.1.6. The extension C | R is finite (of degree 2), since {1, i} is a basis of C over R. It is also algebraic, because every a + bi ∈ C satisfies the polynomial (X − a)2 + b2 = X2 − 2aX + (a2 + b2) ∈ R[X].
Notation 1.1.7. If L | K is finite, we define the degree [L : K] to be the dimension of L as a K-vector space.
3 √ Example 1.1.8. Let α = i + 2 ∈ C. But α is also algebraic over Q: we have √ √ α − 2 = i ⇒ α2 − 2 2α + 2 = −1 √ ⇒ α2 + 3 = 2 2α ⇒ (α2 + 3)2 = 8α2 ⇒ α4 − 2α2 + 9 = 0.
We’ll see later that X4 − 2X2 + 9 is irreducible in Q[X], so it is the minimal polynomial of α over Q. Lecture √ 2 On the other hand, the minimal polynomial of α over R is X2 − 2 2X + 3, by the previous example. This shows that the minimal polynomial of α over K really depends on which K we use! Remark. I forgot to point out in the last lecture that in Proposition 1.1.4, the minimal polynomial f of α over K has the property that any polynomial g ∈ K[X] such that g(α) = 0 is necessarily a multiple of f . This is clear from the proof. We’ll use this fact a lot, so make sure it’s in your notes! Proposition 1.1.9. Let L | K be a field extension. An element α ∈ L is algebraic over K if and only if there exists a finite extension of K inside L which contains α.
(In particular, any finite extension is algebraic, and any algebraic extension is a union of finite extensions. There are algebraic extensions which aren’t finite, as we’ll see later.)
Proof. Firstly, let’s prove the “if” part. It suffices to show that if L | K is a finite extension and α ∈ L, then α is algebraic. Suppose [L : K] = d < ∞. Then the powers 1, α, α2, ... , αd are d + 1 elements of a d-dimensional vector space over K, so they must be linearly dependent: that is, we can find elements c0, ... , cd of K, not all zero, such that d c0 + c1α + ··· + cdα = 0. i Thus α satisfies the non-zero polynomial g(X) = ∑ ciX ∈ K[X] of degree ≤ d. Thus α is algebraic over K. The “only if” part is a little harder. Let f be the minimal polynomial of α over K, and d its degree. We will show that the K-subspace M of L spanned by the powers of α is d-dimensional over L, with basis S = {1, ... , αd−1}, and is a subfield of L. Since S is a finite set, this shows that M is a finite field extension of K inside L which contains α. Claim 1: M is a subring of L. N By definition, M is exactly the elements of L which are of the form a0 + a1α + ··· + aNα for some α0, ... , αN ∈ K; that is, L is the image of the ring homomorphism K[X] → L given by mapping g to g(α). But the image of a ring homomorphism is always a subring (Algebra 2). Claim 2: M is spanned by S. By the division algorithm for polynomials, for each g ∈ K[X] we can write g(X) = a(X) f (X) + b(X) where a, b ∈ K[X] and deg(b) ≤ d − 1. But this implies that
g(α) = a(α) f (α) + b(α) = b(α),
since f (α) = 0. As b has degree ≤ d − 1, b(α) is a K-linear combination of the elements of S. Claim 3: M is closed under taking inverses of nonzero elements. This is the most difficult bit! There are many possible proofs, but here’s one. We know by this stage that M is finite-dimensional over K. Let x ∈ M be non-zero, and consider the map mx : M → M given by mx(y) = xy. (This is called the “multiplication-by-x map”).
I claim that mx is injective. If not, there would be some nonzero y such that xy = 0; but this equality takes place inside L, which is a field, so either x = 0 or y = 0, which is a contradiction.
4 By the rank–nullity theorem, it follows that mx is surjective. In particular, 1 ∈ image(mx), which shows that 1/x ∈ M. This concludes the proof that M is a field.
As a by-product of the proof of the “only if” part, we get two interesting pieces of information. Corollary 1.1.10. (i) An element α ∈ L is algebraic over K if and only if the powers of α span a finite-dimensional K-subspace of L. (ii) If α is algebraic over L, then there is a unique smallest extension of K in L which contains α, namely the K-subspace spanned by the powers of α; and this has a K-basis 1, α, ... , αd−1, where d is the degree of α over K.
Proof. The only thing we have left to check is that if α has degree d over K, the set S = {1, ... , αd−1} is linearly independent over K. Suppose S is linearly dependent. Then there are c0, ... , cd−1 ∈ K, not all zero, d−1 i such that c0 + c1α + ··· + cd−1α = 0; in other words, g(α) = 0 where g is the polynomial ∑ ciX , whose degree is ≤ d − 1. But this implies g must be divisible by the minimal polynomial f of α over K, which is impossible, since f has degree d. √ 2 Example 1.1.11. It’s clear that√ 2 is algebraic over Q, and its minimal polynomial is X − 2. Thus the smallest extension of Q containing 2 is the field √ {a + b 2 : a, b ∈ Q}. Fact 1.1.12. For any extension L | K and α ∈ L, there’s always a unique smallest extension of K inside L containing α (whether or not α is algebraic). We denote this smallest extension by K(α). So Proposition 1.1.9 shows that α is algebraic over K if and only if [K(α) : K] < ∞. We’ll occasionally have to consider stacking field extensions on top of each other: if we have three fields K, L, M with K ⊆ L ⊆ M, then we have three field extensions, L | K, M | L, and M | K. Proposition 1.1.13 (Tower law). The extension M | K is finite if and only if L | K and M | L are both finite, and in this case, we have [M : K] = [M : L][L : K].
Proof. Suppose [M : L] = r and [L : K] = s are finite. Then let `1, ... , `r be a K-basis of L and let m1, ... , ms be an L-basis of M. It’s easy to see that {`imj : 1 ≤ i ≤ r, 1 ≤ j ≤ s} is a K-basis of M, so [M : K] = rs and in particular M | K is a finite extension. Conversely, if [M : K] is finite, then L is a K-vector subspace of a finite-dimensional K-vector space, hence is itself finite-dimensional over K, so L | K is finite; and any set spanning M as a K-vector space certainly spans M as an L-vector space, so M | L is also finite. √ Example 1.1.14. Consider the field Q(α), where α = 2 + i, as in Example 1.1.8. We know that [Q(α) : Q] ≤ 4, since we have written down a polynomial of degree 4 that α satisfies. 2 √ √ α +3 = Q( ) Q( ) Q( ) On√ the other hand, 2α 2, so 2 is a subfield√ of α . We√ know that α must be bigger than Q( 2) (since α isn’t in R), and thus both [Q(α) : Q( 2)] and [Q( 2) : Q] are ≥ 2. Hence √ √ [Q(α) : Q] = [Q(α) : Q( 2)][Q( 2) : Q] ≥ 4 by the tower law. So the degree is exactly 4. √ √ Moreover, by√ Proposition 1.1.9, we√ know that {1, 2} is a basis√ of Q( 2) over Q, and {1, i} is a basis of Q(α) over√ Q( √2) (since i = α − 2 is in Q(α) but not in Q( 2)). So, by the proof of the tower law, we see that {1, 2, i, i 2} is a basis of Q(α) over Q.
5 1.2 Algebraic numbers and number fields Lecture Definition 1.2.1. An algebraic number is a complex number α ∈ C which is algebraic over Q: that is, there exists a 3 non-zero polynomial g ∈ Q[X] such that g(α) = 0. We write A for the set of all algebraic numbers (so A ⊂ C). √ Example 1.2.2. Any rational number√ α is algebraic (it’s a root of f (X) = X − α). The numbers i, 3, etc are algebraic; and we saw above that 2 + i was algebraic, although this took a bit of work to show. Remark. We’ll see in the next section that A is a field, so in particular the sum of any two algebraic numbers is always algebraic; but we’ll need to develop a bit more theory first. Definition 1.2.3. An algebraic number field, or just a number field, is a subfield of C which is finite as an extension of Q.
Exercise. Can you see why every subfield of C must automatically contain Q?
As a special case of Proposition 1.1.9, we see that α ∈ C is algebraic if and only if Q(α) is a number field. This gives us a massive supply of number fields: if we take any irreducible polynomial f ∈ Q[X], then we can find a root α of f in C (by the Fundamental Theorem of Algebra), and then Q(α) will be a number field. Example 1.2.4 (Quadratic fields). Let√ d be a non-square in Q. Then there are exactly two square roots of d in C; choose one of them and call it d (it doesn’t matter which we choose). Then the field √ √ Q( d) = {a + b d : a, b ∈ Q}
is a number field, of degree 2 over Q.
These are called quadratic fields and they’re some of the simplest number fields; we’ll use them as one of our main sources of examples. √ √ √ Of course there is some redundancy here: the fields Q( 2), Q( 8) and Q( 18) are the same. Let’s say an integer d is square-free if it is not divisible by m2 for any integer m > 1. (Thus 1 is squarefree, but 0 is not.) √ Proposition 1.2.5. Any number field K such that [K : Q] = 2 is equal to Q( d) for a unique square-free integer d 6= 1.
Proof. Let K be a number field of degree 2, and let α ∈ K be such that α ∈/ Q. Then {1, α} must be a basis 2 = + ∈ − of K, so we have α xα y for some x, y Q. Replacing α with√ α x/2, which doesn’t change the field generated by α, we can assume that α2 = y; so K is the field Q( y) for some rational number y.
n1 nr Let us factorize y into prime powers, y = ±p1 ... pr (where some of the nr may be negative). Replacing α −n1/2 (1−n1)/2 with p1 α if n1 is even, and with p1 if n1 is odd, and similarly for the other factors, we may arrange that y is a square-free integer d. If we end up with d = 1 then this is a contradiction, since this forces α to be ±1, contradicting the assumption that α 6= Q. √ √ We still need to check that the fields Q( d1) and Q( d2) are different if d1 and d2 are distinct squarefree integers. This is left as an exercise (see coursework #1).
1.3 Extensions of number fields
We defined number fields as finite extensions of Q, and this gave us a bunch of new and interesting fields. We might expect to get even more new fields by taking finite extensions of number fields; but we don’t get anything new if we do this.
6 Proposition 1.3.1. Let K be a number field, and let α ∈ C. Suppose α is algebraic over K. Then K(α) is a number field, and in particular α ∈ A.
Proof. Applying Proposition 1.1.9, we see that K(α) is a finite extension of K; but K is also finite as an extension of Q. The tower law now shows that K(α) | Q is a finite extension. Thus K(α) is a number field.
Notation 1.3.2. Let L | K be a field extension. For a finite set S = {a1, a2, ..., an} ⊂ L, we denote by K(S) = K(a1, a2,..., an) the smallest extension of K inside L that contains S. √ √ Example 1.3.3. The field Q( 2, i) is the smallest√ extension of√Q inside C that contains i and 2. The field Q(α)√from Example 1.1.14√ contains i and 2, so Q√(α) ⊇ Q(i, 2); on the other hand, any field containing i and 2 must contain α = 2 + i, so Q(α) = Q(i, 2). Corollary 1.3.4. If S is any finite set of algebraic numbers, then Q(S) is a number field.
Proof. We will show, by induction on n, that if S is any set of algebraic numbers with #S = n, then Q(S) is a number field. For n = 0 this is trivial (Q is a number field). So let us assume it is true for n − 1. Write S = {a1, ... , an}. We have Q(S) = Q(a1, ... , an) = K(an), where K is the field Q(a1, ... , an−1). By the induction hypothesis, K is a number field. Since an is algebraic over Q it is certainly algebraic over K, so, by the previous proposition, K(an) = Q(S) is a number field. So the induction hypothesis holds for n and we are done. √ We can now show that all the hard work we had to do in Example 1.1.8, to prove that 2 + i was algebraic, has been washed away by the rising sea of theory! Theorem 1.3.5. The set A of algebraic numbers is a field.
Proof. We need to show that A contains 0 and 1 (easy), and is closed under addition, multiplication, and inversion of non-zero elements. If α ∈ A is nonzero, then Q(α) is a number field and 1/α ∈ Q(α), so 1/α ∈ A. d i (Exercise: If fα(X) = ∑i=0 ciX is the minimal polynomial of α over Q, write down explicitly a nonzero polynomial over Q satisfied by 1/α.) Now let α, β ∈ A. By the previous corollary, Q(α, β) is a number field, so Q(α, β) ⊂ A. However, Q(α, β) obviously contains α + β and αβ so we are done.
Remark. Note that A is not itself a number field (why?) Lecture 4 1.4 Interlude: Number fields and matrices
Recall from the proof of Proposition 1.1.9 that if K is a number field, and α ∈ K, then we can associate to α a linear operator mα : K → K.
If we choose a basis of K as a Q-vector space, we can write mα as a matrix. √ √ √ Example 1.4.1. Let K = Q( d) be a quadratic field. Then {1, d} is a basis of K. If we take α = a + b d, then we have √ mα(1) = a + b d √ √ mα( d) = bd + a d
7 a bd so the matrix of m is . α b a
It turns out that lots of useful algebraic information about α is encapsulated in the operator mα. Proposition 1.4.2. Let K be a number field.
(i) The map α 7→ mα is an injective Q-linear map, and a ring homomorphism, from K to the ring of Q-linear operators on K.
(ii) If g is the characteristic polynomial of mα, then g(α) = 0.
Proof. Part (i) is obvious, so we give the proof of part (ii).
We know that g(mα) is the zero matrix by the Cayley–Hamilton theorem. However, for any polynomial h ∈ Q[X], we have h(mα) = mh(α) by part (i). So mg(α) is the zero linear operator; but by injectivity this forces g(α) = 0. Example 1.4.3. Let K = Q(θ) where θ is the unique real root of f (X) = X3 + X + 1. Then {1, θ, θ2} is a basis of K over Q. Let’s let α = 1 + 3θ2 and calculate the matrix of α. We have
α · 1 = 1 + 3θ2 α · θ = θ + 3θ3 = −3 − 2θ α · θ2 = −3θ − 2θ2
so the matrix of mα in this basis is 1 −3 0 0 −2 −3 3 0 −2 Hence α satisfies the characteristic polynomial of this matrix, which is X3 + 3X2 − 31. 1 We can also use this method to calculate α : we have 4 −6 9 1 m1/α = 1/mα = 31 −9 −2 3 6 −9 −2
1 2 and the first column of this shows that 1/α = m1/α(1) = 31 (4 − 9θ + 6θ ). Remark. Some textbooks refer to the characteristic polynomial of mα as the field polynomial of α√. Notice that unlike the√ minimal polynomial, it really√ depends on the field K, e.g. the field polynomials of 2 as an element of Q( 2) and as an element of Q( 2, i) aren’t the same.
1.5 Embeddings
Definition 1.5.1. An embedding of a number field K is a ring homomorphism ϕ : K → C.
Any such homomorphism is necessarily injective, and satisfies ϕ(x) = x for all x ∈ Q. Note that K is by definition a subfield of C, so there is a distinguished identity embedding (sending x to x for all x); but there might be more. √ √ √ For instance, we can embed Q( 3) into C by sending a + b 3 ∈ K to a − b 3 ∈ C. If L | K is an extension of number fields, then any embedding of L restricts to an embedding of K; but different embeddings L → C can give the same embedding K → C.
8 √ √ √ Example 1.5.2. Let√ K be the field Q( 2). Let ϕ : K → C be the embedding a + b 2 7→ a − b 2. Let L be the extension Q(i, 2) of K; as in Example 1.1.14, every element of L can be written uniquely as √ √ a + b 2 + ci + di 2
for some a, b, c, d ∈ Q.
There are two embeddings Φ1, Φ2 of L which restrict to ϕ, given by √ √ √ √ Φ (a + b 2 + ci + di 2) = a − b 2 + ci − di 2, 1 √ √ √ √ Φ2(a + b 2 + ci + d 2) = a − b 2 − ci + di 2.
There’s a close link between embeddings of K, and roots of the minimal polynomials of elements of K. We’ll need a preliminary lemma: Lemma 1.5.3 (Separability Lemma). Let K be a number field, let f ∈ K[X] be an irreducible polynomial of degree d ≥ 1, and let ϕ be an embedding of K. Let ϕ( f ) ∈ C[X] be the polynomial obtained by applying ϕ to the coefficients of f . Then ϕ( f ) has d distinct roots in C.
Proof. Replacing K with its image under ϕ, which is also a number field, we can assume that ϕ is the identity embedding. The Fundamental Theorem of Algebra tells us that any complex polynomial of degree d has d roots in C counted with multiplicity; so we need to show that f cannot have repeated roots. Let f 0 be the derivative of f , which is also in K[X] and is non-zero. Let h ∈ K[X] be the GCD of f and f 0. Then h has degree ≤ d − 1, but divides f , so h must be a constant. So f cannot have roots in common with f 0. But any repeated root of f is a common root of f and f 0. Remark. We call this the Separability Lemma because it’s related to the concept of “separable extensions” in Galois theory (but we won’t need to know that here).
This now gives us a pretty good handle on embeddings: Proposition 1.5.4. (i) Let L | K be an extension of number fields. For any embedding ϕ of K, there are exactly [L : K] distinct embeddings of L extending ϕ. (ii) Any number field K has [K : Q] embeddings.
Proof. For (i), let us suppose first that L = K(α) for a single element α. Let f be the minimal polynomial of α over K. I claim that the extensions of ϕ to an embedding Φ of L biject with the roots of ϕ( f ) in C.
By Proposition 1.1.9, we know that every ` ∈ L can be written uniquely in the form ` = k0 + k1α + ··· + d−1 kd−1α , for some ki ∈ K, where d = [L : K]. Thus, if Φ is an embedding of L extending ϕ, we must have i Φ(`) = ∑ ϕ(ci)Φ(α) ; thus Φ is uniquely determined by where it sends α. Moreover, we have
(ϕ( f ))(Φ(α)) = Φ( f (α)) = Φ(0) = 0, so Φ(α) must be a root of ϕ( f ). Lecture It remains to show that, for every root ρ of ϕ( f ), there is an embedding sending α to ρ. We define a map Φ 5 d−1 i i by sending ` = ∑i=0 kiα to ∑ ϕ(ki)ρ . This is obviously compatible with addition, but we need to show it is compatible with multiplication. i i i Let ` = ∑ aiα and ` = ∑ biα be elements of L. We can write ` = r(α) and m = s(α) where r = ∑ aiX and i s = ∑ biX are polynomials in K[X] of degree ≤ d − 1. Then `m = t(α) where t is the remainder of rs divided by f . Under the map Φ, we have ` 7→ ϕ(r)(ρ) and m 7→ ϕ(s)(ρ). Hence we have
Φ(`)Φ(m) = ϕ(r)(ρ)ϕ(s)(ρ) = ϕ(rs)(ρ),
9 but Φ(`m) = ϕ(t)(ρ). Since ϕ(t) differs from ϕ(rs) by a multiple of ϕ( f ), and ϕ( f )(ρ) = 0, we have ϕ(rs)(ρ) = ϕ(t)(ρ) as required. This proves (i) when L = K(α). Now let’s prove the general case. It’s clear that we can find a finite set α1, ... , αn such that L = K(α1, ... , αn) (for example, any basis of L as a K-vector space will do). Let Ki = K(α1, ... , αi). Then each embedding of K extends to [K1 : K] embeddings of K1, and these extend to [K2 : K1] embeddings of K2, etc; so the number of embeddings of Kn = L is
[L : Kn−1][Kn−1 : Kn−2] ... [K1 : K] = [L : K]
by the tower law. To prove (ii), we simply apply (i) to the extension K/Q.
Remark. Note that the√ image of an embedding√ √ of K doesn’t always land in K. For instance, there is an embedding of K = Q( 3 2) mapping 3 2 to ω 3 2, where ω = e2πi/3; this isn’t in K, since K is contained in R (and ω isn’t).
From the proof of (i), we see that if α ∈ A, the embeddings of Q(α) biject with the roots in C of the minimal polynomial of α (over Q). These have a special name:
Definition 1.5.5. Let α be an algebraic number, and let fα be its minimal polynomial. Then the roots of fα in C are called the conjugates of α. If ϕ1, ... , ϕd are the embeddings of Q(α) in C, then the conjugates of α are α1 = ϕ1(α),..., αd = ϕd(α).
Proposition 1.5.6. Let α ∈ A, and let α1 = α, α2, ... , αd be the conjugates of α and f its minimal polynomial. Then
d f (X) = ∏(X − αi). i=1
d Proof. We know that f is monic of degree d and the αi are its roots, and the same is true of ∏i=1(X − αi), so the two polynomials must coincide. √ √ √ √ √ √ Example 1.5.7. Let K = Q( 2 + 5), so [K : Q] = 4. The conjugates of 2 + 5 are ± 2 ± 5 and we calculate that √ √ √ √ √ √ √ √ (X − 2 − 5)(X − 2 + 5))(X + 2 − 5)(X + 2 + 5) √ √ = ((X − 2)2 − 5)((X + 2)2 − 5) √ √ = (X2 − 2 2X − 3)(X2 + 2 2X − 3) √ = (X2 − 3)2 − (2 2X)2 = X4 − 14X2 + 9, √ √ which is the minimal polynomial of 2 + 5.
Remark. If K is a number field and α ∈ K, and ϕ1,..., ϕd are the embeddings of K, then
d r ∏(X − αi) = fα(X) i=1 where r = [K : Q(α)]. This follows easily from Prop 1.5.4 and Prop 1.5.6.
10 1.6 Primitive elements √ Corollary 1.3.4 gives us lots of examples of number fields, like Q( 2, i), which aren’t given to us in the form Q(α) for a single α. However, sometimes these fields are “secretly” of this form: for instance, we saw above that √ √ Q( 2, i) = Q( 2 + i).
This is an instance of a more general fact: Theorem 1.6.1 (Primitive element theorem). For any number field K, we can find an element α ∈ K such that K = Q(α) (a “primitive element” for K over Q).
The proof is a little technical but the idea is fairly simple: if we let α be any “sufficiently random” element of K, then α will be a primitive element. We’ll need a lemma first. Lemma 1.6.2. Let K be a number field, and let α ∈ K. If the only embedding ϕ of K such that ϕ(α) = α is the identity embedding, then α is a primitive element (i.e. K = Q(α)).
Proof. Suppose α is not a primitive element. Then Q(α) is a proper subfield of K, and thus e = [Q(α) : Q] is < d. By Proposition 1.5.4, the identity embedding of Q(α) extends to more than one embedding of K, and these all satisfy ϕ(α) = α.
Proof of Theorem 1.6.1. We can certainly find a finite set S such that K = Q(S) (any Q-basis of K will do). So, by induction on the size of S, it is sufficient to show that in any field extension of the form K = Q(α, β) there is a primitive element.
Let f (t), g(t) ∈ Q[t] be the minimal polynomials of α and β over Q, respectively. Let ϕ1, ... , ϕr be the embeddings of Q(α), and ψ1, ... , ψs the embeddings of β, and write αi = ϕi(α), βj = ψj(β). WLOG, α1 = α and β1 = β. Choose c ∈ Q so that α + cβ 6= αi + cβj unless i = j = 1. (1.1) This is possible since Q is infinite and each of the equations
α + cβ = αi + cβj has at most one solution for c. Now let θ = α + cβ; we will show that Q(α, β) = Q(θ). Let ϕ be an embedding of K, and suppose that ϕ(θ) = θ. We know that ϕ(α) must be one of the αi, and ϕ(β) must be one of the βj. By the condition (1.1), this implies that ϕ(α) = α and ϕ(β) = β, so ϕ is the identity on Q(α, β). By the lemma, it follows that Q(α, β) = Q(θ).
1.7 Norm and trace Lecture The last purely field-theoretic topic we’ll cover is to do with ways of passing between elements of K and 6 elements of Q. Recall that if K is a number field and α ∈ K, then multiplication by α defines a linear map mα : K → K. Definition 1.7.1. We define the trace of α by
TrK/Q(α) = Tr(mα) ∈ Q and the norm of α by
NmK/Q(α) = Det(mα) ∈ Q.
11 (We sometimes omit the subscripts if it’s clear what field K we are talking about.) √ √ √ ( + ) = √ ( + ) = 2 − 2 So in Example 1.4.1 we have TrQ( d)/Q a b d 2a and NmQ( d)/Q a b d a db . In Example
1 −3 0
1.4.3 we have Tr (α) = −3, and Nm (α) = 0 −2 −3 = 31. K/Q K/Q 3 0 −2 Proposition 1.7.2. The trace is additive, and the norm is multiplicative; for any α, β in K we have
TrK/Q(α + β) = TrK/Q(α) + TrK/Q(β), NmK/Q(αβ) = NmK/Q(α) NmK/Q(β).
Proof. This follows immediately from the equalities of linear operators
mα+β = mα + mβ,
mαβ = mαmβ, which are just the associativity of addition and multiplication.
Theorem 1.7.3. Let ϕ1, ... , ϕd be the embeddings K → C, and let α ∈ K. Then the characteristic polynomial of mα is given by d ∏(X − ϕi(α)), i=1 so in particular we have
d TrK/Q(α) = ∑ ϕi(α), i=1 d NmK/Q(α) = ∏ ϕi(α). i=1
Proof. We first prove the theorem assuming that K = Q(α). Consider the linear map mα, and let gα be its characteristic polynomial. By vector-space theory, we have
d d−1 d gα(X) = X − TrK/Q(α)X + ··· + (−1) NmK/Q(α).
On the other hand, α must be a root of the characteristic polynomial gα of mα, by the Cayley-Hamilton theorem. Since gα is of degree d and is monic, we must have
gα(X) = fα(X) = ∏(x − αi). i
We can then compare coefficients to conclude. This deals with “almost all” α. To clinch the result in general, choose a primitive element β of the exten- sion K|Q. The result above shows that the matrix of mβ (with respect to any choice of Q-basis of K) is diagonalizable over C, with distinct eigenvalues; so there exists an invertible matrix T over C such that
−1 mβ = TDβT ,
12 where Dβ is the diagonal matrix with entries β1 = ϕ1(β),..., βd = ϕd(β). Now an arbitrary element α of K i can be written in the form ∑i ci β , and exploiting associativity and distributivity again, we get
d−1 i Mα = ∑ ci Mβ i=0 d−1 −1 i = ∑ ci(TDβT ) i=0 d−1 i −1 = T( ∑ ciDβ)T . i=0
d−1 i But since the ϕi are ring homomorphisms, the matrix ∑i=0 ciDβ is diagonal with its j-th diagonal entry being d−1 d−1 i ∑i=0 ci ϕj(β) = ϕj(∑i=0 ci β ) = ϕj(α). The result once again follows by taking trace and determinant.
13 Chapter 2
Algebraic integers
2.1 Motivation and definitions
We now understand the purely field-theoretic structure of number fields pretty thoroughly. But there’s a limit to the interesting things you can say about a field. For instance, Q is a pretty boring ring: there are no nontrivial ideals (only the zero ideal), and every nonzero element divides every other element, so there is no interesting theory of factorisation, etc. On the other hand, the ring Z of integers is a much richer object – we can factor integers into primes, for instance, and this is a genuinely subtle and interesting process. The aim of this chapter is to show that inside the field A of algebraic numbers, there’s a subset of “nice” elements R, with R sitting inside A in the same nice way that Z sits inside Q. Here are some natural things we might ask for:
• R should be a subring of A (the sum and product of algebraic integers should be an integer). • We know what it means for a rational number to be integral, so it should be true that R ∩ Q = Z.
• If α ∈ R, then all the conjugates of α should be in R.
Proposition 2.1.1. Suppose that a subring R ⊂ A exists with these properties. Then for any α ∈ R, the minimal polynomial fα(X) has integer coefficients.
Proof. Let α = α1,..., αd be the conjugates of α. Then we have
d fα(X) = ∏(X − αi) ∈ R[X], i=1 so the coefficients of fα are in R. But they are also in Q, and we’re assuming that R ∩ Q = Z.
Warning: we haven’t yet proven that a ring R satisfying our wishlist actually exists, or that it is unique. But this gives us a strong hint what R should be! Definition 2.1.2. We define the algebraic integers as the subset B ⊂ A given by
{α ∈ C : the minimal polynomial of α over Q lies in Z[X]} .
14 We’ll see shortly that B satisfies our wishlist above; and it’s clear that any other subset R satisfying our wishlist must be contained in B, so B is somehow the “best choice”. First, we give a slightly more useful criterion for identifying elements of B. Proposition 2.1.3. Let α ∈ A be such that g(α) = 0 for some monic polynomial g ∈ Z[X]. Then α ∈ B.
Proof. Recall “Gauss’ Lemma” from Algebra 2, which states that if f , g ∈ Q[X] are monic polynomials with f | g, and g ∈ Z[X], then f ∈ Z[X] as well. We apply this with g as in the statement, and f equal to the minimal polynomial of α. We know that f must divide g, so by Gauss’ Lemma we have f ∈ Z[X]. Lecture √ √ 2 1+ 5 7 Example 2.1.4. Clearly 2 ∈ B, since its minimal polynomial is X − 2. A more subtle example is 2 (the “Golden Ratio”). This has minimal polynomial X2 − X − 1 = 0, so it’s in B, even though it might not look integral at first sight!
We’ll now give a version of Proposition 1.1.9 (and Corollary 1.1.10) for algebraic integers. Proposition 2.1.5. Let α ∈ C. Then α ∈ B iff there is a subring of C containing α which is finitely-generated as an abelian group. Moreover, for any α ∈ C there is a unique smallest subring of C containing α, denoted by Z[α], which is generated as an abelian group by the powers of α; so α ∈ B iff Z[α] is finitely-generated as an abelian group.
Proof. Define Z[α] to be the subgroup of C generated by {1, α, ... } under addition. This is a subring, since it is the image of the ring Z[X] under the evaluation-at-α homomorphism. Moreover, any subring of C containing α must contain Z[α] so it’s the unique smallest such subring. Now, suppose α ∈ B. Let the minimal polynomial of α be f ∈ Z[X]. Take any x ∈ Z[α]; then we have x = g(α) for some polynomial g ∈ Z[X]. By polynomial division, we can write g = a f + b for some polynomials a, b with b of degree < deg( f ); and since f is monic, we have a, b in Z[X]. Thus x = g(α) = b(α) is in the group generated by 1, α,..., αd−1. So Z[α] is finitely-generated as an abelian group, as required. Conversely, suppose α lies in a subring R ⊆ C which is finitely-generated as an abelian group. Then R ⊇ Z[α], so Z[α] is itself finitely-generated. Hence there must be some N such that {1, α, ... , αN−1} is a generating set. So αN is a Z-linear combination of {1, ... , αN−1}, which shows that α is a root of a monic polynomial (of degree N) with coefficients in Z. Hence α ∈ B by Proposition 2.1.3. Remark. Whenever you see two theorems with virtually identical proofs, you should be thinking “Can I formulate a single theorem of which both of these are special cases?”. It is indeed possible to formulate a theorem of which Proposition 1.1.9 and Proposition 2.1.5 are special cases, but you need to use the notion of a module over a commutative ring – this is a concept which you’ll meet if you’re doing Commutative Algebra. Proposition 2.1.6. The set B satisfies our wishlist above.
Proof. It is clear that B ∩ Q = Z, since the minimal polynomial of α ∈ Q is X − α, which is in Z[X] iff α ∈ Z. Moreover, if α ∈ B then the conjugates of α are in B, since they have the same minimal polynomial as α. So let’s show that B is a ring. Let α, β ∈ B. I claim that the abelian group generated by the expressions {αi βj : i, j ≥ 0} is finitely-generated. If α has degree r and β has degree s, then one sees by induction on max(i, j) that any term αi βj can be written as a linear combination of αpβq with 0 ≤ p < r, 0 ≤ q < s, and there are finitely many of these, which proves the claim. But this group is a ring (it’s the image of Z[X, Y] under the map f (X, Y) 7→ f (α, β)) and it contains α and β, so it contains αβ and α ± β. Thus αβ and α ± β are contained in a subring that’s a finitely-generated abelian group; so they’re both in B by the previous proposition.
15 We also have another property of B, which shows that B is “big enough” in some sense. Proposition 2.1.7. Let α ∈ A. Then there is an integer n ≥ 1 such that nα ∈ B.
i Proof. Suppose the minimal polynomial of α is fα(X) = ∑ ciX , with cd = 1 and ci ∈ Q. Let di ≥ 1 be the denominator of ci (as a fraction in lowest terms); and let n be the lowest common multiple of the di. Then d n d−1 2 d−2 d n fα(X/n) = X + ncd−1X + n cd−2X + ··· + n c0 ∈ Z[X] is a monic polynomial satisfied by nα, so nα ∈ B.
α Remark. This certainly implies that any element of A can be written as β with α, β ∈ B; so A is the field of fractions of B, in the sense of Commutative Algebra.
2.2 Rings of integers and integral bases
Definition 2.2.1. If K is a number field, the ring of integers of K, denoted OK, is the ring K ∩ B. √ Proposition 2.2.2 (Integers of quadratic fields). Let d 6= 1 be a square-free integer and K = Q( d). I claim that ( √ Z[ d] if d 6= , √ 1 mod 4 OK = h 1+ d i Z 2 if d = 1 mod 4. √ √ Proof. It is clear that Z[ d] ⊆ OK (for any value of d). Conversely, if α = a + b d ∈ OK, then either b = 0, 2 2 2 in which case α ∈ OK ∩ Q = Z; or the minimal polynomial of α is X − 2aX + (a − db ), so 2a ∈ Z and 4db2 ∈ Z. Since d is square-free, the last condition implies that 2b is also in Z. √ √ √ Z[ d] { 1 d 1+ d } 1 O Hence√α differs by an element of from√ one of the elements 0, 2 , 2 , 2 . Clearly 2 is not in K, and d 1+ d 2 1−d nor is 2 . The minimal polynomial of 2 is X − X + 4 , so it is integral if and only if d = 1 mod 4. Lecture 8 Definition 2.2.3. An integral basis of a number field K is a set of elements b1, ... , bn ∈ OK which are a Z-basis for OK; that is, a set such that every x ∈ OK can be written uniquely in the form n1b1 + ··· + ndbd with ni ∈ Z. √ √ √ 1+ d So {1, d} is an integral basis of Q( d) if d 6= 1 mod 4, and {1, 2 } is an integral basis if d = 1 mod 4. Note that we haven’t shown, yet, that every number field actually has an integral basis! We’ll prove this in the next section. Notice that any integral basis of K must in particular be a basis of K as a Q-vector space (use Proposition 2.1.7 to see that it spans). √ Remark. √Not every basis of K consisting of algebraic integers is an integral basis – for instance, if K = Q( 5), then {1, 5} is a basis of K contained in OK, but not an integral basis of K.
2.3 The trace pairing and the discriminant
In chapter 1 we thought a lot about number fields K | Q as vector spaces over Q. There is some more Q-linear structure on K, which comes from a special Q-bilinear form on K, namely
(α, β) 7−→ TrK/Q(αβ).
We call this pairing the trace pairing. It’s a symmetric bilinear form.
16 Proposition 2.3.1. This pairing is perfect: if α is an element of K, and TrK/Q(αβ) = 0 for all β ∈ K, then α = 0.
−1 −1 Proof. If α 6= 0, then α ∈ K and TrK/Q(αα ) = [K : Q] is non-zero.
We’ll use the trace pairing to study bases of K, and in particular determine which bases are integral bases. Let b1, ... , bd be a basis of K. The matrix of the trace pairing with respect to B is the d × d matrix TB given by
(TB)ij = Tr(bibj).
The determinant of the trace-pairing matrix is rather important, and it has a special name:
Definition 2.3.2. The discriminant of K relative to the basis B, denoted by ∆K(B) or ∆K(b1, ... , bd), is the determinant of the matrix TB.
(Notations vary: Stewart–Tall write ∆[b1,..., bd].)
Remark. One can define ∆K(b1, ... , bd) for any d elements of K as the determinant of the matrix with (i, j)- entry Tr(bibj), whether or not b1, ... , bd is a basis. In fact ∆K(b1, ... , bd) is non-zero if and only if b1, ... , bd is a basis – can you see how to prove this? √ √ Example 2.3.3. Let K = Q( d) for d 6= 1 squarefree. Then B = {1, d} is a basis of K and we have √ Tr(1) Tr( d) 2 0 T = √ = B Tr( d) Tr(d) 0 2d so ∆K(B) = 4d. √ √ √ 2 √ 1+ d 1+ d 1+ d 1+d+2 d 1+d If we use instead B = {1, 2 }, then we have Tr 2 = 1 and Tr 2 = Tr 4 = 2 , so 2 1 TB = 1+d 1 2
so ∆K(B) = d.
Proposition 2.3.4. Let B and C be bases of K, and let S be the change-of-basis matrix (so S = (Sij), where ci = ∑j Sjibj). Then 2 ∆K(C) = Det(S) ∆K(B).
t Proof. By Algebra 1, the matrix of the trace pairing with respect to the basis C is given by TC = S TBS where t t 2 S is the transpose of S. Hence Det TC = Det(S ) Det(TB) Det(S) = Det(S) Det(TB).
Proposition 2.3.5. If the bi are in OK, then ∆(b1,..., bd) ∈ Z.
Proof. This is clear since bibj is an algebraic integer, so TrK/Q(bibj) is in Z. Thus TB is a matrix of integers, so its determinant is an integer.
Fact 2.3.6. In fact one can show that if the bi are in OK then ∆(b1, ... , bd) is always congruent to 0 or 1 modulo 4. See Swinnerton-Dyer’s book for the proof.
Theorem 2.3.7. Let B = {b1, ... , bd} be a basis of K contained in OK, and such that |∆K(B)| is as small as possible among bases of K contained in OK. Then B is an integral basis of K. In particular, every number field admits integral bases.
17 Proof. For the first part, it suffices to prove the following: if B = {b1, ... , bd} is any basis of K contained in 0 OK, and there exists an element α ∈ OK that is not in the Z-span of B, then we can find a new basis B ⊂ OK 0 such that |∆K(B )| < |∆K(B)|.
Let H be the abelian group generated by the bi, and let G be the larger abelian group generated by the bi and α. Note that both G and H are subgroups of OK, and by assumption G is strictly bigger than H. It is clear that G is finitely-generated, and G has no nonzero elements of finite order (because it’s a subgroup of C). By the classification of finitely-generated abelian groups, we must have G =∼ Zr for some r. We must have r ≥ d, because G contains H which is itself isomorphic to Zd; on the other hand, we must have r ≤ d, since any d + 1 elements of G are linearly dependent over Q and hence linearly dependent over Z. ∼ d Thus G = Z , so we can pick a set of elements C = {c1, ... , cd} which are a basis of G as an abelian group, and the ci are also a basis for K as a Q-vector space.
If we let S be the matrix whose columns are the coefficients of the bi in the basis ci, then we have Det(S) = 2 [G : H] > 1 (by the Smith normal form theorem from Algebra 1). Hence ∆K(B) = Det(S) ∆K(C), and so |∆K(C)| < |∆K(B)|, as required. Now, we show existence. Any number field has some basis B as a Q-vector space, and by Proposition 2.1.7 we may scale B so it is contained in OK. Hence the set
{|∆K(B)| : B ⊂ OK basis of K}
is a non-empty set of positive integers and hence has a smallest element. By the first part, this implies that K has integral bases. Lecture 9 Corollary 2.3.8. Let B ⊂ OK be a basis of K. If ∆K(B) is a square-free integer, then B is an integral basis.
Proof. Let C be an integral basis of K and let S be the change-of-basis matrix. Then we have ∆K(B) = 2 Det(S) ∆K(C), but ∆K(B) is squarefree, so we must have Det(S) = ±1. Thus ∆K(B) = ∆K(C), so B is itself an integral basis. Example 2.3.9. Let Q(θ) be the cubic field from Example 1.4.3, where θ3 + θ + 1 = 0. We compute that
Tr(1) = 3, Tr(θ) = 0, Tr(θ2) = −2, Tr(θ3) = Tr(−1 − θ) = −3, Tr(θ4) = Tr(−θ − θ2) = 2.
Hence the discriminant of K in the basis {1, θ, θ2} is given by
3 0 −2
0 −2 −3 = −31.
−2 −3 2
Since −31 is squarefree, it follows that OK = Z[θ]. Note that Corollary 2.3.8 is not an “if and only if” criterion!
Example 2.3.10. Recall that if K = Q(i), then we know that {1, i} is an integral basis. However, ∆K(1, i) = −4, which is certainly not square-free.
Definition 2.3.11. We define the discriminant of K, denoted ∆K, to be the discriminant ∆K(B), where B is an integral basis.
18 Notice that any two integral bases have the same discriminant, because if B and C are integral bases, the 2 basis-change matrix S in Proposition 2.3.4 has determinant ±1, so ∆K(C) = Det(S) ∆K(B) = ∆K(B). Thus ∆K is well-defined. By Theorem 2.3.7, |∆K| is the smallest value of |∆K(B)| as B varies over bases of K contained in OK.
2.4 Interlude: formulae for discriminants
There are lots of rather pretty formulae for discriminants. The first one is elegant, but not particularly useful in practice:
Proposition 2.4.1. Let ϕ1,..., ϕd be the embeddings of K into C. Then