3 Zeta and L-functions
In this section we will use analytic methods to (i) develop a formula for class numbers, and (ii) use this to prove Dirichlet’s theorem in arithmetic progressions: that any arithmetic progression: a + m, a +2m, a +3m, . . . contains infinitely many primes gcd(a, m)=1. This chapter follows [Cohn], though our presentation is reversed from his, together with some supplementary material taken from various other sources. More general treatments are found in [Marcus] and [Neukirch], though they do not do everything we will do here.
3.1 Zeta functions Recall one defines the Riemann zeta function by 1 ⇣(s)= . (3.1) ns X One knows from calculus that this converges for s>1 (compare with
1 x1 s 1 1 1 dx = = < .) xs 1 s 1 s 1 Z1 x=1 Euler observed that (for s>1) one also has the product expansion
1 1 1 1 ⇣(s)= = 1+ + + = . ns ps p2s ··· 1 p s p X Y ✓ ◆ Y Here p runs over all primes of N. The last equality just follows from the formula for a geometric n 1 series: 1 a = if a < 1. To see the why product expansion (middle equality) is valid, it’s n=0 1 a | | perhaps easiest to first notice that it is formally true for s =1, where it says P ⇤ 1 1 1 1 1 1 1 1 1 1 = 1+ + + + 1+ + + + 1+ + + + (3.2) n 2 22 23 ··· 3 32 33 ··· 5 52 53 ··· ··· X ✓ ◆✓ ◆✓ ◆ What does this (formal) infinite product on the right mean? It just means a (formal) limit of the sequence of finite subproducts: 1 1 1 1 1+ + + + = 2 22 23 ··· n n N2 X2
1 1 1 1 1 1 1 1 1 1 1 1 1 1+ + + + 1+ + + + =1+ + + + + + + + 2 22 23 ··· 3 32 33 ··· 2 3 2 3 22 32 22 3 2 32 ··· ✓ ◆✓ ◆ · · · 1 = n n N2,3 2X
⇤When s =1, neither side of the equality actually converges, but the explanation for why both sides should be equal is perhaps more transparent. Here “formally” is not to be confused with rigorously—we mean we can formally manipulate one side to get to the other.
39 1 1 1 1 1 1 1 1 1 1 1+ + + + 1+ + + + 1+ + + + = 2 22 23 ··· 3 32 33 ··· 5 52 53 ··· n ✓ ◆✓ ◆✓ ◆ n N2,3,5 2X . .
e1 e2 ek where Np1,p2,...,pk = p p p : ei N 0 , i.e., Np1,p2,...,pk is the set of natural numbers 1 2 ··· k 2 [{ } which only contain the primes p ,...p in their prime decomposition. 1 k We now prove rigorously that the formal product expansion for ⇣(s) given above is valid for s>1.
Definition 3.1.1. Let p denote the set of primes of N.Letap C for each p. We define { } 2
ap =lim ap. n p !1 n Hence we will say ap converges (diverges) if the limit on the right does. We say ap con- verges absolutely if Q Q lim ap n i !1 i Example 3.1.2. If some ap =0,thenaftersomepoint(nomatterhowthep’s are ordered), we will have a finite subproduct of ap =0.Thus ap will converge to 0 absolutely. Note that if every ap > 0, thenQ log( ap)=Q log ap. An immediate consequence is that ap converges (absolutely) if and only if the series log ap converges (absolutely). Q P Q Proposition 3.1.3. Let (an)n1=1 be a totally multiplicativeP sequence of complex numbers, i.e., amn = 1 aman for any m, n N,andassumea1 =1.If an converges absolutely, then so does 2 p 1 ap and 1 P 1 Q a = , n 1 a n=1 p p X Y where the product is taken over all primes p of N. Proof. Suppose an converges absolutely. Let ✏>0. Then for some N N we can say 2 P a <✏. | n| n>NX Let p1,p2,... be any ordering of the set of primes of N. Then there is some K N such that { } 2 p ,...,p contains all N. Observe { 1 K } K K 1 2 = 1+ap + ap + = an. 1 api ··· i=1 i=1 n Np1,...,pK Y Y 2 X 40 Since Np1,...,pK contains 1,...,N,wehave K 1 1 an an an <✏. 1 api | | n=1 i=1 n>N n>N X Y X X Corollary 3.1.4. For any s>1 the Euler product expansion 1 ⇣(s)= s (3.3) 1 p Y is valid. s Proof. Apply the proposition with an = n . The Euler product expansion demonstrates that the zeta function captures information about primes. In fact, it contains a surprising amount of information about primes. The simplest appli- cation of the zeta function to the study of primes is Euler’s proof of the infinitude of primes. Theorem 3.1.5. There are infinitely many primes. Proof. Assume there are finitely many primes, p1,...,pk. Then 1 1 1 1 1 1 ⇣(s)= s s s < 1 p · 1 p ···1 p ! 1 1/p1 · 1 1/p2 ···1 1/pk 1 1 2 k as s 1. On the other hand ! 1 1 ⇣(s)= = ns ! n 1 as s 1. Contradiction. X X ! Exercise 3.1. For any integer k>1,onecanshow1/⇣(k) represents the probability that k “ran- domly chosen” integers are coprime (have gcd 1). Let f(x)=x on [ ⇡,⇡),computetheFourier coefficients and apply Parseval’s identity. Use this to compute ⇣(2),andhencedeterminetheproba- bility that 2 randomly chosen integers are coprime. (Alternatively, you can try to derive the product expansion sin x 1 x 2 = 1 , x n⇡ n=1 Y ✓ ⇣ ⌘ ◆ and look at the x2 coefficient to find ⇣(2).) We will briefly discuss some deeper connections of ⇣(s) to the study of primes, but first let us give a generalization of the Riemann zeta function. Definition 3.1.6. Let K be a number field. The Dedekind zeta function for K is 1 ⇣ (s)= , K N(a)s a X for s>1 where a runs over all (nonzero) ideals of . OK 41 As before, one can show this series indeed converges for all s>1, and we have an Euler product expansion 1 ⇣ (s)= K 1 N(p) s p Y valid for s>1 as above. Hence the Dedekind zeta function can be used to study the prime ideals of of K. We remark that another way to write the above definition is a ⇣ (s)= n K ns X where an denotes the number of ideals of K with norm n (convince yourself of this). Consequently, the Dedekind zeta function can be used to study the number of ideals of norm n. However, we will be interested in it for its applications to the class number hK of K. 3.2 Interlude: Riemann’s crazy ideas Riemann published a single paper in number theory, On the Number of Primes Less Than a Given Magnitude in 1859, which was 8 pages long, contained no formal proofs, and essentially gave birth to all of analytic number theory. We will summarize the main ideas here. We only defined the Riemann zeta function for real s>1, but in fact Riemann considered it for complex values of s. In general if a>0 and z C, then one defines 2 az = ez ln a where zn ez = . n! This allows one formally to make sense of the definitionX 1 ⇣(s)= ns X for s C, and one can show the sum actually converges provided Re(s) > 1. Riemann showed that 2 ⇣(s) can be extended (uniquely) to a differentiable function on all of C except at s =1, where ⇣(s) has a pole (must be ). However the above series expression is only valid for Re(s) > 1. 1 1 Riemann showed that ⇣(s) has a certain symmetry around the line Re(s)= 2 , namely one has the functional equation ⇣(1 s)= ⇤(s)⇣(s) where ⇤(s) is a function closely related to the function. The functional equation says one can compute ⇣(1 s) in terms of ⇣(s), so we can indirectly use the series for ⇣(s) to compute ⇣(s) when Re(s) < 0. The region 0 < Re(s) < 1 is called the critical strip, and the central line of symmetry 1 Re(s)= 2 is called the critical line. Let ⇢ denote the set of zeroes of ⇣(s) inside of the critical strip. There are countably (infinitely) { } many, and let us order them by their absolute value. Let 1 1 f(x)=⇡(x)+ ⇡(x1/2)+ ⇡(x1/3)+ 2 3 ··· 42 where ⇡(x) is the number of primes less than x. Riemann discovered the following formula for f(x) dt f(x) = Li(x) Li(x⇢) log(2) + 1 t(t2 1) ln t ⇢ x X Z x dt where Li(x)= 0 ln t . Hence this formula relates ⇡(x) with the (values of Li at the) zeroes of ⇣(s). In fact, using Möbius inversion, one can rewrite ⇡(x) in terms of f(x) (and therefore the zeroes of R ⇣(s))as 1 1 ⇡(x)=f(x) f(x1/2) (x1/3) 2 3 ··· Essentially this says the following: if we know exactly where all the zeroes ⇢ of ⇣(s) are we know exactly where the primes are (these are the places on the real line where ⇡(x) jumps). Here is where Riemann made his famous conjecture, the Riemann hypothesis, that all the zeroes of ⇣(s) lying in the critical strip actually lie on the critical line. (It is easy to see from series expansion that ⇣(s) =0for Re(s) 1. Then by the functional equation, ⇣(1 s)=0for Re(s) > 1 6 if and only if ⇤(s)=0, which happens precisely for s a positive odd integer. Thus the only zeroes of ⇣(s) outside of the critical strip, are the so-called trivial zeroes occurring when s = 2k, k N.) 2 In 1896, Hadamard and de la Vallée Poussin used Riemann’s ideas to prove the prime number theorem,that x ⇡(x) Li(x) ⇠ ⇠ ln x as x . (This notation means ⇡(x) is approximately x for x large.) This is important, for !1 ln x example, in cryptography where one wants to know that the primes don’t get too thinly spread out, so that large primes provide suitably secure keys for RSA. The Riemann hypothesis is equivalent to the “best possible bound” for the error term in the prime number theorem, precisely that 1 ⇡(x) Li(x) < px ln(x) | | 8⇡ for x 2657. The Riemann hypothesis has natural generalizations to Dedekind zeta functions and L-functions (see below). Due to a host of applications, the generalized Riemann hypothesis is considered one of the most important open problems in mathematics. 3.3 Dirichlet L-functions Let m N. In 1837 (before Riemann!)⇤, Dirichlet introduced L-functions as a generalization of 2 the Riemann zeta function in order to study the primes mod m. In particular, Dirichlet used these L-functions to show that there are infinitely many primes a mod m, which will be one of the ⌘ main results of this chapter. This result had been conjectured by Euler for a =1and by Legendre in general. Let’s start with the example of p 1 mod 4. (Last semester we were able to use a trick together ⌘ with the first supplemental law of quadratic reciprocity to show there are infinitely many primes p 1 mod 4, and the case of p 3 mod 4 was an exercise using a different trick. Legendre tried ⌘ ⌘ to use quadratic reciprocity to treat the general case, but was unsuccessful, and as far as I know, there is no proof of the general case which does not use Dirichlet L-functions.) ⇤Riemann’s zeta function was studied before Riemann as a function of natural numbers by Euler. 43 Knowing Euler’s proof of the infinitude of primes, one might be tempted to try to define a series by 1 . 1 p s p 1 mod 4 ⌘ Y This expands out as a sum 1 ns n N1 X2 where N1 is the set of all natural numbers which only contain primes 1 mod 4 in their prime ⌘ factorization. If the number of such primes is finite then the product expansion converges as s 1, and one would like to show the series expansion diverges to obtain a contradiction. However ! summing over N1 is not a natural thing to do and there is no simple way to directly analyze it. Hence we will have to be a little more subtle than this. Dirichlet, being smarter than this, had the following idea using characters. Let’s first recall a couple things about characters of finite abelian groups. Definition 3.3.1. Let G be a finite abelian group. A character (or 1-dimensional representa- ˆ tion)ofG is a group homomorphism into C⇥.ThesetofcharactersofG is denoted by G,andis called the dual of G. Exercise 3.2. Let G be a finite abelian group. Let , Gˆ. 2 (i) Show , defined by ( )(g)= (g) (g),isalsoinGˆ. (ii) Show , defined by (g)= (g),isalsoinGˆ.(Herethebardenotescomplexconjugation.) (iii) For any g G,show (g) is a (not necessarily primitive) n-th root of unity where n is the 2 † order of g in G. (iv) Deduce that = ,where denotes the trivial character,i.e., (g)=1for all g G. 0 0 0 2 (v) Conclude that Gˆ is an abelian group. Proposition 3.3.2. Let G be finite abelian group. Then Gˆ G. ' Proof. Let us first prove the proposition in the case G = Cn (the cyclic group of order n). Let ↵ be a generator of G. By (iii) of the exercise above, if Gˆ, then (↵) must be an n-th root of unity 2 ⇣. Furthermore any n-th root of unity ⇣ defines (uniquely) a character on G by setting (↵k)=⇣k. (Observe this is character, and that nothing else can be.) In other words, a character is determined by what it does to a generator of G. 2⇡i/n Let ⇣n denote a primitive n-th root of unity (take ⇣n = e if you wish). There are nn-th 0 2 n 1 roots of unity, given by ⇣n =1,⇣n,⇣n,...,⇣n . Hence there are precisely n distinct character of i i+j G, 0, 1, 2,..., n 1 given by i(↵)=⇣n. It is obvious then that ( i j)↵ = ⇣n , i.e., i j = ˆ i+j mod n. Hence G Z/nZ Cn. ' ' Now that we have prove the proposition in the case where G is cyclic, let us assume G is an arbitrary finite abelian group. Then we know by the classification of such groups, G Cri . Set ' ni 2⇡i/ni ⇣ni = e and let ↵i G be a generator for Cni . As above, we can define characters ij on Cni j 2 Q by ij(↵i)=⇣ni . We can extend each ij to a character on G by setting ij(↵k)=1whenever i = k and using multiplicativity. In other words, view and g G as 6 2 e1 e2 et g =(↵1 ,↵2 ,...,↵t ) n d †⇣ C is a primitive n-th root of unity if ⇣ =1but ⇣ =1for any d n. For example there are four 4-th roots of 2 6 | unity: 1, i,butonly i are primitive. ± ± ± 44 and set jei ij(g)=⇣ni . Hence for each i, we get disjoint subgroups of Gˆ each isomorphic to Cˆ C .Itisstraightforward ni ' ni to check that any character of G is a product of some ij’s, i.e., we have Gˆ Cˆ C G. ' ni ' ni ' Y Y We will be interested in the case where G =(Z/mZ)⇥. ˆ Example 3.3.3. Suppose G =(Z/2Z)⇥.ThenG = 1 = C1 so G = 0 .Inotherwords,the { } { } only character of G is the trivial one 0 which sends 1 to 1. Example 3.3.4. Suppose G =(Z/3Z)⇥.ThenG = 1, 2 C2 (here 1 and 2 represent the { }' corresponding congruence classes in Z/3Z). Since 2 generates G and has order 2,therearetwo possibilities for characters: :1 1, 2 1 0 7! 7! and :1 1, 2 1 1 7! 7! So Gˆ = , C . { 0 1}' 2 Note that G =(Z/4Z)⇥ C2 also, so this case is essentially the same as (Z/3Z)⇥. ' Example 3.3.5. Suppose G =(Z/5Z)⇥.ThenG = 1, 2, 3, 4 C4.Here2 generates G,soa { }' character of G is determined by what 4-th root of unity 2 maps to. Explicitly, we have 4 characters, whose values are read offof the following character table: 12 3 4 0 11 1 1 1 1 11 1 1 i i 1 2 1 ii 1 3 Looking at this table, it is easy to see Gˆ is cyclic of order four, generated either by 2 or 3. Exercise 3.3. Determine all characters of (Z/15Z)⇥ and (Z/16Z)⇥.(Writethemdownincharacter tables like our (Z/5Z)⇥ case. Feel free to order the columns and rows however you find easiest.) Theorem 3.3.6. Let CG = f : G C denote the space of complex valued functions from a finite { ! } abelian group G into C (the group algebra of G). This is a G -dimensional vector space over C. | | The characters Gˆ form a C-basis for CG. 2 We omit the proof, and in fact we do not need this precise result, but it is helpful for motivation. What this means for us is the following. We want to study the primes p in a congruence class mod m.Aslongasp - m,wehavep a mod m for some a (Z/mZ)⇥. Let denote the the set of ⌘ 2 { } characters of G =(Z/mZ)⇥ and fix a (Z/mZ)⇥. Consider the function f CG given by 2 2 f :(Z/mZ)⇥ C ! 45 1 x = a f(x)= (0 else. In other words, f tells me if p mod m is a or not. What the above says is that f(x)= c (x) X for some c C. Hence knowing (p mod m) for all tells us whether p a mod m or not. Slightly 2 ⌘ more generally, the above theorem says knowing (p mod m) for all and p - m is the same as knowing p mod m for all p - m. Put another way, the characters of (Z/mZ)⇥ distinguish all the (invertible) congruence classes mod m. This suggest it is possible to study the primes a mod m ⌘ using the characters of (Z/mZ)⇥. To somehow connect this with something like the zeta function, one actually wants to think of the characters of (Z/mZ)⇥ as “characters” of Z. Definition 3.3.7. Let be a character of (Z/mZ)⇥.Weextend to a function