<<

MATH 324 Summer 2006 Elementary

Notes on the Riemann Zeta

Department of Mathematical and Statistical Sciences University of Alberta

Riemann Zeta Function

In this note we give an introduction to the , which connects the ideas of with the arithmetic of the .

Define a function of a real variable s as follows,

∞ 1 ζ(s) = . ns n=1 X This function is called the Riemann zeta function, after the German mathematician Bernhard Riemann who systematically studied the deeper properties of this function beginning in 1859.

The infinite for ζ(s) was actually first introduced by Euler nearly 100 years before Riemann’s work.

Note that from the test, the series for ζ(s) converges for s > 1, and therefore the function ζ(s) is defined for all real numbers s > 1. Also, since

1 n+1 dx , ns ≥ xs Zn for all n 1, we have ≥ ∞ 1 ∞ dx 1 = ns ≥ xs s 1 n=1 1 X Z − for all s > 1. Now we let s 1+, and we see that → lim ζ(s) = . s 1+ ∞ → Infinite Products

In order to discuss the connection of ζ(s) with the primes, we need the concept of an infinite product, which is very similar to the notion of an infinite sum.

Definition. Given a of real numbers an n 1, let { } ≥ p1 = a1, p = a a , 2 1 · 2 p = a a a , 3 1 · 2 · 3 . . n p = a a a = a . n 1 · 2 · · · n k kY=1 th The ordered pair of an , pn is called an infinite product. The pn is called the n {{ } {th }} partial product and an is called the n factor of the product, and the following symbols are used to denote the infinite product defined above:

∞ a a a or a . 1 · 2 · · · n · · · n n=1 Y

Using the the analogy with infinite series, where we say that the series converges if and only if the sequence of partial sums converges, it is tempting to say that the infinite product converges if and only if the sequence of partial products converges. However, if we do this, then every product which has one factor equal to zero would converge, regardless of the behavior of the remaining factors. The definition below is more useful:

n ∞ th Definition. Given an infinite product an, let pn = ak be the n partial product. n=1 Y kY=1

(a) If infinitely many factors an are zero, then we say the product diverges to zero.

(b) If no factor a is zero, then we say that the product converges if and only if there exists a p = 0, such n 6 ∞ that p p as n . In this case, p is called the value of the product, and we write p = a . n → → ∞ n n=1 If p 0 as n , then we say the product diverges to zero. Y n → → ∞

∞ (c) If there exists an N such that n > N implies that a = 0, then we say that a converges, n 6 n n=1 Y ∞ provided that an converges as described in part (b). In this case, the value of the product is n=YN+1 ∞ a a a a . 1 · 2 · · · N · n n=YN+1

∞ (d) an is said to be divergent if it does not converge as described in parts (b) and (c) above. n=1 Y Note: The value of a convergent infinite product can be zero, but this is the case if and only if a finite number of the factors are zero.

Also, the convergence of an infinite product is not affected by inserting or removing a finite number of factors, zero or not.

Now we give a criterion for the convergence of infinite products, completely analogous to the corresponding criterion for convergence of infinite series.

Theorem (Cauchy Criteria for Convergence of an Infinite Product).

∞ The infinite product an converges if and only if given any  > 0, there exists an integer n0 such that n=1 Y m ak 1 <  − k=n+1 Y whenever m > n n0. ≥

∞ Proof. Assume first that the product an converges, we may also assume that no an is zero (discarding n=1 a finite number of terms if necessary). LetY

pn = a1 a2 an and p = lim pn, n · · · · →∞ so that p = 0, and there exists an M > 0 such that p > M for all n 1. Now, since p satisfies the 6 | n| ≥ { n} Cauchy criteria for sequences, given an  > 0, there exists an integer n0 such that

p p < M | m − n| m whenever m > n n0. Dividing by pn we get ak 1 <  whenever m > n n0. ≥ | | − ≥ k=n+1 Y

Conversly, suppose that given any  > 0, there exists an integer n0 such that

m ak 1 <  ( ) − ∗ k=n+1 Y whenever m > n n0. ≥

First note that if m > n0 this implies that am = 0, since (assuming that 0 <  < 1), taking n = m 1, we have 6 − a 1 a 1 < , | m| − ≤ | m − | that is,

0 < 1  < a < 1 +  − | m| for m n . ≥ 0 1 Now take  = 2 , and let q = a a a m n0+1 n0+2 · · · m for m > n0, then 1 < q < 3 , 2 | m| 2 for all m > n . Therefore, if q converges, it cannot converge to zero. 0 { m}

1 To see that the sequence qn does actually converge, let 0 <  < 2 be arbitrary, then there exists an n0 such that { } q m 1 < 2  q − 3 n

whenever m > n n0, that is, ≥ q q < 2  q < 3 2  =  | m − n| 3 | n| 2 · 3 whenever m > n n . Thus, the sequence q is a of real numbers, and hence converges, ≥ 0 { n} ∞ and so the infinite product an converges. n=1 Y

∞ Note: Taking n = m 1 in ( ), we see that if the infinite product a converges, then − ∗ n n=1 Y lim an = 1. n →∞

Because of this, the factors of a product are often written as 1 + an, so that convergence of a product ∞ (1 + an) implies that lim an = 0. n n=1 →∞ Y

∞ Theorem. If a > 0 for all n 1, then the infinite product (1 + a ) converges if and only if the infinite n ≥ n n=1 Y ∞ series an converges. n=1 P Proof. Let the partial sums and partial products be given by

s = a + a + + a and p = (1 + a )(1 + a ) (1 + a ) n 1 2 · · · n n 1 2 · · · n for n 1. Since a > 0 for all n 1, then both of the sequences s and p are monotone increasing, ≥ n ≥ { n} { n} and to prove the theorem, we only have to show that sn is bounded above if and only if pn is bounded above. { } { }

First we note that s < p for all n 1. Now using the fact that 1 + x ex for all real numbers x, we have n n ≥ ≤ p = (1 + a )(1 + a ) (1 + a ) ea1 ea2 ean = esn n 1 2 · · · n ≤ · · · for all n 1. Therefore the sequence pn is bounded above if and only if the sequence sn is bounded above. ≥ { } { }

Also, note that the sequence of partial products pn cannot converge to zero, since pn 1 for n 1. Finally, note that { } ≥ ≥ p + if and only if s + . n → ∞ n → ∞ ∞ Definition. The infinite product (1 + an) is said to converge absolutely if and only if the infinite product n=1 Y ∞ (1 + a ) converges. | n| n=1 Y

∞ Theorem. If the infinite product (1 + an) converges absolutely, then it converges. n=1 Y Proof. Use the Cauchy criterion together with the inequality

(1 + a )(1 + a ) (1 + a ) 1 (1 + a )(1 + a ) (1 + a ) 1. | n+1 n+2 · · · m − | ≤ | n+1| | n+2| · · · | m| −

∞ Note: For positive terms a > 0 for all n 1, we know that (1 + a ) converges if and only if ∞ a n ≥ n n n=1 n=1 Y P ∞ ∞ converges, so that (1 + an) converges absolutely if and only if an converges absolutely. However, if n=1 n=1 the terms are not pYositive, we have the following examples to show Pthe results need not be valid.

( 1)n ∞ Exercise 1. Let a = − for n 1, show that (1 + a ) diverges, but ∞ a converges. n √n ≥ n n n=1 n=1 Y P Exercise 2. Let 1 1 1 a2n 1 = − and a2n = + − √n √n n

∞ ∞ for n = 1, 2, . . . , show that (1 + an) converges, but an diverges. n=1 n=1 Y P

We do have a theorem analogous to the positive term result:

∞ Theorem. If a 0 for all n 1, then the infinite product (1 a ) converges if and only if the infinite n ≥ ≥ − n n=1 Y ∞ series an converges. n=1 P

∞ Proof. Note that convergence of ∞ a implies the of (1 a ), and hence the n − n n=1 n=1 Y ∞ P convergence of (1 a ). − n n=1 Y ∞ Conversely, suppose that ∞ a diverges, if a does not converge to zero, then (1 a ) also diverges. n { n} − n n=1 n=1 Y Therefore, we may assumePthat an 0 as n , and by discarding finitely many terms if necessary, we may assume that a 1 for all n →1. But then,→ ∞1 a 1 for all n 1, and so a = 0 for n 1. n ≤ 2 ≥ − n ≥ 2 ≥ n 6 ≥ Let p = (1 a )(1 a ) (1 a ) and q = (1 + a )(1 + a ) (1 + a ) n − 1 − 2 · · · − n n 1 2 · · · n for all n 1, then since ≥ (1 a )(1 + a ) = 1 a2 1 − k k − k ≤ 1 we have pn for all n 1. ≤ qn ≥

∞ Now since ∞ a diverges, then (1 + a ) diverges also, and q + as n . Therefore, p 0 as n n n → ∞ → ∞ n → n=1 n=1 Y P ∞ n and so by part (b) of the definition of convergence, it follows that (1 a ) diverges to 0. → ∞ − n n=1 Y

Euler’s Product Formula for the Riemann Zeta Function

We now have enough information to prove Euler’s theorem and make the connection between infinite products and prime numbers.

Theorem (Euler’s Product Formula)

th Let pk denote the k prime number, if s > 1, then

∞ 1 ∞ 1 ζ s , ( ) = s = s n p− n=1 1 k X kY=1 − and the infinite product converges absolutely.

Proof. For m 1, let ≥ m 1 Pm = s 1 pk− kY=1 − be the mth partial product of the infinite product. If we write each factor as a convergent geomteric series, we have m 1 1 Pm = 1 + s + 2s + , pk pk · · · kY=1   which is a product of a finite number of absolutely convergent infinite series, and multiplying these series to- gether and rearranging the terms according to increasing denominators, we get another absolutely convergent infinite series, and a typical term looks like 1 1 a1s a2s ams = s p p pm n 1 2 · · · where n = pa1 pa2 pam , and a 0 for i = 1, 2, . . . , m. Therefore, 1 2 · · · m i ≥ 1 P = : all prime factors of n are p m ns ≤ m X   and by the Fundamental Theorem of Arithmetic, each such n occurs exactly once in the summation. Therefore, 1 ζ(s) P = : at least one prime factor of n is > p − m ns m X   and since these prime factors occur among the integers n > pm, we have 1 ζ(s) P . | − m| ≤ ns n>pm X 1 m , , ∞ As the sum on the right tends to 0 since s converges, and therefore → ∞ n=1 n P lim Pm = ζ(s). m →∞

∞ Finally, the product has the form (1 + an) where n=1 Y 1 1 a , n = s + 2s + pn pn · · ·

∞ ∞ 1 and the series a converges absolutely since it is dominated by , and therefore the infinite product n ns n=1 n=1 X X ∞ (1 + an) also converges absolutely. n=1 Y

Evaluating the Zeta Function

In 1734, Euler found that ∞ 1 π2 ζ(2) = = , n2 6 n=1 X and he later found closed formulas for

∞ 1 ζ(4) = n4 n=1 X ∞ 1 ζ(6) = n6 n=1 X ∞ 1 ζ(8) = n8 n=1 X . .

∞ 1 but he failed to find a closed formula for ζ(3) = . In fact, it was not even known if ζ(3) was irrational n3 n=1 or not until 1979, when R. Apery proved that ζX(3) is irrational. It is still unknown whether or not ζ(3) is transcendental. π2 An elementary proof that ζ(2) = uses the theory of to sum the series. There is another 6 elementary proof given below that uses only the binomial theorem and some elementary facts about the roots of polynomials with integer coefficients.

Theorem. ∞ 1 π2 ζ(2) = = . n2 6 n=1 X

π 1 Proof. Note that for 0 < x < , we have 0 < sin x < x < tan x, so that 0 < cot x < < csc x, and 2 x therefore 1 cot2 x < < csc2 x (1) x2 π for 0 < x < . 2

kπ π Now let m N, then for k = 1, . . . , m we have 0 < < , so that ∈ 2m + 1 2 m kπ m (2m + 1)2 m kπ cot2 < < csc2 2m + 1 k2π2 2m + 1 Xk=1 kX=1 Xk=1 for all m 1; therefore, ≥ m m m π2 kπ 1 π2 kπ cot2 < < csc2 (2) (2m + 1)2 · 2m + 1 k2 (2m + 1)2 · 2m + 1 Xk=1 kX=1 Xk=1 for all m 1. ≥ m m 2 kπ 2 kπ Now we find explicit formulas for cot 2m+1 and csc 2m+1 . From DeMoivre’s formula and the bino- k=1 k=1 mial theorem, we have, for x R andP n N, P ∈ ∈ n cos nx + i sin nx = cos x + i sin x n n n j j = cos x − i sin x j j=0 X   n   n−1 [ 2 ] [ 2 ] n n 2j 2j 2j n n 2j 1 2j+1 2j+1 = cos − x (i) sin x + cos − − x (i) sin x 2j 2j + 1 j=0 j=0 X   X   n n−1 [ 2 ] [ 2 ] j n n 2j 2j j n n 2j 1 2j+1 = 1 cos − x sin x + i 1 cos − − x sin x, − 2j − 2j + 1 j=0   j=0   X  X  and equating real and imaginary parts in this expression, we have

n [ 2 ] j n n 2j 2j cos nx = 1 cos − x sin x (3) − 2j j=0   X  and

n−1 [ 2 ] j n n 2j 1 2j+1 sin nx = 1 cos − − x sin x. (4) − 2j + 1 j=0   X  Now, if sin x = 0, and m N, then taking n = 2m + 1 in (4), we have 6 ∈ m j 2m + 1 2m 2j 2j+1 sin(2m + 1)x = 1 cos − x sin x − 2j + 1 j=0 X    m j 2m + 1 m j j m = sin2m+1 x 1 cos2 x − sin2 x − · − 2j + 1 j=0 X   m    j 2m + 1 m j = sin2m+1 x 1 cot2 x − , · − 2j + 1 j=0   X   that is,

m j 2m + 1 m j sin(2m + 1)x = sin2m+1 x 1 cot2 x − (5) · − 2j + 1 j=0   X   for all m N, provided that sin x = 0. ∈ 6

kπ Taking x = 2m+1 , for k = 1, 2, . . . , m; then

sin kπ = 0 but sin (2m + 1) kπ = sin kπ = 0 2m+1 6 2m+1 for k = 1, 2, . . . , m, and (5) yields 

m j 2m + 1 m j 1 cot2 kπ − = 0 (6) − 2j + 1 2m+1 j=0   X   for k = 1, 2, . . . , m. Therefore, the m roots of the equation

m j 2m + 1 m j 1 t − = 0 − 2j + 1 j=0   X  2 kπ are t = cot 2m+1 , k = 1, 2, . . . , m.

m Now, if p(t) = a0 + a1t + + amt is a polynomial in t of degree m, and if λ1, λ2, . . . , λm are the roots of p, then · · ·

p(t) = a (t λ )(t λ ) (t λ ) m − 1 − 2 · · · − m m m 1 = a t a (λ + + λ )t − + m − m 1 · · · m · · · m m 1 j 2m+1 m j m 1 so the coefficient of t − is am(λ1 + +λm). Now, for p(t) = ( 1) 2j+1 t − , the coefficient of t − − · · · j=0 − is 2m+1 and a = 2m+1 , and therefore from (6) we have P  − 3 m 1   m kπ 2m+1 m(2m 1) cot2 = 3 = − (7) 2m + 1 2m+1 3 k=1 1  X  for all m 1. and since csc2 x = 1 + cot2 x, then ≥ m kπ 2m(m + 1) csc2 = (8) 2m + 1 3 kX=1 for all m 1. ≥ Combining (2), (7) and (8), we have

m 2m(2m 1) π2 1 2m(2m + 2) π2 − < < , (9) (2m + 1)2 · 6 k2 (2m + 1)2 · 6 Xk=1 for all m 1, and letting m , we have ≥ → ∞ ∞ 1 π2 = . k2 6 Xk=1 as required.

Example. The usual Fourier series evaluation of ζ(2) looks at the function

f(x) = x2, 0 < x < π,

and then finds the Fourier cosine series of f, the even 2π-periodic extension of f, to be

b a0 ∞ + a cos nx, 2 n n=1 X where 2 π a = f(x) cos nx dx n π Z0 for n = 0, 1, 2, . . . .

For n = 0, we have 2 π 2 x3 π 2 π3 2π2 a = x2 dx = = = . 0 π π · 3 π · 3 3 Z0 0

For n 1, we have ≥ 2 π 2 x2 sin nx π 2 π 4 π a = x2 cos nx dx = x sin nx dx = x sin nx dx n π π n − n −nπ Z0  0 Z0  Z0 π π π 4 x cos nx 1 4 n 4 = + cos nx dx = ( 1) sin nx −nπ − n n n2 − − n3π  0 Z0  0 4( 1)n = − . n2

Therefore, the Fourier cosine series for f is given by

b π2 ∞ ( 1)n f(x) + 4 − cos nx ∼ 3 n2 n=1 X b for all x R. ∈ From Dirichlet’s theorem, since the even 2π-periodic extension of f is continuous at each x R, then the Fourier cosine series converges to f(x) for all x R, and we can write ∈ ∈ b π2 ∞ ( 1)n f(x) = + 4 − cos nx 3 n2 n=1 X b for all x R, in particular, when x = π, the series converges to π2, so that ∈ π2 ∞ 1 π2 = + 4 , 3 n2 n=1 X and therefore ∞ 1 π2 ζ(2) = = . n2 6 n=1 X

π2 Exercise 3. Show that ζ(2) = by evaluating the double integral 6 1 1 dx dy I = 1 xy Z0 Z0 − in two different ways.

(a) Obtain I = ζ(2) from the expansion 1 = 1 + xy + x2y2 + x3y3 + , 1 xy · · · − which is valid for xy < 1. | | (b) Evaluate the integral directly by rotating the coordinate system about the origin through an angle of π to obtain 4

1/√2 u √2 √2 u dv − dv I = 4 du + 4 du, 2 u2 + v2 2 u2 + v2 Z0 Z0 −  Z1/√2 Z0 − ! integrating with respect to v and then making the substitution u = √2 cos θ.