Notes for the course in Analytic Number Theory
G. Molteni
Fall 2019
revision 8.0
Disclaimer
These are the notes I have written for the course in Analytical Number Theory in A.Y. 2011–’20. I wish to thank my former students (alphabetical order): Gu- glielmo Beretta, Alexey Beshenov, Alessandro Ghirardi, Davide Redaelli and Fe- derico Zerbini, for careful reading and suggestions improving these notes. I am the unique responsible for any remaining error in these notes.
The image appearing on the cover shows a picture of the 1859 Riemann’s scratch note where in 1932 C. Siegel recognized the celebrated Riemann–Siegel formula (an identity allowing to computed with extraordinary precision the values of the Riemann zeta function inside the critical strip). This image is a resized version of the image in H. M. Edwards Riemann’s Zeta Function, Dover Publications, New York, 2001, page 156. The author has not been able to discover whether this image is covered by any Copyright and believes that it can appear here according to some fair use rule. He will remove it in case a Copyright infringement would be brought to his attention.
Giuseppe Molteni
This work is licensed under a Creative Commons Attribution-Non- Commercial-NoDerivatives 4.0 International License. This means that: (Attribu- tion) You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use. (NonCommercial) You may not use the material for commercial purposes. (NoDerivatives) If you re- mix, transform, or build upon the material, you may not distribute the modified material. i Contents
Disclaimer i Notation 1 Chapter 1. Prime number theorem 2 1.1. Preliminary facts: a warm-up 2 1.2. Two general formulas 5 1.3. The ring of arithmetical functions 13 1.4. Dirichlet series: as formal series 16 1.5. Dirichlet series: as complex functions 19 1.6. The analytic continuation of ζ(s) 28 1.7. Some elementary results 29 1.8. The Prime Number Theorem 39 Chapter 2. Primes in arithmetic progressions 61 Chapter 3. Sieve methods 71 3.1. Eratosthenes-Legendre’s sieve 71 3.2. Selberg’s Λ2-method 77 3.3. Sifting more classes 83 3.4. Two sets with positive density 91 Chapter 4. Sumsets 103 Chapter 5. Waring’s problem 113 5.1. First step: cancellation in exponential sums 115 5.2. Second step: integral representation 120 Appendix. Bibliography 129
ii Notation
• Let f and g : R → [0, +∞) functions. Then • f(x) = O(g(x)) as x → x0 ∈ R := R ∪ {±∞} means that the quotient f(x)/g(x) is locally bounded in a neighborhood of x0, i.e. that there exist + a constant C ∈ R and an open set U(x0) such that
f(x)/g(x) ≤ C ∀x ∈ U(x0).
• f(x) g(x) as x → x0 ∈ R := R ∪ {±∞} is an equivalent notation for f(x) = O(g(x)).
• f(x) = o(g(x)) as x → x0 ∈ R := R ∪ {±∞} means that the quotient f(x)/g(x) tends to 0 as x → x0 (in other words, the constant C in previous item can be taken arbitrarily small).
• f(x) g(x) as x → x0 ∈ R := R ∪ {±∞} means that both the quotients f(x)/g(x) and g(x)/f(x) are locally bounded in a neighborhood of x0, i.e. + that there exist a constant C ∈ R and an open set U(x0) such that 1 g(x) ≤ f(x) ≤ Cg(x) ∀x ∈ U(x ). C 0
• f(x) = Ω(g(x)) as x → x0 ∈ R := R ∪ {±∞} means that f(x) = O(g(x)) is false. This means that for every constant C ∈ R and every open set U(x0) there exists x ∈ U(x0) such that |f(x)|/|g(x)| > C.
• Given x ∈ R, the integer part of x is defined bxc := max{n ∈ Z: n ≤ x}. It must be not confused with dxe := min{n ∈ Z: n ≥ x}. The fractional part of x is {x} := x − bxc (in signal processing this is called sawtooth func- tion). According to the definition, {x} is a 1-periodic function R → R with discontinuities in every x ∈ Z: lim {x} = 0, lim {x} = 1, ∀n ∈ Z. x→n+ x→n− • Usually we denote by s the complex argument of a complex function. In A. N. Th. it is customary to use σ and t to denote the real and the imaginary parts of s, respectively. In other words, s = σ + it with σ, t ∈ R. • Given two integers m and n, m|n means that m divides n.(m, n) denotes their greatest common divisor, and [m, n] their smallest common multiple, so that mn = (m, n)[m, n]. • e(x) denotes the function e(x) := e2πix.
1 Chapter 1
Prime number theorem
1.1. Preliminary facts: a warm-up + Let P be the set of prime numbers. For every x ∈ R , let π(x) := ]{p ∈ P : p ≤ x}. How much large π(x) can be? Proposition 1.1 (Euclid) P is not finite, therefore π(x) → ∞ as x → ∞.
Proof. Let p1, . . . , pn be any set of primes. Let N := 1 + p1p2 ··· pn. N is an integer, thus it has a prime factor p. p is not equal to any pj, since N = 1 (mod pj). The argument can be modified in such a way to produce a quantitative result. Proposition 1.2 π(x) ln ln x as x → ∞.
Proof. Let p1 = 2 < p2 < . . . be the complete set of primes (an infinite set, according to the previous result). We use the previous argument to prove that 2n−1 21−1 pn ≤ 2 for every n. In fact, the claim is true for n = 1 (because p1 = 2 ≤ 2 ). By induction on n and following the argument proving Proposition 1.1, we know that n−1 Y 2j−1 Pn−1 2j−1 2n−1−1 2n−1 pn ≤ 1 + p1p2 ··· pn−1 ≤ 1 + 2 = 1 + 2 j=1 = 1 + 2 ≤ 2 . j=1 For every x ≥ 2, let n be such that 22n−1 ≤ x < 22n . Then 2n−1 π(x) ≥ π(2 ) ≥ n ≥ log2 log2 x ln ln x. There are several alternative and elementary proofs of these facts. 2n P´olya. For every n ∈ N let Fn := 2 +1, the nth Fermat number. These numbers are pairwise coprime, i.e.
(Fn,Fm) = 1 ∀n 6= m. Proof. The sequence of Fermat numbers satisfies a kind of recursive formula; Qm−1 in fact k=0 Fj = Fm − 2 for every m ≥ 1, an identity which can be easily proved by induction. The formula shows that Fn divides Fm − 2 whenever n < m; in particular every prime dividing both Fn and Fm is also a factor of Fm and Fm − 2, hence it must be 2. Nevertheless, Fermat’s numbers are odd, therefore they cannot have 2 as common factor. 2 CAP. 1: PRIME NUMBER THEOREM 3
The co-primality implies that every Fn has a special prime factor, pn, which divides Fn and does not divide every Fm with m 6= n. In particular, there are infinity many primes (because there are infinity many Fermat’s numbers) and 2n−2 2n−1 the nth prime is lower than Fn−2 = 2 + 1 ≤ 2 as proved before. (Note that we use here the fact that p1 = 2 and p2 = 3 = F0). n P´olya (variation). For every n ∈ N let Mn := 2 − 1, the nth Mersenne num- m−n ber. These numbers satisfy the relation Mm = 2 Mn + Mm−n for every m ≥ n, so that the greatest common divisor of Mm and Mn is M(m,n). Let
p1 = 2, . . . , pk be any set of distinct primes. Then Mp1 ,...,Mpk are pairwise coprime, therefore there are at least k distinct odd prime numbers (because every Mn is an odd number), in particular there is at least one odd prime num- ber which is greater than pk, and hence one more prime. The argument proves pn that if 2 = p1 < p2 < p3 < ··· is the sequence of prime, then pn+1 < 2 . This upper bound for pn can be used to produce a lower bound for π(x), but of incredibly low quality. Erd˝os. Every integer n can be written in a unique way as a product of a square 2 m and a squarefree q. Fix√ x > 2 and apply that decomposition to every integer n ≤ x. There are x possible values for m, and 2π(x) values for q, at most. Hence √ π(x) bxc = ]{n ∈ N: n ≤ x} ≤ ]{m}· ]{q} ≤ x · 2 implying that π(x) ln x. Note that also that this simple argument already improves Proposition 1.2. Euler. Consider the product Y Y 1 1 1 (1 − 1/p)−1 = 1 + + + + ··· . p p2 p3 p≤x p≤x Every integer n can be written in a unique way as product of prime powers and when n is ≤ x then also the primes appearing in its factorization are ≤ x (trivial). Therefore the previous product gives Y Y 1 1 1 X 1 (1.1) (1 − 1/p)−1 = 1 + + + + ··· ≥ . p p2 p3 n p≤x p≤x n≤x The right hand side diverges in x, hence this inequality already proves the existence of infinitely many primes: only in this case the product appearing to the left hand side diverges. The following argument deduces an interesting lower bound from this argu- ment. The inequality (1 − y) ≥ e−2y holds whenever y ∈ [0, 1/2], hence 4 1.1. PRELIMINARY FACTS: A WARM-UP
(1 − 1/p)−1 ≤ e2/p for every prime p, so that
P 2 Y Y X 1 e p≤x p = e2/p ≥ (1 − 1/p)−1 ≥ , n p≤x p≤x n≤x
i.e.,
X 1 1 X 1 (1.2) ≥ ln . p 2 n p≤x n≤x
P 1 P 1 This inequality proves that p≤x p ln ln x because n≤x n ∼ ln x. Exercise. 1.1 The following steps improve the lower bound (1.2) by removing the constant 1/2 appearing there. 1) Prove that (1 − x)ex = 1 + O(x2) for x → 0 and use this equality to prove that
P 1 Y 1 Y 1−1 Y O(1) e p≤x p = e p = 1 − · 1 + . p p2 p≤x p≤x p≤x
Q −2 2) Prove that p≤x(1 + c · p ) converges to a nonzero constant for every fixed constant c ≥ 0; deduce that X 1 Y 1−1 = ln 1 − + O(1). p p p≤x p≤x 3) Use (1.1) to deduce that X 1 X 1 ≥ ln + O(1). p n p≤x n≤x 4) Conclude that X 1 (1.3) ≥ ln ln x + O(1). p p≤x In Proposition 1.14 we will see that ln ln x is the right behavior for the sum of inverse of primes, since X 1 ∼ ln ln x. p p≤x This is part of a very famous set of results proved with totally elementary tools by Mertens long before the proof of the Prime Number Theorem. CAP. 1: PRIME NUMBER THEOREM 5
1.2. Two general formulas The following formula is due to Abel. It is essentially the discrete version of the well known formula for the partial integration.
Proposition 1.3 (Partial summation formula: 1 v.) Let f, g : N → C. Let F (x) P := 1≤k≤x f(k) for x ≥ 1, and F (x) := 0 if x < 1. Then N N−1 X X f(n)g(n) = F (N)g(N) − F (n)(g(n + 1) − g(n)). n=1 n=1 Proof. f(n) = F (n) − F (n − 1) for every n ≥ 1, hence N N N N X X X X f(n)g(n) = F (n) − F (n − 1)g(n) = F (n)g(n) − F (n − 1)g(n) n=1 n=1 n=1 n=1 N N−1 X X = F (n)g(n) − F (n)g(n + 1) n=1 n=1 N−1 X = F (N)g(N) − F (n)(g(n + 1) − g(n)). n=1
1 R n+1 0 Suppose now that g ∈ C ([0, +∞)), then g(n + 1) − g(n) = n g (x) dx so that the formula becomes N N−1 X X Z n+1 f(n)g(n) = F (N)g(N) − F (n) g0(x) dx. n=1 n=1 n Here F (x) = F (n) for x ∈ [n, n + 1), therefore
N−1 X Z n+1 = F (N)g(N) − F (x)g0(x) dx n=1 n Z N = F (N)g(N) − F (x)g0(x) dx. 1 In this way we have proved the following useful formula.
Proposition 1.4 (Partial summation formula 2 v.) Let f : N → C, g : [0, +∞) → 1 P C, g ∈ C ([0, +∞)). Let F (x) := 1≤k≤x f(k) for x ≥ 1, and F (x) := 0 if x < 1. Then N X Z N (1.4) f(n)g(n) = F (N)g(N) − F (x)g0(x) dx. n=1 1 6 1.2. TWO GENERAL FORMULAS
The importance of the partial summation formula comes from the fact that P the function F (x) := n≤x f(n) (sometime called the cumulating function of f) is no less regular than f. The following result is a simple instance of this fact.
Proposition 1.5 (Ces`aromean value) Let f : N → R and suppose that f(k) → 1 ` ∈ R as k diverges. Then x F (x) → `, too. 1 Note that x F (x) is the mean value of the set {f(k)}k≤x, thus the proposition claims that the mean value of f is at least as regular as the original sequence f.
Proof. Suppose ` ∈ R. Let > 0 be fixed. There exists an integer K such that k ≥ K implies f(k) ∈ (` − , ` + ). Let x ≥ K and let x ∈ N, then 1 1 X 1 X F (x) − ` = f(k) − ` = (f(k) − `) x x x k≤x k≤x so that 1 1 X 1 X X F (x) − ` ≤ |f(k) − `| = |f(k) − `| + |f(k) − `| x x x k≤x k≤K K≤k≤x c 1 X c ≤ + ≤ + , x x x K≤k≤x where c is independent of x. This formula proves the claim when x diverges in N. Trivial bounds involving x and bxc prove the claim for the general case x ∈ R. At last, the statement for ` ∈ {±∞} can be proved in similar way. Usually F (x) has a better behavior (in some sense) than f(n). The following exercise show this principle in action: there we have f(n) = exp(2πinθ) oscillates as a function of n, but F has a finite mean value, and this allows the convergence of the series. Exercise. 1.2 Recall that e(x) := exp(2πix). 1) Prove that X X e2πi(bxc+1)θ − 1 sin(π(bxc + 1)θ) F (x, θ):= e(nθ)= e2πinθ = = eπibxcθ e2πiθ − 1 sin(πθ) 0≤n≤x 0≤n≤bxc for every θ ∈ R\Z.
2) Deduce that for every θ ∈ R\Z there is a constant cθ > 0 such that |F (x, θ)| ≤ cθ independently of x. 3) Using Proposition 1.4 deduce that the series ∞ X e(nθ) converges for every θ ∈ \ . n R Z n=1 CAP. 1: PRIME NUMBER THEOREM 7
N Exercise. 1.3 Let {zj}j=1 be any finite set of distinct complex numbers, with |zj| = 1 for every j. Let α1, . . . , αN be complex numbers, not all equal to zero. Prove that k k sk := α1z1 + ··· + αN zN = Ω(1) as k → ∞, i.e., that the claim limk→∞ sk = 0 is false. Hint: by absurd, suppose that the limit exists and is 0. Suppose α1 6= 0. Deduce 1 P −k a contradiction by computing the limit of x k≤x skz1 in two different ways. The following formula compares a sum with the corresponding integral.
Proposition 1.6 (Euler, Maclaurin) Let c be an integer and f :[c, +∞) → C be a C1 function. Then N X Z N 1 Z N (1.5) f(k) = f(x) dx + f(N) + f(c) + f 0(x)({x} − 1 ) dx. 2 2 k=c c c Proof. It is immediate to verify that Z 1 Z 1 1 0 1 g(0) + g(1) = g(x) dx + g (x)(x − 2 ) dx 2 0 0 1 for every g ∈ C ([0, 1]). Now write this formula for the functions gk(x) := f(x + k) where k ∈ N is arbitrarily fixed; we get Z 1 Z 1 1 0 1 f(k) + f(k + 1) = f(x + k) dx + f (x + k)(x − 2 ) dx. 2 0 0 In the integrals we take the shift x + k → x, so that Z k+1 Z k+1 1 0 1 f(k) + f(k + 1) = f(x) dx + f (x)(x − k − 2 ) dx. 2 k k For x ∈ [k, k + 1), we have x − k = {x}, hence the equality can be written also as Z k+1 Z k+1 1 0 1 f(k) + f(k + 1) = f(x) dx + f (x)({x} − 2 ) dx. 2 k k Now we add the equality for k = c, . . . , N − 1, obtaining N X 1 Z N Z N f(k) − (f(N) + f(c)) = f(x) dx + f 0(x)({x} − 1 ) dx, 2 2 k=c c c which is the claim. Exercise. 1.4 We have proved Proposition 1.6 directly, but it can be deduced also from the partial summation formula (1.4). Hint: set in that formula f(x) = 1, so that F (x) = bxc = x − {x}, and integrate by part the integral containing the factor x. 8 1.2. TWO GENERAL FORMULAS
Example. 1.1 Applying the formula to f(x) = 1/x we have N X 1 (1.6) = ln N + γ + O(1/N) k k=1 R ∞ {x} where γ := 1 − 1 x2 dx = 0.5772 ... is the Euler–Mascheroni constant. In fact, N 1 X 1 Z N 1 1 1 Z N {x} − = dx + 1 + − 2 dx. k x 2 N x2 k=1 1 1
R ∞ {x}−1/2 1 1 The integral 1 x2 dx converges absolutely because | {x} − 2 | ≤ 2 , thus we can write the equality as Z ∞ 1 Z ∞ 1 1 1 {x} − 2 {x} − 2 (1.7) = ln N + 1 + − 2 dx + 2 dx. 2 N 1 x N x 1 R ∞ {x}−1/2 R ∞ {x} Now the claim follows by setting γ := 2 − 1 x2 dx = 1 − 1 x2 dx and noticing that Z ∞ {x} − 1 1 Z ∞ 1 1 (1.8) 2 dx ≤ dx = . 2 2 N x 2 N x 2N The constant γ is probably the most famous and important constant of the Ma- thematics after 0, 1, π, e and i, and is not totally well understood. For instance, the known algorithms for its computation are not very efficient (when compared to the analogous algorithms for other constants, π for instance). One conjectures that γ is a transcendental number, but it is still unknown if γ ∈ Q. Exercise. 1.5 From (1.7) and (1.8), deduce that N X 1 1 0 ≤ − ln N − γ ≤ ∀N ≥ 1. k N k=1 This formula can be used to compute the value of γ, but it is not very efficient (it needs N terms to get γ with an approximation 1/N). Exercise. 1.6 Use (1.6) to prove that: N N X 1 1 γ X 1 1 γ = ln(N/2) + + O(1/N), = ln(2N) + + O(1/N). k 2 2 k 2 2 k=1 k=1 k even k odd Use this result to prove that N X (−1)k − = ln 2 + O(1/N). k k=1 CAP. 1: PRIME NUMBER THEOREM 9
With the same tool, prove that for every m, n ≥ 1 the alternating sum
1 1 1 1 1 1 + 3 + ··· + 2m−1 −( 2 + 4 + ··· + 2n ) | {z } | {z } m terms n terms 1 + 1 + 1 +···+ 1 −( 1 + 1 +···+ 1 ) + ··· = ln(4m/n). 2m+1 2m+3 4m−1 2n+2 2n+4 4n 2 | {z } | {z } m terms n terms This is a concrete example of the phenomenon proved by Riemann: a series con- verges unconditionally (i.e. the convergence and the value of the series are inde- pendent of any reordering of its terms) if and only if it converges absolutely. In other words, if a series converges only simply, then there are reorderings of the same numbers producing new series which converge to different values.
Exercise. 1.7 Use Proposition 1.6 to prove that for every ` ∈ N, N X (ln k)` = N(ln N)` + O(N(ln N)`−1). k=1 With an inductive argument on ` and using also Proposition 1.4 prove the more precise equality N X ` ` (ln k) = NP`(ln N) + O((ln N) ), k=1 P` `−n `! n where P`(x) is the polynomial n=0(−1) n! x . The formula (1.5) can be considerably extended. For every integer n, let {Bn(x)} be the set of Bernoulli polynomials, i.e. the family of polynomials which are defined recursively as: Z 1 0 B0(x) = 1,Bn(x) = nBn−1(x), Bn(x) dx = 0, ∀n ≥ 1. 0 For example: 4 3 2 1 B0(x) = 1,B4(x) = x − 2x + x − 30 , 1 5 5 4 5 3 1 B1(x) = x − 2 ,B5(x) = x − 2 x + 3 x − 6 x, 2 1 6 5 5 4 1 2 1 B2(x) = x − x + 6 ,B6(x) = x − 3x + 2 x − 2 x + 42 , 3 3 2 1 7 7 6 7 5 7 3 1 B3(x) = x − 2 x + 2 x, B7(x) = x − 2 x + 2 x − 6 x + 6 x.
Exercise. 1.8 The following steps give a uniform bound for Bn(x). R 1 1) Deduce from 0 Bn(x) dx = 0 that Bn has at least a root in (0, 1) when n ≥ 1. 10 1.2. TWO GENERAL FORMULAS
0 2) Using Lagrange’s intermediate values theorem deduce that kBnk∞ ≤ kBnk∞, when n ≥ 1, where the sup norms are for x ∈ [0, 1].
3) Use the recursive definition of Bn to deduce that kBnk∞ ≤ nkBn−1k∞, and by induction conclude that kBnk∞ ≤ n!. This is not the correct order of growth for kBnk∞, since it is known that kBnk∞ n! (2π)n , but the simple argument captures the main features of the polynomials, i.e. their over-exponential growth: I’m indebt with Guglielmo Beretta for this argument (Thanks!). Exercise. 1.9 The following steps prove some relations among Bernoulli polyno- mials. P∞ tn 1) Let F (x, t) := n=0 Bn(x) n! . Prove that text F (x, t) = . et − 1 Hint: use the recursive formula to deduce that ∂xF (x, t) = tF (x, t), so that F (x, t) = a(t)ext for some function a(t). Then, integrating term by term the et−1 R 1 definition of F (x, t) prove that a(t) t = 0 F (x, t) dx = 1. n 2) From 1) deduce that F (1−x, t) = F (x, −t), i.e. that Bn(1−x) = (−1) Bn(x). t 3) From 1) deduce that F (0, t) + 2 is an even function of t, and that therefore Bn(0) = 0 when n is odd and > 2.
4) Conclude that Bn(1) = Bn(0) for every n ≥ 2. Pn n n−k 5) Prove that Bn(x) = k=0 k Bk(0)x for every n. Hint: use the equality F (x, t) = extF (0, t). 6) Specializing the previous formula to x = 1 and using 4), deduce a recursive formula for the sequence Bn(0), n ∈ N. |x| 7) Prove that |Bn(x)| ≤ n!e for every x ∈ R and every n. Hint: use the formula in Step 5 and the bound in Ex. 1.8.
Pn n+1 n 8) Prove that k=0 k Bk(x) = (n + 1)x for every n. Hint: use the equality (et − 1)F (x, t) = text.
n−1 9) Prove that Bn(x + 1) − Bn(x) = nx for every n. Hint: use the equality F (x + 1, t) − F (x, t) = text.
PN n 10) For every N ≥ 0 and n ∈ N, n ≥ 1 let Sn(N) := k=0 k . Prove that 1 Sn(N) = n+1 (Bn+1(N + 1) − Bn+1(0)). CAP. 1: PRIME NUMBER THEOREM 11
This is Faulhaber’s formula, giving the value of the sum of nth power of integers up to N as polynomial in N. 2 Note that B1(x) = x − 1/2, so that {x} − 1/2 is B1({x}). Suppose that f ∈ C and let k be any integer, then Z k+1 Z k+1− 0 1 0 f (x)({x} − 2 ) dx = lim f (x)B1({x}) dx k →0+ k+ 1 where we have introduced the parameter because 2 B2(x) is a primitive for B1(x) 1 for every x, but 2 B2({x}) is a primitive for B1({x}) only for x ∈ R\Z. Now we can integrate by parts, getting h B ({x}) k+1− Z k+1− B ({x}) i = lim f 0(x) 2 − f 00(x) 2 dx + →0 2 k+ k+ 2 B ({x}) k+1− Z k+1 B ({x}) = lim f 0(x) 2 − f 00(x) 2 dx + →0 2 k+ k 2 B (1 − ) B () Z k+1 B ({x}) = lim f 0(k + 1 − ) 2 − f 0(k + ) 2 − f 00(x) 2 dx →0+ 2 2 k 2 B (1) B (0) Z k+1 B ({x}) = f 0(k + 1) 2 − f 0(k) 2 − f 00(x) 2 dx 2 2 k 2 B (0) Z k+1 B ({x}) = 2 (f 0(k + 1) − f 0(k)) − f 00(x) 2 dx. 2 k 2 Summing for k = c, . . . , N − 1 we get Z N Z N 0 1 B2(0) 0 0 00 B2({x}) f (x)({x} − 2 ) dx = (f (N) − f (c)) − f (x) dx, c 2 c 2 so that (1.6) becomes
N X Z N (1.9) f(k) = f(x) dx − B1(0)(f(N) + f(c)) k=c c Z N B2(0) 0 0 1 00 + (f (N) − f (c)) − f (x)B2({x}) dx. 2 2 c Now it is evident in which way the formula can be further iterated for sufficiently regular functions. R 1 Remark. 1.1 Why the polynomial Bn are normalized by setting 0 Bn(x) dx = 0? Because this condition says that integral mean value of Bn is zero, and this is R N 00 convenient for the estimation of the remainder term 0 f (x)B2({x}) dx (or even R N (n) the more general 0 f (x)Bn({x}) dx). 12 1.2. TWO GENERAL FORMULAS
Example. 1.2 Applying (1.9) to f(x) = ln x we have N X 1 (1.10) ln(N!) = ln k = N ln N − N + ln N + c + O(1/N), 2 k=1 where c is a constant that the method does not allow to determine but which can be found with a different argument (for instance as a consequence of Wallis’ 1 formula): c = 2 ln(2π). The resulting formula for ln(N!) is due to Stirling. Exercise. 1.10 Probably you are curious about Wallis’ formula we mentioned before, i.e. about a possible way to identify the constant c in (1.10). Here the sketch of the proof. R π n n−1 1) Let In := 0 (sin x) dx. Prove that I0 = π, I1 = 2 and that In = n In−2 for every n ≥ 2. 2) Recall that the double factorial !! is defined as 0!! = 1!! = 1 and n!! := n(n − 2)(n − 4) ··· for n ≥ 2, where the product decreases up to 1 or 2, according to the parity of n. Prove that (2k + 1)! (2k + 1)! (2k)!! = 2kk!, (2k + 1)!! = = . (2k)!! 2kk!
R π 2k+1 R π 2k 2k+1 3) Prove that I2k+1= 0 (sin x) dx≤ 0 (sin x) dx=I2k I2k (2k − 1)!!(2k + 1)!! I0 (2k − 1)!(2k + 1)!k = = 4k 4 π. I2k+1 (2k)!!(2k)!! I1 2 k! √ N c O(1/N) 5) The result in Example√ 1.2 can be written as N! = (N/e) e Ne , so that N! ∼ (N/e)N ec N. Inserting this asymptotic into 4), after some simpli- fications prove that the right hand side tends to 2πe−2c. This constant must ln(2π) be 1, by 3). This proves that c = 2 , i.e. that √ N! ∼ (N/e)N 2πN. There are several alternative proofs. Some of them are collected in the first chapter of [EL]. 1 Exercise. 1.11 Use (1.9) and the fact that |B2(x)| ≤ 6 for x ∈ [0, 1] to prove that N 1 X 1 1 − ≤ − ln N − − γ ≤ 0 ∀N ≥ 1. 6N 2 k 2N k=1 CAP. 1: PRIME NUMBER THEOREM 13 This formula can be used to√ compute the value of γ and is more efficient than the one in Ex. 1.5 (it needs N terms to get γ with an approximation 1/N). This formula for γ can be further improved using the Euler-Maclaurin formula at higher levels (involving Bernoulli polynomials of higher order), and was used by Euler exactly for this purpose. Exercise. 1.12 Let α ∈ (0, 1). Prove that there exists a constant cα such that X 1 N 1−α ξ(α, N) = + c + ∀N ≥ 1, nα 1 − α α N α n≤N with ξ(α, N) ∈ [0, 1] for every N. 1.3. The ring of arithmetical functions In Analytic Number Theory it is customary to call arithmetical function every function f : N\{0} → C; the notion of arithmetical function therefore overlaps with the one of sequence. The set of arithmetical functions has a natural structure of commutative and associative ring with unit, with respect to the pointwise sum: (f + g)(n) := f(n) + g(n) ∀n ∈ N\{0}, and the Dirichlet product: X X (f ∗ g)(n) := f(d)g(n/d) = f(d)g(d0). d|n d,d0 dd0=n The second representation shows immediately the equality f ∗ g = g ∗ f. The unit with respect to this product is the function δ which is defined as: δ(1) = 1, δ(n) = 0 for every n > 1. Exercise. 1.13 Prove that f is invertible in the ring of the arithmetical functions if and only if f(1) 6= 0. A function f is called: -) multiplicative, when f(mn) = f(m)f(n) for every couple of coprime integers m, n; -) completely multiplicative, when f(mn) = f(m)f(n) for every couple of integers m, n; -) additive, when f(mn) = f(m)+f(n) for every couple of coprime integers m, n; -) completely additive, when f(mn) = f(m) + f(n) for every couple of integers m, n. 14 1.3. THE RING OF ARITHMETICAL FUNCTIONS We will see several examples of arithmetical functions having (one of) these pro- perties. The most interesting is certainly the multiplicativity, as a consequence of the following fact. Proposition 1.7 Let f and g be multiplicative, then f ∗ g is multiplicative, too. Proof. Let m and n be coprime. Then every divisor of mn can be factorized in a unique way as product of two integers d and d0, with d dividing m and d0 dividing n. Vice-versa, every couple d, d0 of divisors of m and n respectively, produces a divisors dd0 of mn. As a consequence X m n X X m n (f ∗ g)(mn) = f(dd0)g = f(dd0)g d d0 d d0 d|m d|m d0|n d0|n X X m n X m X n = f(d)f(d0)g g = f(d)g f(d0)g d d0 d d0 d|m d0|n d|m d0|n = (f ∗ g)(m)(f ∗ g)(n), where for the intermediate equality we have used the fact that (m, n) = 1, d|m, 0 0 0 d |n imply (d, d ) = 1 = (m/d, n/d ), and the multiplicativity of f and g. Exercise. 1.14 Prove that if f is invertible and multiplicative, then also f −1 is multiplicative; this proves that the multiplicative and invertible arithmetical functions form an abelian group. Here we recall some of the most useful arithmetical functions. δP , the characteristic function of the primes, defined as: δP (n) = 1 when n ∈ P, 0 otherwise; δ, the delta function, defined as: δ(1) = 1, δ(n) = 0 for all n > 1; 1, the unit function, defined as: 1(n) = 1 for all n ≥ 1; I, the identity function, defined as I(n) := n for all n; ω, the omega function, defined as: ω(n) := #{p: p|n} (i.e., the number of distinct primes dividing n); µ, the M¨obiusfunction, defined as: µ(n) := 0 if n not squarefree, µ(n) = (−1)ω(n) if n is squarefree; Λ, the Von Mangoldt function, defined as: Λ(n) := 0 if n not a prime power, Λ(n) := ln p if n = pk for some prime p and some power k > 0; P τ στ , the τ-divisor function, defined as: στ (n) := d|n d for every n, when τ is a fixed parameter in C. CAP. 1: PRIME NUMBER THEOREM 15 P d, the divisor function, defined as: d(n) := σ0(n) = d|n 1 for every n; P σ, the sum of divisors function, defined as: σ(n) := σ1(n) = d|n d for every n; ϕ, the totient Euler function, defined as ϕ(n) := the cardinality of the set of ∗ integers in 1, . . . , n which are coprime to n; in other words, ϕ(n) = ](Z/nZ) . The following facts can be verified directly: 1) ω is additive; 2) µ, 1, στ , d, σ are multiplicative; 3) ϕ is multiplicative, for instance as a consequence of the Chinese remainder Q theorem, and ϕ(n) = n p|n(1 − 1/p); τ 4) d = 1 ∗ 1, σ = 1 ∗ I, στ = 1 ∗ I . The following equality is less trivial and is fundamental for our purposes. It is called second form of the M¨obiusidentity (the first one being a slightly different identity): (1.11) 1 ∗ µ = δ, P i.e., d|n µ(d) = 0 for every n > 1. This equality shows that µ is the inverse of 1 in the ring of the arithmetical functions. Proof. Both δ and 1 ∗ µ are multiplicative, therefore it is sufficient to prove their equality for prime powers. Let k > 0 and p be any prime number, then X (1 ∗ µ)(pk) = µ(d) = µ(1) + µ(p) = 0 = δ(pk), d|pk which proves the claim. The associativity of the ring implies that (1.12) F = 1 ∗ f ⇐⇒ f = µ ∗ F, i.e., that X X (1.13) F (n) = f(d) ⇐⇒ f(n) = F (d)µ(n/d). d|n d|n Exercise. 1.15 (Erd˝os)Let f(n): N → (0, +∞) be a multiplicative and monotone α 1 function. Then, there exists α ∈ R such that f(n) = n for every n. 1The original proof was very difficult but now it has been considerably simplified, for instance see E. Howe: A new proof of Erd˝os’theorem on monotone multiplicative functions, Amer. Math. Monthly 93(8), 593–595, 1986. See also [T], Ch. 1.2 Ex. 10. 16 1.4. DIRICHLET SERIES: AS FORMAL SERIES P Exercise. 1.16 Prove that I = 1 ∗ ϕ and that ϕ = I ∗ µ, i.e. that n = d|n ϕ(d) P and ϕ(n) = d|n dµ(n/d). P Exercise. 1.17 Prove that ln = 1∗Λ and that Λ = µ∗ln, i.e. that ln n = d|n Λ(d) P P and Λ(n) = d|n µ(d) ln(n/d). Deduce that Λ(n) = − d|n µ(d) ln d. Exercise. 1.18 Let h be a completely additive map. Let Dh be defined on the ring of arithmetical functions by setting Dhf : n → (Dhf)(n) := f(n)h(n) (i.e., the pointwise multiplication by the values of h). Prove that Dh is a derivation, i.e. that Dh(f ∗ g) = (Dhf) ∗ g + f ∗ Dhg. The ring of arithmetical functions supports also other derivations which are not of this kind.2 Exercise. 1.19 The ring of arithmetical functions is a Unique Factorization Do- main, i.e. every arithmetical function can be written in a unique way (up to reordering) as product of irreducible arithmetical functions.3. This is not known for the ring of somewhere converging Dirichlet series and this is a pity because, if proved, such unique factorization property would have many important conse- quences for the number theory. 1.4. Dirichlet series: as formal series To every arithmetical function f we associate a formal series F and vice-versa, in the following way: ∞ X f(n) f : \{0} → ⇐⇒ F (s) := . N C ns n=1 Note that we are not assuming any hypothesis about the convergence of the series defining F (s), so that we consider it (for the moment) only as a formal series. Any series of the form F is called Dirichlet series. For instance ∞ X 1 1 ⇐⇒ =: ζ(s) Riemann’s zeta function, ns n=1 ∞ X δ(n) δ ⇐⇒ = 1, ns n=1 2See H. N. Shapiro: On the convolution ring of arithmetic functions, Comm. Pure Appl. Math. 25, 287–336, 1972. 3See E. D. Cashwell, C. J. Everett: The ring of number-theoretic functions, Pacific. J. Math. 9, 975–985, 1959; and Formal power series Pacific. J. Math. 13, 45–64, 1963. CAP. 1: PRIME NUMBER THEOREM 17 ∞ X n ⇐⇒ = ζ(s − 1), I ns n=1 ∞ X nτ τ ⇐⇒ = ζ(s − τ). I ns n=1 As usual when dealing with formal series, we consider two Dirichlet series F and G as equal if and only if they have the same coefficients. Also the set of formal Dirichlet series is a ring, with respect to the pointwise sum ∞ ∞ ∞ X f(n) X g(n) X f(n) + g(n) F (s) = ,G(s) = , =⇒ (F + G)(s) = , ns ns ns n=1 n=1 n=1 and product ∞ ∞ ∞ ∞ ∞ X f(m) X g(n) X X f(m)g(n) X (f ∗ g)(n) (FG)(s) := = = . ms ns (mn)s ns m=1 n=1 m=1 n=1 n=1 The last formula shows that the ring of arithmetical functions (with the ∗ Dirichlet product) and the ring of formal Dirichlet series (and pointwise sum and product) are isomorphic, so that we can prove identities about arithmetical functions simply multiplying the corresponding Dirichlet series (and vice-versa, of course). For instance, we have ∞ X d(n) d = 1 ∗ 1 =⇒ = ζ2(s), ns n=1 ∞ X σ(n) σ = 1 ∗ =⇒ = ζ(s)ζ(s − 1), I ns n=1 ∞ X στ (n) σ = 1 ∗ τ =⇒ = ζ(s)ζ(s − τ), τ I ns n=1 ∞ X µ(n) µ = 1−1 =⇒ = ζ−1(s). ns n=1 For multiplicative arithmetical functions f an alternative representation is pos- sible (for the moment only as formal identity, without any notion of convergence). In fact, from the unique factorization of every integer as product of prime powers we have ∞ ν ν ν Y f(p) f(p2) f(p3) X f(p 1 ) · f(p 2 ) ··· f(p k ) 1 + + + + ··· = 1 2 k ps p2s p3s ns p n=1 18 1.4. DIRICHLET SERIES: AS FORMAL SERIES ν1 ν2 νk where n = p1 · p2 ··· pk is the factorization of n in prime powers (for n = 1 the product is empty and its value is taken equal to 1, by definition). When f ν1 ν2 νk is multiplicative we have f(p1 ) · f(p2 ) ··· f(pk ) = f(n), so that we get the final equality ∞ Y f(p) f(p2) f(p3) X f(n) (1.14) 1 + + + + ··· = , ps p2s p3s ns p n=1 which is called representation as Euler product. When f is completely multiplica- tive the identity can be further elaborated because in this case f(p) f(p2) f(p3) f(p) f(p)2 f(p)3 f(p)−1 1 + + + + ··· = 1 + + + + ··· = 1 − , ps p2s p3s ps p2s p3s ps again as formal identity between power series in the variable f(p)/ps. In this case the Euler product becomes ∞ Y f(p)−1 X f(n) (1.15) 1 − = . ps ns p n=1 The multiplicativity of the functions 1 and µ gives the representations ∞ Y 1 −1 X 1 1 − = = ζ(s) ps ns p n=1 ∞ Y 1 X µ(n) 1 − = = ζ−1(s), ps ns p n=1 so that now the equality (1.11) saying that 1 ∗ µ = δ simply becomes ∞ ∞ X 1 X µ(n) = ζ(s)ζ−1(s) = 1, ns ns n=1 n=1 and Equalities (1.12) and (1.13) can be formulated by saying that ζ(s)F (s) = G(s) ⇐⇒ F (s) = ζ−1(s)G(s). Exercise. 1.20 Using the multiplicativity and the representation as Euler product it is now easy to prove that: ∞ X ϕ(n) ζ(s − 1) = . ns ζ(s) n=1 Exercise. 1.21 Prove that X 1 Y 1 = 1 + ; ns ps n squarefree p CAP. 1: PRIME NUMBER THEOREM 19 deduce that X 1 ζ(s) = . ns ζ(2s) n squarefree Exercise. 1.22 This is a generalization of the previous exercise. Let r be a fixed r positive integer. An integer n is called r-power free when p|n implies that p - n (i.e., when n is not divisible by any r-power which is not equal to 1). Prove that X 1 ζ(s) = . ns ζ(rs) n r-power free Exercise. 1.23 Again, using the multiplicativity and the representation as Euler product prove that for every couple of arbitrarily fixed τ, ν ∈ C, one has ∞ X στ (n)σν(n) ζ(s)ζ(s − τ)ζ(s − ν)ζ(s − τ − ν) = . ns ζ(2s − τ − ν) n=1 This identity is due to Ramanujan. As special cases we have ∞ 2 X |στ (n)| ζ(s)ζ(s − τ)ζ(s − τ)ζ(s − τ − τ) = ∀ τ ∈ , ns ζ(2s − τ − τ) C n=1 ∞ 2 2 X |σiτ (n)| ζ (s)ζ(s − iτ)ζ(s + iτ) = ∀ τ ∈ , ns ζ(2s) R n=1 ∞ X d(n)2 ζ4(s) = . ns ζ(2s) n=1 1.5. Dirichlet series: as complex functions We have seen how the Dirichlet series are already useful when are considered as formal series. Nevertheless, their full strength appears when they are considered as complex functions, but for this we have to discuss their convergence as series. There is very good introduction to this topic in [Titch], Ch. IX, and Hardy and Riesz [HR] have dedicated an entire book to this subject. P∞ s Theorem 1.1 Suppose that the Dirichlet series F (s) := n=1 f(n)/n converges for s = s0 ∈ C. Then it converges for every s with Re(s) > Re(s0) and the convergence is uniform in the sector S` := {s: |Im(s − s0)| ≤ ` Re(s − s0)}, for every ` > 0. 20 1.5. DIRICHLET SERIES: AS COMPLEX FUNCTIONS SH Im(s) s Im(s0) s0 s0) Re(s)Re( Figure 1.1. Proof. Without loss of generality we can assume that F (s0) = 0, because we can satisfy this condition simply by change the value of f(1) into f(1) − F (s0). Let P s0 S(x) := n≤x f(n)/n , with S(x) = 0 when x < 1. By Proposition 1.4 we get that for every M < N ∈ N N N X f(n) X f(n) 1 = ns ns0 ns−s0 n=M+1 n=M+1 S(N) S(M) Z N s0−s−1 = s−s − s−s + (s − s0) S(x)x dx. N 0 M 0 M Now, fix > 0 and take M large enough to have |S(m)| < for every m ≥ M: such an M exists because F (s0) = 0, by hypothesis. Moreover, we take s with Re(s) ≥ Re(s0), so that we deduce that N f(n) |S(N)| |S(M)| Z N X Re(s0−s)−1 ≤ + +|s − s0| |S(x)|x dx ns N Re(s−s0) M Re(s−s0) n=M+1 M Z N Z +∞ Re(s0−s)−1 Re(s0−s)−1 ≤ 2 + |s − s0| x dx ≤ 2 + |s − s0| x dx M 1 |s − s | |Im(s − s )| = 2 + 0 ≤ 3 + 0 . Re(s − s0) Re(s − s0) When s ∈ S` the last inequality becomes ≤ (3 + `), so that the convergence (uniform in S`) follows by the Cauchy test. In analogy with the power series, the previous result motivates the introduction of the notion of abscissa of convergence, which is σc := inf{Re(s0): F (s) converges at s = s0}, CAP. 1: PRIME NUMBER THEOREM 21 with σc = +∞ when the series does not converge in any point, and σc = −∞ when the series converges everywhere. In fact, from the previous theorem we deduce immediately the following fact. P∞ s Corollary 1.1 Suppose that the Dirichlet series F (s) := n=1 f(n)/n converges somewhere (hence σc ∈ [−∞, +∞)), then the series converges in the half-plain Re(s) > σc and the convergence is uniform in every compact subset. s Each function 1/n = exp(−s ln n) is holomorphic in C, so that from Morera’s theorem we deduce immediately the following regularity result. P∞ s Corollary 1.2 Suppose that the Dirichlet series F (s) := n=1 f(n)/n converges somewhere (hence σc ∈ [−∞, +∞)), then F (s) is holomorphic in the half-plain H := {s ∈ C: σ > σc} and its derivative can be computed termwise, so that ∞ X f(n) ln n F 0(s) = − ∀s: Re(s) > σ . ns c n=1 Proof. F (s) is continuous in Re(s) > σc, because the convergence is uniform in every compact (so we have uniform convergence in a suitable open neighborhood of every fixed point) and the summand are evidently continuous. Let Γ be any closed (simple and regular) curve contained in H. Then ∞ ∞ Z Z X f(n) X Z f(n) F (s) ds = ds = ds ns ns Γ Γ n=1 n=1 Γ where the inversion of the sum and the integral is allowed by the uniform conver- R f(n) gence (the curve Γ is evidently a compact set). Each integral Γ ns ds is null, −s because the map n is holomorphic in C, hence Z F (s) ds = 0. Γ Therefore F (s) is a complex map which is continuous and whose integral over every closed curve is zero: Morera’s theorem allows to conclude that F (s) is holomorphic in Re(s) > σc. 0 The proof of the formula for F (s) runs as follow. Let s0 ∈ H be fixed, and let Γ be a circle centered at s0, sufficiently small to be contained in H and positively oriented. Then the Cauchy formula for the derivative of a holomorphic function says that ∞ 1 Z F (z) 1 Z X f(n)n−z F 0(s ) = dz = dz. 0 2πi (z − s )2 2πi (z − s )2 Γ 0 Γ n=1 0 22 1.5. DIRICHLET SERIES: AS COMPLEX FUNCTIONS The uniform convergence in Γ allows to exchange the integral and the series, so that ∞ X 1 Z n−z F 0(s ) = f(n) dz. 0 2πi (z − s )2 n=1 Γ 0 −s The inner integral is the derivative of the map s 7→ n at s0 (because this map is holomorphic), thus its value is −n−s0 ln n, so that ∞ 0 X −f(n) ln n F (s0) = , ns0 n=1 which is the claim. Exercise. 1.24 Note that Corollary 1.2 shows that when F (s) is a Dirichlet series, 0 0 then F (s) is a Dirichlet series too, and that if we denote by σc and σc the abscissa 0 0 of convergence for F (s) and F (s) respectively, then σc ≤ σc. Prove that actually 0 σc = σc. Hint: F (s) is a primitive of F 0(s), and the integration along every compact path 0 0 can be done termwise, by the uniform convergence of F (s) in Re(s) > σc. The notion of abscissa of convergence is similar to the one of radius of conver- gence for power series, nevertheless there is an important difference: a Dirichlet series can have a finite abscissa σc without having any singularity along the vertical line Re(s) = σc. For power series, there is always a singularity on the critical circle. This different behavior is due essentially to the lack of compactness (the vertical line is not compact, while the critical circle is evidently a compact set). An exam- P∞ n s ple of this phenomenon is the series n=1(−1) /n for which σc = 0 (prove it, for instance by using Proposition 1.3 with f(n) = (−1)n and g(n) = n−s to prove that PN n s limN→∞ n=1(−1) /n exists and is finite whenever Re(s) > 0) and that admits an analytic continuation to C as holomorphic function (see Ex. 1.28). However, for Dirichlet series with non-negative coefficients this phenomenon cannot happen: this is the claim of the following result, due to Landau. P∞ s Theorem 1.2 (Landau) Suppose that the Dirichlet series F (s) := n=1 f(n)/n converges somewhere, and that F (s) has an analytic continuation in Ω\{σc} where Ω is an open set containing the point s = σc. If f(n) ≥ 0 for every n, then σc is a singularity for F (s). Proof. Without loss of generality we can assume that σc = 0 (because we can always translate the problem to the analogous problem for F (s+σc) whose abscissa of convergence is 0). By absurd, suppose that F (s) is holomorphic in an open CAP. 1: PRIME NUMBER THEOREM 23 neighborhood U of s = 0, so that its Taylor power series centered at 1 ∞ X F (k)(1) F (s) = (s − 1)k k! k=0 has a convergence radius strictly greater than 1. By Corollary 1.2 ∞ X f(n) lnk n F (k)(1) = (−1)k ∀k, n n=1 hence ∞ ∞ X X (1 − s)k f(n) lnk n F (s) = . k! n k=0 n=1 Let s ∈ U be negative, then each term appearing in this double series is non- negative (here we use the assumption f(n) ≥ 0 for all n) so that the series can be exchanged without modifying its value; in this way we get ∞ ∞ X f(n) X (1 − s)k lnk n F (s) = . n k! n=1 k=0 The inner sum here is exp((1 − s) ln n) = n1−s, hence we have proved that ∞ X f(n) F (s) = ns n=1 holds for some negative s. This means that the Dirichlet series converges for some negative s, which is impossible since we have assumed that σc = 0. In its essence, the previous theorem holds because a non-negative double series can be reordered without assuming its convergence. The following exercise provides an other instance of this fact, this time in the more familiar setting of the power series. P∞ n Exercise. 1.25 (Pringsheim’s Theorem) Let F (z) := n=0 a(n)z be a power series with convergence radius equal to 1 and suppose that it has an analytic continuation in an open set containing z = 1. Prove that if a(n) ≥ 0 for every n then the point z = 1 is a singularity for F (z). Hint: Imitate the proof of the Landau theorem. Suppose (by absurd) that F (s) is holomorphic in an open set containing z = 1. Consider the representation as power series at s = 1/2 of F . The convergence radius of this power series is strictly greater than 1/2 (why?). Use the representation as power series at z = 0 to find F (k)(1/2) and substitute in the power series at z = 1/2. In this way you get a double series which can be exchanged (because its terms are non-negative). 24 1.5. DIRICHLET SERIES: AS COMPLEX FUNCTIONS P∞ s A Dirichlet series n=1 f(n)/n is called absolutely convergent in s0 ∈ C when ∞ ∞ X f(n) X |f(n)| = < ∞. ns0 nRe(σ0) n=1 n=1 Note that the absolute convergence depends only on the behavior of the series at Re(σ0), so that if the series converges absolutely at s0 then it converges absolutely in every point of the vertical line Re(s) = Re(s0). The absolute convergence implies the usual convergence (a simple application of the Cauchy test). Moreover, it is immediate to prove that if a Dirichlet series converges absolutely at s0, then it converges absolutely in every point s with Re(s) ≥ Re(s0) and that the convergence is uniform in every half-plain H := {s: Re(s) > Re(s0)+}. Therefore, it is useful to introduce the notion of abscissa of absolute convergence, which is σa := inf{σ ∈ R: F (σ) converges absolutely}. Evidently σc is always ≤ σa. P∞ s Exercise. 1.26 Let F (s) = n=1 f(n)/n . Suppose that it converges somewhere, so that σc < +∞. Prove that σa ≤ σc + 1, i.e. that the convergence is absolute in every point s with Re(s) > σc + 1. The previous exercise proves that σc ≤ σa ≤ σc +1; in general it is not possible to be more precise, in fact for every choice of u ∈ [0, 1] it is possible to define a Dirichlet series with σc = 0 and σa = u. P∞ n `s Exercise. 1.27 Let ` ∈ N. Let F`(s) := n=1(−1) /n . Note that this is a Dirichlet series, since ` ∈ N. Prove that for this series σc = 0 (using the partial summation formula) and σa = 1/`. As a more elaborated example, let α ≥ 1. Let Fα be the Dirichlet series ∞ X (−1)n F (s) = α dnαes n=1 where dxe := inf{n ∈ Z: x ≤ n}. Prove that for this series σc = 0 and σa = 1/α. For multiplicative functions the alternative representation as Euler product is possible. We see now that this representation is valid whenever the Dirichlet series converges absolutely. P∞ s Theorem 1.3 Let F (s) = n=1 f(n)/n converge absolutely at s0. Let f(n) be multiplicative, then the infinite product ∞ Y X f(pk) (1.16) 1 + pks p k=1 converges absolutely at s0 to F (s0). CAP. 1: PRIME NUMBER THEOREM 25 Q Proof. Recall that an infinite product (1 + an) converges absolutely (by defi- Q n P nition), when n(1 + |an|) converges, and that this happens if and only if n |an| converges too. In fact, the inequality 1 + y ≤ ey implies that Y X (1 + |an|) ≤ exp( |an|) n n so that X Y |an| < ∞ =⇒ (1 + |an|) < ∞. n n y/2 Q On the other hand, for y ∈ [0, 1] we have e ≤ 1 + y. If n(1 + |an|) < +∞ then 4 |an| goes to zero , so that |an| < 1 if n is large enough, n > N say. Then 1 X Y (1.17) exp( 2 |an|) ≤ (1 + |an|) n>N n>N so that Y X (1 + |an|) < ∞ =⇒ |an| < ∞. n n Therefore, in order to prove the absolute convergence of (1.16) at s0 it is sufficient to prove that ∞ X X f(pk) < ∞. pks0 p k=1 This is almost immediate, since ∞ ∞ ∞ X X f(pk) X X |f(pk)| X |f(n)| ≤ ≤ < ∞, pks0 pkRe(s0) nRe(s0) p k=1 p k=1 n=1 by hypothesis. Now we have to prove that the product converges to F (s0). We fix P > 0 and consider the finite product ∞ Y X f(pk) 1 + . pks0 p≤P k=1 In this product each factor is a power series converging absolutely, therefore we can rearrange the terms without modifying its value. The multiplicativity of f 4 QN In fact, the sequence n=1(1 + |an|) increases and is larger than 1. Thus its limit `, say, is not QN n=1(1+|an|) ` zero and 1 + |aN | = QN−1 goes to ` = 1. Notice that this argument does not apply n=1 (1+|an|) to simply convergent products, since in that case it is not possible to exclude the case that the limit ` is 0. 26 1.5. DIRICHLET SERIES: AS COMPLEX FUNCTIONS shows that the rearrangement is the sum X f(n) ns0 n∈A where A denotes the set of integers whose prime factors are smaller than P . We note that X f(n) X f(n) F (s0) − = ns0 ns0 n∈A n∈B where B = Ac is the set of integers having at least one prime factor greater then P . Hence ∞ Y X f(pk) X f(n) X f(n) X f(n) 1 + − F (s0) = ≤ ≤ , pks0 ns0 ns0 ns0 p≤P k=1 n∈B n∈B n≥P since each integer in B is not lower than P . The last sum tends to 0 when P → ∞, P f(n) since n ns0 converges, by hypothesis. The following proposition proves an important fact about the localization of zeros of an absolute convergent product; its relevance for Dirichlet series with multiplicative coefficients comes from the previous theorem. Q Proposition 1.8 Let n(1+an) be an absolutely converging infinite product. Then its value is 0 if and only if some factor 1 + an is zero. −2y P Proof. For y ∈ [0, 1/2] we have e ≤ 1 − y. By hypothesis n an converges absolutely, thus |an| < 1/2 if n is large enough, n > N say. Then X Y Y Y exp(−2 |an|) ≤ (1 − |an|) ≤ |1 + an| = (1 + an) . n>N n>N n>N n>N Q Q In particular (1 + an) is strictly positive. This proves that (1 + an) can n>N Q n be equal to 0 if and only if the finite product n≤N (1 + an) is equal to zero, which is the claim. When applied to the Dirichlet series defining the Riemann zeta function, the previous theorems prove that P s Corollary 1.3 In the half-plain Re(s) > 1 the Dirichlet series ζ(s) = n 1/n converges absolutely, is a holomorphic function, has the representation ∞ X 1 Y 1 −1 ζ(s) = = 1 − ns ps n=1 p and is not equal to zero. CAP. 1: PRIME NUMBER THEOREM 27 Remark. 1.2 It is sufficient to modify even only the sign of a finite set of coefficients of ζ to produce a new function having zeros in Re(s) > 1, and therefore badly violating the Riemann hypothesis. The new function has no more a representation as Euler product: this fact suggests that if RH is true, then for its proof in some place the arithmetic will have a fundamental contribution (see Ex. 1.53). There is still one important aspect about the Dirichlet series that we have to dis- P∞ s cuss. When we consider them as formal series, we define the equality n=1 f(n)/n P∞ s = n=1 g(n)/n by saying that this happens if and only if f(n) = g(n) for every n. What happen to this condition when the series converge somewhere and therefore can be considered as functions of complex variable? The following proposition will allow to show that the series are equal as complex functions if and only if they are equal as formal series, i.e. that they are equal if and only if f(n) = g(n) for every n. P∞ s Proposition 1.9 Let F (s) = n=1 f(n)/n converge somewhere. Let σ0 be any real number greater then σa, the abscissa of absolute convergence. Then Z T ∞ 2 1 2 X |f(n)| lim |F (σ0 + it)| dt = < ∞. T →∞ 2T n2σ0 −T n=1 As a consequence, if F (s) is the null function then f(n) = 0 for every n. Proof. The absolute convergence at σ0 implies the absolute convergence of the series in every point of the vertical line {s: s = σ0 + it, t ∈ R}, uniformly in t. The theorem about the absolute convergence of product of series proves that ∞ ∞ 2 X f(m) X f(n) |F (σ0 + it)| = · mσ0+it nσ0−it m=1 n=1 ∞ X f(m)f(n) n it = (mn)σ0 m m,n=1 and that this double series converges absolutely and uniformly for t ∈ R. The uniform convergence gives ∞ Z T Z T it 1 2 X f(m)f(n) 1 n |F (σ0 + it)| dt = dt. 2T (mn)σ0 2T m −T m,n=1 −T A simple computation shows that ( 1 Z T n it 1 if n = m dt = sin(T ln(n/m)) 2T −T m T ln(n/m) if n 6= m. 28 1.6. THE ANALYTIC CONTINUATION OF ζ(S) 1 Z T n it 1 Z T n it 1 Z T Moreover, dt ≤ dt = dt = 1, so that 2T −T m 2T −T m 2T −T the convergence of the double series in m, n is uniform in T . Hence the limit as T → ∞ can be computed termwise, and we get ∞ Z T Z T it 1 2 X f(m)f(n) 1 n lim |F (σ0 + it)| dt = lim dt T →∞ 2T (mn)σ0 T →∞ 2T m −T m,n=1 −T ∞ ∞ X f(m)f(n) X |f(n)|2 = δm,n = . (mn)σ0 n2σ0 m,n=1 n=1 1.6. The analytic continuation of ζ(s) P∞ s From the previous section we know that ζ(s) = n=1 1/n is a holomorphic function in Re(s) > 1. In this section we prove that ζ(s) admits a ‘natural’ extension as a function in C. For every fixed s, from Proposition 1.6 we get that N Z N Z N X 1 dx 1 1 B1({x}) = + + 1 − s dx ns xs 2 N s xs+1 n=1 1 1 1−s Z N 1 N 1 1 B1({x}) = + + s + 1 − s s+1 dx. s − 1 1 − s 2 N 1 x Now, suppose that Re(s) > 1, then we can take the limit N → ∞, getting Z ∞ 1 1 B1({x}) ζ(s) = + − s s+1 dx. s − 1 2 1 x This relation is an identity for Re(s) > 1, but the integral exists in the larger region Re(s) > 0 (the integrand here decays to ∞ as 1 , because B ({x}) xRe(s)+1 1 is bounded). Moreover, this integral defines a holomorphic function in Re(s) > 0 (standard argument, again based upon Morera’s theorem), and (s − 1)−1 is evidently a meromorphic function in C with a unique pole at s = 1, which is simple and where the residue is equal to 1. Thus, the previous equality can be used to define ζ(s) in the larger region Re(s) > 0, as meromorphic function in Re(s) > 0 and having a unique pole (simple, and residue equal to 1) at s = 1. The general theory of analytic continuation allows to conclude that this definition is the unique one which extends ζ(s) as meromorphic function. We can pursuit this argument. With an integration by parts we get Z ∞ Z ∞ B1({x}) B2(0) s + 1 B2({x}) s+1 dx = − + s+2 dx, when Re(s) > 0. 1 x 2 2 1 x CAP. 1: PRIME NUMBER THEOREM 29 As before, this is an equality in Re(s) > 0 but the integral to the right hand side exists in Re(s) > −1 and defines a holomorphic function here. Thus this equality can be used to define the function to the left hand side in this larger region. Iterating this argument m-times we get a formula providing the analytic extension of ζ(s) in Re(s) > 1 − m. Concluding, we have proved the following result. Corollary 1.4 The Riemann zeta function ζ(s) admits an analytic continuation in C as meromorphic function, with a unique pole at s = 1 which is simple and with Ress=1 ζ(s) = 1. There are more elegant ways to prove the same conclusion. For example we can extend ζ to Re(s) > 0 with the previous argument and then use the functional equation satisfied by ζ (a relation connecting the values of ζ(s) and ζ(1 − s)) to get the analytic continuation. The functional equation is a fundamental tool for the comprehension of the deep analytical properties of ζ(s), but it is not useful for our limited purpose (the proof of the Prime Number Theorem). We will not mention it anymore in these notes. P∞ n s 1−s Exercise. 1.28 Prove that f(s) := n=1(−1) /n = (2 − 1)ζ(s) when Re(s) > 1. Notice that this equality provides the analytic continuation to C of f(s), as holomorphic function. P s P s Hint: Look for a representation of n even 1/n and n odd 1/n in terms of ζ(s), 1−s and subtract. For the second part, note that (2 − 1) is holomorphic in C and has a zero at s = 1. 1.7. Some elementary results Before to face the Prime Number Theorem it is a good idea to begin with a more modest target. Mertens’ result (in its simplest formulation) and Dirichlet’s result about the mean value of the divisor function will be two good tests. We need a preliminary upper bound for π(x). When we split the integers ≤ x in couples, each couple contains at most one prime; this simple remark gives the upper bound π(x) ≤ bx/2c which with a more analytical language we write as x π(x) ≤ + O(1). 2 This idea can be pushed far away. Let we take another integer, say 6. We split the integers up to x into distinct blocks containing six consecutive integers each one. Besides the first block, in each other block only two primes can appear, at most, because any term which is not coprime to 6 cannot be a prime when it is 30 1.7. SOME ELEMENTARY RESULTS not a divisor of 6 self, and there are only ϕ(6) = 2 integers coprime to 6 in every 2x block. This proves that π(x) ≤ 6 + R where R ≤ 6. In other words, 2x π(x) ≤ + O(1). 6 Repeating this argument with a generic integer N, we get that ϕ(N) (1.18) π(x) ≤ x + O(N), N where O(N) denotes a quantity which is ≤ N. This innocent bound shows that π(x) ϕ(N) lim sup ≤ ∀N, x→∞ x N which is an interesting upper bound, since now we have the possibility to act on the parameter N in order to bound π(x) in a non-trivial way. For instance, let N Q be the product of all prime numbers below L (a new parameter): N = p≤L p. Then ϕ(N) Y 1 Y 1 = 1 − = 1 − . N p p p|N p≤L The inequality 1 − y ≤ e−y shows that ϕ(N) − P 1 (1.19) ≤ e p≤L p . N P 1 We have already proved that p≤L p → ∞ as L diverges, this implies that ϕ(N) lim inf = 0 N→∞ N so that from (1.18) we conclude that (1.20) π(x) = o(x). With a slightly bigger effort we can improve this result. From (1.18), (1.19) Q and (1.3) with N = p≤L p we get L π(x) ϕ(N) N − P 1 N O(1) L ≤ + O ≤ e p≤L p + O ≤ + O x N x x ln L x Q L because N = p≤L p ≤ L . The bound suggests to select L in such a way that the two terms have (approximatively) the same size, in other words in such a way that LL/x ≈ 1/ ln L, i.e., LL ln L ≈ x. The equation LL ln L = x has a complicated ln x solution for L = L(x), but it is easy to see that L(x) is asymptotic to ln ln x . Setting ln x L = ln ln x the bound becomes π(x) O(1) ≤ + O(exp(L ln L − ln x)) x ln ln x CAP. 1: PRIME NUMBER THEOREM 31 O(1) ln x ln x = + O exp ln − ln x ln ln x ln ln x ln ln x O(1) ln x ln ln ln x O(1) 1 = + O exp − = + O ln x . ln ln x ln ln x ln ln x (ln ln x) ln ln x This bound proves the following result. x Proposition 1.10 π(x) ln ln x as x → ∞. The previous claim is interesting, but it is still far from the truth. Next result is still elementary in tools but represents a major improvement on all previous x results because it claims that π(x) ln x : it is due to Chebyshev. 2n n Let n be any positive integer, then the binomial n is lower than 4 , because 2n 2n X 2n ≤ = (1 + 1)2n = 4n. n k k=0 On the other hands, every prime in (n, 2n] divides (2n)! but does not divide n!, hence Y 2n p ≤ ≤ 4n n n