ANALYTIC NUMBER THEORY NOTES
AARON LANDESMAN
1. INTRODUCTION Kannan Soundararajan taught a course (Math 249A) on Analytic Number Theory at Stanford in Fall 2017. These are my “live-TeXed“ notes from the course. Conventions are as follows: Each lecture gets its own “chapter,” and appears in the table of contents with the date. Of course, these notes are not a faithful representation of the course, either in the mathematics itself or in the quotes, jokes, and philo- sophical musings; in particular, the errors are my fault. By the same token, any virtues in the notes are to be credited to the lecturer and not the scribe. 1 Please email suggestions to [email protected]
1This introduction has been adapted from Akhil Matthew’s introduction to his notes, with his permission. 1 2 AARON LANDESMAN
2. 9/26/17 2.1. Overview. This will be somewhat of an introductory course in analytic methods, but more like a second introduction. We’ll assume familiarity with the prime number theorem, connecting contribu- tions of primes from zeros of the zeta function. You might look at the first half of Davenport’s book or so as a prerequisite. We’ll assume the students know how to prove there are infinitely many primes in arithmetic progressions. To get started, we’ll do the first four or five lectures proving Vino- gradov’s three prime theorem: Theorem 2.1 (Vinogradov). Every large odd number is the sum of three primes. When we say “large,” one can actually compute the bound explic- itly (i.e., it is effective). Remark 2.2. Helfgott, a few years back, made the bound accessible so that one could compute exactly which odd numbers were not ex- pressible as the sum of three primes. He showed something like all primes more than 7 could be written as the sum of three primes.
To start the proof, write N = p1 + p2 + p3, and we’ll count the number of ways to write N as such. In fact, we’ll consider
∑ Λ(n1)Λ(n2)Λ(n3) N=n1+n2+n3 where ( log p if n = pk Λ(n) := 0 else If we define √ Ψ(x) := ∑ Λ(n) = x + O(xe−c log x). n≤x This is equivalent to saying √ π(x) = li(x) + o(xe−c log x). Here Z x dt li(x) = . 2 log t x li(x) is about log x . ANALYTIC NUMBER THEORY NOTES 3
2.2. Heuristic of proof. A first guess is that there are about π(N) choices for each of p1, p2, p3. Their sum must add up to a given num- 1 ber N. The chance that p1 + p2 + p3 is exactly N is roughly N . Hence, the number of such ways is approximately N 3 1 N2 = . log N N (log N)3 We can also estimate 3 2 1+1/2 R3(N) := ∑ Λ(n1)Λ(n2)Λ(n3) ∼ N /N = N + O(N + log N) N=n1+n2+n3 where the error comes from contributions of powers of primes. 2.3. Hardy and Littlewood’s circle method. Let S(α) := ∑ Λ(n)e(nα) n≤N where e(x) = e2πix. Then, Z 1 Z 1 3 S(α) e(−Nα)dα = ∑ Λ(n1)Λ(n2)Λ(n3)e (n1 + n2 + n3 − N)α) dα 0 0 n1,n2,n3≤N = R3(N). To bound this, note that S(0) = Ψ(N) ∼ N. We’d like to bound it by about N2. Also, S(1/2) is pretty big because S(1/2) = (Λ(2) + λ(4) + ··· ) − ∑ Λ(n) n≤N,n odd because e(x) = −1 if x is odd. Then, for all 10−6 |λ| ≤ , N we have .99N. Then, Z S(α)3e(−Nα)dα ' 10−6N2. |α|≤10−6/N We could similarly make an argument in a small neighborhood of 1 2 . This gives an analytic reason that the number of representations might be on the scale of N2. So there are portions of the integral which give the correct answer. 4 AARON LANDESMAN
Exercise 2.3 (Waring’s problem). We want to know whether we can k k k write N = x1 + x2 + ··· + x3 (i.e., as a sum of four squares or nine cubes, etc.) (1) First, find a probabilistic guess for the number of such repre- sentations. (2) Use the circle method S Z 1 k ∑ e(n α) e(−Nα)dα. 0 1 n≤N k Then, find portions of the integrand that correspond to the right probabilistic answer. Returning to our integral for three primes, let’s think about when S(α) is big.
1 Example 2.4. Let’s try 3 . ∑ Λ(n)e(n/3). n≤N We have a contribution from powers of 3 which is about log N, so ∑ Λ(n)e(n/3) = O(log N) + e(1/3)Ψ(N; 3, 1) + e(2/3)Ψ(N; 3, 2) n≤N ∼ N/2. where N N Ψ(N; 3, 1) ∼ = . φ(3) 2 where Ψ(N; a, b) counts the number of primes up to N which is b mod a. Remark 2.5. Note that a S( ) q counts approximately the distribution of primes in progressions mod q with (a, q) = 1. Sometimes when q is small, since we’re only count primes coprime to q, we will get an answer substantially away from 0. We’ll later need to think through the uniformity of q in terms of N. ANALYTIC NUMBER THEORY NOTES 5
Remark 2.6 (Insight). S(α) is big near most rational numbers with small denominators. It’s not big near 1/4, so we’re only saying it’s big near certain ones. This might have something to do with whether the denominator of the rational number is square free: if it is not square free, you essentially get translates over roots of unity of that prime whose square divides q, and things cancel out Exercise 2.7. Show ∗ ak ∑ e( ) = µ(q), k mod q q where the ∗ means (k, q) = 1 and µ is the Mobius¨ function. Goal 2.8. If α is not near a rational number with small denominator then |S(α)| is small. To accomplish this, Hardy and Littlewood decided to split [0, 1] into two parts - major arcs M and minor arcs m. The major arcs are a close to q for q small, and the minor arcs are the rest. The measure of the minor arcs are big while the measure of the major arcs have small measure. That is, the minor arcs have nearly full measure. So, there is a very small set on which the generating function is big. There is also a big set on which the generating function is small. There is a trivial bound |S(α)| ≤ Ψ(N) ∼ N, using the triangle inequality. One can also work out Lemma 2.9. We have Z 1 |S(α)|2dα ∼ N log N. 0 Proof. Z 1 Z 1 2 |S(α)| dα = ∑ Λ(n1)Λ(n2) e ((n1 − n2)α) dα 0 0 n1,n2≤N = ∑ Λ(n)2 ≤ n N √ = ∑ log nΛ(n) + O( N log N) n≤N = N log N.
√ Exercise 2.10. Verify the above, where the O( N log N) difference is coming from prime powers. The idea for the last step is that most 6 AARON LANDESMAN numbers less than N are on the order of N. One might use partial summation, which is integration by parts. Usually, p |S(α)| ∼ N log N. If α is far from every rational number, such as the golden ratio, φ, 1 +ε we might try to compute S(φ). We might expect that S(φ) N 2 . We don’t know whether this is true, but we do know ∑ Λ(n)e(nφ) N1−δ n≤N for some δ > 0 (and it will probably even be a pretty large δ). We will now develop a technique saying that once you are far away from a rational number, you can get this sort of power saving.
2.4. Strategy for determining asymptotics for R3(N). We have the integral Z 1 Z Z S(α)3e(−Nα)dα = ( + )S(α)3e(−Nα)dα 0 M m We want the second part over the minor arcs to be smaller than N2. The idea is that we can bound ! Z Z 1 S(α)3e(−Nα)dα ≤ ∑ |S(α)| |S(α)|2dα m α∈m 0 (N log N) ∑ |S(α)|. m So, it is enough to have εN ∑ |S(α)| ≤ . α∈m log N This will show the contribution from the minor arcs is less than that of the major arcs, assuming we know the major arcs contribute N2.
Exercise 2.11 (Roth). For all δ > 0, there is N0 = N(δ) so that for all N ≥ N0, such that every subset A ⊂ [1, N] with |A | ≥ δN has a (nontrivial) three term arithmetic progression. Letting A (α) = ∑ e(aα), a∈A ANALYTIC NUMBER THEORY NOTES 7 we obtain Z 1 A (α)2A (−2α)dα 0 counts the number of triples (x, y, z) with x + z = 2y. This includes |A | trivial solutions, so we want to see this integral is larger. We might expect δ3N2 solutions. But now, it’s a bit hard to see how to actually bound this integral. Exercise 2.12 (Vague exercise). If, “away from 0,” |A (α)| ≤ ε|A | then the contribution of that portion of the integral Z 1 A (α)2A (−2α)dα 0 is bounded by ε|A |2. (We’d like to know something like ε ≤ δ/106.) The idea is that either we have this bound above, or else we get some additive structure in A which we exploit to get a bigger den- sity set. There are notes on this on Sound’s web-page from a course he taught on additive combinatorics. Now, we want to focus on showing that for some definition of the minor arcs, the sum S(α) has a little bit of cancellation. 2.5. Vinogradov’s method. Here is the key idea from Vinogradov’s method. This comes up many times throughout analytic number theory. We’d like to understand the sequence S(α) = ∑ Λ(n)e(nα). n≤N We could similarly study ∑ Λ(n)e( f (n)), n≤N √ √ where, say, f (n) is e( n) or e( n + (log n)2). We could similarly study √ ∑ e( n) n≤N or ∑ e(t log n) n≤N 8 AARON LANDESMAN for looking at 0’s of the zeta function. Let’s start with the simplest version of these, where instead of summing over primes and prime powers, we only sum over all the integers. Say we want to consider ∑ e(nα). n≤N This is a geometric progression, so it is easy to sum: e(α) (1 − e(Nα)) ∑ e(nα) = n≤N 1 − e(α) x = sin πα where x is bounded by 2, and the numerator is approximately sin πNα.
Exercise 2.13. Show
2 ∑ e(nα) ≤ min N, n≤N sin πα 1 min N, . ||α|| letting ||α|| denote the distance from the nearest integer. Let Φ be a smooth function. Then, ∑ Φ(n/N)e(nα) n is some smooth version of what we are trying to approximate. We might try to use the Poisson summation formula. We can write ∑ Φ(n/N)e(nα) = N ∑ φˆ(N(k + α)) n k and work out the Poisson summation formula. For φ smooth, the Fourier transform is rapidly decreasing.
3. 9/28/17 Recall last time we had
R3(N) := ∑ Λ(n1)Λ(n2)Λ(n3). n1+n2+n3=N The goal was to show this asymptotes to N2. We set S(α) = ∑ Λ(n)e(nα), n≤N ANALYTIC NUMBER THEORY NOTES 9 and found Z 1 3 R3(N) = S(α) e(−Nα)dα. 0 The idea is to show that S(α) is large only near rational numbers with small denominators (the minor arcs). On the complement, we want to show |S(α)| is small, and then bound Z Z 1 | S(α)3e(−Nα)dα| ≤ sup |S(α)| |S(α)|2dα. m α∈m 0 We then could use Parseval’s identity to bound this by N log N. Toward the end of last time, we found 1 ∑ e(nα) min(N, ). n≤N ||α||
Exercise 3.1. Count the number of ways of writing N = n1 + n2 + n3 asymptotically by writing down the associated integral using the circle method. The exponential sum will only be big for α near 0. There is only one major arc in this case. The answer should be about N2/2, and the point is to see where the 1/2 comes from.
Recall from elementary number theory that Λ(n) = ∑ab=n µ(a) log b. If we look at the Dirichlet series for ζ0 1 − (s) = · −ζ0(s) ζ ζ(s) 1 where the first term has Dirichlet series Λ(n), ζ has Dirichlet series µ and −ζ0(s) has Dirichlet series log. Then, the convolution of µ and log is Λ. Then,
Z Z N ! log ne(nα) = log td e(nα) − ∑ n≤N 1 n≤t Z N 1 = log N e(nα) − e(nα)dt ∑ − ∑ n≤N 1 t n≤t 1 (log N) min N, . ||α|| Then, ∑ Λ(n)e(nα) = ∑ µ(a) ∑ log be(abα). n≤N a b≤N/a 10 AARON LANDESMAN
Example 3.2. First, let’s try the case α = 0. Then, ∑ Λ(n) = ∑ µ(a) ∑ log b n≤N a b≤N/a
∞ µ(a) If we knew ∑a=1 a = 0 we could then prove the prime number theorem. This is essentially equivalent to proving the prime number theorem, so it would take some work. Things are good when a is small, but there is a problem when a is big. Goal 3.3. Our overall aim is to bound S(α). 3.1. Vinogradov’s idea. We’d like to somehow decompose Λ(n) into pieces, where either we use a simple exponential sum, or using the following idea. The idea has to do with bilinear forms. We notate m ∼ M meaning M < m ≤ 2M.
B(M, N) := ∑ ∑ ambn · f (m, n), m∼M n∼N with ai, bi arbitrary complex numbers, and f (m, n) is an oscillatory term, such as f (m, n) = e(mnα). Intuitively f (m, n) should have some “cancellation.” Goal 3.4. We’d like to bound the sum B(M, N) by something like !1/2 !1/2 2 2 ∑ |am| ∑ |bn| · Nf m∼M n∼N where g is some sort of operator norm of the matrix f (m, n). (1) We think of f (m, n) as something that cancels out. It does not always have the same sign. (2) We typically have | f (m, n)| small, e.g., ≤ 1. (3) We might also imagine am, bn ≤ 1. We’d then like to compare the bound we obtain to the trivial bound MN. We’d like to beat this trivial bound. This will be impossible to bound if (1) f (m, n) = 1. (2) f (m, n) = α(m)β(n). (3) Both M and N are big (or at least the associated matrix has large rank). In order to avoid these impossibilities, we will need to exploit that f (m, n) is genuinely a 2-variable function, and does not decouple. ANALYTIC NUMBER THEORY NOTES 11
To obtain the bound, we will use Cauchy-Schwarz. We have 2 ! 2
2 ∑ ∑ ambn f (m, n) ≤ ∑ |am| ∑ ∑ bn f (m, n) . m∼M n∼N m∼M m∼M n∼N Let ∗ denote 2
∑ ∑ bn f (m, n) m∼M n∼N Then, 2
∗ = ∑ ∑ bn f (m, n) m∼M n∼N ≤ ( ) ( ) ∑ bn1 bn2 ∑ f m, n1 f m, n2 . n1,n2∼N m∼M
From this we have gained that we have replaced the unknown am, bn by inner products of our known matrix f (m, n). If we knew f (m, n) were orthogonal, then the sum amounts to terms with n1 = n2 of the form 2 ∑ |bn| M. n∼N Things will never be quite so good that we will precisely get orthog- onality. But, we might have some approximate orthogonality. For example, if n1 6= n2, maybe we can bound the correlation by 1. Then, the off-diagonal terms are of the form | ∑ bn1 bn2 . n16=n2 Using Cauchy’s inequality, we obtain 2 2 |bn1 bn2 | |bn1 | + |bn2 | . Hence, | | |2 ∑ bn1 bn2 N ∑ bn . n16=n2 n∼N In the above favorable circumstances, putting the above together, we get a bound !1/2 !1/2 √ √ 2 2 ∑ ∑ ambn f (m, n) ∑ |am| ∑ |bn| M + N m∼M n∼N m∼M n∼N 12 AARON LANDESMAN
We might√ now√ try setting all |an| = |bn| = 1, and then our bound is M N + N M instead of MN so we save √1 + √1 . Again, this M N bound holds under various assumptions that the f (m, n) are approx- imately orthogonal. This is the key strategy. We now want to implement the above strategy in the situation we are in. The key point of the strategy is that we have transferred the problem from understanding the un- known an, bm to the known problem of understanding the correlation of f (m, n).
3.2. Applying Vinogradov’s idea. We now want
∑ ∑ ambne(mnα). m∼M n∼N
Thinking of the am as µ(a) and the bn as µ(n). We then want to bound
∗ = |b b | e (m(n − n ) ) ∑ n1 n2 ∑ 1 2 α . n1,n2∼N m∼M
Suppose we write n1 − n2 =: k. Then |k| ≤ N. We then have 1 ∗ | |2 + | |2 ∑ bn1 bn2 min M, ||(n1 − n2)α|| n1,n2∼N ! 1 |b |2 min M, . ∑ n1 ∑ ||kα|| n1∼N |k|≤N We conclude !1/2 1/2 1/2 2 2 1 ambne(mnα) ≤ |am| |bn| min M, . ∑ ∑ ∑ ∑ ||k || m,n n |k|≤N α
We’d like to show we get something from this if α is not close to a rational number with small denominator. So we keep in mind that α might be irrational. We start with Dirichlet’s theorem: Theorem 3.5 (Dirichlet). For all Q > 1 and all α ∈ R, there exists a rational number a/q with (a, q) = 1 and q ≤ Q so that
a 1 α − < . q qQ ANALYTIC NUMBER THEORY NOTES 13
So, we can get pretty good approximations to irrational numbers with small denominators. A crude version of this is
a 1 α − ≤ . q q2 We can get approximations of this type by continued fraction expan- sions. Let (∗) denote 1 (∗) := min M, ∑ ||k || |k|≤N α We should expect that if q is small, then we might revert to the trivial bound MN. Perhaps there is some inverse relationship√ with q. So maybe we get something like MN/q or MN/ q. So, the larger q gets, the more saving we should get over the trivial bound. So, very small values of q are not good, but we’d like to show that if q is in some intermediate range, we might hope to be in a good situation. So, the bound we will write down will depend on the Diophantine properties of α and the scale on which we are operating. So, assume α has the rational approximation given by Dirichlet’s theorem satisfying
a 1 α − ≤ . q q2 Split the interval from m to n of length k into several intervals of length q. How do |kα| vary on this interval - there is at most one value which is very close to an integer. Then, we have q ∑ min M, M + q log q. 0≤a≤q a The log q is unimportant and we can remove it if we’d like, using Poisson summation if we had a smooth function. It would then be min(M, q/a2) and we could remove the log. We now want to bound the following by dividing N into N/q + 1 intervals of length q. 1 (∗) = min M, ∑ ||k || |k|≤N α = (N/q + 1)(M + q log q) log q (M + q)(N + q) . q 14 AARON LANDESMAN
We have proven: | − a < 1 ( ) = Proposition 3.6. If α q q2 and a, q 1, then
∑ ∑ ambne(mnα) m∼M n∼N 1/2 1/2 1/2 log q √ p p |a |2 |b |2 MN + Mq + Nq + q . ∑ n ∑ n q Question 3.7. Why is the above bound useful? Just to summarize, this might be helpful to think of the case that the ai, bj are bounded in norm by 1. We then get approximately a bound by √ MN √ √ √ √ MN 1 1 √ √ √ MN + q M + N + q = √ + MN √ + √ + q MN. q q M N The middle term is what we would get from the orthogonality re- lation. If q is small or large, we don’t beat the trivial bound, as ex- pected. This is the crucial bound for our particular bilinear form. Next time, we’ll try and rewrite the coefficients as a bilinear form as a function of multiple summands. The final ingredient is to write a combinatorial identity to express Λ(n) in terms of things we un- derstand. That is, we want to write it as something like
e(bα) ∑ (log b) + ∑ ambne(mnα). m,n where both m and n are large in the second sum.
4. 10/3/17 4.1. Review. Last time, we were trying to bound sums like ! 2
2 2 | ∑ ∑ f (m, n)| ≤ ∑ |am| ∑ ∑ bn f (m, n) m n m m n
= |b b | f (m n ) f (m n ) ∑ n1 n2 ∑ , 1 , 2 n1,n2 m
= |b |2 + |b |2 f (m n ) f (m n ) ∑ n1 n2 ∑ , 1 , 2 , n1,n2 m ANALYTIC NUMBER THEORY NOTES 15
and we hoped that the correlations, i.e., the terms ∑m f (m, n1) f (m, n2) were bounded by M if n1 = n2 and O(1) if n1 6= n2. We then could 2 bound the above by (M + N) ∑i |bn| . Recall last time, we were trying to bound sums like
∑ ∑ ambne(mnα) m∼M n∼N where m ∼ M means M ≤ m ≤ 2M. We had
a 1 α − ≤ , q q2 with (a, q) = 1. We proved last time
∑ ∑ ambne(mnα) m∼M n∼N !1/2 !1/2 √ √ √ log q 2 2 √ √ ∑ |am| ∑ |bn| MN + q M + N + q . q m n We had Λ(n) = ∑ µ(a) log b S(α) = ∑ Λ(n)e(nα). n≤N Our tools are (1) ∑ log ne(nα) n≤x is some sort of geometric progression which after factoring out a log, which we understand (2)
∑ ∑ ambne(mnα) m∼M n∼N which is well bounded using Vinogradov’s method discussed last time (and above this lecture) assuming the covariances are small for off-diagonal terms and on the order of the num- ber of elements for the diagonal terms. Our goal is now the following combinatorial one: Goal 4.1. Write Λ(n) in a form where we can use the above two tools. 16 AARON LANDESMAN
Theorem 4.2 (Vaughan’s identity). We have
∞ Λ(n) ζ0 = − ( ) ∑ s s n=1 n ζ 1 µ(a) = ∑ s ζ(s) a a log b − 0( ) = ζ s ∑ s . b b Proof. This follows from straightforward manipulations of Dirichlet series, the first one comes from the derivative of log ζ(s).
4.2. Mollifying ζ(s). We now want to Mollify ζ(s). For this, we will use Selberg’s sieve. One way to study the zeta function could be an appropriate trun- cation. We may consider µ(n) ( ) = M s ∑ s . n≤U n which is a sort of approximation to the inverse of the Riemann ζ function, using the above identity that 1 µ(a) = ∑ s ζ(s) a a
One can compute,
∞ a(n) ( ) ( ) = ζ s M s ∑ s . n=1 n where a(n) is defined by a(n) = ∑ µ(d) d|n,d≤U 1 if n = 1 = 0 if 1 < n ≤ U the norm is bounded by d(n) if n > U where d(n) is the number of divisors of n. ANALYTIC NUMBER THEORY NOTES 17
We have ζ0 ζ0 − (s) = − (s) (1 − ζM + ζM) ζ ζ ζ0 = −ζ(s)M(s) − (s) (1 − ζM(s)) . ζ First, we should be fairly happy with the term −ζ(s)M(s) which has Dirichlet series given by ! log b M(n) ∑ s ∑ s , b n≤U n and we’ll have a long sum in the b’s, where we can hope to get some cancellation. Next, to understand ζ0 (s) (1 − ζM(s)) . ζ We try to think of this product as a sort of bilinear form. The terms from 1 − ζM(s) only matter for n larger than U. Thinking of this term as a bilinear term, we’re happy because 1 − ζM(s) is large. But, we have to ensure that ζ0/ζ is not too “skinny.” To deal with this, we can subtract out the small primes, and then later add them back. To accomplish this, we define Λ(n) ( ) = P s : ∑ s . n≤V n Then, ζ0 ζ0 − (s) = − (s) − P(s) + P(s) ζ ζ Λ(n) Λ(n) = + ∑ s ∑ s n>V n n≤V n
Then, ζ0 ζ0 − (s) = −ζ0(s)M(s) − (s) (1 − ζM(s)) ζ ζ −ζ0 = −ζ0(s)M(s) + (s) − P(s) (1 − ζM(s)) + P(s) (1 − ζM(s)) ζ −ζ0 = −ζ0(s)M(s) + (s) − P(s) (1 − ζM(s)) + P(s) − ζ(s)M(s)P(s) ζ 18 AARON LANDESMAN
The point of this breakdown is that we now have three terms we can handle using our two tools we have. The middle term decomposes into two parts, both of which are big, which gives a bilinear form. The last term has a long sum from the ζ(s) term in simple coeffi- cients. For P(s), we can just ignore it because V is small. The first term is similarly a long some from the ζ0 term. Remark 4.3. The first term which we handle via our first summation technique is called a “type 1 sum” and the second handled via our second bilinear form summation technique is called a “type 2 sum.” Example 4.4. Let’s say we want to write −ζ0 −ζ0 (s) = (s) (1 − ζM)2 + 2ζM − ζ2 M2 ζ ζ ζ0 = − + ζ0 M (1 − ζM) − 2ζ0 M + ζζ0 M2. ζ The first is a type 2 sum, the second is a type 1. The ζ and ζ0 are both somewhat a simple divisor function because if the product of ζ, ζ0 goes in a long range then at least one of them must be summed in a long range. So, the third term is also a type 1 sum. Remark 4.5 (Heath Brown identity, aka binomial theorem). Given −ζ0 (s) (1 − ζM)k ζ one can try expanding this in k via the binomial theorem, and try to bound various terms. 4.3. Proving Vinogradov’s theorem. Recall we have
a 1 α − ≤ . q q2 with (α, q) = 1. Our goal is to bound S(α) in terms of q. Trivially we know S(α) is bounded by N, and we want to save a bit more than one log on the minor arcs, and then we’ll have to concentrate on the major arcs. Using Vaughan’s identity, (where we have not yet specified U and V). There are three type 1 sums and one type 2 sum (from the bilinear form. Recall we have ζ0 −ζ0 − (s) = −ζ0(s)M(s) + (s) − P(s) (1 − ζM(s)) + P(s) − ζ(s)M(s)P(s) ζ ζ ANALYTIC NUMBER THEORY NOTES 19 and we are trying to bound the four sums. First we deal with the P(s) term, which is
∑ Λ(n)e(nα) V. n≤V
Next, we try to bound the first term, which is the contribution from primes coming from ζ0 M. This term is
N 1 µ(n) log re(nrα) min , . ∑ ∑ ∑ || || n≤U r≤N/n n≤U n nα
It is convenient to split over dyadic blocks 2k < n ≤ 2k+1.
Exercise 4.6. Carry out the argument from week 1 for dealing with sums like 1 min N, . ∑ || || |n|≤N nα
There is a small lie in what we will next do, and your job is to fix it by Thursday. You should check what happens for smaller n as well.
Pretending that only the large range matters, we can bound
N 1 µ(n) log re(nrα) min , ∑ ∑ ∑ || || n≤U r≤N/n n≤U n nα U N (log N) + 1 + q log q q U N N (log N)2 + U + + q q U
Now, we’ll aim to attack the last type 1 sum, which is the term corresponding to ζ(s)M(s)P(s) which is ∑ ∑ µ(n)Λ(`) ∑ e(n`rα) n≤U `≤V r≤N/n` if we let n` = a then a ≤ UV. Then, the terms in a are bounded by something like ∑n`=a Λ(`) = log a (using that the left hand side is the convolution of ζ with ζ0/ζ which is ζ0 which has coefficients 20 AARON LANDESMAN given by log. Therefore,
∑ ∑ µ(n)Λ(`) ∑ e(n`rα) ∑ log a ∑ e (arα) n≤U `≤V r≤N/n` a≤UV r≤N/a N 1 (log N) ∑ min , a≤UV a ||αa|| Nq N (log N)2 + q + UV + . q UV
Exercise 4.7. Verify the above bounds using a method similar to the type 2 bound of the first ζ0 M term. Adding up our three type one sums, and removing terms trivially bounded by others, we get N N (log N)2 + q + UV + . q U This handles three of the four terms. The last term remaining to be handled is the type 2 sum corresponding to −ζ0 (s) − P(s) (1 − ζM(s)) ζ The first sum only contains terms larger than V and the second only contains terms larger than U. Using 1 1 − ζM(s) = µ(d) ∑ ns ∑ n d|n,d>U we obtain the sum ∑ Λ(n) ∑ ∑ µ(s)e(mnα) n>V m>U d|m,d>U with mn ≤ N. Remark 4.8. We now have two terms with variables in our bilinear form, both with large values. Both will range over dyadic intervals. It starts to look like a bilinear form, though there is the caveat that the two variables are connected by the condition that mn ≤ N. Hence, we need some technical device to separate the variables. Essentially, this is saying these are like points lying below a hy- perbola and we would like to approximate the hyperbola by some rectangle. ANALYTIC NUMBER THEORY NOTES 21
Morally, ∑ Λ(n) ∑ ∑ µ(s)e(mnα) n>V m>U d|m,d>U with mn ≤ N. Ignoring the condition mn ≤ N, (which we will fix next time) the above sum is approximated by ∑ ∑ a(m)b(n)e(mnα). m∼A n∼B for A > U, B > V, AB ≤ N. This is the kind of bilinear form we want for our type 2 sum. We can now use the bilinear form estimate from type 2, we get the estimate !1/2 !1/2 log q √ √ √ √ ∑ a(m)2 ∑ b(n)2 √ AB + A + B q + q . m∼A n∼B q Note that the correlation is as needed because we have checked it for the particular bilinear form ambne(mnα). Here, bn = Λ(n). Then, ∑ Λ(n)2 B log B. n∼B Next, ∑ a(m)2 ∑ d(m)2 m∼A m∼A
Exercise 4.9. Show ∑ d(n) ∼ x log x. n≤x
(write this as ∑n≤x ∑d|n 1 and interchange the two sums).
2 3 It turns out ∑n≤x d(n) ∼ Cx (log x) . We end up getting a bound from ∑ a(m)2 ∑ d(m)2 Cx (log x)3 . m∼A m∼A So, we have some loose ends which we shall address next time including (1) thinking about these sum of divisor functions up to x, (2) thinking through the type one bounds for the first and fourth terms, (3) and putting these things all together. 22 AARON LANDESMAN
5. 10/5/17 5.1. Recap of last time. Recall that we have defined S(α) := ∑ Λ(n)e(n). n≤N Our goal is to bound these exponential sums. We assume
a 1 α − ≤ q q2 for (a, q) = 1. We had a way of approaching this bound with expo- nential sums and bilinear forms. We used the combinatorial identity −ζ0 −ζ0 (s) = P(s) + ζ0(s)M(s) − ζ(s)M(s)P(s) + (s) − P(s) (1 − ζ(s)M(s)) ζ ζ where Λ(n) ( ) = P s ∑ s . n≤V n and µ(n) ( ) = M s ∑ s . n≤U n The first three terms are type 1 sums, and the last term is a type 2 sum. Last time, we discussed the bound for the type 1 sums. We saw they were bounded by N N (log N)2 + q + UV + . q U For example, to bound the term ζ0(s)M(s), we had to bound N 1 ∑ min , . n≤U n ||nα|| We could split this into dyadic blocks and carry out the usual sum. When 1 ≤ n ≤ q, we cannot split it into intervals of length q. Exercise 5.1. For the blocks, we should take a dyadic sum over inter- vals 2kq ≤ n ≤ 2k+1q, and then we should pay attention to the case 1 ≤ n ≤ q, and we should get a bound around q log q or something like that for the sum of the first q terms. ANALYTIC NUMBER THEORY NOTES 23
At the end of last class, we were discussing the type 2 sums. There were many small things we needed to keep track of. We wrote the sum as ∑ Λ(n) ∑ ∑ µ(d) e(mnα) n>V m>U,mn≤N d|m,d>U Last time, we divided this sum into dyadic intervals with m ∼ A and n ∼ B. Remark 5.2. We have to justify why the sum can be split the sum into dyadic blocks subject to the condition that mn ≤ N. We then bounded the above by !1/2 !1/2 log q √ √ √ √ ∑ d(m)2 ∑ Λ(n)2 √ AB + A + B q + q m∼A n∼B q !1/2 log q √ √ √ √ ∑ d(m)2 (B log B)1/2 √ AB + A + B q + q m∼A q 5.2. Bounding the sum of the divisor function. We can see ∑ d(n) = ∑ ∑ 1 n≤x a≤x b≤x/a x = ∑ ( + O(1)) a≤x a = x log x + O(x). This is a wasteful O(1) when a is small. Dirichlet’s idea was to deal with the hyperbola ab = x and count b ≤ B and a ≤ A. One could count points a certain portion of the hyperbola based on whether A or B is smaller on the outside or inside. When one carries this out, one gets an error term on the√ size of A + B, instead of x (with A + B = x. One can take A √= B = x. One ends up getting an error of x log x + (2γ − 1) x + O( x). Exercise 5.3. Carry out Dirichlet’s idea and check this error term. Then, we can compute
dk(n) = ∑ 1 a1···ak=n = ∑ dk−1(b), ab=n 24 AARON LANDESMAN and use induction. If we knew ∑b≤y dk−1(b), we could then use the hyperbola method for a ≤ A, ab ≤ B and choose the parameters A, B with AB = x. 5.3. A second method for bounding the sum of the divisor func- tion. We now want a second method of calculating this. We are try- ing to bound d(n) ζ(s)2 = . ∑ ns We have 1 Z c+i∞ xs ∑ d(n) = ζ(s)2 ds n≤x 2πi c−i∞ s for c > 1. Exercise 5.4. Show ( 1 Z ds 1 if y > 1 ys = 2πi (c) s 0 if y < 1 (see davenport’s book) where the path (c) means that from c − i∞ to c + i∞. Essentially, one can prove this by noting the integral is 0 for very small y, and there is only one pole at y = 1 which has residue 1. When we expand xs = x (1 + (s − 1) log x + ··· ) and 1 ζ(s) = + γ + O(s − 1). s − 1 The residue of the pole at s = 1 is x log x + (2γ − 1) x. It would be useful, and can be done easily, to have some bounds for |ζ(s)| (1 + |t|)1/2 where s = σ + iτ, 0 ≤ σ ≤ 1. We are trying to bound 1 Z xs ζ(s)2 ds 2πi (c) s 1 Z c+iT xs xc = ζ(s)2 ds + O( ). 2πi c−iT s T ANALYTIC NUMBER THEORY NOTES 25
We can then try to bound this integral by something like x log x + (2γ − 1) x, similarly to the way done in Davenport’s book. In potential hope of formalizing this method, we are trying to we want to bound
∑ dk(n), n≤x and we consider ∞ d (n) ( )k = k ζ s ∑ s . n=1 n We then can compute these by examining 1 Z xs ζ(s)kds. 2πi (c) s So, we want to know vaguely what the residue of this integral at s = 1. The residue is something like
x (log x)k (k − 1)! with some lower order terms we can work out. The main term will be a polynomial of log x of degree k − 1. Exercise 5.5. Show we end up getting a residue of the form
1−δk xPk(log x) + O(x ) for Pk a degree k − 1 polynomial. Remark 5.6. Gauss should that the number of lattice points in a circle of radius R is N(R) = πR2 + O(R1/2+ε).
(the best currently known is only error R2/3−δ Dirichlet’s divisor problem is to show ∑ d(n) = x log x + (2γ − 1) x + O(x1/4+ε). n≤x In both cases, the main term is the area of the region you are consid- ering. The best error known is only about O(x1/3−δ). 26 AARON LANDESMAN
5.4. Calculating the sum of squares of the divisor function. Now, we’d like to calculate ∑ d(n)2. n≤x We will instead calculate ∞ d(n)2 4 9 = + + + ··· ∑ s ∏ 1 s 2s . n=1 n p p p Note that this will converge absolutely whenever we are to the right of 1. We have 1 1 ( ) = + + + ··· ζ s ∏ 1 s 2s . p p p We can approximate ∞ d(n)2 4 9 = + + + ··· ∑ s ∏ 1 s 2s n=1 n p p p = ζ(s)4F(s) where α ( ) = + + ··· F s ∏ 1 2s p p for some α, which converges absolutely if Re(s) > 1/2. One then obtains that ∑ d(n)2 ∼ Cx (log x)3 , n≤x 4 using the bound for ζ as xPk(log x) with Pk of degree k − 1. with an asymptotic power saving error term. We could similarly use the hyperbola method to approximate 2 d(n) = (d4 ∗ f ) (n) for a multiplicative function f with f (p) = 0. Remark 5.7. When one actually calculates what d(n)2 ∑ s n n one might find something like ζ(s)4 ζ(2s) ANALYTIC NUMBER THEORY NOTES 27 although this identity is not relevant to finding the correct asymp- totic formula. Exercise 5.8 (Fun exercise!). Let a(n) be the number of abelian groups of order n. First make a guess for ∑ a(n) n≤x asymptotically. Then compute the asymptotics. Hint: Use the iden- tity for the partition function ∞ − ∑ p(n)xn = ∏ (1 − xn) 1 n=0 to get the constant in the asymptotics, which ends up being some- thing like ζ(2) · ζ(3) ··· . Remark 5.9. One might also try computing
∑ dπ(n) n≤x
∑ di(n) n≤x π where dπ(n) are the coefficients of the Dirichlet series of ζ(s) . Re- latedly, one might try to count ∑ 1, n≤x,n=a2+b2 and one can work out 1/2 1/4 ∑ 1 = ζ(s) L(s, χ−4) F(s), n=a2+b2 where f (s) is regular to the left of 1. It’s not completely obvious how these functions continue analyti- cally. We could make sense of ζ(s) = exp (π log ζ(s)) , which makes sense to the right of 1. But, it also can be extended to regions where there are no zeros or poles of the ζ function. If we understand the zero-free region of the zeta function, then we can make sense of this function in this zero-free region. In the region c γ > 1 − , log T 28 AARON LANDESMAN
ζ(s) 6= 0 as shown in davenport. It turns out this function has a singularity which is not a pole (nor essential nor removable) and it turns out to be something like − x (log x)π 1 Γ(π) for the function dπ(n). This idea is called the Selberg, Delange method (or in a paper to- day on arXiv, the LSD method). Remark 5.10. We only wanted an upper bound for ∑ d(n)2. n≤x We didn’t need an asymptotic. In analytic number theory, this is called Rankin’s method. We can bound d(n)2 ( )2 ≤ α ∑ d n x ∑ α n≤x n≤x n ∞ d(n)2 ≤ α x ∑ α n=1 n 4 = α + + ··· x ∏ 1 α p p Then, we want to optimize to choose the best α. Making α close to 1, the product blows up and xα gets small. From calculus, there will be some choice of α which minimizes this product. 1 α For example, if you guess α = 1 + log x , you find x is about x and 4 1 4 + + ··· ∼ + ∏ 1 α ζ 1 . p p log x
This yields x (log x)4 as a bound, and you are only off by one log. Exercise 5.11. Verify this. Exercise 5.12. Let p(n) denote the number of partitions of n. Prove that √ p(n) ≤ exp π 2/3n .
Moreover, find the optimal constant so that p(n) ≤ eαn. (Sound thinks the constant above is optimal). ANALYTIC NUMBER THEORY NOTES 29
Hardy and Ramanujan found √ exp π 2/3n p(n) ∼ √ . 4n 3 Hint: Show ∞ − ∑ ∏ (1 − xn) 1 n=1 Then, − p(N) ≤ ∏ (1 − xn) 1 x−N. Exercise 5.13 (Fun mathoverflow problem). Let N be a parameter. How many subsets S ⊂ [1, N] are there so that 1 ∑ < 1. s∈S s Obviously the answer is ≤ 2N, and the exercise is to find a better bound. Hint: This is not an application of what we’ve discussed, but it is an application of the ideas we’ve discussed. 5.5. Returning to our type 2 sum. Recall we had A > U, B > V. We can now bound !1/2 !1/2 log q √ √ √ √ ∑ d(m)2 ∑ Λ(n)2 √ AB + A + B q + q m∼A n∼B q 1/2 log q √ √ √ √ A(log A)3 (B log B)1/2 √ AB + A + B q + q q √ √ 3 AB p (log N) √ + A B + B A + qAB q 3 N N N p (log N) √ + √ + √ + qN . q V U using for the last step that AB ≤ N. We are doing well here because both variable A and B vary only in long ranges (i.e., U and V are reasonably large). We then have to add the error from the type 1 sum which was N N (log N)2 + q + UV + . q U 30 AARON LANDESMAN
Adding these together, we get 3 N N N p S(α) (log N) √ + √ + √ + qN + UV . q V U
By symmetry, we may as√ well choose U = V, and so we should optimize by choosing N/ U = U2. Hence, U = N2/5. Then, one obtains 3 N p S(α) (log N) √ + qN + N4/5 . q There is one small caveat, where we must ensure how to separate the variables m and n subject to the condition mn ≤ N. We’ll have to finish this next time. Believing this for the moment, we’ve proven. So, if N > q > (log N)10 . (log N)10 So, as long as we can approximate α by some rational q in this range, we get a good bound. These will be called the “minor arcs.” Theorem 5.14. Let φ be the golden ration. Then, ∑ Λ(n)e(nφ) N4/5 (log N)3 . n≤N √ Proof. We can plug in q to be around N using Fibonacci number approximations plugged into the above formula, and then the bound 3 is (log N) N4/5. Remark 5.15. The bound also works well for bounding things like ∑ Λ(n)ein n≤N using that
1 a C − ≥ , π q q20 so one can always find a pretty good approximation to π. So, we get a bound of about ∑ Λ(n)ein N.99 n≤N ANALYTIC NUMBER THEORY NOTES 31
6. 10/10/17 6.1. Exercises and questions. Last time, we let q be a number with
a 1 α − < q q2
3 N p ∑ Λ(n)e(nα) (log N) √ + qN + N4/5 . n≤N q Exercise 6.1. Let φ be the Euler totient function, let ! φ(n)k ∑ n≤N n Find asymptotics for this. Why might these asymptotics be interest- ing. 6.2. Recapping what we have seen in the Proof of Vinogradov’s theorem. Last time, we had some sum in terms of m ∼ A, n ∼ B with a condition mn ≤ N. We want to separate m and n. We have ( 1 Z c+i∞ N s ds 1 if mn ≤ N = 2πi c−i∞ mn s 0 if mn > N When we plug this into our bilinear form
∑ ∑ f1 f2e(mnα), m∼A n∼B
(for appropriate f1, f2) we get Z c+i∞ 1 f1 f2e(mnα) s ds ∑ ∑ s s N . 2πi c−i∞ m∼A n∼B m n s This separates the variables at the cost of log N when we integrate ds s . Question 6.2 (Possibly open question). We have ∑ Λ(n)e(nφ) N4/5+ε, n≤N for φ the golden ratio. Can one say something better? Presumably the right answer is N1/2, though that may be hard. Maybe one could show something like N3/4. The key is that we have rational approx- imations at every scale. 32 AARON LANDESMAN
Recapping what we have done so far, we were trying to bound Z 1 S(α)3e(−Nα)dα. 0 We split this up into major and minor arcs. On the minor arcs, we bounded this by Z | S(α)3e(−Nα)dα| < max |S(α)| N log N. m m We expect a main term on the order of N2. We have a good bound on S(α) so long as N ≥ q ≥ (log N)10 (log N)10 The minor arcs will be all points which satisfy an approximation of this type, and the major arcs will be all points which do not satisfy an approximation of this type. Let Q = N . By Dirichlet’s theorem, for all α ∈ (0, 1), there (log N)10 exists (a, q) = 1, q ≤ Q and
a 1 1 α − ≤ ≤ . q qQ q2
Definition 6.3. We say α ∈ m (in a minor arc) if there exists such an approximation with
q ≥ (log N)10 . Otherwise, there exists α ∈ M (in a major arc). That is,
a 1 α − ≤ q qQ with q < (log N)10. The major arcs M are disjoint. The total measure of the major arcs is
φ(q)2 C (log N)10 |M| = ∼ , ∑ qQ Q q≤(log N)10 which is roughly (log N)20/N. ANALYTIC NUMBER THEORY NOTES 33
We now wish to understand S(α) for α on a major arc. Let α = a 1 a q + β for q small, |β| ≤ qQ . The idea is to understand S( q ). Let’s instead try to understand an ∑ Λ(n) exp . n≤x q 6.3. Riemann hypothesis and counting primes. To start, let us re- call what the Riemann hypothesis says about counting the number of primes up to x. Let Ψ(x) be the number of primes up to x. It implies Ψ(x) = x + O x1/2+ε .
If one further assumes the generalized Riemann hypothesis, one finds x Ψ (x; q, a) = + O(x1/2+ε). φ(q) Further, the constant in O(x1/2+ε) is independent of q. In particular, this means φ(q) ∼ q, Thus, we have a nice asymptotic for q ≤ x1/2−ε.
Conjecture 6.4 (Montgomery). We have ! x x1/2+ε Ψ(x; q, a) = + O √ . φ(q) q
Plugging in the generalized Riemann hypothesis, we get na ∗ ak ∑ Λ(n) exp = ∑ ∑ Λ(n) exp n≤x q k mod q n≡k mod q,n≤x q ∗ ak x = exp + O(x1/2+ε) ∑ ( ) k mod q q φ q µ(q) = x + O(qx1/2+ε) φ(q) where the superscript ∗ again means k is coprime to q and we are using
∗ k Exercise 6.5. Show ∑k mod q exp q = µ(q). Now, suppose (n, q) = 1 and we want to express exp (n/q) in terms of characters χ mod q. 34 AARON LANDESMAN
× Letting χ0 denote the identity on (Z/qZ) We can consider ∗ k ∗ k 1 exp χ = exp χ(k)χ(n) ∑ 0 ∑ ( ) ∑ k mod q q k mod q q φ q χ mod q 1 = χ(n)τ(χ), ( ) ∑ φ q χ mod q where k τ(χ) = ∑ χ(k) exp . k mod q q Then, an 1 Λ(n) exp = τ(χ) Λ(n)χ(an). ∑ ( ) ∑ ∑ n≤x q φ q χ mod q n≤x Define ψ(x; χ) := ∑ Λ(n)χ(n). n≤x Then, Ψ (x; q, a) = ∑ Λ(x) n≤x,n≡a mod q 1 = χ(a)Ψ(x, χ). ( ) ∑ φ q χ mod q The generalized Riemann hypothesis (GRH) is essentially the state- ment that for χ = χ0, we have Ψ(x, χ) = Ψ(x) up to a small error (which is just the usual Riemann hypothesis) and for χ 6= χ0, we have |Ψ(x, χ)| = O(x1/2+ε).
In the case χ = χ0, we get the main term with ∗ k τ(χ0) = ∑ exp = µ(q). k mod q q So, the main term is µ(q) Ψ(x). φ(q) ANALYTIC NUMBER THEORY NOTES 35
Exercise 6.6 (A bit tricky, perhaps). Using orthogonality of charac- ters show that if χ is primitive√ modq (meaning not having a pe- riod dividing q) then |τ(χ)| = q. Hint: See Davenport’s section on Gauss sums. Plugging this in the above formulas and the GRH bounds, we see a refined GRH bound an µ(q) √ ∑ Λ(n) exp = x + O x1/2+ε + O qx1/2+ε . n≤x q φ(q) So, compared to our previous error bound with O(qx1/2+ε) error, we √ only get O( qx1/2+ε). We are not assuming GRH, rather we want an unconditional proof, so the above discussion assuming GRH can now be ignored. We have an 1 Λ(n) exp = χ(a)τ(χ)ψ(x, χ) ∑ ( ) ∑ n≤x q φ q χ mod q √ ! µ(q) q = Ψ(x, χ ) + O |Ψ(x, χ)| . φ(q) 0 φ(q) ∑ χ6=χ0 We have 2 Ψ (x, χ0) = Ψ(x) + O (log x) . From the prime number theorem, we have p Ψ(x) = x + O x exp −c log x for some c > 0. The key step in the proof of this is that the region > − c ( ) = σ 1 log 2+|t| has no zeros of the zeta function ζ s , where s σ + it. We therefore get a bound 2 Ψ (x, χ0) = Ψ(x) + O (log x) xρ x ∼ x − + O . ∑ ρ T |ρ|≤T In the best case, we might have p Ψ(x; χ) x exp −c log x , p for χ 6= χ0 mod q and q ≤ exp log x . 36 AARON LANDESMAN
The conclusion is that if p q ≤ exp c log x for c small, then an µ(q) p ∑ Λ(n) exp = x + O x exp −c log x . n≤x q φ(q) In a similar way, one would like to show x p ψ(x; q, a) = + O x exp −c log x . φ(q) The short version of the story is that we can basically do this, but with one important caveat, which is called a Landau-Siegel zero.
6.4. Siegel Zeros. We want to understand Ψ(x; χ). If χ 6= χ0, we have ! xρχ x (log x)2 Ψ(x, χ) = − ∑ + O ρχ T ρχ,|ρχ|≤T where ρχ are the zeros of ∞ χ(n) ( ) = L s, χ : ∑ s . n=1 n One can find proofs of all of these things in Davenport. If χ is primitive then L(s, χ) has a functional equation of the form q s/2 s + α Γ L(s, χ) π 2 where α is either 0 or 1 depending on whether χ(−1) = 1, (so α = 0) or χ(−1) = −1 (so α = 1). This yields the volume at 1 − s. One can count the number of zeros of ζ(s) or L(s, χ) up to height T, which is approximately T log qT . 2π It is also useful to know the Hadamard factorization, which, once you know this is an order 1 function, tells you this has a factorization in terms of its zeros. That is, ! s L(s, χ) = ∏ 1 − es/ρ eA+Bs. ρ ρ So the sum of the reciprocals of the squares of the 0’s converge, but possibly not the sum of the reciprocals of the 0’s. ANALYTIC NUMBER THEORY NOTES 37
For the zeta function we have, ξ(s) = s(s − 1)π−s/2Γ(s/2)ζ(s). which kills the pole at s = 1. This satisfies the functional equation ξ(s) = ξ(1 − s). The main difference between the L functions and the ζ function, we will need to know something about the zero free region. For ζ(s), there is a zero free region of the form c σ > 1 − , log 2 + |t| with σ = im s. We want c σ > 1 − , q(log 2 + |t|) is free of zeros of L(s, χ). This would imply p Ψ(x, χ) = O x exp −c log x p for q ≤ exp c log x. This holds if χ is a complex character modq. But for quadratic characters χ mod q, there is the unfortunate pos- sibility that there could be one exceptional real simple zero β Theorem 6.7 (Siegel). Let β be the possible zero of L(s, χ) as above. Then, C(ε) β < 1 − qε for any ε > 0 for some constant C(ε) which cannot be computed (i.e., the proof is ineffective). Next time we’ll say a bit more about Siegel’s theorem. It might be helpful to review things about the prime number theorem in pro- gressions which we will go over as needed on Thursday. Sound also says he is happy to look at or discuss solutions if you do end up solving problems.
7. 10/12/17 7.1. Review. Let χ be some character Z/qZ → C×. We have Ψ(x, χ) = ∑ Λ(n)χ(n). n≤x
If χ = χ0, we have p Ψ(x) + O (log x)2 = x + O x exp −c log x . 38 AARON LANDESMAN
If χ 6= χ0, GRH implies |Ψ(x, χ)| x1/2+ε. We would like an unconditional bound around p (7.1) |Ψ(x, χ)| x exp −c log x p and we would like to say when q ≤ exp log x . We have an 1 Λ(n) exp = τ(χ)χ (a) Ψ(x, χ). ∑ ( ) ∑ n≤x q φ q χ mod q
The main term comes from χ = χ0 where τ(χ0) = µ(q), using an exercise on computing Gauss sums from last time. The error term, assuming GRH is of the form q1/2 + x1/2+ε. If Equation 7.1 holds, then we can bound an µ(q) p ∑ Λ(n) exp + O x exp −c log x n≤x q φ(q) p in the range q ≤ exp c log x). We don’t actually know Equation 7.1, but for our application to sums of three primes, we thought of q as only going up to (log x)10 p and not all the way to exp c log x. If χ is complex, (i.e., not a real character) then p |Ψ(x, χ)| x exp −c log x p for q ≤ exp c log x then there are no zeros of L(s, χ) for c σ > 1 − . log q(2 + |t|) with σ = im s. If instead χ is real or quadratic, then the zero free region above holds except possibly for one real simple zero. Theorem 7.1 (Siegel). The real zero β (if it exists) must satisfy C(ε) β < 1 − qε for any ε > 0 and some ineffective constant C(ε). Remark 7.2. If the zero does not exist, then we can obtain Equa- tion 7.1. If there does exist a Siegel zero for χ mod q, then −xβ p Ψ(x, χ) = + O x exp −c log x β ANALYTIC NUMBER THEORY NOTES 39
If q ≤ (log x)A, we can choose ε small enough, we can ensure β < β 1 − √ C , and then we can absorb the main term for Ψ(x, χ) − x log x β into the error term. In the presence of the Siegel zero, we can only get this uniform desired result for q ≤ (log x)A, but not for q ≤ p exp c log x. This is also ineffective. Therefore, we obtain an µ(q) p ∑ Λ(n) exp x + O x exp −c log x . n≤x q φ(q) Remark 7.3. Suppose β is very close to 1. Pretend β = 1. Then there is one character χ mod q so that Ψ(x, χ) is approximately −x. If you think of 1 x xβ χ(a) Ψ(x; q, a) = χ(a)ψ(x, χ) = − . φ(q) ∑ φ(q) β φ(q) and here χ is real so χ(a) = χ(a). Then, half of the progression get most of the primes and the other half get none of them (this happens depending on whether χ(a) = ±1). 7.2. Proving Vinogradov’s theorem using Siegel’s theorem. For the moment, we’ll assume Siegel’s theorem and finish the proof of Vino- gradov’s theorem. We’ll later come back to discuss Siegel’s theorem. We have seen that if q ≤ (log x)A (A is around 10) then an µ(q) p ∑ Λ(n) exp = + O x exp −c log x . n≤x q φ(q) The major arcs are of the form
a 1 α − ≤ , q qQ with N Q = (log N)10 for q ≤ (log N)10. Recall we have already bounded the minor arcs, a and we are now trying to bound the major arcs. Set α = q + β. We 40 AARON LANDESMAN would like to understand an S(α) := ∑ Λ(n) exp exp (nβ) . n≤N q
an We can think of the the product of Λ(n) exp q whose partial sums we understand and exp (nβ) which doesn’t vary very much. So, an S(α) = ∑ Λ(n) exp exp (nβ) n≤N q ! Z N an = exp(xβ)d ∑ Λ(n) exp 1 n≤x q µ(q) Z N p Z N p = exp(xβ)dx + O N exp −c log N + O βx exp −c log x dx φ(q) 1 1 p = O 1 + N|β|N exp −c log N p = O N exp −c log x .
Where we used integration by parts to get the above bounds on the 1 error terms, and then we used that β ≤ qQ , and we might have to adjust the constant c to absorb some factors of log N. a Remark 7.4. The above bound makes sense: If β is very close to q , we pick up the same error term we had before. But if β is very far, then the error term should group approximately proportionally to N |β|, which indeed it does. We now want to evaluate the major arc contribution Z S(α)3e (−Nα) dα. M We are hoping this is of size N2 · C with C some constant we can evaluate. Indeed, Z ∗ Z 1/qQ a 3 a S(α)3e (−Nα) dα = ∑ ∑ S + β exp −N + β dβ. M −1/qQ q q q≤(log N)10 a mod q We know a 3 µ(q) Z N + = ( ) + 3 − p S β 3 exp xβ dx O N exp c log N q φ(q) 0 ANALYTIC NUMBER THEORY NOTES 41
The error term in the integral over the major arcs is then p 1 p O N3 exp −c log N = O N2 exp −c log N . ∑ Q q≤(log N)10 So the error terms are under control. We now want to understand the main term. The main term is almost independent of a except for the a factor exp −N q + β . We want to understand the main term of Z S(α)3e (−Nα) dα. M which is ∗ Z 1/qQ µ(q) Z N a ( ) − + ∑ ∑ 3 exp xβ dx exp N β dβ. −1/qQ φ(q) 0 q q≤(log N)10 a mod q Recall the Ramanujan sum aN c (N) := exp . q ∑ q (a,q)=1 The main term is then µ(q) Z 1/qQ Z N 3 ( ) ( ) (− ) ∑ 3 cq N exp xβ dx e Nβ dβ. φ(q) −1/qQ 0 q≤(log N)10 We can now replace Z N Z 1 exp(xβ)dx = N exp (Nxβ) dx. 0 0 This yields
Z 1/qQ Z N 3 exp (xβ) dx e (−Nβ) dβ −1/qQ 0 Z 1/qQ Z 1 3 = N) N3 exp (Nxβ) dx e (−Nβ) dβ −1/qQ 0 Z N/qQ Z 1 3 = N2 exp (Nxβ) dx e (β) dβ −N/qQ 0 Z ∞ Z 1 3 q2Q2 = 2 ( ) ( ) + N exp Nxβ dx e β dβ O 2 −∞ 0 N 42 AARON LANDESMAN where the last step uses that the tail is
Z dβ q2Q2 = O 3 O 2 . |β|>N/qQ β N This integral above is called the singular integral. Plugging in this remainder term, we get that the error contribution is 1 10 O Q2 φ(q)q2 = O Q2 (log N) ∑ φ(q)3 q≤(log N)10 ! N2 = O . (log N)10 So, we can replace our integral from −N/qQ to N/qQ by an inte- gral going off to infinity. This integral is essentially computing the number of ways to write N as a sum of three numbers, which is es- sentially N2/2. But, we can also compute it since this is essentially a Fourier transform. That is,
Z ∞ Z 1 3 N2 exp (Nxβ) dx e (β) dβ −∞ 0 is the convolution of χ[0,1] ∗ χ[0,1] ∗ χ[0,1] which has Fourier transform above, and for this convolution we get Z δ (t1 + t2 + t3 = 1) . t1,t2,t3∈[0,1] Then one can use Parseval’s identity to compute the Fourier trans- form of this. Here, Parseval is counting the number of ways of writ- ing 1 as a sum of three real numbers. Before we were writing N as a sum of three integers. Now, let’s finish our calculation. We were trying to compute
µ(q) Z 1/qQ Z N 3 ( ) ( ) (− ) ∑ 3 cq N exp xβ dx e Nβ dβ. φ(q) −1/qQ 0 q≤(log N)10 which is approximated, using our above discussion, by
µ(q) N2 c (N) . ∑ φ(q)3 q 2 q≤(log N)10 ANALYTIC NUMBER THEORY NOTES 43
This sum is called the singular sum The tail of this sum is roughly
2! 1 − O (log N) 10 . ∑ φ(q) q>(log N)10
Therefore, the main term is of the form
N2 ∞ µ(q) c (N). ∑ ( )3 q 2 q=1 φ q
Let ∞ µ(q) S (N) := c (N). ∑ ( )3 q q=1 φ q
We can write, using the Chinese remainder theorem so that cp1 p2 (N) = cp1 (N)cp2 (N). So we have 1 S (n) = ∏ 1 − s cp(n) . p (p − 1)
Then,
− p 1 aN cp(N) = ∑ exp a=1 p ( −1 if p N = - p − 1 if p | N
Then, 1 S (n) = ∏ 1 − s cp(n) p (p − 1) ! 1 1 = + · − ∏ 1 3 ∏ 1 2 . p-N (p − 1) p|N (p − 1)
Remark 7.5. We see this cancels out when N is even. When N is even, the major arc at 0 is cancelled by the major arc at 1/2. And in general, the major arc at a/q is canceled by a similar one at a/2q. 44 AARON LANDESMAN
Finally, we have
Z 1 S(α)3e (−Nα) = ∑ 0 n1+n2+n3=N N2 N = S (N) + O . 2 log N Because the contribution of prime squares and cubes is negligible, we get that every sufficiently large odd number is the sum of three primes (and in fact it is the sum of three primes in many ways), where here we are using that S (N) ≥ c for all N where c is some universal constant bounded below by 1 2 · 1 − . ∏ ( − )2 p≥3 p 1 This finishes the proof. Remark 7.6 (Philosophy). Under suitable situation, we’d like to say we can get an answer by counting contributions at each place and then multiplying them together. For example, say we’d like to count the number of ways to write 2N = p1 + p2. We can try to do the same computation mod p (i.e., the counting the number of (a, b) so that N = a + b mod p for a, b relatively prime to p, and similarly over the infinite place). We could then approximate
∑ Λ(n1)Λ(n2) ∼ S (N)2N, n1+n2=2N R 1 ( )2 (− ) we can then try to use the circle method to approximate 0 S α e 2Nα dα. But we can no longer use Parseval’s identity to bound the minor arcs because Z 1 |S(α)|2 = ∑ Λ(n)2 ∼ 2N log N. 0 n≤2N Remark 7.7. At the beginning of this course, we mentioned we could try to count the number of ways to write N as a sum
k k N = x1 + ··· + xs . ANALYTIC NUMBER THEORY NOTES 45
Letting P = N1/k, this is approximated by the integral !s Z 1 ∑ exp nkα e (−Nα) dα. 0 x≤P Then, we might expect Ps/N = Ps−k. There might then be local obstructions (e.g., squares are always 1 mod 8 or 0 mod 8). Then, if S is large enough in terms of 4k, one might try to show this can be done. Instead of trying to understand exponen- tial sums over primes, we would want to understand exponential sums over powers. But, once S ≥ k + 1 and there are no congruence constructions, this sort of result should hold. For example, every large number should be a sum of four squares. But for three squares, there is a congruence obstruction - 7 mod 8 can never be written as a sum of three squares. It turns out you can write numbers as sums of 7 cubes. But for fifth powers, the problem turns out to be much harder. Next time we’ll talk about effectivity and Siegel’s theorem.
8. 10/17/17 8.1. Exercises to solidify the ideas thus far. Here are two exercises, which are a bit longer and harder than usual. Exercise 8.1 (Difficult exercise). Assume GRH. Give a bound for ∑ Λ(n) exp (nα) n≤x
− a ≤ 1 for α q q2 without using bilinear forms, but instead using GRH and thinking about the prime number theorem and arithmetic pro- gressions. Hint: We discussed how to write na 1 ∑ Λ(n) exp ∼ ∑ χ(a)τ(χ)Ψ(x, χ), n≤x q φ(q) and one can input information about Ψ using GRH. na One would then try to write exp q in terms of exp (nβ) with 1 |β| ≤ qQ and then one should try to obtain good minor arc estimates for this summation using a “quasi-Riemann hypothesis” (i.e., assuming there 46 AARON LANDESMAN are no zeros with σ ≥ 2/3. Unconditionally, we know information about primes in progressions up to some√ modulus. Assuming GRH, we know estimates for primes up to x. We can then√ find approxi- mations for numbers with the denominator up to x. We can then use the prime number theorem for everything, and then we won’t even have to worry about major and minor arcs, we can hit the whole problem in both cases using GRH. If one is more careful (via a result due to Hardy and Littlewood) it is enough to assume there are no zeros with σ ≥ 3/4. Remark 8.2. On GRH, one should be able to prove ∑ Λ(n) exp (nφ) x3/4+ε, n≤x whereas Vinogradov’s method only gave an x4/5. To get the 3/4 estimate, we would need Hardy and Littlewood’s refinement. This refinement due to Hardy and Littlewood is a refinement of the Gauss sum idea. One might decompose exp an as a sum of multiplicative √ q characters. This incurs a loss of q. When one writes it as 1 τ(χ)χ(ax), ( ) ∑ φ q χ mod q one√ rewrites a number on the order of 1 as a number on the order of q. For n ≤ x, one can write exp(nβ) in terms of integral of the form Z f (y)niydy |y|≤x We try to replace this additive character in x in terms of a multiplica- tive character in y. A gauss sum is then of the form n τ(χ) = exp χ(n). ∑ q
There will then be an integral√ which is an analog of a Gauss sum. One then saves a factor of q, instead of just writing it naively by breaking it up into progressions. Exercise 8.3 (Difficult exercise). This exercise is to prove a theorem of Davenport. Let µ(n) be the Mobius¨ function. Show
x ( ) ( ) sup ∑ µ n exp nα A A α∈R n≤x (log x) ANALYTIC NUMBER THEORY NOTES 47 for any A > 0. You will have to figure out what happens when α is on a major or minor arc. When α is on a minor arc, you will want to use Vinogradov’s√ √ method and use a bilinear estimate, you will have x/ q + q · x. ζ0 We use Vaughn’s identity for obtaining bilinear forms for − ζ (s), 1 and we would need an analog for identifying ζ (s). One would take ζ · M for M a modifier, and play around with powers of that. To deal with the case when α is on a major arc. This has to do with understanding an ∑ µ(n) exp , n≤x q for q ≤ (log x)A. One might rewrite this in terms of χ mod q. The goal would then be to understand ∑ µ(n)χ(n). n≤x Many results holding for prime numbers also hold for the Mobius¨ function. For studying primes we look at something like 1 Z −ζ0 ds (s)xs , 2πi (c) ζ s and in this case we would be looking at 1 Z 1 ds xs . 2πi (c) L(s, χ) s For primes there is a pole at s = 1, but the pole at s = 1 becomes a 0 for 1/L. So there is some cancellation. On the major arcs, there are savings with powers of log, and on the minor arcs there is another method using bilinear forms which gives savings of powers of log. Exercise 8.4. Assuming GRH, show
3/4+ε sup ∑ µ(n) exp (nα) x . α∈R n≤x Maybe 5/6 instead of 3/4 would be easier to prove. Presumably the correct answer is x1/2+ε. The supremum does obtain x1/2 because Parseval tells us 2 Z 1 2 ∑ µ(n) exp (nα) dα = ∑ µ(n) . 0 n≤x n≤x 48 AARON LANDESMAN
Remark 8.5. The minor arc technology gave use an estimate of the form N p √ + qN + E q where E is some error term endemic to the method.√ The first two terms are optimized when q is on the order of N, in which case the first two terms give N3/4, so you cannot really do better than N3/4 with this minor arcs method. We can write exp(nα) in terms of Z ∑ χ(n)nit f , χ t a for some function f . Here n ≤ N, α ∈ (0, 1). One than writes α = q + β. One does not want to use too many q’s and too many t’s. Roughly one uses q characters χ and integrates over t which is roughly 1 + |β|x. One needs to balance what weight to put on the sum and what weight to put on the integral. You can always choose√ q ≤ Q, |β| ≤ 1 . Then, 1 + |β|N ≤ 1 + N . One can choose Q ∼ N so the integral qQ √ Q √ goes up to N and the sum adds over N terms. One looses N1/2 complexity when doing the above procedure. So it is very hard to beat N3/4 in these major and minor arc estimates. 8.2. Zeros of ζ and L-functions. It’s good to have some intuition for where the 0’s come from and how you might prove these functions have a 0-free region. We’d like to prove ζ(1 + it) 6= 0, L(1, χ) 6= 0, where − χ(p) 1 L (1, χ) = ∏ 1 − . p p We can consider the product − 1 1 ( + ) = − ζ 1 it ∏ 1 1+it . p p How will you find t with |ζ (1 + it)| being large or small. The small primes have a much bigger impact on this product above than the large primes. The maximum impact occurs when the small primes are as big or small as possible. ANALYTIC NUMBER THEORY NOTES 49
To make |ζ (1 + it)| large, we would like pit ∼ 1 and to make it small we would like pit ∼ −1 for many small primes p. Exercise 8.6 (Simple exercise). Show that for any N, there are qua- dratic characters χ with χ(p) = 1 for all p ≤ N (and similarly χ(p) = −1 for all p ≤ N). Remark 8.7. Then, χ will occur to some modulus q, and q might be very large in terms of N. If one tries to compute one of these via the Chinese remainder theorem, one might then have the modulus exponentially large in N. Conjecture 8.8 (Vinogradov). If χ(p) = 1 for all p ≤ N, then q > NA for arbitrarily large A. Conversely, χ(p) = −1 for all p ≤ N then q > NA for all large A. Example 8.9. The least quadratic non-residue must be smaller than qε, if Conjecture 8.8 were to hold true. Remark 8.10. The chance that all the first N primes land heads, one should expect the chance is around 1/2N. So, one would expect the number of primes would have to be exponentially large. Every once in a while, there can be a surprise. For example, if D = −163. Then, −163 = −1 p for p < 41. There are 12 such primes. If you think of things as coin tosses, there would only be a 1/212 (since there are 12 primes up to and including 37) but 163 is substantially smaller than 212 = 4096. L(1, χ−163) should be very small. Indeed, the Class number for- mula gives √ πh Q −163 π L (1, χ−163) √ ∼ . 163 13 50 AARON LANDESMAN and the class number is 1 here (and h denotes class√ number). Recall that the class number formula implies that for Q −D, √ πh(Q −D L (1, χD) = √ . D Goldfeld and Gross-Zagier’s result implies C log D L 1, χ−D ≤ √ . D Siegel’s theorem implies that if χ is a quadratic character mod q, then L(1, χ) ≥ C (ε) q−ε for all ε > 0. Remark 8.11. The zero free region is determined as follows. If L (β, χ) = 0, then L (1, χ) = (1 − β) L0 (σ, χ) for some β ≤ σ ≤ 1. Exercise 8.12. Prove that if 1 1 ≤ σ ≤ 1 − , log q then 0 2 L (σ, χ) ≤ C (log q) . See Davenport. Essentially you can make sense of this a little left of the 1 line. Then you can differentiate it and deduce this. So, you have a bound on how close a zero can be to 1. So, if there’s a bad Siegel 0 it has to be bounded away from 1. 8.3. The zero-free region of the Zeta function. We’d like to instead look at the completed zeta function ξ(s) = s (s − 1) π−s/2Γ(s/2)ζ(s). The functional equation says ξ(s) = ξ(s − 1). The Hadamard product formula gives s eA+Bs ∏ 1 − es/ρ. ρ ρ The trivial zeros come because the Γ(s/2) function has zeros at s = 0, −2, −4, ··· . ANALYTIC NUMBER THEORY NOTES 51
Exercise 8.13. The Riemann hypothesis is equivalent to |ξ (σ + it)| is monotonically increasing in σ ≥ 1/2 Hint: Show that 0 by 0, the Hadamard product above will be increasing. Furthermore, |ζ (σ + it)| is monotone increasing on σ ≥ 1. Exercise 8.14. Let χ be an even character, i.e., χ(−1) = 1. Define ξ (s, χ) := π−s/2Γ(s/2)L(s, χ). Then prove |ξ (s, χ)| is monotone increasing in σ > 1. If ζ(1 + it) = 0, then pit ∼ −1 for many small primes p. This implies p2it ∼ 1 for many small primes p. This implies ζ(1 + 2it) is very big. This relates to the classical inequality ζ(σ)3 |ζ (σ + it)|4 |ζ (σ + 2it)| ≥ 1. Then, χ(p)pit ∼ −1 implies χ(p)2 p2it ∼ 1, which implies L 1 + 2it, χ2 is big. This would yield a contradiction unless 2 χ = χ0 and t = 0. In this case, we are considering the ζ function at 1, which is big because it has a pole. This is the Siegel zero situation where we have a quadratic character and want a lower bound for L(1, χ). 52 AARON LANDESMAN
8.4. Siegel zero situation. We’ll now discuss a proof due to Gold- feld of Siegel’s theorem. We want to show that a lower bound for L(1, χ) L(1, χ) C(ε)q−ε for χ a quadratic character modq. We look at the region h ε i 1 − , 1 . 10 Either (1) All quadratic Dirichlet L-functions have no zero in this region ε We take β = 1 − 10 . We define Ψ to some character mod3. (2) There is some quadratic character Ψ mod r for some r with ε L(β, Ψ) = 0 with 1 ≥ β ≥ 1 − 10 . Consider a(n) ζ(s)L(s, χ)L(s, Ψ)L(s, χΨ) = . ∑ ns
Exercise 8.15. Check a(n) ≥ 0 for all n by just expanding the defini- tions of Dirichlet characters for the various L functions. This function above is the Dedekind ζ function for the biquadratic extension defined by χ and Ψ. This function is always non-negative on primes because (1 + χ(p))(1 + Ψ(p)) ≥ 0 and both χ, Ψ takes values ±1, 0. Then, for c > 1, consider 1 Z I := ζ(s + β)L(s + β, χ)L(s + β, Ψ)L(s + β, χΨ)Γ(s)Xsds 2πi (c) where X is some large parameter that we haven’t yet defined, which is roughly (qr)10. Exercise 8.16. Show that 1 Z XsΓ(s)ds = e−1/x. 2πi (c) Look at the 0’s of Γ, compute the residues, and you will see the Taylor expansion for e−1/x. ANALYTIC NUMBER THEORY NOTES 53
Here we have nice absolutely convergent integrals. But instead of picking up the characteristic function, we pick up a “smoothed” version of the characteristic function. Then, we have ∞ a(n) = −n/x I ∑ β e . n=1 n where we are plugging in a(n) ∑ nβ and (X/n)s We have Then, we have ∞ a(n) = −n/x ≥ −1/x ≥ I ∑ β e e 1/2. n=1 n Dirichlet L-functions are entire. The only L-function with a pole is the Riemann zeta function. For any other character, the L function terms cancel out every q-steps. For example, using integration by parts ! Z ∞ 1 L(s, χ) = d χ(n) − s ∑ 1 y n≤y ! Z ∞ 1 = s χ(n) dy. − s+1 ∑ 1 y n≤y Moving the line of integration to the left, we encounter poles at 1 − β from the ζ function, there are poles from the Γ function. We take Re s = −β + 1/2, so this is negative, but not as negative as −1. We encounter poles at s = 1 − β, s = 0 coming from ζ(s + β) and Γ(s). The pole at s = 1 − β has residue Computing the residue at 1 − β we get L (1, χ) L (1, Ψ) L (1, χΨ) X1−βΓ(1 − β).
1 The residue at 0 is given as follows: near 0, we have Γ(0) ∼ s using that s · Γ(s) = Γ(s + 1) and Γ(1) = 1 and is smooth. The residue at 0 is ζ(β)L(β, χ)L(β, Ψ)L(β, χΨ) ≤ 0. 54 AARON LANDESMAN
Indeed, if all Dirichlet functions have no 0’s, L(β, χ) is positive and L(β, Ψ), L(β, χΨ) is positive, and ζ(β) is negative. In the second case L(β, Ψ) = 0. We then have a lower bound for the residue at 1 − β. This is what we want, because we want a lower bound for L(1, χ). We would be done if we had upper bounds for the latter Dirichlet L functions. We can just replace using integration by parts ! Z ∞ 1 L(s, χ) = d χ(n) − s ∑ 1 y n≤y ! Z ∞ 1 = s χ(n) dy. − s+1 ∑ 1 y n≤y as above. Exercise 8.17. Indeed show that for χ a character modq, show |L (1, χ) | log q. for X a large power of qr (using Re(s) = −β + 1/2). Therefore, we would conclude a bound of the form − L (1, χ) (qr) ε . We could get an effective bound, but we don’t know what r is. In case 1, r = 3, so things would be fine. But, if there is some violation to the Riemann hypothesis, then r depends on what the violation to the Riemann hypothesis is. So this r is the source of the ineffectivity in Siegel’s theorem. Next time, we’ll discuss effectivity of the 3-prime theorem. Then we’ll move on to discussing a theorem of Maynard: Theorem 8.18. There are infinitely many primes with no 7 in their decimal expansion.
9. 10/24/17 9.1. Quick recap of the proof of Siegel’s theorem. Recall that last time we proved Siegel’s theorem: Theorem 9.1 (Siegel). We have C(ε) L(1, χ) > qε (with C(ε) ineffective). ANALYTIC NUMBER THEORY NOTES 55
The idea of the proof was to construct an auxiliary character Ψ mod r. There were two cases. In the first case, all characters have no ze- 1 ros [1 − ε/10, 1] and we took ψ mod r = 3 In the second case we ε assume there exists some r with a zero β ≥ 1 − 10 . The idea was to consider 1 Z a(n)e−n/x ζ(s + β)L(s + β, χ)L(s + β, Ψ)L(s + β, χΨ)XsΓ(s)ds = ∑ ≥ e−1/x 2πi (c) nβ We then move the line of integration to Re(s) = 1/2 − β. This has a pole at 1 − β. We obtain L(1, χ)L(1, Ψ)L(1, χΨ)X1−βΓ(1 − β). Then, at s = 0, we have ζ(β)L(β, χ)L(β, Ψ)L(β, χΨ) ≤ 0. At the end of last time, we claimed 1 Lemma 9.2. The integral on 2 − β is negligible. That is, 1 Z ζ(s + β)L(s + β, χ)L(s + β, Ψ)L(s + β, χΨ)XsΓ(s)ds 1 1 2πi ( 2 −β) for appropriate values of x (we will take (qr)20). Proof. Indeed, 1 Z ζ(s + β)L(s + β, χ)L(s + β, Ψ)L(s + β, χΨ)XsΓ(s)ds 1 2πi ( 2 −β) Z ∞ − −| | 1 1 1 1 x1/2 β e t ζ( + it)L( + it, χ)L( + it, Ψ)L( + it, χΨ) dt −∞ 2 2 2 2 We want some kind of polynomial bound to show this integral is negligible. We have ξ(s) = s (s − 1) π−s/2Γ(s/2)ζ(s) is entire of order 1 (meaning it doesn’t grow more than exponen- tially). We want to use the maximum modulus principal in a com- plex strip with real part between −1 and 2. It’s easy to bound ξ because ζ is a bounded function on Re(s) = 2. Similarly, we can un- derstand asymptotics of the other terms. By the functional equation, we then also understand the value at Re(s) = −1. So, by this variant of the maximum modulus principal, we can bound |ξ (1/2 + it)| by, essentially, |ξ(2 + it)|. This cannot literally be true because it would imply the Riemann hypothesis, but if we restrict to a rectangular re- gion, bounding things from above and below, we will have good 56 AARON LANDESMAN enough bounds. But, in any case, after making this precise, we can bound Γ by a sterling approximation, and then bound |ζ (1/2 + it)| (1 + |t|) .
Remark 9.3. If we instead carry this out between −ε and 1 + ε, one can obtain the convexity bound + |ζ(1/2 + it)| (1 + |t|)1/4 ε . The Lindelof¨ hypothesis says we can replace 1/4 + ε by any positive exponent. Altogether, we can bound 1 Z ζ(s + β)L(s + β, χ)L(s + β, Ψ)L(s + β, χΨ)XsΓ(s)ds 1 2πi ( 2 −β) Z ∞ − −| | 1 1 1 1 x1/2 β e t ζ( + it)L( + it, χ)L( + it, Ψ)L( + it, χΨ) dt −∞ 2 2 2 2 Z ∞ x1/2−β e−|t| ((1 + |t|) qr)4 dt −∞ (qr)4 x−.4.
Now, choose x = qr20 so that (qr)4 x−.4 1. Using the above bound from the lemma, together with 1 Z a(n)e−n/x ζ(s + β)L(s + β, χ)L(s + β, Ψ)L(s + β, χΨ)XsΓ(s)ds = ∑ ≥ e−1/x 2πi (c) nβ (with the last term bounded by .9) we get 1 xβ−1 L (1, χ) L (1, Ψ) L (1, χΨ) ≥ 3 Γ(1 − β) 1 ≥ (1 − β) x−ε/10 5 − = (1 − β) /5 (qr) 2ε . Then, note L(1, χ) ≤ c log r, L(1, χΨ) ≤ c log qr. ANALYTIC NUMBER THEORY NOTES 57
So, one obtains − L (1, χ) ≥ C (1 − β)(qr) 3ε . The constant C is calculatable, but the reason for the ineffectivity is that we do not know what r and β are. Remark 9.4. One can effectively prove C L (1, χ) ≥ √ , q
√1 so β must be at least r away from 1, or something like that. So really the constant above only depends on r, since we can get a bound on β from that. 9.2. Effectivity of ternary Goldbach. Returning to ternary Goldbach, we considered an ∑ Λ(n) exp . n≤N q Using q ≤ (log N)10, we found ∑ Λ(n)χ(n), n≤N β −N √c is bounded by something like β for β > 1 − q . Then, √ Nβ ≤ N1−c/ q is small compared to N only when q ≤ (log N)1.99 . But, we wanted q to go up to (log N)10 rather than (log N)2. So, we will have to use Siegel’s theorem in some range. Even though Siegel’s theorem is not effective, we can use that Siegel zeros are rare to still get effectivity of ternary Goldbach. Lemma 9.5. There cannot exist two primitive quadratic characters
χ1(modq1), χ2(modq2) 100 10−10 with Q ≤ q1, q2 ≤ Q and both L functions having a 0 at least 1 − log Q .
Proof. Suppose we have two such characters χ1, χ2. We’ll now play these two characters against each other. Consider
ζ(s)L(s, χ1)L(s, χ2)L(s, χ1χ2). This is the Dedekind zeta function of a biquadratic field, so its Dirich- let coefficients are all positive. 58 AARON LANDESMAN
Instead, consider
ξ(s)ξ(s, χ1)ξ(s, χ2)ξ(s, χ1χ2).
Consider its logarithmic derivative and evaluate at some real num- ber σ > 1. We have ξ0 ξ0 ξ0 ξ0 (σ) + (σ, χ ) + (σ, χ ) + (σ, χ χ ) ξ ξ 1 ξ 2 ξ 1 2
Using the Hadamard product formula we have
s ξ(s) = eA+Bs ∏ 1 − es/ρ ρ ρ then ξ0 1 (s) = ∑ . ξ ρ s − ρ
Then, 1 σ − β Re = . s − ρ |s − ρ|2 with s = σ + it, ρ = β + iγ. On the one hand, the expression
ξ0 ξ0 ξ0 ξ0 (σ) + (σ, χ ) + (σ, χ ) + (σ, χ χ ) ξ ξ 1 ξ 2 ξ 1 2 is always positive. On the other hand, if we have two real zeros, we would obtain ξ0 ξ0 ξ0 ξ0 1 1 (σ) + (σ, χ1) + (σ, χ2) + (σ, χ1χ2) ≥ + . ξ ξ ξ ξ σ − β1 σ − β2 We also know
ξ(σ) = σ (σ − 1) π−σ/2Γ(σ/2)ζ(σ), and with α equal to either 0 or 1,
q σ/2 ξ(σ, χ ) = 1 Γ(σ + α/2)L(σ, χ ). 1 π 1 ANALYTIC NUMBER THEORY NOTES 59
We get similar expressions for the other two ξ functions. Then, we obtain ξ0 ξ0 ξ0 ξ0 (σ) + (σ, χ ) + (σ, χ ) + (σ, χ χ ) ξ ξ 1 ξ 2 ξ 1 2 1 1 1 1 + log q + log q + log q q + O(1) 1 − σ 2 1 2 2 2 1 2 ζ0 L0 L0 L0 + (σ) + (σ, χ ) + (σ, χ ) + (σ, χ χ ). ζ L 1 L 2 L 1 2 We then obtain ζ0 L0 L0 L0 (σ) + (σ, χ ) + (σ, χ ) + (σ, χ χ ). ζ L 1 L 2 L 1 2 is approximated by Λ(n) − (1 + χ (n) + χ (n) + χ χ (n)) , ∑ nr 1 2 1 2 which has all Dirichlet coefficients negative. Therefore, we have ξ0 ξ0 ξ0 ξ0 (σ) + (σ, χ ) + (σ, χ ) + (σ, χ χ ) ξ ξ 1 ξ 2 ξ 1 2 1 1 1 1 + log q + log q + log q q + O(1) 1 − σ 2 1 2 2 2 1 2 ζ0 L0 L0 L0 + (σ) + (σ, χ ) + (σ, χ ) + (σ, χ χ ) ζ L 1 L 2 L 1 2 1 ≤ + log q q + O(1). σ − 1 1 2
If β1, β2 were close to 1, we have a lower bound by something close to 2/1 − σ.
100 Exercise 9.6. If q1, q2 are comparable to each other, say q1 ≤ q2 , q2 < q100, then we can’t have both a lower bound by 1 + 1 and an 1 σ−β1 σ−β2 upper bound by 1 + log(q q ) + O(1) θ − 1 1 2
−6 Here, we are choosing σ to be around 1 + 10 . log q1q2 60 AARON LANDESMAN
9.3. Discriminants of number fields. Let K be a number field over Q. Then, some prime must be ramified in K because the discriminant is more than 1. In general, if K has degree n over Q, what can we say about dK := disc K. Question 9.7 (Open question). Take f ∈ Z[x] of degree n irreducible. How does disc( f ) grow? Theorem 9.8 (Minkowski, Stark-Odlyzko). The discriminant of a num- ber field K is bounded below by cn with c > 1. Remark 9.9. Minkowski got this by thinking about lattices and using the geometry of numbers. One way to think of this is the following idea going back to Stark: Let r1 be the number of real embeddings, r2 be the number of com- plex embeddings, so that r1 + 2r2 = n. Consider the Dedekind zeta function r s/2 −s/2 1 −s r2 ξK(s) = s(s − 1)dK ζK(s) π Γ(s/2) (2π) Γ(s) s = (··· ) 1 − (··· ) ∏ ρ ρK K where ··· indicate factors we must include to make the product con- verge. This will satisfy a functional equation ξK(1 − s) = ξK(s), and will have a Hadamard product, and so on. Then, ξ0 1 K (σ) = ≥ 0. ξ ∑ σ − ρ K ρK K Then, 0 0 0 ξK 1 1 1 −1 1 Γ ζK (σ) = + + log dK + r1 log π + (σ/2) + r2 (··· ) + (σ). ξK σ σ − 1 2 2 2 Γ ζK
ζ0 Using that the last term K (σ) is negative, and the whole sum is pos- ζK Γ0 itive, choosing σ near 1 optimally and knowledge of Γ (1/2) and Γ0 Γ (1) gives a lower bound for the discriminant. One must choose an c appropriate value of σ, Sound suggests something like σ = 1 + n . So, if you have a field with small discriminant, this also means there are not many primes of small norm. The zeros of such an L function are then also nicely behaved. Exercise 9.10. Work out the details in the above remark. ANALYTIC NUMBER THEORY NOTES 61
There is a nice survey by Odlyzko (if one searches “discriminants Odlyzko“) and also an article by Serre on “Minorations of discrimi- nants” (in French). Remark 9.11. The ring of integers in a number field may not be monogenic, so the discriminant of a polynomial may be much larger than the discriminant of a number field. We don’t have a good lower bound on the discriminant of a polynomial. Suppose (log N)1.9 q ≤ (log N)10 . suppose there is some χ mod q0 with a Siegel zero at β0. All we have a to worry about are α = q + β with q0 | q. We have an expression of the form an Nµ(q) τ(χ) ∑ Λ(n) exp = + (··· ) + χ(a)Ψ(N, χ). n≤N q φ(q) φ(q) β The last Ψ(N, χ) is bounded by something like N 0 /β0. So, the above is approximated by Nµ(q) τ(χ) Nβ0 + χ(a) φ(q) φ(q) β0 and then we can then approximate these things by major and minor arcs. We then have to change whatever main term we had before with this new main term coming from τ(χ) Nβ0 χ(a) φ(q) β0 That is, we have Z Z a 3 a S(α)3 exp (−Nα) dα = S + β exp −N + β dβ. ∑ 1 M 10 |β|≤ qQ q q q≤(log N) ,q0|q Then, to find the contribution of the cube of this main term, we find ∗ τ(χ)3 N3β0 −aN χ(a) exp ∑ ( )3 3 a mod q φ q β0 q
−aN From χ(a) exp q we get another Gauss sum so ∗ τ(χ)3 N3β0 −aN τ(χ)4 N3β0−1 χ(a) exp ∼ . ∑ ( )3 3 ( )3 3 a mod q φ q β0 q φ q β0 62 AARON LANDESMAN − − ( ) where the 1√ in 3β0 1 is coming from the integral. Then, τ χ is bounded by q. So, the above is bounded by
q2 N3β0−1 3 3 . q β0 So, in conclusion, we get a bound like 2 N 2 ∑ N q0 log log N. 10 q q≤(log N) ,q0|q 1.9 for q0 > (log N) . Therefore, the proof is effective.
10. 10/26/17 Remark 10.1. Goldfeld Gross Zagier says c log |D| L (1, χ) ≥ p |D| for imaginary quadratic fields. Then, effectively, h(−D) > c log |D|. √ Remark 10.2 (Euler’s idoneal numbers). Consider Q −D. Can it p be that all p ≤ |D| are either ramified or inert? Gauss’ genus theorem tells us h (−D) ≥ 2 part of the class group = 2# primes |D = d (|D|) . The divisor function grows as O(|D|ε). The problem is to find all discriminants −D < 0 where √ cl Q −D = (Z/2Z)r .
Remark 10.3.√ There is work of Biro on class numbers of fields of the form Q n2 + 4 .
10.1. Primes with missing digits. In general it is quite hard to an- swer questions of the form: (1) If p is a prime, is p + 2 a primes? (2) If n is even, when is n2 + 1 prime? Here are some theorems coming out of sieve methods. Theorem 10.4 (Piatetski-Shapiro, 1950s). If 1 < α < 1.1, then there are infinitely many primes of the form bnαc. ANALYTIC NUMBER THEORY NOTES 63
Remark 10.5. This is quite a sparse set of numbers up to x, there are only x1/α such numbers. Recall from Fermat that every p ≡ 1 mod 4 can be written as a sum of two squares. Theorem 10.6 (Fouvry and Iwaniec). Infinitely often, one can write p ≡ 1 mod 4 as p = n2 + m2 with n a prime. Theorem 10.7 (Friedlander and Iwaniec). For p ≡ 1 mod 4 one can write p = m2 + n4 for infinitely many primes. Theorem 10.8 (Heath-Brown and Li). There are infinitely many primes p ≡ 1 mod 4 with p = m2 + q4 for q primes. Theorem 10.9 (Heath-Brown). There are infinitely many primes of the form a3 + 2b3. Remark 10.10. This answers an old question of Hardy and Little- wood asking if there are infinitely many primes which are sums of three cubes. Friedlander and Iwaniec only involves pairs m, n over sets of size x3/4 = x1/2 · x1/4 and in Heath-Brown’s result, this only involves a set of size x2/3 up to some x. Question 10.11. Are there infinitely many primes p = a2 + b3? Remark 10.12. This is analogous to the question of whether there are infinitely many elliptic curves with prime discriminant or conductor (since the discriminant of an elliptic curve in short Weierstrass form is something like 4a3 + 27b2). The main result we’ll spend the next few lectures proving is of a similar flavor. Theorem 10.13 (Maynard). If q is a sufficiently large base (e.g. q = 107), k j write n = ∑j=0 njq with 0 ≤ nj ≤ q − 1 as a base q expansion. Select a forbidden digit 0 ≤ a0 ≤ q − 1 Let
A := {n ∈ N : n does not have the digit a0 base q } . Then, n o # n < qk : n ∈ A = (q − 1)k log(q−1)/ log q = qk . 64 AARON LANDESMAN
Then,
( ) ∼ ( ) ( − )k ∑ Λ n κa0 q q 1 . n One can also find elements of A that are, say, squares. j Theorem 10.15 (Mauduit and Rivat). Write primes in binary p = ∑ aj2 . Count s(p) := ∑j aj. Then, s(p) is equally likely to be 0 or 1 mod 2. More generally, this can be done with any base replacing 2, with obvious exceptions. There is also the following cute result: One might ask if one can find Fermat primes, with two 1’s in the binary expansion and all other digits 0. This might be a hard problem because the set is quite sparse, but, one can try to further ask if there are infinitely many primes with k 1’s. One might try an easier problem asking if there simply exist primes with exactly k 1’s in their binary expansion. The following theorem shows the answer is yes. Theorem 10.16 (Drmota, Mauduit, and Rivat)√. Let K be an integer and let k be on the scale of K/2 (say k − K/2 = O( k)). Then, n o p < 2K : p prime , there are k digits equal to 1 has an asymptotic formula, with about 1/k of the numbers in this set prime. K Remark 10.17. One would expect about ( K ) ∼ √2 K/2 K Theorem 10.18 (Bourgain). There are primes p ≤ 2k for which you can specify any αK of the binary digits, for some fixed α > 0, where the last digit must be 1 (so that the number is not even). 10.2. Beginning the proof of Maynard’s theorem. We now turn to proving Maynard’s theorem on primes without a specified digit. Re- call we have fixed a base q, an integer k, and defined A as the set of k primes up to q without a digit a0. ANALYTIC NUMBER THEORY NOTES 65 We are trying to count ∑ Λ(n) = ∑ Λ(n)1A(n). n = ∑ Λ(n)1A(n). n≤qk The last equality holds because m, n < qk. For the penultimate one, writing a am = ( ) S k ∑ Λ m exp k q m q and −a −na = ( ) A k ∑ 1A n exp k , q n q and the only terms that survive are m ≡ n mod qk. This is an excel- lent approximation to the integral from the circle method. Exercise 10.19. Verify the above equalities. 66 AARON LANDESMAN We now separate terms into major and minor arcs, as in the circle method. We now write a ` = + β qk d d ≤ qk | | ≤ 1 with /2 and β dqk/2 . We try to approximate a/qk using rational numbers with denomi- nator at most qk/2. By Dirichlet’s theorem, we can always write num- bers in this form. We write A as some large positive number. The major arcs are those values of a with A (log qk)A d ≤ log qk , |β| ≤ . qk The minor arcs are the remaining values of a. The major arcs are distinct because the denominators are small and we are taking small intervals around each rational number. We’ll first deal with the major arcs. The harder part will come later when we deal with the minor arcs. 10.4. The major arc contribution. There are two cases: (1) The denominator d is a small power of q (2) The denominator d is not a small power of q. The main terms will come from the first case. Consider the sum S(α) on the major arcs of the first case, so ` b α = + d qk with d a power of q. b is small (at most d) because β is bounded. Consider `n S (`/d) = Λ(n) exp ∑ d n