CARLETON UNIVERSITY
SCHOOL OF MATHEMATICS AND STATISTICS
HONOURS PROJECT
TITLE: The Circle Method Applied to
Goldbach’s Weak Conjecture
AUTHOR: Michael Cayanan
SUPERVISOR: Brandon Fodden
DATE: August 27, 2019
Acknowledgements
First and foremost, I would like to thank my supervisor Brandon Fodden not only for all of his advice and guidance throughout the writing of this paper, but also for being such a welcoming and understanding instructor during my first year of undergraduate study. I would also like to thank the second reader Kenneth Williams for graciously providing his time and personal insight into this paper. Of course, I am also very grateful for the countless professors and instructors who have offered me outlets to explore my curiosity and to cultivate my appreciation of mathematical abstraction through the courses that they have organized.
I am eternally indebted to the love and continual support provided by my family and my dearest friends throughout my undergraduate journey. They were always able to provide assurance and uplift me during my worst and darkest moments, and words cannot describe how grateful I am to be surrounded with so much compassion and inspiration. This work is as much yours as it is mine, and I love you all 3000. THE CIRCLE METHOD APPLIED TO GOLDBACH’S WEAK CONJECTURE
Michael Cayanan
August 27, 2019
Abstract. Additive number theory is the branch of number theory that is devoted to the study of repre- sentations of integers subject to various arithmetic constraints. One of, if not, the most popular question that additive number theory asks arose in a letter written by Christian Goldbach to Leonhard Euler, fa- mously dubbed the Goldbach conjecture. Dated June 7th, 1742, the translated conjecture states that “Every number n which is a sum of two primes is a sum of as many primes including unity as one wishes (up to n), and that every number > 2 is a sum of three primes”. The primary goal of this paper is to tackle a much easier problem concerning the representation of every odd integer greater than 5 as a sum of three primes (appropriately named the weak Goldbach conjecture, or the ternary Goldbach conjecture), under the frame- work of the famed “circle method”, mainly attributed to Hardy, Littlewood, Ramanujan and Vinogradov. The circle method will allow us to translate the weak conjecture into a complex-analytic problem relying on the estimation of integrals performed over a constructed partition of [0, 1]. We will also explore the extent and the limitations of the circle method when applied to Goldbach’s binary conjecture, as well as its an application concerning Waring’s problem.
1. Notation and Number Theory Preliminaries
Before we introduce the circle method, some preliminary definitions and results from elementary number theory will be briefly recalled. As usual, Z denotes the set of all integers and N the set of all positive integers. In what follows, p will almost exclusively denote a prime number and P will denote the set of all prime numbers. We say that a nonzero integer d divides an integer n if n = cd for some c ∈ Z, and this will be denoted by d | n. The greatest common divisor of integers a and b (not both 0) will always be presented as (a, b). If (a, b) = 1, then we say that a and b are relatively prime, or co-prime.
Definition 1.1. An arithmetic function f is a complex valued function defined on the positive integers. If f is not identically zero and f(mn) = f(m)f(n) whenever (m, n) = 1, then f is a multiplicative function.
We now briefly describe two elementary arithmetic functions and some of their useful properties.
x1 xk Definition 1.2. For n > 1 with prime factorization n = p1 . . . pk , we define the M¨obiusfunction µ(n) as k (−1) if x1 = ··· = xk = 1 µ(n) = 0 otherwise.
If n = 1, then we set µ(1) = 1. 2 THE CIRCLE METHOD APPLIED TO GOLDBACH’S WEAK CONJECTURE 3
Definition 1.3. For n ≥ 1, the Euler totient function φ(n) is equal to the number of positive integers k ≤ n such that (n, k) = 1. That is, n X φ(n) = 1. k=1 (n,k)=1
It is well known that both µ and φ are multiplicative functions. Evaluating the totient function at prime k k k−1 k 1 Q 1 powers yields φ(p ) = p − p = p 1 − p , and more generally, φ(n) = n p|n 1 − p . Summing over all of the divisors d of n for φ(n) and µ(n), this gives X X 1 if n = 1 φ(d) = n, and µ(d) = d|n d|n 0 if n > 1.
All of these results may be found in any elementary number theory texts such as Apostol [1] (Chapters 2.2 and 2.3) or Nathanson [15] (Sections A.5 and A.6).
Proposition 1.4. Given ε > 0, we have n1−ε < φ(n) < n for all sufficiently large n.
Proof. Clearly, φ(n) < n, ∀n > 1. Now, we want to show that
n1−ε lim = 0. n→∞ φ(n)
Since φ is multiplicative, it suffices to show that the result holds for prime powers. Observe that for every p prime p, we have that p−1 ≤ 2, hence
pm(1−ε) pm(1−ε) p pm(1−ε) 2 = = ≤ . φ(pm) pm − pm−1 p − 1 pm pmε
Thus, pm(1−ε) lim = 0. pm→∞ φ(pm)
For notational convenience, for x ∈ R, we will set e(x) := e2πix. We will now define an exponential sum and prove an important reformulation using the M¨obius function which we will make use of.
Definition 1.5. Let q, n ∈ Z and q ≥ 1. We define the Ramanujan sum as the exponential sum q X kn c (n) = e . q q k=1 (k,q)=1
Theorem 1.6. The Ramanujan sum can be expressed in the form
X q c (n) = µ d. q d d|(q,n)
Furthermore, if (q, n) = 1, we have cq(n) = µ(q). 4 Michael Cayanan
P Proof. As noted above, we know that d|1 µ(d) = 1, hence
q q q X kn X kn X kn X c (n) = e = e · 1 = e µ(d). q q q q k=1 k=1 k=1 d|(k,q) (k,q)=1 (k,q)=1
Therefore, q q/d q/d X X kn X X dln X X ln c (n) = µ(d) e = µ(d) e = µ(d) e . q q q q/d d|q k=1 d|q l=1 d|q l=1 d|k k=dl
Pd ln Define fd(n) := l=1 e d , and note that d if d | n fd(n) = 0 otherwise.
From this, we then have that
q/d X X ln c (n) = µ(d) e q q/d d|q l=1 X = µ(d)fq/d(n) d|q X q = µ f (n) d d d|q X q = µ d d d|q d|n X q = µ d. d d|(q,n)
If (q, n) = 1, then the formula simplifies to cq(n) = µ(q), as required.
Using this result, we may now show that cq(n) is a multiplicative function in q. For if q1, q2 ≥ 1 with
(q1, q2) = 1, we have
X q1 X q2 X q1 q2 cq1 (n) · cq2 (n) = µ d1 µ d2 = µ µ d1d2. d1 d2 d1 d2 d1|(q1,n) d2|(q2,n) d1|(q1,n) d2|(q2,n)
Writing d = d1d2 and noting that µ is multiplicative, we rewrite the previous summation as
X q1q2 X q1q2 µ d = µ d = c (n). d d q1·q2 d|(q1,n)·(q2,n) d|(q1·q2,n)
To conclude this introductory section, we introduce the last few number theoretic results and definitions which will greatly aid us in the proof of Vinogradov’s Theorem. Excluding Theorem 1.11 and its following remark, the proofs of all of these results may be found in the appendix of Nathanson [15]. THE CIRCLE METHOD APPLIED TO GOLDBACH’S WEAK CONJECTURE 5
P∞ Theorem 1.7 (Euler Products). Let f be a multiplicative function. Suppose that the series n=1 f(n) converges absolutely. Then
∞ ∞ X Y Y X f(n) = (1 + f(p) + f(p2) + ··· ) = 1 + f(pk). n=1 p p k=1 If f is completely multiplicative, then
∞ X Y f(n) = (1 − f(p))−1. n=1 p
Definition 1.8. Let f be any complex valued function, and let g be any positive function. Then f = O(g) if and only if there exists C > 0 such that |f(x)| ≤ Cg(x) for all x large enough. The constant C is called the implied constant. We may occasionally write f g (notation due to Vinogradov) in place of f = O(g) as well.
Definition 1.9. We say that f is asymptotic to g, denoted f ∼ g, if
f(x) lim = 1. x→∞ g(x)
Theorem 1.10 (Partial Summation). Let a(n) be an arithmetic function, and suppose that f is a function P with continuous derivative on an interval [y, x] with 0 ≤ y < x. Define the sum function A(x) = n≤x a(n), and A(x) = 0 if x < 1. Then
X Z x a(n)f(n) = A(x)f(x) − A(y)f(y) − A(t)f 0(t)dt. y In particular, taking 0 < y < 1 yields X Z x a(n)f(n) = A(x)f(x) − A(t)f 0(t)dt. n≤x 1 P Theorem 1.11 (Prime Number Theorem). Define π(x) = p≤x 1 as the prime-counting function. Then x π(x) ∼ . log x Remark 1.12. There are several equivalent forms for the Prime Number Theorem. Several equivalences involve weighted prime counting functions: Chebyshev’s θ-function and Chebyshev’s ψ-function, respectively given by the sums X X θ(x) = log p and ψ(x) = log p. p≤x pk≤x The Prime Number Theorem is equivalent to either of the asymptotic relations θ(x) ∼ x and ψ(x) ∼ x. Moreover, one can show that there exists c1, c2 > 0 such that c1x ≤ θ(x) ≤ ψ(x) ≤ π(x) log x ≤ c2x. 6 Michael Cayanan This is known as Chebyshev’s Theorem. A proof of the Prime Number Theorem is presented in great detail in Chapter 13 of Apostol [1], while Chebyshev’s Theorem may be found in Section 6.2 of Nathanson [15]. 2. The Circle Method Consider a set A of nonnegative integers. For some positive integers n and s, we define rs,A(n) as the number of representations of n as the sum of s elements of A (the number of solutions to the equation a1 + ··· + as = n, where a1, . . . as ∈ A). That is, X rs,A(n) = 1. a1+···+as=n a1,...,as∈A Note that the order of the summands from A is taken into account, so for example, 2 + 3 and 3 + 2 both contribute to r2,N(5). If the set A is clear from the context, we will simply denote this by rs(n). We are interested in estimating rs,A(n) for specific values of s and sets A. For example, if we take s = 2 and A = P as the set of all prime numbers, then r2,P (n) counts the number of ways n can be written as a sum of two primes. Indeed, showing that such a quantity is positive for all even n ≥ 4 is equivalent to the verification of Goldbach’s conjecture. Similarly, setting s = 3 allows one to study Goldbach’s weak conjecture, which asserts that every odd number greater than 5 can be represented as the sum of three primes. In order to motivate the overall idea, we will define the generating function for A as the power series P a f(z) = a∈A z . Let a(n) = 1 if n ∈ A, and a(n) = 0 otherwise. Note that squaring f(z) yields 2 ∞ 2 X a X n f(z) = z = cnz , a∈A n=0 where XX cn = a(h)a(k). 0≤h,k≤n h+k=n Note that a(h)a(k) = 1 if h, k ∈ A, and a(h)a(k) = 0 otherwise. Thus, cn counts the number of ways n may be expressed as a sum of two elements belonging in A, and so cn = r2,A(n). Using the same argument as above, one may deduce that for s ∈ N, the sth power of f(z) is given by s ∞ s X a X n f(z) = z = rs,A(n)z . a∈A n=0 s From this, we see that f(z) is actually the generating function for rs,A(n). Explicitly determining the value of rs,A(n) for specific values of s and n can be difficult, especially when these parameters are rather large. However, certain choices for the set A can lead to easily obtainable formulas for rs,A(n). Example 2.1. Let s ≥ 1 and K = {0, 1k, 2k,... } be the set of k powers, where k is a positive integer. Waring’s problem asks whether each nonnegative integer is the sum of a bounded number of kth powers. THE CIRCLE METHOD APPLIED TO GOLDBACH’S WEAK CONJECTURE 7 For k = 1 and n ≥ s, we may obtain an explicit formula for rs(n). We are now studying the number of representations for the expression n = a1 + ··· + as. Subtracting s on both sides, we obtain n − s = (a1 −1)+···+(as −1). If we denote by rs(n) the number of representations of n as the sum of s nonnegative integers, then the above argument tells us that rs(n) = rs(n − s). We will show that n − 1 r (n) = r (n − s) = . s s s − 1 P∞ n 1 Proof. Let |z| < 1. Then the geometric series f(z) = n=0 z converges to the sum 1−z . Taking s powers, s P∞ n we obtain the generating function for rs(n): f(z) = n=0 rs(n)z . On the other hand, we have 1 f(z)s = (1 − z)s 1 ds−1 1 = (s − 1)! dzs−1 1 − z ∞ 1 ds−1 X = zk (s − 1)! dzs−1 k=0 ∞ 1 X = k(k − 1) ··· (k − s + 2)zk−s+1 (s − 1)! k=s−1 ∞ X k(k − 1) ··· (k − s + 2) (k − s + 1)! = · zk−s+1 (s − 1)! (k − s + 1)! k=s−1 ∞ X k = zk−s+1. s − 1 k=s−1 Re-indexing the last sum with n = k − s + 1, we obtain ∞ X n + s − 1 f(z)s = zn. s − 1 n=0 Thus, n + s − 1 r (n) = , s s − 1 and by our previous observation we have that n − 1 r (n) = r (n − s) = . s s s − 1 Now continuing from our original generating function, let Cρ = {z ∈ C : |z|= ρ} denote the circle centred at the origin with radius ρ. Using Cauchy’s Theorem, we can recover the coefficients of the generating function by considering a contour integral. We have Z s Z P∞ k ∞ Z 1 f(z) 1 rs,A(k)z 1 X dz = k=0 dz = r (k) zk−n−1 = r (n) 2πi zn+1 2πi zn+1 2πi s,A s,A Cρ Cρ k=0 Cρ 8 Michael Cayanan since 1 R zk−n−1 = 1, if k − n − 1 = −1 (i.e. k = n), and is 0 otherwise. 2πi Cρ This was the original precursor to the circle method developed by Hardy, Littlewood, and Ramanujan in 1918-1920. Assuming the Generalized Riemann Hypothesis, Hardy and Littlewood were able to conditionally show that every large odd number is a sum of three primes, and that almost every even number is a sum of two primes. In layman terms, they established bounds for the contour integral by showing that there exists a constant N such that for all odd n > N, the integral is at least 1 (i.e. r3,P (n) ≥ 1 for all such n). Establishing such bounds was accomplished by performing the integration over a specific partition of the unit interval. We shall elaborate more on this principle when dealing with Vinogradov’s refinement of the circle method. 3. The Singular Series, Major Arcs, and Minor Arcs Before we proceed further, a new arithmetic function needs to be formally introduced. Definition 3.1. The function ∞ X µ(q)cq(n) S(n) = φ(q)3 q=1 is called the singular series for the ternary Goldbach problem. Under the assumption of the Generalized Riemann Hypothesis, Hardy and Littlewood’s original efforts resulted in the asymptotic relation n2 r (n) ∼ S(n) , 3,P 2(log n)3 where n is any sufficiently large odd integer. Therefore, the quantity S(n) can be bounded away from 0, implying that there are many ways to write n as the sum of three distinct prime numbers and proving Goldbach’s weak conjecture for large enough odd n. In 1937, Vinogradov was able to refine the work of Hardy and Littlewood by showing that the bounds are not conditional on the assumption of the Generalized Riemann Hypothesis. Theorem 3.2 (Vinogradov’s Theorem). There exists positive constants c1 and c2 such that c1 < S(n) < c2 for all sufficiently large odd integers n. Furthermore, for any sufficiently large odd integer n, we have ! n2 log log n r (n) = S(n) 1 + O . 3,P 2(log n)3 log n A large amount of time and effort will be spent towards proving the latter statement, while the former is much easier in comparison. We first observe the following. THE CIRCLE METHOD APPLIED TO GOLDBACH’S WEAK CONJECTURE 9 Theorem 3.3. For odd n, the singular series S(n) converges absolutely and uniformly with Euler product Y 1 Y 1 S(n) = 1 + 1 − . (p − 1)3 p2 − 3p + 3 p p|n Furthermore, for any ε > 0, X µ(q)cq(n) S(n, Q) := = S(n) + O(Q−(1−ε)), φ(q)3 q≤Q where the implied constant depends only on ε. Observe that proving this theorem for odd n gives Y 1 Y 1 1 − < S(n) < 1 + . p2 − 3p + 3 (p − 1)3 p|n p 1 Note that 1 − p2−3p+3 < 1, and so the resulting product over all the prime divisors of n will provide a lower Q 1 bound for c1. Setting p 1 + (p−1)3 as c2 finishes our argument. One may also interpret this inequality as S(n) 1 for odd n. Proof. Let ε > 0 be given. Proposition 1.4 gives us q1−ε < φ(q) for sufficiently large q. By definition of cq(n) and φ(q), we also have cq(n) = O(φ(q)) since q q X kn X |cq(n)| = e ≤ 1 = φ(q). q k=1 k=1 (k,q)=1 (k,q)=1 Noting that |µ(q)| ≤ 1, the summands of the singular series can then be estimated, as µ(q)c (n) 1 1 q = O = O . φ(q)3 φ(q)2 q2−ε That is, the singular series converges absolutely and uniformly. From this, we also obtain ! X µ(q)cq(n) X 1 1 S(n) − S(n, Q) = = O = O . φ(q)3 q2−ε Q1−ε q>Q q>Q Now observe that if p | n, then p p−1 X kn X c (n) = e = 1 = p − 1. p p k=1 k=1 (k,p)=1 µ(q)cq (n) On the other hand, p - n gives cp(n) = µ(p) = −1 by Theorem 1.6. The expression φ(q)3 is multiplicative with respect to q (as it is a product and quotient of multiplicative functions), and together with absolute convergence, we may utilize Theorem 1.7. Since µ(pk) = 0 for k ≥ 2 we can now obtain ∞ k Y X µ(p )cpk (n) S(n) = 1 + φ(pk)3 p k=1 10 Michael Cayanan Y cp(n) = 1 − φ(p)3 p Y 1 Y 1 = 1 + 1 − (p − 1)3 (p − 1)2 p-n p|n Q 1 p 1 + (p−1)3 Y 1 = 1 − Q 1 + 1 (p − 1)2 p|n (p−1)3 p|n 1 Y 1 Y 1 − (p−1)2 = 1 + . (p − 1)3 1 + 1 p p|n (p−1)3 Simplifying the second product yields 1 2 3 2 1 − (p−1)2 p − 2p (p − 1) (p − 2)(p − 1) p − 3p + 3 − 1 1 1 = 3 2 · 2 = 2 = 2 = 1 − 2 . 1 + (p−1)3 p − 3p + 3p (p − 1) p − 3p + 3 p − 3p + 3 p + 3p + 3 Thus, we have Y 1 Y 1 S(n) = 1 + 1 − , (p − 1)3 p2 − 3p + 3 p p|n as required. Remark 3.4. The notion of a singular series S(n) extends to other applications of the circle method, and is not exclusive to Goldbach’s weak conjecture. Typically, one requires S(n) to be some arithmetic function uniformly bounded above and below by positive constants c1 and c2. Other examples of singular series can be found in Sections 7 and 8. Remark 3.5. The condition of n being odd in Theorem 3.3 cannot be stressed enough. For even n, we see that Y 1 Y 1 S(n) = 1 + 1 − (p − 1)3 (p − 1)2 p-n p|n 1 Y 1 Y 1 = 1 − 1 + 1 − (2 − 1)2 (p − 1)3 (p − 1)2 p-n p|n p>2 = 0. Note that if Vinogradov’s result held true for even n, then one of p1, p2, or p3 would have to be even. Without loss of generality, let p1 = 2. Then we may write n − 2 = p2 + p3, and we have seemingly proved that every even number can be written as a sum of two primes. That is, if we were able to take n to be even in Theorem 3.3, Goldbach’s strong conjecture would follow. We now reformulate the definition of rs,A(n) slightly and consider a weighted sum over the number of representations of n as a sum of s elements of A: X Rs,A(n) = log a1 log a2 ··· log as. a1+···+as=n a1,...,as∈A THE CIRCLE METHOD APPLIED TO GOLDBACH’S WEAK CONJECTURE 11 In what follows, we will fix s = 3 and A = P, so that r3,P (n) = r(n) and R3,P (n) = R(n). Working analogously as before, we will define a specific generating function for R(n) from which we can retrieve its coefficients through integration. The main difference in the approach here is that the generating function is now an exponential sum, rather than a traditional power series. Define X F (x) := (log p)e(px). p≤n Then, the orthogonality relation Z 1 1 if m = n e(mx)e(−nx)dx = 0 0 if m 6= n gives us Z 1 Z 1 3 X F (x) e(−nx)dx = (log p1)(log p2)(log p3)e((p1 + p2 + p3)x)e(−nx)dx 0 0 p1,p2,p3≤n X Z 1 = (log p1)(log p2)(log p3) e((p1 + p2 + p3)x)e(−nx)dx 0 p1,p2,p3≤n X = (log p1)(log p2)(log p3) p1+p2+p3=n = R(n). The logarithmic weights are introduced for analytic reasons, and will simplify future computations. It was mentioned before that Hardy and Littlewood’s original result involves a partition for [0, 1]. These disjoint partitions are called major and minor arcs, and the above integral was then evaluated over both sets separately. Definition 3.6. For odd n > 1, fix B > 0 and let Q = (log n)B. For 1 ≤ q ≤ Q, 1 ≤ a ≤ q, and (a, q) = 1, we define the major arc M(q, a) as the set a Q M(q, a) = x ∈ [0, 1] : x − ≤ . q n The set of all major arcs will be denoted and given by Q q [ [ M = M(q, a). q=1 a=1 (a,q)=1 The set of minor arcs is correspondingly given by m = [0, 1] \ M. So the major arcs consist of real numbers x ∈ [0, 1] that are well approximated by rationals with sufficiently Q small denominators that are within a distance of n from x. 12 Michael Cayanan Remark 3.7. For large values of n, the major arcs M(q, a) are mutually disjoint for all a and q. Indeed, 0 0 a a0 0 0 suppose x ∈ M(q, a) ∩ M(q , a ) and q 6= q0 . Hence, |aq − a q| ≥ 1, and with the triangle inequality, one obtains 0 0 0 0 1 1 |aq − a q| a a a a ≤ ≤ = − = − + x − x Q2 qq0 qq0 q q0 q q0 0 B a a 2Q 2(log n) ≤ x − + x − ≤ = . q q0 n n We may rearrange this to obtain n ≤ 2Q3 = 2(log n)3B, which is a contradiction for sufficiently large n. Remark 3.8. Let µ be any measure on the σ-algebra of measurable sets on R. We remark that the measure of the set M tends to 0 as n goes to infinity. Since µ([a, b]) = b − a and µ is additive on disjoint sets, we have Q q Q q Q q Q Q X X X X 2Q X 2Q X X 2Q 2Q X µ(M) = µ(M(q, a)) = = 1 = φ(q) ≤ q n n n n q=1 a=1 q=1 a=1 q=1 a=1 q=1 q=1 (a,q)=1 (a,q)=1 (a,q)=1 2Q Q(Q + 1) 2Q3 2(log n)3B ≤ · ≤ = , n 2 n n and this upper bound approaches 0 as n gets large. This last remark also tells us that the measure of m tends to 1 for increasing values of n, and so our minor arcs are actually considerably larger than our major arcs. This is a curious fact as we will end up showing that the main term in Vinogradov’s theorem will come from the integral over the major arcs, while the contribution from the minor arcs will be negligible in comparison: Z 1 Z Z R(n) = F (x)3e(−nx)dx = F (x)3e(−nx)dx + F (x)3e(−nx)dx. 0 M m 4. Estimating the Major Arcs We now introduce the singular integral 1 Z 2 J(n) = u(β)3e(−nβ)dβ, 1 − 2 Pn where u(β) is an exponential sum given by u(β) = m=1 e(mβ). The aforementioned contribution from the major arcs in Vinogradov’s theorem will actually be the product of the singular series and the singular integral, with the latter being easy to deal with. Analogous to the argument utilizing the orthogonality relations preceding Definition 3.6, J(n) can be seen to be nothing more than the number of ways to express n as a sum of three positive integers. Hence, J(n) = rs(n) from Example 2.1. An asymptotic formula for J(n) is then given by n − 1 (n − 1)(n − 2)(n − 3)! n2 − 3n + 2 n2 J(n) = = = = + O(n). 2 2(n − 3)! 2 2 THE CIRCLE METHOD APPLIED TO GOLDBACH’S WEAK CONJECTURE 13 The next few results will rely on the following theorem. Theorem 4.1 (Siegel-Walfisz). Let C, B > 0 and (q, a) = 1. Then for any q ≤ (log x)B, we have X x x log p = + O . φ(q) (log x)C p≤x p≡a mod q Here, the implied constant is dependent on B and C. The Siegel-Walfisz theorem provides a uniform error term for primes in arithmetic progressions up to q ≤ (log x)B. As φ(q) is at most q, which in turn is at most (log x)B, taking C > B allows for the main term to be significantly larger then the error term. So it is in our best interest to keep C far away from B.A formal proof and the historical development of the Siegel-Walfisz theorem can be found in Chapters 21 and 22 of Davenport [4]. Lemma 4.2. Let X Fx(α) := (log p)e(pα). p≤x Let B, C > 0. Suppose that 1 ≤ q ≤ Q = (log n)B, and (q, a) = 1. For 1 ≤ x ≤ n, we then have a µ(q) Qn F = x + O . x q φ(q) (log n)C Here the implied constant depends only on B and C. Proof. Let p ≡ r mod q. Note that p | q ⇐⇒ (r, q) > 1, and so q X X pa X pa X (log p)e = (log p)e = O log p = O(log q) = O(log Q). q q r=1 p≤x p≤x p|q (r,q)>1 p≡r mod q p|q