Faculty of Sciences Department of Mathematics
Exponential sums and applications in number theory and analysis
Frederik Broucke
Promotor: Prof. dr. J. Vindas
Master’s thesis submitted in partial fulfilment of the requirements for the degree of Master of Science in Mathematics
Academic year 2017–2018 ii Voorwoord
Het oorspronkelijke idee voor deze thesis was om het bewijs van het ternaire vermoeden van Goldbach van Helfgott [16] te bestuderen. Al snel werd mij duidelijk dat dit een monumentale opdracht zou zijn, gezien de omvang van het bewijs (ruim 300 bladzij- den). Daarom besloot ik om in de plaats de basisprincipes van de Hardy-Littlewood- of cirkelmethode te bestuderen, de techniek die de ruggengraat vormt van het bewijs van Helfgott, en die een zeer belangrijke plaats inneemt in de additieve getaltheorie in het algemeen. Hiervoor heb ik gedurende het eerste semester enkele hoofdstukken van het boek “The Hardy-Littlewood method” van R.C. Vaughan [37] gelezen. Dit is waarschijnlijk de moeilijkste wiskundige tekst die ik tot nu toe gelezen heb; de weinige tussenstappen, het gebrek aan details, en zinnen als “one easily sees that” waren vaak frustrerend en demotiverend. Toch heb ik doorgezet, en achteraf gezien ben ik echt wel blij dat ik dat gedaan heb. Niet alleen heb ik enorm veel bijgeleerd over het onderwerp, ik heb ook het gevoel dat ik beter of vlotter ben geworden in het lezen van (moeilijke) wiskundige teksten in het algemeen. Na het lezen van dit boek gaf mijn promotor, professor Vindas, me de opdracht om de idee¨en en technieken van de cirkelmethode toe te passen in de studie van de functie van Riemann, een “pathologische” continue functie die een heel onregelmatig puntsgewijs gedrag vertoont. De combinatie van deze twee onderwerpen leidde uiteindelijk tot de keuze voor exponenti¨ele sommen als onderwerp van de thesis, met een grote nadruk op de toepassingen voor de cirkelmethode en de studie van Riemanns functie.
Persoonlijk vind ik dat je een wiskundig bewijs op twee niveaus kan begrijpen. Ener- zijds kan je elke overgang of stap afzonderlijk begrijpen, inzien waarom regel n + 1 volgt uit regel n voor elke n. Anderzijds (en misschien belangrijker) kan je het bewijs op een globaal niveau begrijpen: wat zijn de achterliggende idee¨en en motivaties, waarom doet men iets op die manier, waarom werkt dit? In mijn thesis heb ik geprobeerd om beide niveaus voldoende te belichten. Ik heb geprobeerd zo veel mogelijk resultaten rigoureus en volledig bewezen. Het is hierbij nagenoeg onmogelijk om (soms vervelende) technische details te vermijden. Toch hoop ik dat de techniciteiten de achterliggende idee¨en niet verbloemen, en dat de lezer door het lezen van deze thesis inzicht en appreciatie kan krijgen voor het mooie onderwerp van exponenti¨ele sommen en hun toepassingen in vele fascinerende problemen.
Ik heb er bewust voor gekozen om deze masterproef in het Engels te schrijven. Dit lijkt mij passend voor de masterproef als (mini-)wetenschappelijk onderzoek, aangezien de overgrote meerderheid van de wetenschappelijke literatuur in het Engels wordt ge- schreven.
Ten slotte wil ik mijn promotor, professor Vindas, bedanken. Hij stond me bij met
iii iv VOORWOORD advies, hielp me met problemen tijdens het lezen en het schrijven, en voorzag nuttige referenties. Anderzijds gaf hij me ook voldoende vrijheid om mijn eigen ding te doen, wat ik zeer op prijs stel. Ten slotte maakte hij veel tijd vrij in zijn drukke schema om de talrijke voorlopige versies nauwgezet na te lezen.
De auteur geeft de toelating deze masterproef voor consultatie beschikbaar te stellen en delen van de masterproef te kopi¨eren voor persoonlijk gebruik. Elk ander gebruik valt onder de beperkingen van het auteursrecht, in het bijzonder met betrekking tot de verplichting de bron uitdrukkelijk te vermelden bij het aanhalen van resultaten uit deze masterproef.
Frederik Broucke 30 mei 2018 Contents
Voorwoord iii
List of symbols vii
1 Introduction 1
2 Exponential sums 3 2.1 Some elementary estimates ...... 3 2.2 Characters and Gauss sums ...... 4 2.2.1 Ramanujan sums ...... 6 2.2.2 Separable Gauss sums and primitive characters ...... 7 2.2.3 Quadratic Gauss sums ...... 9 2.3 kth-power Gauss sums ...... 13 2.3.1 kth-powers modulo pl ...... 13 2.3.2 The exponential sums S(q, a) and S(q, a, b) ...... 15 2.4 Weyl sums ...... 18 2.4.1 Weyl’s method ...... 18 2.4.2 Vinogradov’s method ...... 22 2.4.3 Vinogradov’s mean value theorem ...... 28
3 The Hardy-Littlewood method 35 3.1 Generalities ...... 35 3.2 Waring’s problem ...... 37 3.2.1 Approximating the generating function ...... 37 3.2.2 The singular series ...... 43 3.2.3 The singular integral ...... 49 3.2.4 The contribution from the major arcs ...... 51 3.2.5 The contribution from the minor arcs ...... 53 3.3 The ternary Goldbach problem ...... 55 3.3.1 The contribution from the major arcs ...... 55 3.3.2 The contribution from the minor arcs ...... 60
4 The Vinogradov-Korobov zero-free region for ζ 65
5 Riemann’s non-differentiable function 73 5.1 Introduction ...... 73 5.2 Behaviour at rational points ...... 75 5.3 Behaviour at irrational points ...... 78 5.3.1 The upper bound for α(ρ)...... 78 5.3.2 The lower bound for α(ρ)...... 79
v vi CONTENTS
6 Conclusion 85
A Nederlandse samenvatting 87
B Populariserende samenvatting 91 B.1 Oplossingen van vergelijkingen detecteren ...... 91
C Additional theorems 95 C.1 Diophantine approximation ...... 95 C.2 The Poisson summation formula ...... 99 C.3 The continuous wavelet transform ...... 100 List of symbols
Symbol Description N The set of natural numbers including zero, {0, 1, 2,...}. Z The set of integers. Q The set of rational numbers. R The set of real numbers. C The set of complex numbers. R× The unit group of a ring R. d | n d divides n. α α α α+1 p k n p exactly divides n; p | n and p - n. (n, m) The greatest common divisor of n and m. d(n) The number of positive divisors of n. ω(n) The number of distinct prime factors of n. µ(n) The M¨obiusfunction; µ(n) = (−1)ω(n) if n is square-free, µ(n) = 0 otherwise. ϕ(n) Euler’s totient function; the number of a, 1 ≤ a ≤ n for which (a, n) = 1. Λ(n) The von Mangoldt function; Λ(n) = log p if n = pα for some prime p, Λ(n) = 0 otherwise. (n) The unit function for Dirichlet convolution; (1) = 1 and (n) = 0 if n 6= 1. P ϑ(x) The first Chebyshev function; ϑ(x) = p≤x log p. P ψ(x) The second Chebyshev function; ψ(x) = n≤x Λ(n). P π(x) The prime counting function; π(x) = p≤x 1. indg a The index of a. If g is a primitive root mod q, and (a, q) = 1, then m indg a is the unique m mod ϕ(q) such that a ≡ g mod q. e(z) The complex exponential with period 1; e(z) = exp(2πiz). G(n, χ) The Gauss sum associated with n and the character χ mod q: q X G(n, χ) = χ(m)e(mn/q). m=1 q X cq(n) The Ramanujan sum: cq(n) = e(mn/q). m=1 (m,q)=1 q X amk S (q, a) The kth-power Gauss sum: e . k q m=1 q X amk + bm S (q, a, b) A variant of the kth-power Gauss sum: e . k q m=1
vii viii LIST OF SYMBOLS
Symbol Description M X Sf (M) Weyl sums; Sf (M) = e(f(m)). m=1 [x] The integer part of x; the unique integer such that [x] ≤ x < [x]+1. {x} The fractional part of x; {x} = x − [x]. dxe The ceiling function of x; the unique integer such that dxe − 1 < x ≤ dxe. kxk The distance from x to the nearest integer.
f(x) = O(g(x)) f(x) ≤ Cg(x) for some absolute constant C. f(x) = o(g(x)) lim f(x)/g(x) = 0. f(x) = Ω(g(x)) The negation of f(x) = o(g(x)). f(x) g(x) f(x) = O(g(x)). f(x) g(x) g(x) = O(f(x)), g non-negative. f(x) ∼ g(x) lim f(x)/g(x) = 1. f(x) g(x) f(x) g(x) and f(x) g(x).
Concerning the asymptotic notations: the range in which the inequalities or limits hold is usually clear from the context; if needed it will be specified, e.g.
f(x) = o(g(x)) as x → 0 means that f(x) lim = 0. x→0 g(x) If the implicit constant in the notation is not absolute, but depends on some additional parameter, this will be notated via a subscript, e.g.
f(x) k g(x) where f and g are functions which also depend on some additional parameter k, means that there is a constant Ck, only depending on k, such that f(x) ≤ Ckg(x).
We use the convention that for a complex number z, its argument arg z is a number in the interval ] − π, π].
We use the notation fˆ for the Fourier transform of a function f, defined as follows Z fˆ(y) = f(x)e−ixy dx. R
Sometimes (e.g. in summations), the expression min(X, 1/0) will occur; this will be taken to be X. Chapter 1
Introduction
We denote the complex exponential with period 1 by e, that is ∀z ∈ C : e(z) = exp(2πiz). This thesis is devoted to the study of exponential sums, namely sums of the form
N X e(f(n)), f real-valued n=1 and their applications. Except for some special cases, these sums cannot be evaluated explicitly. The first objective of this thesis is to study means to establish upper bounds for exponential sums which are sharper than the trivial bound
N X e(f(n)) ≤ N.
n=1
This requires proving that there occurs some cancellation within the sum, i.e. that the arguments of the exponentials, the f(n), do not all have the same value mod 1. This is generally a very difficult task, and proofs of such sharper bounds are often delicate and subtle. Quite often, the gains over the trivial estimate N are very small, for example N −δ for a tiny positive δ, or even smaller. These small gains can nonetheless have vast ramifications in applications. The second objective of this thesis is to investigate some of the applications of expo- nential sums in specific problems in number theory and analysis.
In Chapter 2, some general methods for estimating exponential sums will be inves- tigated. After some introductory examples in Section 2.1, we will prove some basic properties of character and Gauss sums in Section 2.2. These sums are ubiquitous in number theory. We devote special attention to quadratic Gauss sums and their gener- alisations, the kth-power Gauss sums, who will be treated in Section 2.3. In the last section of this chapter, Section 2.4, we will discuss Weyl sums, exponential sums of which the argument f is a polynomial, or more general, a smooth function. We will present two methods for obtaining estimates for them, Weyl’s method and Vinogradov’s method. One of the essential ingredients in Vinogradov’s method, Vinogradov’s mean value theorem, will also be treated.
The next chapters of the thesis are devoted to applications in number theory and analysis of the results obtained in Chapter 2.
1 2 CHAPTER 1. INTRODUCTION
In Chapter 3, we will explore an important technique in additive number theory, the Hardy-Littlewood. After a historical introduction and an outline of the general ideas in Section 3.1, we will consider its application to two problems: Waring’s problem (Section 3.2) and the ternary Goldbach problem (Section 3.3). In Chapter 4 we will deduce the Vinogradov-Korobov zero-free region for the Rie- mann zeta function ζ. Although this result is about sixty years old, it is to this day the asymptotically best zero-free region for ζ (but unfortunately nowhere near the hoped zero-free region which would follow from the Riemann Hypothesis). The implications for the error term in the Prime Number Theorem are also stated. Finally, Chapter 5 is devoted to the study of Riemann’s so called “non-differentiable” function. In particular, we will prove at which points it is differentiable and at which points it is not, and moreover we will determine the H¨olderexponent at every point.
Chapters 2–4 are the result of a literature study and are heavily based on the books by Vaughan [37], and Iwaniec and Kowalski [23]. Specifically, we list the primary sources per section in table 1.1. Chapter 5 contains some original results (or rather a new method of proving al- ready existing results), but still relies heavily on other works, especially the articles by Duistermaat [4] and Jaffard [24].
Table 1.1: Primary sources per section
Section Source 2.1 [29, Section 2] 2.2 [1, Chapter 8] 2.3 [37, Section 4.2] 2.4 [37, Sections 2.2, 5.1 and 5.2] and [23, Section 8.5] 3.2 [37, Chapter 4 and Section 5.3] 3.3 [37, Chapter 3] and [23, Sections 13.4 and 13.5] 4 [23, Section 8.5] 5.2 [4] 5.3 [4] and [24] Chapter 2
Exponential sums
2.1 Some elementary estimates
To get some intuition and a taste of exponential sums, we begin our exposition with some elementary estimates. First, we consider the simple but very important example of an exponential sum over an arithmetic progression. For these sums we have the following upper bound.
Proposition 2.1.1. Let α, β be real numbers and M a positive integer. Then
M X 1 e(αm + β) ≤ min M, , 2kαk m=1 where kαk denotes the distance from α to the nearest integer.
Proof. The upper bound M follows via estimating trivially via the triangular inequality. Upon using the formula for the partial sums of a geometric series, we see that
M X 1 − e(αM) 2 e(αm + β) = e(α + β) ≤ . 1 − e(α) 1 − e(α) m=1
Without loss of generality, we may replace α by kαk, since we may change the sign of
α and add an arbitrary integer to it without altering the value of 1 − e(α) . Using elementary trigonometry, 1 − e(kαk) = 2 sin(πkαk), which by Jordan’s inequality is bounded from below by (2/π)πkαk = 2kαk, since πkαk ∈ [0, π/2].
If one has M real numbers x1, . . . , xM , then one can picture the partials sums sm = e(x1) + ··· + e(xm) as part of a walk starting in 0, with step length 1, and where the th direction of the m step is determined by the fractional part {xm} of xm. Intuitively, if the values of the fractional parts of the xm are close together, the walk will be more or less in one general direction, and the resulting sum will be large. If the fractional parts are more evenly distributed mod 1, the direction of each step will appear more “random”, and we expect the resulting sum to be smaller1. This idea is captured in the following theorem, taken from [29].
1One can show that for a random walk with M steps in two dimensions the expected value of the square of the distance√ from the endpoint to the starting point is M, while the expected value of the π √ distance itself is ∼ M. 2
3 4 CHAPTER 2. EXPONENTIAL SUMS
Theorem 2.1.2 (Kusmin-Landau). Let x1, . . . , xM be real numbers and set δm = xm+1− xm. Suppose there is a positive ∆ such that ∆ < δ1 ≤ ... ≤ δM−1 < 1 − ∆. Then M X π∆ e(xm) ≤ cot . 2 m=1
Proof. Let zm = e(xm), wm = zm+1/zm = e(δm) and ρm = 1/(1 − wm). Then
M M−1 X X e(xm) = ρm(zm − zm+1) + zM . m=1 m=1 By partial summation this equals
M−1 X ρ1z1 + (ρm − ρm−1)zm + (1 − ρM−1)zM , m=2 so that M M−1 X X e(xm) ≤|ρ1| + |ρm − ρm−1| +|1 − ρM−1| .
m=1 m=2 If ρ = 1/(1 − w) and w = e(δ) with 0 < δ < 1, then ρ = (1 + i cot πδ)/2 and |ρ| = |1 − ρ| = 1/(2 sin πδ). Since cot is decreasing on [0, π], the above is
M−1 1 1 X 1 ≤ + (cot πδ − cot πδ ) + 2 sin πδ 2 m−1 m 2 sin πδ 1 m=2 M−1 1 1 1 = + cot πδ1 − cot πδM−1 + 2 sin πδ1 sin πδM−1 1 π∆ ≤ + cot π∆ = cot . sin π∆ 2
Corollary 2.1.3. Let f be a real-valued continuous function on [a, b], differentiable on
]a, b[ such that f 0 is increasing. Suppose that ∀x ∈ ]a, b[ : f 0(x) ≥ ∆ for some positive ∆. Then,
X 2 e(f(m)) ≤ . π∆ a≤m≤b Proof. Let N be an integer chosen so that ∀x ∈ ]a, b[ : N < f 0(x) < N + 1. Such an integer exists in view of Darboux’s theorem (f 0 has the intermediate value property). If we replace f(x) by f(x) − Nx, the sum is unchanged and ∆ ≤ f 0(x) ≤ 1 − ∆. We can apply the previous theorem with xm = f(m). We have δm = f(m + 1) − f(m) = 0 0 f (ξm) for some ξm ∈ ]m, m + 1[ by the mean value theorem. Since f is increasing, the hypothesis on the δm is indeed satisfied. The result follows by noting that cot y < 1/y for y ∈ ]0, π/2].
2.2 Characters and Gauss sums
We assume the reader is acquainted with the basics of (Dirichlet) characters, but we will restate the important properties. For an introduction to characters, see for example [1, Chapter 6] 2.2. CHARACTERS AND GAUSS SUMS 5
Definition 2.2.1. Suppose (G, ·) is a group. A character χ of G is a morphism of G to × the multiplicative group of the complex numbers: χ : (G, ·) → (C , ·). It is easily seen that for any character χ of G, χ(e) = 1, where e is the identity of G, and that for any element g ∈ G of finite order n, χ(g) is an nth root of unity. The character which maps every element to 1 is called the principal character, and is denoted by χ0.
Example 2.2.2. If G = hgi for some g of finite order n, then the characters of G are m given by χk : g 7→ e(km/n), for k = 0, . . . , n − 1.
Define Gˆ to be the set of all characters of G. We can make Gˆ into a group by defining the multiplication of two characters χ1, χ2 pointwise: (χ1 · χ2)(g) = χ1(g)χ2(g); the identity element of Gˆ is the principal character, and for a character χ, its inverse is given by the character 1/χ which maps g to 1/χ(g). The set Gˆ with this multiplication is called the character group of G. Using the above example, and the characterisation of finite abelian groups, it is easy to show that every finite abelian group is isomorphic to its character group. The following theorem is one of the main reasons why characters are so useful.
Theorem 2.2.3 (Orthogonality relations for characters). Suppose G is a finite abelian group of order n. Then for any χ ∈ Gˆ and for any g ∈ G we have ( X n if χ = χ0, χ(g) = 0 otherwise; g∈G ( X n if g = e, χ(g) = 0 otherwise. χ∈Gˆ
The orthogonality relations imply that the characters form an orthogonal basis for the vector space all functions f : G → C. Indeed, consider such a function f, and define its Fourier transform fˆ : Gˆ → C via 1 1 X fˆ(χ) = hf, χi = f(g)χ(g). |G| |G| g∈G X Then f = fˆ(χ)χ: χ∈Gˆ X X 1 X fˆ(χ)χ(g) = f(h)χ(h)χ(g) |G| χ∈Gˆ χ∈Gˆ h∈G X 1 X = f(h) χ(h−1g) = f(g). |G| h∈G χ∈Gˆ
Example 2.2.4. Suppose f is a periodic arithmetic function with period q. Then
q q X 1 X f(n) = fˆ(k)e(kn/q), where fˆ(k) = f(n)e(−nk/q). q k=1 n=1
(Here, the character group of (Z/qZ, +) is identified with the group (Z/qZ, +) itself.) 6 CHAPTER 2. EXPONENTIAL SUMS
In number theory, we are mainly interested in the so called Dirichlet characters. For a positive integer q, a Dirichlet character mod q is an arithmetical function χ such that × there exists a characterχ ˜ of the group of reduced residues mod q,(Z/qZ) , with the property that ( χ(n) =χ ˜(n + qZ) if (n, q) = 1, χ(n) = 0 if (n, q) > 1. + × A Dirichlet character is hence just the lift to N of a character of (Z/qZ) ; often the × distinction between a character of (Z/qZ) and a q-periodic completely multiplicative arithmetical function f with f(n) 6= 0 ⇐⇒ (n, q) = 1 will not be made, and they will be both referred to as a Dirichlet character. Dirichlet characters satisfy the orthogonality relations q ( ( X ϕ(q) if χ = χ0, X ϕ(q) if n ≡ 1 mod q, χ(m) = χ(n) = 0 otherwise; 0 otherwise. m=1 χ mod q Also, every q-periodic arithmetical function supported on the integers coprime with q can be expressed as a sum over Dirichlet characters. It is therefore useful to study sums involving Dirichlet characters. Definition 2.2.5. Suppose q is a positive integer. Given an integer n and a Dirichlet character χ mod q, the Gauss sum G(n, χ) associated with n and χ is given by q X G(n, χ) = χ(m)e(mn/q). m=1 From the definition it is obvious that G(n, χ) is periodic in n with period q. The Gauss sum G(n, χ) can be viewed as the inner product of the multiplicative character χ with the additive character m 7→ e(−mn/q), and hence as a discrete analog of the gamma function.
2.2.1 Ramanujan sums We will first consider a special case of the Gauss sums, the Ramanujan sums.
Definition 2.2.6. Suppose q and n are positive integers. The Ramanujan sum cq(n) is the Gauss sum associated with the principle character mod q: q X cq(n) = G(n, χ0) = e(mn/q). m=1 (m,q)=1
Lemma 2.2.7. For fixed n, cq(n) is a multiplicative function of q.
Proof. Suppose (q1, q2) = 1. By the Chinese remainder theorem, for each m mod q1q2 there are unique m1 mod q1, m2 mod q2 such that m = m1q2 + m2q1. Furthermore, (m, q) = 1 ⇐⇒ (mi, qi) = 1. Therefore, q1 q2 X X (m1q2 + m2q1)n cq1q2 (n) = e q1q2 m1=1 m2=1 (m1,q1)=1 (m2,q2)=1 q1 q2 X m1n X m2n = e e = cq1 (n)cq2 (n). q1 q2 m1=1 m2=1 (m1,q1)=1 (m2,q2)=1 2.2. CHARACTERS AND GAUSS SUMS 7
Theorem 2.2.8. X µ(q/(q, n)) c (n) = dµ(q/d) = ϕ(q). q ϕ(q/(q, n)) d|(n,q)
Proof. Write for the characteristic function of {1}. Then = µ∗1, and we can eliminate the condition (m, q) = 1 via this relation:
q q q X X X X cq(n) = e(mn/q) = ((m, q))e(mn/q) = e(mn/q) µ(d) m=1 m=1 m=1 d|(m,q) (m,q)=1 q/d X X X X = µ(d) e(kdn/q) = µ(d)q/d = dµ(q/d). d|q k=1 d|q d|(n,q) (q/d)|n
Evaluating in prime powers pl yields:
0 if pl−1 n, - l−1 l−1 cpl (n) = −p if p k n, pl − pl−1 if pl | n µ(pl/(pl, n)) = ϕ(pl). ϕ(pl/(pl, n))
Since both cq(n) and µ(q/(q, n))ϕ(q)/ϕ(q/(q, n)) are multiplicative and they coincide on prime powers, they are equal.
2.2.2 Separable Gauss sums and primitive characters
We now consider the Gauss sum G(n, χ) mod q and suppose (n, q) = 1. Then we can write
q q X X G(n, χ) = χ(m)e(mn/q) = χ(n)χ(mn)e(mn/q) m=1 m=1 = χ(n)G(1, χ). (2.1)
In the last step we used that mn runs over a complete residu system mod q whenever m does, since n is invertible mod q. This nice property is called separability.
Definition 2.2.9. The Gauss sum G(n, χ) mod q is called separable if G(n, χ) = χ(n)G(1, χ).
Since χ(n) = 0 whenever (n, q) > 1, G(n, χ) is separable for every n if and only if G(n, χ) = 0 whenever (n, q) > 1. The absolute value of these Gauss sums is easily determined.
√ Theorem 2.2.10. Suppose G(n, χ) is separable for every n. Then G(1, χ) = q. 8 CHAPTER 2. EXPONENTIAL SUMS
Proof.
q 2 X G(1, χ) = G(1, χ)G(1, χ) = G(1, χ) χ(m)e(−m/q) m=1 q q q X X X = G(m, χ)e(−m/q) = χ(k)e(m(k − 1)/q) m=1 m=1 k=1 q q X X = χ(k) e(m(k − 1)/q) = χ(1)q = q. k=1 m=1
Next, we would like to determine which characters give rise to (everywhere) separable Gauss sums.
Lemma 2.2.11. Let χ be a Dirichlet character mod q and suppose G(n, χ) 6= 0 for some n with (n, q) > 1. Then there exists an integer d with d | q, d < q such that
χ(a) = 1 whenever (a, q) = 1 and a ≡ 1 mod d.
Proof. Let k = (n, q) > 1 and d = q/k. Now suppose a is a natural number with (a, q) = 1 and a ≡ 1 mod d. Since a is invertible mod q, am runs over a complete residue system mod q whenever m does. Hence
q X G(n, χ) = χ(am)e(amn/q) = χ(a)G(an, χ). m=1 Since a ≡ 1 mod d, a = 1 + bd = 1 + bq/k for some b. Now anm/q = nm/q + bnm/k ≡ nm/q mod 1, since k | n. Therefore,
G(n, χ) = χ(a)G(an, χ) = χ(a)G(n, χ), and since G(n, χ) 6= 0, χ(a) = 1.
Definition 2.2.12. Suppose χ is a Dirichlet character mod q.
• A positive divisor d of q is called an induced modulus of χ if
χ(a) = 1 whenever (a, q) = 1 and a ≡ 1 mod d.
• The smallest induced modulus of χ is called the conductor of χ.
• χ is called primitive if its conductor equals q, i.e. if it has no induced modulus smaller than q.
Remark 2.2.13.
• 1 is an induced modulus for χ if and only if χ is the principal character. Therefore, every non-principal character modulo a prime p is primitive.
• One can show that, if d is an induced modulus for χ, there is a character ψ mod d which induces χ, in the sense that χ = ψχ0, where χ0 is the principal character mod q. If d is the conductor, then ψ can be taken primitive mod d. 2.2. CHARACTERS AND GAUSS SUMS 9
Lemma 2.2.11 implies the following theorem. Theorem 2.2.14. Suppose χ is a primitive Dirichlet character. Then G(n, χ) is sepa- √ rable for every n, and so G(1, χ) = q. Remark 2.2.15. • The converse of the above theorem also holds, if G(n, χ) is separable for every n, then χ is primitive.
• For general Gauss sums, there is the following theorem (see for example [30, page 290]). Let χ be a character mod q induced by the primitive character ψ modulo its conductor d. Put r = q/(q, n). If d - r, then G(n, χ) = 0, and if d | r, then ϕ(q) G(n, χ) = ψ(n/(q, n))ψ(r/d)µ(r/d) G(1, ψ). ϕ(r)
2.2.3 Quadratic Gauss sums Definition 2.2.16. Let q, a, b, k be positive integers with (a, q) = 1. The exponential sums Sk(q, a) and Sk(q, a, b) are defined as:
q X amk S (q, a) = e , k q m=1 q X amk + bm S (q, a, b) = e . k q m=1 If the exponent k is clear from the context, the subscript k will be omitted in the notation for these sums. In this subsection, we will determine the values of these sums for k = 2, which is a classical result due to Gauss. Definition 2.2.17. • Suppose p is an odd prime. The Legendre symbol is defined as 1 if p - a and a is a quadratic residue mod p, a = −1 if p - a and a is a quadratic non-residue mod p, p 0 if p | a.
α1 αN • Suppose q is an odd integer, q = p1 ··· pN for odd primes pi. The Jacobi symbol is defined as a a α1 a αN = ··· . q p1 pN Since the Legendre and Jacobi symbol coincide when they are both defined, we do not make a distinction in the notation. Notice also that when p 6= 2 is prime
a p−1 ≡ a 2 mod p. p It is easy to see that the Jacobi symbol for fixed q is a Dirichlet character mod q: it is q-periodic, completely multiplicative and supported on the integers coprime with q. It is in fact a quadratic character: a non-principal character whose square is principal. 10 CHAPTER 2. EXPONENTIAL SUMS
Lemma 2.2.18. Suppose p is an odd prime and a is an integer with p - a. Then a S(p, a) = S(p, 1). p m Proof. The number of solutions of x2 ≡ m mod p equals 1 + . Therefore p
p X m · S(p, a) = 1 + e(am/p) = G a, p p m=1 a · a = G 1, = S(p, 1), p p p where we used the fact that (a, p) = 1 and the separability of G(a, χ) for any non- principal character χ mod p (see Theorem 2.2.14).
We now determine the value of the quadratic Gauss sums in the case a = 1. Theorem 2.2.19. Suppose a and q are positive integers with at least one of them even, and define the exponential sum (remark the 2 in the denominator)
q X am2 S0(q, a) = e . 2q m=1 Then r q 1 + i S0(q, a) = √ S0(a, q). a 2 Proof. We present an analytic proof given in [30, Chapter 9]. Define ( e(ax2/(2q)) for 1/2 < x < q + 1/2, f(x) = 0 otherwise.
By the Poisson summation formula (C.2.1),
K X S0(q, a) = lim fˆ(2πk). K→+∞ k=−K For non-zero k,
Z q+1/2 ax2 Z q+1/2 −1 0 ax2 fˆ(2πk) = e − kx dx = e(−kx) e dx 1/2 2q 1/2 2πik 2q −1 1 Z q+1/2 2πia ax2 1 = O(1) + e(−kx) xe dx q,a , 2πik 2πik 1/2 q 2q |k| by integration by part. By completing the square, ax2 a k2q − kx = (x − kq/a)2 − , 2q 2q 2a and by the change of variables u = (x − kq/a)/q, we have that
k2q Z 1+1/(2q)−k/a fˆ(2πk) = qe − e(aqu2/2) du. 2a 1/(2q)−k/a 2.2. CHARACTERS AND GAUSS SUMS 11
Since at least one of a and q is even, if k ≡ r mod a, then qk2 ≡ qr2 mod 2a, so we can group the residues mod a in the sum over k:
K a [K/a] X X −qr2 X Z 1+1/(2q)−m−r/a fˆ(2πk) = q e e(aqu2/2) du + O (1/K). 2a q,a k=−K r=1 m=−[K/a] 1/(2q)−m−r/a
When K → +∞, the latter sum converges to
Z 1 1 + i e(aqu2/2) du = √ √ , R aq 2 which is called the Fresnel integral. This is a “classical” improper integral which can be evaluated via contour integration and residue calculus. We get eventually:
r q 1 + i S0(q, a) = √ S0(a, q). a 2
By taking a = 2, we obtain a famous result of Gauss:
Corollary 2.2.20 (Gauss). √ q if q ≡ 1 mod 4, q 2 X m 0 if q ≡ 2 mod 4, S(q, 1) = e = √ q i q if q ≡ 3 mod 4, m=1 √ (1 + i) q if q ≡ 0 mod 4.
Consider two distinct odd primes p1 and p2. It is easy to verify (see also Lemma 2.3.4 below) that S(p1p2, 1) = S(p1, p2)S(p2, p1). By evaluating this quadratic Gauss sum in two different ways, using the above corollary and Lemma 2.2.18, we get a quick proof of the famous reciprocity law for the Legendre symbol:
p p 2 1 = (−1)(p1−1)(p2−1)/4. p1 p2
The above theorem also implies a reciprocity law for the quadratic Gauss sum S(q, a).
Corollary 2.2.21. If a is odd, then
r q 1 + i S(q, a) = (1 + e(−qa/4))S(a, q). a 2
Proof.
r r 2a 2 q 1 + i q 1 + i X qm S(q, a) = S0(q, 2a) = S0(2a, q) = e − . a 2 a 2 4a m=1
We split the last sum in a sum over even and odd m. For the odd m, observe that (m + 2a)2 ≡ m2 mod 4a, so we may sum over the odd numbers in any complete residu 12 CHAPTER 2. EXPONENTIAL SUMS system mod 2a. Choosing the complete residu system {a + 1, a + 2,..., 3a} and using the fact that a is odd, we get:
2a a a X qm2 X q(2m)2 X q(2m + a)2 e − = e − + e − 4a 4a 4a m=1 m=1 m=1 a a X qm2 X qm2 = e − + e − e(−qa/4) a a m=1 m=1 = S(a, q)(1 + e(−qa/4)).
Theorem 2.2.22. Suppose a and q are positive integers with (a, q) = 1. For odd n, define ( 1 if n ≡ 1 mod 4, εn = i if n ≡ 3 mod 4.
Then a √ εq q if q is odd, q S(q, a) = 0 if q ≡ 2 mod 4, q √ (1 + i)ε q if q ≡ 0 mod 4. a a
We have all the essential ingredients for the proof, but it will be postponed to Sub- section 2.3.2 since it makes use of a lemma (Lemma 2.3.6) which also holds for general exponents k.
Finally, we are left to treat the sums S(q, a, b), but these can be easily related to the sums S(q, a) as follows. Suppose first that b ≡ 2b0 mod q for some b0. Then we can complete the square and we get:
q X a(m + a−1b0)2 a−1b02 a−1b02 S(q, a, b) = e e − = e − S(q, a). q q q m=1
Here, a−1 is the multiplicative inverse of a mod q. If there is no such b0, then q is even and b odd. In this case we have
2q 2q X a(2m + a−1b)2 X a(2m)2 S(4q, a) = e + e 4q 4q m=1 m=1 a−1b2 = 2e S(q, a, b) + 2S(q, a), 4q since 2m + a−1b runs over all odd residues mod 4q when m runs over {1,..., 2q}. There- fore, 1 a−1b2 e − S(4q, a) if q ≡ 2 mod 4, S(q, a, b) = 2 4q 0 if q ≡ 0 mod 4. 2.3. kth-POWER GAUSS SUMS 13
2.3 kth-power Gauss sums
Throughout this section, we fix an integer k > 1.
2.3.1 kth-powers modulo pl We will first examine the distribution of kth-powers modulo a prime power pl. The following theorem about the structure of the unit group is well known (see for example [1, Chapter 10]). Theorem 2.3.1. • For the prime 2, we have × × × Z =∼ 1, Z 2 =∼ Z , Z l =∼ Z × Z l−2 for l ≥ 2 . 2Z 2 Z 2Z 2 Z 2Z 2 Z An explicit isomorphism for the latter case is
× l−2 a b l Z × Z l−2 → Z l : (a + 2 , b + 2 ) 7→ (−1) 5 + 2 . (2.2) 2Z 2 Z 2 Z Z Z Z • For the odd primes, we have × Z l =∼ Z l−1 . p Z p (p − 1)Z