Research Collection
Doctoral Thesis
Primes of the shape x² + ny² The distribution on average and prime number races
Author(s): Ditchen, Jakob J.
Publication Date: 2013
Permanent Link: https://doi.org/10.3929/ethz-a-010138958
Rights / License: In Copyright - Non-Commercial Use Permitted
This page was generated automatically upon download from the ETH Zurich Research Collection. For more information please consult the Terms of use.
ETH Library Diss. ETH No. 21502
Primes of the shape x2 + ny2 The Distribution on Average and Prime Number Races
A dissertation submitted to ETH Zürich for the degree of Doctor of Sciences
Presented by Jakob Johann Ditchen
Dipl.-Math. techn. Universität Karlsruhe (TH) Certificate of Advanced Study in Mathematics University of Cambridge
Born on 10 November 1982 Citizen of Germany
Accepted on the recommendation of Prof. Dr. Emmanuel Kowalski Examiner Prof. Dr. Özlem Imamoglu Co-examiner Prof. Dr. Philippe Michel Co-examiner
2013
Abstract
This thesis focuses on uniformities and discrepancies in the distribution of prime numbers represented by positive definite integral binary quadratic forms of various discriminants. We prove results of Bombieri–Vinogradov and Barban–Davenport–Halberstam type on the average distribution of the primes with respect to their representability by these forms. Our results imply that the corresponding prime number theorem holds uniformly and with a non-trivial error term for almost all negative fundamental discriminants in long ranges. Moreover, we investigate a variant of “Chebyshev’s bias” between primes of the shapes x2 + ny2 and x2 + my2 for certain distinct positive integers n and m.
Deutsche Zusammenfassung
Die vorliegende Dissertation befasst sich mit Gleichmäßigkeiten und Diskrepanzen in der Verteilung von Primzahlen, die durch positiv definite, ganzzahlige binäre quadratische Formen mit unterschiedlichen Diskriminanten darstellbar sind. Wir beweisen Varianten des Satzes von Bombieri-Vinogradov sowie des Satzes von Barban-Davenport-Halberstam und zeigen so, dass der betreffende Primzahlsatz, von höchstens „wenigen“ Ausnahmen abgesehen, für negative Fundamentaldiskrimi- nanten in langen Intervallen ein gleichmäßiges und nicht-triviales Restglied aufweist. Des Weiteren untersuchen wir für gewisse Paare (n, m) positiver ganzer Zahlen eine Diskrepanz zwischen den Verteilungen von Primzahlen der Form x2 + ny2 und solchen der Form x2 + my2; dies stellt ein Gegenstück zu einer klassischen Beobach- tung von Tschebyscheff bezüglich Primzahlen der Formen 4k + 1 und 4k + 3 dar, die in den letzten Jahren intensiv untersucht wurde.
Binäre quadratische Formen, das heißt homogene Polynome der Gestalt
2 2 f(x, y) = ax + bxy + cy (x, y ∈ Z) mit ganzzahligen Koeffizienten a, b und c, sind neben arithmetischen Folgen die einfachsten Polynome von denen bekannt ist, dass jedes von ihnen unendlich viele Primzahlen darzustellen vermag, sofern dem nicht Kongruenzbeziehungen der Koeffizienten offensichtlich entgegenstehen. Die analytische Theorie der Darstellung von Primzahlen durch fest gewählte binäre quadratische Formen ist ähnlich gut erforscht wie jene der Primzahlen in fest gewählten arithmetischen Folgen. Es ist hingegen nur wenig darüber bekannt, wie sich diese Eigenschaften im Durchschnitt über mehrere binäre quadratische Formen unterschiedlicher Diskriminante verhalten oder wie sie im Vergleich zweier verschiedener Formen von einander abweichen – andererseits existieren zahlreiche solcher Resultate für Primzahlen in arithmetischen Folgen. Für Primzahlen in arithmetischen Folgen wurden für die erstgenannte Art von Problemen mittels des Großen Siebs ab den 1960er Jahren Resultate erzielt, die es in Anwendungen häufig ermöglichen, auf den Gebrauch der verallgemeinerten Riemannschen Vermutung zu verzichten. Die bekanntesten dieser Ergebnisse sind der Satz von Bombieri-Vinogradov sowie der Satz von Barban-Davenport-Halberstam. Diese zeigen zum einen, dass das Restglied im Primzahlsatz für arithmetische Folgen im Durchschnitt dem durch die Riemannsche Vermutung vorhergesagten entspricht; dabei wird der Durchschnitt über die Moduln der arithmetischen Folgen im selben Bereich betrachtet, in welchem die Riemannsche Vermutung nicht-triviale Ergebnisse liefert. Zum anderen konnte gezeigt werden, dass der mittlere quadratische Fehler im Primzahlsatz sehr klein ist, wenn sowohl über die Moduln als auch über deren Restklassen gemittelt wird; der hier zulässige Bereich für die Moduln übersteigt dabei sogar den durch die Riemannsche Vermutung kontrollierten Bereich. In der vorliegenden Arbeit werden analoge Resultate für positiv definite, ganzzahlige binäre quadratische Formen gefunden: Sei X eine große, positive Zahl. Für die Anzahl der Primzahlen p 6 X, für welche – bei gegebener ganzer Zahl n – ganze Zahlen x und y existieren, so dass p sich in der Form p = x2 + ny2 schreiben lässt, zeigen wir insbesondere, dass der zugehörige Primzahlsatz für die quadratfreien, positiven ganzen Zahlen n ≡ 1 (mod 4) unterhalb von etwa X1/10 gleichmäßig in n gilt – abgesehen von höchstens „wenigen“ Ausnahmen. Allgemeiner beweisen wir konkret, dass für alle A > 0 eine Konstante B = B(A) existiert, so dass für alle ε > 0 die Beziehung
X0 li(X) 1/2 −A max π(X; q, C) − ε,A Q X(log X) C∈K(q) e(C)h(q) q>−Q
10+ε −B gilt, falls Q 6 X(log X) ist. Hierbei bezeichnet π(X; q, C) die Anzahl der Primzahlen p 6 X, welche durch die quadratischen Formen der Formenklasse C aus der Formenklassen- gruppe K(q) zur Diskriminante q darstellbar sind, h(q) bezeichnet die Klassenzahl zu dieser Diskriminante, li steht für das logarithmische Integral und e(C) ist eine von der Klasse abhän- gige Konstante; die Summe auf der linken Seite läuft über negative Fundamentaldiskriminanten q > −Q mit q 6≡ 0 (mod 8). Ferner zeigen wir, dass das Restglied im Primzahlsatz für positiv definite binäre quadra- tische Formen im quadratischen Mittel über sowohl Fundamentaldiskriminanten als auch die zugehörigen Formenklassen in einem größeren Bereich klein ist: Für alle A > 0 existiert eine Konstante B = B(A), so dass für alle ε > 0 die Beziehung
2 X0 X li(X) 1/2 2 −A π(X; q, C) − ε,A Q X (log X) e(C)h(q) q>−Q C∈K(q)
3+ε −B gilt, falls Q 6 X(log X) ist. Beide Ergebnisse erreichen nicht die Stärke der oben genannten Resultate für arithmetische Folgen. Dies ist unter anderem auf den Umstand zurückzuführen, dass es uns lediglich gelingt eine schwächere Version einer Ungleichung zum Großen Sieb für komplexe Klassengruppen- charaktere zu finden, welche für Ergebnisse dieses Typs unentbehrlich scheint. Während die bisher genannten Ergebnisse sich mit der Untersuchung von Uniformität in der Verteilung von durch arithmetische Folgen respektive binäre quadratische Formen darstellba- ren Primzahlen befassen, ist die Frage nach Diskrepanzen in diesen Verteilungen nicht minder interessant. Tschebyscheff bemerkte bereits, dass die Anzahl der Primzahlen in der Folge 4k + 1 unterhalb einer gegebenen Zahl meist kleiner ist als diejenige in der Folge 4k + 3. Dem Prim- zahlsatz zufolge sind beide Anzahlen asymptotisch gleich, so dass die Ursachen dieser „Vorliebe“ der Primzahlen für die zweite Folge nicht offensichtlich sind. Erst in den letzten Jahren wurde diese Diskrepanz in allgemeiner Form für arithmetische Folgen eingehend untersucht. Wir untersuchen in dieser Arbeit einen ähnlichen Effekt, der sich für die Anzahl der Prim- zahlen der Form x2 + ny2 und solche der Form x2 + my2 unterhalb einer gegebenen Zahl offenbart, wenn sich die zugehörigen Diskriminanten der beiden Formen in der Klassenzahl zwar gleichen – und somit, nach dem Primzahlsatz, auch das asymptotische Verhalten der Verteilun- gen übereinstimmt –, sie sich in der Anzahl ihrer ungeraden Primfaktoren aber unterscheiden. Von den großen Meistern wie Fermat, Euler, Gauß und Dirichlet wurde Primzahlen der Form x2 + ny2 mindestens ebenso viel Beachtung geschenkt wie Primzahlen der Form a + nk. Und noch de la Vallée Poussin bewies in seiner Arbeit zum Primzahlsatz diesen nicht nur in der gewöhnlichen Form und in der Form für arithmetische Folgen, sondern ebenfalls gleich in der Form für positiv definite binäre quadratische Formen. Wiewohl Primzahlen, die durch binäre quadratische Formen darstellbar sind, seither immer wieder prominent in Erscheinung getreten sind – beispielsweise als wichtiger Bestandteil bestimmter Faktorisierungsalgorithmen –, haben die Untersuchungen bezüglich Gleichmäßigkeiten und Diskrepanzen in ihrer Verteilung jedoch bei weitem nicht mehr dieselbe Aufmerksamkeit erhalten wie die entsprechende Forschung zu Primzahlen in arithmetischen Folgen, die sich häufig einfacher gestaltet. Die vorliegende Arbeit möchte einen Beitrag dazu leisten, diese Lücke dereinst zu schließen. Meinen Eltern gewidmet
Contents
Primes of the shape x2 + ny2: The Distribution on Average and Prime Number Races
Preface 1 Notation 5
1 Primes represented by positive definite binary quadratic forms 7
1.1 The composition of binary quadratic forms and form classes ...... 9
1.2 Algebraic methods for arithmetic objects ...... 11
1.3 The Chebotarev density theorem and conditional results ...... 17
2 The average distribution of primes represented by positive definite binary quadratic forms with varying discriminant 21
2.1 Mean-value results for primes in arithmetic progressions ...... 22
2.2 A large sieve inequality for complex ideal class group characters ...... 25
2.3 Results of Bombieri–Vinogradov type ...... 34
2.4 The mean square distribution ...... 57
2.5 Applications and open questions ...... 63
3 Chebyshev’s bias and prime number races for binary quadratic forms 71
3.1 Bias in the distribution of primes in arithmetic progressions ...... 72
3.2 Primes represented by different classes of forms with a fixed discriminant . . . . 74
3.3 Prime number races for forms of the shape x2 + ny2 ...... 76
Bibliography 89
Preface 1
Preface
Apart from arithmetic progressions, integral binary quadratic forms – that is, homogeneous polynomials of the shape 2 2 f(x, y) = ax + bxy + cy (x, y ∈ Z) with integral coefficients a, b and c – are the simplest polynomials which are known to represent infinitely many prime numbers unless there is an obvious obstacle by means of a common prime divisor of the coefficients. Analytic questions about prime numbers which are representable by any fixed binary quadratic form have been studied almost as extensively as analytic questions about prime numbers in arithmetic progressions. There is, however, not much known about the average behaviour of these representation properties when averaged over binary quadratic forms of distinct discriminants or about differences in these properties between two distinct binary quadratic forms. In contrast, there exist plenty of comparable results for prime numbers in arithmetic progressions. Results for questions on “uniformity on average” in the prime number theorem for arithmetic progressions of various moduli have been achieved by means of the Large Sieve since the 1960s. In applications they often allow to dispense with the assumption of the Generalized Riemann Hypothesis (GRH). The Bombieri–Vinogradov theorem and the Barban–Davenport–Halberstam theorem are the most famous and important of these results: The first one shows that the error term in the prime number theorem for arithmetic progressions is small – as small as predicted by GRH – for all reduced residue classes, “on average” over moduli in about the same range of moduli in which GRH yields non-trivial results. The second theorem shows that the mean square of the error term is small if one averages over both moduli and their reduced residue classes; here the admissible range for the moduli even exceeds the range that may be controlled by GRH. In this dissertation we find analogous results for positive definite integral binary quadratic forms. For all large positive numbers X, we show that the prime number theorem for primes 2 2 p 6 X of the shape p = x + ny holds, with at most “few” exceptions, for almost all squarefree positive integers n ≡ 1 (mod 4) up to about X1/10 with a uniform, small remainder term. In fact, we prove more generally that for all A > 0 and all ε > 0, there exists a constant B = B(A) such that
X0 li(X) 1/2 −A max π(X; q, C) − ε,A Q X(log X) C∈K(q) e(C)h(q) q>−Q 10+ε −B if Q 6 X(log X) . Here π(X; q, C) denotes the number of primes p 6 X which are representable by quadratic forms lying in the form class C of the form class group K(q) of discriminant q; the corresponding class number is h(q), the logarithmic integral is denoted by li, and e(C) is a constant which depends on the form class only; the sum on the left-hand side is over negative fundamental discriminants q > −Q with q 6≡ 0 (mod 8). Furthermore, we show that the mean square of the remainder term in the prime number theorem for positive definite binary quadratic forms is small in a longer range – if we average over both the discriminants and the corresponding form classes: For all A > 0 and all ε > 0, there exists a constant B = B(A) such that the bound 2 X0 X li(X) 1/2 2 −A π(X; q, C) − ε,A Q X (log X) e(C)h(q) q>−Q C∈K(q)
3+ε −B holds if Q 6 X(log X) . 2 Preface
Both statements do not reach the strength of the aforementioned theorems for arithmetic progressions. Among other reasons, this paucity is due to the fact that we only succeed to find a weaker version of a large sieve inequality for complex ideal class group characters, which seems to be essential for results of this type.
The results that we have mentioned so far have been concerned with the examination of uniformities in the distributions of primes in arithmetic progressions and primes which may be represented by binary quadratic forms. The question on discrepancies in these distributions is scarcely less interesting. Chebyshev already noticed that the number of primes below a given bound and lying in the progression 4k + 1 is usually smaller than the number of primes lying in the progression 4k + 3. The reason for this “bias” is not obvious as, by the corresponding prime number theorem, the cardinalities of both sets are asymptotically equal. It has only been recently that this discrepancy has been analysed in a more general setting. We examine a similar “bias” which shows itself between primes of the shape x2 + ny2 and primes of the shape x2 +my2 when the discriminants of these forms have the same class number but a different number of distinct odd prime divisors.
The old masters like Fermat, Euler, Gauß and Dirichlet paid at least as much regard to primes of the form x2 + ny2 as to primes of the form a + nk. Likewise de la Vallée Poussin, in his seminal work on the Prime Number Theorem, not only proved it in the ordinary form for all prime numbers but he also gave the proofs of the prime number theorem for arithmetic progressions as well as the prime number theorem for positive definite binary quadratic forms in the same work. Since then, prime numbers that can be represented by specific binary quad- ratic forms have seen many important applications, e.g. as ingredients of certain factorization algorithms, to name only one example. However, research on uniformities and discrepancies in the distribution of such prime numbers has received much less attention than the corresponding research on prime numbers in arithmetic progressions, which often proves to be easier. This thesis aims to provide a modest step towards closing this gap.
Outline Chapter 1 serves as an introduction to the fundamental results of the theory of binary quadratic forms and the representability of primes by such forms of a fixed discriminant. We start the chapter with some historical remarks and give a review of the primary notions of this theory. The basis of the theory of binary quadratic forms was mainly established by Gauß in his seminal Disquisitiones Arithmeticae. In particular, he fleshed out the definition of equivalence classes of binary quadratic forms and the theory of composition of these forms (which both have their origins in the work of Lagrange). In Section 1.1, we give a brief review of these concepts, which led to the first results by Dirichlet, Weber and de la Vallée Poussin on the number of primes that are representable by any given binary quadratic form. In the subsequent Section 1.2, we summarize the relation between binary quadratic forms and ideals in quadratic fields. This relationship has been essential for most analytic investigations on primes and binary quadratic forms. In particular, Landau’s improvement of de la Vallée Poussin’s prime number theorem for binary quadratic forms, which we also state in this section, is built on this connection and our own results in Chapters 2 and 3 will also rely on it. We pick up this topic in Section 1.3, in which we state the Chebotarev density theorem and relate it to the prime number theorem for binary quadratic forms. The emphasis is laid on conditional results that depend on appropriate versions of the Generalized Riemann Hypothesis; they will be used in Chapter 3. Preface 3
In Chapter 2, the main part of this thesis, we investigate the uniformity of the distribution of primes “in” form classes (i.e., with respect to the representability by these classes) when the discriminant of the classes varies over negative fundamental discriminants q 6≡ 0 (mod 8) and we demonstrate that good error terms in the corresponding prime number theorem hold “on average” (however, we do not achieve the above-mentioned conditional error terms). We start with a review of analogous results for primes in arithmetic progressions in Section 2.1. The first original results are proved in Section 2.2, in which we find a large sieve inequality for complex ideal class group characters. This inequality lies at the heart of the subsequent two sections. Our restriction to fundamental discriminants q with q 6≡ 0 (mod 8) in this chapter is mainly due to the cumbersome proof that this large sieve inequality would require for more general discriminants. In Section 2.3, we prove results of Bombieri–Vinogradov type for the counting function for primes represented by binary quadratic forms as well as similar results for smooth versions of appropriate Chebyshev functions and for special subsets of negative fundamental discriminants. The results for the latter functions and sets show an interesting feature in that we end up with more than the usual saving of just a power of a logarithm over “trivial” bounds. We also notice that the results may be improved if we assume the Lindelöf Hypothesis for Rankin–Selberg convolutions of holomorphic cusp forms of weight one. The second type of uniformity results are of Barban–Davenport–Halberstam type; they are the topic of Section 2.4. We show that general arithmetic functions exhibit an “average behaviour” with respect to the representability of integers by form classes – for most form classes to most discriminants in long ranges – if the functions satisfy Siegel–Walfisz conditions for both arithmetic progressions and form classes (and an additional technical condition). Two applications and an outlook on possible extensions and generalizations are provided in Section 2.5. The first application deals with the question about the size of the least primes that are representable by binary quadratic forms of a given discriminant. The second application is a uniformity result for integers of the form k = x2 + ny2 which are the product of two primes that are representable by forms of discriminant −4n.
In spite of these uniformity results, frequency discrepancies still occur between primes of the shapes x2 +ny2 and x2 +my2 for distinct positive integers n and m – even when their frequencies show the same asymptotic behaviour. These discrepancies are the subject of the original results of Chapter 3, which can be considered as a counterpoint to the results of Chapter 2. Before we come to the new results, the so-called Chebyshev bias (or prime number race) for primes in arithmetic progressions is reviewed in Section 3.1. Many important results on this topic have been found only recently. Previous research has also investigated an analogous bias in the distribution of prime ideals in distinct ideal classes of a fixed imaginary quadratic field; we look at the corresponding results in Section 3.2. Due to the close relation between ideal classes and form classes which we have mentioned above, some of these results may be interpreted as prime number races for primes represented by binary quadratic forms in different form classes of the same discriminant. Finally, in Section 3.3, we demonstrate that there exists a bias in the distribution of primes of the shapes x2 +ny2 and x2 +my2 when −4n and −4m are negative fundamental discriminants with a different number of odd prime divisors but the same class number. Similarly to most other recent results in comparative prime number theory, our results are conditional on the Generalized Riemann Hypothesis and a linear independence hypothesis for the zeros of certain class group L-functions and Dirichlet L-functions. In the proofs of the results of this section – unlike the results of Chapter 2 – almost no major new difficulties arise that would require a significant deviation from the proofs of results on Chebyshev’s bias for arithmetic progressions; 4 Preface thus, after the initial setting of the scene, we mostly only stress the differences that occur and do not repeat the proofs in full detail. We close the section with a list of questions and possible extensions that could be investigated in future work.
About this Thesis The research described in this dissertation was performed in the Department of Mathematics (D-MATH) at ETH Zürich between December 2008 and October 2013, and was supervised by Professor Dr. Emmanuel Kowalski. Chapter 1 as well as Sections 2.1, 3.1 and 3.2 are of an expository nature and do not contain any original results. The work presented in the remaining sections of this dissertation is original. It is influenced primarily by the following earlier works:
• The proof of the large sieve inequality in Section 2.2 is based on a similar result of Duke and Kowalski [DK00].
• The proof of the results in Section 2.3 follows roughly Gallagher’s method of proof for the original Bombieri–Vinogradov theorem as presented in [Bom87].
• The proof of the results in Section 2.4 follows roughly the method of proof for the original Barban–Davenport–Halberstam theorem and its generalizations as described in [IK04], for example.
• The proofs of the results in Section 3.3 largely parallel some of the proofs of results on Chebyshev’s bias in [RS94] and [FM13].
Jakob J. Ditchen Zürich, Autumn 2013
Acknowledgements I am greatly indebted to Professor Emmanuel Kowalski for suggesting the problems from which this dissertation arose and for patiently guiding me with valuable advice. I would also like to thank Professor Özlem Imamoglu and Professor Philippe Michel for accepting to examine this thesis. Many colleagues made the time that I spent at the Department of Mathematics at ETH Zürich a very pleasant and inspiring one, for which I thank them. Zutiefst dankbar bin ich schließlich auch meiner geliebten Andrea, die mir in ihrer solch wundervollen und unterstützenden Art stets zur Seite steht, Martha und Izabela, die mir großartige Schwestern sind, und meinen lieben Eltern, die mir den Weg zur Mathematik sowie der Kunst des Lernens aufgezeigt haben und mich seit all den Jahren in einzigartiger Weise unterstützen. Notation 5
Notation
We list here the main notation, symbols and assumptions that will be used throughout this thesis; many of them are standard in (analytic) number theory. The letters m and n denote positive integers, q denotes a negative discriminant, q0 and q00 denote fundamental discriminants, s denotes a complex number, Q and X denote positive real numbers and C denotes a form class when used as parameters or arguments of functions in the definitions below.
Symbol Meaning ϕ(n) the Euler totient function µ(n) the Möbius function π(X) the number of primes p 6 X Z X 1 li(X) the logarithmic integral, i.e. li(X) = dt log t Z 2 · ds the integral on the line Re(s) = c in the complex plane (c) Γ(s) the gamma function Λ(n) the von Mangoldt function ψ(X) the Chebyshev function, i.e. ψ(X) = X Λ(n) n6X τ(n) the number of positive integer divisors of n ω(n) the number of prime divisors of n counted without multiplicity (m, n) the greatest common divisor of m and n [m, n] the least common multiple of m and n D the set of all negative discriminants, i.e. the set of all negative integers q satisfying q ≡ 0 (mod 4) or q ≡ 1 (mod 4) F the set of all negative fundamental discriminants; see (1.3) F the set of all negative fundamental discriminants q 6≡ 0 (mod 8) F(Q) the set of negative fundamental discriminants q ∈ F with |q| 6 Q Fex(Q) the subset of exceptional discriminants in F(Q); see Section 2.3.2 K(q) the form class group of binary quadratic forms with discriminant q C0(q), C0 the form class which contains the principal form with discrimin- ant q, i.e. the identity element of K(q) R(q, C) the set of all positive integers which can be represented by all forms f of the form class C ∈ K(q) √ Oq the ring of integers of Q( q) O(q) the order of discriminant q in an imaginary quadratic field K; it equals Oq if q ∈ F (which we will usually assume; see below) I(q) the group of invertible fractional O(q)-ideals, i.e. the group of invertible finitely generated O(q)-submodules of K P (q) the subgroup of principal fractional O(q)-ideals H(q) the quotient I(q)/P (q), i.e. the ideal class group of the order O(q) Bq the bijection K(q) → H(q) given in Lemma 1.4 h(q) the class number for the discriminant q, i.e. h(q) = |K(q)| = |H(q)| Z(q) the set of non-zero integral O(q)-ideals N(a) the norm of the ideal a ∈ Z(q), i.e. the size of the quotient ring O(q)/a (the dependence on q is suppressed) Hb(q) the dual group of H(q), i.e. the set of (ideal) class group characters (q) χ0 the trivial character in Hb(q) 6 Notation
P λχ(n) the sum a∈Z(q) χ(a) for χ ∈ Hb(q); here and throughout we set N(a)=n χ(a) = χ(C), where C ∈ H(q) is the ideal class of a ∈ I(q) q0 0 χq0 the Kronecker symbol ( · ) for the fundamental discriminant q , i.e. the primitive real Dirichlet character modulo |q0| if q0 6≡ 0 (mod 8) χq0,q00 the real (ideal) class group character arising from the convolution of the Dirichlet characters χq0 and χq00 ; see (2.69) and (2.98) L(s, λχ) the L-function for the class group character χ L(s, χ) the L-function for the Dirichlet character χ (it may also denote a class group L-function in the displayed sums in Section 3.3) π(X; m, n) the number of primes p 6 X with p ≡ n (mod m) π(X; q, C) the number of primes p 6 X such that for every form f in C ∈ K(q) there exist x, y ∈ Z satisfying f(x, y) = p ψ(X; q, C) the corresponding Chebyshev function ψk(X; q, C) the smoothed and weighted Chebyshev function which is defined in equation (2.22) e(C) the constant which equals 1 if the form class C has order > 3 in K(q) and equals 2 otherwise κ(q) the number of form classes C in K(q) with e(C) = 2, i.e. the number of ambiguous classes; see equation (1.2) w(C, n) the number of ideals a ∈ Bq(C) with N(a) = n; see Remark 1.8 ν a divisor frequency associated to a subset of F; see Definition 2.12 π0(X; n) the number of primes p 6 X such that there exist x, y ∈ Z satisfying p = x2 + ny2
Complex numbers are generally denoted by s = σ + it with σ, t ∈ R. Non-trivial zeros of L-functions are generally denoted by ρ = β + iγ with β, γ ∈ R. When used as a variable, the letter p will always denote a rational prime and the Fraktur letter p will always denote a prime ideal (of a ring which will be clear from the context).
Asymptotic notation: For arithmetic functions f and g, we write f(x) = O(g(x)), or equivalently f(x) g(x), when there is an absolute constant c such that |f(x)| 6 cg(x) for all values of x under consideration. We usually write f(x) = Oα(g(x)) or f(x) α g(x) if the constant depends on some para- meter α; we may suppress such dependencies if they are sufficiently clear from the context. We write f(x) g(x) if lim f(x)/g(x) = 1, and f(x) = o(g(x)) when lim f(x)/g(x) = 0. ∼ x→∞ x→∞ General assumptions on binary quadratic forms and discriminants: All binary quadratic forms in this thesis are assumed to be integral, primitive and positive definite; in particular, all discriminants of forms, orders and fields are negative integers q which satisfy either q ≡ 0 (mod 4) or q ≡ 1 (mod 4). In addition, all discriminants which will appear after Remark 1.7 – in particular, all discriminants in Chapter 2 and Chapter 3 – are assumed to be fundamental discriminants, i.e. each of these discriminants q is assumed to satisfy either (a) q ≡ 1 (mod 4) with q squarefree, or
q q (b) q ≡ 0 (mod 4) with 4 ≡ 2 or 3 (mod 4) and 4 squarefree. Moreover, none of the discriminants in Chapter 2 will be a multiple of 8. In Chapter 3, we will only consider forms of the shape x2 + ny2 for positive integers n; note that the discriminant of x2 +ny2 is a negative fundamental discriminant if and only if n 6≡ 3 (mod 4) and n is squarefree. Chapter 1
Primes represented by positive definite binary quadratic forms
The history of questions about prime numbers of the shape x2 + ny2 probably starts with Fermat, who stated the following three assertions in letters to Mersenne in 1640 and to Pascal in 1654: For all odd primes p,
2 2 ∃ x, y ∈ Z : p = x + y if and only if p ≡ 1 (mod 4), 2 2 ∃ x, y ∈ Z : p = x + 2y if and only if p ≡ 1 or 3 (mod 8), (1.1) 2 2 ∃ x, y ∈ Z : p = x + 3y if and only if p = 3 or p ≡ 1 (mod 3).
Fermat claimed to have proofs for all these statements, but there is no evidence that this was indeed the case. It took about one hundred years before Euler actually provided complete proofs for Fermat’s assertions (see [Cox97, §1.1] and the references there). Similar statements for primes of the shape x2 + ny2 for particular values of n > 3 were conjectured by Euler and most of them were later proved by Lagrange and Gauß by means of quadratic reciprocity. However, it slowly became clear that congruence relations like these could not exist for all positive integers n, even less for arbitrary binary quadratic forms. Only the evolution of the theory of the composition of forms and form classes as well as the development of ideal theory and the ensuing relation between classes of quadratic forms and ideal classes in quadratic fields opened up the possibility to find conditions for the representability of primes by general binary quadratic forms and statements for the frequency of such primes. Before going into details in the upcoming sections, let us recall the basic definitions and fix certain assumptions concerning binary quadratic forms: An integral binary quadratic form is a homogeneous polynomial of the shape
f(x, y) = ax2 + bxy + cy2 in two variables over the ring of rational integers. Such a form is called primitive if the coefficients a, b and c share no common prime divisor; all forms in this thesis will be assumed to be integral and primitive. We say that the binary quadratic form f represents an integer m if there exist integers x and y such that f(x, y) = m. The discriminant of such a binary quadratic form is defined to be q = b2 − 4ac; note that this implies q ≡ 0 or 1 (mod 4). If q = 0, then the form can only represent squares of integers. If q > 0, then the form can represent both positive and negative integers; such a form is called indefinite. If q < 0, then the form can represent either only negative integers or only positive integers (depending on the sign of the coefficient a); the form is accordingly called negative definite or positive definite. 8 Primes represented by positive definite binary quadratic forms
In this thesis we shall only deal with binary quadratic forms of negative discriminant (in fact, only with positive definite forms, since we are interested in the representability of primes) as the theory of these forms is considerably simpler than the theory of forms of positive discriminants. The difference is basically due to the fact that, if the discriminant is negative, then the equation k = ax2 + bxy + cy2 (for fixed integers k, a, b and c) is the equation of an ellipse, which contains only finitely many lattice points; it is, however, the equation of a hyperbola, which contains an infinite number of lattice points, if the discriminant is positive. We quote Gauß: Formae vero determinantium positivorum, quae tractationem prorsus peculiarem requirunt, commentationi alteri reservatae manere debebunt1 and we also hope that the questions which are examined in this thesis for forms of negative dis- criminant will be investigated for forms of positive discriminant in another work. Consequently, we will also state all classical results in this chapter for positive definite binary quadratic forms only – regardless of the possible existence of analogous results for indefinite forms. In order to avoid excessive repetitions of these assumptions, we adopt the following convention: Convention: Whenever we will say form or quadratic form or binary quadratic form in this thesis, we will mean a positive definite integral primitive binary quadratic form.
This introductory chapter intends to give a brief account of the basic, mostly classical results in the prime number theory for binary quadratic forms of a fixed discriminant. It is organized as follows: In the next section we start with a review of the theory of composition of form classes of binary quadratic forms, which we basically owe to Dirichlet, but which has its origins already in the works of Lagrange and Gauß who composed forms instead of form classes. It was exactly this transition that was essential to answer a broad variety of questions on the representability of integers by such forms and made it possible to prove general qualitative as well as the first quantitative statements on the infinitude of primes that are representable by any given primitive binary quadratic form. The convenience to work with classes became even clearer after the introduction of the theory of “ideal numbers” by Kummer and its development by Dedekind. The relation between form classes and ideal classes is exhibited in Section 1.2 and the resulting more precise quantitative prime number theorem is given. Advances in class field theory (in the shape of the Chebotarev density theorem) and in analytic number theory led to even better results, conditional and unconditional ones, which are covered in Section 1.3. There exist plenty of excellent books and papers that present many of these topics in a much more detailed and more elaborated way than it would be possible and appropriate to provide here. The author profited particularly from the books Primes of the form x2 + ny2 [Cox97], Zetafunktionen und quadratische Körper [Zag81] and The shaping of arithmetic after C. F. Gauss’s Disquisitiones arithmeticae [GSS07] while writing this introductory chapter. We would like to end this opening section by reminding the reader that binary quadratic forms have found important applications in cryptography, which we cannot discuss here. Extensive accounts of the underlying algorithms can be found in [BV07], [Bue89] and [Coh93], for example.
1“The forms with positive determinant [discriminant], which require a special treatment, must remain reserved for other studies.”; quoted from the introduction to Gauß’s article De nexu inter multitudinem classium, in quas formae binariae secundi gradus distribuuntur, earumque determinantem, which can be found in the second volume of his collective works as well as in the cumulative German translation Untersuchungen über höhere Arithmetik of his number-theoretic works. Similarly, de la Vallée Poussin stated that “tandis que l’extension se fait naturellement à ces dernières [i.e., forms with negative discriminant], les formes de déterminant positif exigent une analyse beaucoup plus compliquée” in the third part [dlVP96] of his Recherches analytiques sur la théorie des nombres premiers in which he proved the prime number theorem for positive definite binary quadratic forms; he provided the analysis for indefinite forms one year later, in the fourth part [dlVP97] of his work. 1.1 The composition of binary quadratic forms and form classes 9
1.1 The composition of binary quadratic forms and form classes
Gauß’s Disquisitiones Arithmeticae, published in 1801, are widely regarded as the beginning of modern number theory. And about half of this work is devoted to the theory of binary quadratic forms. Building on former definitions and results from Lagrange’s Recherches d’Arithmétique, Gauß revealed here the importance and depth of the notion of equivalence of binary quadratic forms and of the way in which these forms can be composed. Edwards [GSS07, §II.2.1] notes that one of the purposes for Gauß to present the theory of composition in its full generality was to give another proof of quadratic reciprocity – a simpler proof than the one he gave in an earlier part of the Disquisitiones. However, in order to be able to derive the laws and properties of the composition for arbi- trarily given forms, long and complicated computations are necessary and the resulting com- position is not even a binary operation. Later, Dirichlet “simplified” Gauß’s composition by forfeiting the capability of composing arbitrary forms but contented himself with the ability to compose certain forms which are equivalent to the given ones. Thus, his composition of forms is really a composition of equivalence classes of forms. Yet, this kind of composition was sufficient for his questions on the representability of numbers by binary quadratic forms – and therefore it is also sufficient for the questions we will be concerned with in this thesis. We now describe this method of composing equivalence classes of binary quadratic forms. First of all, we must say what we mean by equivalence of quadratic forms: Two binary quadratic forms f and g are (properly) equivalent if there exists an element ! r s ∈ SL(2, ) t u Z such that f(x, y) = g(rx + sy, tx + uy) for all x, y ∈ Z. A short calculation shows that equivalent forms have the same discriminant. Moreover, it can be shown that equivalence of binary quadratic forms is indeed an equivalence relation and the number of equivalence classes – which we call form classes – is finite for any given discriminant of forms; see [Cox97, §2], for example. We denote the set of form classes of forms with discriminant q by K(q) and its cardinality by h(q). The main importance of this classification lies in the following fact (see [Zag81, §8], for example): Equivalent forms represent the same numbers. Therefore, we may define the set R(q, C) = n ∈ Z | ∀f ∈ C ∃x, y ∈ Z : f(x, y) = n for every negative integer q ≡ 0, 1 (mod 4) and every form class C ∈ K(q). We proceed to the definition of the composition of equivalence classes: Let
f(x, y) = ax2 + bxy + cy2 and g(x, y) = a0x2 + b0xy + c0y2
0 b+b0 be two forms of negative discriminant q and assume that the coefficients a, a and 2 have no common prime divisor. Then the (Dirichlet) composition F of f and g is the form
B2 − q F (x, y) = aa0x2 + Bxy + y2, 4aa0 where B is the unique integer modulo 2aa0 such that
B ≡ b (mod 2a),B ≡ b0 (mod 2a0) and B2 ≡ q (mod 4aa0). 10 Primes represented by positive definite binary quadratic forms
See [Cox97, Lemma 3.2] for a proof of the uniqueness of B. Now, one can show (see the references given in [Cox97, §3]) that this composition of special forms induces a well-defined binary operation on K(q) and turns it into an abelian group – the form class group of discriminant q – with order h(q) and the following identity element and inverses: Given a negative integer q ≡ 0, 1 (mod 4), the principal form of discriminant q is defined by q x2 − y2 if q ≡ 0 (mod 4), 4 1 − q x2 + xy + y2 if q ≡ 1 (mod 4). 4 The form class containing the principal form is the identity element of the class group K(q) and it is called its principal class; we denote it by C0(q) (or simply by C0 if the corresponding discriminant is clear from the context). The inverse of the class which contains the form ax2 + bxy + cy2 is the class which contains the form ax2 − bxy + cy2. We say that a class is an ambiguous class if its order in K(q) is at most 2; any form in an ambiguous class is called an ambiguous form. Switching the attention from quadratic forms to their equivalence classes led to the advent of the first results on the number of primes that may be represented by any given form. Dirichlet already sketched a proof of the infinitude of primes representable by certain binary quadratic forms in 1840, but it was Weber [Web82] who gave the first complete proof that held for all primitive forms:
Theorem 1.1 (Weber). Every positive definite integral primitive binary quadratic form repres- ents infinitely many primes.
De la Vallée Poussin is best known for his proof of the Prime Number Theorem (which was independently proved by Hadamard at about the same time). This work [dlVP96] is even more remarkable if one recalls that it not only contains the proofs of the ordinary prime number theorem and the corresponding one for primes in arithmetic progressions, but he also proved there:
Theorem 1.2 (De la Vallée Poussin). Let π(X; q, C) be the number of primes p 6 X that may be represented by forms in the form class C of the form class group of the negative discriminant q. Then li(X) π(X; q, C) = · 1 + o(1) as X → ∞, e(C)h(q) where e(C) = 2 if C is an ambiguous class and e(C) = 1 otherwise.
The original proofs of these theorems were quite long-winded. In the next section, we will see how the groups K(q), which consist of arithmetic objects, can be linked to groups of algebraic objects, which turn out to be more convenient to work with. This link led to shorter and more precise forms of the above statements and will also be the basis of our results.
Remark 1.3. There exists another natural classification of binary quadratic forms, which is also due to Gauß: We say that two forms of discriminant q lie in the same genus if they represent ∗ the same values in (Z/qZ) . Equivalent forms are always in the same genus, but the converse is usually not true. The most important properties are:
(a) All genera of forms of discriminant q consist of the same number of form classes. If this number is 1 and q = −4n for some positive integer n, then there exists a congruence condition like (1.1) which characterizes the primes of the shape x2 + ny2. 1.2 Algebraic methods for arithmetic objects 11
(b) The genus containing the principal form is called the principal genus. It consists of the squares in the form class group.
(c) Let q be the discriminant of a positive definite form. The number of genera of forms of discriminant q is given by 2ω(q)−1 if q is odd, ω(q)−2 q 2 if q is even and 4 ≡ 1, 5 (mod 8), κ(q) = q (1.2) 2ω(q)−1 if q is even and ≡ 2, 3, 4, 6, 7 (mod 8), 4 ω(q) q 2 if q is even and 4 ≡ 0 (mod 8),
where ω(q) denotes the number of distinct prime divisors of q. Both the number of ambiguous classes in K(q) and the index of the subgroup (K(q))2 in K(q) are also equal to κ(q).
The proofs of these properties can be found in [Cox97, §3]. We will come across genera and the numbers κ(q) in Chapter 3, but for the most part of this thesis form classes will be more important for us.
1.2 Algebraic methods for arithmetic objects
Kummer, in his endeavours to find a way to compensate the lack of unique factorization in the rings of integers of cyclotomic number fields, introduced the notion of “ideal numbers” in 1847. According to [GSS07, §II.2.1], Kummer was led to the definition of equivalence classes of these numbers by the way Gauß had partitioned binary quadratic forms into classes. The intimate relation to binary quadratic forms persisted when Dedekind generalized Kummer’s concept and introduced the language of ideals. Before we can state this connection explicitly, we have to fix the notation that we will use for certain notions of algebraic number theory: Every quadratic number field K can be written √ uniquely in the form Q( q) for a squarefree integer q 6= 0, 1. Its discriminant dK is given by dK = q if q ≡ 1 (mod 4) and by dK = 4q otherwise. The union of the set of integers which are discriminants of quadratic fields and {1} is called the set of fundamental discriminants. We will denote the set of all negative fundamental discriminants by F, i.e. F = d ∈ Z | d < 0, d ≡ 1 (mod 4) and d is squarefree (1.3) d ∪ d ∈ Z | d < 0, d ≡ 0 (mod 4) and 4 6≡ 1 (mod 4) is squarefree . Furthermore, the set of all negative integers q ≡ 0, 1 (mod 4), i.e. the set of negative discriminants of quadratic forms, will be denoted by D. Let q ∈ D. Then there exist a unique positive integer r (which is called the conductor of q) and a fundamental discriminant q ∈ F such that q = r2q . Moreover, there exists a unique √ 0 0 order of discriminant q in Q( q0): Recall that an order O in a quadratic field K is any subring of K containing 1 such that O is a finitely generated Z-module that contains a Q-basis of K. For example, the ring of integers of K is always an order in K and, in fact, the maximal one. The discriminant of any order O in K is the product of the square of the index of O in the ring of integers times the discriminant of the field. For any given discriminant q ∈ D of a binary √ quadratic form, we will only be interested in the order of discriminant q in ( q ), which we √ Q 0 denote by O(q); the ring of integers of Q( q0) will be denoted by Oq. Note that O(q) equals Oq if q ∈ F. 12 Primes represented by positive definite binary quadratic forms
For all q ∈ D we define: • I(q), the group of invertible fractional O(q)-ideals, i.e. the group of invertible finitely √ generated O(q)-submodules of Q( q0); note that fractional O(q)-ideals are usually not ring ideals of O(q);
• P (q), the subgroup of principal fractional O(q)-ideals;
•H(q), the quotient I(q)/P (q), i.e. the ideal class group of the order O(q);
• Z(q), the set of non-zero integral O(q)-ideals;
• N(a), the norm of the ideal a ∈ Z(q), i.e. the size of the quotient ring O(q)/a (the dependence on q is suppressed).
The algebraic properties of all these objects are explained in [Cox97, §5 and §7], for example. At this point, we just recall that H(q) is always a finite abelian group and say, analogously to the notion for form classes, that an ideal class K ∈ H(q) is ambiguous if K = K−1 in H(q). Binary quadratic forms and ideal classes are linked through the following result, which is due to Dedekind:
Lemma 1.4 (Dedekind). For every negative discriminant q, there exists an isomorphism
Bq : K(q) → H(q)
2 2 which is induced by the map that sends the binary√ quadratic form f(x, y) = ax + bxy + cy to −b+ q the ideal of O(q) that is generated by a and 2 . In particular, we have h(q) = |K(q)| = |H(q)|.
Moreover, a positive integer m is represented by the positive definite binary quadratic forms in the class C ∈ K(q) if and only if there exists an ideal a ∈ Bq(C) such that N(a) = m. A proof can be found in [Cox97, Theorem 7.7], for example. This relation helped to drive forward the development of algebraic number theory thanks to the extensive theory Gauß had created on the arithmetic side of this bijection. On the other hand, it turned out that many statements on binary quadratic forms can be proved in a simpler way by using the amenities of the algebraic side.
Remark 1.5. A major drawback of orders is the fact that they are usually not Dedekind domains (i.e., the factorization is not unique at the level of ideals); on the other hand, the ring of integers Oq is always a Dedekind domain. However, it turns out that this problem is not a severe one for most questions on primes represented by binary quadratic forms. In fact, if 2 q ∈ D with q = r q0, where q0 ∈ F and r is a positive integer, we let Ir denote the group of fractional Oq-ideals a satisfying a + rOq = Oq. Moreover, let Pr denote the subgroup of Ir generated by principal ideals of the form αOq such that α ∈ Oq satisfies α = a (mod rOq) for some integer a with (a, r) = 1. Then one can show (see [Cox97, §7]) that there exists an isomorphism Beq : H(q) → Ir/Pr. In particular, it follows with Lemma 1.4 that a positive integer m satisfying (m, r) = 1 is represented by the positive definite binary quadratic forms in C ∈ K(q) if and only if there exists an Oq-ideal a ∈ Bbq(C) := Beq(Bq(C)) such that |Oq/a| = m. We state the resulting qualitative information about the representability of integers by binary quadratic forms of a given discriminant in a more explicit way: 1.2 Algebraic methods for arithmetic objects 13
Proposition 1.6. Let q be a negative discriminant and let C ∈ K(q) be a form class. Write 2 q = r q0, where r is the conductor of q and q0 ∈ F.
(a) Let p be a prime which does not divide r. Then there exist x, y ∈ Z and a binary quadratic form f of discriminant q such that f(x, y) = p if and only if either
2 (i) p ramifies in Oq, i.e. there exists a prime ideal p in Oq with pOq = p ; in this case p may be represented by forms of the class C if and only if p ∈ Bbq(C) and p is then representable by forms of the class C only; or
(ii) p splits in Oq, i.e. there exist distinct prime ideals p1, p2 in Oq with
pOq = p1p2.
In this case p may be represented by forms of the class C if and only if p1 ∈ Bbq(C) −1 or p2 ∈ Bbq(C); in particular, p1 ∈ Bbq(C) if and only if p2 ∈ (Bbq(C)) , i.e. p is representable exactly by forms of the classes C and C−1.
(b) Let n be a positive integer which is coprime to r and let
Y αi Y βj Y γk Y δ` n = pi rj sk t`
be its prime factorization, where the first product is taken over all primes which split in Oq and are representable by forms of ambiguous classes, the second product is taken over all primes which split in Oq and are representable by forms of non-ambiguous classes, the third product is taken over all primes which remain prime in Oq and the fourth product is taken over all primes which ramify in Oq.
Denote the class which represents the prime pi by Cpi , the classes which represent the −1 2 prime rj by Crj and Crj , the class which represents the square sk by Csk (which must
always be the principal class) and the class which represents the prime t` by Ct` (which must always be an ambiguous class). Then n is representable by forms of discriminant q if and only if all the exponents γk are even and it is then representable by exactly the classes of the form
Y αj Y βj −vj Y Y Y Y βj −vj Y C C Cγk/2 Cδ` = Cαi C Cδ` pi rj sk t` pi rj t`
for all tuples (vj) of integers vj ∈ {0, 2,..., 2βj}.
Remark 1.7. Most of the classical results that we will present in this chapter are known to hold for both non-fundamental and fundamental discriminants. It should also be possible to prove many of the original results in Chapter 2 and Chapter 3 in a general form for both kinds of discriminants – along the lines of the proofs that we will give for fundamental discriminants only. However, we believe that the amount of additional technical details that are usually necessary for general proofs – due to the peculiarities of the square factors of non-fundamental discriminants – would often eclipse the main arguments. From now on, we will therefore restrict our attention to fundamental discriminants. In particular, we may henceforth always assume that O(q) = Oq. 14 Primes represented by positive definite binary quadratic forms
Remark 1.8. If a negative fundamental discriminant q and a form class C ∈ K(q) are given and we want to estimate a sum of the form
X g(n) n6X n∈R(q,C) for some arithmetic function g, then Proposition 1.6 allows us to equivalently estimate the sum g(n) X X , v(C, n) n6X a∈Bq(C) N(a)=n where the weight function v(C, n) accounts for the fact that, in general, the number of ideals a ∈ Bq(C) with norm n does not equal 1. Thus, if there exists in Bq(C) an ideal a with norm n, then v(C, n) is the number of ideals a ∈ Bq(C) with N(a) = n. For further use, we also set
w(C, n) = X 1. (1.4)
a∈Bq(C) N(a)=n
Note that v(C, n) remains undefined if there is no ideal a ∈ Bq(C) with N(a) = n, while w(C, n) = 0 in this case; thus, we have w(C, n) = 0 if and only if n∈ / R(q, C), by Lemma 1.4. Using Proposition 1.6, we may also give an expression for w(C, n) which does not use the language of ideals but only the language of form classes: We have
Y Y βj −vj Y Y w(C, n) = (v ): C = Cαi C Cδ` · (α + 1); (1.5) j pi rj t` i the second factor arises from the (αi + 1) possibilities when choosing the prime ideals which lie over each split prime pi that is representable by forms of an ambiguous class. Remark 1.9. The arithmetic functions we will be most interested in are the characteristic function for the set of rational primes and (smooth versions of) the von Mangoldt function. For q all (positive or negative) fundamental discriminants q, let χq denote the Kronecker symbol ( · ) (see [IK04, §3.5] or [MV07, §9.3] for the explicit definition); it equals the unique primitive real Dirichlet character modulo |q| if q 6≡ 0 (mod 8) (there are two primitive real Dirichlet characters if q ≡ 0 (mod 8)). For each rational prime p, the number of solutions m (mod p) to m2 ≡ q (mod p) equals 1 + χq(p) and one can easily show (see [Cox97, Proposition 5.16], for example):
2 • If χq(p) = 0, i.e. if p divides q, then p ramifies in O(q), i.e. pO(q) = p for some prime ideal p of O(q) and N(p) = p;
• if χq(p) = 1, then p splits in O(q), i.e. pO(q) = p1p2 for two distinct prime ideals p1, p2 of O(q) and N(p1) = N(p2) = p;
• if χq(p) = −1, then p remains prime in O(q), i.e. pO(q) = p is a prime ideal in O(q) and N(p) = p2. Consequently, Proposition 1.6 implies that, if n = p` for a prime p and a positive integer ` and if n can be represented by the forms in the class C ∈ K(q), then 6 ` + 1 if χq(p) = 1, ` w(C, n) = w(C, p ) = 1 if χq(p) = 0, (1.6) = 1 if χq(p) = −1 (and ` must be even). 1.2 Algebraic methods for arithmetic objects 15
Only a small set of primes ramifies in O(q). Thus, if the number w(C, p) is positive, it will usually be given by w(C, p) = 2 if C is ambiguous, and w(C, p) = 1 otherwise. For further use, we therefore put (2 if C is ambiguous, e(C) = (1.7) 1 if C is not ambiguous. Note that we thus have
X X X X X X w(C, p) = e(C) 1 − 1 = (1 + χq(p)). (1.8) C∈K(q) p6X C∈K(q) p6X p6X p6X p∈R(q,C) p|q
In Chapter 2, we will be interested in questions of uniformity. That is, given a large real number X and a “reasonable” arithmetic function g, we would like to know whether there exists an estimate for X g(n) (1.9) n6X n∈R(q,C) which is uniform in (i.e., independent of) the choice of the form class C ∈ K(q) and the error term of which is also uniform in the choice of the discriminant q. Thus, one would instinctively expect a “reasonable” function to be a function which shows no obvious reason to favour any classes or discriminants; the sum in (1.9) should, for any specific form class, therefore not differ much from the average over all C ∈ K(q) of such sums, for all q in some large range. However, the distinct behaviour of ambiguous and non-ambiguous classes in their capability to represent primes, which is evident from Proposition 1.6 and the remarks above, shows that we usually cannot expect estimates that are independent of the given form class. Nevertheless, if g is the characteristic function for the set of prime numbers, for example, we may still hope that a “uniformity up to the factors e(C)” holds. Due to the close relation between forms and ideals, we may then ask the same question on uniformity for sums of the type X X ge(n) (1.10) n6X a∈C N(a)=n for (fundamental) discriminants q, ideal classes C ∈ H(q) and arithmetic functions ge. It turns out that there is less reason to expect a significant dependence on the given class here.2 Ana- lytic methods also often tend to cooperate better with algebraic objects like ideals than with arithmetic objects like quadratic forms. Hence, chances are better to estimate a sum like (1.10) g(n) for a function ge(n) which is “usually” close to v(C,n) (see Remark 1.8), then translate the result by means of the bijection Bq to an estimate for (1.9) and hope that the term “usually” indeed means “sufficiently often” in order to give an additional error term which is small (and still uniform). We will often benefit from this procedure. Leaving the uniformity in q aside (which will be the topic of Chapter 2), there exist classical results which give uniformity in C ∈ H(q) in (1.10) for certain functions ge. For constant functions we have:
2Note, however, that this is basically only true if the inner sum in (1.10) is only over ideals which are products of prime ideals that lie over split primes: By Remark 1.9, prime ideals that lie over ramified primes or over primes that remain prime in O(q) may only be contained in an ambiguous class or in the principal class, respectively. Conveniently, these prime ideals are rare (or have a relatively large norm) and are therefore negligible for most of our considerations. 16 Primes represented by positive definite binary quadratic forms
Theorem 1.10 (The ideal theorem for ideal classes). Let q ∈ F with |q| > 4 and let C ∈ H(q). Then X X πX 1/3 1 = + Oq X ). p|q| n6X a∈C N(a)=n
This was proved by Landau in 1918; see [Nar04, §7.4.13] and the references there.
Of even greater interest to us, when ge is the characteristic function for the set of primes, we have li(X) √ X X 1 = + O Xe−c log X (1.11) h(q) q p6X a∈C prime N(a)=p for all q ∈ F, all C ∈ H(q) and a constant c = c(q) > 0. This is a consequence of:
Theorem 1.11 (The prime ideal theorem for ideal classes). Let q ∈ F and let C ∈ H(q). Then there exists a constant c = c(q) > 0 such that
li(X) √ X 1 = + O Xe−c log X , (1.12) h(q) q p∈C N(p)6X where the sum on the left side is over prime ideals of O(q) only.
Landau proved a general version of this statement with a weaker error term in 1907. The version above was shown by him in 1918 and the error term has been only slightly improved since then; see [Nar04, §7.2 and §7.4.12] and the references there. Note that the left sides of (1.11) and (1.12) may only differ by the prime ideals which lie over rational primes that remain prime in O(q) (and this may only happen if C is the principal class). By Remark 1.9, the norm of these prime√ ideals is the square of the respective rational primes. Thus, their contribution is less than X and therefore negligible in (1.11). From (1.11) and our previous remarks, we easily derive:
Theorem 1.12 (Landau’s prime number theorem for binary quadratic forms). Let q ∈ F and let C ∈ K(q). Then there exists a constant c = c(q) > 0 such that
li(X) √ π(X; q, C) := X 1 = + O Xe−c log X . e(C)h(q) q p6X prime p∈R(q,C)
Indeed, whenever a prime ideal of O(q) lies over a split prime p in Bq(C), there are, by Proposition 1.6 and Remark 1.9, exactly e(C) prime ideals lying over p in Bq(C). Thus,
e(C)π(X; q, C) = X X 1 + O(log |q|), p6X a∈C N(a)=p where the error term takes into account potential prime ideals which lie over ramified primes, i.e. whose norm divides q; note that there are ω(q) log |q| such prime ideals. This error term is, of course, negligible in Theorem 1.12. Therefore, Theorem 1.12 follows from (1.11).3
3It should be remarked that Landau [Lan14] gave a direct proof of Theorem 1.12 already in 1914 – without using ideals and even with an absolute constant c. 1.3 The Chebotarev density theorem and conditional results 17
Remark 1.13. As for the distribution of all integers representable by forms in a given form class C ∈ K(q), Bernays [Ber12] proved – without using ideals – that there exists a constant b(q), which does not depend on C, such that
b(q)X X X 1 = + O (1.13) (log X)1/2 a,q (log X)1/2+a n6X n∈R(q,C)