ASYMPTOTICS FOR THE PARTITION FUNCTION VIA THE CIRCLE METHOD
DAVID SCHWEIN
1. Introduction A partition of a natural number n is a multiset of natural numbers that sum to n. For instance, 7=1+2+4=2+4+1 are two (identical) partitions of 7. The partition function p(n) counts the numbers of partitions of n. The first few values of this function are tabulated below.
Table 1. Values of the Partition Function
n 0 1 2 3 4 5 6 7 8 9 10 p(n) 1 1 2 3 5 7 11 15 22 30 42
Our goal is to understand the asymptotic behavior of the partition function. P n Let F (x) = n≥0 p(n)x be the (power series) generating function of the partition function. We can turn information about F (x) into information about p(n) because the partition function can be recovered from its generating function:
1 Z F (x) (1) p(n) = n+1 dx, r < 1. 2πi |x|=r x Perhaps the most important information about F (x) is that it factors as an infinite product of rational functions: Y F (x) = (1 − xk)−1. k≥1 This identity follows from the definition of the partition function by expanding the factor (1 − xk)−1 as a power series, then multiplying out the product. Since the poles of the factor (1 − xk)−1 are the kth roots of unity, we can think of F (x) as having a pole (of “infinite order”) at every root of unity. In reality, of course, F (x) has no poles: it does not admit an analytic continuation to any connected open subset properly containing the unit disc. To get a sense for how F (x) behaves, consider the partial product
N Y k −1 FN (x) = (1 − x ) . k=1
At each kth root of unity with k ≤ N, the rational function FN (x) has a pole of multiplicity bN/kc; there are no other poles. When N ≥ n, the power series expansions of F (x) and FN (x) have the same coefficient n of x , namely p(n). Consequently, replacing F by FN in the integral (1) leaves its value unchanged. After making this replacement we can dilate the contour of integration past the unit circle, increasing the n+1 radius r in (1) to be larger than 1. This has the effect of subtracting the residues of all poles of FN (x)/x on the unit disc. The resulting contour integral actually vanishes, as can be seen by estimating its size in
Date: 15 November 2017. 1 2 DAVID SCHWEIN
terms of r and letting r tend to infinity. We conclude that the partition function is related to the residues of FN (x) by the equation N X X Res(FN , ω) p(n) + = 0, ωn+1 k=1 ω∈µk × where µk ⊂ C is the set of kth roots of unity. (Alternatively, this equation is a consequence of the fact that the residues of the poles of a meromorphic function on the Riemann sphere sum to zero, applied to the n+1 rational function FN (x)/x .) The obvious next step is to simplify the residues in the expression above, each the value of a certain rational function at a root of unity. Executing this strategy would (probably) require a great deal of algebraic manipulation; preferring analysis over algebra, we are forced to abandon it. But not all hope is lost. This failed plan has taught us that the behavior of F (x) near the roots of unity carries information about p(n). Around 1918, these considerations led Hardy and Ramanujan to analyze the integral (1) using a contour of integration formed by arcs of circles approaching roots of unity. Their method produced the following asymptotic formula. exp πp2n/3 Theorem 1 ([5, p. 79]). p(n) ∼ √ . 4n 3 In accordance with Stigler’s law of eponymy, Hardy and Ramanujan’s method is known today as the Hardy- Littlewood circle method. (The name likely derives from a paper of Hardy and Littlewood published soon thereafter, in 1919, on Waring’s problem of representing integers as sums of powers [4].) The circle method has become a standard tool in the repertoire of analytic number theorists, and in Section 4 we will give an overview of its applications. In the 1930s, Rademacher modified Hardy and Ramanujan’s argument to give a convergent infinite series expansion for p(n); earlier, Hardy and Ramanujan had only given a divergent asymptotic expansion (different from Theorem 1). Theorem 2 ([6, p. 422]). For n ≥ 1, the partition function p(n) is represented by the convergent series ∞ 1 X √ d −1/2 2π 1/2 p(n) = √ A (n) k n − 1 sinh √ n − 1 , k dn 24 24 π 2 k=1 k 6 where Ak(n) is a certain sum of roots of unity. Our goal in this paper is to derive Rademacher’s asymptotic expansion of the partition function and (in a future version of this paper) to use it to prove Hardy and Ramanujan’s simpler asymptotic formula. To analyze the partition function using the circle method, instead of replacing F by its partial product FN and deforming the contour of integration across the roots of unity, as in our first attempt at the problem, we will retain F and deform the contour of integration to approach the roots of unity of order at most N. The deformed contour consists of arcs of circles, one arc for every such root of unity. Adding up the contributions from each arc and letting N tend to infinity gives Rademacher’s infinite sum decomposition of the partition function. The arcs of Rademacher’s contour are arcs of Ford circles, a family of circles first studied by L. R. Ford in 1938 as a “visual representation of arithmetical results of diverse kinds” [3] (though not including the asymptotics of the partition function; it was Rademacher who synthesized these ideas). The Ford circles of order N are indexed by the roots of unity of order at most N, in other words, by the fractions of denominator at most N in the interval [0, 1]. The fraction h/k corresponds to the circle of center h/k + i/(2k2) and radius 1/(2k2). Figure 1 depicts the Ford circles of order 4. The set indexing the Ford circles of order N, that is, the rational numbers in [0, 1] with denominator at most N, is known as the Farey sequence of order N after the British geologist John Farey, who reported them in an 1816 letter to the Philosophical Magazine [2]. Understanding Ford circles requires understanding basic properties of the Farey sequence, most notably that for any three consecutive fractions in the sequence, the middle term is the mediant of the outer terms (this observation was the content of Farey’s letter, although he could not prove it). Constructing Rademacher’s contour is only part of the argument. Understanding the behavior of F (x) on each individual arc of the contour relies on a remarkable connection between F (x) and a certain modular ASYMPTOTICS FOR THE PARTITION FUNCTION VIA THE CIRCLE METHOD 3
1 1 1 2 3 0 4 3 2 3 4 1
Figure 1. The Ford Circles of Order 4 form η(τ), the Dedekind eta function. It is defined for τ in the upper half plane by the infinite product Y −1 η(τ) = e(τ/24) 1 − e(k) , Re(τ) > 0, e(τ) := exp(2πiτ). k≥1 Much is known about the eta function, but for us, the key property is its functional equation η(γτ) = ε(γ)(cτ + d)1/2η(τ), a b where γ = c d ∈ SL2(Z) and where ε : SL2(Z) → µ24 is a certain function whose precise definition is not (yet) important. (Here SL2(Z) acts on the upper half plane in the usual way, by fractional linear aτ+b transformations: γτ = cτ+d .) In other words, η is a twisted modular form of weight one half. Comparing the definition of η with the infinite product for F , we find that e(τ/24) η(τ) = . F (e(τ)) Substituting this equation into the functional equation for η gives a transformation formula for the function F (e(τ)), which we write in the form F (e(τ)) = ε(γ)(cτ + d)1/2e (τ − γτ)/24F (e(γτ)). When x is close to zero, F (x) is close to one; so when τ is close to i∞, that is, when τ is large, F (e(τ)) is close to one. Under this heuristic, the transformation formula for F (e(τ)) implies that for τ close to the pole of γτ, (2) F (e(τ)) ≈ ε(γ)(cτ + d)1/2e (τ − γτ)/24. This approximation replaces the function F (e(τ)) by elementary functions, which are much more amenable to analysis. When x = e(τ) is a root of unity, τ = h/k is a rational number. Therefore, to focus our attention on the root of unity e(h/k) we ought to choose a transformation γ that sends h/k to ∞. This forces the denominator of γτ to be kτ − h, and the requirement that γ has determinant one forces the numerator of γτ to be Hτ − (hH + 1)/k, where H is any integer such that hH + 1 = 0 mod k. So in summary, we should take H −(hH + 1)/k γ = γ = h,k k −h in the functional equation for F . The remainder of the analysis consists of manipulating contour integrals. The variable transformation z = i(τ −h/k)/k2 normalizes the contour for h/k, mapping it to an arc of the circle of radius and center 1/2. The endpoints of this arc depend on h and k, and to estimate error terms we will need to bound the 4 DAVID SCHWEIN
magnitude of the endpoints; these bounds ultimately depend on properties of the Farey sequence. Finally, after exploiting a certain integral representation of a Bessel function built up from the elementary function (2), we will have Rademacher’s formula. None of the core mathematical ideas presented here are new. My exposition follows Chapter 5 of Apostol’s second elementary analytic number theory text [1], based in turn on Rademacher’s original paper [6].
2. Preliminaries 2.1. The Transformation Formula. Definition 3. The Dedekind eta function η : H → C is the holomorphic function defined by the infinite product Y η(τ) = e(τ/24) (1 − e(kτ)), e(τ) := exp(2πiτ). k≥1 Definition 4 (Dedekind Sum). Let h and k be integers with k ≥ 1. The Dedekind sum S(h, k) is k−1 ( X r hr x − bxc − 1 if x∈ / S(h, k) = , where ((x)) = 2 Z k k 0 if x ∈ r=0 Z a b Proposition 5 (Functional Equation for η). For every γ = c d ∈ SL2(Z) and every τ ∈ H, η(γτ) = ε(γ)(cτ + d)1/2η(γ),
where ε : SL2(Z) → µ24 is the function defined by a + d 1 1 ε(γ) = e + S(−d, c) − . 24c 2 8 Proof. See the supplement to Chapter 3 of Apostol’s book [1]. The strategy is to first prove the theorem in the special case where γτ = −1/τ, then deduce the general theorem by an inductive argument on the length of γ in terms of the standard generators for SL2(Z), using a certain reciprocity law of Dedekind sums. An inductive proof of this sort makes clear that the codomain of ε is the 24th roots of unity; this claim is not at all clear from the definition of ε. Proposition 6 (Functional Equation for F (x)). Let h and k be integers with (h, k) = 1 and k 6= 0, let H be any integer such that hH + 1 = 0 mod k, and let γh,k ∈ SL2(Z) be the element acting on H by τ 7→ (Hτ − (hH + 1)/k)/(kτ − h). For every x ∈ D, 1/2 1 F (e(τ)) = ε(γh,k)(kτ − h) e 24 (τ − γh,kτ) F (e(γh,k)). Proof. Substitute η(τ) = e(τ/24)/F (e(τ)) and γ = γh,k into the functional equation for η. 2.2. Farey Fractions. Definition 7. The Farey sequence of order N is the ordered set of rational numbers of denominator at most N in the interval [0, 1]. Proposition 8. Let a/b < h/k < c/d be rational numbers with bc − ad = 1. Then k ≥ b + d. Proof. Start with the equation c a c h h a − = − + − , d b d k k b which simplifies to 1 ck − dh bh − ak = + . ab dk bk Let m = bh − ak and let n = ck − dh. Multiplying the equation above by abk shows that k = bn + dm. Then k ≥ b + d because m and n are at least 1. Definition 9. Let q = a/b and r = c/d be rational numbers with (a, b) = (c, d) = 1 and b, d ≥ 0. The mediant of q and r is the rational number a + c . b + d Proposition 10. Let q < r be rational numbers and let s be their mediant. ASYMPTOTICS FOR THE PARTITION FUNCTION VIA THE CIRCLE METHOD 5
(a) The mediant satisfies the inequality q < s < r. Suppose in addition that q = a/b and r = c/d with bc − ad = 1, and that s = h/k with (h, k) = 1. (b) The mediant is the unique rational number in the interval (q, r) of denominator at most b + d. (c) ak − bh = 1 and ck − dh = 1. Proof. The first part amounts to proving that a a + c c < < . b b + d d The first inequality follows from the following computation: a + c a b(a + c) − a(b + d) bc − ad − = = ; b + d b b(b + d) b(b + d) this is positive because c/d > a/b. The second inequality is proved in the same way. For the second part, if q and r are both integers or half-integers then the claim is easy to check. Otherwise, suppose that s0 is a rational number of denominator at most b + d in the interval (q, r). By Proposition 8, the denominator of s0 is at least b + d, hence equal to it. Simplifying the inequality (b − 1)(d − 1) > 1, we conclude that bd > b + d, so that 1/(bd) < 1/(b + d). As the length of the interval [q, r] is 1/(bd), it is impossible for the interval [q, r] to contain two consecutive rational numbers of denominator b + d. Proposition 11. The Farey sequence of order N is obtained from the Farey sequence of order N − 1 by inserting all mediants s of consecutive terms q < r such that the denominator of s is N. In particular, if a/b and c/d with (a, b) = (c, d) = 1 are consecutive fractions in a Farey sequence then bc − ad = 1. Proof. The proof is by induction on N. We already know that the Farey sequence of order N is obtained from the Farey sequence of order N −1 by inserting the ϕ(N) fractions h/N with (h, N) = 1 and 0 < h < N. Suppose that the fraction s = h/N is inserted between the fractions q < r in the N − 1th Farey sequence. By induction, q = a/b and r = c/d with bc − ad = 1. Since q and r were adjacent in the N − 1th Farey sequence, s is the rational number of least denominator strictly contained between them. By Proposition 10, s must be the mediant of q and r. By the same proposition, if s = h/k with (h, k) = 1 then hb − ak = 1 and ck − dh = 1, finishing the proof. The following proposition assembles the results about Farey fractions that we will require in the following sections; the proof is clear from the previous propositions. Proposition 12. Let a/b < c/d with (a, b) = (c, d) = 1 be successive rational numbers in the Farey sequence of order N. Then bc − ad = 1 and b + d ≥ N + 1. 2.3. Ford Circles. Fix a natural number N. Definition 13. Let h and k be integers with (h, k) = 1 and k ≥ 1. The h/kth Ford circle C(h, k) is the circle in the upper half plane H of radius 1/(2k2) and center h/k + i/(2k2). Proposition 14. Let h/k < h0/k0 be successive fractions in the Farey sequence of order N, with (h, k) = (h0, k0) = 1. Then the Ford circles C(h, k) and C(h0, k0) intersect tangentially at the point h k0 i h0 k i + + = − + . k k(k2 + k02) k2 + k02 k0 k0(k2 + k02) k2 + k02 Proof. As a preliminary calculation, note that since h0k − hk0 = 1 by Proposition 12, h h0 1 − = . k k0 kk0 A circle of radius r and center z in the complex plane is tangent to a second circle of radius s and center w if and only if |z − w| = r + s. In this case, the point of tangency is a convex linear combination of z and w, specifically, the point sz + rw . r + s See Figure 2 for a depiction of two such circles. 6 DAVID SCHWEIN
r s z w
Figure 2. The intersection point of two tangent circles
Applying these facts to our particular choice of circles, we see that the two Ford circles are tangent because 0 2 0 h i h i h h 2 1 1 2 + − + = − + − k 2k2 k0 2k02 k k0 2k2 2k02 1 1 1 2 = + − (kk0)2 2k2 2k02 1 1 2 = + 2k2 2k02 by the algebraic identity 4xy + (x − y)2 = (x + y)2. The point of intersection is 1 1 −1 1 h i 1 h0 i + + + + 2k2 2k02 2k02 k 2k2 2k2 k0 2k02 h 1 1 −1 1 i 1 h0 h i = + + + − + k k2 k02 k02 2k2 k2 k0 k 2k02 h k2k02 1 i h k0 i = + + = + + . k k2 + k02 k3k0 k2k02 k k(k2 + k02) k2 + k02 A similar calculation gives the other formula.
Proposition 15. Let h0/k0 < h/k < h1/k1 be successive fractions in the Farey sequence FN , with (h0, k0) = (h, k) = (h1, k1) = 1, and let zj(h, k) denote the image of the intersection point of C(h, k) and C(hj, kj) under the linear transformation z = −ik2(τ − h/k). Then √ k |z (h, k)| ≤ 2 . j N 2 2 1 2 Proof. Proposition 12 and the easily-verified inequality x + y ≥ 2 (x + y) give the lower bound 2 2 1 2 1 2 k + k0 ≥ 2 (k + k0) ≥ 2 N . By Proposition 14, 2 kk0 ik z0(h, k) = 2 2 − 2 2 . k + k0 k + k0 Hence k √ k |z0(h, k)| = ≤ 2 . p 2 2 N k + k0 A similar (but not completely symmetric) argument using the other part of Proposition 14 gives the inequality for |z1(h, k)|. 2.4. Rademacher Contour. Fix a natural number N. The h/kth Ford circle tangentially intersects the Ford circles for the fractions adjacent to h/k in the Nth Farey sequence; let α(h, k) be the clockwise- oriented arc along C(h, k) that connects these points and does not intersect the real axis. Figure 3 depicts the Rademacher contour for N = 4. The careful reader will have noticed that this picture contains a slight lie: in the edge case (h, k) = (0, 1), the Ford circle C(0, 1) is divided into halves and one half of it is translated to the other side of the contour. This modification does change the contour, but it won’t change the integral in which the contour eventually appears, since the integrand will be periodic of period 1. (It might be even better to view the Ford circles ASYMPTOTICS FOR THE PARTITION FUNCTION VIA THE CIRCLE METHOD 7
i i + 1
1 1 1 2 3 0 4 3 2 3 4 1
Figure 3. The Rademacher Contour for N = 4
of order N as parameterized by all rational numbers of denominator at most N, so that Figure 3 is a slice of a bi-infinite periodic sequence of circles, but we decided against this somewhat cumbersome approach.)
2.5. Bessel Functions. The final steps of our analysis will use a small piece of the theory of Bessel functions, a special family of functions used widely in applied mathematics. I will cite the two results we need from Watson’s treatise on Bessel functions [10]. It would be nice to have a more conceptual explanation for the appearance of these formulas.
Definition 16. Let ν be a complex number. The Bessel function of order ν, denoted by Jν (z), is the meromorphic function defined by the power series expansion ∞ X (−1)m z 2m+ν J (z) = . ν m! · Γ(m + ν + 1) 2 m=0
The modified Bessel function of order ν, denoted by Iν (z), is the meromorphic function ∞ X 1 z 2m+ν I (z) = i−ν J (iz) = . ν ν m! · Γ(m + ν + 1) 2 m=0 Proposition 17. Suppose c > 0 and Re(ν) > 0. Then 1 ν Z ( 2 z) −ν−1 2 Iν (z) = t exp(t + z /(4t)) dt, 2πi (c) R where (c) is the contour integral over the infinite vertical line Re t = c taken in the upward direction.
Proof. See [10, p. 181]. r2z d sinh z Proposition 18. I (z) = . 3/2 π dz z
Proof. The Bessel functions of half-integral order are elementary functions; see [10, Ch. III]. Putting these two propositions together gives a closed-form expression for the following integral. Proposition 19. Suppose c > 0 and Re(ν) > 0. Then Z √ d sinh z t−5/2 exp(t + z2/(4t)) dt = 8 πiz−1 . (c) dz z 8 DAVID SCHWEIN
3. Rademacher’s Asymptotic Expansion of the Partition Function In this section we will prove the following theorem. Theorem 20. The partition function p(n) has the infinite series expansion 1 X √ d −1/2 2π 1/2 p(n) = √ A (n) k n − 1 sinh √ n − 1 , k dn 24 24 π 2 k≥1 k 6 where X 1 Ak(n) = e 2 S(h, k) e(−nh/k) (h,k)=1 0≤h 1 Z F (x) p(n) = n+1 dx. 2πi |x|=e(i) x Change the domain to the upper half plane via the coordinate transformation x = e(τ), giving the integral Z i+1 F (e(τ))e(−nτ) dτ i taken along a straight-line contour. Fix a large integer N and let α(h, k) be the h/kth arc of the Rademacher contour, with (h, k) = 1. (Note: α depends on N.) Decompose the integral operator above as N Z i+1 X X Z = , i k=1 (h,k)=1 α(h,k) 0≤h N X X Z X X p(n) = F (e(τ))e(−nτ) dτ, = . k=1 h α(h,k) h (h,k)=1 0≤h N Z X X 1/2 1 p(n) = ε(γh,k)(kτ − h) e 24 (τ − γh,kτ) F (e(γh,kτ))e(−nτ) dτ k=1 h α(h,k) Next, consider the (h, k)th integral above. Change variables to move the Ford circle C(h, k) to the 1 2 circle of radius and center 2 , using the linear transformation τ = h/k + iz/k . A calculation shows that γh,kτ = H/k + i/z. Then