Computation and theory of Mordell-Tornheim-Witten sums II
(Dedicated to Richard Askey on the occasion of his eightieth birthday)
D. H. Bailey∗ J. M. Borwein.† July 29, 2014
Abstract In [8] the current authors, along with the late and much-missed Richard Crandall (1947– 2012), considered generalized Mordell–Tornheim–Witten (MTW) zeta-function values along with their derivatives, and explored connections with multiple-zeta values (MZVs). This en- tailed use of symbolic integration, high precision numerical integration, and some interesting combinatorics and special-function theory. The original motivation was to represent objects such as Eulerian log-gamma integrals; and all such integrals were expressed in terms of a MTW basis. Herein, we extend the research envisaged in [8] by analyzing the relations be- tween a significantly more general class of MTW sums. This has required significantly more subtle scientific computation and concomitant special function theory.
1 Introduction
In [8] we defined an ensemble of extended Mordell–Tornheim–Witten (MTW) zeta function values [23, 36, 29, 30,5, 15, 37, 38]. There is by now a huge literature on these sums; in part because of the many connections with fields such as combinatorics, number theory, and mathematical physics. Unlike previous authors we included derivatives with respect to the order of the terms. We in- vestigated interrelations between MTW evaluations, and explored some deeper connections with multiple-zeta values (MZVs). To achieve these results, we used symbolic and numerical integration, special function theory and some less-than-obvious combinatorics and generating function analysis. Our original motivation was that of representing previously unresolved constructs such as Eu- lerian log-gamma integrals Z 1 n LGn := log Γ(x) dx. 0
For that purpose it was necessary to consider only one differentiation of Lis(z) with respect to s—though the requisite tools for higher order derivatives were laid out in [8]. For additional introductory material we refer to [8]. These approaches are extended to character polylogarithms in [6]. Some related results and computational algorithms for the incomplete gamma function, the Hurwitz zeta function, and the Dirichlet L-series are presented in [7].
∗Lawrence Berkeley National Lab (retired), Berkeley, CA 94720, and University of California, Davis, Dept. of Computer Science, Davis, CA 95616, USA. E-mail: [email protected]. †CARMA, University of Newcastle NSW 2303, Australia, E-mail: [email protected]
1 In this article we continue the research envisaged in [8] by analyzing the relations between higher-order MTW sums. This paper is, in consequence, in part in homage to our friend and collaborator Richard Crandall who died after a short illness on December 20, 2012. It is also a fitting tribute to Dick Askey, whose role in the development of modern special function theory has been prodigious (see [1] and the discussion in [19]). In Indiscrete Thoughts [34, p. 216] Gian-Carlo Rota writes “special functions [is] a Wisconsin subject,” but in this setting ‘Askey’ is a synonym for ‘Wisconsin’.
1.1 Organization The organization of the paper is as follows. In Section2 we recall an ensemble D capturing the values we wish to study and provide effective integral representations in terms of polylogarithms on the unit circle. (In Section 4.5 we reprise a subensemble D1 sufficient for the study log gamma integrals.) In Section3 we provide the necessary polylogarithmic algorithms for computation of our sums/integrals to high precision (400 digits up to 3100 digits). To do so we first have to provide similar tools for the zeta function and its derivatives at integer points. These methods, and some extensions, are of independent value and are further pursued in this paper. In Section4 we record various reductions and interrelations of our MTW values (see Theorems 5,6 and7). In Section5 we reprise two rigorous experiments [8] designed to use integer relation methods [16] to first explore the structure of the ensemble D1 and then to begin to study D. Section 6 we present our new experimental work on the structure of D. It also provides proofs of some experimentally discovered results. Finally, in Section7 we make some summatory remarks.
2 Mordell–Tornheim–Witten ensembles
The multidimensional Mordell–Tornheim–Witten (MTW) zeta function
X 1 ω(s1, . . . , sK+1) := (1) s1 sK sK+1 m1 ··· m (m1 + ··· + mK ) m1,...,mK > 0 K enjoys known relations [32], but remains mysterious with respect to many combinatorial phenomena, especially when we contemplate derivatives with respect to the si parameters. We shall refer to Pk+1 K + 1 as the depth and j=1 sj as the weight of ω. A previous work [5] introduced and discussed a novel generalized MTW zeta function for positive integers M,N and nonnegative integers si, tj—with constraints M ≥ N ≥ 1—together with a polylogarithm-integral representation:
M N X Y 1 Y 1 ω(s1, . . . , sM | t1, . . . , tN ) := s t (2) mj j nk k m1,...,mM ,n1,...,nN > 0 j=1 k=1 PM PN j=1 mj = k=1 nk M N 1 Z 2π Y Y = Li eiθ Li e−iθ dθ. (3) 2π sj tk 0 j=1 k=1
P n s Here the polylogarithm of order s denotes Lis(z) := n≥1 z /n and its analytic extensions [31] and the (complex) number s is its order.
2 When some s-parameters are zero, there are convergence issues with this integral representation. One may, however, use principal-value calculus, or alternative representations given in [8] and expanded upon in Section 4.4. When N = 1 the representation (3) devolves to the classic MTW form, in that
ω(s1, . . . , sM+1) = ω(s1, . . . , sM | sM+1). (4)
2.1 Generalized MTW sums We also explore a wider MTW ensemble involving outer derivatives, introduced in [5], according to
M N dj ek s1, . . . , sM | t1, . . . , tN X Y (− log mj) Y (− log nk) ω := s t (5) d1, . . . , dM | e1, . . . eN mj j nk k m1,...,mM ,n1,...,nN > 0 j=1 k=1 PM PN j=1 mj = k=1 nk Z 2π M N 1 Y Y (e ) = Li(dj ) eiθ Li k e−iθ dθ, (6) 2π sj tk 0 j=1 k=1
(d) ∂ d where the s-th outer derivative of a polylogarithm is denoted Lis (z) := ∂s Lis(z). Thus, the (d) effective computation of (6) requires really robust and efficient methods for computing Lis (Section 3.1.3 and sequela) and for high precision quadrature (Section 3.6). We emphasize that all ω are real since we integrate over a full period; or more directly since the summand is real. Consistent with earlier usage, we now refer to M + N as the depth and PM PN j=1(sj + dj) + k=1(tk + ek) as the weight of ω. To summarize, we study the MTW ensemble comprising the set s1, . . . , sM | t1, . . . , tN + D := ω : si, di, tj, ej ≥ 0; M ≥ N ≥ 1,M,N ∈ Z . (7) d1, . . . , dM | e1, . . . eN
3 Underlying special function and computational tools
To study ensemble D intensively, we must repeatedly differentiate polylogarithms with respect to their order, and be able to compute these and related functions, and the integrals (2) and (6) to very high precision (typically hundreds of digits or more). The reason that such high precision is required stems from our applications of the PSLQ in- teger relation algorithm [11] in this research. Given an input vector of computed real numbers (x1, x2, ··· , xn), PSLQ finds a nontrivial integer vector (a1, a2, ··· , an), if it exists, such that a1x1 + a2x2 + ··· anxn = 0. Such an algorithm is useful in this context because it often per- mits one to identify a high-precision computed numerical value in terms of an analytic formula involving certain well-known mathematical constants. Suppose, for example, one conjectures an integral I is given as a sum of terms (with unknown rational coefficients), each of which is some known constant. Then by computing x1 = I and the terms (xi, 2 ≤ i ≤ n) to sufficiently high precision, and applying PSLQ to (x1, x2, ··· , xn), the integer coefficients of such a relation (if it exists) can be found. Solving this relation for x1 = I produces a rational linear for I involving the conjectured constants. Even if PSLQ fails to find
3 a numerically significant integer relation, it produces exclusion bounds within which no integer relation exists; often valuable information in this type of exploration. For PSLQ (or any other relation-finding algorithm) to reliably recover a relation (a1, a2, ··· , an) d of length n, where the coefficients (ai) have maximum absolute value 10 , requires at least dn-digit precision in the input vector, and at least dn-digit arithmetic in the operation of the algorithm.
3.1 Polylogarithms and their derivatives with respect to order In regard to the needed polylogarithm values, [5] gives formulas such as below. Proposition 1. When s = n is a positive integer,
∞ 0 X logm z logn−1 z Li (z) = ζ(n − m) + (H − log(− log z)) , (8) n m! (n − 1)! n−1 m=0
1 1 1 P0 valid for | log z| < 2π. Here Hn := 1 + 2 + 3 + ··· + n , and the primed sum means to avoid the singularity at ζ(1). For any complex order s not a positive integer,
X logm z Li (z) = ζ(s − m) + Γ(1 − s)(− log z)s−1. (9) s m! m≥0
(This formula is valid for s = 0.) In formula (8), the condition | log z| < 2π precludes its use when |z| < e−2π ≈ 0.0018674. For such small |z|, however, it typically suffices to use the definition
∞ X zk Li (z) = . (10) s ks k=1
Note that Li0(z) = z/(1 − z) and Li1(z) = − log(1 − z). In fact, we found that formula (10) is generally faster than (8) whenever |z| < 1/4, at least for precision levels in the range of 100 to 4000 digits.
3.1.1 Outer derivatives of general order polylogarithms On carefully manipulating (9) for integer k ≥ 0 we have for | log z| < 2π and τ ∈ [0, 1):
∞ X logn z logk z X Li (z) = ζ(k + 1 + τ − n) + c (L) τ j, (11) k+1+τ n! k! k,j 0≤n6=k j=0 see [27, §9, eqn. (51)]. Here L := log(− log z) and the c coefficients engage the Stieltjes constants γn, where γ0 = γ [27, §7.1], which occur in the asymptotic expansion
∞ 1 X (−1)n ζ(z) = + γ (z − 1)n. z − 1 n n n=0
4 Precisely
(−1)j c (L) = γ − b (L), (12) k,j j! j k,j+1 where the bk,j terms—corrected from [27, §7.1]—are given by
X L p Γ(t)(1) b (L) := (−1)tf , (13) k,j p! t! k,q p+t+q=j p,t,q≥0
q Qk 1 and fk,q is the coefficient of x in m=1 1+x/m . This is calculable recursively via f0,0 = 1, f0,q = 0 (q > 0), fk,0 = 1 (k > 0) and
q X (−1)h f = f . (14) k,q kh k−1,q−h h=0 Above we used the functional equation for the Γ function to remove singularities at negative integers. While (11) has little directly to recommend it computationally, it is highly effective in deter- mining derivative values with respect or order, as we shall see in (17). Remark 1 (Harmonic numbers). The next formula from [21] is helpful in organizing differentiation and highlights the relationship between the rising factorial or binomial coefficient and harmonic numbers: for n ≥ 1 α α n (−1) d λ (−1) [α−1] = Hn−1 . (15) α! dλ n λ=0 n λ Herein, n denotes the binomial function Γ(λ+1)/(Γ(n+1)Γ(λ−n+1)), and we denote a multiple harmonic number by [α] X 1 Hn−1 := . (16) i1i2 ··· iα n>i1>i2>...>iα
[0] (β) P β If α = 0 we set Hn−1 := 1. We also recall that Hn−1 := k [2] 1 2 1 (2) Then, fk,1 = −Hk and fk,2 = Hk = 2 Hk + 2 Hk , in terms of classical generalized harmonic numbers, while ck,0 = Hk − L. With k = τ = 0 this yields (8). (1) To obtain the first derivative Lik+1(z), we differentiate (11) at zero and so require the evaluation ck,1. With k = 0 and j = 1 this supplies (25) below. More generally: Theorem 1 (Derivatives for positive order). Fix k = 0, 1, 2 ... and m = 1, 2 .... For | log z| < 2π and L = log(− log z) one has n k (m) X log z log z Li (z) = ζ(m)(k + 1 − n) + m! c (L) . (17) k+1 n! k,m k! 0≤n6=k 5 (−1)j Here, for k ≥ 1, ck,j(L) = j! γj − bk,j+1(L). (as in (13), (19) valid for k = 0) above X L p Γ(t)(1) β(q)(k, 1) X L p Γ(t)(1) b (L) := k (−1)t = (−1)tf , (18) k,m p! t! q! p! t! k,q p+t+q=m p+t+q=j p,t,q≥0 p,t,q≥0 where β(q)(k, 1) is the q-th derivative of the beta function β(k, x) wrt x at 1, the f coefficients are given recursively by f0,0 = 1, f0,q = 0 (q > 0), fk,0 = 1 (k > 0) while q X (−1)h f = f . (19) k,q kh k−1,q−h h=0 Note that symmetric divided differences allow one to rapidly check (17) to moderate precision (m) (say 50 digits). For k = −1, or, in other words, for Li0 (z), things are simpler, as we may use (9): Theorem 2 (Derivatives for zero order). With Γ(t)(1) and L = log(− log z) as above for arbitrary z, we have for m any positive integer n m m−t (m) X log z X m L Li (z) = ζ(m)(−n) − (−1)t Γ(t)(1) . (20) 0 n! t log z n≥0 t=0 Below we give an effective algorithm for Γ(t)(1) in (39). We also provide the necessary tools for computation of ζ(m)(−n) as required in (17) and (20). 3.1.2 Another potential method An alternative way to calculate derivatives of polylogarithms avoids recourse to ζ or η derivatives, the tradeoff being that one needs a side-quadrature calculation (albeit one involving only an el- ementary integrand). P. Jodr´aobserves (http://rspa.royalsocietypublishing.org/content/ 464/2099/3081.full) that polylogarithms can be written z Z 1 (− log u)s Lis(z) = 2 du. Γ(s + 1) 0 (1 − zu) Multiplying through by Γ(s + 1) =: s!, one has convenient recursion for the d-th derivative: d−1 1 X d 1 Li(d)(z) = − Γ(d−j)(s + 1)Li(j)(z) + I (z), (21) s s! j s s! s,d j=0 where Z 1 s d (− log u) Is,d(z) := z log (− log u) 2 du. (22) 0 (1 − zu) (d) The Γ=derivatives here are again fundamental. The idea, then is to build a table of Lis using this recursion, with a log log-log integral required—but only once—for every new derivative. In this 6 way, for each abscissa z in an overall MTW quadrature, a chain of derivatives can be calculated and stored. Moreover, changing variables (x = − log u) and using the binomial theorem leads to ∞ d ! n X X d k z I (z) = z Γ(d)(s + 1) + (− log n) Γ(d−k)(s + 1) . (23) s,d k ns n=2 k=0 Hence, we opt to use the left side of (23), as the right includes calculating the series we wish to avoid summing. That said, (23) leads to a quick verification of (21) and (22). 3.1.3 The special case s = 1 and z = eiθ Most importantly, in light of integral (6) we may write, for 0 < θ < 2π, θ (π − θ) Li (eiθ) = − log 2 sin + i. (24) 1 2 2 0 As described , the order-derivatives Lis(z) = d(Lis(z))/ ds for integer s, can be computed via ∞ n X log z 1 1 2 L0 (z) = ζ0 (1 − n) − γ − π2 − (γ + log (− log z)) , (25) 1 n! 1 12 2 n=1 which, as before, is valid whenever | log z| < 2π. Here γ1 is the second Stieltjes constant [3, 27]. For small |z|, it again suffices to use the elementary form ∞ X zk log k Li0 (z) = − . (26) s ks n=1 Relation (25) can be applied to yield the formula ∞ n X (iθ) 1 1 2 Li0 (eiθ) = ζ0 (1 − n) − γ − π2 − (γ + log (−iθ)) , (27) 1 n! 1 12 2 n=1 valid and convergent for |θ| < 2π. Given such formulas, to evaluate MTW values one may use pure quadrature, a convergent series, or a combination of quadrature and series. All of these are exploited in the MTW examples of [27]. 3.2 Values of ζ at integer arguments Effective use of (8,9) requires precomputed values of the zeta function and its derivatives at integer arguments (see [3, 25]). 3.2.1 Values of ζ at positive even integer arguments As we shall require ζ(n) for many integers, the following approach, used in [9], is efficient. First, to compute ζ(2n), observe that ∞ −2 X coth(πx) = ζ(2k)(−1)kx2k = cosh(πx)/ sinh(πx) πx k=0 1 1 + (πx)2/2! + (πx)4/4! + (πx)6/6! + ··· = · . (28) πx 1 + (πx)2/3! + (πx)4/5! + (πx)6/7! + ··· 7 Let P (x) and Q(x) be the numerator and denominator polynomials obtained by truncating these series to n terms. The approximate reciprocal R(x) of Q(x) can be gotten from the Newton iteration Rk+1(x) := Rk(x) + [1 − Q(x) · Rk(x)] · Rk(x), (29) where the degree of the polynomial and numeric precision of the coefficients are dynamically increased, approximately doubling when convergence has been achieved—at a given degree and precision—until the final desired degree and precision are achieved. When complete, the quotient P/Q is the product P (x) · R(x). Desired ζ(2k) values can then be obtained from coefficients of this product polynomial as in [9]. Recall that ζ(0) = −1/2. The Bernoulli numbers B2k are also needed. They now can be obtained from [33, Eqn. (25.6.2)] 2(2k)!ζ(2k) B = (−1)k+1 . (30) 2k (2π)2k 3.2.2 Values of ζ at positive odd integer arguments Positive odd zeta values can be efficiently found via two Ramanujan-style formulas: [9, 18]: ∞ 2N+2 X 1 X B2kB4N+4−2k ζ(4N + 3) = −2 − π(2π)4N+2 (−1)k , (31) k4N+3(exp(2kπ) − 1) (2k)!(4N + 4 − 2k)! k=1 k=0 ∞ 4N 2N+1 1 X (2πk + 2N) exp(2πk) − 2N π(2π) X B2kB4N+2−2k ζ(4N + 1) = − − (−1)k . N k4N+1(exp(2kπ) − 1)2 2N (2k − 1)!(4N + 2 − 2k)! k=1 k=1 3.2.3 Values of ζ at negative integer arguments Finally, zeta can be evaluated at negative integers by the following formulas [33, (25.6.3), (25.6.4)]: B ζ(−2n + 1) = − 2n and ζ(−2n) = 0. (32) 2n 3.3 Derivatives of ζ at integer arguments Precomputed zeta-derivative values are prerequisite for the efficient use of formulas we have pre- sented so far, including the crucial formulas (17), (20), (25) and (27). 3.3.1 Derivatives of ζ at positive integer arguments Remark 2 (Using Computer Algebra Systems). For the simplest instances of the just-mentioned formulas, built-in zeta derivative facilities of Maple or Mathematica suffice; but from our experience this approach is not robust. For example, Maple 16 produces 2000 digits of quantities such as ζ(10(2) quite rapidly, and 5000-digit results in roughly 100 seconds. Yet while Mathematica 9 verifies the 2000-digit results in cases we tried (more slowly), it fails at 5000-digit precision. Also, Mathematica (4) 9 inexplicably refuses to compute ζ (0) at all; similarly for higher derivatives at zero. ♦ 8 For positive integers, derivatives of the zeta function can be computed via a series-accelerated algorithm for derivatives of the Dirichlet eta function (or alternating zeta function), given as ∞ X (−1)n−1 η(s) := = (1 − 21−s)ζ(s). (33) ns n=1 For practical computation of eta or its derivatives, any of several alternating series acceleration schemes can be used. The corresponding values of zeta derivatives can then be found by solving (33) for ζ(s) and then taking formal derivatives, for example η0(s) 21−sη(s) log 2 ζ0(s) = − . (34) (1 − 21−s) (1 − 21−s)2 Example 1 (Alternating series acceleration [27, 26]). This is illustrated in the following Mathe- matica code (for argument ss, and precision prec digits): zetaprime[ss_] := Module[{s, n, d, a, b, c}, n = Floor[1.5*prec]; d = (3 + Sqrt[8])^n; d = 1/2*(d + 1/d); {b, c, s} = {-1, -d, 0}; Do[c = b - c; a = 1/(k + 1)^ss *(-Log[k + 1]); s = s + c*a; b = (k + n)*(k - n)*b/((k + 1)*(k + 1/2)), {k, 0, n - 1}]; (s/d - 2^(1 - ss)*Log[2]*Zeta[ss])/(1 - 2^(1 - ss))] In this algorithm, that logarithm and zeta values can be precalculated, and so do not significantly add to run time. A similar approach works well for higher derivatives of zeta, although the resulting generalization of (34) becomes progressively more complicated. ♦ 3.3.2 First derivative of of ζ at zero and negative integer arguments s−1 πs 0 The functional equation ζ(s) = 2(2π) sin 2 Γ(1 − s) ζ(1 − s) lets one extract ζ (0) = −(log 2π)/2 and for even m = 2, 4, 6,... d (−1)m/2m! ζ0(−m) := ζ(s)| = ζ(m + 1) (35) ds s=−m 2m+1πm [27, p. 15], while for odd m = 1, 3, 5 ... on the other hand, ζ0(m + 1) ζ0(−m) = ζ(−m) γ + log 2π − H − . (36) m ζ(m + 1) We now delineate methods more suited to higher derivatives at negative integers. 3.4 Higher derivatives of ζ at negative integers To approach these values we again need recourse to the gamma function. 9 3.4.1 Derivatives of Γ at positive integers (n) (a) Let gn := Γ (1). Now it is well known [33, (5.7.1) and (5.7.2)] that Γ(z + 1) C(z) = z Γ(z) C(z) = z (37) P∞ k where C(z) := k=1 ckz with c0 = 0, c1 = 1, c2 = γ and k (k − 1)ck = γck−1 − ζ(2) ck−2 + ζ(3) ck−3 − · · · + (−1) ζ(k − 1) c1, (38) Thus, differentiating (37) by Leibnitz’ formula, for n ≥ 1 we have n−1 X n! g = − g c . (39) n k! k n+1−k k=0 (b) More generally, for positive integer m we have Γ(z + m) C(z) = (z)m (40) (n) where (z)m := z(z + 1) ··· (z + m − 1) is the rising factorial. Whence, letting gn(m) := Γ (m) so that gn(1) = gn, we may apply the product rule to (40) and obtain n−1 X n! Dn+1 g (m) = − g (m) c + m . (41) n k! k n+1−k n + 1 k=0 n Here Dm is the n-th derivative of (x)m evaluated at x = 0; zero for n > m. For n ≤ m values are easily obtained symbolically or in terms of Stirling numbers of the first kind: m−n n X k m+n+1 Dm = s (m, k + n)(k + 1)n (m − 1) = (n + 1)! (−1) s (m, 1 + n) . (42) k=0 n Dm Thus, (n+1) = n!|s(m, 1 + n)| and so for n, m > 1 we obtain the recursion n−1 gn(m) X gk(m) = − c + |s(m, 1 + n)|. (43) n! k! n+1−k k=0 where for integer n, k ≥ 0 s(n, k) = s(n − 1, k − 1) − (n − 1) s(n − 1, k) , (44) as in [33, Equation (26.8.18)]. 3.4.2 Apostol’s formulas for ζ(k)(m) at negative integers 1 For n = 1, 2,..., with κ := − log(2π)− 2 πi, we have Apostol’s formulas [33, (25.6.13) and (25.6.14)]: k m 2(−1)n X X k m (−1)kζ(k)(1 − 2n) = Re(κk−m)Γ(r)(2n) ζ(m−r)(2n) , (45) (2π)2n m r m=0 r=0 k m 2(−1)n X X k m (−1)kζ(k)(−2n) = Im(κk−m)Γ(r)(2n + 1) ζ(m−r)(2n + 1) . (2π)2n+1 m r m=0 r=0 (46) 10 Since in (41) only initial conditions rely on m, equations (45) and (46) are well fitted to work with (41) (along with (44), and (38)). The derivatives ζ(m)(0) needed in (45) can be computed by either the methods of [3, Thm. 3] or [25, §5(c)]. Indeed Theorem 3 (Apostol). If z = − log(2π) − iπ/2 and n ≥ 0, we have n−1 ζ(n)(0) 1 Im(zn+1) 1 X Im(zn−k) (−1)n = + a (47) n! π (n + 1)! π k (n − k)! k=1 where the coefficients ak are determine by the Laurent expansion ∞ 1 X Γ(s)ζ(s) = + a (s − 1)n, (48) s − 1 n n=0 Pn n so that an = cn+1 + k=0 cn−kγk where cn = Γ (1)/n! = gn/n! is computable as in (39). 3.5 More general character L-series We make a brief detour to general real L-series, see [20], given by d−1 X χ±d(n) 1 X ±d k L (s) := = ζ s, . (49) ±d ns ds k d n>0 k=1