Arxiv:1512.00906V3 [Math.NA] 27 Feb 2017
Total Page:16
File Type:pdf, Size:1020Kb
Butcher series A story of rooted trees and numerical methods for evolution equations Robert I. McLachlan · Klas Modin · Hans Munthe-Kaas · Olivier Verdier February 28, 2017 To appear in Asia Pacific Mathematics Newsletter Abstract Butcher series appear when Runge–Kutta methods for ordinary differential equations are expanded in power series of the step size parameter. Each term in a Butcher series consists of a weighted elementary differential, and the set of all such differentials is isomorphic to the set of rooted trees, as noted by Cayley in the mid 19th century. A century later Butcher discovered that rooted trees can also be used to obtain the order conditions of Runge–Kutta methods, and he found a natural group structure, today known as the Butcher group. It is now known that many numerical methods also can be expanded in Butcher series; these are called B-series methods. A long-standing problem has been to characterize, in terms of qualitative features, all B-series methods. Here we tell the story of Butcher series, stretching from the early work of Cayley, to modern developments and connections to abstract algebra, and finally to the resolution of the characterization problem. This resolution introduces geometric tools and perspectives to an area traditionally explored using analysis and combinatorics. Keywords Butcher series · order conditions · numerical integrators · ordinary differential equations · rooted trees · elementary differentials · affine equivariance Mathematics Subject Classification (2000) 65-03 · 01-08 · 65L06 R.I. McLachlan Institute of Fundamental Sciences, Massey University, New Zealand E-mail: [email protected] K. Modin Mathematical Sciences, Chalmers University of Technology and University of Gothenburg, Sweden E-mail: [email protected] H. Munthe-Kaas Department of Mathematics, University of Bergen, Norway E-mail: [email protected] O. Verdier arXiv:1512.00906v3 [math.NA] 27 Feb 2017 Department of Computing, Mathematics and Physics, Western Norway University of Applied Sciences E-mail: [email protected] 2 McLachlan, Modin, Munthe-Kaas, and Verdier 1 From Cayley to Butcher Butcher series are mathematical objects that were introduced by the New Zealand mathematician John Butcher in the 1960s. He introduced them as part of his study of Runge–Kutta methods, a popular class of numerical methods for evolution equations such as initial-value problems for ordinary differential equations, and they remain indispensable in the numerical analysis of differential equations. In this article we provide a brief introduction to Butcher series, survey their early history up to their introduction by John Butcher, and relate the story of the many connections that have recently been discovered between Butcher series and other parts of mathematics, no- tably algebra and geometry.1 We begin, however, with the traditional definition. Butcher series are intimately associated with the set of smooth (infinitely differ- entiable) vector fields on vector spaces. Indeed, let f be a smooth vector field on a vector space V, defining the ordinary differential equation (ODE) x˙ = f (x); (1.1) dx wherex ˙ = dt denotes the derivative with respect to time t. One way to study (1.1) is to develop the Taylor series of its solutions. Let x(h) be the solution to (1.1) at time t = h subject to the initial condition x(0) = x0. The Taylor series of x(h) in h is 1 x(h) = x(0) + hx˙(0) + h2x¨(0) + :::: (1.2) 2 We already know that x(0) = x0 andx ˙(0) = f (x0). The additional terms can be found by repeatedly applying the chain and product rules. For example, d d x¨ = x˙ = f (x) = f 0(x)x˙ = f 0(x) f (x); dt dt 1 n or, relative to a basis in which x = x e1 + ::: + x en, n i i ¶ f j x¨ = ∑ j (x) f (x); j=1 ¶x 1 n where f (x) = f (x)e1 + ::: + f (x)en. Continuing in this way gives x˙ = f (x); x¨ = f 0(x) f (x); ... x = f 0(x) f 0(x) f (x) + f 00(x)( f (x); f (x)); .... x = f 0(x) f 0(x) f 0(x) f (x) + f 0(x) f 00(x)( f (x); f (x))+ (1.3) 3 f 00(x)( f 0(x) f (x); f (x)) + f 000(x)( f (x); f (x); f (x)); . 1 This article is not a comprehensive review and is focussed on our own interests. Useful companions to this article are the detailed mathematical review of Butcher series by Sanz-Serna and Murua [35] and the textbook treatments of Hairer et al. [21,23]. Butcher series 3 Here the kth derivative f (k)(x) of the vector field f is regarded as a multilinear map V k ! V. For example, f 00( f ; f ) is the vector field on V whose ith coordinate is n 2 i ¶ f j k ∑ j k (x) f (x) f (x): j;k=1 ¶x ¶x A vector field of the form appearing in (1.3), combining f and its derivatives, is called an elementary differential. Using (1.3), the Taylor series (1.2) for the solution of (1.1) can be written as 1 1 1 x(h) = x + h f + h2 f 0 f + h3 f 0 f 0 f + h3 f 00( f ; f ) + ::: (1.4) 0 2 6 6 where each elementary differential is evaluated at x0. Notice that the power of h in each term is determined by the multiplicity of f in the elementary differential. However, the coefficients 1, 1, 1=2, 1=6, 1=6, and so on are not determined by their corresponding elementary differentials. A Butcher series, shortly denoted B-series, is a generalization of (1.4) allowing arbitrary coefficients, i.e., a formal series of the form 2 0 3 0 0 3 00 B(c; f ) := c0x0 + c1h f + c2h f ( f ) + c3h f ( f ( f )) + c4h f ( f ; f ) + ::: (1.5) where ci 2 R. Although presented here in coordinates, we shall see that Butcher series do not depend on the choice of basis. 2 Early history Butcher series are named in honour of the New Zealand mathematician John Butcher. In a publication career spanning (so far) 60 years he has written 167 papers and books, all but 18 of them concerned with Runge–Kutta methods and their generalisations. Most of them involve in some way the fundamental structure that bears his name. Butcher series were introduced in a remarkable series of ten sole-authored papers in the years 1963–1972. A Runge–Kutta method is a numerical approximation xn 7! xn+1 of the exact flow of (1.1) defined by the following equations in xn, xn+1, X1;:::;Xn 2 V: n Xi = xn + h ∑ ai j f (Xj); j=1 (2.1) n xn+1 = xn + h ∑ b j f (Xj): j=1 Here n is the number of stages of the method and ai j, b j are real numbers parame- terising the Runge–Kutta method. Associated with the abstract Runge–Kutta method (2.1) are its order conditions, polynomials equations in ai j and b j—one equation per elementary differential—that determine the order of convergence of the method and its local error. Their derivation has been simplified over the years; a modern exposi- tion can be found in Hairer, Lubich and Wanner [21], and a detailed history in Butcher and Wanner [9]. 4 McLachlan, Modin, Munthe-Kaas, and Verdier The first breakthrough paper dates from 1963 [5]. Here Butcher found for the first time the coefficients ci of the B-series (1.5) of xn+1 of the Taylor expansion in h of an arbitrary Runge–Kutta method. This gave the order conditions for Runge– Kutta methods in complete generality. As previous studies had laboriously expanded the solutions of particular (e.g. explicit) methods by hand, this was an enormously important development. Butcher did have, however, some precursors. The most notable example is the paper of Merson [32] from 1957. Robert Henry ‘Robin’ Merson (1921–1992) was a scientist at the Royal Aircraft Establishment, Farnborough, UK, who was invited along with more senior numerical analysts to a conference on Data Processing and Automatic Computing Machines at Australia’s Weapons Research Establishment in Salisbury, South Australia.2 It seems like a long way to go for a conference in 1957. However, the UK was still performing above-ground atomic bomb tests in South Australia at that time and the Australian government was very keen to be a part of the emerging era. Merson’s work is bound up with one of the most significant events of 1957, the launch of Sputnik 1 on 4 October 1957, and the tale of Farnborough’s involvement is told in detail by one of the key participants, Desmond King-Hele, in his book A Tapestry of Orbits [28]. The short version is that with the aid of a large radio antenna hastily erected in a nearby field, and some calculations of Robin Merson, within two weeks they had an accurate orbit for Sputnik 1. This allowed them to estimate the density of the upper atmosphere and (after Sputnik 2) the shape of the earth. Robin Merson became an expert in practical numerical analysis and orbit determination. Merson’s paper explains clearly the structure of the elementary differentials f 0( f ), f 00( f ; f ), etcetera, and, crucially, shows how they are in one-to-one correspondence with rooted trees. He also introduces various basic operations on rooted trees. This de- velopment, perhaps regarded initially as a bookkeeping device for finding and keep- ing track of the different terms, has over time become central to the combinatorial and algebraic study of B-series.