COMPUTATION OF SCALING INVARIANT LAX PAIRS WITH APPLICATIONS TO CONSERVATION LAWS
by Jacob Rezac
A thesis submitted to the Faculty and the Board of Trustees of the Colorado School of Mines in partial fulfillment of the requirements for the degree of Master of Science (Mathematical and Computer Sciences).
Golden, Colorado Date
Signed: Jacob Rezac
Signed: Dr. Willy Hereman Thesis Advisor
Golden, Colorado Date
Signed: Dr. Willy Hereman Professor and Interim Department Head Department of Applied Mathematics and Statistics
ii
ABSTRACT
There has been a large amount of research on methods for solving nonlinear par- tial differential equations (PDEs) since the 1950s. A completely integrable nonlinear PDE is one which admits solutions, given specific constraints. These integrable non- linear PDEs can be associated with a system of linear PDEs through a compatibility condition. Such a system is called a Lax pair. While Lax pairs are very important in the theory of solving nonlinear PDEs, few methods exist to compute them. This The- sis presents a method for generating Lax pairs for a specific class of nonlinear PDEs. Conservation laws, well-known from physics, are a related concept. Indeed, the ex- istence of an infinite number of conservation laws for a nonlinear PDE also predicts it complete integrability. We discuss a method for the computation of conservation laws, given a PDE’s Lax pair. This method is based on work done by Drinfel’d and Sokolov in 1985, which does not seem to have been fully explored in literature. The goal of this Thesis is to demonstrate the efficacy of these two construction methods. As such, we also list a number of Lax pairs and conservation laws computed by the methods presented in the Thesis.
iii
TABLE OF CONTENTS
ABSTRACT ...... iii
LIST OF TABLES ...... vi
ACKNOWLEDGMENTS ...... vii
CHAPTER 1 INTRODUCTION ...... 1
CHAPTER 2 PRELIMINARIES ...... 5
2.1 Lax Pairs ...... 5
2.2 AKNS Scheme ...... 11
2.3 Gauge Transformations ...... 16
2.4 Conservation Laws ...... 21
CHAPTER 3 CONSTRUCTION OF LAX PAIRS ...... 25
3.1 Scaling Invariance ...... 25
3.2 Exhaustive Lax Pair Computation Methods ...... 28
3.3 Uniform Weight Construction ...... 34
3.4 Weak Lax Pairs and Triviality Concerns ...... 39
CHAPTER 4 THE CONSTRUCTION OF CONSERVATION LAWS FROM LAX PAIRS ...... 45
4.1 The Drinfel’d Sokolov Method for Computing Conservation Laws . . . 45
4.2 Triviality and Simplification ...... 51
CHAPTER 5 FURTHER EXAMPLES ...... 55
5.1 Lax Pairs Computed with Exhaustive Methods ...... 55
iv
5.1.1 Modified Korteweg-de Vries Equation ...... 55
5.1.2 Kaup-Kuperschmidt Equation ...... 56
5.1.3 Lax 5th-Order Equation ...... 57
5.1.4 Sawada-Kotera Equation ...... 57
5.1.5 Harry-Dym Equation ...... 58
5.1.6 Drinfel’d-Sokolov System ...... 59
5.2 Lax Pairs Computed by Uniform Weight Method ...... 59
5.2.1 Nonlinear Schr¨odingerEquation ...... 59
5.2.2 Gardner Equation ...... 60
5.3 Conservation Laws Produced by the Drinfel’d-Sokolov Method . . . . . 61
5.3.1 Korteweg-de Vries Equation ...... 61
5.3.2 Modified Korteweg-de Vries Equation ...... 61
5.3.3 Sine-Gordon Equation ...... 62
5.3.4 Sinh-Gordon Equation ...... 63
5.3.5 Nonlinear Schr¨odingerEquation ...... 63
CHAPTER 6 CONCLUSIONS AND FURTHER RESEARCH ...... 67
REFERENCES CITED ...... 69
APPENDIX - OUTLINE OF DRINFEL’D-SOKOLOV ALGORITHM AND USE OF MATHEMATICA ...... 73
v
LIST OF TABLES
Table 5.1 The first four conservation laws for the KdV equation (2.4). . . . . 61
Table 5.2 The first four conservation laws for the mKdV equation (2.31). . . 62
Table 5.3 The first four conservation laws for the sine-Gordon equation (5.4). 63
Table 5.4 The first four conservation laws for the sinh-Gordon equation (5.5). 64
Table 5.5 The first four conservation laws for the NLS equation (5.2). . . . . 65
vi
ACKNOWLEDGMENTS
Thanks to my Thesis advisor, Willy Hereman, for his countless hours of advice (mathematical and otherwise), for his friendship, and for constantly pushing me to be a better mathematician. Thanks to the Department of Applied Math and Statistics at the Colorado School of Mines, including all the faculty and staff who have helped me during my time here. In particular, thanks to my Thesis committee Drs. Paul Martin and Luis Tenorio, both of whom have taught me more math than I care to admit. Also thanks to Dr. Mark Hickman, whose brief sabbatical in Golden put the following work on a surer mathematical footing than could have occurred without him. Further thanks to the many students in my research group the last three years, Terry Bridgman, Jennifer Larue, Tony McCollom, Janeen Neri, Sara Clifton, Oscar Aguilar, Jon Tran, and Allen Voltz. Their work ethic, insightful ideas, and pointed questions all helped push this Thesis to completion. This research was partially funded by NSF research award no. CCF-0830783. Finally, thanks to my family and friends, who both distracted and encouraged me enough to complete this document, whether they realize it or not. Also to Golden’s numerous fine coffee shops (in particular Higher Grounds), which kept me caffeinated and gave me a place to work whenever I needed one.
vii
CHAPTER 1 INTRODUCTION
In the early 1950s, physicists Fermi, Pasta, and Ulam (FPU) investigated [1] an apparent paradox related to the field of statistical mechanics1. The research group was attempting to model wave motion in a nonlinear crystal lattice. Scientific under- standing at the time predicted that energy put into the system would become equally partitioned across the crystal lattice. In an unexpected turn of events, however, nu- merical experiments showed that nearly all of the energy put into the crystal would remain in its original position; e.g., if the energy were initially put into the lowest vibrational mode of a wave, all but 3% of the energy would remain in that same mode [1]. This problem plagued physicists for almost 10 years. In 1965, however, Zabusky and Kruskal [3] explored the problem in its continuous limit. Numerical experiments conducted by Zabusky and Kruskal revealed surprising interactions between solutions of the continuous FPU model: solitary wave solutions passed through each other, emerging with no changes in velocity, amplitude, or shape. Although the problem was nonlinear in nature, wave solutions interacted as though they were linear and obeyed the superposition principle. This discovery hinted at a solution to the FPU paradox. In the continuous limit, these non-interacting waves were analogous to energy not mixing in different wave modes of a nonlinear lattice. Zabusky and Kruskal called these waves solitons2 to emphasize both the solitary nature of the waves and the similarities these waves shared with particles (e.g., the electron or positron).
1In recent years [2] it has been argued that Tsingou should also be credited with these inves- tigations. Most of the numerical computations presented in the paper attributed to FPU were, in fact, done by Tsingou. 2Zabusky originally planned to call these waves solitrons. Solitron, however, was already the name of a manufacturing company (which still exists today) [4].
1 It was soon noticed that the equation studied by Zabusky and Kruskal was, in fact, a nonlinear partial differential equation (PDE) discovered much earlier in the context of water waves. This so-called Korteweg-de Vries (KdV) equation was derived in 1895, by Dutch mathematicians Korteweg and de Vries, to describe solitary wave behavior observed by naval engineer John Scott Russell in 1834 [5]. Soon, other equations with soliton solutions were found. The modified KdV (mKdV) for example, which is related to the KdV equation by a simple transformation, was investigated by the Zabusky-Kruskal team [3], as well as Miura [6], in the mid-’60s. The sine- Gordon equation, which had been known by plasma physicists for some time, was soon discovered [7] to have soliton solutions as well. As more and more equations of this type were uncovered, the study of exact solutions to nonlinear PDEs exhibiting soliton behavior quickly became a subject important to mathematicians. In a series of papers [6, 8] published in the 1960s and ‘70s, Gardner, Greene, Kruskal, and Miura (GGKM) found analytic solutions to describe the interaction of waves governed by the KdV equation. Importantly, GGKM developed a general method to find these solutions. The papers also describe other important analytic properties of the KdV equation and its solutions, such as conservation laws. In 1968, Lax proposed a formal technique [9] for finding soliton solutions to non- linear evolution equations based on work done by GGKM. His technique involved relating the original nonlinear PDE to two linear operators via a compatibility condi- tion. These linear operators, called a Lax pair, are the main topic of this Thesis. The work done by Lax was further generalized by Zakharov and Shabat [10] and Ablowitz, Kaup, Newell, and Segur (AKNS) [11]. Soliton solutions were found for more compli- cated nonlinear PDEs, such as the Nonlinear Schr¨odingerequation (NLS), and Lax pairs were again required for the solutions. The solution technique created by these researchers is called the Inverse Scattering Transform (IST). It has been very success- ful over the last 40 years, and has been called one of the most important techniques
2 developed in applied mathematics over that time. During the time-period Lax pairs were first being discovered, mathematicians be- gan to examine conditions under which soliton waves could occur. Many nonlinear waves which had been studied before the discovery of solitons had exhibited “shock wave” solutions, requiring jump boundary conditions and conservation law formu- lations. Applying this analogy to the new nonlinear waves, Miura [6], along with Kruskal, Zabusky, and Whitham [9], discovered nine conservation laws related to the KdV equation in the late 1960s. The first few conservation laws correspond to physical conserved quantities of the system - conservation of mass, momentum, and energy [12]. The rest, however, have no obvious physical meaning. More and more conservation laws were soon found, and it was proven that an infinite number existed. It was soon found that most PDEs having the property of infinite conservation laws also have a Lax pair and analytic soliton solutions [13]. Conservation laws, and their relationship to Lax pairs, are another subject discussed in this Thesis. The connections between exact solutions to a PDE and the existence of either Lax pairs or an infinite number of conservation laws are well-studied [13]. In the years since these properties were first discovered, there have been many attempts to algorithmically compute Lax pairs and conservation laws for a given PDE. In partic- ular, the computation of conservation laws has been successful [12,14]. However, the algorithmic construction of Lax pairs has received much less attention [15]. This The- sis will outline algorithmic methods for the computation of both. The methods will require stringent properties for the PDEs being examined, but many of the equations important in nonlinear physics have these properties. The Thesis will proceed as follows: Chapter 2 will discuss the technical aspects of Lax pairs, conservation laws, and a related concept, gauge transformations. This Chapter will cover the theory and give concrete examples to further clarify the con- cepts. Chapter 3 will present two related methods for the computation of Lax pairs.
3 These methods are loosely based on past research, but have not appeared in lit- erature. Particular attention is paid to a type of triviality often ignored by those researching Lax pairs [16]. Chapter 4 will discuss the computation of conservation laws from Lax pairs. Specifically, it will discuss a method based on work by Drinfel’d and Sokolov [17] from which conservation laws can be constructed algorithmically. Finally, Chapter 5 will list a number of Lax pairs and conservation laws constructed by the methods described in Chapters 3 and 4. Many of the Lax pairs are more gen- eral than those published in literature. The conservation laws listed in this chapter are compared to published results.
4 CHAPTER 2 PRELIMINARIES
The attempts in the 1960s and ’70s to create a general theory for solving nonlin- ear PDEs led to the discovery of a number of properties related to exactly solvable equations. Some of these properties will be discussed in this chapter and a number of examples will be given to further elucidate concepts.
2.1 Lax Pairs
Perhaps the most basic nonlinear PDE studied in applied mathematics is Burgers’ equation [18],
ut + uux − κuxx = 0. (2.1)
Here, and throughout this document, u(x, t) is a dependent variable, and x and t are spatial and temporal independent variables, respectively. Moreover, subscripts denote partial derivatives. Equation (2.1) developed as a simplification of the Navier- Stokes equations [19] has applications ranging from fluid flow to optical tomography and traffic flow [20,21]. A solution to (2.1) can be found by the so called Hopf-Cole3 transformation [23,24], which linearizes (2.1) into the (linear) heat equation,
H(v) = vt − κvxx = 0. (2.2)
In particular, the change of variables governed by ∂ ln v v u = −2κ = −2κ x (2.3) ∂x v reduces (2.1) to
2κ ∂ v − v H(v) = 0. v2 x ∂x
3While Hopf and Cole independently discovered this transformation in the early 1950s [18], it appears as though Florin found the same solution first while studying properties of soil mechanics as early as 1948 [22].
5 ∂ ln v That is, if v solves (2.2), then u = −2κ ∂x solves (2.1). This Hopf-Cole solution to Burgers’ equation suggests a technique for solving nonlinear PDEs:
1. Find a transformation which changes a given nonlinear PDE into a linear one. 2. Solve the new linear equation. 3. Transform back to find solutions to the original nonlinear problem.
This is exactly what the IST attempts to do, though in a much more complicated way. Of particular interest to the rest of this Thesis is step 1 - finding transformations. This process of finding transformations such as (2.3) will be generalized and made algorithmic in the sections that follow. To develop a linearization technique for nonlinear PDEs, we consider the KdV equation [5],
ut + 6uux + uxxx = 0, (2.4) for u(x, t). Note that the 6 in front of the uux-term can be scaled to any constant. The similarity between (2.4) and (2.1) suggests a linearizing transformation akin to the Hopf-Cole transform, (2.3). Indeed, consider the transformation ψ u = − xx , ψ for a new dependent variable ψ(x, t). Rewriting yields
ψxx + uψ = 0. (2.5)
Moreover, the KdV equation is invariant under a Galilean transformation [7], u → u + κ, for some constant κ. So, for some constant λ, we can write (2.5) as
ψxx − (λ − u)ψ = 0. (2.6)
Equation (2.6) is a well-known Sturm-Liouville equation, related to the Schr¨odinger
∂2 equation of quantum physics [7]. We then define the linear operator L = ∂x2 + uI by
6 Lψ = λψ. (2.7)
Note that I in the definition of L is the identity operator. While this sort of transformation specified the solution for Burgers’ equation, it is unlikely to do so for more complicated PDEs. To allow for additional flexibility, also consider an operator M to describe the time evolution of the eigenfunction ψ,
Mψ = ψt. (2.8)
4 Assuming λt = 0, consider the t-derivative of (2.7),
Ltψ + Lψt = λψt.
Substituting in from (2.7) and (2.8) gives that
Ltψ + L(Mψ) = λ (Mψ) = M(λψ) = MLψ,
or
(Lt + [L, M])ψ = 0,
where [L, M] = LM −ML is the commutator of L and M. Continuing, we can readily verify [15] that
3 Mψ = −4ψx − 6uψx − 3uxψ
fits the role of (2.8). The preceding process suggests a general formulation for finding linear PDEs compatible with a nonlinear PDE. Let a pair of operators L and M be such that
Lψ = λψ and Mψ = ψt and, consequently,
(Lt + [L, M])ψ ˙=0, (2.9)
where ˙=means equal on solutions of the nonlinear PDE. Then L and M are known as a scalar operator Lax pair of the PDE. Moreover, (2.9) is called the operator Lax
4This assumption is related to solution theory regarding the linear Schr¨odingerequation and is required [5] for the IST to work.
7 equation. Example 1 Substituting the pair of operators
∂2 ∂3 ∂ L = + uI and M = −4 − 6u − 3u I. ∂x2 ∂x3 ∂x x into the Lax equation, (2.9), yields
(Lt + [L, M]) ψ = −(ut + 6uux + uxxx)ψ.
So, on solutions of the KdV equation (2.4), we see that the operators L and M are a Lax pair compatible with the KdV equation. The concept of the Lax pair was expanded further by both the Zakharov-Shabat team and the AKNS research group in the 1970s. Rather than searching for L and M operators consistent with a PDE, they considered n × n-matrices. In particular, they considered matrices X = X(λ, u, ux, uxx,...) and T = T (λ, u, ux, uxx,...) such that
XΦ = Φx and T Φ = Φt, (2.10) where Φ is a vector of eigenfunctions. Indeed, as X and T depended on the spectral parameter λ, AKNS and Zakharov and Shabat considered a more general scheme than that considered in the operator formalism discussed above. Note that, again, we required λt = 0. Following a similar technique as above, a compatibility condition relating X, T , and the original nonlinear PDE can be derived. Requiring Φxt = Φtx gives that
Φxt = XtΦ + XΦt = TxΦ + T Φx = Φtx.
Simplifying this and replacing Φt and Φx with (2.10) gives
(Xt − Tx + [X,T ])Φ = 0, where [X,T ] is the standard commutator,
[X,T ] = XT − TX.
8 Thus, we take
Xt − Tx + [X,T ] ˙=0 (2.11) to be the compatibility condition for a matrix Lax pair X and T . Here, 0 means the n × n zero-matrix (in the remainder of this Thesis, the distinction between the zero matrix and zero scalar will be clear from context). Equation (2.11) is called the matrix form Lax equation (or zero-curvature condition). Note that trivial pairs are avoided again by requiring compatibility with solutions of the PDE. The matrix formulation of the Lax equation will be used more commonly throughout the Thesis, so we say “Lax pair” to mean matrix form of the Lax pair. Example 2 Consider again the KdV equation, (2.4). Moreover, let Φ = [ψ φ]T be a vector-valued function such that ψ and φ are functions of u, ux, and higher- order x−derivatives of u. This type of dependence is said to be on the jet space of u. Then, we want to find a matrix X which satisfies XΦ = Φx and a matrix T such that T Φ = Φt. If such X and T exist, there must be a functional relationship
5 between ψ, φ, ψx, and φx. So, we make a simple guess that φ = ψx. We assume φ and ψ are sufficiently smooth so that φx = ψxx. Furthermore, we know from (2.6) that ψxx = λψ − uψ. Thus, we can replace ψxx with lower-order terms, giving
φx = λψ − uψ. Thus, there is enough information to construct a matrix X,
0 1 X = , λ − u 0 which satisfies Φx = XΦ. Finding the T matrix is not as immediate as finding X. The process, however, is the same. Assume again that φ = ψx. Then, again assuming that φ and ψ are sufficiently smooth, φt = (ψx)t = (ψt)x. However, we know that ψt = Mψ =
−4ψxxx − 6uψx − 3uxψ. So,
5This guess is motivated by the way second-order ordinary differential equations can be recast into matrix form.
9 φt = (−4ψxxx − 6uψx − 3uxψ)x.
This can be further reduced with the relations ψx = φ, ψxx = λψ − uψ, and ψxxx =
λψx − uxψ − uψx, yielding
2 2 φt = (−4λ + 2λu + 2u + uxx)ψ − uxφ.
Similarly,
ψt = −4ψxxx − 6uψx − 3uxψ
= −4(λφ − uφ − uxψ) − 6uφ − 3uxψ
= uxψ − (4λ + 2u)φ.
We have accomplished the goal of this example: φt and ψt are expressed in terms of φ and ψ after all x-derivatives of these functions have been removed. Thus, since
T Φt = [ψt φt] , we have computed the matrix T as ux −4λ − 2u T = 2 2 . −4λ + 2λu + 2u + uxx −ux A direct computation shows that these X and T satisfy the matrix Lax pair equation along with the KdV equation, (2.4). Indeed, after some algebra, substituting X and T into (2.11) gives
0 0 Xt − Tx + [X,T ] = − , ut + 6uux + uxxx 0 and substitution of (2.4) will give the zero-matrix. The above matrices can be generalized to generic α in the KdV equation,
ut + αuux + uxxx = 0. (2.12)
In this case,
0 1 X = 1 , (2.13) λ − 6 αu 0 and
10 1 1 6 αux −4λ − 3 αu T = 2 1 1 2 2 1 1 . (2.14) −4λ + 3 αλu + 18 α u + 6 αuxx − 6 αux Notice that substituting α = 6 into (2.13) and (2.14) generates the matrices found in Example 2, as expected.
2.2 AKNS Scheme
Finding a Lax pair for the KdV equation was somewhat intuitive. Other equations, however, do not produce compatible pairs so easily. Of particular interest, then, is work done by the AKNS group to construct integrable equations from classes of compatible Lax pairs. They took a reverse route: rather than constructing a pair for a given equation, these researchers started from general Lax pairs and tried to fit them to specific equations. In the simplified version of the AKNS method, only 2 × 2 matrices are considered (for a review of higher-dimensional matrices, see [25]). We will do the same. So, begin by assuming a general form of Lax matrices such that
−iκ q X = , (2.15) r iκ where κ is an eigenvalue and q(x, t) and r(x, t) are undetermined functions. The form of X comes from a generalized approach to the IST taken in [10]. For reasons which will become clear in section 2.3, we take κ to be the spectral parameter, rather than λ. Moreover, consider a very general T matrix,
AB T = , (2.16) CD where A, B, C and D are general functions of κ, and the jet space of u. Then, keeping in mind that the Lax equation is equivalent to XΦ = Φx and T Φ = Φt, where Φ = [ψ φ]T , we have
ψx = −iκψ + qφ, and ψt = Aψ + Bφ, (2.17)
φx = rψ + iκφ, φt = Cψ + Dφ. (2.18)
11 If we require symmetry of second-derivatives, ψxt = ψtx and φxt = φtx, and that
κt = 0, we find
qtφ + qφt − iκψt = Axψ + Aψx + Bxφ + Bφx
rtψ + rψt + iκφt = Cxψ + Cψx + Dxφ + Dφx.
Replacing ψx, φx, ψt, and φt from their definition in (2.17) and simplifying yields
(Ax − iAκ + rB)ψ + (qA + Bx + iBκ)φ = (qC − iAκ)ψ + (qt − iBκ + qD)φ
(Cx − iCκ + rD)φ + (qC + Dx + iDκ)φ = (rt + rA + iCκ)ψ + (rB + iDκ)φ.
So, requiring that φ and ψ be non-zero, we find the system of equations
Ax = qC − rB,
Bx + 2iκB = qt − (A − D)q,
Cx − 2iκC = rt + (A − D)r,
−Dx = qC − rB.
We can reduce this system of differential equations, without loss of generality [5], by taking D = −A. While there are a number of ways to find solutions to this system, we consider expanding A, B, and C into power series in κ (other authors e.g., [26] expand A, B, and C differently). In doing so, certain evolution equations will be found to hold on q or r, which will, in turn, determine the PDEs with which the Lax pairs are compatible. An example clarifies this procedure. Example 3 Expand A, B, and C up to the third power of κ,
2 3 A = A0 + A1κ + A2κ + A3κ , 2 3 B = B0 + B1κ + B2κ + B3κ , 2 3 C = C0 + C1κ + C2κ + C3κ ,
12 where Ai,Bi and Ci are functions on the jet space of q and r. While the reason behind expanding to the third order in κ is not clear a priori, we will find that this expansion produces useful results6. Then, substituting this expansion into (2.11) and simplifying, gives
2 0 = (rB0 − qC0 + (A0)x) + (rB1 − qC1 + (A1)x) κ + (rB2 − qC2 + (A2)x) κ 3 + (rB3 − qC3 + (A3)x) κ , 2 qt = (2qA0 + (B0)x) + (2(iB0 + qA1) + (B1)x) κ + (2(iB1 + qA2) + (B2)x) κ 3 4 + (2(iB2 + qA3) + (B3)x) κ + 2iB3κ , 2 −rt = (2rA0 − (C0)x) + (2(iC0 + rA1) − (C1)x) κ + (2(iC1 + rA2) − (C2)x) κ 3 4 + (2(iC2 + rA3) − (C3)x) κ + 2iC3κ .
We hope to solve for A, B, C, q, and r which specify a Lax pair associated with a PDE. So, splitting these equations in powers of κ yields
0 κ : rB0 − qC0 + (A0)x = 0, qt = 2qA0 + (B0)x , −rt = 2rA0 − (C0)x , 1 κ : rB1 − qC1 + (A1)x = 0, 2(iB0 + qA1) + (B1)x = 0, 2(iC0 + rA1) − (C1)x = 0, 2 κ : rB2 − qC2 + (A2)x = 0, 2(iB1 + qA2) + (B2)x = 0, 2(iC1 + rA2) − (C2)x = 0, 3 κ : rB3 − qC3 + (A3)x = 0, 2(iB2 + qA3) + (B3)x = 0, 2(iC2 + rA3) − (C3)x = 0, 4 κ : 2iB3 = 0, 2iC3 = 0.
4 From the equations at order κ , we see that B3 = C3 = 0. Substituting B3 = C3 = 0
3 into the equations at order κ will give A3,x = 0, or A3 = a3, where a3 is some constant. Hence, the equations at order k3 reduce to
iB2 + a3q = 0 and iC2 + a3r = 0.
Consequently, B2 = ia3q and C2 = ia3r. Substituting these into the equations at
2 order κ gives A2,x = 0, or A2 = a2, for some constant a2. Then,
ia3qx + 2(iB1 + a2q) = 0 and ia3rx − 2(iC1 + a2r) = 0.
1 1 Thus, B1 = ia2q − 2 a3qx and C1 = ia2r + 2 a3rx.
6In general, there is a lack of motivation in the order of power series expansion. This is one drawback to the AKNS method.
13 1 We continue by solving the equations at order κ . These determine that (A1)x = 1 1 2 a3(qrx + qxr), so that A1 = 2 (a3qr + a1), for some constant a1. Then, from the remaining equations at order κ1,
1 1 2iB + a q + ia q + a q2r − q = 0, 0 2 1 2 x 3 2 xx and
1 1 2iC + a r − ia r + a qr2 − r = 0. 0 2 1 2 x 3 2 xx Solving gives
1 1 B = ia q − a q + ia (q2r − q ) , 0 2 1 2 x 3 2 xx and
1 1 C = ia r + a r + ia (qr2 − r ) . 0 2 1 2 x 3 2 xx Finally, from the equations at order κ0, we see
1 1 (A ) = a (q r + qr ) + ia (q r − qr ) , 0 x 2 2 x x 4 3 xx xx so 1 1 A = a + a qr + ia (q r − qr ) , 0 0 2 2 4 3 x x
0 where a0 is some constant. Moreover, the remaining equations at order κ specify q and r as evolution equations. Specifically,
1 1 1 1 q = 2a q + ia q + a q2r + q + ia 3qq r − q , (2.19) t 0 2 1 x 2 2 xx 2 3 x 2 xxx and
1 1 1 r = −2a r + ia r + a qr2 + r + ia (3qrr − r ) . (2.20) t 0 2 1 x 2 2 xx 2 3 x xxx
Substituting a0 = a1 = a2 = 0, a3 = −4i, and r = −1 into (2.19) and (2.20) yields
14 qt + 6qqx + qxxx = 0,
the KdV equation. The goal of this exercise was to represent T as a cubic polynomial on the jet space of u. From the above method, we have found q, r, A, B, and C which form a valid Lax pair when substituted into (2.15) and (2.16). Thus, we have constructed a Lax pair for the KdV equation,
−iκ q X = , (2.21) AKNS −1 iκ and
3 2 2 −4iκ + 2iqκ − qx 4qκ + 2iqxκ − 2q − qxx TAKNS = 2 3 . (2.22) −4κ + 2q 4iκ − 2iqκ + qx Substituting this back into the Lax equation (2.11) yields
0 q + 6uu + u (X ) − (T ) + [X ,T ] = t x xxx , AKNS t AKNS x AKNS AKNS 0 0 so the pair is indeed valid. A number of other Lax pairs can be found this way. Important examples include the NLS equation, which comes from expanding A, B and C as second-order polyno- mials and the sine-Gordon equation, which comes from expanding in inverse powers of κ. For a larger list of these Lax pairs, as well as a more detailed version of the above discussion, see [5]. The AKNS scheme was successful in helping to classify integrable systems. It also broadened the form of Lax pairs mathematicians were willing to use. However, the whole approach had one major flaw: if a researcher had an equation which needed to be solved, the scheme would not necessarily produce a Lax pair for it. Attempts to fix the problem require much creativity on the part of a researcher, and were not guaranteed to work [27]. Moreover, some require techniques and language difficult for non-experts to interpret [28]. The methods described in Chapter 3 are an attempt to fix this. Before considering new computation techniques, however, we discuss two
15 more important properties of integrable PDEs.
2.3 Gauge Transformations
At first glance, the above sections seem to be surprising, if not contradictory. In particular, two separate Lax pairs have been given for the same equation. Given that both of these pairs are valid, some questions arise: is there a relationship between these two pairs? Is one pair better than the other? If so, can one pair easily be trans- formed into the more useful pair? The first of these questions will be addressed by the following section on gauge transformations. The other questions will be addressed in the following Chapters. Since we know that more than one matrix Lax pair exists for the KdV equation, we hypothesize that this is true in general. In particular, we look at the vector Φ used in the definition of X and T , (2.10); if a different Φ, say Φˆ is used, it seems likely that a different X and T , say Xˆ and Tˆ will be required. Thus, consider a nonsingular matrix G such that Φˆ = GΦ, where Φ = [ψ φ]T , as before. Then, we can relate X and Xˆ by noting that ˆ −1 ˆ ˆ ˆ Φ = G (Φ)x + GxΦ = (GX + Gx)Φ = (GX + Gx)G Φ = XΦ. x Rewriting this, we see that
ˆ ˆ −1 −1 XΦ = (GXG + GxG )Φ. (2.23)
Similarly, we find that
ˆ ˆ −1 −1 T Φ = (GT G + GtG )Φ. (2.24)
Thus, for any pair of matrices which satisfy (2.11), an infinite number of equivalent pairs may be found through a gauge transformation of the form (2.23)-(2.24). In general, constructing a gauge between two given Lax pairs (or, more generally, from one known Lax pair to a pair of a given form) is nontrivial. In the simplest cases, however these gauge transformations are fairly straightforward to create.
16 Example 4 Consider again the two Lax pairs found for the KdV equation. Let
0 1 ux −4λ − 2u X = ,T = 2 2 , λ − u 0 −4λ + 2uλ + uxx + 2u −ux
and
3 2 2 ˆ −iκ u ˆ −4iκ + 2iuκ − ux 4uκ + 2iuxκ − 2u − uxx X = , T = 2 3 . −1 iκ −4κ + 2u 4iκ − 2iuκ + ux
Rewriting equations (2.23) and (2.24), we need to find a nonsingular matrix G such that
ˆ ˆ XG = GX + Gx and TG = GT + Gt.
Because both X and Xˆ are linear in u, let us assume that G is a matrix with only constant entries g11, g12, g21, g22,
g g G = 11 12 . (2.25) g21 g22
Since Gx = Gt = 0, this significantly reduces the gauge relationship. So, we have
−ig κ − g u −ig κ + g u XGˆ = 11 21 12 22 , ig21κ − g11 ig22κ − g12 and
g λ − g u g GX = 12 12 11 . g22λ − g22u g21 These must match element-wise, which yields
g11 = −ig12κ + g22u, (2.26)
g21 = ig22κ − g12, (2.27)
ig11κ − g21u = −g12λ + g12u, (2.28)
ig21κ − g11 = g22λ − g22u. (2.29)
Substituting g11 and g21 from (2.26)-(2.27) into (2.28)-(2.29) yields
17 2 g12(κ + λ) = 0, 2 g22(κ + λ) = 0.
If we set λ = −κ2, this is satisfied. Thus, if G is nonsingular, we have that
−ig κ + g u g G = 12 22 12 , (2.30) ig22κ − g12 g22 and λ = −κ2, will relate X and Xˆ. Moreover,
2 2 det(G) = g12 − 2ig12g22κ + g22u, so as long as g12 or g22 is nonzero, G will be a legitimate gauge transformation.
However, g22 multiplies a u term in the (1, 1)-element of (2.30). Since we assumed that G was a constant matrix, we must set g22 = 0. This yields
−ig κ g G = 12 12 . −g12 0
To match the gauge transformation found in literature [15], take g12 = 1. This gives
−iκ 1 G = . −1 0 In fact, using this G gives
−iκ −κ2 − λ + u GXG−1 = . −1 iκ Substituting λ = −κ2 gives Xˆ, as expected. More complicated gauge transformations can also be found. In the next example, we consider a more complicated case, for which a constant gauge matrix cannot be found. Rather than turning (2.23) and (2.24) into a simple algebraic system, we will be forced to work with a system of differential equations. Example 5 Consider the mKdV equation [6],
2 ut + αu ux + uxxx = 0.
18 Common values for α are ±6 and ±1. In this example, we will consider the case α = −6,
2 ut − 6u ux + uxxx = 0. (2.31)
Consider the following two Lax pairs from the mKdV equation,
0 1 X = 2 (2.32) λ + u + ux 0 with
2 −2uux − uxx −4λ + 2u + 2ux T = 2 2 4 2 . −4λ − 2λu − 2λux + 2u + 4u ux − 2uuxx − uxxx 2uux + uxx (2.33)
This pair can be computed in the same way as were X1 and T1 from Example 4. A second pair [5] is
−ik u Xˆ = (2.34) u ik with
3 2 2 3 ˆ −4ik − 2iku 4k u + 2ikux + 2u − uxx T = 2 3 3 2 , (2.35) 4k u − 2ikux + 2u − uxx 4ik + 2iku
obtained through the AKNS-scheme. Now, we want to find a nonsingular matrix G
ˆ −1 −1 such that X = GXG + GxG . We first naively consider a constant gauge matrix, as in Example 4,
g g G = 11 12 . g21 g22 Equating terms element-wise as before, we get
g11 = −ig12κ + g22u,
g21 = ig22κ + g12u, 2 g12(λ + u + ux) = −ig11κ + g21u, 2 g22(λ + u + ux) = ig21κ + g11u.
19 Substituting g11 and g21 from the first two equations in the above system into the last two equations gives
2 g12(λ + ux) = −g12κ , 2 g22(λ + ux) = −g22κ .
Assuming ux is non-constant, the only solution is g12 = g22 = 0. However, this would also mean that g11 = g21 = 0, leading to G = 0. Thus, the choice of a constant matrix is too limited, and must be modified. Because the sets of Lax pairs have u2 as their highest order, we assume G is linear in u and its first derivative,
g (u, u ) g (u, u ) G = 11 x 12 x , (2.36) g21(u, ux) g22(u, ux)
for arbitrary functions g11, g12, g21, and g22. Then, substituting G into (2.23) gives the system of equations
ˆ XG − GX − Gx = 0.
ˆ Substituting X and X gives 4 equations in terms of g11, g12, g21, g22, u, and spectral parameters. Specifically, the following must be satisfied
ig12κ − g22u + g12,x + g11 = 0, (2.37)
ig22κ + g12u − g22,x − g21 = 0, (2.38) 2 ig11κ − g21u + g11,x + g12 λ + u + ux = 0, (2.39) 2 ig21κ + g11u − g21,x − g22 λ + u + ux = 0. (2.40)
Assume for the moment that g12 and g22 are non-constant. Say, e.g., they are
functions of u. Then, (2.37) and (2.38) require that g11 and g21 be quadratic functions of u. However, this violates the assumption that G is linear in u. Thus, we assume
that g12 and g22 are constants, say g12 = α and g22 = β. Substituting these values into (2.39) and (2.40) yields
20 2 ig11κ − g21u + g11,x + α(λ + u + ux) = 0, (2.41) 2 ig21κ + g11u − g21,x − β(λ + u + ux) = 0, (2.42)
while (2.37) and (2.38) give that g11 = βu − iαk and g21 = αu + iβk. Now replacing these values of g11 and g21 into the remaining equations and simplifying gives
2 α(κ + λ) + (β + α)ux = 0, 2 β(κ + λ) + (β + α)ux = 0.
One solution to this system is λ = −k2 and (α + β) = 0. That is, α = −β. Thus, the system (2.37)-(2.40) is satisfied if
2 g12 = −g22 = α, g11 = −α(iκ + u), g21 = α(−iκ + u), and λ = −κ .
Substituting the above into G gives
−α(iκ + u) α G = . α(−iκ + u) −α For α = −1, this simplifies into
ik + u −1 . ik − u 1
2.4 Conservation Laws
The concept of a conservation law is well-known from physics. Conservation of mass, energy, and momentum are common phrases even among non-specialists. While this Thesis focuses primarily on Lax pairs, conservations law are also discussed. We introduce the concept now, in anticipation of what will follow in Chapters 4 and 5. In particular, a conservation law for a given PDE is an equation of the form
ρt + Jx ˙=0. (2.43)
21 The function ρ(x, t) is called a conserved density and the function J(x, t) is the associated flux. Both ρ and J are functions on the jet space of u. As before, ˙= means equality on solutions of the PDE. Earlier we saw that a PDE with a Lax pair could be solved exactly with the IST. Similarly, the existence of an infinite number of conservation laws predicts integrabil- ity7 [13]. Moreover, conservation laws can be used to find other properties of interest for a PDE. For example, the Miura transformation, which helped lead to the discovery of a Lax pair for the KdV equation, was discovered [7] in the context of conservation laws. Furthermore, the knowledge of a conservation law is often helpful for finding a numerical solution to a PDE. Thus, finding conserved quantities is an important topic. In section 4, we will discuss a construction technique based on Lax pairs. In this section, however, we will demonstrate the concept of conservation laws and their relationship to Lax pairs with an example. Example 6 Consider again the KdV equation, (2.4). It is well-known that there are an infinite amount of conservation laws related to the KdV equation [6]. The first two conservation laws are simple: