An Introduction to Lévy and Feller Processes
– Advanced Courses in Mathematics - CRM Barcelona 2014 –
René L. Schilling
TU Dresden, Institut für Mathematische Stochastik, 01062 Dresden, Germany
[email protected] http://www.math.tu-dresden.de/sto/schilling
These course notes will be published, together Davar Khoshnevisan’s notes on Invariance and Comparison Principles for Parabolic Stochastic Partial Differential Equations as From Lévy-Type Processes to Parabolic SPDEs by the CRM, Barcelona and Birkäuser, Cham 2017 (ISBN: 978-3-319-34119-4). The arXiv-version and the published version may differ in layout, pagination and wording, but not in content. arXiv:1603.00251v2 [math.PR] 17 Oct 2016
Contents
Preface 3
Symbols and notation 5
1. Orientation 7
2. Lévy processes 12
3. Examples 16
4. On the Markov property 24
5. A digression: semigroups 30
6. The generator of a Lévy process 36
7. Construction of Lévy processes 44
8. Two special Lévy processes 49
9. Random measures 55
10.A digression: stochastic integrals 64
11.From Lévy to Feller processes 75
12.Symbols and semimartingales 84
13.Dénouement 93
A. Some classical results 97
Bibliography 104
1
Preface
These lecture notes are an extended version of my lectures on Lévy and Lévy-type processes given at the Second Barcelona Summer School on Stochastic Analysis organized by the Centre de Recerca Matemàtica (CRM). The lectures are aimed at advanced graduate and PhD students. In order to read these notes, one should have sound knowledge of measure theoretic probability theory and some background in stochastic processes, as it is covered in my books Measures, Integals and Martingales [54] and Brownian Motion [56].
My purpose in these lectures is to give an introduction to Lévy processes, and to show how one can extend this approach to space inhomogeneous processes which behave locally like Lévy processes. After a brief overview (Chapter 1) I introduce Lévy processes, explain how to char- acterize them (Chapter 2) and discuss the quintessential examples of Lévy processes (Chapter 3). The Markov (loss of memory) property of Lévy processes is studied in Chapter 4. A short analytic interlude (Chapter 5) gives an introduction to operator semigroups, resolvents and their generators from a probabilistic perspective. Chapter 6 brings us back to generators of Lévy processes which are identified as pseudo differential operators whose symbol is the characteristic exponent of the Lévy process. As a by-product we obtain the Lévy–Khintchine formula. Continuing this line, we arrive at the first construction of Lévy processes in Chapter 7. Chap- ter 8 is devoted to two very special Lévy processes: (compound) Poisson processes and Brownian motion. We give elementary constructions of both processes and show how and why they are special Lévy processes, indeed. This is also the basis for the next chapter (Chapter 9) where we construct a random measure from the jumps of a Lévy process. This can be used to provide a further construction of Lévy processes, culminating in the famous Lévy–Itô decomposition and yet another proof of the Lévy–Khintchine formula. A second interlude (Chapter 10) embeds these random measures into the larger theory of ran- dom orthogonal measures. We show how we can use random orthogonal measures to develop an extension of Itô’s theory of stochastic integrals for square-integrable (not necessarily continuous) martingales, but we restrict ourselves to the bare bones, i.e. the L2-theory. In Chapter 11 we in- troduce Feller processes as the proper spatially inhomogeneous brethren of Lévy processes, and we show how our proof of the Lévy–Khintchine formula carries over to this setting. We will see, in particular, that Feller processes have a symbol which is the state-space dependent analogue of the characteristic exponent of a Lévy process. The symbol describes the process and its gener- ator. A probabilistic way to calculate the symbol and some first consequences (in particular the semimartingale decomposition of Feller processes) is discussed in Chapter 12; we also show that
3 4 R. L. Schilling: An Introduction to Lévy and Feller Processes the symbol contains information on global properties of the process, such as conservativeness. In the final Chapter 13, we summarize (mostly without proofs) how other path properties of a Feller process can be obtained via the symbol. In order to make these notes self-contained, we collect in the appendix some material which is not always included in standard graduate probability courses.
It is now about time to thank many individuals who helped to bring this enterprise on the way. I am grateful to the scientific board and the organizing committee for the kind invitation to deliver these lectures at the Centre de Recerca Matemàtica in Barcelona. The CRM is a wonderful place to teach and to do research, and I am very happy to acknowledge their support and hospitality. I would like to thank the students who participated in the CRM course as well as all students and readers who were exposed to earlier (temporally & spatially inhomogeneous. . . ) versions of my lectures; without your input these notes would look different! I am greatly indebted to Ms. Franziska Kühn for her interest in this topic; her valuable comments pinpointed many mistakes and helped to make the presentation much clearer. And, last and most, I thank my wife for her love, support and forbearance while these notes were being prepared.
Dresden, September 2015 René L. Schilling Symbols and notation
This index is intended to aid cross-referencing, so notation that is specific to a single chapter is generally not listed. Some symbols are used locally, without ambiguity, in senses other than those given below; numbers following an entry are page numbers. Unless otherwise stated, functions are real-valued and binary operations between functions such n→∞ as f ± g, f · g, f ∧ g, f ∨ g, comparisons f 6 g, f < g or limiting relations fn −−−→ f , limn fn, liminfn fn, limsupn fn, supn fn or infn fn are understood pointwise.
−ix·ξ General notation: analysis eξ (x) e positive always in the sense > 0 General notation: probability negative always in the sense 6 0 N 1,2,3,... ∼ ‘is distributed as’ inf /0 inf /0 = +∞ ⊥⊥ ‘is stochastically independent’ a ∨ b maximum of a and b a.s. almost surely (w. r. t. P) a ∧ b minimum of a and b iid independent and identically distributed bxc largest integer n 6 x Rd 2 2 2 N,Exp,Poi normal, exponential, Poisson |x| norm in : |x| = x1 + ··· + xd distribution Rd d x · y scalar product in : ∑ j=1 x jy j P,E probability, expectation 1, x ∈ A V C 1A 1A(x) = , ov variance, covariance 0, x ∈/ A (L0)–(L3) definition of a Lévy process, 7 δx point mass at x (L20) 12 D domain ∆ Laplace operator Sets and σ-algebras ∂ c ∂ j partial derivative A complement of the set A ∂x j ∂ ∂ > A closure of the set A ∇, ∇x gradient ,..., ∂x1 ∂xd · F f , fb Fourier transform A ∪ B disjoint union, i.e. A ∪ B for (2π)−d R e−ix·ξ f (x)dx disjoint sets A ∩ B = /0 F−1 f , f inverse Fourier transform Br(x) open ball, R eix·ξ f (x)dx centre x, radius r q supp f support, { f 6= 0}
5 6 R. L. Schilling: An Introduction to Lévy and Feller Processes
B(E) Borel sets of E Spaces of functions X Ft canonical filtration σ(Xs : s 6 t) B(E) Borel functions on E σ S F∞ t>0 Ft Bb(E) – – , bounded Fτ 75 C(E) continuous functions on E 29 Fτ+ Cb(E) – – , bounded predictable σ-algebra, 101 P C∞(E) – – , lim f (x) = 0 |x|→∞
Stochastic processes Cc(E) – – , compact support Px,Ex law and mean of a Markov Cn(E) n times continuously diff’ble process starting at x, 24 functions on E n Xt− left limit lims↑t Xs Cb(E) – – , bounded (with all derivatives) ∆Xt jump at time t: Xt − Xt− Cn (E) – – , 0 at infinity (with all σ,τ stopping times: {σ 6 t} ∈ Ft , ∞ derivatives) t > 0 n x Cc(E) – – , compact support τr ,τr inf{t > 0 : |Xt − X0| > r}, first p p p p exit time from the open ball Br(x) L (E, µ),L (µ),L (E) L space w. r. t. the
centered at x = X0 measure space (E,A , µ) càdlàg right continuous on [0,∞) with S(Rd) rapidly decreasing smooth finite left limits on (0,∞) functions on Rd, 36 1. Orientation
Stochastic processes with stationary and independent increments are classical examples of Markov processes. Their importance both in theory and for applications justifies to study these processes and their history. The origins of processes with independent increments reach back to the late 1920s and they are closely connected with the notion of infinite divisibility and the genesis of the Lévy–Khintchine formula. Around this time, the limiting behaviour of sums of independent random variables
X0 := 0 and Xn := ξ1 + ξ2 + ··· + ξn, n ∈ N, was well understood through the contributions of Borel, Markov, Cantelli, Lindeberg, Feller, de Finetti, Khintchine, Kolmogorov and, of course, Lévy; two new developments emerged, on the one hand the study of dependent random variables and, on the other, the study of continuous-time analogues of sums of independent random variables. In order to pass from n ∈ N to a continuous parameter t ∈ [0,∞) we need to replace the steps ξk by increments Xt − Xs. It is not hard to see that Xt , t ∈ N, with iid (independent and identically distributed) steps ξk enjoys the following properties:
X0 = 0 a.s. (L0) stationary increments Xt − Xs ∼ Xt−s − X0 ∀s 6 t (L1) independent increments Xt − Xs ⊥⊥σ(Xr,r 6 s) ∀s 6 t (L2) where ‘∼’ stands for ‘same distribution’ and ‘⊥⊥’ for stochastic independence. In the non-discrete setting we will also require a mild regularity condition
continuity in probability limP(|Xt − X0| > ε) = 0 ∀ε > 0 (L3) t→0 which rules out fixed discontinuities of the path t 7→ Xt . Under (L0)–(L2) one has that
n Xt = ξk,n(t) and ξk,n(t) = (X kt − X (k−1)t ) are iid (1.1) ∑ n k=1 n for every n ∈ N. Letting n → ∞ shows that Xt arises as a suitable limit of (a triangular array of) iid random variables which transforms the problem into a question of limit theorems and infinite divisibility.
7 8 R. L. Schilling: An Introduction to Lévy and Feller Processes
This was first observed in 1929 by de Finetti [15] who introduces (without naming it, the name is due to Bawly [6] and Khintchine [29]) the concept of infinite divisibility of a random variable X n ∀n ∃ iid random variables ξi,n : X ∼ ∑ ξi,n (1.2) i=1 and asks for the general structure of infinitely divisible random variables. His paper contains two remarkable results on the characteristic function χ(ξ) = Eeiξ·X of an infinite divisible random variable (taken from [39]):
De Finetti’s first theorem. A random variable X is infinitely divisible if, and only if, its charac- teristic function is of the form χ(ξ) = limn→∞ exp − pn(1 − φn(ξ)) where pn > 0 and φn is a characteristic function.
De Finetti’s second theorem. The characteristic function of an infinitely divisible random vari- able X is the limit of finite products of Poissonian characteristic functions
ihnξ χn(ξ) = exp − pn(1 − e ) ,
and the converse is also true. In particular, all infinitely divisible laws are limits of convo- lutions of Poisson distributions.
Because of (1.1), Xt is infinitely divisible and as such one can construct, in principle, all indepen- dent-increment processes Xt as limits of sums of Poisson random variables. The contributions of Kolmogorov [31], Lévy [37] and Khintchine [28] show the exact form of the characteristic function of an infinitely divisible random variable
Z iξ·X 1 iy·ξ − logEe = −il · ξ + ξ · Qξ + 1 − e + iξ · y1(0,1)(|y|) ν(dy) (1.3) 2 y6=0 where l ∈ Rd, Q ∈ Rd×d is a positive semidefinite symmetric matrix, and ν is a measure on Rd R 2 \{0} such that y6=0 min{1,|y| }ν(dy) < ∞. This is the famous Lévy–Khintchine formula. The exact knowledge of (1.3) makes it possible to find the approximating Poisson variables in de
Finetti’s theorem explicitly, thus leading to a construction of Xt . A little later, and without knowledge of de Finetti’s results, Lévy came up in his seminal paper
[37] (see also [38, Chap. VII]) with a decomposition of Xt in four independent components: a deterministic drift, a Gaussian part, the compensated small jumps and the large jumps ∆Xs :=
Xs − Xs−. This is now known as Lévy–Itô decomposition: p Z ! Xt = tl + QWt + lim ∆Xs −t yν(dy) + ∆Xs (1.4) ε→0 ∑ ∑ 0
Lévy uses results from the convergence of random series, notably Kolmogorov’s three series the- orem, in order to explain the convergence of the series appearing in (1.4). A rigorous proof based on the representation (1.5) are due to Itô [23] who completed Lévy’s programme to construct Xt . The coefficients l,Q,ν are the same as in (1.3), W is a d-dimensional standard Brownian mo- tion, and Nω ((0,t] × B) is the random measure #{s ∈ (0,t] : Xs(ω) − Xs−(ω) ∈ B} counting the jumps of X; it is a Poisson random variable with intensity EN((0,t]×B) = tν(B) for all Borel sets B ⊂ Rd \{0} such that 0 ∈/ B. Nowadays there are at least six possible approaches to constructing processes with (stationary and) independent increments X = (Xt )t>0.
The de Finetti–Lévy(–Kolmogorov–Khintchine) construction. The starting point is the observation that each Xt satisfies (1.1) and is, therefore, infinitely divisible. Thus, the character- istic exponent logEeiξ·Xt is given by the Lévy–Khintchine formula (1.3), and using the triplet √ (l,Q,ν) one can construct a drift lt, a Brownian motion QWt and compound Poisson processes, i.e. Poisson processes whose intensities y ∈ Rd are mixed with respect to the finite measure
νε (dy) := 1[ε,∞)(|y|)ν(dy). Using a suitable compensation (in the spirit of Kolmogorov’s three series theorem) of the small jumps, it is possible to show that the limit ε → 0 exists locally uni- formly in t. A very clear presentation of this approach can be found in Breiman [10, Chapter 14.7–8], see also Chapter 7.
The Lévy–Itô construction. This is currently the most popular approach to independent-in- crement processes, see e.g. Applebaum [2, Chapter 2.3–4] or Kyprianou [36, Chapter 2]. Origi- nally the idea is due to Lévy [37], but Itô [23] gave the first rigorous construction. It is based on the observation that the jumps of a process with stationary and independent increments define a
Poisson random measure Nω ([0,t]×B) and this can be used to obtain the Lévy–Itô decomposition (1.5). The Lévy–Khintchine formula is then a corollary of the pathwise decomposition. Some of the best presentations can be found in Gikhman–Skorokhod [18, Chapter VI], Itô [24, Chapter 4.3] and Bretagnolle [11]. A proof based on additive functionals and martingale stochastic integrals is due to Kunita & Watanabe [35, Section 7]. We follow this approach in Chapter 9.
Variants of the Lévy–Itô construction. The Lévy–Itô decomposition (1.5) is, in fact, the semimartingale decomposition of a process with stationary and independent increments. Using the general theory of semimartingales – which heavily relies on general random measures – we can identify processes with independent increments as those semimartingales whose semimartingale characteristics are deterministic, cf. Jacod & Shiryaev [27, Chapter II.4c]. A further interesting derivation of the Lévy–Itô decomposition is based on stochastic integrals driven by martingales. The key is Itô’s formula and, again, the fact that the jumps of a process with stationary and in- dependent increments defines a Poisson point process which can be used as a good stochastic 10 R. L. Schilling: An Introduction to Lévy and Feller Processes integrator; this unique approach1 can be found in Kunita [34, Chapter 2].
Kolmogorov’s construction. This is the classic construction of stochastic processes starting from the finite-dimensional distributions. For a process with stationary and independent incre- ments these are given as iterated convolutions of the form
E f (Xt0 ,...,Xtn ) Z Z = ··· f (y0,y0 + y1,...,y0 + ··· + yn) pt0 (dy0)pt1−t0 (dy1)... ptn−tn−1 (dyn)
R iξ·y with pt (dy) = P(Xt ∈ dy) or e pt (dy) = exp[−tψ(ξ)] where ψ is the characteristic exponent (1.3). Particularly nice presentations are those of Sato [51, Chapter 2.10–11] and Bauer [5, Chapter 37].
The invariance principle. Just as for a Brownian motion, it is possible to construct Lévy processes as limits of (suitably interpolated) random walks. For finite dimensional distributions this is done in Gikhman & Skorokhod [18, Chapter IX.6]; for the whole trajectory, i.e. in the space of càdlàg2 functions D[0,1] equipped with the Skorokhod topology, the proper references are Prokhorov [42] and Grimvall [19].
Random series constructions. A series representation of an independent-increment pro- cess (Xt )t∈[0,1] is an expression of the form
n Xt = lim Jk1[0,t](Uk) −tck a.s. n→∞ ∑ k=1
The random variables Jk represent the jumps, Uk are iid uniform random variables and ck are suitable deterministic centering terms. Compared with the Lévy–Itô decomposition (1.4), the main difference is the fact that the jumps are summed over a deterministic index set {1,2,...n} while the summation in (1.4) extends over the random set {s : |∆Xs| > 1/n}. In order to construct a process with characteristic exponent (1.3) where l = 0 and Q = 0, one considers a disintegration Z ∞ ν(dy) = σ(r,dy)dr. 0
It is possible, cf. Rosinski´ [47], to choose σ(r,dy) = P(H(r,Vk) ∈ dy) where V = (Vk)k∈N is any sequence of d-dimensional iid random variables and H : (0,∞) × Rd → Rd is measurable. Now let Γ = (Γk)k∈N be a sequence of partial sums of iid standard exponential random variables and
U = (Uk)k∈N iid uniform random variables on [0,1] such that U,V,Γ are independent. Then Z k Z Jk := H(Γk,Vk) and ck = yσ(r,dy)dr k−1 |y|<1
1It reminds of the elegant use of Itô’s formula in Kunita-and-Watanabe’s proof of Lévy’s characterization of Brow- nian motion, see e.g. Schilling & Partzsch [56, Chapter 18.2]. 2A french acronym meaning ‘right-continuous and finite limits from the left’. Chapter 1: Orientation 11 is the sought-for series representation, cf. Rosinski´ [47] and [46]. This approach is important if one wants to simulate independent-increment processes. Moreover, it still holds for Banach space valued random variables. 2. Lévy processes
P Throughout this chapter, (Ω,A , ) is a fixed probability space, t0 = 0 6 t1 6 ... 6 tn and 0 6 s < t d are positive real numbers, and ξk,ηk, k = 1,...,n, denote vectors from R ; we write ξ · η for the Euclidean scalar product.
Rd Definition 2.1. A Lévy process X = (Xt )t>0 is a stochastic process Xt : Ω → satisfying (L0)– (L3); this is to say that X starts at zero, has stationary and independent increments and is continu- ous in probability.
One should understand Lévy processes as continuous-time versions of sums of iid random vari- ables. This can easily be seen from the telescopic sum
n X − X = X − X , s < t, n ∈ N, t s ∑ tk tk−1 (2.1) k=1
k where tk = s + n (t − s). Since the increments Xtk − Xtk−1 are iid random variables, we see that all Xt of a Lévy process are infinitely divisible, i.e. (1.2) holds. Many properties of a Lévy process will, therefore, resemble those of sums of iid random variables. Let us briefly discuss the conditions (L0)–(L3).
X Remark 2.2. We have stated (L2) using the canonical filtration Ft := σ(Xr, r 6 t) of the process X. Often this condition is written in the following way
Xt − Xt ,...,Xt − Xt are independent random variables n n−1 1 0 (L20) for all n ∈ N, t0 = 0 < t1 < ··· < tn.
It is easy to see that this is actually equivalent to (L2): From
bi-measurable (Xt1 ,...,Xtn ) ←−−−−−−−→ (Xt1 − Xt0 ,...,Xtn − Xtn−1 ) it follows that
X Ft = σ (Xt1 ,...,Xtn ), 0 6 t1 6 ... 6 tn 6 t = σ (Xt1 − Xt0 ,...,Xtn − Xtn−1 ), 0 = t0 6 t1 6 ... 6 tn 6 t (2.2) = σ Xu − Xv, 0 6 v 6 u 6 t , and we conclude that (L2) and (L20) are indeed equivalent. The condition (L3) is equivalent to either of the following
12 Chapter 2: Lévy processes 13
• ‘t 7→ Xt is continuous in probability’;
1 • ‘t 7→ Xt is a.s. càdlàg’ (up to a modification of the process). The equivalence with the first claim, and the direction ‘⇐’ of the second claim are easy:
P P 1E lim (|Xu − Xt | > ε) = lim (|X|t−u|| > ε) 6 lim (|Xh| ∧ ε), (2.3) u→t |t−u|→0 h→0 ε but it takes more effort to show that continuity in probatility (L3) guarantees that almost all paths are càdlàg.2 Usually, this is proved by controlling the oscillations of the paths of a Lévy process, cf. Sato [51, Theorem 11.1], or by the fundamental regularization theorem for submartingales, see Revuz & Yor [44, Theorem II.(2.5)] and Remark 11.2; in contrast to the general martingale setting [44, Theorem II.(2.9)], we do not need to augment the natural filtration because of (L1) and (L3). Since our construction of Lévy processes gives directly a càdlàg version, we do not go into further detail. The condition (L3) has another consequence. Recall that the Cauchy–Abel functional equa- tions have unique solutions if, say, φ, ψ and θ are (right-)continuous:
φ(s +t) = φ(s) · φ(t) φ(t) = φ(1)t , ψ(s +t) = ψ(s) + ψ(t)(s,t > 0) =⇒ ψ(t) = ψ(1) ·t (2.4) c θ(st) = θ(s) · θ(t) θ(t) = t , c > 0. The first equation is treated in Theorem A.1 in the appendix. For a thorough discussion on condi- tions ensuring uniqueness we refer to Aczel [1, Chapter 2.1]. Rd Proposition 2.3. Let (Xt )t>0 be a Lévy process in . Then iξ·X iξ·X t d Ee t = Ee 1 , t > 0, ξ ∈ R . (2.5)
Proof. Fix s,t > 0. We get (L2) (L1) Eeiξ·(Xt+s−Xs)+iξ·Xs = Eeiξ·(Xt+s−Xs) Eeiξ·Xs = Eeiξ·Xt Eeiξ·Xs , or φ(t + s) = φ(t) · φ(s), if we write φ(t) = Eeiξ·Xt . Since x 7→ eiξ·x is continuous, there is for every ε > 0 some δ > 0 such that
E iξ·(Xt −Xs) P P |φ(t) − φ(s)| 6 e − 1 6 ε + 2 (|Xt − Xs| > δ) = ε + 2 (|X|t−s|| > δ). Thus, (L3) guarantees that t 7→ φ(t) is continuous, and the claim follows from (2.4).
Notice that any solution f (t) of (2.4) also satisfies (L0)–(L2); by Proposition 2.3 Xt + f (t) is a Lévy process if, and only if, f (t) is continuous. On the other hand, Hamel, cf. [1, p. 35], constructed discontinuous (non-measurable and locally unbounded) solutions to (2.4). Thus, (L3) means that t 7→ Xt has no fixed discontinuities, i.e. all jumps occur at random times.
1‘Right-continuous and finite limits from the left’ 2More precisely: that there exists a modification of X which has almost surely càdlàg paths. 14 R. L. Schilling: An Introduction to Lévy and Feller Processes
P Corollary 2.4. The finite-dimensional distributions (Xt1 ∈ dx1,...,Xtn ∈ dxn) of a Lévy process are uniquely determined by
n ! n h itk−tk−1 E E exp i ∑ ξk · Xtk = ∏ exp(i(ξk + ··· + ξn) · X1) (2.6) k=1 k=1
Rd N for all ξ1,...,ξn ∈ , n ∈ and 0 = t0 6 t1 6 ... 6 tn.
Proof. The left-hand side of (2.6) is just the characteristic function of (Xt1 ,...,Xtn ). Consequently, the assertion follows from (2.6). Using Proposition 2.3, we have
n ! n−2 ! E E exp i ∑ ξk · Xtk = exp i ∑ ξk · Xtk + i(ξn + ξn−1) · Xtn−1 + iξn · (Xtn − Xtn−1 ) k=1 k=1 n−2 ! (L2) iξn·X1 tn−tn−1 = Eexp i ξk · Xt + i(ξn + ξn−1) · Xt Ee . (L1) ∑ k n−1 k=1
Since the first half of the right-hand side has the same structure as the original expression, we can iterate this calculation and obtain (2.6).
It is not hard to invert the Fourier transform in (2.6). Writing pt (dx) := P(Xt ∈ dx) we get
Z Z n P(X ∈ B ,...,X ∈ B ) = ··· (x + ··· + x )p (dx ) t1 1 tn n ∏ 1Bk 1 k tk−tk−1 k (2.7) k=1 Z Z n = ··· (y )p (dy − y ). ∏ 1Bk k tk−tk−1 k k−1 (2.8) k=1
iξ·X Let us discuss the structure of the characteristic function χ(ξ) = Ee 1 of X1. From (2.1) 2 we see that each random variable Xt of a Lévy process is infinitely divisible. Clearly, |χ(ξ)| is 0 0 the (real-valued) characteristic function of the symmetrization Xe1 = X1 −X1 (X1 is an independent copy of X1) and Xe1 is again infinitely divisible:
n n h 0 0 i Xe1 = (Xek − Xek−1 ) = (X k − X k−1 ) − (X k − X k−1 ) . ∑ n n ∑ n n k=1 k=1 n n
2 2n 2 In particular, |χ| = |χ1/n| where |χ1/n| is the characteristic function of Xe1/n. Since everything is real and |χ(ξ)| 6 1, we get
2 2/n d θ(ξ) := lim |χ1/n(ξ)| = lim |χ(ξ)| , ξ ∈ R , n→∞ n→∞ which is 0 or 1 depending on |χ(ξ)| = 0 or |χ(ξ)| > 0, respectively. As χ(ξ) is continuous at
ξ = 0 with χ(0) = 1, we have θ ≡ 1 in a neighbourhood Br(0) of 0. Now we can use Lévy’s continuity theorem (Theorem A.5) and conclude that the limiting function θ(ξ) is continuous everywhere, hence θ ≡ 1. In particular, χ(ξ) has no zeroes. Chapter 2: Lévy processes 15
Rd Corollary 2.5. Let (Xt )t>0 be a Lévy process in . There exists a unique continuous function ψ : Rd → C such that −tψ(ξ) d Eexp(iξ · Xt ) = e , t > 0, ξ ∈ R . The function ψ is called the characteristic exponent.
Proof. In view of Proposition 2.3 it is enough to consider t = 1. Set χ(ξ) := Eexp(iξ · X1). An obvious candidate for the exponent is ψ(ξ) = −log χ(ξ), but with complex logarithms there is always the trouble which branch of the logarithm one should take. Let us begin with the unique- ness: −ψ −φ −(ψ−φ) e = e =⇒ e = 1 =⇒ ψ(ξ) − φ(ξ) = 2π ikξ Z for some integer kξ ∈ . Since φ,ψ are continuous and φ(0) = ψ(0) = 1, we get kξ ≡ 0. To prove the existence of the logarithm, it is not sufficient to take the principal branch of the logarithm. As we have seen above, χ(ξ) is continuous and has no zeroes, i.e. inf|ξ|6r |χ(ξ)| > 0 3 for any r > 0; therefore, there is a ‘distinguished’, continuous version of the argument arg◦ χ(ξ) such that arg◦ χ(0) = 0.
This allows us to take a continuous version of log χ(ξ) = log|χ(ξ)| + arg◦ χ(ξ).
Corollary 2.6. Let Y be an infinitely divisible random variable. Then there exists at most one4
Lévy process (Xt )t>0 such that X1 ∼ Y.
Proof. Since X1 ∼ Y, infinite divisibility is a necessary requirement for Y. On the other hand, Proposition 2.3 and Corollary 2.4 show how to construct the finite-dimensional distributions of a
Lévy process, hence the process, from X1.
So far, we have seen the following one-to-one correspondences
1:1 1:1 E iξ·X1 E iξ·X1 (Xt )t>0 Lévy process ←→ e ←→ ψ(ξ) = −log e and the next step is to find all possible characteristic exponents. This will lead us to the Lévy– Khintchine formula.
3A very detailed argument is given in Sato [51, Lemma 7.6], a completely different proof can be found in Dieudonné [16, Chapter IX, Appendix 2]. 4We will see in Chapter 7 how to construct this process. It is unique in the sense that its finite-dimensional distributions are uniquely determined by Y. 3. Examples
We begin with a useful alternative characterisation of Lévy processes.
Rd P Theorem 3.1. Let X = (Xt )t>0 be a stochastic process with values in , (X0 = 0) = 1 and X Ft = Ft = σ(Xr, r 6 t). The process X is a Lévy process if, and only if, there exists an exponent ψ : Rd → C such that E iξ·(Xt −Xs) −(t−s)ψ(ξ) Rd e Fs = e for all s < t, ξ ∈ . (3.1)
Proof. If X is a Lévy process, we get
(L2) (L1) Cor. 2.5 E iξ·(Xt −Xs) E iξ·(Xt −Xs) E iξ·Xt−s −(t−s)ψ(ξ) e Fs = e = e = e .
Conversely, assume that X0 = 0 a.s. and (3.1) holds. Then
Eeiξ·(Xt −Xs) = e−(t−s)ψ(ξ) = Eeiξ·(Xt−s−X0) which shows Xt − Xs ∼ Xt−s − X0 = Xt−s, i.e. (L1).
For any F ∈ Fs we find from the tower property of conditional expectation h i E 1 iξ·(Xt −Xs) E 1 E iξ·(Xt −Xs) E1 −(t−s)ψ(ξ) F · e = F e Fs = F · e . (3.2)
iu1 iu c Observe that e F = 1Fc + e 1F for any u ∈ R; since both F and F are in Fs, we get iu1F iξ·(Xt −Xs) iξ·(Xt −Xs) iu iξ·(Xt −Xs) E e e = E 1Fc e + E 1F e e
(3.2) iu −(t−s)ψ(ξ) = E 1Fc + e 1F e
(3.2) 1 = Eeiu F Eeiξ·(Xt −Xs).
Thus, 1F ⊥⊥(Xt − Xs) for any F ∈ Fs, and (L2) follows. iξ·X −tψ(ξ) Finally, limt→0 Ee t = limt→0 e = 1 proves that Xt → 0 in distribution, hence in proba- bility. This gives (L3).
Theorem 3.1 allows us to give concrete examples of Lévy processes. Example 3.2. The following processes are Lévy processes.
d a) Drift in direction l/|l|, l ∈ R , with speed |l|: Xt = tl and ψ(ξ) = −il · ξ.
16 Chapter 3: Examples 17
Rd×d b) Brownian motion with (positive semi-definite) covariance matrix Q ∈ : Let (Wt )t>0 be a d √ standard Wiener process on R and set Xt := QWt . 1 P −d/2 −1/2 −1 Then ψ(ξ) = 2 ξ · Qξ and (Xt ∈ dy) = (2πt) (detQ) exp(−y · Q y/2t)dy. c) Poisson process in R with jump height 1 and intensity λ. This is an integer-valued counting process (Nt )t>0 which increases by 1 after an independent exponential waiting time with mean λ. Thus, ∞ 1 Nt = ∑ [0,t](τk), τk = σ1 + ··· + σk, σk ∼ Exp(λ) iid. k=1 Using this definition, it is a bit messy to show that N is indeed a Lévy process (see e.g. Çinlar [12, Chapter 4]). We will give a different proof in Theorem 3.4 below. Usually, the first step is to show that its law is a Poisson distribution
(λt)k P(N = k) = e−tλ , k = 0,1,2,... t k!
(thus the name!) and from this one can calculate the characteristic exponent
∞ (λt)k EeiuNt = ∑ eiuke−tλ = e−tλ expλteiu = exp −tλ(1 − eiu), k=0 k! i.e. ψ(u) = λ(1 − eiu). Mind that this is strictly weaker than (3.1) and does not prove that N is a Lévy process.
Rd d) Compound Poisson process in with jump distribution µ and intensity λ. Let N = (Nt )t>0 be a Poisson process with intensity λ and replace the jumps of size 1 by independent iid jumps of d random height H1,H2,... with values in R and H1 ∼ µ. This is a compound Poisson process:
Nt Ct = ∑ Hk, Hk ∼ µ iid and independent of (Nt )t>0. k=1
We will see in Theorem 3.4 that compound Poisson processes are Lévy processes.
Let us show that the Poisson and compound Poisson processes are Lévy processes. For this we need the following auxiliary result. Since t 7→ Ct is a step function, the Riemann–Stieltjes integral R f (u)dCu is well-defined.
Lemma 3.3 (Campbell’s formula). Let Ct = H1 + ··· + HNt be a compound Poisson process as in
Example 3.2.d) with iid jumps Hk ∼ µ and an independent Poisson process (Nt )t>0 with intensity λ. Then Z ∞ Z ∞Z iy f (s+t) Eexp i f (t + s)dCt = exp λ (e − 1) µ(dy)dt (3.3) 0 0 y6=0
d holds for all s > 0 and bounded measurable functions f : [0,∞) → R with compact support. 18 R. L. Schilling: An Introduction to Lévy and Feller Processes
Proof. Set τk = σ1 + ··· + σk where σk ∼ Exp(λ) are iid. Then Z ∞ φ(s) :=Eexp i f (s +t)dCt 0 ∞ ! E = exp i ∑ f (s + σ1 + ··· + σk)Hk k=1 Z ∞ ∞ ! iid E E P = exp i ∑ f (s + x + σ2 + ··· + σk)Hk exp(i f (s + x)H1) (σ1 ∈ dx) 0 k=2 | {z }| {z }| {z } =φ(s+x) =:γ(s+x) =λe−λx dx Z ∞ =λ φ(s + x)γ(s + x)e−λx dx 0 Z ∞ =λeλs γ(t)φ(t)e−λt dt. s This is equivalent to Z ∞ e−λsφ(s) = λ (φ(t)e−λt )γ(t)dt s and φ(∞) = 1 since f has compact support. This integral equation has a unique solution; it is now a routine exercise to verify that the right-hand side of (3.3) is indeed a solution.
Theorem 3.4. Let Ct = H1 + ··· + HNt be a compound Poisson process as in Example 3.2.d) with iid jumps Hk ∼ µ and an independent Poisson process (Nt )t>0 with intensity λ. Then (Ct )t>0 (and also (Nt )t>0) is a d-dimensional Lévy process with characteristic exponent Z ψ(ξ) = λ (1 − eiy·ξ ) µ(dy). (3.4) y6=0
Proof. Since the trajectories of t 7→ Ct are càdlàg step functions with C0 = 0, the properties (L0) Rd and (L3), see (2.3), are satisfied. We will show (L1) and (L2). Let ξk ∈ , 0 = t0 6 ... 6 tn, and a < b. Then the Riemann–Stieltjes integral Z ∞ ∞ 1 1 (a,b](t)dCt = ∑ (a,b](τk)Hk = Cb −Ca 0 k=1 exists. We apply the Campbell formula (3.3) to the function n f (t) := 1 (t) ∑ ξk (tk−1,tk] k=1 and with s = 0. Then the left-hand side of (3.3) becomes the characteristic function of the incre- ments n ! E · (C −C ) , exp i ∑ ξk tk tk−1 k=1 while the right-hand side is equal to " # Z n Z tk n Z iξk·y iξk·y exp λ ∑ (e − 1)dt µ(dy) = ∏ exp λ(tk −tk−1) (e − 1) µ(dy) y6=0 k=1 tk−1 k=1 y6=0 n = E ·C ∏ exp iξk tk−tk−1 k=1 Chapter 3: Examples 19
(use Campbell’s formula with n = 1 for the last equality). This shows that the increments are 0 independent, i.e. (L2 ) holds, as well as (L1): Ctk −Ctk−1 ∼ Ctk−tk−1 .
If d = 1 and Hk ∼ δ1, Ct is a Poisson process.
∗k ∗0 Denote by µ the k-fold convolution of the measure µ; as usual, µ := δ0.
Corollary 3.5. Let (Nt )t>0 be a Poisson process with intensity λ and Ct = H1 + ··· + HNt a com- pound Poisson process with iid jumps Hk ∼ µ. Then, for all t > 0,
(λt)k P(N = k) = e−λt , k = 0,1,2,... (3.5) t k! ∞ ( t)k P −λt λ ∗k Rd (Ct ∈ B) = e ∑ µ (B), B ⊂ Borel. (3.6) k=0 k!
Proof. If we use Theorem 3.4 for d = 1 and µ = δ1, we see that the characteristic function of Nt is iu χt (u) = exp[−λt(1−e )]. Since this is also the characteristic function of the Poisson distribution
(i.e. the r.h.s. of (3.5)), we get Nt ∼ Poi(λt).
Since (Hk)k∈N ⊥⊥(Nt )t>0, we have for any Borel set B
∞ P P (Ct ∈ B) = ∑ (Ct ∈ B, Nt = k) k=0 ∞ P P P = δ0(B) (Nt = 0) + ∑ (H1 + ··· + Hk ∈ B) (Nt = k) k=1 ∞ (λt)k = e−λt ∑ µ∗k(B). k=0 k!
Example 3.2 contains the basic Lévy processes which will also be the building blocks for all Lévy processes. In order to define more specialized Lévy processes, we need further assumptions on the distributions of the random variables Xt .
Rd Definition 3.6. Let (Xt )t>0 be a stochastically continuous process in . It is called self-similar, if
∀a > 0 ∃b = b(a) : (Xat )t>0 ∼ (bXt )t>0 (3.7) in the sense that both sides have the same finite-dimensional distributions.
Lemma 3.7 (Lamperti). If (Xt )t>0 is self-similar and non-degenerate, then there exists a unique H index of self-similarity H > 0 such that b(a) = a . If (Xt )t>0 is a self-similar Lévy process, then 1 H > 2 .
0 Proof. Since (Xt )t>0 is self-similar, we find for a,a > 0 and each t > 0
0 0 b(aa )Xt ∼ Xaa0t ∼ b(a)Xa0t ∼ b(a)b(a )Xt , 20 R. L. Schilling: An Introduction to Lévy and Feller Processes
0 0 1 and so b(aa ) = b(a)b(a ) as Xt is non-degenerate. By the convergence of types theorem (Theo- rem A.6) and the continuity in probability of t 7→ Xt we see that a 7→ b(a) is continuous. Thus, the Cauchy functional equation b(aa0) = b(a)b(a0) has the unique continuous solution b(a) = aH for some H > 0. 1 Assume now that (Xt )t>0 is a Lévy process. We are going to show that H > 2 . Using self- similarity and the properties (L1), (L2) we get (primes always denote iid copies of the respective random variables)
H 00 0 H 00 H 0 (n + m) X1 ∼ Xn+m = (Xn+m − Xm) + Xm ∼ Xn + Xm ∼ n X1 + m X1. (3.8)
1 Any standard normal random variable X1 satisfies (3.8) with H = 2 . On the other hand, if X1 has V V V 00 V 0 V 00 V 0 a second moment, we get (n + m) X1 = Xn+m = Xn + Xm = n X1 + m X1 by Bienaymés 1 identity for variances, i.e. (3.8) can only hold with H = 2 . Thus, any self-similar X1 with finite 1 1 second moment has to satisfy (3.8) with H = 2 . If we can show that H < 2 implies the existence of a second moment, we have reached a contradiction. 1 H If Xn is symmetric and H < 2 , we find because of Xn ∼ n X1 some u > 0 such that 1 P(|X | > unH ) = P(|X | > u) < . n 1 4 By the symmetrization inequality (Theorem A.7), 1 1 1 − exp{−nP(|X | > unH )} P(|X | > unH ) < 2 1 6 n 4 P H N P 0 −1/H which means that n (|X1| > un ) 6 c for all n ∈ . Thus, (|X1| > x) 6 c x for all x > u+1, and so Z ∞ Z ∞ E 2 P 0 1−1/H |X1| = 2 x (|X1| > x)dx 6 2(u + 1) + 2c x dx < ∞ 0 u+1 1 0 0 as H < 2 . If Xn is not symmetric, we use its symmetrization Xn − Xn where Xn are iid copies of Xn.
Definition 3.8. A random variable X is called stable if
N Rd 0 0 ∀n ∈ ∃bn > 0, cn ∈ : X1 + ··· + Xn ∼ bnX + cn (3.9)
0 0 where X1,...,Xn are iid copies of X. If (3.9) holds with cn = 0, the random variable is called strictly stable. A Lévy process (Xt )t>0 is (strictly) stable if X1 is a (strictly) stable random variable.
1We use here that bX ∼ cX =⇒ b = c if X is non-degenerate. To see this, set χ(ξ) = Eeiξ·X and notice