Arxiv:2003.06465V2 [Math.PR]
Total Page:16
File Type:pdf, Size:1020Kb
OPTIMAL STOPPING OF STOCHASTIC TRANSPORT MINIMIZING SUBMARTINGALE COSTS NASSIF GHOUSSOUB, YOUNG-HEON KIM AND AARON ZEFF PALMER Abstract. Given a stochastic state process (Xt)t and a real-valued submartingale cost process (St)t, we characterize optimal stopping times τ that minimize the expectation of Sτ while realizing given initial and target distributions µ and ν, i.e., X0 ∼ µ and Xτ ∼ ν. A dual optimization problem is considered and shown to be attained under suitable conditions. The optimal solution of the dual problem then provides a contact set, which characterizes the location where optimal stopping can occur. The optimal stopping time is uniquely determined as the first hitting time of this contact set provided we assume a natural structural assumption on the pair (Xt,St)t, which generalizes the twist condition on the cost in optimal transport theory. This paper extends the Brownian motion settings studied in [15, 16] and deals with more general costs. Contents 1. Introduction 1 2. Weak Duality and Dynamic Programming 4 2.1. Notation and Definitions 4 2.2. Dual formulation 5 2.3. Dynamic programming dual formulation 6 3. Pointwise Bounds 8 4. Dual Attainment in the Discrete Setting 10 5. Dual Attainment in Hilbert Space 11 5.1. When the cost is the expected stopping time 14 6. The General Twist Condition 14 Appendix A. Recurrent Processes 16 A.1. Dual attainment 17 A.2. When the cost is the expected stopping time 20 Appendix B. Path Monotonicity 21 References 21 arXiv:2003.06465v2 [math.PR] 22 Dec 2020 1. Introduction Given a state process (Xt)t valued in a complete metric space O, an initial distribution µ, and a target distribution ν on O, we consider the set T (µ,ν) of -possibly randomized- stopping times τ that satisfy X0 ∼ µ and Xτ ∼ ν, where here, and in the sequel, the notation Y ∼ λ means that the law of the random variable Y is the probability measure λ. The problem of finding such stopping times (i.e., when T (µ,ν) is non-empty) is known as the Skorokhod embedding problem and has a long history ever since it was initiated by Skorokhod [27] in the early 1960s in the case where (Xt)t is Brownian motion (Wt)t and O = R, and followed by Date: December 24, 2020. The first two authors are partially supported by the Natural Sciences and Engineering Research Council of Canada (NSERC). ©2020 by the author. 1 2 NASSIFGHOUSSOUB,YOUNG-HEONKIMANDAARONZEFFPALMER important contributions from Root [25], Rost [26], Chacon-Walsh [8] and others. In this case, the set T (µ,ν) of embedding stopping times is empty unless µ and ν satisfies a certain order: µ ≺ ν that is φ(x)µ(dx) ≤ φ(y)ν(dy), for all subharmonic functions φ (∆φ ≥ 0). ZR ZR This condition is indeed sufficient and illustrates a duality principle of embedding stopping times, which is given here as Corollary 3.1 to the duality Theorem 2.3 (see [3] as well as [13]). Since then, the problem and its variants were investigated by a large number of researchers, and have led to several important results in probability theory and stochastic processes. We refer to Ob loj [24] for an excellent survey of the subject. For our present purpose, we note that no special role is played here by O = R or by (Xt)t being Brownian motion, and the Skorokhod embedding theorem was eventually extended to more general state spaces and Markov processes (see the previously related works of Baxter-Chacon [1], Falkner [11], Strassen [28], Rost [26], Dellacherie-Meyer [9], etc). For example, the result holds generally when ∆ is the infinitesimal generator of a diffusion process (Xt)t. Our analysis here will distinguish between two cases: processes that are absorbed into a ‘cemetery’ state, and those that are ergodic. We shall detail the absorbing case in the main body of the paper and repeat the results for ergodic processes in Appendix A. Given now a real valued cost process (St)t, the optimal Skorokhod embedding problem is to minimize the expected cost over all such embedding stopping times, cf. [3], (1.1) PS(µ,ν) := inf E Sτ ; X0 ∼ µ, Xτ ∼ ν . τ n o The optimization problem (1.1) and its variants have been considered in mathematical finance, for ex- ample, with applications to option-pricing by Hobson [19], Beiglb¨ock and Juillet [4], Ghoussoub-Kim-Lim [14, 13], Beiglb¨ock-Cox-Huesmann [3]. Many applications consider processes beyond Brownian motion, and the cost processes possessing a variety of structural properties. A starting point for our work is that the cost should be a submartingale, in other words, a process that increases in conditional expectation for each increment. There are two particular cases of submartingale costs that have already been analyzed when the state process Xt = Wt is a multi-dimensional Brownian motion: t • St = 0 L(s, Ws)ds, where the Lagrangian L is non-negative was analyzed in [15, 16]; • St = cR(W0, Wt) where y → c(x, y) is subharmonic, i.e., ∆yc(x, y) ≥ 0, which was analyzed in [16] Note that the first case is Markovian, while the second is not though it depends only on initial/final position. The dual problem to PS(µ,ν) has been expressed in [3], as Pµ (1.2) D (µ,ν) = sup ψ(y)ν(dy) − E M , S Z 0 (ψ,(Mt)t)∈AS n O o µ where P is the distribution of (Xt)t with initial X0 ∼ µ, and AS consists of the end potential, ψ : O → R, and a martingale (Mt)t that satisfies Mσ ≥ ψ(Xσ) − Sσ for any stopping time σ almost surely on the probability space. In general, the dual maximization problem can be reduced to a maximization of ψ, as M0 can be deter- ψ ψ mined from ψ using the Snell envelope of (ψ(Xt)−St)t, which we denote by (Gt )t. In other words, Gt is the conditional expected value of the optimal stopping problem that maximizes ψ(Xσ) − Sσ over stopping times ψ ψ ψ σ ≥ t. The martingale (Mt )t such that (ψ, (Mt )t) ∈ AS can be recovered from (Gt )t by the Doob-Meyer decomposition. These results are covered by Lemma 2.2 and Theorem 2.3. The optimizer ψ of the dual problem characterizes an optimal embedding stopping time, τ ∈ T (µ,ν), by the following ‘verification’ principles of Theorem 2.4: ψ i. The value process does not change predictably before τ ((Gt∧τ )t is a martingale); ψ ii. The terminal value is given by the end potential minus the cost (Gτ = ψ(Xτ ) − Sτ ). The first focus of this paper is on the attainment of the dual maximization problems, DS(µ,ν), which so far has been elusive. In one-dimension, dual attainment has been achieved for Brownian motion in [6] and [18], and has been extended to the multi-dimensional case in [15] and [16]. We also note an alternate approach to attainment has been undertaken by weakening the dual formulation in [5]. Our analysis identifies natural structural situations that yield bounds, hence compactness in suitable function spaces, on the end OPTIMAL STOPPING OF STOCHASTIC TRANSPORT 3 potential ψ. Some structure is required, since if the cost is (the supermartingale!) St = −|W0 − Wt|, then the dual maximizer ψ does not exist [4] (see also [14] for related results). In §3 we prove bounds on ψ under the following set of assumptions: • The state process (Xt)t is a stationary Feller process with an absorbing state C ∈ O; • The initial and target distribution are in the order prescribed by (Xt)t, µ ≺ ν, i.e. φ(x)µ(dx) ≤ φ(y)ν(dy), whenever φ is X-subharmonic (or ∆φ(x) ≥ 0), ZO ZO where we abuse the notation and let ∆ denote the generator of X. E C • [τC] < ∞ for τC = inf{t | Xt = } and Sτ = SτC for τ ≥ τC. (This is naturally satisfied when the process is either killed exponentially or by an absorbed state). • The cost process (St)t is a submartingale with S0 = 0 and SτC ≤ K < +∞. Proposition 3.3 provides the essential normalization for absorbing processes, which shows that the end potential ψ can be restricted to values in [−K, 0]. This procedure also restricts ψ to have 0 value on C and any absorbing state of the Markov process. These bounds are sufficient for dual attainment if O is discrete (Theorem 4.1). To illustrate the generality of our approach, we shall also handle the case where the time is discrete. In §5, we aim for a more refined bound and make the following assumptions: • The state process (Xt)t is symmetric with Dirichlet form E that satisfies a Poincar´einequality, 2 2 2 kfk2 := |f(x)| m(dx) ≤ CpkfkH, ZO 2 where kfkH := E(f,f)= O f(x) − ∆f(x) m(dx). • We also assume E satisfies a regularityR property that makes the viscosity and variational formulations of supersolutions equivalent. • The initial distribution µ lies in the dual space H∗; • The cost process (St)t is such that (Dt − St)t is also a submartingale for some D> 0. In other words, E Sr|Ft] − St ≤ D(r − t) for r>t. We then obtain a uniform superharmonic estimate on the end potentials of the form ∆ψ(x) ≤ D for all x ∈ O, which translates to a uniform bound of kψkH, by combining Lemma 2.5 with viscosity and weak solution theory in Proposition 5.6, and prove semi-continuity of the dual value with respect to the topology of BD in Proposition 5.5.