Ronan Thesis 2021
Total Page:16
File Type:pdf, Size:1020Kb
NEW TECHNIQUES IN OPTIMAL TRANSPORT A Thesis Submitted to the Faculty in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Mathematics by James Gordon Ronan DARTMOUTH COLLEGE Hanover, New Hampshire May 2021 Examining Committee: AnneGeet Anne Gelb, Chair Mather Pour Matthew Parno Peter Doyle Douglas Cochran F. Jon Kull, Ph.D. Dean of the Guarini School of Graduate and Advanced Studies Abstract This thesis develops a new technique for applications of optimal transport and presents anewperspectiveonoptimaltransportthroughthemeasuretheoretictooloftransi- tion kernels. Optimal transport provides a way of lifting a distance metric on a space to probability measures over that space. This makes the field well suited for certain types of image analysis. Part of this thesis focuses on a new application for optimal transport, while the other focuses on a new approach to optimal transport itself. With respect to the first part of this thesis, we propose using semi-discrete optimal transport for the estimation of parameters in physical scenes and show how to do so. Optimal transport is a natural setting when studying images because displacements of the objects in the image directly correspond to a change in the optimal transport cost. In the second part of this thesis we discuss transition kernels, which provide a mathematical tool that can be used to map measures to measures. It therefore seems intuitive to incorporate transition kernels into optimal transport problems. However, this requires changing the traditional perspective of viewing optimal transport as primarily a tool to measure distances between two fixed measures. To that end, this thesis develops theory to show how kernels may be used to extend optimal transport to signed measures. ii Preface IamfortunateandgratefultobewhereIamtoday.Myparentshavesupportedand encouraged me along the way, and pushed me to take all of the opportunities that I have been given. My sincere gratitude goes to my advisor Anne Gelb for accepting me as her odd duck of a student. She has helped me to grow into a better mathematician and think about how mathematics should serve the world. She and my secondary advisor, Matthew Parno have been resiliently positive and optimistic despite the challenges of the past year. I am excited to continue to work with and to learn from them in the future. Thank you to the remainder of my committee, Douglas Cochran and Peter Doyle, for meeting with me and helping to improve this thesis. iii Contents Abstract..................................... ii Preface . iii 1Overview 1 2 Preliminaries 4 2.1 Measure Theory Background . 4 2.2 OT Background . 10 2.2.1 Historical development of optimal transport . 10 2.2.2 The Kantorovich Problem . 11 2.2.3 Duality Theory . 17 2.3 Wasserstein Distances . 25 2.3.1 Examples of Wasserstein distances . 29 2.3.2 Geodesics in Wasserstein Space . 31 2.4 1-D OT . 35 2.4.1 One Dimensional Transport . 36 2.4.2 Structure of Monotone Coupling . 38 2.4.3 c-Cyclic Monotonicity in One Dimension . 41 2.4.4 OptimalTransportMaps. 44 3 SDOT 45 iv 3.1 SDOT . 46 3.1.1 Laguerre Cells . 47 3.1.2 Theoretical Results on SDOT . 52 3.2 AlgorithmicSDOT ............................ 54 3.2.1 AlgorithmicModifications . 60 3.2.2 Other Regularized Optimal Transport . 66 3.2.3 Quantization . 67 4ParameterEstimation 69 4.1 Simple Examples . 70 4.1.1 Centers of Mass . 70 4.1.2 AngleofRotation......................... 73 4.1.3 Rotating and Translating Object . 76 4.2 Misfits in Time . 77 4.2.1 Velocity Estimation . 78 4.2.2 Colliding Balls . 80 4.3 Cantilever Beam . 85 4.3.1 Model Set-up . 86 4.3.2 Results . 90 4.3.3 AdjointGradient ......................... 92 4.4 Further Questions . 98 4.4.1 Representation of Objects . 98 4.4.2 Time Sensitivity . 99 4.4.3 BlurringandNoise ........................ 99 4.4.4 AnalysisofSolutionOperators. 99 5 OT Kernels 101 v 5.1 Kernels Background . 102 5.2 Kernels for OT . 105 5.2.1 Optimal transport kernels . 107 5.2.2 Geodesics and Kernels . 112 5.3 Signed OT with Kernels . 114 5.3.1 One Dimensional Signed Optimal Transport . 118 5.4 FutureQuestions ............................. 123 6 Conclusion 124 References 127 vi Chapter 1 Overview Optimal transport is an area of mathematics that combines analysis, probability and geometry while o↵ering applications in diverse areas. Optimal transport studies various kinds of transport costs that describe how to rearrange one measure into another. The initial description of optimal transport codified these rearrangements as transport maps, T , and the mass of the measure at x was sent to T (x). Just as there are many ways to go from point A to point B, there are many di↵erent possible transport maps which send one measure into another. Optimal transport is a way of assigning a cost to these rearrangements and understanding the transport map with the lowest total cost. This can be useful when looking at physical systems because it acts like Occam’s razor. The optimal transport map is the one that required the least e↵ort to get things done. While an optimal map might not perfectly reflect what happened, in some ways it is the simplest. It is important to note that a transport map sends one measure to another on the global scale, but it does so by coordinating the local paths at each point supported in the measure. The optimal transport cost represents perfect cooperation, where every point acts in accordance with everyone else to minimize the cost. Hence we see that optimal transport exists in a perfect world while still allowing us to understand our 1 Overview Overview imperfect one. The roots of optimal transport date back to Monge in 1781, and the idea was originally motivated by Monge’s interest in quantifying the amount of work it would take to excavate a hole of a particular shape and construct a pile in another shape [38]. While we have moved on from focusing on digging ditches, applications are an integral part of optimal transport to this day. In 1942 Kantorovich o↵ered a new perspective and re-framed the problem and showed the field’s applicability to economic problems [23]. The trends of re-interpretation and application can be found running through the history of optimal transport. A notable re-formulation was to view the problem in a continuous time setting in [8], which opened up the geometric aspects of the field which were explored further in [35, 40, 33]. The di↵erent formulations of the problems lent themselves to di↵ering applications and improved implementations that reinforce each other – more applications become feasible as the implementations improve, while the increase in potential applications drives the desire for better implementations. A few notable implementations include using an elliptic partial di↵erential equation (PDE) formulation of the problem, [9], entropic regularization, which adds a regularization term to the objective function of the problem, [14, 51], and semi-discrete optimal transport (SDOT) which exploits connections to computational geometry that arise when the class of measures un- der consideration is restricted, [24, 26, 28, 37]. Applications arise in diverse fields including economics [36], fluid dynamics [21], and seismic imaging [16, 17, 18]. This thesis presents a new way to apply optimal transport for parameter estima- tion. We show how to apply SDOT to form misfit functions for parameter estimation of physical systems. We show how we can use quantitization to create a discrete representation of the objects in the system, and use SDOT to compare our model to the observation. This approach can be used for a variety of applications and we 2 Overview Overview demonstrate this through examples. Additionally, this thesis demonstrates that there is a latent measure theoretic ker- nel structure in optimal transport, which allows us to restore some of the functionality that has been lost in reformulations of the problem. The original formulation of opti- mal transport su↵ered from many disadvantages, but when it was reframed in terms of couplings we lost the ability to send multiple measures through the same map. Kernels allow us to understand couplings in a new way to restore some of the lost functionality. We discuss how they may be exploited as a new approach to extending optimal transport to signed measures. This remainder of this thesis proceeds in four parts: (a) Chapter 2 reviews the conceptual and mathematical foundations of optimal transport. This provides a somewhat broad introduction to the subject and establishes the notational conventions used throughout the thesis. (b) Chapter 3 presents a focused review of semi-discrete optimal transport. Both the theoretical background of the area as well as topics related to the imple- mentation are discussed. (c) Chapter 4 demonstrates how to construct an optimal transport misfit function and provides examples. We show how this formulation can be used to estimate parameters of physical systems. (d) Chapter 5 introduces the kernel based framework and presents a new approach to signed optimal transport. We conclude this chapter by focusing on the one dimensional case. Small results are presented throughout the thesis, with the major contributions taking place in Chapters 4 and 5. 3 Chapter 2 Preliminaries The purpose of this chapter is to serve as an easy reference for the necessary back- ground on optimal transport. This includes first defining and introducing our notation for measure theoretic concepts before providing an introduction to optimal transport. A few examples are included in this chapter to illustrate key ideas. Section 2.1 Definitions and Conventions in Measure Theory Measure theory provides an essential building block of optimal transport. This section therefore summarizes measure theoretic concepts used in the thesis.