Optimal Transport and Concentration of Measure

Total Page:16

File Type:pdf, Size:1020Kb

Optimal Transport and Concentration of Measure OPTIMAL TRANSPORT AND CONCENTRATION OF MEASURE NATHAEL GOZLAN*, TRANSCRIBED BY BARTLOMIEJPOLACZYK Abstract. This is a preliminary version of the notes of the lectures on the optimal trans- port and concentration of measure that were given during the 2nd Warsaw Summer School in Probability on July 2017. We introduce here the notion of concentration of measure and develop its connections with some functional inequalities via the methods of optimal trans- port theory. At the end we turn to the convex concentration property and weak transport costs. Any comments or corrections are welcome. In case of such, please contact notes' author at [email protected]. 1. Concentration of measure phenomenon 1.1. Preliminaries. Concentration of measure phenomenon states in general that a random variable that depends on many i.i.d. variables via some Lipschitz function is essentially concentrated around its mean. This phenomenon has many consequences in various fields of probability theory or statistics. Throughout the notes we will denote by (X; d; µ) an abstract metric space equipped with a distance d and a probability measure µ on some sigma algebra Σ. Moreover, we set P(X) to be the set of all probability measures on X. We will start with the core definition of this lecture. Definition. Let α : [0; +1) ! [0; 1] be non increasing. We say that a probability measure µ 2 P(X) satisfies concentration of measure property with the concentration profile α if 1 µ(A) ≥ ) µ(A ) ≥ 1 − α(t) 8 8 ; 2 t A⊆X t≥0 where At := fx 2 X : d(x; A) ≤ tg. We will denote this fact by µ 2 Cα(X; d). Example. Some popular profiles are of the form k αk(t) = a exp −b(t − t0)+ ; for k 2 N; a; b; t0 2 R. Setting k = 1; 2 results in exponential and Gaussian profile respectively. Proposition 1.1. The following statements are equivalent: 1. µ 2 Cα(X; d), 2. µ(f > mµ(f) + t) ≤ α(t) 8f2Lip1 (X; d). Here 1 mµ(f) := arg inf µ(f ≥ t) ≤ t2R 2 *Universit Paris Descartes; [email protected]. 1 2 NATHAEL GOZLAN is a median of f w.r.t. µ and Lipk(X; d) is a space of all k-Lipschitz functions on X w.r.t. the metric d. In general will write Lipk if there is no confusion about the underlying space and the metric. Proof. (1: ) 2:): 1 Set A := ff ≤ mµ(f)g for some arbitrary function f 2 Lip1. Clearly µ(A) ≥ 2 and using the fact that A is closed At = fx : d(x; A) ≤ tg = fx : 9y2X : d(x; y) ≤ t ^ f(y) ≤ mµ(f)g ⊆ ff ≤ mµ(f) + tg; whence µ(f > mµ(f) + t) ≤ 1 − µ(At) ≤ α(t): (2: ) 1:): 1 Let A ⊆ X such that µ(A) ≥ 2 and set f(x) = d(x; A). Whence, f 2 Lip1 by the triangle inequality and mµ(f) = 0, thus µ(At) = µ(f ≤ t) = 1 − µ(f > mµ(f) + t) ≥ 1 − α(t): Remark 1.2. From the proof of Proposition 1.1 it follows that f has to be Lipschitz only in the support of µ. Remark 1.3. If µ 2 Cα(X; d) then applying the above proposition for −f results in the following inequality (1) µ(jf − mµ(f)j > t) ≤ 2α(t); which holds for any f 2 Lip1. Depending on the situation, it can sometimes be easier to work with a mean instead of a median. Fortunately, concentration inequalities involving the median are, up to the constant, equivalent to the ones with the mean which illustrates the following remark. Remark 1.4 (Mean vs. median). Let µ 2 Cα(X; d). Integrating (1) with respect to t results in Z (1) Z jµ(f) − mµ(f)j ≤ jf − mµ(f)j dµ ≤ 2 α(t) dt =: t0; from which it follows that µ(f > µ(f) + t) ≤ α(t − t0); R where µ(f) := X f dµ is a mean of f w.r.t. the measure µ. 1.2. Concentration inequalities. Let us focus now on some important results in the con- centration theory, beginning with the result due to Hoeffding [16]. Proposition 1.5 (Hoeffding’s inequality). Let Xi be an i.i.d sequence with a compact support enclosed in the interval [a; b]. It follows that t2 X¯ ≥ [X ] + t ≤ exp −2n ; P E 1 (b − a)2 ¯ 1 P were X = n Xi. The above result can be seen as an easy consequence of the following proposition. OPTIMAL TRANSPORT AND CONCENTRATION OF MEASURE 3 Proposition 1.6 (Hoeffding’s concentration inequality). Let µ 2 P(X) be a probability mea- ⊗n Nn ⊗n n sure on the space X and let µ := j=1 µ be the product measure. Then µ 2 Cαn (X ; dH ), 2 n where αn = exp(−2t =n) and dH is the Hamming distance on X given by the formula 1 d (x; y) = jfi : x 6= y gj : H n i i Remark 1.7. Proposition 1.6 implies Proposition 1.5. b−a Proof. Set µ = L(X1) be the law of X1. Then f(x) =x ¯ is n Lipschitz on the support of ⊗n ⊗n µ (recall Remark 1.2) w.r.t. dH . Applying the assumption that µ 2 Cαn (X; dH ) yields the proof. We now turn to the proof of Proposition 1.6. Proof of Proposition 1.6. The idea is to make use of exponential Chebyshev's inequality for F (X1;:::;Xn) where Xi are an i.i.d. sample, X1 ∼ µ and F is some 1-Lipschitz function w.r.t. tF (X ;:::;Xn) the metric dH . To that end, we need an estimate on the Laplace transform Ee 1 , which can be obtained using conditioning and some basic properties of the moment generating function. Let us split the proof into separate parts. Part 1. Firstly, let Z be a centered random variable on the interval [−1; 1]. And let φ(t) tZ denote its moment generating function, i.e. φ(t) = log Ee . Since ZetZ Z2etZ ZetZ 2 Z2etZ φ0(t) = E ; φ00(t) = E − E ≤ E ; EetZ EetZ EetZ EetZ then φ0(0) = 0 and φ00(t) ≤ 1 from which it follows that tZ t2=2 (2) Ee ≤ e : Part 2. Let EXk denote integration w.r.t. the variable Xk, that is EXk [f(X1;:::;Xn)] = E[f(X1;:::;Xn)jX1;:::;Xk−1;Xk+1;:::;Xn] n for any measurable function f. Note that since F 2 Lip1(X ; dH ), we have jF (x) − F (y)j ≤ 1 and thus F (X1;:::;Xn) − EF is a centered random variable on the interval [−1; 1], where EF := EF (X1;:::;Xn). We can now estimate h i h i tF (X1;:::;Xn) tF (X1;:::;Xn) E e = EX1;:::;Xn−1 EXn e (2) 2 ≤ EX1;:::;Xn−1 exp t =2 + EXn tF (X1;:::;Xn . 2 ≤ exp nt =2 + tEF: Part 3. Finally, it suffices to apply Chebyschev's exponential inequality s(F (X1;:::;Xn)−EF )−st 2 P (F (X1;:::;Xn) ≥ EF + t) ≤ Ee ≤ exp ns =2 − st : Optimizing with respect to s yields the proof. While Proposition 1.6 is much stronger than Proposition 1.5, it still suffers weak dimension scaling which demonstrates the following example. 4 NATHAEL GOZLAN Example. Take any f 2 Lip1(X; dH ). By Proposition 1.6 2 µ(f > mµ(f) + t) ≤ exp(−2t ): n But on the other hand, setting fn : X ! R, fn(x) = f(x1) results in ⊗n 2 µ (fn > mµ⊗n (f) + t) ≤ exp(−2t =n); which is clearly not optimal since ⊗n µ (fn > mµ⊗n (f) + t) = µ(f > mµ(f) + t): The example above gives a motivation to introduce a notion of a concentration that is not dependent on the dimension. Definition (dimension-free concentration property). A measure µ on (X; d) satisfies the 1 dimension-free concentration property with a concentration profile α, denoted as µ 2 Cα (X; d), ⊗n n if µ 2 Cα(X ; d2) for every n ≥ 1, where 1=2 X 2 d2(x; y) = d(x; y) : 1 Obviously Cα (X; d) ⊆ Cα(X; d). Converse is false which illustrates the following observa- tion left as an exercise. Exercise 1.1. Show that if µ satisfies the dimension-free concentration property, then supp µ is connected. A prominent example of a measure that satisfies the dimension-free concentration property d is Gaussian measure on R . Theorem 1.8 (Borell[4], Sudakov { Tsirelson[31]). Setting γ to be standard Gaussian mea- ¯ 1 sure on R, φ { its c.d.f. and φ(t) = 1 − φ(t) = γ (]t; +1[) we have γ 2 Cα¯ (R; |·|), where α = φ¯. Remark 1.9. The concentration profile φ¯ is optimal, which can be seen by plugging f(x) = x into the equivalent definition of the concentration property (see Proposition 1.1). The following exercise proves that obtained concentration profile is in fact subgaussian. Exercise 1.2. Show that for t > 0 ¯ 1 2 1. φ(t) ≤ 2 exp(−t =2), q 1 1 2 2. φ(t) ≤ 2π t exp(−t =2). The proof of Theorem 1.8 we are about to show is due to G. Piser [26]. It is much less technical than the original proof but provides a weaker concentration profile. Proof of Theorem 1.8 for α(t) = e−2t2/π2 . In the same manner as in the previous proof we will try to arrive at some estimate on the Laplace transform of a pushforward of γn by some Lipschitz function and then make use of Chebyshev's inequality.
Recommended publications
  • Weak Convergence of Probability Measures Revisited
    WEAK CONVERGENCE OF PROBABILITY MEASURES REVISITED Cabriella saltnetti' Roger J-B wetse April 1987 W P-87-30 This research was supported in part by MPI, Projects: "Calcolo Stocastico e Sistemi Dinamici Statistici" and "Modelli Probabilistic" 1984, and by the National Science Foundation. Working Papers are interim reports on work of the lnternational Institute for Applied Systems Analysis and have received only limited review. Views or opinions expressed herein do not necessarily represent those of the Institute or of its National Member Organizations. INTERNATIONAL INSTITUTE FOR APPLIED SYSTEMS ANALYSlS A-2361 Laxenburg, Austria FOREWORD The modeling of stochastic processes is a fundamental tool in the study of models in- volving uncertainty, a major topic at SDS. A number of classical convergence results (and extensions) for probability measures are derived by relying on new tools that are particularly useful in stochastic optimization and extremal statistics. Alexander B. Kurzhanski Chairman System and Decision Sciences Program ABSTRACT The hypo-convergence of upper semicontinuous functions provides a natural frame- work for the study of the convergence of probability measures. This approach also yields some further characterizations of weak convergence and tightness. CONTENTS 1 About Continuity and Measurability 2 Convergence of Sets and Semicontinuous Functions 3 SC-Measures and SGPrerneasures on 7 (E) 4 Hypo-Limits of SGMeasures and Tightness 5 Tightness and Equi-Semicontinuity Acknowledgment Appendix A Appendix B References - vii - WEAK CONVERGENCE OF PROBABILITY MEASURES REVISITED Gabriella salinettil and Roger J-B wetsL luniversiti "La Sapienza", 00185 Roma, Italy. 2~niversityof California, Davis, CA 95616. 1. ABOUT CONTINUITY AND MEASURABILITY A probabilistic structure - a space of possible events, a sigma-field of (observable) subcollections of events, and a probability measure defined on this sigma-field - does not have a built-in topological structure.
    [Show full text]
  • Arxiv:1009.3824V2 [Math.OC] 7 Feb 2012 E Fpstv Nees Let Integers
    OPTIMIZATION AND CONVERGENCE OF OBSERVATION CHANNELS IN STOCHASTIC CONTROL SERDAR YUKSEL¨ AND TAMAS´ LINDER Abstract. This paper studies the optimization of observation channels (stochastic kernels) in partially observed stochastic control problems. In particular, existence and continuity properties are investigated mostly (but not exclusively) concentrating on the single-stage case. Continuity properties of the optimal cost in channels are explored under total variation, setwise convergence, and weak convergence. Sufficient conditions for compactness of a class of channels under total variation and setwise convergence are presented and applications to quantization are explored. Key words. Stochastic control, information theory, observation channels, optimization, quan- tization AMS subject classifications. 15A15, 15A09, 15A23 1. Introduction. In stochastic control, one is often concerned with the following problem: Given a dynamical system, an observation channel (stochastic kernel), a cost function, and an action set, when does there exist an optimal policy, and what is an optimal control policy? The theory for such problems is advanced, and practically significant, spanning a wide variety of applications in engineering, economics, and natural sciences. In this paper, we are interested in a dual problem with the following questions to be explored: Given a dynamical system, a cost function, an action set, and a set of observation channels, does there exist an optimal observation channel? What is the right convergence notion for continuity in such observation channels for optimization purposes? The answers to these questions may provide useful tools for characterizing an optimal observation channel subject to constraints. We start with the probabilistic setup of the problem. Let X ⊂ Rn, be a Borel set in which elements of a controlled Markov process {Xt, t ∈ Z+} live.
    [Show full text]
  • Arxiv:2102.05840V2 [Math.PR]
    SEQUENTIAL CONVERGENCE ON THE SPACE OF BOREL MEASURES LIANGANG MA Abstract We study equivalent descriptions of the vague, weak, setwise and total- variation (TV) convergence of sequences of Borel measures on metrizable and non-metrizable topological spaces in this work. On metrizable spaces, we give some equivalent conditions on the vague convergence of sequences of measures following Kallenberg, and some equivalent conditions on the TV convergence of sequences of measures following Feinberg-Kasyanov-Zgurovsky. There is usually some hierarchy structure on the equivalent descriptions of convergence in different modes, but not always. On non-metrizable spaces, we give examples to show that these conditions are seldom enough to guarantee any convergence of sequences of measures. There are some remarks on the attainability of the TV distance and more modes of sequential convergence at the end of the work. 1. Introduction Let X be a topological space with its Borel σ-algebra B. Consider the collection M˜ (X) of all the Borel measures on (X, B). When we consider the regularity of some mapping f : M˜ (X) → Y with Y being a topological space, some topology or even metric is necessary on the space M˜ (X) of Borel measures. Various notions of topology and metric grow out of arXiv:2102.05840v2 [math.PR] 28 Apr 2021 different situations on the space M˜ (X) in due course to deal with the corresponding concerns of regularity. In those topology and metric endowed on M˜ (X), it has been recognized that the vague, weak, setwise topology as well as the total-variation (TV) metric are highlighted notions on the topological and metric description of M˜ (X) in various circumstances, refer to [Kal, GR, Wul].
    [Show full text]
  • Statistics and Probability Letters a Note on Vague Convergence Of
    Statistics and Probability Letters 153 (2019) 180–186 Contents lists available at ScienceDirect Statistics and Probability Letters journal homepage: www.elsevier.com/locate/stapro A note on vague convergence of measures ∗ Bojan Basrak, Hrvoje Planini¢ Department of Mathematics, Faculty of Science, University of Zagreb, Bijeni£ka 30, Zagreb, Croatia article info a b s t r a c t Article history: We propose a new approach to vague convergence of measures based on the general Received 17 April 2019 theory of boundedness due to Hu (1966). The article explains how this connects and Accepted 6 June 2019 unifies several frequently used types of vague convergence from the literature. Such Available online 20 June 2019 an approach allows one to translate already developed results from one type of vague MSC: convergence to another. We further analyze the corresponding notion of vague topology primary 28A33 and give a new and useful characterization of convergence in distribution of random secondary 60G57 measures in this topology. 60G70 ' 2019 Elsevier B.V. All rights reserved. Keywords: Boundedly finite measures Vague convergence w#–convergence Random measures Convergence in distribution Lipschitz continuous functions 1. Introduction Let X be a Polish space, i.e. separable topological space which is metrizable by a complete metric. Denote by B(X) the corresponding Borel σ –field and choose a subfamily Bb(X) ⊆ B(X) of sets, called bounded (Borel) sets of X. When there is no fear of confusion, we will simply write B and Bb. A Borel measure µ on X is said to be locally (or boundedly) finite if µ(B) < 1 for all B 2 Bb.
    [Show full text]
  • Weak Convergence of Measures
    Mathematical Surveys and Monographs Volume 234 Weak Convergence of Measures Vladimir I. Bogachev Weak Convergence of Measures Mathematical Surveys and Monographs Volume 234 Weak Convergence of Measures Vladimir I. Bogachev EDITORIAL COMMITTEE Walter Craig Natasa Sesum Robert Guralnick, Chair Benjamin Sudakov Constantin Teleman 2010 Mathematics Subject Classification. Primary 60B10, 28C15, 46G12, 60B05, 60B11, 60B12, 60B15, 60E05, 60F05, 54A20. For additional information and updates on this book, visit www.ams.org/bookpages/surv-234 Library of Congress Cataloging-in-Publication Data Names: Bogachev, V. I. (Vladimir Igorevich), 1961- author. Title: Weak convergence of measures / Vladimir I. Bogachev. Description: Providence, Rhode Island : American Mathematical Society, [2018] | Series: Mathe- matical surveys and monographs ; volume 234 | Includes bibliographical references and index. Identifiers: LCCN 2018024621 | ISBN 9781470447380 (alk. paper) Subjects: LCSH: Probabilities. | Measure theory. | Convergence. Classification: LCC QA273.43 .B64 2018 | DDC 519.2/3–dc23 LC record available at https://lccn.loc.gov/2018024621 Copying and reprinting. Individual readers of this publication, and nonprofit libraries acting for them, are permitted to make fair use of the material, such as to copy select pages for use in teaching or research. Permission is granted to quote brief passages from this publication in reviews, provided the customary acknowledgment of the source is given. Republication, systematic copying, or multiple reproduction of any material in this publication is permitted only under license from the American Mathematical Society. Requests for permission to reuse portions of AMS publication content are handled by the Copyright Clearance Center. For more information, please visit www.ams.org/publications/pubpermissions. Send requests for translation rights and licensed reprints to [email protected].
    [Show full text]
  • Lecture 20: Weak Convergence (PDF)
    MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 20 11/25/2013 Introduction to the theory of weak convergence Content. 1. σ-fields on metric spaces. 2. Kolmogorov σ-field on C[0;T ]. 3. Weak convergence. In the first two sections we review some concepts from measure theory on metric spaces. Then in the last section we begin the discussion of the theory of weak convergence, by stating and proving important Portmentau Theorem, which gives four equivalent definitions of weak convergence. 1 Borel σ-fields on metric space We consider a metric space (S; ρ). To discuss probability measures on metric spaces we first need to introduce σ-fields. Definition 1. Borel σ-field B on S is the field generated by the set of all open sets U ⊂ S. Lemma 1. Suppose S is Polish. Then every open set U ⊂ S can be represented as a countable union of balls B(x; r). Proof. Since S is Polish, we can identify a countable set x1; : : : ; xn;::: which is dense. For each xi 2 U, since U is open we can find a radius ri such that ∗ B(xi; ri) ⊂ U. Fix a constant M > 0. Consider ri = min(arg supfr : B(x; r) ⊂ Ug;M) > 0 and set r = r∗=2. Then [ B(x ; r ) ⊂ U. In i i i:xi2U i i order to finish the proof we need to show that equality [i:xi2U B(xi; ri) = U holds. Consider any x 2 U. There exists 0 < r ≤ M such that B(x; r) ⊂ U.
    [Show full text]
  • Probability Measures on Metric Spaces
    Probability measures on metric spaces Onno van Gaans These are some loose notes supporting the first sessions of the seminar Stochastic Evolution Equations organized by Dr. Jan van Neerven at the Delft University of Technology during Winter 2002/2003. They contain less information than the common textbooks on the topic of the title. Their purpose is to present a brief selection of the theory that provides a basis for later study of stochastic evolution equations in Banach spaces. The notes aim at an audience that feels more at ease in analysis than in probability theory. The main focus is on Prokhorov's theorem, which serves both as an important tool for future use and as an illustration of techniques that play a role in the theory. The field of measures on topological spaces has the luxury of several excellent textbooks. The main source that has been used to prepare these notes is the book by Parthasarathy [6]. A clear exposition is also available in one of Bour- baki's volumes [2] and in [9, Section 3.2]. The theory on the Prokhorov metric is taken from Billingsley [1]. The additional references for standard facts on general measure theory and general topology have been Halmos [4] and Kelley [5]. Contents 1 Borel sets 2 2 Borel probability measures 3 3 Weak convergence of measures 6 4 The Prokhorov metric 9 5 Prokhorov's theorem 13 6 Riesz representation theorem 18 7 Riesz representation for non-compact spaces 21 8 Integrable functions on metric spaces 24 9 More properties of the space of probability measures 26 1 The distribution of a random variable in a Banach space X will be a probability measure on X.
    [Show full text]
  • The Linear Topology Associated with Weak Convergence of Probability
    The linear topology associated with weak convergence of probability measures Liang Hong 1 Abstract This expository note aims at illustrating weak convergence of probability measures from a broader view than a previously published paper. Though the results are standard for functional analysts, this approach is rarely known by statisticians and our presentation gives an alternative view than most standard probability textbooks. In particular, this functional approach clarifies the underlying topological structure of weak convergence. We hope this short note is helpful for those who are interested in weak convergence as well as instructors of measure theoretic probability. MSC 2010 Classification: Primary 00-01; Secondary 60F05, 60A10. Keywords: Weak convergence; probability measure; topological vector space; dual pair arXiv:1312.6589v3 [math.PR] 2 Oct 2014 1Liang Hong is an Assistant Professor in the Department of Mathematics, Robert Morris University, 6001 University Boulevard, Moon Township, PA 15108, USA. Tel.: (412) 397-4024. Email address: [email protected]. 1 1 Introduction Weak convergence of probability measures is often defined as follows (Billingsley 1999 or Parthasarathy 1967). Definition 1.1. Let X be a metrizable space. A sequence {Pn} of probability measures on X is said to converge weakly to a probability measure P if lim f(x)dPn(dx)= f(x)P (dx) n→∞ ZX ZX for every bounded continuous function f on X. It is natural to ask the following questions: 1. Why the type of convergence defined in Definition 1.1 is called “weak convergence”? 2. How weak it is? 3. Is there any connection between weak convergence of probability measures and weak convergence in functional analysis? 4.
    [Show full text]
  • BILLINGSLEY Tee University of Chicago Chicago, Illinois
    Convergence of Probability Measures WILEY SERIES IN PROBABILITY AND STATISTICS PROBABILITY AND STATISTICS SECTION Established by WALTER A. SHEWHART and SAMUEL S. WILKS Editors: Vic Barnett, Noel A. C. Cressie, Nicholas I. Fisher, Iain M, Johnstone, J. B. Kadane, David G. Kendall, David W. Scott, Bernard W. Silverman, Adrian F. M. Smith, Jozef L. Teugels; Ralph A. Bradley, Emeritus, J. Stuart Hunter, Emeritus A complete list of the titles in this series appears at the end of this volume. Convergence of Probability Measures Second Edition PATRICK BILLINGSLEY TEe University of Chicago Chicago, Illinois A Wiley-Interscience Publication JOHN WILEY & SONS,INC. NewYork Chichester Weinheim Brisbane Singapore Toronto This text is printed on acid-free paper. 8 Copyright 0 1999 by John Wiley & Sons, Inc. All rights reserved. Published simultaneously in Canada. No patt of this publication may be reproduced, stored in a retrieval system or transmitted in any form or by any means, electronic, mechanical, photocopying, recording, scanning or otherwise, except as permitted under Section 107 or 108 of the 1976 United States Copyright Act, without either the prior written permission of the Publisher, or authorization through payment of the appropriate per-copy fee to the Copyright Clearance Center, 222 Rosewood Drive, Danvers, MA 01923, (978) 750-8400, fax (978) 750-4744. Requests to the Publisher for permission should be addressed to the Permissions Department, John Wiley & Sons, Inc., 605 Third Avenue, New York, NY 10158-0012, (212) 850-601 1, fax (212) 850-6008, E-Mail: PERMREQ @, WILEY.COM. For ordering and customer service, call 1-800-CALL-WILEY.
    [Show full text]
  • Lecture 1 Monge and Kantorovich Problems: from Primal to Dual
    Lecture 1 Monge and Kantorovich problems: from primal to dual Luca Nenna February 5, 2020 These notes are based on the ones by Quentin Mérigot Some motivations for studying optimal transport. • Variational principles for (real) Monge-Ampère equations occuring in geometry (e.g. Gaussian curvature prescription) or optics. • Wasserstein/Monge-Kantorovich distance between probability measures µ, ν on e.g. d R : how much kinetic energy does one require to move a distribution of mass de- scribed by µ to ν ? −! interpretation of some parabolic PDEs as Wasserstein gradient flows, construc- tion of (weak) solutions, numerics, e.g. 8 >@tρ + div(ρv) = 0 ( > @ ρ + div(ρv) = 0 <>v = −∇p − rV t or v = −∇ log ρ p(1 − ρ) = 0 > > :p > 0; ρ 6 1 −! interesting geometry on P(X), with an embedding X,!P(X). Applications in geometry (synthetic notion of Ricci curvature for metric spaces), machine learning, inverse problems, etc. • Quantum physics: electronic configuration in molecules and atoms. References. Introduction to optimal transport, with applications to PDE and/or calculus of variations can be found in books by Villani [6] and Santambrogio [5]. Villani’s second book [7] concentrates on the application of optimal transport to geometric questions (e.g. synthetic definition of Ricci curvature), but its first chapters might be useful. We also mention Gigli, Ambrosio and Savaré [2] for the study of gradient flows with respect to the Monge- Kantorovich/Wasserstein metric. Notation. In the following, we assume that X is a complete and separable metric space. We denote C(X) the space of continuous functions, C0(X) the space of continuous function vanishing 1 at infinity Cb(X) the space of bounded continuous functions.
    [Show full text]
  • Ronan Thesis 2021
    NEW TECHNIQUES IN OPTIMAL TRANSPORT A Thesis Submitted to the Faculty in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Mathematics by James Gordon Ronan DARTMOUTH COLLEGE Hanover, New Hampshire May 2021 Examining Committee: AnneGeet Anne Gelb, Chair Mather Pour Matthew Parno Peter Doyle Douglas Cochran F. Jon Kull, Ph.D. Dean of the Guarini School of Graduate and Advanced Studies Abstract This thesis develops a new technique for applications of optimal transport and presents anewperspectiveonoptimaltransportthroughthemeasuretheoretictooloftransi- tion kernels. Optimal transport provides a way of lifting a distance metric on a space to probability measures over that space. This makes the field well suited for certain types of image analysis. Part of this thesis focuses on a new application for optimal transport, while the other focuses on a new approach to optimal transport itself. With respect to the first part of this thesis, we propose using semi-discrete optimal transport for the estimation of parameters in physical scenes and show how to do so. Optimal transport is a natural setting when studying images because displacements of the objects in the image directly correspond to a change in the optimal transport cost. In the second part of this thesis we discuss transition kernels, which provide a mathematical tool that can be used to map measures to measures. It therefore seems intuitive to incorporate transition kernels into optimal transport problems. However, this requires changing the traditional perspective of viewing optimal transport as primarily a tool to measure distances between two fixed measures. To that end, this thesis develops theory to show how kernels may be used to extend optimal transport to signed measures.
    [Show full text]
  • Arxiv:2106.00086V2 [Math.LO] 2 Jun 2021 Esrsudrdffrn Ovrec Oin 3 9]
    EFFECTIVE NOTIONS OF WEAK CONVERGENCE OF MEASURES ON THE REAL LINE TIMOTHY H. MCNICHOLL AND DIEGO A. ROJAS Abstract. We establish a framework for the study of the effective theory of weak convergence of measures. We define two effective notions of weak convergence of measures on R: one uniform and one non-uniform. We show that these notions are equivalent. By means of this equivalence, we prove an effective version of the Portmanteau Theorem, which consists of multiple equivalent definitions of weak convergence of measures. 1. Introduction Beginning with Alan Turing’s pioneering work on the computability of real num- bers in the mid-1930s [15], mathematicians have understood that concepts central to the study of mathematical analysis have analogues in computability theory. Com- putable analysis examines these computable analogues and attempts to recover as much of the classical theory of analysis as possible while working in a framework for computation. Current research in the field has investigated the interactions between computability and measure theory, one of the main sub-fields of classical analysis, especially with its deep connections to algorithmic randomness [13, 14]. In particular, one aspect of measure theory in algorithmic randomness lies in the connections between random points and measures defined as limits of sequences of measures under different convergence notions [3, 9]. The most commonly studied notion of convergence for sequences of measures in analysis is weak convergence. A sequence of finite Borel measures {µn}n∈N on a separable metric space X weakly converges to a measure µ if, for every bounded continuous real-valued function f on X, the sequence of real numbers arXiv:2106.00086v2 [math.LO] 2 Jun 2021 limn X f dµn = X f dµ.
    [Show full text]