Probability Theory II

Total Page:16

File Type:pdf, Size:1020Kb

Probability Theory II Probability Theory II Spring 2016 Peter Orbanz Contents Chapter 1. Martingales1 1.1. Martingales indexed by partially ordered sets1 1.2. Martingales from adapted processes4 1.3. Stopping times and optional sampling4 1.4. Tail bounds for martingales7 1.5. Notions of convergence for martingales 10 1.6. Uniform integrability 10 1.7. Convergence of martingales 12 1.8. Application: The 0-1 law of Kolmogorov 17 1.9. Continuous-time martingales 18 1.10. Application: The P´olya urn 20 1.11. Application: The Radon-Nikodym theorem 22 Chapter 2. Measures on nice spaces 27 2.1. Topology review 27 2.2. Metric and metrizable spaces 30 2.3. Regularity of measures 31 2.4. Weak convergence 33 2.5. Polish spaces and Borel spaces 35 2.6. The space of probability measures 39 2.7. Compactness and local compactness 42 2.8. Tightness 44 2.9. Some additional useful facts on Polish spaces 45 Chapter 3. Conditioning 47 3.1. A closer look at the definition 47 3.2. Conditioning on σ-algebras 48 3.3. Conditional distributions given σ-algebras 50 3.4. Working with pairs of random variables 52 3.5. Conditional independence 56 3.6. Application: Sufficient statistics 58 3.7. Conditional densities 60 Chapter 4. Pushing forward and pulling back 61 4.1. Integration with respect to image measures 61 4.2. Pullback measures 63 Chapter 5. Stochastic processes 65 5.1. Constructing spaces of mappings 66 5.2. Extension theorems 69 i ii CONTENTS 5.3. Processes with regular paths 72 5.4. Brownian motion 75 5.5. Markov processes 77 5.6. Processes with stationary and independent increments 79 5.7. Gaussian processes 82 5.8. The Poisson process 84 Bibliography Main references Other references Index CHAPTER 1 Martingales The basic limit theorems of probability, such as the elementary laws of large numbers and central limit theorems, establish that certain averages of independent variables converge to their expected values. A sequence of such averages is a random sequence, but it completely derandomizes in the limit, and this is usually a direct consequence of independence. For more complicated processes|typically, if the variables are stochastically dependent|the limit is not a constant, but is itself random. In general, random limits are very hard to handle mathematically, since we have to precisely quantify the effect of dependencies and control the aggregating randomness as we get further into the sequence. It turns out that it is possible to control dependencies and randomness if a process (Xn)n2N has the simple property E[Xn Xm] =a:s: Xm for all m n ; j ≤ which is called the martingale property. From this innocuous identity, we can derive a number of results which are so powerful, and so widely applicable, that they make martingales one of the fundamental tools of probability theory. Many of these results still hold if we use another index set than N, condition on more general events than the value of Xm, or weaken the equality above to or . We will see all this in detail in this chapter, but for now, I would like you≤ to absorb≥ that martingales provide tools for working with random limits. They are not the only such tools, but there are few others. Notation. We assume throughout that all random variables are defined on a single abstract probability space (Ω; ; P). Any random variable X is a measurable A map X :Ω into some measurable space ( ; x). Elements of Ω are always denoted !.!X Think of any ! as a possible \state ofX theA universe"; a random variable X picks out some limited aspect X(!) of ! (the outcome of a coin flip, say, or the path a stochastic process takes). The law of X is the image measure of P under X, and generically denoted X(P) =: (X). The σ-algebra generated by X −1 L is denoted σ(X) := X x . Keep in mind that these conventions imply the A ⊂ A conditional expectation of a real-valued random variable X :Ω R is a random ! variable E[X ]:Ω R, for any σ-algebra . jC ! C ⊂ A 1.1. Martingales indexed by partially ordered sets The most common types of martingales are processes indexed by values in N (so- called \discrete-time martingales") and in R+ (\continuous-time martingales"). However, martingales can much more generally be defined for index sets that need 1 2 1. MARTINGALES not be totally ordered, and we will later on prove the fundamental martingale convergence results for such general index sets. Partially ordered index sets. Let T be a set. Recall that a binary relation on T is called a partial order if it is (1) reflexive: s s for every s T. (2) antisymmetric: If s t and2t s, then s = t. (3) transitive: If s t and t u, then s u. In general, a partially ordered set may contain elements that are not comparable, i.e. some s; t for which neither s t nor t s (hence \partial"). If all pairs of elements are comparable, the partial order is called a total order. We need partially ordered index sets in various contexts, including martingales and the construction of stochastic processes. We have to be careful, though: Us- ing arbitrary partially ordered sets can lead to all kinds of pathologies. Roughly speaking, the problem is that a partially ordered set can decompose into subsets be- tween which elements cannot be compared at all, as if we were indexing arbitrarily by picking indices from completely unrelated index sets. For instance, a partially ordered set could contain two sequences s1 s2 s3 ::: and t1 t2 t3 ::: of elements which both grow larger and larger in terms of the partial order, but whose elements are completely incomparable between the sequences. To avoid such pathologies, we impose an extra condition: If s; t T, there exists u T such that s u and t u : (1.1) 2 2 A partially ordered set (T; ) which satisfies (1.1) is called a directed set. 1.1 Example. Some directed sets: (a) The set of subsets of an arbitrary set, ordered by inclusion. (b) The set of finite subsets of an infinite set, ordered by inclusion. (c) The set of positive definite n n matrices over R, in the L¨ownerpartial order. × (d) Obviously, any totally ordered set (such as N or R in the standard order). / Just as we can index a family of variables by N and obtain a sequence, we can more generally index it by a directed set; the generalization of a sequence so obtained is called a net. To make this notion precise, let be a set. Recall that, X formally, an (infinite) sequence in is a mapping N , that is, each index X !X s is mapped to the sequence element xi. We usually denote such a sequence as (xi)i2N, or more concisely as (xi). 1.2 Definition. Let (T; ) be a directed set. A net in a set is a function X x : T , and we write xt := x(t) and denote the net as (xt)t2 . / !X T Clearly, the net is a sequence if (T; ) is specifically the totally ordered set (N; ). Just like sequences, nets may converge to a limit: ≤ 1.3 Definition. A net (xt)t2T is said to converge to a point x if, for every open neighborhood U of x, there exists an index t0 T such that 2 xt U whenever t t : (1.2) 2 0 / Nets play an important role in real and functional analysis: To establish certain properties in spaces more general than Rd, we may have to demand that every net satisfying certain properties converges (not just every sequence). Sequences 1.1. MARTINGALES INDEXED BY PARTIALLY ORDERED SETS 3 have stronger properties than nets; for example, in any topological space, the set consisting of all elements of a convergent sequence and its limit is a compact set. The same need not be true for a net. Filtrations and martingales. Let (T; ) be a directed set. A filtration is a family = ( t)t2 of σ-algebras i, indexed by the elements of T, that satisfy F F T F s t = s t : (1.3) )F ⊂ F The index set T = N is often referred to as discrete time; similarly, T = R+ is called continuous time. The filtration property states that each σ-algebra s contains all preceding ones. For a filtration, there is also a uniquely determined,F smallest σ- algebra which contains all σ-algebras in , namely F S 1 := σ s : (1.4) F s2TF Now, let (T; ) be a partially ordered set and = ( s)s2 a filtration. We F F T call a family (Xs)s2 of random variables adapted to if Xs is s-measurable for T F F every s. Clearly, every random sequence or random net (Xs)s2T is adapted to the filtration defined by S t := σ σ(Xs) ; (1.5) F st which is called the canonical filtration of (Xs). An adapted family (Xs; s)s2T is called a martingale if (i) each variable Xs is real-valued and integrableF and (ii) it satisfies Xs =a:s: E[Xt s] whenever s t : (1.6) jF Note (1.6) can be expressed equivalently as Z Z A s : XsdP = XtdP whenever s t : (1.7) 8 2 F A A If (Xs; s)s2T satisfies (1.6) only with equality weakened to (i.e. Xs E(Xt s)), it is calledF a submartingale; for , it is called a supermartingale≤ .≤ jF Intuitively, the martingale property≥ (1.6) says the following: Think of the in- dices s and t as times.
Recommended publications
  • Lectures on Fractal Geometry and Dynamics
    Lectures on fractal geometry and dynamics Michael Hochman∗ March 25, 2012 Contents 1 Introduction 2 2 Preliminaries 3 3 Dimension 3 3.1 A family of examples: Middle-α Cantor sets . 3 3.2 Minkowski dimension . 4 3.3 Hausdorff dimension . 9 4 Using measures to compute dimension 13 4.1 The mass distribution principle . 13 4.2 Billingsley's lemma . 15 4.3 Frostman's lemma . 18 4.4 Product sets . 22 5 Iterated function systems 24 5.1 The Hausdorff metric . 24 5.2 Iterated function systems . 26 5.3 Self-similar sets . 31 5.4 Self-affine sets . 36 6 Geometry of measures 39 6.1 The Besicovitch covering theorem . 39 6.2 Density and differentiation theorems . 45 6.3 Dimension of a measure at a point . 48 6.4 Upper and lower dimension of measures . 50 ∗Send comments to [email protected] 1 6.5 Hausdorff measures and their densities . 52 1 Introduction Fractal geometry and its sibling, geometric measure theory, are branches of analysis which study the structure of \irregular" sets and measures in metric spaces, primarily d R . The distinction between regular and irregular sets is not a precise one but informally, k regular sets might be understood as smooth sub-manifolds of R , or perhaps Lipschitz graphs, or countable unions of the above; whereas irregular sets include just about 1 everything else, from the middle- 3 Cantor set (still highly structured) to arbitrary d Cantor sets (irregular, but topologically the same) to truly arbitrary subsets of R . d For concreteness, let us compare smooth sub-manifolds and Cantor subsets of R .
    [Show full text]
  • Normal Convergence
    Normal convergence Uniform convergence: A sequence fn converges uniformly on K ⊆ C means 8" > 0 9N 8n ≥ N sup jf(z) − g(z)j < ". z2K 1 1 Weierstrass M-test: If 8z 2 K jfn(z)j ≤ Mn and Mn converges, then fn converges uniformly on K. n=1 n=1 X X Normal convergence: Given a domain Ω ⊆ C, a sequence of functions fn : Ω ! C converges locally uniformly means 8z 2 Ω 9δ > 0 such that the functions fn restricted to Bδ(z) converge uniformly. Heine-Borel theorem implies that locally uniform convergence is equivalent to convergence that is uniform on compact subsets. Topologists call this compact convergence, while complex analysts call it normal convergence. Compact-open topology: Let X and Y be topological spaces and C(X; Y ) be the set of all continuous functions X ! Y . For each compact K ⊆ X and open U ⊆ Y let S = ff: f(K) ⊆ Ug. The topology on C(X; Y ) generated by all such S is called the compact-open topology. In this topology functions are near when their values are close on compact sets. The compact-open topology on C(Ω; C) is exactly the topology of normal convergence (see Theorems XII.7.2 [3], 5.1 [4]). The space of holomorphic functions: Let H(Ω) denote the space of holomorphic functions on Ω. We can construct a metric for the compact-open topology on H(Ω) ⊆ C(Ω; C) by writing Ω as a union of a tower of compact subsets and using a bounded uniform metric on these subsets.
    [Show full text]
  • Modes of Convergence in Probability Theory
    Modes of Convergence in Probability Theory David Mandel November 5, 2015 Below, fix a probability space (Ω; F;P ) on which all random variables fXng and X are defined. All random variables are assumed to take values in R. Propositions marked with \F" denote results that rely on our finite measure space. That is, those marked results may not hold on a non-finite measure space. Since we already know uniform convergence =) pointwise convergence this proof is omitted, but we include a proof that shows pointwise convergence =) almost sure convergence, and hence uniform convergence =) almost sure convergence. The hierarchy we will show is diagrammed in Fig. 1, where some famous theorems that demonstrate the type of convergence are in parentheses: (SLLN) = strong long of large numbers, (WLLN) = weak law of large numbers, (CLT) ^ = central limit theorem. In parameter estimation, θn is said to be a consistent ^ estimator or θ if θn ! θ in probability. For example, by the SLLN, X¯n ! µ a.s., and hence X¯n ! µ in probability. Therefore the sample mean is a consistent estimator of the population mean. Figure 1: Hierarchy of modes of convergence in probability. 1 1 Definitions of Convergence 1.1 Modes from Calculus Definition Xn ! X pointwise if 8! 2 Ω, 8 > 0, 9N 2 N such that 8n ≥ N, jXn(!) − X(!)j < . Definition Xn ! X uniformly if 8 > 0, 9N 2 N such that 8! 2 Ω and 8n ≥ N, jXn(!) − X(!)j < . 1.2 Modes Unique to Measure Theory Definition Xn ! X in probability if 8 > 0, 8δ > 0, 9N 2 N such that 8n ≥ N, P (jXn − Xj ≥ ) < δ: Or, Xn ! X in probability if 8 > 0, lim P (jXn − Xj ≥ 0) = 0: n!1 The explicit epsilon-delta definition of convergence in probability is useful for proving a.s.
    [Show full text]
  • Probability and Statistics Lecture Notes
    Probability and Statistics Lecture Notes Antonio Jiménez-Martínez Chapter 1 Probability spaces In this chapter we introduce the theoretical structures that will allow us to assign proba- bilities in a wide range of probability problems. 1.1. Examples of random phenomena Science attempts to formulate general laws on the basis of observation and experiment. The simplest and most used scheme of such laws is: if a set of conditions B is satisfied =) event A occurs. Examples of such laws are the law of gravity, the law of conservation of mass, and many other instances in chemistry, physics, biology... If event A occurs inevitably whenever the set of conditions B is satisfied, we say that A is certain or sure (under the set of conditions B). If A can never occur whenever B is satisfied, we say that A is impossible (under the set of conditions B). If A may or may not occur whenever B is satisfied, then A is said to be a random phenomenon. Random phenomena is our subject matter. Unlike certain and impossible events, the presence of randomness implies that the set of conditions B do not reflect all the necessary and sufficient conditions for the event A to occur. It might seem them impossible to make any worthwhile statements about random phenomena. However, experience has shown that many random phenomena exhibit a statistical regularity that makes them subject to study. For such random phenomena it is possible to estimate the chance of occurrence of the random event. This estimate can be obtained from laws, called probabilistic or stochastic, with the form: if a set of conditions B is satisfied event A occurs m times =) repeatedly n times out of the n repetitions.
    [Show full text]
  • Exercise Set 13 Top04-E013 ; November 26, 2004 ; 9:45 A.M
    Prof. D. P.Patil, Department of Mathematics, Indian Institute of Science, Bangalore August-December 2004 MA-231 Topology 13. Function spaces1) —————————— — Uniform Convergence, Stone-Weierstrass theorem, Arzela-Ascoli theorem ———————————————————————————————– November 22, 2004 Karl Theodor Wilhelm Weierstrass† Marshall Harvey Stone†† (1815-1897) (1903-1989) First recall the following definitions and results : Our overall aim in this section is the study of the compactness and completeness properties of a subset F of the set Y X of all maps from a space X into a space Y . To do this a usable topology must be introduced on F (presumably related to the structures of X and Y ) and when this is has been done F is called a f unction space. N13.1 . (The Topology of Pointwise Convergence 2) Let X be any set, Y be any topological space and let fn : X → Y , n ∈ N, be a sequence of maps. We say that the sequence (fn)n∈N is pointwise convergent,ifforeverypoint x ∈X, the sequence fn(x) , n∈N,inY is convergent. a). The (Tychonoff) product topology on Y X is determined solely by the topology of Y (even if X is a topological X X space) the structure on X plays no part. A sequence fn : X →Y , n∈N,inY converges to a function f in Y if and only if for every point x ∈X, the sequence fn(x) , n∈N,inY is convergent. This provides the reason for the name the topology of pointwise convergence;this topology is also simply called the pointwise topology andisdenoted by Tptc .
    [Show full text]
  • Measure and Integration (Under Construction)
    Measure and Integration (under construction) Gustav Holzegel∗ April 24, 2017 Abstract These notes accompany the lecture course ”Measure and Integration” at Imperial College London (Autumn 2016). They follow very closely the text “Real-Analysis” by Stein-Shakarchi, in fact most proofs are simple rephrasings of the proofs presented in the aforementioned book. Contents 1 Motivation 3 1.1 Quick review of the Riemann integral . 3 1.2 Drawbacks of the class R, motivation of the Lebesgue theory . 4 1.2.1 Limits of functions . 5 1.2.2 Lengthofcurves ....................... 5 1.2.3 TheFundamentalTheoremofCalculus . 5 1.3 Measures of sets in R ......................... 6 1.4 LiteratureandFurtherReading . 6 2 Measure Theory: Lebesgue Measure in Rd 7 2.1 Preliminaries and Notation . 7 2.2 VolumeofRectanglesandCubes . 7 2.3 Theexteriormeasure......................... 9 2.3.1 Examples ........................... 9 2.3.2 Propertiesoftheexteriormeasure . 10 2.4 Theclassof(Lebesgue)measurablesets . 12 2.4.1 The property of countable additivity . 14 2.4.2 Regularity properties of the Lebesgue measure . 15 2.4.3 Invariance properties of the Lebesgue measure . 16 2.4.4 σ-algebrasandBorelsets . 16 2.5 Constructionofanon-measurableset. 17 2.6 MeasurableFunctions ........................ 19 2.6.1 Some abstract preliminary remarks . 19 2.6.2 Definitions and equivalent formulations of measurability . 19 2.6.3 Properties 1: Behaviour under compositions . 21 2.6.4 Properties 2: Behaviour under limits . 21 2.6.5 Properties3: Behaviourof sums and products . 22 ∗Imperial College London, Department of Mathematics, South Kensington Campus, Lon- don SW7 2AZ, United Kingdom. 1 2.6.6 Thenotionof“almosteverywhere”. 22 2.7 Building blocks of integration theory .
    [Show full text]
  • The Probability Set-Up.Pdf
    CHAPTER 2 The probability set-up 2.1. Basic theory of probability We will have a sample space, denoted by S (sometimes Ω) that consists of all possible outcomes. For example, if we roll two dice, the sample space would be all possible pairs made up of the numbers one through six. An event is a subset of S. Another example is to toss a coin 2 times, and let S = fHH;HT;TH;TT g; or to let S be the possible orders in which 5 horses nish in a horse race; or S the possible prices of some stock at closing time today; or S = [0; 1); the age at which someone dies; or S the points in a circle, the possible places a dart can hit. We should also keep in mind that the same setting can be described using dierent sample set. For example, in two solutions in Example 1.30 we used two dierent sample sets. 2.1.1. Sets. We start by describing elementary operations on sets. By a set we mean a collection of distinct objects called elements of the set, and we consider a set as an object in its own right. Set operations Suppose S is a set. We say that A ⊂ S, that is, A is a subset of S if every element in A is contained in S; A [ B is the union of sets A ⊂ S and B ⊂ S and denotes the points of S that are in A or B or both; A \ B is the intersection of sets A ⊂ S and B ⊂ S and is the set of points that are in both A and B; ; denotes the empty set; Ac is the complement of A, that is, the points in S that are not in A.
    [Show full text]
  • Borel Structure in Groups and Their Dualso
    BOREL STRUCTURE IN GROUPS AND THEIR DUALSO GEORGE W. MACKEY Introduction. In the past decade or so much work has been done toward extending the classical theory of finite dimensional representations of com- pact groups to a theory of (not necessarily finite dimensional) unitary repre- sentations of locally compact groups. Among the obstacles interfering with various aspects of this program is the lack of a suitable natural topology in the "dual object"; that is in the set of equivalence classes of irreducible representations. One can introduce natural topologies but none of them seem to have reasonable properties except in extremely special cases. When the group is abelian for example the dual object itself is a locally compact abelian group. This paper is based on the observation that for certain purposes one can dispense with a topology in the dual object in favor of a "weaker struc- ture" and that there is a wide class of groups for which this weaker structure has very regular properties. If .S is any topological space one defines a Borel (or Baire) subset of 5 to be a member of the smallest family of subsets of 5 which includes the open sets and is closed with respect to the formation of complements and countable unions. The structure defined in 5 by its Borel sets we may call the Borel structure of 5. It is weaker than the topological structure in the sense that any one-to-one transformation of S onto 5 which preserves the topological structure also preserves the Borel structure whereas the converse is usually false.
    [Show full text]
  • Generalized Inverse Limits and Topological Entropy
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Repository of Faculty of Science, University of Zagreb FACULTY OF SCIENCE DEPARTMENT OF MATHEMATICS Goran Erceg GENERALIZED INVERSE LIMITS AND TOPOLOGICAL ENTROPY DOCTORAL THESIS Zagreb, 2016 PRIRODOSLOVNO - MATEMATICKIˇ FAKULTET MATEMATICKIˇ ODSJEK Goran Erceg GENERALIZIRANI INVERZNI LIMESI I TOPOLOŠKA ENTROPIJA DOKTORSKI RAD Zagreb, 2016. FACULTY OF SCIENCE DEPARTMENT OF MATHEMATICS Goran Erceg GENERALIZED INVERSE LIMITS AND TOPOLOGICAL ENTROPY DOCTORAL THESIS Supervisors: prof. Judy Kennedy prof. dr. sc. Vlasta Matijevic´ Zagreb, 2016 PRIRODOSLOVNO - MATEMATICKIˇ FAKULTET MATEMATICKIˇ ODSJEK Goran Erceg GENERALIZIRANI INVERZNI LIMESI I TOPOLOŠKA ENTROPIJA DOKTORSKI RAD Mentori: prof. Judy Kennedy prof. dr. sc. Vlasta Matijevic´ Zagreb, 2016. Acknowledgements First of all, i want to thank my supervisor professor Judy Kennedy for accept- ing a big responsibility of guiding a transatlantic student. Her enthusiasm and love for mathematics are contagious. I thank professor Vlasta Matijevi´c, not only my supervisor but also my role model as a professor of mathematics. It was privilege to be guided by her for master's and doctoral thesis. I want to thank all my math teachers, from elementary school onwards, who helped that my love for math rises more and more with each year. Special thanks to Jurica Cudina´ who showed me a glimpse of math theory already in the high school. I thank all members of the Topology seminar in Split who always knew to ask right questions at the right moment and to guide me in the right direction. I also thank Iztok Baniˇcand the rest of the Topology seminar in Maribor who welcomed me as their member and showed me the beauty of a teamwork.
    [Show full text]
  • MTH 304: General Topology Semester 2, 2017-2018
    MTH 304: General Topology Semester 2, 2017-2018 Dr. Prahlad Vaidyanathan Contents I. Continuous Functions3 1. First Definitions................................3 2. Open Sets...................................4 3. Continuity by Open Sets...........................6 II. Topological Spaces8 1. Definition and Examples...........................8 2. Metric Spaces................................. 11 3. Basis for a topology.............................. 16 4. The Product Topology on X × Y ...................... 18 Q 5. The Product Topology on Xα ....................... 20 6. Closed Sets.................................. 22 7. Continuous Functions............................. 27 8. The Quotient Topology............................ 30 III.Properties of Topological Spaces 36 1. The Hausdorff property............................ 36 2. Connectedness................................. 37 3. Path Connectedness............................. 41 4. Local Connectedness............................. 44 5. Compactness................................. 46 6. Compact Subsets of Rn ............................ 50 7. Continuous Functions on Compact Sets................... 52 8. Compactness in Metric Spaces........................ 56 9. Local Compactness.............................. 59 IV.Separation Axioms 62 1. Regular Spaces................................ 62 2. Normal Spaces................................ 64 3. Tietze's extension Theorem......................... 67 4. Urysohn Metrization Theorem........................ 71 5. Imbedding of Manifolds..........................
    [Show full text]
  • Arxiv:2102.05840V2 [Math.PR]
    SEQUENTIAL CONVERGENCE ON THE SPACE OF BOREL MEASURES LIANGANG MA Abstract We study equivalent descriptions of the vague, weak, setwise and total- variation (TV) convergence of sequences of Borel measures on metrizable and non-metrizable topological spaces in this work. On metrizable spaces, we give some equivalent conditions on the vague convergence of sequences of measures following Kallenberg, and some equivalent conditions on the TV convergence of sequences of measures following Feinberg-Kasyanov-Zgurovsky. There is usually some hierarchy structure on the equivalent descriptions of convergence in different modes, but not always. On non-metrizable spaces, we give examples to show that these conditions are seldom enough to guarantee any convergence of sequences of measures. There are some remarks on the attainability of the TV distance and more modes of sequential convergence at the end of the work. 1. Introduction Let X be a topological space with its Borel σ-algebra B. Consider the collection M˜ (X) of all the Borel measures on (X, B). When we consider the regularity of some mapping f : M˜ (X) → Y with Y being a topological space, some topology or even metric is necessary on the space M˜ (X) of Borel measures. Various notions of topology and metric grow out of arXiv:2102.05840v2 [math.PR] 28 Apr 2021 different situations on the space M˜ (X) in due course to deal with the corresponding concerns of regularity. In those topology and metric endowed on M˜ (X), it has been recognized that the vague, weak, setwise topology as well as the total-variation (TV) metric are highlighted notions on the topological and metric description of M˜ (X) in various circumstances, refer to [Kal, GR, Wul].
    [Show full text]
  • Dirichlet Is Natural Vincent Danos, Ilias Garnier
    Dirichlet is Natural Vincent Danos, Ilias Garnier To cite this version: Vincent Danos, Ilias Garnier. Dirichlet is Natural. MFPS 31 - Mathematical Foundations of Program- ming Semantics XXXI, Jun 2015, Nijmegen, Netherlands. pp.137-164, 10.1016/j.entcs.2015.12.010. hal-01256903 HAL Id: hal-01256903 https://hal.archives-ouvertes.fr/hal-01256903 Submitted on 15 Jan 2016 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. MFPS 2015 Dirichlet is natural Vincent Danos1 D´epartement d'Informatique Ecole normale sup´erieure Paris, France Ilias Garnier2 School of Informatics University of Edinburgh Edinburgh, United Kingdom Abstract Giry and Lawvere's categorical treatment of probabilities, based on the probabilistic monad G, offer an elegant and hitherto unexploited treatment of higher-order probabilities. The goal of this paper is to follow this formulation to reconstruct a family of higher-order probabilities known as the Dirichlet process. This family is widely used in non-parametric Bayesian learning. Given a Polish space X, we build a family of higher-order probabilities in G(G(X)) indexed by M ∗(X) the set of non-zero finite measures over X. The construction relies on two ingredients.
    [Show full text]