On Collapsed Representation of Hierarchical Completely Random Measures

Total Page:16

File Type:pdf, Size:1020Kb

On Collapsed Representation of Hierarchical Completely Random Measures On collapsed representation of hierarchical Completely Random Measures Gaurav Pandey [email protected] Ambedkar Dukkipati [email protected] Department of Computer Science and Automation Indian Institute of Science, Bangalore-560012, India Abstract elling have been available in literature since more than a decade (Landauer & Dumais, 1997; Hofmann, 1999; Blei The aim of the paper is to provide an exact ap- et al., 2001), the first non-parametric approach, that al- proach for generating a Poisson process sam- lowed the number of latent classes to be determined as pled from a hierarchical CRM, without having well, was the hierarchical Dirichlet process (HDP) (Teh to instantiate the infinitely many atoms of the et al., 2006). Both the approaches model the object as a random measures. We use completely random set of repeated draws from an object-specific distribution, measures (CRM) and hierarchical CRM to de- whereby the object specific distribution is itself sampled fine a prior for Poisson processes. We derive from a common distribution. On the other hand, recent ap- the marginal distribution of the resultant point proaches such as hierarchical beta-negative binomial pro- process, when the underlying CRM is marginal- cess (Zhou et al., 2012; Broderick et al., 2015) and hier- ized out. Using well known properties unique archical gamma-Poisson process (Titsias, 2008; Zhou & to Poisson processes, we were able to derive an Carin, 2015) model the object as a point process, sampled exact approach for instantiating a Poisson pro- from an object specific random measure, which is itself cess with a hierarchical CRM prior. Furthermore, sampled from a common random measure. In some sense, we derive Gibbs sampling strategies for hierar- these approaches are more natural for mixed membership chical CRM models based on Chinese restau- modelling, since they model the object as a single entity rant franchise sampling scheme. As an example, rather than as a sequence of draws from a distribution. we present the sum of generalized gamma pro- cess (SGGP), and show its application in topic- A straightforward implementation of any of the above non- modelling. We show that one can determine the parametric models would require sampling the atoms in the power-law behaviour of the topics and words in non-parametric distribution for the base as well as object- a Bayesian fashion, by defining a prior on the pa- specific measure. However, since the number of atoms in rameters of SGGP. these distributions are often infinite, a truncation step is re- quired to ensure tractability. Alternatively, for the HDP, a Chinese restaurant franchise scheme (Teh et al., 2006) 1. Introduction can be used for collapsed inference in the model (that is, without explicitly instantiating the atoms). Fully collapsed Mixed membership modelling is the problem of assigning inference scheme has also been proposed for beta-negative arXiv:1509.01817v2 [math.ST] 2 Jun 2016 an object to multiple latent classes/features simultaneously. binomial process (BNBP) (Heaukulani & Roy, 2013; Zhou, Depending upon the problem, one can allow a single latent 2014) and Gamma-Gamma-Poisson process (Zhou et al., feature to be exhibited single or multiple times by the ob- 2015). Of particular relevance is the work by Roy(2014), ject. For instance, a document may comprise several top- whereby a Chinese restaurant fanchise scheme has been ics, with each topic occurring in the document with vari- proposed for hierarchies of beta proceses (and its gener- able multiplicity. The corresponding problem of mapping alizations), when coupled with Bernoulli process. the words of a document to topics, is referred to as topic modelling. In this paper, it is our aim to extend fully collapsed sampling so as to allow any completely random measure While parametric solutions to mixed membership mod- (CRM) for the choice of base and object-specific measure. rd As proposed in Roy(2014) for hierarchies of generalized Proceedings of the 33 International Conference on Machine beta processes, we propose Chinese restaurant franchise Learning, New York, NY, USA, 2016. JMLR: W&CP volume 48. Copyright 2016 by the author(s). schemes for hierarchies of CRMs, when coupled with Pois- On collapsed representation of hierarchical Completely Random Measures 1 son process. We hope that this will encourage the use of then the union Π = [i=1Πi is also a Poisson process with P1 hierarchical random measures, other than HDP and BNBP, mean measure µ = i=1 µi. This is known as the su- for mixed-membership modelling and will lead to further perposition proposition. Equivalently, if Ni is the counting P1 research into an understanding of the applicability of the process of Πi, then N = i=1 Ni is the counting process P1 various random measures. To give an idea about the flex- of a Poisson process with mean measure µ = i=1 µi. ibility that can be obtained by using other measures, we Finally, let g be a measurable function from S to , and propose the sum of generalized gamma process (SGGP), R Σ = P g(x). By Campbell’s proposition (Kingman, which allows one to determine the power term in the power- x2Π 1992), Σ is absolutely convergent with probability, if and law distribution of topics with documents, by defining a only if prior on the parameters of SGGP. Alternatively, one can Z also define a prior directly on the discount parameter. min(jg(x)j; 1)µ(dx) < 1: (1) S The main contributions in this paper are as follows: If this condition holds, then for any t > 0, Z • We derive marginal distributions of Poisson process, −tΣ −tg(x) E[e ] = exp − (1 − e )µ(dx) : (2) when coupled with CRMs, S • We provide an exact approach for generating a Pois- son process sampled from a hierarchical CRM, with- 2.2. Completely random measures out having to instantiate the infinitely many atoms of Let (Ω; F; P) be some probability space. Let (M(S); B) the random measure. be the space of all σ-finite measures on (S; S) supplied • We provide a Gibbs sampling approach for sampling with an appropriate σ-algebra. A completely random mea- a Poisson process from a hierarchical CRM. sure (CRM) Λ on (S; S), is a measurable mapping from Ω to M(S) such that • In the experiments section, we propose the sum of generalized gamma process (SGGP), and show its ap- 1. PfΛ(;) = 0g = 1, plicability for topic-modelling. By defining a prior on the parameters of SGGP, one can determine the 2. For any disjoint countable collection of sets power-law distribution of the topics and words in a A1;A2;:::; the random variables Λ(Ai); i = 1; 2;::: P Bayesian fashion. are independent, and Λ([Ai) = i Λ(Ai), holds almost surely. (the independent increments property) 2. Preliminaries and background An important characterization of CRMs in terms of Poisson In this section, we fix the notation and recall a few well processes is as follows (Kingman, 1967). For any CRM Λ known results from the theory of point processes. on (S; S) without any fixed atoms or deterministic compo- + nent, there exists a Poisson process N on (R × S; B + ⊗ R R S), such that Λ(dx) = + zN(dz; dx). Using Campbell’s 2.1. Poisson process R proposition, the Laplace transform of Λ(A) for a measur- Let (S; S) be a measurable space and Π be a random count- able set A, is given by the following formula: able collection of points on S. Let N(A) = jΠ\Aj, for any Z measurable set A. N is also known as the counting process −tΛ(A) −tz E[e ] = exp − (1 − e )ν(dz; dx) ; t ≥ 0; of Π. Π is called a Poisson process if N(A) is indepen- + R ×A dent of N(B), whenever A and B are disjoint measurable (3) sets, and N(A) is Poisson distributed with mean µ(A) for where ν denotes the mean measure of the underlying Pois- a fixed σ-finite measure µ. In sequel, we refer to both the son process N. ν is also referred to as the Poisson intensity random collection Π and its counting process N as Poisson measure of Λ. If ν(dz; dx) = ρ(dz)µ(dx), for a σ-finite process. measure µ on S, and a σ-finite measure ρ on R+ that sat- R −tz isfies + (1 − e )ρ(dz) < 1, then Λ(:) is known as Let (T; T ) be another measurable space and f : S ! T R homogenous CRM. In sequel, we assume µ(:) to be finite. be a measurable function. If the push forward measure Moreover, unless specified, whenever we refer to CRM, it of µ via f, that is, µ ◦ f −1 is non-atomic, then f(Π) = means a homogeneous completely random measure with- ff(x): x 2 Πg is also a Poisson process with mean mea- out any fixed atoms or deterministic component. sure µ ◦ f −1. This is also known as the mapping proposi- tion for Poisson processes (Kingman, 1992). Moreover, if Let N be the Poisson process of the CRM Λ, that is, R Π1; Π2;::: is a countable collection of independent Pois- Λ(dx) = + sN(dz; dx). If Π is the random collec- R son processes with mean measures µ1; µ2;::: respectively, tion of points corresponding to N, then Λ can equivalently On collapsed representation of hierarchical Completely Random Measures P be written as Λ = (z;x)2Π zδx. fz :(z; x) 2 Πg Our aim is to infer the latent features Ni; 1 ≤ i ≤ n from constitute the weights of the CRM Λ. By the mapping Xi; 1 ≤ i ≤ n.
Recommended publications
  • Completely Random Measures and Related Models
    CRMs Sinead Williamson Background Completely random measures and related L´evyprocesses Completely models random measures Applications Normalized Sinead Williamson random measures Neutral-to-the- right processes Computational and Biological Learning Laboratory Exchangeable University of Cambridge matrices January 20, 2011 Outline CRMs Sinead Williamson 1 Background Background L´evyprocesses Completely 2 L´evyprocesses random measures Applications 3 Completely random measures Normalized random measures Neutral-to-the- right processes 4 Applications Exchangeable matrices Normalized random measures Neutral-to-the-right processes Exchangeable matrices A little measure theory CRMs Sinead Williamson Set: e.g. Integers, real numbers, people called James. Background May be finite, countably infinite, or uncountably infinite. L´evyprocesses Completely Algebra: Class T of subsets of a set T s.t. random measures 1 T 2 T . 2 If A 2 T , then Ac 2 T . Applications K Normalized 3 If A1;:::; AK 2 T , then [ Ak = A1 [ A2 [ ::: AK 2 T random k=1 measures (closed under finite unions). Neutral-to-the- right K processes 4 If A1;:::; AK 2 T , then \k=1Ak = A1 \ A2 \ ::: AK 2 T Exchangeable matrices (closed under finite intersections). σ-Algebra: Algebra that is closed under countably infinite unions and intersections. A little measure theory CRMs Sinead Williamson Background L´evyprocesses Measurable space: Combination (T ; T ) of a set and a Completely σ-algebra on that set. random measures Measure: Function µ between a σ-field and the positive Applications reals (+ 1) s.t. Normalized random measures 1 µ(;) = 0. Neutral-to-the- right 2 For all countable collections of disjoint sets processes P Exchangeable A1; A2; · · · 2 T , µ([k Ak ) = µ(Ak ).
    [Show full text]
  • Some Theory for the Analysis of Random Fields Diplomarbeit
    Philipp Pluch Some Theory for the Analysis of Random Fields With Applications to Geostatistics Diplomarbeit zur Erlangung des akademischen Grades Diplom-Ingenieur Studium der Technischen Mathematik Universit¨at Klagenfurt Fakult¨at fur¨ Wirtschaftswissenschaften und Informatik Begutachter: O.Univ.-Prof.Dr. Jurgen¨ Pilz Institut fur¨ Mathematik September 2004 To my parents, Verena and all my friends Ehrenw¨ortliche Erkl¨arung Ich erkl¨are ehrenw¨ortlich, dass ich die vorliegende Schrift verfasst und die mit ihr unmittelbar verbundenen Arbeiten selbst durchgefuhrt¨ habe. Die in der Schrift verwendete Literatur sowie das Ausmaß der mir im gesamten Arbeitsvorgang gew¨ahrten Unterstutzung¨ sind ausnahmslos angegeben. Die Schrift ist noch keiner anderen Prufungsb¨ eh¨orde vorgelegt worden. St. Urban, 29 September 2004 Preface I remember when I first was at our univeristy - I walked inside this large corridor called ’Aula’ and had no idea what I should do, didn’t know what I should study, I had interest in Psychology or Media Studies, and now I’m sitting in my office at the university, five years later, writing my final lines for my master degree theses in mathematics. A long and also hard but so beautiful way was gone, I remember at the beginning, the first mathematic courses in discrete mathematics, how difficult that was for me, the abstract thinking, my first exams and now I have finished them all, I mastered them. I have to thank so many people and I will do so now. First I have to thank my parents, who always believed in me, who gave me financial support and who had to fight with my mood when I was working hard.
    [Show full text]
  • Completely Random Measures
    Pacific Journal of Mathematics COMPLETELY RANDOM MEASURES JOHN FRANK CHARLES KINGMAN Vol. 21, No. 1 November 1967 PACIFIC JOURNAL OF MATHEMATICS Vol. 21, No. 1, 1967 COMPLETELY RANDOM MEASURES J. F. C. KlNGMAN The theory of stochastic processes is concerned with random functions defined on some parameter set. This paper is con- cerned with the case, which occurs naturally in some practical situations, in which the parameter set is a ^-algebra of subsets of some space, and the random functions are all measures on this space. Among all such random measures are distinguished some which are called completely random, which have the property that the values they take on disjoint subsets are independent. A representation theorem is proved for all completely random measures satisfying a weak finiteness condi- tion, and as a consequence it is shown that all such measures are necessarily purely atomic. 1. Stochastic processes X(t) whose realisation are nondecreasing functions of a real parameter t occur in a number of applications of probability theory. For instance, the number of events of a point process in the interval (0, t), the 'load function' of a queue input [2], the amount of water entering a reservoir in time t, and the local time process in a Markov chain ([8], [7] §14), are all processes of this type. In many applications the function X(t) enters as a convenient way of representing a measure on the real line, the Stieltjes measure Φ defined as the unique Borel measure for which (1) Φ(a, b] = X(b + ) - X(a + ), (-co <α< b< oo).
    [Show full text]
  • Poisson Point Processes
    Poisson Point Processes Will Perkins April 23, 2013 The Poisson Process Say you run a website or a bank. How woul you model the arrival of customers to your site? Continuous time process, integer valued. What properties should the process have? Properties 1 The numbers of customers arriving in disjoint time intervals are independent. 2 The number of customers arriving in [t1; t2] depends only on t2 − t1. (Can be relaxed) 3 The probability that one customer arrives in [t; t + ] is λ + o(). 4 The probability that at least two customers arrive in [t; t + ] is o(). The Poisson Process Theorem If a process N(t1; t2) satisfies the above properties, then N(0; t) has a Poisson distribution with mean λt. Such a process is called a Poisson process. Proof: Other Properties 1 Conditioning on the number of arrivals in [0; T ], how are the arrival times distributed? 2 What is the distribution of the time between arrival k and k + 1? 3 Does this process have the continuous-time Markov property? Proofs: Constructing a Poisson Process We can construct a Poisson process using a sequence of iid random variables. Let X1; X2;::: be iid Exponential rv's with mean 1/λ. Then let k+1 X N(0; t) = inffk : Xi ≥ t i=1 Show that this is a Poisson process with mean λ. What would happend if we chose a different distribution for the Xi 's? Inhomogeneous Poisson Process Let f (t) be a non-negative, integrable function. Then we can define an inhomogeneous Poisson process with intensity measure f (t) as follows: 1 The number of arrivals in disjoint intervals are independent.
    [Show full text]
  • On a Class of Random Probability Measures with General Predictive Structure
    On a class of random probability measures with general predictive structure Stefano Favaro Igor Prünster Stephen G. Walker No. 161 November 2010 www.carloalberto.org/working_papers © 2010 by Stefano Favaro, Igor Prünster and Stephen G. Walker. Any opinions expressed here are those of the authors and not those of the Collegio Carlo Alberto. On a class of random probability measures with general predictive structure S. Favaro1, I. Pr¨unster2 and S.G. Walker3 1 Universit`adegli Studi di Torino and Collegio Carlo Alberto, Torino, Italy. E-mail: [email protected] 2 Universit`adegli Studi di Torino, Collegio Carlo Alberto and ICER, Torino, Italy. E-mail: [email protected] 3 Institute of Mathematics, Statistics and Actuarial Science, University of Kent E-mail: [email protected] February 2010 Abstract In this paper we investigate a recently introduced class of nonparametric priors, termed generalized Dirichlet process priors. Such priors induce (exchangeable random) partitions which are characterized by a more elaborate clustering structure than those arising from other widely used priors. A natural area of application of these random probability measures is represented by species sampling problems and, in particular, prediction problems in genomics. To this end we study both the distribution of the number of distinct species present in a sample and the distribution of the number of new species conditionally on an observed sample. We also provide the Bayesian nonparametric estimator for the number of new species in an additional sample of given size and for the discovery probability as function of the size of the additional sample.
    [Show full text]
  • On Strongly Rigid Hyperfluctuating Random Measures
    On strongly rigid hyperfluctuating random measures M.A. Klatt∗ and G. Lasty August 26, 2020 Abstract In contrast to previous belief, we provide examples of stationary ergodic random measures that are both hyperfluctuating and strongly rigid. Therefore, we study hyperplane intersection processes (HIPs) that are formed by the vertices of Poisson hyperplane tessellations. These HIPs are known to be hyperfluctuating, that is, the variance of the number of points in a bounded observation window grows faster than the size of the window. Here we show that the HIPs exhibit a particularly strong rigidity property. For any bounded Borel set B, an exponentially small (bounded) stopping set suffices to reconstruct the position of all points in B and, in fact, all hyperplanes intersecting B. Therefore, also the random measures supported by the hyperplane intersections of arbitrary (but fixed) dimension, are hyperfluctuating. Our examples aid the search for relations between correlations, density fluctuations, and rigidity properties. Keywords: Strong rigidity, hyperfluctuation, hyperuniformity, Poisson hyperplane tessellations, hyperplane intersection processes AMS MSC 2010: 60G55, 60G57, 60D05 1 Introduction Let Φ be a random measure on the d-dimensional Euclidean space Rd; see [10, 14]. In this note all random objects are defined over a fixed probability space (Ω; F; P) with associated expectation operator E. Assume that Φ is stationary, that is distributionally invariant 2 arXiv:2008.10907v1 [math.PR] 25 Aug 2020 under translations. Assume also that Φ is locally square integrable, that is E[Φ(B) ] < 1 for all compact B ⊂ Rd. Take a convex body W , that is a compact and convex subset of d R and assume that W has positive volume Vd(W ).
    [Show full text]
  • Completely Random Measures for Modeling Power Laws in Sparse Graphs
    Completely random measures for modeling power laws in sparse graphs Diana Cai Tamara Broderick Department of Statistics Department of EECS University of Chicago Massachusetts Institute of Technology [email protected] [email protected] Abstract Network data appear in a number of applications, such as online social networks and biological networks, and there is growing interest in both developing mod- els for networks as well as studying the properties of such data. Since individual network datasets continue to grow in size, it is necessary to develop models that accurately represent the real-life scaling properties of networks. One behavior of interest is having a power law in the degree distribution. However, other types of power laws that have been observed empirically and considered for applica- tions such as clustering and feature allocation models have not been studied as frequently in models for graph data. In this paper, we enumerate desirable asymp- totic behavior that may be of interest for modeling graph data, including sparsity and several types of power laws. We outline a general framework for graph gen- erative models using completely random measures; by contrast to the pioneering work of Caron and Fox (2015), we consider instantiating more of the existing atoms of the random measure as the dataset size increases rather than adding new atoms to the measure. We see that these two models can be complementary; they respectively yield interpretations as (1) time passing among existing members of a network and (2) new individuals joining a network. We detail a particular instance of this framework and show simulated results that suggest this model exhibits some desirable asymptotic power-law behavior.
    [Show full text]
  • Random Measures, Theory and Applications
    Olav Kallenberg Random Measures, Theory and Applications 123 Olav Kallenberg Department of Mathematics and Statistics Auburn University Auburn, Alabama USA ISSN 2199-3130 ISSN 2199-3149 (electronic) Probability Theory and Stochastic Modelling ISBN 978-3-319-41596-3 ISBN 978-3-319-41598-7 (eBook) DOI 10.1007/978-3-319-41598-7 Library of Congress Control Number: 2017936349 Mathematics Subject Classification (2010): 60G55, 60G57 © Springer International Publishing Switzerland 2017 This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, express or implied, with respect to the material contained herein or for any errors or omissions
    [Show full text]
  • Dependent Normalized Random Measures
    Dependent Normalized Random Measures Changyou Chen A thesis submitted for the degree of Doctor of Philosophy The Australian National University February 2014 c Changyou Chen 2014 To all my family, on earth and in heaven, for their selfless care and love. Acknowledgments It is time to say goodbye to my PhD research in the Australian National University. It has been an exciting and memorable experience and I would like to take this opportunity to thank everyone who has helped me during my PhD. My utmost gratitude goes to my supervisor, Professor Wray Buntine, for both his extraordinary supervision in my research and selfless care in my daily life. Wray brought me into the Bayesian nonparametrics world, he has taught me how to per- form quality research from the beginning of my PhD. He taught me how to do critical thinking, how to design professional experiments, how to write quality papers, and how to present my research work. I benefited greatly from his experience which has helped me develop as a machine learning researcher. I would like to express my gratitude and respect to Professor Yee Whye Teh, who hosted me as a visiting student at UCL for a short time in 2012. My special thanks also goes to Vinayak Rao, who explained to me his spatial normalized Gamma pro- cesses, a precursor of our joint work together with Yee Whye. I was impressed by Yee Whye and Vinayak’s knowledge in Bayesian nonparametrics, and their sophisticated ideas for Bayesian posterior inference. I was lucky to work with them and had our joint work published in ICML.
    [Show full text]
  • [Math.PR] 3 Dec 2019 Some Remarks on Associated Random Fields
    Some remarks on associated random fields, random measures and point processes G. Last∗, R. Szekli† and D. Yogeshwaran‡ December 4, 2019 Abstract In this paper, we first show that for a countable family of random elements taking values in a partially ordered Polish space with a closed partial order (POP space), association (both positive and negative) of all finite dimensional marginals implies that of the infinite sequence. Our proof proceeds via Strassen’s theorem for stochas- tic domination and thus avoids the assumption of normally ordered on the product space as needed for positive association in [38]. We use these results to show on POP spaces that finite dimensional negative association implies negative association of the random measure and negative association is preserved under weak convergence of random measures. The former provides a simpler proof in the most general set- ting of Polish spaces complementing the recent proofs in [47] and [40] which restrict to point processes in Rd and locally compact Polish spaces respectively. We also provide some examples of associated random measures which shall illustrate our results as well. Keywords: negative association, positive association, random fields, random measures, point processes, weak convergence, Gaussian random fields, Poisson processes, Cox pro- cesses, Poisson cluster processes determinantal point processes, Gibbs point processes. AMS MSC 2010: 60E15, 60G57. arXiv:1903.06004v2 [math.PR] 3 Dec 2019 1 Introduction Positive association of random vectors in Rd appears in Esary et al [14] in 1967, and negative association several years later, see Joag-Dev and Proschan [25] or Alam and Lai Saxena [1]. Since then the theory of positive association has been well developed and has found many applications in various contexts, for example to establish limit theorems, to obtain concentration bounds or to derive stochastic comparison results.
    [Show full text]
  • Point Processes
    POINT PROCESSES Frederic Paik Schoenberg UCLA Department of Statistics, MS 8148 Los Angeles, CA 90095-1554 [email protected] July 2000 1 A point process is a random collection of points falling in some space. In most applications, each point represents the time and/or location of an event. Examples of events include incidence of disease, sightings or births of a species, or the occurrences of fires, earthquakes, lightning strikes, tsunamis, or volcanic eruptions. When modeling purely temporal data, the space in which the points fall is simply a portion of the real line (Figure 1). Increas- ingly, spatial-temporal point processes are used to describe environmental processes; in such instances each point represents the time and location of an event in a spatial-temporal region (Figure 2). 0 s t T time Figure 1: Temporal point process DEFINITIONS There are several ways of characterizing a point process. The mathematically- favored approach is to define a point process N as a random measure on a space S taking values in the non-negative integers Z+ (or infinity). In this framework the measure N(A) represents the number of points falling in the 2 subset A of S. Attention is typically restricted to random measures that are finite on any compact subset of S, and to the case where S is a complete separable metric space (e.g. Rk). latitude A time Figure 2: Spatial-temporal point process For instance, suppose N is a temporal point process. For any times s and t, the measure N assigns a value to the interval (s, t).
    [Show full text]
  • Topics in Algorithmic Randomness and Effective Probability
    TOPICS IN ALGORITHMIC RANDOMNESS AND EFFECTIVE PROBABILITY ADissertation Submitted to the Graduate School of the University of Notre Dame in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy by Quinn Culver Peter Cholak, Director Graduate Program in Mathematics Notre Dame, Indiana April 2015 This document is in the public domain. TOPICS IN ALGORITHMIC RANDOMNESS AND EFFECTIVE PROBABILITY Abstract by Quinn Culver This dissertation contains the results from three related projects, each within the fields of algorithmic randomness and probability theory. The first project we undertake, which can be found in Chapter 2, contains the definition a natural, computable Borel probability measure on the space of Borel probability measures over 2! that allows us to study algorithmically random mea- sures. The main results here are as follows. Every (algorithmically) random measure is atomless yet mutually singular with respect to the Lebesgue measure. The random reals of a random measure are random for the Lebesgue measure, and every random real for the Lebesgue measure is random for some random measure. However, for a fixed Lebesgue-random real, the set of random measures for which that real is ran- dom is small. Relatively random measures, though mutually singular, always share a random real that is in fact computable from the join of the measures. Random mea- sures fail Kolmogorov’s 0-1 law. The shift of a random real for a random measure is no longer random for that measure. In our second project, which makes up Chapter 3, we study algorithmically ran- dom closed subsets of 2!,algorithmicallyrandomcontinuousfunctionsfrom2! to 2!, and the algorithmically random Borel probability measures on 2! from Chapter 2, especially the interplay among these three classes of objects.
    [Show full text]