3 Random Variables & Distributions
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Arxiv:1906.06684V1 [Math.NA]
Randomized Computation of Continuous Data: Is Brownian Motion Computable?∗ Willem L. Fouch´e1, Hyunwoo Lee2, Donghyun Lim2, Sewon Park2, Matthias Schr¨oder3, Martin Ziegler2 1 University of South Africa 2 KAIST 3 University of Birmingham Abstract. We consider randomized computation of continuous data in the sense of Computable Analysis. Our first contribution formally confirms that it is no loss of generality to take as sample space the Cantor space of infinite fair coin flips. This extends [Schr¨oder&Simpson’05] and [Hoyrup&Rojas’09] considering sequences of suitably and adaptively biased coins. Our second contribution is concerned with 1D Brownian Motion (aka Wiener Process), a prob- ability distribution on the space of continuous functions f : [0, 1] → R with f(0) = 0 whose computability has been conjectured [Davie&Fouch´e’2013; arXiv:1409.4667,§6]. We establish that this (higher-type) random variable is computable iff some/every computable family of moduli of continuity (as ordinary random variables) has a computable probability distribution with respect to the Wiener Measure. 1 Introduction Randomization is a powerful technique in classical (i.e. discrete) Computer Science: Many difficult problems have turned out to admit simple solutions by algorithms that ‘roll dice’ and are efficient/correct/optimal with high probability [DKM+94,BMadHS99,CS00,BV04]. Indeed, fair coin flips have been shown computationally universal [Wal77]. Over continuous data, well-known closely connected to topology [Grz57] [Wei00, 2.2+ 3], notions of proba- § § bilistic computation are more subtle [BGH15,Col15]. 1.1 Overview Section 2 resumes from [SS06] the question of how to represent Borel probability measures. -
Set Theory, Including the Axiom of Choice) Plus the Negation of CH
ANNALS OF ~,IATltEMATICAL LOGIC - Volume 2, No. 2 (1970) pp. 143--178 INTERNAL COHEN EXTENSIONS D.A.MARTIN and R.M.SOLOVAY ;Tte RockeJ~'ller University and University (.~t CatiJbrnia. Berkeley Received 2 l)ecemt)er 1969 Introduction Cohen [ !, 2] has shown that tile continuum hypothesis (CH) cannot be proved in Zermelo-Fraenkel set theory. Levy and Solovay [9] have subsequently shown that CH cannot be proved even if one assumes the existence of a measurable cardinal. Their argument in tact shows that no large cardinal axiom of the kind present;y being considered by set theorists can yield a proof of CH (or of its negation, of course). Indeed, many set theorists - including the authors - suspect that C1t is false. But if we reject CH we admit Gurselves to be in a state of ignorance about a great many questions which CH resolves. While CH is a power- full assertion, its negation is in many ways quite weak. Sierpinski [ 1 5 ] deduces propcsitions there called C l - C82 from CH. We know of none of these propositions which is decided by the negation of CH and only one of them (C78) which is decided if one assumes in addition that a measurable cardinal exists. Among the many simple questions easily decided by CH and which cannot be decided in ZF (Zerme!o-Fraenkel set theory, including the axiom of choice) plus the negation of CH are tile following: Is every set of real numbers of cardinality less than tha't of the continuum of Lebesgue measure zero'? Is 2 ~0 < 2 ~ 1 ? Is there a non-trivial measure defined on all sets of real numbers? CIhis third question could be decided in ZF + not CH only in the unlikely event t Tile second author received support from a Sloan Foundation fellowship and tile National Science Foundation Grant (GP-8746). -
A Convenient Category for Higher-Order Probability Theory
A Convenient Category for Higher-Order Probability Theory Chris Heunen Ohad Kammar Sam Staton Hongseok Yang University of Edinburgh, UK University of Oxford, UK University of Oxford, UK University of Oxford, UK Abstract—Higher-order probabilistic programming languages 1 (defquery Bayesian-linear-regression allow programmers to write sophisticated models in machine 2 let let sample normal learning and statistics in a succinct and structured way, but step ( [f ( [s ( ( 0.0 3.0)) 3 sample normal outside the standard measure-theoretic formalization of proba- b ( ( 0.0 3.0))] 4 fn + * bility theory. Programs may use both higher-order functions and ( [x] ( ( s x) b)))] continuous distributions, or even define a probability distribution 5 (observe (normal (f 1.0) 0.5) 2.5) on functions. But standard probability theory does not handle 6 (observe (normal (f 2.0) 0.5) 3.8) higher-order functions well: the category of measurable spaces 7 (observe (normal (f 3.0) 0.5) 4.5) is not cartesian closed. 8 (observe (normal (f 4.0) 0.5) 6.2) Here we introduce quasi-Borel spaces. We show that these 9 (observe (normal (f 5.0) 0.5) 8.0) spaces: form a new formalization of probability theory replacing 10 (predict :f f))) measurable spaces; form a cartesian closed category and so support higher-order functions; form a well-pointed category and so support good proof principles for equational reasoning; and support continuous probability distributions. We demonstrate the use of quasi-Borel spaces for higher-order functions and proba- bility by: showing that a well-known construction of probability theory involving random functions gains a cleaner expression; and generalizing de Finetti’s theorem, that is a crucial theorem in probability theory, to quasi-Borel spaces. -
Jointly Measurable and Progressively Measurable Stochastic Processes
Jointly measurable and progressively measurable stochastic processes Jordan Bell [email protected] Department of Mathematics, University of Toronto June 18, 2015 1 Jointly measurable stochastic processes d Let E = R with Borel E , let I = R≥0, which is a topological space with the subspace topology inherited from R, and let (Ω; F ;P ) be a probability space. For a stochastic process (Xt)t2I with state space E, we say that X is jointly measurable if the map (t; !) 7! Xt(!) is measurable BI ⊗ F ! E . For ! 2 Ω, the path t 7! Xt(!) is called left-continuous if for each t 2 I, Xs(!) ! Xt(!); s " t: We prove that if the paths of a stochastic process are left-continuous then the stochastic process is jointly measurable.1 Theorem 1. If X is a stochastic process with state space E and all the paths of X are left-continuous, then X is jointly measurable. Proof. For n ≥ 1, t 2 I, and ! 2 Ω, let 1 n X Xt (!) = 1[k2−n;(k+1)2−n)(t)Xk2−n (!): k=0 n Each X is measurable BI ⊗ F ! E : for B 2 E , 1 n [ −n −n f(t; !) 2 I × Ω: Xt (!) 2 Bg = [k2 ; (k + 1)2 ) × fXk2−n 2 Bg: k=0 −n −n Let t 2 I. For each n there is a unique kn for which t 2 [kn2 ; (kn +1)2 ), and n −n −n thus Xt (!) = Xkn2 (!). Furthermore, kn2 " t, and because s 7! Xs(!) is n −n left-continuous, Xkn2 (!) ! Xt(!). -
A Test for Singularity 1
STATISTIII & PROBABIUTY Lk'TTERS ELSEVIER Statistics & Probability Letters 40 (1998) 215-226 A test for singularity 1 Attila Frigyesi *, Ola H6ssjer Institute for Matematisk Statistik, Lunds University, Box 118, 22100 Lund, Sweden Received March 1997; received in revised form February 1998 Abstract The LP-norm and other functionals of the kernel estimate of density functions (as functions of the bandwidth) are studied as means to test singularity. Given an absolutely continuous distribution function, the LP-norm, and the other studied functionals, will converge under some mild side-constraints as the bandwidth decreases. In the singular case, the functionals will diverge. These properties may be used to test whether data comes from an arbitrarily large class of absolutely continuous distributions or not. (~) 1998 Elsevier Science B.V. All rights reserved AMS classification." primary 60E05; secondary 28A99; 62G07 Keywords: Singularity; Absolute continuity; Kernel density estimation 1. Introduction In this paper we aim to test whether a given set of data (= {XI ..... X,}), with a marginal distribution #, stems from an absolutely continuous distribution function or from a singular one. A reference to this problem is by Donoho (1988). Consider a probability measure # on (g~P,~(~P)), where ~(~P) is the Borel a-algebra on R p. The measure tt is said to be singular with respect to the Lebesgue measure 2 if there exists a measurable set A such that /~(AC)=0 and 2(A)=0. If we only require #(A)>0 and 2(A)=0, then /~ is said to have a singular part. If tt does not have a singular part, then it is absolutely continuous with respect to 2. -
Lecture Notes
MEASURE THEORY AND STOCHASTIC PROCESSES Lecture Notes José Melo, Susana Cruz Faculdade de Engenharia da Universidade do Porto Programa Doutoral em Engenharia Electrotécnica e de Computadores March 2011 Contents 1 Probability Space3 1 Sample space Ω .....................................4 2 σ-eld F .........................................4 3 Probability Measure P .................................6 3.1 Measure µ ....................................6 3.2 Probability Measure P .............................7 4 Learning Objectives..................................7 5 Appendix........................................8 2 Chapter 1 Probability Space Let's consider the experience of throwing a dart on a circular target with radius r (assuming the dart always hits the target), divided in 4 dierent areas as illustrated in Figure 1.1. 4 3 2 1 Figure 1.1: Circular Target The circles that bound the regions 1, 2, 3, and 4, have radius of, respectively, r , r , 3r , and . 4 2 4 r Therefore, the probability that a dart lands in each region is: 1 , 3 , 5 , P (1) = 16 P (2) = 16 P (3) = 16 7 . P (4) = 16 For this kind of problems, the theory of discrete probability spaces suces. However, when it comes to problems involving either an innitely repeated operation or an innitely ne op- eration, this mathematical framework does not apply. This motivates the introduction of a measure-theoretic probability approach to correctly describe those cases. We dene the proba- bility space (Ω; F;P ), where Ω is the sample space, F is the event space, and P is the probability 3 measure. Each of them will be described in the following subsections. 1 Sample space Ω The sample space Ω is the set of all the possible results or outcomes ! of an experiment or observation. -
Vector Spaces
ApPENDIX A Vector Spaces This appendix reviews some of the basic definitions and properties of vectm spaces. It is presumed that, with the possible exception of Theorem A.14, all of the material presented here is familiar to the reader. Definition A.I. A set.A c Rn is a vector space if, for any x, y, Z E .A and scalars a, 13, operations of vector addition and scalar multiplication are defined such that: (1) (x + y) + Z = x + (y + z). (2) x + y = y + x. (3) There exists a vector 0 E.A such that x + 0 = x = 0 + x. (4) For any XE.A there exists y = -x such that x + y = 0 = y + x. (5) a(x + y) = ax + ay. (6) (a + f3)x = ax + f3x. (7) (af3)x = a(f3x). (8) 1· x = x. Definition A.2. Let .A be a vector space and let JV be a set with JV c .A. JV is a subspace of .A if and only if JV is a vector space. Theorem A.3. Let .A be a vector space and let JV be a nonempty subset of .A. If JV is closed under addition and scalar multiplication, then JV is a subspace of .A. Theorem A.4. Let .A be a vector space and let Xl' ••• , Xr be in .A. The set of all linear combinations of Xl' .•• ,xr is a subspace of .A. Appendix A. Vector Spaces 325 Definition A.S. The set of all linear combinations of Xl'OO',X, is called the space spanned by Xl' .. -
Borel Structure in Groups and Their Dualso
BOREL STRUCTURE IN GROUPS AND THEIR DUALSO GEORGE W. MACKEY Introduction. In the past decade or so much work has been done toward extending the classical theory of finite dimensional representations of com- pact groups to a theory of (not necessarily finite dimensional) unitary repre- sentations of locally compact groups. Among the obstacles interfering with various aspects of this program is the lack of a suitable natural topology in the "dual object"; that is in the set of equivalence classes of irreducible representations. One can introduce natural topologies but none of them seem to have reasonable properties except in extremely special cases. When the group is abelian for example the dual object itself is a locally compact abelian group. This paper is based on the observation that for certain purposes one can dispense with a topology in the dual object in favor of a "weaker struc- ture" and that there is a wide class of groups for which this weaker structure has very regular properties. If .S is any topological space one defines a Borel (or Baire) subset of 5 to be a member of the smallest family of subsets of 5 which includes the open sets and is closed with respect to the formation of complements and countable unions. The structure defined in 5 by its Borel sets we may call the Borel structure of 5. It is weaker than the topological structure in the sense that any one-to-one transformation of S onto 5 which preserves the topological structure also preserves the Borel structure whereas the converse is usually false. -
Shape Analysis, Lebesgue Integration and Absolute Continuity Connections
NISTIR 8217 Shape Analysis, Lebesgue Integration and Absolute Continuity Connections Javier Bernal This publication is available free of charge from: https://doi.org/10.6028/NIST.IR.8217 NISTIR 8217 Shape Analysis, Lebesgue Integration and Absolute Continuity Connections Javier Bernal Applied and Computational Mathematics Division Information Technology Laboratory This publication is available free of charge from: https://doi.org/10.6028/NIST.IR.8217 July 2018 INCLUDES UPDATES AS OF 07-18-2018; SEE APPENDIX U.S. Department of Commerce Wilbur L. Ross, Jr., Secretary National Institute of Standards and Technology Walter Copan, NIST Director and Undersecretary of Commerce for Standards and Technology ______________________________________________________________________________________________________ This Shape Analysis, Lebesgue Integration and publication Absolute Continuity Connections Javier Bernal is National Institute of Standards and Technology, available Gaithersburg, MD 20899, USA free of Abstract charge As shape analysis of the form presented in Srivastava and Klassen’s textbook “Functional and Shape Data Analysis” is intricately related to Lebesgue integration and absolute continuity, it is advantageous from: to have a good grasp of the latter two notions. Accordingly, in these notes we review basic concepts and results about Lebesgue integration https://doi.org/10.6028/NIST.IR.8217 and absolute continuity. In particular, we review fundamental results connecting them to each other and to the kind of shape analysis, or more generally, functional data analysis presented in the aforeme- tioned textbook, in the process shedding light on important aspects of all three notions. Many well-known results, especially most results about Lebesgue integration and some results about absolute conti- nuity, are presented without proofs. -
Randomized Algorithms
Chapter 9 Randomized Algorithms randomized The theme of this chapter is madendrizo algorithms. These are algorithms that make use of randomness in their computation. You might know of quicksort, which is efficient on average when it uses a random pivot, but can be bad for any pivot that is selected without randomness. Analyzing randomized algorithms can be difficult, so you might wonder why randomiza- tion in algorithms is so important and worth the extra effort. Well it turns out that for certain problems randomized algorithms are simpler or faster than algorithms that do not use random- ness. The problem of primality testing (PT), which is to determine if an integer is prime, is a good example. In the late 70s Miller and Rabin developed a famous and simple random- ized algorithm for the problem that only requires polynomial work. For over 20 years it was not known whether the problem could be solved in polynomial work without randomization. Eventually a polynomial time algorithm was developed, but it is much more complicated and computationally more costly than the randomized version. Hence in practice everyone still uses the randomized version. There are many other problems in which a randomized solution is simpler or cheaper than the best non-randomized solution. In this chapter, after covering the prerequisite background, we will consider some such problems. The first we will consider is the following simple prob- lem: Question: How many comparisons do we need to find the top two largest numbers in a sequence of n distinct numbers? Without the help of randomization, there is a trivial algorithm for finding the top two largest numbers in a sequence that requires about 2n − 3 comparisons. -
MA3K0 - High-Dimensional Probability
MA3K0 - High-Dimensional Probability Lecture Notes Stefan Adams i 2020, update 06.05.2020 Notes are in final version - proofreading not completed yet! Typos and errors will be updated on a regular basis Contents 1 Prelimaries on Probability Theory 1 1.1 Random variables . .1 1.2 Classical Inequalities . .4 1.3 Limit Theorems . .6 2 Concentration inequalities for independent random variables 8 2.1 Why concentration inequalities . .8 2.2 Hoeffding’s Inequality . 10 2.3 Chernoff’s Inequality . 13 2.4 Sub-Gaussian random variables . 15 2.5 Sub-Exponential random variables . 22 3 Random vectors in High Dimensions 27 3.1 Concentration of the Euclidean norm . 27 3.2 Covariance matrices and Principal Component Analysis (PCA) . 30 3.3 Examples of High-Dimensional distributions . 32 3.4 Sub-Gaussian random variables in higher dimensions . 34 3.5 Application: Grothendieck’s inequality . 36 4 Random Matrices 36 4.1 Geometrics concepts . 36 4.2 Concentration of the operator norm of random matrices . 38 4.3 Application: Community Detection in Networks . 41 4.4 Application: Covariance Estimation and Clustering . 41 5 Concentration of measure - general case 43 5.1 Concentration by entropic techniques . 43 5.2 Concentration via Isoperimetric Inequalities . 50 5.3 Some matrix calculus and covariance estimation . 54 6 Basic tools in high-dimensional probability 57 6.1 Decoupling . 57 6.2 Concentration for Anisotropic random vectors . 62 6.3 Symmetrisation . 63 7 Random Processes 64 ii Preface Introduction We discuss an elegant argument that showcases the -
LEBESGUE MEASURE and L2 SPACE. Contents 1. Measure Spaces 1 2. Lebesgue Integration 2 3. L2 Space 4 Acknowledgments 9 References
LEBESGUE MEASURE AND L2 SPACE. ANNIE WANG Abstract. This paper begins with an introduction to measure spaces and the Lebesgue theory of measure and integration. Several important theorems regarding the Lebesgue integral are then developed. Finally, we prove the completeness of the L2(µ) space and show that it is a metric space, and a Hilbert space. Contents 1. Measure Spaces 1 2. Lebesgue Integration 2 3. L2 Space 4 Acknowledgments 9 References 9 1. Measure Spaces Definition 1.1. Suppose X is a set. Then X is said to be a measure space if there exists a σ-ring M (that is, M is a nonempty family of subsets of X closed under countable unions and under complements)of subsets of X and a non-negative countably additive set function µ (called a measure) defined on M . If X 2 M, then X is said to be a measurable space. For example, let X = Rp, M the collection of Lebesgue-measurable subsets of Rp, and µ the Lebesgue measure. Another measure space can be found by taking X to be the set of all positive integers, M the collection of all subsets of X, and µ(E) the number of elements of E. We will be interested only in a special case of the measure, the Lebesgue measure. The Lebesgue measure allows us to extend the notions of length and volume to more complicated sets. Definition 1.2. Let Rp be a p-dimensional Euclidean space . We denote an interval p of R by the set of points x = (x1; :::; xp) such that (1.3) ai ≤ xi ≤ bi (i = 1; : : : ; p) Definition 1.4.