Measure and Integration [Pdf]
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  Measure-Theoretic Probability IMeasure-Theoretic Probability I Steven P.Lalley Winter 2017 1 1 Measure Theory 1.1 Why Measure Theory? There are two different views – not necessarily exclusive – on what “probability” means: the subjectivist view and the frequentist view. To the subjectivist, probability is a system of laws that should govern a rational person’s behavior in situations where a bet must be placed (not necessarily just in a casino, but in situations where a decision must be made about how to proceed when only imperfect information about the outcome of the decision is available, for instance, should I allow Dr. Scissorhands to replace my arthritic knee by a plastic joint?). To the frequentist, the laws of probability describe the long- run relative frequencies of different events in “experiments” that can be repeated under roughly identical conditions, for instance, rolling a pair of dice. For the frequentist inter- pretation, it is imperative that probability spaces be large enough to allow a description of an experiment, like dice-rolling, that is repeated infinitely many times, and that the mathematical laws should permit easy handling of limits, so that one can make sense of things like “the probability that the long-run fraction of dice rolls where the two dice sum to 7 is 1/6”. But even for the subjectivist, the laws of probability should allow for description of situations where there might be a continuum of possible outcomes, or pos- sible actions to be taken. Once one is reconciled to the need for such flexibility, it soon becomes apparent that measure theory (the theory of countably additive, as opposed to merely finitely additive measures) is the only way to go.
- 
												  A Convenient Category for Higher-Order Probability TheoryA Convenient Category for Higher-Order Probability Theory Chris Heunen Ohad Kammar Sam Staton Hongseok Yang University of Edinburgh, UK University of Oxford, UK University of Oxford, UK University of Oxford, UK Abstract—Higher-order probabilistic programming languages 1 (defquery Bayesian-linear-regression allow programmers to write sophisticated models in machine 2 let let sample normal learning and statistics in a succinct and structured way, but step ( [f ( [s ( ( 0.0 3.0)) 3 sample normal outside the standard measure-theoretic formalization of proba- b ( ( 0.0 3.0))] 4 fn + * bility theory. Programs may use both higher-order functions and ( [x] ( ( s x) b)))] continuous distributions, or even define a probability distribution 5 (observe (normal (f 1.0) 0.5) 2.5) on functions. But standard probability theory does not handle 6 (observe (normal (f 2.0) 0.5) 3.8) higher-order functions well: the category of measurable spaces 7 (observe (normal (f 3.0) 0.5) 4.5) is not cartesian closed. 8 (observe (normal (f 4.0) 0.5) 6.2) Here we introduce quasi-Borel spaces. We show that these 9 (observe (normal (f 5.0) 0.5) 8.0) spaces: form a new formalization of probability theory replacing 10 (predict :f f))) measurable spaces; form a cartesian closed category and so support higher-order functions; form a well-pointed category and so support good proof principles for equational reasoning; and support continuous probability distributions. We demonstrate the use of quasi-Borel spaces for higher-order functions and proba- bility by: showing that a well-known construction of probability theory involving random functions gains a cleaner expression; and generalizing de Finetti’s theorem, that is a crucial theorem in probability theory, to quasi-Borel spaces.
- 
												  Jointly Measurable and Progressively Measurable Stochastic ProcessesJointly measurable and progressively measurable stochastic processes Jordan Bell [email protected] Department of Mathematics, University of Toronto June 18, 2015 1 Jointly measurable stochastic processes d Let E = R with Borel E , let I = R≥0, which is a topological space with the subspace topology inherited from R, and let (Ω; F ;P ) be a probability space. For a stochastic process (Xt)t2I with state space E, we say that X is jointly measurable if the map (t; !) 7! Xt(!) is measurable BI ⊗ F ! E . For ! 2 Ω, the path t 7! Xt(!) is called left-continuous if for each t 2 I, Xs(!) ! Xt(!); s " t: We prove that if the paths of a stochastic process are left-continuous then the stochastic process is jointly measurable.1 Theorem 1. If X is a stochastic process with state space E and all the paths of X are left-continuous, then X is jointly measurable. Proof. For n ≥ 1, t 2 I, and ! 2 Ω, let 1 n X Xt (!) = 1[k2−n;(k+1)2−n)(t)Xk2−n (!): k=0 n Each X is measurable BI ⊗ F ! E : for B 2 E , 1 n [ −n −n f(t; !) 2 I × Ω: Xt (!) 2 Bg = [k2 ; (k + 1)2 ) × fXk2−n 2 Bg: k=0 −n −n Let t 2 I. For each n there is a unique kn for which t 2 [kn2 ; (kn +1)2 ), and n −n −n thus Xt (!) = Xkn2 (!). Furthermore, kn2 " t, and because s 7! Xs(!) is n −n left-continuous, Xkn2 (!) ! Xt(!).
- 
												  Integration 1 Measurable FunctionsIntegration References: Bass (Real Analysis for Graduate Students), Folland (Real Analysis), Athreya and Lahiri (Measure Theory and Probability Theory). 1 Measurable Functions Let (Ω1; F1) and (Ω2; F2) be measurable spaces. Definition 1 A function T :Ω1 ! Ω2 is (F1; F2)-measurable if for every −1 E 2 F2, T (E) 2 F1. Terminology: If (Ω; F) is a measurable space and f is a real-valued func- tion on Ω, it's called F-measurable or simply measurable, if it is (F; B(<))- measurable. A function f : < ! < is called Borel measurable if the σ-algebra used on the domain and codomain is B(<). If the σ-algebra on the domain is Lebesgue, f is called Lebesgue measurable. Example 1 Measurability of a function is related to the σ-algebras that are chosen in the domain and codomain. Let Ω = f0; 1g. If the σ-algebra is P(Ω), every real valued function is measurable. Indeed, let f :Ω ! <, and E 2 B(<). It is clear that f −1(E) 2 P(Ω) (this includes the case where f −1(E) = ;). However, if F = f;; Ωg is the σ-algebra, only the constant functions are measurable. Indeed, if f(x) = a; 8x 2 Ω, then for any Borel set E containing a, f −1(E) = Ω 2 F. But if f is a function s.t. f(0) 6= f(1), then, any Borel set E containing f(0) but not f(1) will satisfy f −1(E) = f0g 2= F. 1 It is hard to check for measurability of a function using the definition, because it requires checking the preimages of all sets in F2.
- 
												  Shape Analysis, Lebesgue Integration and Absolute Continuity ConnectionsNISTIR 8217 Shape Analysis, Lebesgue Integration and Absolute Continuity Connections Javier Bernal This publication is available free of charge from: https://doi.org/10.6028/NIST.IR.8217 NISTIR 8217 Shape Analysis, Lebesgue Integration and Absolute Continuity Connections Javier Bernal Applied and Computational Mathematics Division Information Technology Laboratory This publication is available free of charge from: https://doi.org/10.6028/NIST.IR.8217 July 2018 INCLUDES UPDATES AS OF 07-18-2018; SEE APPENDIX U.S. Department of Commerce Wilbur L. Ross, Jr., Secretary National Institute of Standards and Technology Walter Copan, NIST Director and Undersecretary of Commerce for Standards and Technology ______________________________________________________________________________________________________ This Shape Analysis, Lebesgue Integration and publication Absolute Continuity Connections Javier Bernal is National Institute of Standards and Technology, available Gaithersburg, MD 20899, USA free of Abstract charge As shape analysis of the form presented in Srivastava and Klassen’s textbook “Functional and Shape Data Analysis” is intricately related to Lebesgue integration and absolute continuity, it is advantageous from: to have a good grasp of the latter two notions. Accordingly, in these notes we review basic concepts and results about Lebesgue integration https://doi.org/10.6028/NIST.IR.8217 and absolute continuity. In particular, we review fundamental results connecting them to each other and to the kind of shape analysis, or more generally, functional data analysis presented in the aforeme- tioned textbook, in the process shedding light on important aspects of all three notions. Many well-known results, especially most results about Lebesgue integration and some results about absolute conti- nuity, are presented without proofs.
- 
												  LEBESGUE MEASURE and L2 SPACE. Contents 1. Measure Spaces 1 2. Lebesgue Integration 2 3. L2 Space 4 Acknowledgments 9 ReferencesLEBESGUE MEASURE AND L2 SPACE. ANNIE WANG Abstract. This paper begins with an introduction to measure spaces and the Lebesgue theory of measure and integration. Several important theorems regarding the Lebesgue integral are then developed. Finally, we prove the completeness of the L2(µ) space and show that it is a metric space, and a Hilbert space. Contents 1. Measure Spaces 1 2. Lebesgue Integration 2 3. L2 Space 4 Acknowledgments 9 References 9 1. Measure Spaces Definition 1.1. Suppose X is a set. Then X is said to be a measure space if there exists a σ-ring M (that is, M is a nonempty family of subsets of X closed under countable unions and under complements)of subsets of X and a non-negative countably additive set function µ (called a measure) defined on M . If X 2 M, then X is said to be a measurable space. For example, let X = Rp, M the collection of Lebesgue-measurable subsets of Rp, and µ the Lebesgue measure. Another measure space can be found by taking X to be the set of all positive integers, M the collection of all subsets of X, and µ(E) the number of elements of E. We will be interested only in a special case of the measure, the Lebesgue measure. The Lebesgue measure allows us to extend the notions of length and volume to more complicated sets. Definition 1.2. Let Rp be a p-dimensional Euclidean space . We denote an interval p of R by the set of points x = (x1; :::; xp) such that (1.3) ai ≤ xi ≤ bi (i = 1; : : : ; p) Definition 1.4.
- 
												  1 Measurable Functions36-752 Advanced Probability Overview Spring 2018 2. Measurable Functions, Random Variables, and Integration Instructor: Alessandro Rinaldo Associated reading: Sec 1.5 of Ash and Dol´eans-Dade; Sec 1.3 and 1.4 of Durrett. 1 Measurable Functions 1.1 Measurable functions Measurable functions are functions that we can integrate with respect to measures in much the same way that continuous functions can be integrated \dx". Recall that the Riemann integral of a continuous function f over a bounded interval is defined as a limit of sums of lengths of subintervals times values of f on the subintervals. The measure of a set generalizes the length while elements of the σ-field generalize the intervals. Recall that a real-valued function is continuous if and only if the inverse image of every open set is open. This generalizes to the inverse image of every measurable set being measurable. Definition 1 (Measurable Functions). Let (Ω; F) and (S; A) be measurable spaces. Let f :Ω ! S be a function that satisfies f −1(A) 2 F for each A 2 A. Then we say that f is F=A-measurable. If the σ-field’s are to be understood from context, we simply say that f is measurable. Example 2. Let F = 2Ω. Then every function from Ω to a set S is measurable no matter what A is. Example 3. Let A = f?;Sg. Then every function from a set Ω to S is measurable, no matter what F is. Proving that a function is measurable is facilitated by noticing that inverse image commutes with union, complement, and intersection.
- 
												  Notes 2 : Measure-Theoretic Foundations IINotes 2 : Measure-theoretic foundations II Math 733-734: Theory of Probability Lecturer: Sebastien Roch References: [Wil91, Chapters 4-6, 8], [Dur10, Sections 1.4-1.7, 2.1]. 1 Independence 1.1 Definition of independence Let (Ω; F; P) be a probability space. DEF 2.1 (Independence) Sub-σ-algebras G1; G2;::: of F are independent for all Gi 2 Gi, i ≥ 1, and distinct i1; : : : ; in we have n Y P[Gi1 \···\ Gin ] = P[Gij ]: j=1 Specializing to events and random variables: DEF 2.2 (Independent RVs) RVs X1;X2;::: are independent if the σ-algebras σ(X1); σ(X2);::: are independent. DEF 2.3 (Independent Events) Events E1;E2;::: are independent if the σ-algebras c Ei = f;;Ei;Ei ; Ωg; i ≥ 1; are independent. The more familiar definitions are the following: THM 2.4 (Independent RVs: Familiar definition) RVs X, Y are independent if and only if for all x; y 2 R P[X ≤ x; Y ≤ y] = P[X ≤ x]P[Y ≤ y]: THM 2.5 (Independent events: Familiar definition) Events E1, E2 are indepen- dent if and only if P[E1 \ E2] = P[E1]P[E2]: 1 Lecture 2: Measure-theoretic foundations II 2 The proofs of these characterizations follows immediately from the following lemma. LEM 2.6 (Independence and π-systems) Suppose that G and H are sub-σ-algebras and that I and J are π-systems such that σ(I) = G; σ(J ) = H: Then G and H are independent if and only if I and J are, i.e., P[I \ J] = P[I]P[J]; 8I 2 I;J 2 J : Proof: Suppose I and J are independent.
- 
												  (Measure Theory for Dummies) UWEE Technical Report Number UWEETR-2006-0008A Measure Theory Tutorial (Measure Theory for Dummies) Maya R. Gupta {gupta}@ee.washington.edu Dept of EE, University of Washington Seattle WA, 98195-2500 UWEE Technical Report Number UWEETR-2006-0008 May 2006 Department of Electrical Engineering University of Washington Box 352500 Seattle, Washington 98195-2500 PHN: (206) 543-2150 FAX: (206) 543-3842 URL: http://www.ee.washington.edu A Measure Theory Tutorial (Measure Theory for Dummies) Maya R. Gupta {gupta}@ee.washington.edu Dept of EE, University of Washington Seattle WA, 98195-2500 University of Washington, Dept. of EE, UWEETR-2006-0008 May 2006 Abstract This tutorial is an informal introduction to measure theory for people who are interested in reading papers that use measure theory. The tutorial assumes one has had at least a year of college-level calculus, some graduate level exposure to random processes, and familiarity with terms like “closed” and “open.” The focus is on the terms and ideas relevant to applied probability and information theory. There are no proofs and no exercises. Measure theory is a bit like grammar, many people communicate clearly without worrying about all the details, but the details do exist and for good reasons. There are a number of great texts that do measure theory justice. This is not one of them. Rather this is a hack way to get the basic ideas down so you can read through research papers and follow what’s going on. Hopefully, you’ll get curious and excited enough about the details to check out some of the references for a deeper understanding.
- 
												  Measure Theory and ProbabilityMeasure theory and probability Alexander Grigoryan University of Bielefeld Lecture Notes, October 2007 - February 2008 Contents 1 Construction of measures 3 1.1Introductionandexamples........................... 3 1.2 σ-additive measures ............................... 5 1.3 An example of using probability theory . .................. 7 1.4Extensionofmeasurefromsemi-ringtoaring................ 8 1.5 Extension of measure to a σ-algebra...................... 11 1.5.1 σ-rings and σ-algebras......................... 11 1.5.2 Outermeasure............................. 13 1.5.3 Symmetric difference.......................... 14 1.5.4 Measurable sets . ............................ 16 1.6 σ-finitemeasures................................ 20 1.7Nullsets..................................... 23 1.8 Lebesgue measure in Rn ............................ 25 1.8.1 Productmeasure............................ 25 1.8.2 Construction of measure in Rn. .................... 26 1.9 Probability spaces ................................ 28 1.10 Independence . ................................. 29 2 Integration 38 2.1 Measurable functions.............................. 38 2.2Sequencesofmeasurablefunctions....................... 42 2.3 The Lebesgue integral for finitemeasures................... 47 2.3.1 Simplefunctions............................ 47 2.3.2 Positivemeasurablefunctions..................... 49 2.3.3 Integrablefunctions........................... 52 2.4Integrationoversubsets............................ 56 2.5 The Lebesgue integral for σ-finitemeasure.................
- 
												  1 Probability Measure and Random Variables1 Probability measure and random variables 1.1 Probability spaces and measures We will use the term experiment in a very general way to refer to some process that produces a random outcome. Definition 1. The set of possible outcomes is called the sample space. We will typically denote an individual outcome by ω and the sample space by Ω. Set notation: A B, A is a subset of B, means that every element of A is also in B. The union⊂ A B of A and B is the of all elements that are in A or B, including those that∪ are in both. The intersection A B of A and B is the set of all elements that are in both of A and B. ∩ n j=1Aj is the set of elements that are in at least one of the Aj. ∪n j=1Aj is the set of elements that are in all of the Aj. ∩∞ ∞ j=1Aj, j=1Aj are ... Two∩ sets A∪ and B are disjoint if A B = . denotes the empty set, the set with no elements. ∩ ∅ ∅ Complements: The complement of an event A, denoted Ac, is the set of outcomes (in Ω) which are not in A. Note that the book writes it as Ω A. De Morgan’s laws: \ (A B)c = Ac Bc ∪ ∩ (A B)c = Ac Bc ∩ ∪ c c ( Aj) = Aj j j [ \ c c ( Aj) = Aj j j \ [ (1) Definition 2. Let Ω be a sample space. A collection of subsets of Ω is a σ-field if F 1.
- 
												  Measurable Functions and Simple FunctionsMeasure theory class notes - 8 September 2010, class 9 1 Measurable functions and simple functions The class of all real measurable functions on (Ω, A ) is too vast to study directly. We identify ways to study them via simpler functions or collections of functions. Recall that L is the set of all measurable functions from (Ω, A ) to R and E⊆L are all the simple functions. ∞ Theorem. Suppose f ∈ L is bounded. Then there exists an increasing sequence {fn}n=1 in E whose uniform limit is f. Proof. First assume that f takes values in [0, 1). We divide [0, 1) into 2n intervals and use this to construct fn: n− 2 1 k − k k+1 fn = 1f 1 n , n 2n ([ 2 2 )) Xk=0 k k+1 k Whenever f takes a value in 2n , 2n , fn takes the value 2n . We have • For all n, fn ≤ f. 1 • For all n, x, |fn(x) − f(x)|≤ 2n . This is clear from the construction. • fn ∈E, since fn is a finite linear combination of indicator functions of sets in A . k 2k 2k+1 • fn ≤ fn+1: For any x, if fn(x)= 2n , then fn+1(x) ∈ 2n+1 , 2n+1 . ∞ So {fn}n=1 is an increasing sequence in E converging uniformly to f. Now for a general f, if the image of f lies in [a,b), then let f − a1 g = Ω b − a (note that a1Ω is the constant function a) ∞ Im g ⊆ [0, 1), so by the above we have {gn}n=1 from E increasing uniformly to g.