Advanced Probability

Total Page:16

File Type:pdf, Size:1020Kb

Advanced Probability Advanced Probability Alexander Sokol Anders Rønn-Nielsen Department of Mathematical Sciences University of Copenhagen Department of Mathematical Sciences University of Copenhagen Universitetsparken 5 DK-2100 Copenhagen Copyright 2013 Alexander Sokol & Anders Rønn-Nielsen ISBN 978-87-7078-999-8 Contents Preface v 1 Sequences of random variables 1 1.1 Measure-theoretic preliminaries . 1 1.2 Convergence of sequences of random variables . 3 1.3 Independence and Kolmogorov's zero-one law . 15 1.4 Convergence of sums of independent variables . 21 1.5 The strong law of large numbers . 24 1.6 Exercises . 30 2 Ergodicity and stationarity 35 2.1 Measure preservation, invariance and ergodicity . 35 2.2 Criteria for measure preservation and ergodicity . 40 2.3 Stationary processes and the law of large numbers . 44 2.4 Exercises . 55 3 Weak convergence 59 3.1 Weak convergence and convergence of measures . 60 3.2 Weak convergence and distribution functions . 67 3.3 Weak convergence and convergence in probability . 69 3.4 Weak convergence and characteristic functions . 72 3.5 Central limit theorems . 85 3.6 Asymptotic normality . 92 3.7 Higher dimensions . 95 3.8 Exercises . 98 4 Signed measures and conditioning 103 4.1 Decomposition of signed measures . 103 4.2 Conditional Expectations given a σ-algebra . 115 iv CONTENTS 4.3 Conditional expectations given a random variable . 124 4.4 Exercises . 128 5 Martingales 133 5.1 Introduction to martingale theory . 134 5.2 Martingales and stopping times . 137 5.3 The martingale convergence theorem . 145 5.4 Martingales and uniform integrability . 151 5.5 The martingale central limit theorem . 164 5.6 Exercises . 175 6 The Brownian motion 191 6.1 Definition and existence . 192 6.2 Continuity of the Brownian motion . 197 6.3 Variation and quadratic variation . 206 6.4 The law of the iterated logarithm . 215 6.5 Exercises . 223 7 Further reading 227 A Supplementary material 229 A.1 Limes superior and limes inferior . 229 A.2 Measure theory and real analysis . 233 A.3 Existence of sequences of random variables . 239 A.4 Exercises . 240 B Hints for exercises 241 B.1 Hints for Chapter 1 . 241 B.2 Hints for Chapter 2 . 244 B.3 Hints for Chapter 3 . 246 B.4 Hints for Chapter 4 . 248 B.5 Hints for Chapter 5 . 251 B.6 Hints for Chapter 6 . 256 B.7 Hints for Appendix A . 258 Bibliography 263 Preface The purpose of this monograph is to present a detailed introduction to selected fundamentals of modern probability theory. The focus is in particular on discrete-time and continuous-time processes, including the law of large numbers, Lindeberg's central limit theorem, martingales, the martingale convergence theorem and the martingale central limit theorem, as well as basic results on Brownian motion. The reader is assumed to have a reasonable grasp of basic analysis and measure theory, as can be obtained through Hansen (2009), Carothers (2000) or Ash (1972), for example. We have endeavoured throughout to present the material in a logical fashion, with detailed proofs allowing the reader to perceive not only the big picture of the theory, but also to understand the finer elements of the methods of proof used. Exercises are given at the end of each chapter, with hints for the exercises given in the appendix. The exercises form an important part of the monograph. We strongly recommend that any reader wishing to acquire a sound understanding of the theory spends considerable time solving the exercises. While we share the responsibility for the ultimate content of the monograph and in partic- ular any mistakes therein, much of the material is based on books and lecture notes from other individuals, in particular \Videreg˚aendesandsynlighedsregning" by Martin Jacobsen, the lecture notes on weak convergence by Søren Tolver Jensen, \Sandsynlighedsregning p˚a M˚alteoretiskGrundlag" by Ernst Hansen as well as supplementary notes by Ernst Hansen, in particular a note on the martingale central limit theorem. We are also indebted to Ketil Biering Tvermosegaard, who diligently translated the lecture notes by Martin Jacobsen and thus eased the migration of their contents to their present form in this monograph. We would like to express our gratitude to our own teachers, particularly Ernst Hansen, Martin Jacobsen and Søren Tolver Jensen, who taught us measure theory and probability theory. vi CONTENTS Also, many warm thanks go to Henrik Nygaard Jensen, who meticulously read large parts of the manuscript and gave many useful comments. Alexander Sokol Anders Rønn-Nielsen København, August 2012 Since the previous edition of the book, a number of misprints and errors have been cor- rected, and various other minor amendments have been made. We are grateful to the many students who have contributed to the monograph by identifying mistakes and suggesting improvements. Alexander Sokol Anders Rønn-Nielsen København, June 2013 Chapter 1 Sequences of random variables In this chapter, we will consider sequences of random variables and the basic results on such sequences, in particular the strong law of large numbers, which formalizes the intuitive notion that averages of independent and identically distributed events tend to the common mean. We begin in Section 1.1 by reviewing the measure-theoretic preliminaries for our later results. In Section 1.2, we discuss modes of convergence for sequences of random variables. The results given in this section are fundamental to much of the remainder of this monograph, as well as modern probability in general. In Section 1.3, we discuss the concept of independence for families of σ-algebras, and as an application, we prove the Kolmogorov zero-one law, which shows that for sequences of independent variables, events which, colloquially speaking, depend only on the tail of the sequence either have probability zero or one. In Section 1.4, we apply the results of the previous sections to prove criteria for the convergence of sums of independent variables. Finally, in Section 1.5, we prove the strong law of large numbers, arguably the most important result of this chapter. 1.1 Measure-theoretic preliminaries As noted in the introduction, we assume given a level of familiarity with basic real analysis and measure theory. Some of the main results assumed to be well-known in the following are reviewed in Appendix A. In this section, we give an independent review of some basic 2 Sequences of random variables results, and review particular notation related to probability theory. We begin by recalling some basic definitions. Let Ω be some set. A σ-algebra F on Ω is a set of subsets of Ω with the following three properties: Ω 2 F, if F 2 F then F c 2 F as 1 well, and if (Fn)n≥1 is a sequence of sets with Fn 2 F for n ≥ 1, then [n=1Fn 2 F as well. We refer to the second condition as F being stable under complements, and we refer to the third condition as F being stable under countable unions. From these stability properties, 1 it also follows that if (Fn)n≥1 is a sequence of sets in F, \n=1Fn 2 F as well. We refer to the pair (Ω; F) as a measurable space, and we refer to the elements of F as events. A probability measure P on (Ω; F) is a mapping P : F! [0; 1] such that P (;) = 0, P (Ω) = 1 P1 and whenever (Fn) is a sequence of disjoint sets in F, it holds that n=1 P (Fn) is convergent 1 P1 and P ([n=1Fn) = n=1 P (Fn). We refer to the latter property as the σ-additivity of the probability measure P . We refer to the triple (Ω; F;P ) as a probability space. Next, assume given a measure space (Ω; F), and let H be a set of subsets of Ω. We may then form the set A of all σ-algebras on Ω containing H, this is a subset of the power set of the power set of Ω. We may then define σ(H) = \F2AF, the intersection of all σ-algebras in A, that is, the intersection of all σ-algebras containing H. This is a σ-algebra as well, and it is the smallest σ-algebra on Ω containing H in the sense that for any σ-algebra G containing H, we have G 2 A and therefore σ(H) = \F2AF ⊆ G. We refer to σ(H) as the σ-algebra generated by H, and we say that H is a generating family for σ(H). Using this construction, we may define a particular σ-algebra on the Euclidean spaces: The d d Borel σ-algebra Bd on R for d ≥ 1 is the smallest σ-algebra containing all open sets in R . We denote the Borel σ-algebra on R by B. Next, let (Fn)n≥1 be a sequence of sets in F. If Fn ⊆ Fn+1 for all n ≥ 1, we say that (Fn)n≥1 is increasing. If Fn ⊇ Fn+1 for all n ≥ 1, we say that (Fn)n≥1 is decreasing. Assume that D is a set of subsets of Ω such that the following holds: Ω 2 D, if F; G 2 D with F ⊆ G 1 then G n F 2 D and if (Fn)n≥1 is an increasing sequence of sets in D then [n=1Fn 2 D. If this is the case, we say that D is a Dynkin class. Furthermore, if H is a set of subsets of Ω such that whenever F; G 2 H then F \ G 2 H, then we say that H is stable under finite intersections.
Recommended publications
  • Conditional Generalizations of Strong Laws Which Conclude the Partial Sums Converge Almost Surely1
    The Annals ofProbability 1982. Vol. 10, No.3. 828-830 CONDITIONAL GENERALIZATIONS OF STRONG LAWS WHICH CONCLUDE THE PARTIAL SUMS CONVERGE ALMOST SURELY1 By T. P. HILL Georgia Institute of Technology Suppose that for every independent sequence of random variables satis­ fying some hypothesis condition H, it follows that the partial sums converge almost surely. Then it is shown that for every arbitrarily-dependent sequence of random variables, the partial sums converge almost surely on the event where the conditional distributions (given the past) satisfy precisely the same condition H. Thus many strong laws for independent sequences may be immediately generalized into conditional results for arbitrarily-dependent sequences. 1. Introduction. Ifevery sequence ofindependent random variables having property A has property B almost surely, does every arbitrarily-dependent sequence of random variables have property B almost surely on the set where the conditional distributions have property A? Not in general, but comparisons of the conditional Borel-Cantelli Lemmas, the condi­ tional three-series theorem, and many martingale results with their independent counter­ parts suggest that the answer is affirmative in fairly general situations. The purpose ofthis note is to prove Theorem 1, which states, in part, that if "property B" is "the partial sums converge," then the answer is always affIrmative, regardless of "property A." Thus many strong laws for independent sequences (even laws yet undiscovered) may be immediately generalized into conditional results for arbitrarily-dependent sequences. 2. Main Theorem. In this note, IY = (Y1 , Y2 , ••• ) is a sequence ofrandom variables on a probability triple (n, d, !?I'), Sn = Y 1 + Y 2 + ..
    [Show full text]
  • The Fundamental Theorem of Calculus for Lebesgue Integral
    Divulgaciones Matem´aticasVol. 8 No. 1 (2000), pp. 75{85 The Fundamental Theorem of Calculus for Lebesgue Integral El Teorema Fundamental del C´alculo para la Integral de Lebesgue Di´omedesB´arcenas([email protected]) Departamento de Matem´aticas.Facultad de Ciencias. Universidad de los Andes. M´erida.Venezuela. Abstract In this paper we prove the Theorem announced in the title with- out using Vitali's Covering Lemma and have as a consequence of this approach the equivalence of this theorem with that which states that absolutely continuous functions with zero derivative almost everywhere are constant. We also prove that the decomposition of a bounded vari- ation function is unique up to a constant. Key words and phrases: Radon-Nikodym Theorem, Fundamental Theorem of Calculus, Vitali's covering Lemma. Resumen En este art´ıculose demuestra el Teorema Fundamental del C´alculo para la integral de Lebesgue sin usar el Lema del cubrimiento de Vi- tali, obteni´endosecomo consecuencia que dicho teorema es equivalente al que afirma que toda funci´onabsolutamente continua con derivada igual a cero en casi todo punto es constante. Tambi´ense prueba que la descomposici´onde una funci´onde variaci´onacotada es ´unicaa menos de una constante. Palabras y frases clave: Teorema de Radon-Nikodym, Teorema Fun- damental del C´alculo,Lema del cubrimiento de Vitali. Received: 1999/08/18. Revised: 2000/02/24. Accepted: 2000/03/01. MSC (1991): 26A24, 28A15. Supported by C.D.C.H.T-U.L.A under project C-840-97. 76 Di´omedesB´arcenas 1 Introduction The Fundamental Theorem of Calculus for Lebesgue Integral states that: A function f :[a; b] R is absolutely continuous if and only if it is ! 1 differentiable almost everywhere, its derivative f 0 L [a; b] and, for each t [a; b], 2 2 t f(t) = f(a) + f 0(s)ds: Za This theorem is extremely important in Lebesgue integration Theory and several ways of proving it are found in classical Real Analysis.
    [Show full text]
  • [Math.FA] 3 Dec 1999 Rnfrneter for Theory Transference Introduction 1 Sas Ihnrah U Twl Etetdi Eaaepaper
    Transference in Spaces of Measures Nakhl´eH. Asmar,∗ Stephen J. Montgomery–Smith,† and Sadahiro Saeki‡ 1 Introduction Transference theory for Lp spaces is a powerful tool with many fruitful applications to sin- gular integrals, ergodic theory, and spectral theory of operators [4, 5]. These methods afford a unified approach to many problems in diverse areas, which before were proved by a variety of methods. The purpose of this paper is to bring about a similar approach to spaces of measures. Our main transference result is motivated by the extensions of the classical F.&M. Riesz Theorem due to Bochner [3], Helson-Lowdenslager [10, 11], de Leeuw-Glicksberg [6], Forelli [9], and others. It might seem that these extensions should all be obtainable via transference methods, and indeed, as we will show, these are exemplary illustrations of the scope of our main result. It is not straightforward to extend the classical transference methods of Calder´on, Coif- man and Weiss to spaces of measures. First, their methods make use of averaging techniques and the amenability of the group of representations. The averaging techniques simply do not work with measures, and do not preserve analyticity. Secondly, and most importantly, their techniques require that the representation is strongly continuous. For spaces of mea- sures, this last requirement would be prohibitive, even for the simplest representations such as translations. Instead, we will introduce a much weaker requirement, which we will call ‘sup path attaining’. By working with sup path attaining representations, we are able to prove a new transference principle with interesting applications. For example, we will show how to derive with ease generalizations of Bochner’s theorem and Forelli’s main result.
    [Show full text]
  • Generalizations of the Riemann Integral: an Investigation of the Henstock Integral
    Generalizations of the Riemann Integral: An Investigation of the Henstock Integral Jonathan Wells May 15, 2011 Abstract The Henstock integral, a generalization of the Riemann integral that makes use of the δ-fine tagged partition, is studied. We first consider Lebesgue’s Criterion for Riemann Integrability, which states that a func- tion is Riemann integrable if and only if it is bounded and continuous almost everywhere, before investigating several theoretical shortcomings of the Riemann integral. Despite the inverse relationship between integra- tion and differentiation given by the Fundamental Theorem of Calculus, we find that not every derivative is Riemann integrable. We also find that the strong condition of uniform convergence must be applied to guarantee that the limit of a sequence of Riemann integrable functions remains in- tegrable. However, by slightly altering the way that tagged partitions are formed, we are able to construct a definition for the integral that allows for the integration of a much wider class of functions. We investigate sev- eral properties of this generalized Riemann integral. We also demonstrate that every derivative is Henstock integrable, and that the much looser requirements of the Monotone Convergence Theorem guarantee that the limit of a sequence of Henstock integrable functions is integrable. This paper is written without the use of Lebesgue measure theory. Acknowledgements I would like to thank Professor Patrick Keef and Professor Russell Gordon for their advice and guidance through this project. I would also like to acknowledge Kathryn Barich and Kailey Bolles for their assistance in the editing process. Introduction As the workhorse of modern analysis, the integral is without question one of the most familiar pieces of the calculus sequence.
    [Show full text]
  • Stability in the Almost Everywhere Sense: a Linear Transfer Operator Approach ∗ R
    CORE Metadata, citation and similar papers at core.ac.uk Provided by Elsevier - Publisher Connector J. Math. Anal. Appl. 368 (2010) 144–156 Contents lists available at ScienceDirect Journal of Mathematical Analysis and Applications www.elsevier.com/locate/jmaa Stability in the almost everywhere sense: A linear transfer operator approach ∗ R. Rajaram a, ,U.Vaidyab,M.Fardadc, B. Ganapathysubramanian d a Dept. of Math. Sci., 3300, Lake Rd West, Kent State University, Ashtabula, OH 44004, United States b Dept. of Elec. and Comp. Engineering, Iowa State University, Ames, IA 50011, United States c Dept. of Elec. Engineering and Comp. Sci., Syracuse University, Syracuse, NY 13244, United States d Dept. of Mechanical Engineering, Iowa State University, Ames, IA 50011, United States article info abstract Article history: The problem of almost everywhere stability of a nonlinear autonomous ordinary differential Received 20 April 2009 equation is studied using a linear transfer operator framework. The infinitesimal generator Available online 20 February 2010 of a linear transfer operator (Perron–Frobenius) is used to provide stability conditions of Submitted by H. Zwart an autonomous ordinary differential equation. It is shown that almost everywhere uniform stability of a nonlinear differential equation, is equivalent to the existence of a non-negative Keywords: Almost everywhere stability solution for a steady state advection type linear partial differential equation. We refer Advection equation to this non-negative solution, verifying almost everywhere global stability, as Lyapunov Density function density. A numerical method using finite element techniques is used for the computation of Lyapunov density. © 2010 Elsevier Inc. All rights reserved. 1. Introduction Stability analysis of an ordinary differential equation is one of the most fundamental problems in the theory of dynami- cal systems.
    [Show full text]
  • Chapter 6. Integration §1. Integrals of Nonnegative Functions Let (X, S, Μ
    Chapter 6. Integration §1. Integrals of Nonnegative Functions Let (X, S, µ) be a measure space. We denote by L+ the set of all measurable functions from X to [0, ∞]. + Let φ be a simple function in L . Suppose f(X) = {a1, . , am}. Then φ has the Pm −1 standard representation φ = j=1 ajχEj , where Ej := f ({aj}), j = 1, . , m. We define the integral of φ with respect to µ to be m Z X φ dµ := ajµ(Ej). X j=1 Pm For c ≥ 0, we have cφ = j=1(caj)χEj . It follows that m Z X Z cφ dµ = (caj)µ(Ej) = c φ dµ. X j=1 X Pm Suppose that φ = j=1 ajχEj , where aj ≥ 0 for j = 1, . , m and E1,...,Em are m Pn mutually disjoint measurable sets such that ∪j=1Ej = X. Suppose that ψ = k=1 bkχFk , where bk ≥ 0 for k = 1, . , n and F1,...,Fn are mutually disjoint measurable sets such n that ∪k=1Fk = X. Then we have m n n m X X X X φ = ajχEj ∩Fk and ψ = bkχFk∩Ej . j=1 k=1 k=1 j=1 If φ ≤ ψ, then aj ≤ bk, provided Ej ∩ Fk 6= ∅. Consequently, m m n m n n X X X X X X ajµ(Ej) = ajµ(Ej ∩ Fk) ≤ bkµ(Ej ∩ Fk) = bkµ(Fk). j=1 j=1 k=1 j=1 k=1 k=1 In particular, if φ = ψ, then m n X X ajµ(Ej) = bkµ(Fk). j=1 k=1 In general, if φ ≤ ψ, then m n Z X X Z φ dµ = ajµ(Ej) ≤ bkµ(Fk) = ψ dµ.
    [Show full text]
  • 2. Convergence Theorems
    2. Convergence theorems In this section we analyze the dynamics of integrabilty in the case when se- quences of measurable functions are considered. Roughly speaking, a “convergence theorem” states that integrability is preserved under taking limits. In other words, ∞ if one has a sequence (fn)n=1 of integrable functions, and if f is some kind of a limit of the fn’s, then we would like to conclude that f itself is integrable, as well R R as the equality f = limn→∞ fn. Such results are often employed in two instances: A. When we want to prove that some function f is integrable. In this case ∞ we would look for a sequence (fn)n=1 of integrable approximants for f. B. When we want to construct and integrable function. In this case, we will produce first the approximants, and then we will examine the existence of the limit. The first convergence result, which is somehow primite, but very useful, is the following. Lemma 2.1. Let (X, A, µ) be a finite measure space, let a ∈ (0, ∞) and let fn : X → [0, a], n ≥ 1, be a sequence of measurable functions satisfying (a) f1 ≥ f2 ≥ · · · ≥ 0; (b) limn→∞ fn(x) = 0, ∀ x ∈ X. Then one has the equality Z (1) lim fn dµ = 0. n→∞ X Proof. Let us define, for each ε > 0, and each integer n ≥ 1, the set ε An = {x ∈ X : fn(x) ≥ ε}. ε Obviously, we have An ∈ A, ∀ ε > 0, n ≥ 1. One key fact we are going to use is the following.
    [Show full text]
  • Lecture 26: Dominated Convergence Theorem
    Lecture 26: Dominated Convergence Theorem Continuation of Fatou's Lemma. + + Corollary 0.1. If f 2 L and ffn 2 L : n 2 Ng is any sequence of functions such that fn ! f almost everywhere, then Z Z f 6 lim inf fn: X X Proof. Let fn ! f everywhere in X. That is, lim inf fn(x) = f(x) (= lim sup fn also) for all x 2 X. Then, by Fatou's lemma, Z Z Z f = lim inf fn 6 lim inf fn: X X X If fn 9 f everywhere in X, then let E = fx 2 X : lim inf fn(x) 6= f(x)g. Since fn ! f almost everywhere in X, µ(E) = 0 and Z Z Z Z f = f and fn = fn 8n X X−E X X−E thus making fn 9 f everywhere in X − E. Hence, Z Z Z Z f = f 6 lim inf fn = lim inf fn: X X−E X−E X χ Example 0.2 (Strict inequality). Let Sn = [n; n + 1] ⊂ R and fn = Sn . Then, f = lim inf fn = 0 and Z Z 0 = f dµ < lim inf fn dµ = 1: R R Example 0.3 (Importance of non-negativity). Let Sn = [n; n + 1] ⊂ R and χ fn = − Sn . Then, f = lim inf fn = 0 but Z Z 0 = f dµ > lim inf fn dµ = −1: R R 1 + R Proposition 0.4. If f 2 L and X f dµ < 1, then (a) the set A = fx 2 X : f(x) = 1g is a null set and (b) the set B = fx 2 X : f(x) > 0g is σ-finite.
    [Show full text]
  • MEASURE and INTEGRATION: LECTURE 3 Riemann Integral. If S Is Simple and Measurable Then Sdµ = Αiµ(
    MEASURE AND INTEGRATION: LECTURE 3 Riemann integral. If s is simple and measurable then N � � sdµ = αiµ(Ei), X i=1 �N where s = i=1 αiχEi . If f ≥ 0, then � �� � fdµ = sup sdµ | 0 ≤ s ≤ f, s simple & measurable . X X Recall the Riemann integral of function f on interval [a, b]. Define lower and upper integrals L(f, P ) and U(f, P ), where P is a partition of [a, b]. Set − � � f = sup L(f, P ) and f = inf U(f, P ). P P − A function f is Riemann integrable ⇐⇒ − � � f = f, − in which case this common value is � f. A set B ⊂ R has measure zero if, for any � > 0, there exists a ∞ ∞ countable collection of intervals { Ii}i =1 such that B ⊂ ∪i =1Ii and �∞ i=1 λ(Ii) < �. Examples: finite sets, countable sets. There are also uncountable sets with measure zero. However, any interval does not have measure zero. Theorem 0.1. A function f is Riemann integrable if and only if f is discontinuous on a set of measure zero. A function is said to have a property (e.g., continuous) almost ev­ erywhere (abbreviated a.e.) if the set on which the property does not hold has measure zero. Thus, the statement of the theorem is that f is Riemann integrable if and only if it is continuous almost everywhere. Date: September 11, 2003. 1 2 MEASURE AND INTEGRATION: LECTURE 3 Recall positive measure: a measure function µ: M → [0, ∞] such ∞ �∞ that µ (∪i=1Ei) = i=1 µ(Ei) for Ei ∈ M disjoint.
    [Show full text]
  • Lecture Notes for Math 522 Spring 2012 (Rudin Chapter
    Lebesgue integral We will define a very powerful integral, far better than Riemann in the sense that it will allow us to integrate pretty much every reasonable function and we will also obtain strong convergence results. That is if we take a limit of integrable functions we will get an integrable function and the limit of the integrals will be the integral of the limit under very mild conditions. We will focus only on the real line, although the theory easily extends to more abstract contexts. In Riemann integral the basic block was a rectangle. If we wanted to integrate a function that was identically 1 on an interval [a; b], then the integral was simply the area of that rectangle, so 1×(b−a) = b−a. For Lebesgue integral what we want to do is to replace the interval with a more general subset of the real line. That is, if we have a set S ⊂ R and we take the indicator function or characteristic function χS defined by ( 1 if x 2 S, χ (x) = S 0 else. Then the integral of χS should really be equal to the area under the graph, which should be equal to the \size" of S. Example: Suppose that S is the set of rational numbers between 0 and 1. Let us argue that its size is 0, and so the integral of χS should be 0. Let fx1; x2;:::g = S be an enumeration of the points of S. Now for any > 0 take the sets −j−1 −j−1 Ij = (xj − 2 ; xj + 2 ); then 1 [ S ⊂ Ij: j=1 −j The \size" of any Ij should be 2 , so it seems reasonable to say that the \size" of S is less than the sum of the sizes of the Ij's.
    [Show full text]
  • A Brief Introduction to Lebesgue Theory
    CHAPTER 3 A BRIEF INTRODUCTION TO LEBESGUE THEORY Introduction The span from Newton and Leibniz to Lebesgue covers only 250 years (Figure 3.1). Lebesgue published his dissertation “Integrale,´ longueur, aire” (“Integral, length, area”) in the Annali di Matematica in 1902. Lebesgue developed “measure of a set” in the first chapter and an integral based on his measure in the second. Figure 3.1 From Newton and Leibniz to Lebesgue. Introduction to Real Analysis. By William C. Bauldry 125 Copyright c 2009 John Wiley & Sons, Inc. ⌅ 126 A BRIEF INTRODUCTION TO LEBESGUE THEORY Part of Lebesgue’s motivation were two problems that had arisen with Riemann’s integral. First, there were functions for which the integral of the derivative does not recover the original function and others for which the derivative of the integral is not the original. Second, the integral of the limit of a sequence of functions was not necessarily the limit of the integrals. We’ve seen that uniform convergence allows the interchange of limit and integral, but there are sequences that do not converge uniformly yet the limit of the integrals is equal to the integral of the limit. In Lebesgue’s own words from “Integral, length, area” (as quoted by Hochkirchen (2004, p. 272)), It thus seems to be natural to search for a definition of the integral which makes integration the inverse operation of differentiation in as large a range as possible. Lebesgue was able to combine Darboux’s work on defining the Riemann integral with Borel’s research on the “content” of a set.
    [Show full text]
  • Lecture Notes 4 36-705 1 Reminder: Convergence of Sequences 2
    Lecture Notes 4 36-705 In today's lecture we discuss the convergence of random variables. At a high-level, our first few lectures focused on non-asymptotic properties of averages i.e. the tail bounds we derived applied for any fixed sample size n. For the next few lectures we focus on asymptotic properties, i.e. we ask the question: what happens to the average of n i.i.d. random variables as n ! 1. Roughly, from a theoretical perspective the idea is that many expressions will consider- ably simplify in the asymptotic regime. Rather than have many different tail bounds, we will derive simple \universal results" that hold under extremely weak conditions. From a slightly more practical perspective, asymptotic theory is often useful to obtain approximate confidence intervals. 1 Reminder: convergence of sequences When we think of convergence of deterministic real numbers the corresponding notions are classical. Formally, we say that a sequence of real numbers a1; a2;::: converges to a fixed real number a if, for every positive number , there exists a natural number N() such that for all n ≥ N(), jan − aj < . We call a the limit of the sequence and write limn!1 an = a: Our focus today will in trying to develop analogues of this notion that apply to sequences of random variables. We will first give some definitions and then try to circle back to relate the definitions and discuss some examples. Throughout, we will focus on the setting where we have a sequence of random variables X1;:::;Xn and another random variable X, and would like to define what is means for the sequence to converge to X.
    [Show full text]