Math.CO] 8 Mar 2018 Fsbeune N Hi Re.Freape Osdrtep the Consider Example, for Order
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Laws of Total Expectation and Total Variance
Laws of Total Expectation and Total Variance Definition of conditional density. Assume and arbitrary random variable X with density fX . Take an event A with P (A) > 0. Then the conditional density fXjA is defined as follows: 8 < f(x) 2 P (A) x A fXjA(x) = : 0 x2 = A Note that the support of fXjA is supported only in A. Definition of conditional expectation conditioned on an event. Z Z 1 E(h(X)jA) = h(x) fXjA(x) dx = h(x) fX (x) dx A P (A) A Example. For the random variable X with density function 8 < 1 3 64 x 0 < x < 4 f(x) = : 0 otherwise calculate E(X2 j X ≥ 1). Solution. Step 1. Z h i 4 1 1 1 x=4 255 P (X ≥ 1) = x3 dx = x4 = 1 64 256 4 x=1 256 Step 2. Z Z ( ) 1 256 4 1 E(X2 j X ≥ 1) = x2 f(x) dx = x2 x3 dx P (X ≥ 1) fx≥1g 255 1 64 Z ( ) ( )( ) h i 256 4 1 256 1 1 x=4 8192 = x5 dx = x6 = 255 1 64 255 64 6 x=1 765 Definition of conditional variance conditioned on an event. Var(XjA) = E(X2jA) − E(XjA)2 1 Example. For the previous example , calculate the conditional variance Var(XjX ≥ 1) Solution. We already calculated E(X2 j X ≥ 1). We only need to calculate E(X j X ≥ 1). Z Z Z ( ) 1 256 4 1 E(X j X ≥ 1) = x f(x) dx = x x3 dx P (X ≥ 1) fx≥1g 255 1 64 Z ( ) ( )( ) h i 256 4 1 256 1 1 x=4 4096 = x4 dx = x5 = 255 1 64 255 64 5 x=1 1275 Finally: ( ) 8192 4096 2 630784 Var(XjX ≥ 1) = E(X2jX ≥ 1) − E(XjX ≥ 1)2 = − = 765 1275 1625625 Definition of conditional expectation conditioned on a random variable. -
Probability Cheatsheet V2.0 Thinking Conditionally Law of Total Probability (LOTP)
Probability Cheatsheet v2.0 Thinking Conditionally Law of Total Probability (LOTP) Let B1;B2;B3; :::Bn be a partition of the sample space (i.e., they are Compiled by William Chen (http://wzchen.com) and Joe Blitzstein, Independence disjoint and their union is the entire sample space). with contributions from Sebastian Chiu, Yuan Jiang, Yuqi Hou, and Independent Events A and B are independent if knowing whether P (A) = P (AjB )P (B ) + P (AjB )P (B ) + ··· + P (AjB )P (B ) Jessy Hwang. Material based on Joe Blitzstein's (@stat110) lectures 1 1 2 2 n n A occurred gives no information about whether B occurred. More (http://stat110.net) and Blitzstein/Hwang's Introduction to P (A) = P (A \ B1) + P (A \ B2) + ··· + P (A \ Bn) formally, A and B (which have nonzero probability) are independent if Probability textbook (http://bit.ly/introprobability). Licensed and only if one of the following equivalent statements holds: For LOTP with extra conditioning, just add in another event C! under CC BY-NC-SA 4.0. Please share comments, suggestions, and errors at http://github.com/wzchen/probability_cheatsheet. P (A \ B) = P (A)P (B) P (AjC) = P (AjB1;C)P (B1jC) + ··· + P (AjBn;C)P (BnjC) P (AjB) = P (A) P (AjC) = P (A \ B1jC) + P (A \ B2jC) + ··· + P (A \ BnjC) P (BjA) = P (B) Last Updated September 4, 2015 Special case of LOTP with B and Bc as partition: Conditional Independence A and B are conditionally independent P (A) = P (AjB)P (B) + P (AjBc)P (Bc) given C if P (A \ BjC) = P (AjC)P (BjC). -
Probability Cheatsheet
Probability Cheatsheet v1.1.1 Simpson's Paradox Expected Value, Linearity, and Symmetry P (A j B; C) < P (A j Bc;C) and P (A j B; Cc) < P (A j Bc;Cc) Expected Value (aka mean, expectation, or average) can be thought Compiled by William Chen (http://wzchen.com) with contributions yet still, P (A j B) > P (A j Bc) of as the \weighted average" of the possible outcomes of our random from Sebastian Chiu, Yuan Jiang, Yuqi Hou, and Jessy Hwang. variable. Mathematically, if x1; x2; x3;::: are all of the possible values Material based off of Joe Blitzstein's (@stat110) lectures Bayes' Rule and Law of Total Probability that X can take, the expected value of X can be calculated as follows: P (http://stat110.net) and Blitzstein/Hwang's Intro to Probability E(X) = xiP (X = xi) textbook (http://bit.ly/introprobability). Licensed under CC i Law of Total Probability with partitioning set B1; B2; B3; :::Bn and BY-NC-SA 4.0. Please share comments, suggestions, and errors at with extra conditioning (just add C!) Note that for any X and Y , a and b scaling coefficients and c is our http://github.com/wzchen/probability_cheatsheet. constant, the following property of Linearity of Expectation holds: P (A) = P (AjB1)P (B1) + P (AjB2)P (B2) + :::P (AjBn)P (Bn) Last Updated March 20, 2015 P (A) = P (A \ B1) + P (A \ B2) + :::P (A \ Bn) E(aX + bY + c) = aE(X) + bE(Y ) + c P (AjC) = P (AjB1; C)P (B1jC) + :::P (AjBn; C)P (BnjC) If two Random Variables have the same distribution, even when they are dependent by the property of Symmetry their expected values P (AjC) = P (A \ B jC) + P (A \ B jC) + :::P (A \ B jC) Counting 1 2 n are equal. -
(Introduction to Probability at an Advanced Level) - All Lecture Notes
Fall 2018 Statistics 201A (Introduction to Probability at an advanced level) - All Lecture Notes Aditya Guntuboyina August 15, 2020 Contents 0.1 Sample spaces, Events, Probability.................................5 0.2 Conditional Probability and Independence.............................6 0.3 Random Variables..........................................7 1 Random Variables, Expectation and Variance8 1.1 Expectations of Random Variables.................................9 1.2 Variance................................................ 10 2 Independence of Random Variables 11 3 Common Distributions 11 3.1 Ber(p) Distribution......................................... 11 3.2 Bin(n; p) Distribution........................................ 11 3.3 Poisson Distribution......................................... 12 4 Covariance, Correlation and Regression 14 5 Correlation and Regression 16 6 Back to Common Distributions 16 6.1 Geometric Distribution........................................ 16 6.2 Negative Binomial Distribution................................... 17 7 Continuous Distributions 17 7.1 Normal or Gaussian Distribution.................................. 17 1 7.2 Uniform Distribution......................................... 18 7.3 The Exponential Density...................................... 18 7.4 The Gamma Density......................................... 18 8 Variable Transformations 19 9 Distribution Functions and the Quantile Transform 20 10 Joint Densities 22 11 Joint Densities under Transformations 23 11.1 Detour to Convolutions...................................... -
On the Method of Moments for Estimation in Latent Variable Models Anastasia Podosinnikova
On the Method of Moments for Estimation in Latent Variable Models Anastasia Podosinnikova To cite this version: Anastasia Podosinnikova. On the Method of Moments for Estimation in Latent Variable Models. Machine Learning [cs.LG]. Ecole Normale Superieure de Paris - ENS Paris, 2016. English. tel- 01489260v1 HAL Id: tel-01489260 https://tel.archives-ouvertes.fr/tel-01489260v1 Submitted on 14 Mar 2017 (v1), last revised 5 Apr 2018 (v2) HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. THÈSE DE DOCTORAT de l’Université de recherche Paris Sciences Lettres – PSL Research University préparée à l’École normale supérieure Sur la méthode des mo- École doctorale n°386 ments pour d’estimation Spécialité: Informatique des modèles à variable latentes Soutenue le 01.12.2016 On the method of moments for es- timation in latent variable models par Anastasia Podosinnikova Composition du Jury : Mme Animashree ANANDKUMAR University of California Irvine Rapporteur M Samuel KASKI Aalto University Rapporteur M Francis BACH École normale supérieure Directeur de thèse M Simon LACOSTE-JULIEN École normale supérieure Directeur de thèse M Alexandre D’ASPREMONT École normale supérieure Membre du Jury M Pierre COMON CNRS Membre du Jury M Rémi GRIBONVAL INRIA Membre du Jury Abstract Latent variable models are powerful probabilistic tools for extracting useful latent structure from otherwise unstructured data and have proved useful in numerous ap- plications such as natural language processing and computer vision. -
Identifying Sources of Variation and the Flow of Information in Biochemical
Identifying sources of variation and the flow PNAS PLUS of information in biochemical networks Clive G. Bowshera,1 and Peter S. Swainb,1 aSchool of Mathematics, University of Bristol, Bristol BS8 1TW, United Kingdom; and bSynthSys—Synthetic and Systems Biology, University of Edinburgh, Edinburgh EH9 3JD, United Kingdom Edited by Ken A. Dill, Stony Brook University, Stony Brook, NY, and approved March 6, 2012 (received for review November 24, 2011) To understand how cells control and exploit biochemical fluctua- Each Y could be, for example, the number of molecules of another tions, we must identify the sources of stochasticity, quantify their biochemical species, a property of the intra- or extracellular envir- effects, and distinguish informative variation from confounding onment, a characteristic of cell morphology (8), a reaction rate “noise.” We present an analysis that allows fluctuations of bio- that depends on the concentration of a cellular component such chemical networks to be decomposed into multiple components, as ATP, or even the number of times a particular type of reaction Y gives conditions for the design of experimental reporters to mea- has occurred. We emphasize that the variables are the stochastic sure all components, and provides a technique to predict the mag- variables whose effects are of interest: They are not all possible sources of stochasticity. nitude of these components from models. Further, we identify a Y Y Y particular component of variation that can be used to quantify the We wish to determine how fluctuations in 1, 2, and 3 af- fect fluctuations in Z, the output of the network. -
Methods of Monte Carlo Simulation
Methods of Monte Carlo Simulation Ulm University Institute of Stochastics Lecture Notes Dr. Tim Brereton Winter Term 2015 – 2016 Ulm 2 Contents 1 Introduction 5 1.1 Somesimpleexamples .............................. .. 5 1.1.1 Example1................................... 5 1.1.2 Example2................................... 7 2 Generating Uniform Random Numbers 9 2.1 Howdowemakerandomnumbers? . 9 2.1.1 Physicalgeneration . .. 9 2.1.2 Pseudo-randomnumbers . 10 2.2 Pseudo-randomnumbers . ... 10 2.2.1 Abstractsetting................................ 10 2.2.2 LinearCongruentialGenerators . ..... 10 2.2.3 ImprovementsonLCGs ........................... 12 3 Non-Uniform Random Variable Generation 15 3.1 TheInverseTransform ............................. ... 15 3.1.1 Thetablelookupmethod . 17 3.2 Integration on Various Domains and Properties of Estimators .......... 18 3.2.1 Integration over (0,b)............................. 18 3.2.2 Propertiesofestimators . ... 19 3.2.3 Integrationoverunboundeddomains . ..... 20 3.3 TheAcceptance-RejectionMethod . ..... 21 3.3.1 Thecontinuouscase ............................. 21 3.3.2 Thediscretecase ............................... 25 3.3.3 Multivariateacceptance-rejection . ........ 26 3.3.4 Drawing uniformly from complicated sets . ....... 27 3.3.5 The limitations of high-dimensional acceptance-rejection ......... 28 3.4 Location-ScaleFamilies. ...... 29 3.5 GeneratingNormalRandomVariables . ..... 30 3.5.1 Box-Mullermethod.............................. 31 3.6 Big O notation .................................... 31 -
Overdispersed Models for Claim Count Distribution
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by DSpace at Tartu University Library TARTU UNIVERSITY FACULTY OF MATHEMATICS AND COMPUTER SCIENCE Institute of Mathematical Statistics Frazier Carsten Overdispersed Models for Claim Count Distribution Master’s Thesis Supervisor: Meelis K¨a¨arik, Ph.D TARTU 2013 Contents 1 Introduction 3 2 Classical Collective Risk Model 4 2.1 Properties ................................ 4 2.2 CompoundPoissonModel . 8 3 Compound Poisson Model with Different Insurance Periods 11 4 Overdispersed Models 14 4.1 Introduction............................... 14 4.2 CausesofOverdispersion. 15 4.3 Overdispersion in the Natural Exponential Family . .... 17 5 Handling Overdispersion in a More General Framework 22 5.1 MixedPoissonModel.......................... 23 5.2 NegativeBinomialModel. 24 6 Practical Applications of the Overdispersed Poisson Model 28 Kokkuv˜ote (eesti keeles) 38 References 40 Appendices 41 A Proofs 41 B Program codes 44 2 1 Introduction Constructing models to predict future loss events is a fundamental duty of actu- aries. However, large amounts of information are needed to derive such a model. When considering many similar data points (e.g., similar insurance policies or in- dividual claims), it is reasonable to create a collective risk model, which deals with all of these policies/claims together, rather than treating each one separately. By forming a collective risk model, it is possible to assess the expected activity of each individual policy. This information can then be used to calculate premiums (see, e.g., Gray & Pitts, 2012). There are several classical models that are commonly used to model the number of claims in a given time period. -
2017 Timo Koski Department of Math
Lecture Notes: Probability and Random Processes at KTH for sf2940 Probability Theory Edition: 2017 Timo Koski Department of Mathematics KTH Royal Institute of Technology Stockholm, Sweden 2 Contents Foreword 9 1 Probability Spaces and Random Variables 11 1.1 Introduction...................................... .......... 11 1.2 Terminology and Notations in Elementary Set Theory . .............. 11 1.3 AlgebrasofSets.................................... .......... 14 1.4 ProbabilitySpace.................................... ......... 19 1.4.1 Probability Measures . ...... 19 1.4.2 Continuity from below and Continuity from above . .......... 21 1.4.3 Why Do We Need Sigma-Fields? . .... 23 1.4.4 P - Negligible Events and P -AlmostSureProperties . 25 1.5 Random Variables and Distribution Functions . ............ 25 1.5.1 Randomness?..................................... ...... 25 1.5.2 Random Variables and Sigma Fields Generated by Random Variables ........... 26 1.5.3 Distribution Functions . ...... 28 1.6 Independence of Random Variables and Sigma Fields, I.I.D. r.v.’s . ............... 29 1.7 TheBorel-CantelliLemmas ............................. .......... 30 1.8 ExpectedValueofaRandomVariable . ........... 32 1.8.1 A First Definition and Some Developments . ........ 32 1.8.2 TheGeneralDefinition............................... ....... 34 1.8.3 The Law of the Unconscious Statistician . ......... 34 1.8.4 Three Inequalities for Expectations . ........... 35 1.8.5 Limits and Integrals . ...... 37 1.9 Appendix: lim sup xn and lim inf xn ................................. -
Section 1: Basic Probability Theory
||||| Course Overview Three sections: I Monday 2:30pm-3:45pm Wednesday 1:00pm-2:15pm Thursday 2:30pm-3:45pm Goal: I Review some basic (or not so basic) concepts in probability and statistics I Good preparation for CME 308 Syllabus: I Basic probability, including random variables, conditional distribution, moments, concentration inequality I Convergence concepts, including three types of convergence, WLLN, CLT, delta method I Statistical inference, including fundamental concepts in inference, point estimation, MLE Section 1: Basic probability theory Dangna Li ICME September 14, 2015 Outline Random Variables and Distributions Expectation, Variance and Covariance Definitions Key properties Conditional Expectation and Conditional Variance Definitions Key properties Concentration Inequality Markov inequality Chebyshev inequality Chernoff bound Random variables: discrete and continuous I For our purposes, random variables will be one of two types: discrete or continuous. I A random variable X is discrete if its set of possible values X is finite or countably infinite. I A random variable X is continuous if its possible values form an uncountable set (e.g., some interval on R) and the probability that X equals any such value exactly is zero. I Examples: I Discrete: binomial, geometric, Poisson, and discrete uniform random variables I Continuous: normal, exponential, beta, gamma, chi-squared, Student's t, and continuous uniform random variables The probability density (mass) function I pmf: The probability mass function (pmf) of a discrete random variable X is a nonnegative function f (x) = P(X = x), where x denotes each possible value that X can take. It is always true that P f (x) = 1: x2X I pdf: The probability density function (pdf) of a continuous random variable X is a nonnegative function f (x) such that R b a f (x)dx = P(a ≤X≤ b) for any a; b 2 R. -
COURSE NOTES STATS 325 Stochastic Processes
COURSE NOTES STATS 325 Stochastic Processes Department of Statistics University of Auckland Contents 1. Stochastic Processes 4 1.1 Revision: Sample spaces and random variables . .... 8 1.2 StochasticProcesses ............................... .. 9 2. Probability 16 2.1 Samplespacesandevents ............................. 16 2.2 Probability Reference List . 17 2.3 Conditional Probability . 18 2.4 The Partition Theorem (Law of Total Probability) . ..... 23 2.5 Bayes’ Theorem: inverting conditional probabilities . ...... 25 2.6 First-Step Analysis for calculating probabilities in a process . ...... 28 2.7 Special Process: the Gambler’s Ruin . .. 32 2.8 Independence ..................................... 35 2.9 TheContinuityTheorem............................... 36 2.10RandomVariables................................... 38 2.11 ContinuousRandomVariables . .. 39 2.12 DiscreteRandomVariables. .. 41 2.13 IndependentRandomVariables . ... 43 3. Expectation and Variance 44 3.1 Expectation ...................................... 45 3.2 Variance, covariance, and correlation . ..... 48 3.3 Conditional Expectation and Conditional Variance . ..... 51 3.4 Examples of Conditional Expectation and Variance . ..... 57 3.5 First-Step Analysis for calculating expected reaching times . ........ 63 3.6 Probability as a conditional expectation . ... 67 3.7 Specialprocess: amodelforgenespread . ...... 71 4. Generating Functions 74 4.1 Commonsums .................................... 74 4.2 Probability Generating Functions . .. 76 4.3 Using the probability generating -
14.30 Introduction to Statistical Methods in Economics Spring 2009
MIT OpenCourseWare http://ocw.mit.edu 14.30 Introduction to Statistical Methods in Economics Spring 2009 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. 14.30 Introduction to Statistical Methods in Economics Lecture Notes 14 Konrad Menzel April 2, 2009 1 Conditional Expectations Example 1 Each year, a firm’s R&D department produces X innovations according to some random process, where E[X] = 2 and Var(X) = 2. Each invention is a commercial success with probability p = 0.2 (assume independence). The number of commercial successes in a given year are denoted by S. Since we know that the mean of S B(x, p) = xp, conditional on X = x innovations in a given year, xp of them should be successful on ave∼rage. The conditional expectation of X given Y is the expectation of X taken over the conditional p.d.f.: Definition 1 y yfY |X (y X) if Y is discrete E[Y X] = ∞ | | yfY |X (y X)dy if Y is continuous � �−∞ | Note that since fY |X (y X) carries th�e random variable X as its argument, the conditional expectation is also a random variable.| However, we can also define the conditional expectation of Y given a particular value of X, y yfY |X (y x) if Y is discrete E[Y X = x] = ∞ | | yfY |X (y x)dy if Y is continuous � �−∞ | which is just a number for any given v�alue of x as long as the conditional density is defined. Since the calculation goes exactly like before, only that we now integrate over the conditional distribution, won’t do a numerical example (for the problem set, just apply definition).