Characteristic Kernels and Infinitely Divisible Distributions

Total Page:16

File Type:pdf, Size:1020Kb

Characteristic Kernels and Infinitely Divisible Distributions Journal of Machine Learning Research 17 (2016) 1-28 Submitted 3/14; Revised 5/16; Published 9/16 Characteristic Kernels and Infinitely Divisible Distributions Yu Nishiyama [email protected] The University of Electro-Communications 1-5-1 Chofugaoka, Chofu, Tokyo 182-8585, Japan Kenji Fukumizu [email protected] The Institute of Statistical Mathematics 10-3 Midori-cho, Tachikawa, Tokyo 190-8562, Japan Editor: Ingo Steinwart Abstract We connect shift-invariant characteristic kernels to infinitely divisible distributions on Rd. Characteristic kernels play an important role in machine learning applications with their kernel means to distinguish any two probability measures. The contribution of this paper is twofold. First, we show, using the L´evy–Khintchine formula, that any shift-invariant kernel given by a bounded, continuous, and symmetric probability density function (pdf) of an infinitely divisible distribution on Rd is characteristic. We mention some closure properties of such characteristic kernels under addition, pointwise product, and convolution. Second, in developing various kernel mean algorithms, it is fundamental to compute the following values: (i) kernel mean values mP (x), x , and (ii) kernel mean RKHS inner products ∈ X mP ,mQ H, for probability measures P,Q. If P,Q, and kernel k are Gaussians, then the computationh i of (i) and (ii) results in Gaussian pdfs that are tractable. We generalize this Gaussian combination to more general cases in the class of infinitely divisible distributions. We then introduce a conjugate kernel and a convolution trick, so that the above (i) and (ii) have the same pdf form, expecting tractable computation at least in some cases. As spe- cific instances, we explore α-stable distributions and a rich class of generalized hyperbolic distributions, where the Laplace, Cauchy, and Student’s t distributions are included. Keywords: Characteristic Kernel, Kernel Mean, Infinitely Divisible Distribution, Con- jugate Kernel, Convolution Trick arXiv:1403.7304v3 [stat.ML] 25 Oct 2016 1. Introduction Let ( , ( )) be a measurable space and ( ) be the set of probability measures. Let X B X M1 X be the real-valued reproducing kernel Hilbert space (RKHS) associated with a bounded H and measurable positive-definite (p.d.) kernel k : R. In machine learning, X×X → kernel methods provide a technique for developing nonlinear algorithms, by mapping data X , , X in to higher- or infinite-dimensional RKHS functions k( , X ),...,k( , X ) in 1 · · · n X · 1 · n (Sch¨olkopf and Smola, 2002; Steinwart and Christmann, 2008). H Recently, an RKHS representation of a probability measure P ( ), called kernel ∈ M1 X mean, mP := EX P [k( , X)] (Smola et al., 2007; Fukumizu et al., 2013), or equiva- ∼ · ∈ H lently, m (x)= k(x,y)dP (y), x (1) P ∈X Z c 2016 Yu Nishiyama and Kenji Fukumizu. Nishiyama and Fukumizu has been used to handle probability measures in RKHSs. The kernel mean enables us to introduce a similarity and distance between two probability measures P,Q ( ), via ∈ M1 X the RKHS inner product mP ,mQ and the norm mP mQ , respectively. Using h i || − ||H these quantities, different authors haveH proposed many algorithms, including density es- timations (Smola et al., 2007; Song et al., 2008; McCalman et al., 2013), hypothesis tests (Gretton et al. 2012, Gretton et al. 2008, Fukumizu et al. 2008), kernel Bayesian inference (Song et al. 2009, Song et al. 2010, Song et al. 2011, Fukumizu et al. 2013, Song et al. 2013, Kanagawa et al. 2016, Nishiyama et al. 2016), classification (Muandet et al., 2012), dimen- sion reduction (Fukumizu and Leng, 2012), and reinforcement learning (Gr¨unew¨alder et al. 2012, Nishiyama et al. 2012, Rawlik et al. 2013, Boots et al. 2013). In these applications, the characteristic property of a p.d. kernel k is important: a p.d. kernel is said to be characteristic if any two probability measures P,Q ( ) can be ∈ M1 X distinguished by their kernel means m ,m (Fukumizu et al., 2004; Sriperumbudur et al., P Q ∈H 2010, 2011). For a continuous, bounded, and shift-invariant p.d. kernel on Rd with k(x,y)= κ(x y), a necessary and sufficient condition for the kernel to be characteristic is known − via the Bochner theorem (Sriperumbudur et al., 2010, Theorem 9). As the first contribution of this paper, we show, using the L´evy–Khintchine formula (Sato, 1999; F. W. Steutel, 2004; Applebaum, 2009), that if κ is a continuous, bounded, and symmetric pdf of an infinitely divisible distribution P on Rd, then k is a characteristic p.d. kernel. We call such kernels convolutionally infinitely divisible (CID) kernels. Examples of CID kernels are given in Example 3.4. In addition, we note some closure properties of the CID kernels with respect to addition, pointwise product, and convolution. To describe the second contribution, we briefly explain what is essentially computed in kernel mean algorithms. In general kernel methods, the following computations are fundamental: (i) RKHS function values: f(x) for f , x , ∈H ∈X (ii) RKHS inner products: f, g , f, g . h iH ∈H If f is represented by f := n w k( , X ), w Rn, then the function value (i) f(x)= ∈H i=1 i · i ∈ n w k(x, X ) reduces to the evaluation of the kernel k(x,y). Similarly, if two RKHS i=1 i i P functions f, g are both represented by f := n w k( , X ) and g := l w˜ k( , X˜ ), P ∈H i=1 i · i j=1 j · j respectively, then the inner product (ii) f, g = n l w w˜ k(X , X˜ ) reduces to the h i P i=1 j=1 i j i Pj evaluation of the kernel k(x,y), which is so-calledH the kernel trick k( ,x), k( ,y) = k(x,y). P P h · · iH n We consider a more general case in which f, g are represented by f := w m ∈H i=1 i Pi and g := l w˜ m , respectively, where m , m are kernel means of probabil- j=1 j Qj { Pi } { Qj }⊂H P ity measures Pi , Qj 1( ). Kernel algorithms involving kernel means use this type P { } { } ⊂ M X 1 of RKHS functions explicitly or implicitly. If Pi , Qj are delta measures δX , δ ˜ , { } { } { i } { Xj } then these functions are specialized to the above kernel trick case, where mδx = k( ,x). The n n l · computation of (i) f(x) = i=1 wimPi (x) and (ii) f, g = i=1 j=1 wiw˜j mPi ,mQj h i h iH requires the following kernel mean evaluations:2 H P P P 1. A probability measure δx( ), x is a delta measure; if x B, then δx(B) = 1; otherwise, δx(B) = 0 · ∈ X ∈ for B ( ). ∈ B X nP ˙ 2. If kernel means mP ,mQ are also both expressed by a weighted sum, mP := Pi=1 ηik( , Xi) and mQ := nQ ¨ ˙ ¨ · P η˜ik( , Xi), Xi , Xi , then the computation also reduces to the above kernel trick case. i=1 · { } { }⊂X 2 Characteristic Kernels and Infinitely Divisible Distributions (iii) kernel mean values: m (x) for P ( ), x , P ∈ M1 X ∈X (iv) kernel mean inner products: mP ,mQ , P,Q 1( ). h iH ∈ M X Note that the kernel mean value (1) and the kernel mean inner product m ,m = h P Qi k(x,y)dP (x)dQ(y) involve an integral, and their rigorous computation is not tractableH in general. R The second contribution of this paper is to provide some classes of p.d. kernels and parametric models P,Q := P θ Θ such that the kernel computation of (iii) and ∈ PΘ { θ| ∈ } (iv) can be reduced to a kernel evaluation, where tractable computation is considered. For a shift-invariant kernel k(x,y) = κ(x y), x,y Rd on Rd, as shown in Lemma 2.5, the − ∈ computation of (iii) and (iv) reduces to the following convolution: (iii)’ kernel mean values: m (x) = (κ P )(x), P ∗ (iv)’ kernel mean inner products: mP ,mQ = (κ P˜ Q)(0) = (κ P Q˜)(0), h iH ∗ ∗ ∗ ∗ where P˜ and Q˜ are the dual of P and Q, respectively.3 This convolution representation motivates us to explore a set of parametric distributions that is closed under convolution, PΘ namely, a convolution semigroup ( , ) (Rd), where κ is a density function in . PΘ ∗ ⊂ M1 PΘ To illustrate the basic idea, let us consider Gaussian distributions as a parametric PΘ class, which is closed under convolution, and a Gaussian kernel. For simplicity, we consider 2 2 2 the case of scalar variance matrices σ Id. Let Nd(µ,σ Id) and fd(x µ,σ Id) denote the d- | 2 dimensional Gaussian distribution with mean µ and variance-covariance matrix σ Id, and its 2 2 pdf, respectively. If P and Q are Gaussian distributions Nd(µP ,σP Id) and Nd(µQ,σQId), respectively, and k is given by the pdf f (x y 0,τ 2I), it is easy to see that m (x) = d − | P f (x µ , (σ2 + τ 2)I ) and m ,m = f (µ µ , (σ2 + σ2 + τ 2)I ). The kernel mean d | P P d h P Qi d P | Q P Q d value and inner product are thus reducedH to simply evaluating Gaussian pdfs, which are given by a parameter update following a specific rule. This type of computation appears in various applications: to list a few, Muandet et al. (2012) proposed a support measure classification by considering kernels k(P,Q) between two input probability measures P,Q, including Gaussian models; Song et al. (2008) and McCalman et al. (2013) considered an approximation of a (target) probability measure P with a Gaussian mixture Pθ, via an optimization problem θˆ = argmin m m 2 .
Recommended publications
  • Duality for Real and Multivariate Exponential Families
    Duality for real and multivariate exponential families a, G´erard Letac ∗ aInstitut de Math´ematiques de Toulouse, 118 route de Narbonne 31062 Toulouse, France. Abstract n Consider a measure µ on R generating a natural exponential family F(µ) with variance function VF(µ)(m) and Laplace transform exp(ℓµ(s)) = exp( s, x )µ(dx). ZRn −h i A dual measure µ∗ satisfies ℓ′ ( ℓ′ (s)) = s. Such a dual measure does not always exist. One important property µ∗ µ 1 − − is ℓ′′ (m) = (VF(µ)(m))− , leading to the notion of duality among exponential families (or rather among the extended µ∗ notion of T exponential families TF obtained by considering all translations of a given exponential family F). Keywords: Dilogarithm distribution, Landau distribution, large deviations, quadratic and cubic real exponential families, Tweedie scale, Wishart distributions. 2020 MSC: Primary 62H05, Secondary 60E10 1. Introduction One can be surprized by the explicit formulas that one gets from large deviations in one dimension. Consider iid random variables X1,..., Xn,... and the empirical mean Xn = (X1 + + Xn)/n. Then the Cram´er theorem [6] says that the limit ··· 1/n Pr(Xn > m) n α(m) → →∞ 1 does exist for m > E(X ). For instance, the symmetric Bernoulli case Pr(Xn = 1) = leads, for 0 < m < 1, to 1 ± 2 1 m 1+m α(m) = (1 + m)− − (1 m)− . (1) − 1 x For the bilateral exponential distribution Xn e dx we get, for m > 0, ∼ 2 −| | 1 1 √1+m2 2 α(m) = e − (1 + √1 + m ). 2 What strange formulas! Other ones can be found in [17], Problem 410.
    [Show full text]
  • Luria-Delbruck Experiment
    Statistical mechanics approaches for studying microbial growth (Lecture 1) Ariel Amir, 10/2019 In the following are included: 1) PDF of lecture slides 2) Calculations related to the Luria-Delbruck experiment. (See also the original paper referenced therein) 3) Calculation of the fixation probability in a serial dilution protocol, taken from SI of: Guo, Vucelja and Amir, Science Advances, 5(7), eaav3842 (2019). Statistical mechanics approaches for studying microbial growth Ariel Amir E. coli Daughter Mother cell cells Time Doubling time/ generation time/growth/replication Taheri-Araghi et al. (2015) Microbial growth 25μm Escherichia coli Saccharomyces cerevisiae Halobacterium salinarum Stewart et al., Soifer al., Eun et al., PLoS Biol (2005), Current Biology (2016) Nat. Micro (2018) Doubling time ~ tens of minutes to several hours (credit: wikipedia) Time Outline (Lecture I) • Why study microbes? Luria-Delbruck experiment, Evolution experiments • Introduction to microbial growth, with focus on cell size regulation (Lecture II) • Size control and correlations across different domains of life • Going from single-cell variability to the population growth (Lecture III) • Bet-hedging • Optimal partitioning of cellular resources Why study microbes? Luria-Delbruck experiment Protocol • Grow culture to ~108 cells. • Expose to bacteriophage. • Count survivors (plating). What to do when the experiment is not reproducible? (variance, standard deviation >> mean) Luria and Delbruck (1943) Why study microbes? Luria-Delbruck experiment Model 1: adaptation
    [Show full text]
  • The Holtsmark-Continuum Model for the Statistical Description of a Plasma
    On fluctuations in plasmas : the Holtsmark-continuum model for the statistical description of a plasma Citation for published version (APA): Dalenoort, G. J. (1970). On fluctuations in plasmas : the Holtsmark-continuum model for the statistical description of a plasma. Technische Hogeschool Eindhoven. https://doi.org/10.6100/IR108607 DOI: 10.6100/IR108607 Document status and date: Published: 01/01/1970 Document Version: Publisher’s PDF, also known as Version of Record (includes final page, issue and volume numbers) Please check the document version of this publication: • A submitted manuscript is the version of the article upon submission and before peer-review. There can be important differences between the submitted version and the official published version of record. People interested in the research are advised to contact the author for the final version of the publication, or visit the DOI to the publisher's website. • The final author version and the galley proof are versions of the publication after peer review. • The final published version features the final layout of the paper including the volume, issue and page numbers. Link to publication General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the public portal.
    [Show full text]
  • Hand-Book on STATISTICAL DISTRIBUTIONS for Experimentalists
    Internal Report SUF–PFY/96–01 Stockholm, 11 December 1996 1st revision, 31 October 1998 last modification 10 September 2007 Hand-book on STATISTICAL DISTRIBUTIONS for experimentalists by Christian Walck Particle Physics Group Fysikum University of Stockholm (e-mail: [email protected]) Contents 1 Introduction 1 1.1 Random Number Generation .............................. 1 2 Probability Density Functions 3 2.1 Introduction ........................................ 3 2.2 Moments ......................................... 3 2.2.1 Errors of Moments ................................ 4 2.3 Characteristic Function ................................. 4 2.4 Probability Generating Function ............................ 5 2.5 Cumulants ......................................... 6 2.6 Random Number Generation .............................. 7 2.6.1 Cumulative Technique .............................. 7 2.6.2 Accept-Reject technique ............................. 7 2.6.3 Composition Techniques ............................. 8 2.7 Multivariate Distributions ................................ 9 2.7.1 Multivariate Moments .............................. 9 2.7.2 Errors of Bivariate Moments .......................... 9 2.7.3 Joint Characteristic Function .......................... 10 2.7.4 Random Number Generation .......................... 11 3 Bernoulli Distribution 12 3.1 Introduction ........................................ 12 3.2 Relation to Other Distributions ............................. 12 4 Beta distribution 13 4.1 Introduction .......................................
    [Show full text]
  • NAG Library Chapter Introduction G01 – Simple Calculations on Statistical Data
    g01 – Simple Calculations on Statistical Data Introduction – g01 NAG Library Chapter Introduction g01 – Simple Calculations on Statistical Data Contents 1 Scope of the Chapter.................................................... 2 2 Background to the Problems ............................................ 2 2.1 Summary Statistics .................................................... 2 2.2 Statistical Distribution Functions and Their Inverses........................ 2 2.3 Testing for Normality and Other Distributions ............................. 3 2.4 Distribution of Quadratic Forms......................................... 3 2.5 Energy Loss Distributions .............................................. 3 2.6 Vectorized Functions .................................................. 4 3 Recommendations on Choice and Use of Available Functions ............ 4 3.1 Working with Streamed or Extremely Large Datasets ....................... 7 4 Auxiliary Functions Associated with Library Function Arguments ....... 7 5 Functions Withdrawn or Scheduled for Withdrawal ..................... 7 6 References............................................................... 7 Mark 24 g01.1 Introduction – g01 NAG Library Manual 1 Scope of the Chapter This chapter covers three topics: summary statistics statistical distribution functions and their inverses; testing for Normality and other distributions. 2 Background to the Problems 2.1 Summary Statistics The summary statistics consist of two groups. The first group are those based on moments; for example mean, standard
    [Show full text]
  • Luca Lista Statistical Methods for Data Analysis in Particle Physics Lecture Notes in Physics
    Lecture Notes in Physics 909 Luca Lista Statistical Methods for Data Analysis in Particle Physics Lecture Notes in Physics Volume 909 Founding Editors W. Beiglböck J. Ehlers K. Hepp H. Weidenmüller Editorial Board M. Bartelmann, Heidelberg, Germany B.-G. Englert, Singapore, Singapore P. Hänggi, Augsburg, Germany M. Hjorth-Jensen, Oslo, Norway R.A.L. Jones, Sheffield, UK M. Lewenstein, Barcelona, Spain H. von Löhneysen, Karlsruhe, Germany J.-M. Raimond, Paris, France A. Rubio, Donostia, San Sebastian, Spain S. Theisen, Potsdam, Germany D. Vollhardt, Augsburg, Germany J.D. Wells, Ann Arbor, USA G.P. Zank, Huntsville, USA The Lecture Notes in Physics The series Lecture Notes in Physics (LNP), founded in 1969, reports new devel- opments in physics research and teaching-quickly and informally, but with a high quality and the explicit aim to summarize and communicate current knowledge in an accessible way. Books published in this series are conceived as bridging material between advanced graduate textbooks and the forefront of research and to serve three purposes: • to be a compact and modern up-to-date source of reference on a well-defined topic • to serve as an accessible introduction to the field to postgraduate students and nonspecialist researchers from related areas • to be a source of advanced teaching material for specialized seminars, courses and schools Both monographs and multi-author volumes will be considered for publication. Edited volumes should, however, consist of a very limited number of contributions only. Proceedings will not be considered for LNP. Volumes published in LNP are disseminated both in print and in electronic for- mats, the electronic archive being available at springerlink.com.
    [Show full text]
  • On Levy-Walk Model for Correlations in Spatial Galaxy Distribution
    Physics & Astronomy International Journal Mini Review Open Access On levy-walk model for correlations in spatial galaxy distribution Abstract Volume 3 Issue 2 - 2019 The model of stochastic fractal is briefly described, basic results regarding properties VV Uchaikin of spatial structures with long-range correlations of power-law type are reviewed, and Ulyanovsk State University, 42 Lev Tolstoy Str., Ulyanovsk some applications to galaxy distribution in the Universe are discussed. 432700, Russia Correspondence: VV Uchaikin, Ulyanovsk State University, 42 Lev Tolstoy Str., Ulyanovsk 432700, Russia, Email Received: March 14, 2019 | Published: March 29, 2019 Introduction distribution of gravitational force magnitude (Holtsmark distribution). Section IV presents the distribution of number of points inside the One of important problems complicating the extraction of spherical cells in the stochastic fractal model. In Section V the question information from cosmological surveys, is necessity to supplement about the global mass density of the fractal Universe is investigated observational data subtracted of foreground or for some (i.e. with the empirical distribution density over masses adopted. In Section technical) reasons removed from the survey. This particularly VI the ergodic problem in observational cosmology is discussed. affects the extraction of information from the largest observable scales. Maximum-likelihood estimators are not quite applicable for Power spectrum reconstructing the full-sky because of non-Caussian character of The important statistical characteristic of the large-scale structure observed matter distribution. To solve this problem, many authors of the Universe is the two-point spatial galaxy correlation function,1 modify implementations of such estimators which are robust to the which determines the joint probability leakage of contaminants from within masked regions.
    [Show full text]
  • Field Guide to Continuous Probability Distributions
    Field Guide to Continuous Probability Distributions Gavin E. Crooks v 1.0.0 2019 G. E. Crooks – Field Guide to Probability Distributions v 1.0.0 Copyright © 2010-2019 Gavin E. Crooks ISBN: 978-1-7339381-0-5 http://threeplusone.com/fieldguide Berkeley Institute for Theoretical Sciences (BITS) typeset on 2019-04-10 with XeTeX version 0.99999 fonts: Trump Mediaeval (text), Euler (math) 271828182845904 2 G. E. Crooks – Field Guide to Probability Distributions Preface: The search for GUD A common problem is that of describing the probability distribution of a single, continuous variable. A few distributions, such as the normal and exponential, were discovered in the 1800’s or earlier. But about a century ago the great statistician, Karl Pearson, realized that the known probabil- ity distributions were not sufficient to handle all of the phenomena then under investigation, and set out to create new distributions with useful properties. During the 20th century this process continued with abandon and a vast menagerie of distinct mathematical forms were discovered and invented, investigated, analyzed, rediscovered and renamed, all for the purpose of de- scribing the probability of some interesting variable. There are hundreds of named distributions and synonyms in current usage. The apparent diver- sity is unending and disorienting. Fortunately, the situation is less confused than it might at first appear. Most common, continuous, univariate, unimodal distributions can be orga- nized into a small number of distinct families, which are all special cases of a single Grand Unified Distribution. This compendium details these hun- dred or so simple distributions, their properties and their interrelations.
    [Show full text]
  • Selection-Like Biases Emerge in Population Models with Recurrent Jackpot Events
    Genetics: Early Online, published on August 31, 2018 as 10.1534/genetics.118.301516 GENETICS | INVESTIGATION Selection-like biases emerge in population models with recurrent jackpot events Oskar Hallatschek Departments of Physics and Integrative Biology, University of California, Berkeley, CA 94720 ABSTRACT Evolutionary dynamics driven out of equilibrium by growth, expansion or adaptation often generate a characteristi- cally skewed distribution of descendant numbers: The earliest, the most advanced or the fittest ancestors have exceptionally large number of descendants, which Luria and Delbrück called “jackpot" events. Here, I show that recurrent jackpot events generate a deterministic median bias favoring majority alleles, which is akin to positive frequency-dependent selection (propor- tional to the log-ratio of the frequencies of mutant and wild-type alleles). This fictitious selection force results from the fact that majority alleles tend to sample deeper into the tail of the descendant distribution. The flipside of this sampling effect is the rare occurrence of large frequency hikes in favor of minority alleles, which ensures that the allele frequency dynamics remains neutral in expectation, unless genuine selection is present. The resulting picture of a selection-like bias compensated by rare big jumps allows for an intuitive understanding of allele frequency trajectories and enables the exact calculation of transition densities for a range of important scenarios, including population size variations and different forms of natural selection. As a general signature of evolution by rare events, fictitious selection hampers the establishment of new beneficial mutations, counteracts balancing selection and confounds methods to infer selection from data over limited timescales. KEYWORDS his Genetics journal template is provided to help you write your work in the correct journal format.
    [Show full text]
  • A Fast and Compact Approximation of Energy Loss Fluctuation for Monte Carlo Simulation of Charged Particles Transport Armando Alaminos-Bouza
    A Fast and compact approximation of energy loss fluctuation for Monte Carlo simulation of charged particles transport Armando Alaminos-Bouza To cite this version: Armando Alaminos-Bouza. A Fast and compact approximation of energy loss fluctuation for Monte Carlo simulation of charged particles transport. 2015. hal-02480024 HAL Id: hal-02480024 https://hal.archives-ouvertes.fr/hal-02480024 Preprint submitted on 17 Feb 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Distributed under a Creative Commons Public Domain Mark| 4.0 International License A Fast and compact approximation of energy loss fluctuation for Monte Carlo simulation of charged particles transport. Armando Alaminos Bouza Abstract: A simple and fast functional model is proposed to approximate energy loss distributions of charged particles crossing slabs of matter. The most accepted physical models for treating this problem was created by Landau and later improved by Vavilov. Both models depend on complex functional forms with exact solutions that are, by far, too CPU intensive to be directly included in existing Monte Carlo codes. Several authors have proposed approximations with varying degree of accuracy and performance. This paper presents a compact and efficient form that approximates with enough accuracy the Vavilov distribution and its extreme cases of Landau and Gaussian shapes.
    [Show full text]
  • Lecture Notes on Nonequilibrium Statistical Physics (A Work in Progress)
    Lecture Notes on Nonequilibrium Statistical Physics (A Work in Progress) Daniel Arovas Department of Physics University of California, San Diego June 13, 2021 Contents 1 Fundamentals of Probability 1 1.1 References...................................... ........ 1 1.2 StatisticalPropertiesofRandomWalks . ............... 2 1.2.1 One-dimensionalrandomwalk . ........ 2 1.2.2 Thermodynamiclimit. ....... 4 1.2.3 Entropyandenergy.............................. ...... 5 1.3 BasicConceptsinProbabilityTheory . .............. 6 1.3.1 Fundamentaldefinitions . ....... 6 1.3.2 Bayesianstatistics . ......... 7 1.3.3 Randomvariablesandtheiraverages . ........... 8 1.4 EntropyandProbability . ........... 9 1.4.1 Entropyandinformationtheory. .......... 9 1.4.2 Probability distributions from maximum entropy . ............... 11 1.4.3 Continuousprobabilitydistributions . .............. 15 1.5 GeneralAspectsofProbabilityDistributions . .................. 16 1.5.1 Discreteandcontinuousdistributions . ............. 16 1.5.2 Centrallimittheorem . ........ 18 1.5.3 Momentsandcumulants .. ..... ...... ..... ...... .... ..... 19 1.5.4 MultidimensionalGaussianintegral . ........... 20 1.6 BayesianStatisticalInference . ............... 21 1.6.1 FrequentistsandBayesians. .......... 21 i ii CONTENTS 1.6.2 UpdatingBayesianpriors. ......... 22 1.6.3 Hyperparametersandconjugatepriors . ............ 23 1.6.4 Theproblemwithpriors . ....... 25 2 Stochastic Processes 27 2.1 References...................................... ........ 27 2.2 IntroductiontoStochasticProcesses . ...............
    [Show full text]
  • NAG Library Chapter Contents G01 – Simple Calculations on Statistical Data
    G01 – Simple Calculations on Statistical Data Contents – G01 NAG Library Chapter Contents g01 – Simple Calculations on Statistical Data g01 Chapter Introduction – a description of the Chapter and an overview of the algorithms available Function Mark of Name Introduction Purpose g01adc 7 nag_summary_stats_freq Mean, variance, skewness, kurtosis, etc., one variable, from frequency table g01aec 6 nag_frequency_table Frequency table from raw data g01alc 4 nag_5pt_summary_stats Five-point summary (median, hinges and extremes) g01amc 9 nag_double_quantiles Quantiles of a set of unordered values g01anc 23 nag_approx_quantiles_fixed Calculates approximate quantiles from a data stream of known size g01apc 23 nag_approx_quantiles_arbitrary Calculates approximate quantiles from a data stream of unknown size g01atc 24 nag_summary_stats_onevar Computes univariate summary information: mean, variance, skewness, kurtosis g01auc 24 nag_summary_stats_onevar_combine Combines multiple sets of summary information, for use after nag_summary_stats_onevar (g01atc) g01bjc 4 nag_binomial_dist Binomial distribution function g01bkc 4 nag_poisson_dist Poisson distribution function g01blc 4 nag_hypergeom_dist Hypergeometric distribution function g01dac 7 nag_normal_scores_exact Normal scores, accurate values g01dcc 7 nag_normal_scores_var Normal scores, approximate variance-covariance matrix g01ddc 4 nag_shapiro_wilk_test Shapiro and Wilk's W test for Normality g01dhc 4 nag_ranks_and_scores Ranks, Normal scores, approximate Normal scores or exponential (Savage) scores
    [Show full text]