A Wrapped Normal Distribution on Hyperbolic Space for Gradient-Based Learning

Total Page:16

File Type:pdf, Size:1020Kb

A Wrapped Normal Distribution on Hyperbolic Space for Gradient-Based Learning A Wrapped Normal Distribution on Hyperbolic Space for Gradient-Based Learning Yoshihiro Nagano 1 Shoichiro Yamaguchi 2 Yasuhiro Fujita 2 Masanori Koyama 2 Abstract (a) A tree representation of the train- Hyperbolic space is a geometry that is known to ing dataset be well-suited for representation learning of data !!!!!!" "!!""!! with an underlying hierarchical structure. In this "!!!""! paper, we present a novel hyperbolic distribution !!"!"!" called pseudo-hyperbolic Gaussian, a Gaussian- !!!!!"" !!!!"!" like distribution on hyperbolic space whose den- sity can be evaluated analytically and differen- !!!"!"" !!"!!"" !"!!"!" "!!!"!" tiated with respect to the parameters. Our dis- tribution enables the gradient-based learning of the probabilistic models on hyperbolic space that could never have been considered before. Also, (b) Vanilla VAE (β = 1:0) (c) Hyperbolic VAE we can sample from this hyperbolic probability distribution without resorting to auxiliary means like rejection sampling. As applications of our distribution, we develop a hyperbolic-analog of variational autoencoder and a method of proba- bilistic word embedding on hyperbolic space. We demonstrate the efficacy of our distribution on various datasets including MNIST, Atari 2600 Breakout, and WordNet. Figure 1: The visual results of Hyperbolic VAE applied to 1. Introduction an artificial dataset generated by applying random pertur- bations to a binary tree. The visualization is being done Recently, hyperbolic geometry is drawing attention as a on the Poincare´ ball. The red points are the embeddings powerful geometry to assist deep networks in capturing fun- of the original tree, and the blue points are the embeddings damental structural properties of data such as a hierarchy. of noisy observations generated from the tree. The pink × Hyperbolic attention network (Gul¨ c¸ehre et al., 2019) im- represents the origin of the hyperbolic space. The VAE was proved the generalization performance of neural networks trained without the prior knowledge of the tree structure. on various tasks including machine translation by imposing Please see 6.1 for experimental details arXiv:1902.02992v2 [stat.ML] 10 May 2019 the hyperbolic geometry on several parts of neural networks. Poincare´ embeddings (Nickel & Kiela, 2017) succeeded in learning a parsimonious representation of symbolic data by determines the properties of the dataset that can be learned embedding the dataset into Poincare´ balls. from the embedding. For the dataset with a hierarchical structure, in particular, the number of relevant features can In the task of data embedding, the choice of the target space grow exponentially with the depth of the hierarchy. Eu- 1Department of Complexity Science and Engineering, The clidean space is often inadequate for capturing the structural University of Tokyo, Japan 2Preferred Networks, Inc., Japan. information (Figure1). If the choice of the target space Correspondence to: Yoshihiro Nagano <[email protected] of the embedding is limited to Euclidean space, one might tokyo.ac.jp>. have to prepare extremely high dimensional space as the tar- Proceedings of the 36 th International Conference on Machine get space to guarantee small distortion. However, the same Learning, Long Beach, California, PMLR 97, 2019. Copyright embedding can be done remarkably well if the destination 2019 by the author(s). is the hyperbolic space (Sarkar, 2012; Sala et al., 2018). A Wrapped Normal Distribution on Hyperbolic Space for Gradient-Based Learning Now, the next natural question is; “how can we extend these and Lorentz (hyperboloid/Minkowski) model, and Poincare´ works to probabilistic inference problems on hyperbolic half-plane model. Many applications of hyperbolic space space?” When we know in advance that there is a hierarchi- to machine learning to date have adopted the Poincare´ disk cal structure in the dataset, a prior distribution on hyperbolic model as the subject of study (Nickel & Kiela, 2017; Ganea space might serve as a good informative prior. We might et al., 2018a;b; Sala et al., 2018). In this study, however, also want to make Bayesian inference on a dataset with we will use the Lorentz model that, as claimed in Nickel hierarchical structure by training a variational autoencoder & Kiela(2018), comes with a simpler closed form of the (VAE) (Kingma & Welling, 2014; Rezende et al., 2014) with geodesics and does not suffer from the numerical instabili- latent variables defined on hyperbolic space. We might also ties in approximating the distance. We will also exploit the want to conduct probabilistic word embedding into hyper- fact that both exponential map and parallel transportation bolic space while taking into account the uncertainty that have a clean closed form in the Lorentz model. arises from the underlying hierarchical relationship among Lorentz model n (Figure 2(a)) can be represented as a set words. Finally, it would be best if we can compare different H of points z 2 n+1 with z > 0 such that its Lorentzian probabilistic models on hyperbolic space based on popular R 0 product (negative Minkowski bilinear form) statistical measures like divergence that requires the explicit form of the probability density function. n X hz; z0i = −z z0 + z z0; The endeavors we mentioned in the previous paragraph all L 0 0 i i require probability distributions on hyperbolic space that i=1 admit a parametrization of the density function that can be with itself is −1. That is, computed analytically and differentiated with respect to n n+1 the parameter. Also, we want to be able to sample from the H = z 2 R : hz; ziL = −1; z0 > 0 : (1) distribution efficiently; that is, we do not want to resort to auxiliary methods like rejection sampling. Lorentzian inner product also functions as the metric tensor on hyperbolic space. We will refer to the one-hot vector In this study, we present a novel hyperbolic distribution µ = [1; 0; 0; :::0] 2 n ⊂ n+1 as the origin of the called pseudo-hyperbolic Gaussian, a Gaussian-like distri- 0 H R hyperbolic space. Also, the distance between two points bution on hyperbolic space that resolves all these problems. z; z0 on n is given by d (z; z0) = arccosh (−hz; z0i ), We construct this distribution by defining Gaussian distri- H ` L which is also the length of the geodesic that connects z and bution on the tangent space at the origin of the hyperbolic z0. space and projecting the distribution onto hyperbolic space after transporting the tangent space to a desired location in the space. This operation can be formalized by a combina- 2.2. Parallel Transport and Exponential Map tion of the parallel transport and the exponential map for the The rough explanation of our strategy for the construction Lorentz model of hyperbolic space. of pseudo-hyperbolic Gaussian G(µ; Σ) with µ 2 Hn and We can use our pseudo-hyperbolic Gaussian distribution to a positive positive definite matrix Σ is as follows. We (1) construct a probabilistic model on hyperbolic space that can sample a vector from N (0; Σ), (2) transport the vector from be trained with gradient-based learning. For example, our µ0 to µ along the geodesic, and (3) project the vector onto distribution can be used as a prior of a VAE (Figure1, Figure the surface. To formalize this sequence of operations, we 6). It is also possible to extend the existing probabilistic em- need to define the tangent space on hyperbolic space as well bedding method to hyperbolic space using our distribution, as the way to transport the tangent space and the way to such as probabilistic word embedding. We will demonstrate project a vector in the tangent space to the surface. The the utility of our method through the experiments of proba- transportation of the tangent vector requires parallel trans- bilistic hyperbolic models on benchmark datasets including port, and the projection of the tangent vector to the surface MNIST, Atari 2600 Breakout, and WordNet. requires the definition of exponential map. Tangent space of hyperbolic space 2. Background n n Let us use TµH to denote the tangent space of H at µ n 2.1. Hyperbolic Geometry (Figure 2(a)). Representing TµH as a set of vectors in the same ambient space Rn+1 into which Hn is embedded, n Hyperbolic geometry is a non-Euclidean geometry with a TµH can be characterized as the set of points satisfying constant negative Gaussian curvature, and it can be visual- the orthogonality relation with respect to the Lorentzian ized as the forward sheet of the two-sheeted hyperboloid. product: There are four common equivalent models used for the hy- n perbolic geometry: the Klein model, Poincare´ disk model, TµH := fu: hu; µiL = 0g: (2) A Wrapped Normal Distribution on Hyperbolic Space for Gradient-Based Learning (a) (b) (c) !"#"$$%$&'#"()*+#, !"#$%&%'()*+,)# 1 1 Figure 2: (a) One-dimensional Lorentz model H (red) and its tangent space TµH (blue). (b) Parallel transport carries n v 2 Tµ0 (green) to u 2 Tµ (blue) while preserving k · kL . (c) Exponential map projects the u 2 Tµ (blue) to z 2 H (red). n The distance between µ and expµ(u) which is measured on the surface of H coincides with kukL. n TµH set can be literally thought of as the tangent space of norm of v. For hyperbolic space, this map (Figure 2(c)) is n the forward hyperboloid sheet at µ. Note that Tµ0 H con- given by n+1 p sists of v 2 R with v0 = 0, and kvkL := hv; viL = kvk2. u z = expµ(u) = cosh (kukL)µ+sinh (kukL) : (5) Parallel transport and inverse parallel transport kukL Next, for an arbitrary pair of point µ; ν 2 Hn, the par- allel transport from ν to µ is defined as a map PTν!µ As we can confirm with straightforward computation, this n n n from Tν H to TµH that carries a vector in Tν H exponential map is norm preserving in the sense that along the geodesic from ν to µ in a parallel manner d`(µ; expµ(u)) = arccosh −hµ; expµ(u)iL = kukL.
Recommended publications
  • Comparative Analysis of Normal and Logistic Distributions Modeling of Stock Exchange Monthly Returns in Nigeria (1995- 2014)
    International Journal of Business & Law Research 4(4):58-66, Oct.-Dec., 2016 © SEAHI PUBLICATIONS, 2016 www.seahipaj.org ISSN: 2360-8986 Comparative Analysis Of Normal And Logistic Distributions Modeling Of Stock Exchange Monthly Returns In Nigeria (1995- 2014) EKE, Charles N. Ph.D. Department of Mathematics and Statistics Federal Polytechnic Nekede, Owerri, Imo State, Nigeria [email protected] ABSTRACT This study examined the statistical modeling of monthly returns from the Nigerian stock exchange using probability distributions (normal and logistic distributions) for the period, 1995 to 2014.The data were analyzed to determine the returns from the stock exchange, its mean and standard deviation in order to obtain the best suitable distribution of fit. Furthermore, comparative analysis was done between the logistic distribution and normal distribution. The findings proved that the logistic distribution is better than the normal distribution in modeling the returns from the Nigeria stock exchange due to its p – value is greater than the 5% significance level alpha and also because of its minimum variance. Keywords: Stock Exchange, Minimum Variance, Statistical Distribution, Modeling 1.0 INTRODUCTION Stock exchange market in any economy plays an important role towards economic growth. Importantly, stock exchange is an exchange where stock brokers and traders can buy and sell stocks (shares), bounds and other securities; this stock exchange also provides facilities for issue and redemption of securities and other financial instruments and capital events including the payment of income and dividends (Diamond, 2000). In addition , the Nigeria stock exchange was established as the Lagos stock exchange, in the year 1960, while, the stock exchange council was constituted and officially started operation on august 25th 1961 with 19 securities that were listed for trading (Ologunde et al, 2006).
    [Show full text]
  • Sparsity Enforcing Priors in Inverse Problems Via Normal Variance
    Sparsity enforcing priors in inverse problems via Normal variance mixtures: model selection, algorithms and applications Mircea Dumitru1 1Laboratoire des signaux et systemes` (L2S), CNRS – CentraleSupelec´ – Universite´ Paris-Sud, CentraleSupelec,´ Plateau de Moulon, 91192 Gif-sur-Yvette, France Abstract The sparse structure of the solution for an inverse problem can be modelled using different sparsity enforcing pri- ors when the Bayesian approach is considered. Analytical expression for the unknowns of the model can be obtained by building hierarchical models based on sparsity enforcing distributions expressed via conjugate priors. We consider heavy tailed distributions with this property: the Student-t distribution, which is expressed as a Normal scale mixture, with the mixing distribution the Inverse Gamma distribution, the Laplace distribution, which can also be expressed as a Normal scale mixture, with the mixing distribution the Exponential distribution or can be expressed as a Normal inverse scale mixture, with the mixing distribution the Inverse Gamma distribution, the Hyperbolic distribution, the Variance-Gamma distribution, the Normal-Inverse Gaussian distribution, all three expressed via conjugate distribu- tions using the Generalized Hyperbolic distribution. For all distributions iterative algorithms are derived based on hierarchical models that account for the uncertainties of the forward model. For estimation, Maximum A Posterior (MAP) and Posterior Mean (PM) via variational Bayesian approximation (VBA) are used. The performances of re- sulting algorithm are compared in applications in 3D computed tomography (3D-CT) and chronobiology. Finally, a theoretical study is developed for comparison between sparsity enforcing algorithms obtained via the Bayesian ap- proach and the sparsity enforcing algorithms issued from regularization techniques, like LASSO and some others.
    [Show full text]
  • Handbook on Probability Distributions
    R powered R-forge project Handbook on probability distributions R-forge distributions Core Team University Year 2009-2010 LATEXpowered Mac OS' TeXShop edited Contents Introduction 4 I Discrete distributions 6 1 Classic discrete distribution 7 2 Not so-common discrete distribution 27 II Continuous distributions 34 3 Finite support distribution 35 4 The Gaussian family 47 5 Exponential distribution and its extensions 56 6 Chi-squared's ditribution and related extensions 75 7 Student and related distributions 84 8 Pareto family 88 9 Logistic distribution and related extensions 108 10 Extrem Value Theory distributions 111 3 4 CONTENTS III Multivariate and generalized distributions 116 11 Generalization of common distributions 117 12 Multivariate distributions 133 13 Misc 135 Conclusion 137 Bibliography 137 A Mathematical tools 141 Introduction This guide is intended to provide a quite exhaustive (at least as I can) view on probability distri- butions. It is constructed in chapters of distribution family with a section for each distribution. Each section focuses on the tryptic: definition - estimation - application. Ultimate bibles for probability distributions are Wimmer & Altmann (1999) which lists 750 univariate discrete distributions and Johnson et al. (1994) which details continuous distributions. In the appendix, we recall the basics of probability distributions as well as \common" mathe- matical functions, cf. section A.2. And for all distribution, we use the following notations • X a random variable following a given distribution, • x a realization of this random variable, • f the density function (if it exists), • F the (cumulative) distribution function, • P (X = k) the mass probability function in k, • M the moment generating function (if it exists), • G the probability generating function (if it exists), • φ the characteristic function (if it exists), Finally all graphics are done the open source statistical software R and its numerous packages available on the Comprehensive R Archive Network (CRAN∗).
    [Show full text]
  • Parametric Representation of the Top of Income Distributions: Options, Historical Evidence and Model Selection
    PARAMETRIC REPRESENTATION OF THE TOP OF INCOME DISTRIBUTIONS: OPTIONS, HISTORICAL EVIDENCE AND MODEL SELECTION Vladimir Hlasny Working Paper 90 March, 2020 The CEQ Working Paper Series The CEQ Institute at Tulane University works to reduce inequality and poverty through rigorous tax and benefit incidence analysis and active engagement with the policy community. The studies published in the CEQ Working Paper series are pre-publication versions of peer-reviewed or scholarly articles, book chapters, and reports produced by the Institute. The papers mainly include empirical studies based on the CEQ methodology and theoretical analysis of the impact of fiscal policy on poverty and inequality. The content of the papers published in this series is entirely the responsibility of the author or authors. Although all the results of empirical studies are reviewed according to the protocol of quality control established by the CEQ Institute, the papers are not subject to a formal arbitration process. Moreover, national and international agencies often update their data series, the information included here may be subject to change. For updates, the reader is referred to the CEQ Standard Indicators available online in the CEQ Institute’s website www.commitmentoequity.org/datacenter. The CEQ Working Paper series is possible thanks to the generous support of the Bill & Melinda Gates Foundation. For more information, visit www.commitmentoequity.org. The CEQ logo is a stylized graphical representation of a Lorenz curve for a fairly unequal distribution of income (the bottom part of the C, below the diagonal) and a concentration curve for a very progressive transfer (the top part of the C).
    [Show full text]
  • The Generalized Hyperbolic Model: Estimation, Financial Derivatives, and Risk Measures
    The Generalized Hyperbolic Model: Estimation, Financial Derivatives, and Risk Measures Dissertation zur Erlangung des Doktorgrades der Mathematischen Fakult¨at der Albert-Ludwigs-Universit¨at Freiburg i. Br. vorgelegt von Karsten Prause Oktober 1999 Dekan: Prof. Dr. Wolfgang Soergel Referenten: Prof. Dr. Ernst Eberlein Prof. Ph.D. Bent Jesper Christensen Datum der Promotion: 15. Dezember 1999 Institut f¨ur Mathematische Stochastik Albert-Ludwigs-Universit¨at Freiburg Eckerstraße 1 D–79104 Freiburg im Breisgau To Ulrike Preface The aim of this dissertation is to describe more realistic models for financial assets based on generalized hyperbolic (GH) distributions and their subclasses. Generalized hyperbolic distributions were introduced by Barndorff-Nielsen (1977), and stochastic processes based on these distributions were first applied by Eberlein and Keller (1995) in Finance. Being a normal variance-mean mixture, GH distributions possess semi- heavy tails and allow for a natural definiton of volatility models by replacing the mixing generalized inverse Gaussian (GIG) distribution by appropriate volatility processes. In the first Chapter we introduce univariate GH distributions, construct an estimation algorithm and examine statistically the fit of generalized hyperbolic distributions to log- return distributions of financial assets. We extend the hyperbolic model for the pricing of derivatives to generalized hyperbolic L´evy motions and discuss the calculation of prices by fast Fourier methods and saddle-point approximations. Chapter 2 contains on the one hand a general recipe for the evaluation of option pricing models; on the other hand the derivative pricing based on GH L´evy motions is studied from various points of view: The accordance with observed asset price processes is investigated statistically, and by simulation studies the sensitivity to relevant variables; finally, theoretical prices are compared with quoted option prices.
    [Show full text]
  • Package 'Hyperbolicdist'
    Package ‘HyperbolicDist’ February 19, 2015 Version 0.6-2 Date 2009-09-09 Title The hyperbolic distribution Author David Scott <[email protected]> Maintainer David Scott <[email protected]> Depends R (>= 2.3.0) Suggests VarianceGamma, actuar Encoding latin1 Description This package provides functions for the hyperbolic and related distributions. Density, distribution and quantile functions and random number generation are provided for the hyperbolic distribution, the generalized hyperbolic distribution, the generalized inverse Gaussian distribution and the skew-Laplace distribution. Additional functionality is provided for the hyperbolic distribution, including fitting of the hyperbolic to data. License GPL (>= 2) URL http://www.r-project.org Repository CRAN Repository/R-Forge/Project rmetrics Repository/R-Forge/Revision 4411 Date/Publication 2009-09-23 16:45:00 NeedsCompilation no R topics documented: Bessel K Ratio . .2 Functions for Moments . .4 Generalized Inverse Gaussian . .5 GeneralizedHyperbolic . .8 1 2 Bessel K Ratio GeneralizedHyperbolicPlots . 11 ghypCalcRange . 13 ghypChangePars . 14 ghypMom . 15 gigCalcRange . 17 gigChangePars . 18 gigCheckPars . 19 gigMom . 21 GIGPlots . 23 hyperbCalcRange . 25 hyperbChangePars . 26 hyperbCvMTest . 27 hyperbFit . 29 hyperbFitStart . 32 Hyperbolic . 34 HyperbolicDistribution . 37 HyperbPlots . 38 hyperbWSqTable . 40 is.wholenumber . 40 logHist . 41 mamquam . 44 momChangeAbout . 45 momIntegrated . 46 momRecursion . 48 resistors . 49 safeIntegrate . 50 Sample Moments . 52 SandP500 . 53 SkewLaplace . 54 SkewLaplacePlots . 55 Specific Generalized Hyperbolic Moments and Mode . 57 Specific Generalized Inverse Gaussian Moments and Mode . 58 Specific Hyperbolic Distribution Moments and Mode . 59 summary.hyperbFit . 60 Index 62 Bessel K Ratio Ratio of Bessel K Functions Description Calculates the ratio of Bessel K functions of different orders Usage besselRatio(x, nu, orderDiff, useExpScaled = 700) Bessel K Ratio 3 Arguments x Numeric, ≥ 0.
    [Show full text]
  • Generalized Hyperbolic Distributions and Value-At-Risk Estimation For
    CORE Metadata, citation and similar papers at core.ac.uk Provided by Clute Institute: Journals International Business & Economics Research Journal – March/April 2014 Volume 13, Number 2 Generalized Hyperbolic Distributions And Value-At-Risk Estimation For The South African Mining Index Chun-Kai Huang, University of KwaZulu-Natal, South Africa Knowledge Chinhamu, University of KwaZulu-Natal, South Africa Chun-Sung Huang, University of Cape Town, South Africa Jahvaid Hammujuddy, University of KwaZulu-Natal, South Africa ABSTRACT South Africa is a cornucopia of mineral riches and the performance of its mining industry has significant impacts on the economy. Hence, an accurate distributional assumption of the underlying mining index returns is imperative for the forecasting and understanding of the financial market. In this paper, we propose three subclasses of the generalized hyperbolic distributions as appropriate models for the Johannesburg Stock Exchange (JSE) Mining Index returns. These models are shown to outperform the traditional assumption of normality and accommodate for a number of stylized features, such as excess kurtosis and volatility clustering, embedded within the financial data. The models are compared using the Akaike Information Criterion (AIC), the Bayesian Information Criterion (BIC) and log-likelihoods. In addition, Value- at-Risk (VaR) estimation and backtesting were also performed to test the extreme tails. The various criteria utilized suggest the generalized hyperbolic (GH) skew Student’s t-distribution as the most robust model for the South African Mining Index returns. Keywords: FTSE/JSE Mining Index; Hyperbolic Distribution; Normal-Inverse Gaussian (NIG) Distribution; Generalized Hyperbolic Skew Student’s t-Distribution; Value-at-Risk INTRODUCTION s a country in profusion of natural resources, South Africa’s mining sector accounts for a considerable portion of the world’s production and reserves in mineral riches such as gold, platinum, and coal.
    [Show full text]
  • From the Classical Beta Distribution to Generalized Beta Distributions
    FROM THE CLASSICAL BETA DISTRIBUTION TO GENERALIZED BETA DISTRIBUTIONS Title A project submitted to the School of Mathematics, University of Nairobi in partial fulfillment of the requirements for the degree of Master of Science in Statistics. By Otieno Jacob I56/72137/2008 Supervisor: Prof. J.A.M Ottieno School of Mathematics University of Nairobi July, 2013 FROM THE CLASSICAL BETA DISTRIBUTION TO GENERALIZED BETA DISTRIBUTIONS Half Title ii Declaration This project is my original work and has not been presented for a degree in any other University Signature ________________________ Otieno Jacob This project has been submitted for examination with my approval as the University Supervisor Signature ________________________ Prof. J.A.M Ottieno iii Dedication I dedicate this project to my wife Lilian; sons Benjamin, Vincent, John; daughter Joy and friends. A special feeling of gratitude to my loving Parents, Justus and Margaret, my aunt Wilkista, my Grandmother Nereah, and my late Uncle Edwin for their support and encouragement. iv Acknowledgement I would like to thank my family members, colleagues, friends and all those who helped and supported me in writing this project. My sincere appreciation goes to my supervisor, Prof. J.A.M Ottieno. I would not have made it this far without his tremendous guidance and support. I would like to thank him for his valuable advice concerning the presentation of this work before the Master of Science Project Committee of the School of Mathematics at the University of Nairobi and for his encouragement and guidance throughout. I would also like to thank Dr. Kipchirchir for his questions and comments which were beneficial for finalizing the project.
    [Show full text]
  • Three Skewed Matrix Variate Distributions Arxiv:1704.02531V5
    Three Skewed Matrix Variate Distributions Michael P.B. Gallaugher and Paul D. McNicholas Dept. of Mathematics & Statistics, McMaster University, Hamilton, Ontario, Canada. Abstract Three-way data can be conveniently modelled by using matrix variate distributions. Although there has been a lot of work for the matrix variate normal distribution, there is little work in the area of matrix skew distributions. Three matrix variate distributions that incorporate skewness, as well as other flexible properties such as concentration, are discussed. Equivalences to multivariate analogues are presented, and moment generating functions are derived. Maximum likelihood parameter estimation is discussed, and simulated data is used for illustration. 1 Introduction Matrix variate distributions are useful in modelling three way data, e.g., multivariate longitu- dinal data. Although the matrix normal distribution is widely used, there is relative paucity arXiv:1704.02531v5 [stat.ME] 13 Aug 2018 in the area of matrix skewed distributions. Herein, we discuss matrix variate extensions of three already well established multivariate distributions using matrix normal variance-mean mixtures. Specifically, we consider a matrix variate generalized hyperbolic distribution, a matrix variate variance-gamma distribution, and a matrix variate normal inverse Gaussian (NIG) distribution. Along with the matrix variate skew-t distribution, mixtures of these respective distributions have been used for clustering (Gallaugher and McNicholas, 2018); 1 however, unlike the matrix variate skew-t distribution (Gallaugher and McNicholas, 2017), their properties have yet to be discussed and this letter aims to fill that gap. 2 Background 2.1 The Matrix Variate Normal and Related Distributions One of the most mathematically tractable examples of a matrix variate distribution is the matrix variate normal distribution.
    [Show full text]
  • Assessing the Relative Performance of Heavy-Tailed Distributions
    The Journal of Applied Business Research – July/August 2014 Volume 30, Number 4 Assessing The Relative Performance Of Heavy-Tailed Distributions: Empirical Evidence From The Johannesburg Stock Exchange Chun-Sung Huang, University of Cape Town, South Africa Chun-Kai Huang, University of Cape Town, South Africa Knowledge Chinhamu, University of KwaZulu-Natal, South Africa ABSTRACT It has been well documented that the empirical distribution of daily logarithmic returns from financial market variables is characterized by excess kurtosis and skewness. In order to capture such properties in financial data, heavy-tailed and asymmetric distributions are required to overcome shortfalls of the widely exhausted classical normality assumption. In the context of financial forecasting and risk management, the accuracy in modeling the underlying returns distribution plays a vital role. For example, risk management tools such as value-at-risk (VaR) are highly dependent on the underlying distributional assumption, with particular focus being placed at the extreme tails. Hence, identifying a distribution that best captures all aspects of the given financial data may provide vast advantages to both investors and risk managers. In this paper, we investigate major financial indices on the Johannesburg Stock Exchange (JSE) and fit their associated returns to classes of heavy tailed distributions. The relative adequacy and goodness-of-fit of these distributions are then assessed through the robustness of their respective VaR estimates. Our results indicate that the best model selection is not only variant across the indices, but also across different VaR levels and the dissimilar tails of return series. Keywords: Heavy-Tailed and Asymmetric Distributions; Value-At-Risk (VaR); Johannesburg Stock Exchange 1.
    [Show full text]
  • Probability Distributions
    Probability distributions [Nematrian website page: ProbabilityDistributionsIntro, © Nematrian 2020] The Nematrian website contains information and analytics on a wide range of probability distributions, including: Discrete (univariate) distributions - Bernoulli, see also binomial distribution - Binomial - Geometric, see also negative binomial distribution - Hypergeometric - Logarithmic - Negative binomial - Poisson - Uniform (discrete) Continuous (univariate) distributions - Beta - Beta prime - Burr - Cauchy - Chi-squared - Dagum - Degenerate - Error function - Exponential - F - Fatigue, also known as the Birnbaum-Saunders distribution - Fréchet, see also generalised extreme value (GEV) distribution - Gamma - Generalised extreme value (GEV) - Generalised gamma - Generalised inverse Gaussian - Generalised Pareto (GDP) - Gumbel, see also generalised extreme value (GEV) distribution - Hyperbolic secant - Inverse gamma - Inverse Gaussian - Johnson SU - Kumaraswamy - Laplace - Lévy - Logistic - Log-logistic - Lognormal - Nakagami - Non-central chi-squared - Non-central t - Normal - Pareto - Power function - Rayleigh - Reciprocal - Rice - Student’s t - Triangular - Uniform - Weibull, see also generalised extreme value (GEV) distribution Continuous multivariate distributions - Inverse Wishart Copulas (a copula is a special type of continuous multivariate distribution) - Clayton - Comonotonicity - Countermonotonicity (only valid for 푛 = 2, where 푛 is the dimension of the input) - Frank - Generalised Clayton - Gumbel - Gaussian - Independence
    [Show full text]
  • Models for Heavy-Tailed Asset Returns
    Munich Personal RePEc Archive Models for Heavy-tailed Asset Returns Borak, Szymon and Misiorek, Adam and Weron, Rafal Wrocław University of Technology, Poland September 2010 Online at https://mpra.ub.uni-muenchen.de/25494/ MPRA Paper No. 25494, posted 28 Sep 2010 20:15 UTC Models for Heavy-tailed Asset Returns1 Szymon Borak2, Adam Misiorek3, and Rafał Weron4 Abstract: Many of the concepts in theoretical and empirical finance developed over the past decades – including the classical portfolio theory, the Black- Scholes-Merton option pricing model or the RiskMetrics variance-covariance approach to VaR – rest upon the assumption that asset returns follow a normal distribution. But this assumption is not justified by empirical data! Rather, the empirical observations exhibit excess kurtosis, more colloquially known as fat tails or heavy tails. This chapter is intended as a guide to heavy-tailed models. We first describe the historically oldest heavy-tailed model – the stable laws. Next, we briefly characterize their recent lighter-tailed generalizations, the so- called truncated and tempered stable distributions. Then we study the class of generalized hyperbolic laws, which – like tempered stable distributions – can be classified somewhere between infinite variance stable laws and the Gaussian distribution. Finally, we provide numerical examples. Keywords: Heavy-tailed distribution; Stable distribution; Tempered stable distribution; Generalized hyperbolic distribution; Asset return; Random number generation; Parameter estimation JEL: C13,
    [Show full text]