Investigation and Comparison of Sampling Properties of L-Moments and Conventional Moments

Total Page:16

File Type:pdf, Size:1020Kb

Investigation and Comparison of Sampling Properties of L-Moments and Conventional Moments Journal of Hydrology 218 (1999) 13–34 Investigation and comparison of sampling properties of L-moments and conventional moments A. Sankarasubramanian1, K. Srinivasan* Department of Civil Engineering, Indian Institute of Technology Madras, Chennai 600 036, India Received 3 December 1997; accepted 26 January 1999 Abstract The first part of this article deals with fitting of regression equations for the sampling properties, variance of L-standard deviation (l2), and bias and variance of L-skewness (t3), based on Monte-Carlo simulation results, for generalised Normal (Lognormal-3) and Pearson-3 distributions. These fitted equations will be useful in formulating goodness-of-fit test statistics in regional frequency analysis. The second part presents a comparison of the sampling properties between L-moments and conventional product moments for generalised Normal, generalised Extreme Value, generalised Pareto and Pearson-3 distribu- tions, in a relative form. The comparison reveals that the bias in L-skewness is found to be insignificant up to a skewness of about 1.0, even for small samples. In case of higher skewness, for a reasonable sample size of 30, L-skewness is found to be nearly unbiased. However, the conventional skewness is found to be significantly biased, even for a low skewness of 0.5 and a reasonable sample size of 30. The overall performance evaluation in terms of “Relative-RMSE in third moment ratio” reveals that conventional moments are preferable at lower skewness, particularly for smaller samples, while L-moments are preferable at higher skewness, for all sample sizes. This point is illustrated through an application that seeks to obtain an appropriate regional flood frequency distribution for the 98 catchment areas located in the central region of India, spread over six hydrometeorologic subzones. ᭧ 1999 Elsevier Science B.V. All rights reserved. Keywords: Sampling methods; Generalised Normal, Pearson distributions; L-moments; L-skewness 1. Introduction distribution. Even though this method is conceptually simple, and the computations are straight-forward, it The method of moments has been one of the is found that the numerical values of the sample simplest and conventional parameter estimation tech- moments can be very different from those of the popu- niques used in statistical hydrology. In this method, lation from which the sample has been drawn, espe- while fitting a probability distribution to a sample, the cially when the sample size is small and/or the parameters are estimated by equating the sample skewness of the sample is considerable. If m, s 2, g, moments to those of the theoretical moments of the denote the mean, the variance and the skewness of the population, then the unbiased sample estimates of m, s 2, g, are expressed as: * Corresponding author. E-mail address: [email protected] (K. Srinivasan) 1 Xn 1 ; Presently Research Associate, Tufts University, MA 02155, m xi 1a USA. n i1 0022-1694/99/$ - see front matter ᭧ 1999 Elsevier Science B.V. All rights reserved. PII: S0022-1694(99)00018-9 14 A. Sankarasubramanian, K. Srinivasan / Journal of Hydrology 218 (1999) 13–34 Xn 1 estimated with M1,r,s considering x as linear. In fact, s2 x Ϫ m2; 1b Ϫ i either the quantity M1,0,s (a s: s 0,1,…) or M1,r,0 (b r: n 1 i 1 r 0,1,…) is sufficient to characterise a probability "#distribution. The quantity M which is known as b , 1 n Xn 1,r,0 r g x Ϫ m3 : 1c is more commonly used and can be defined as: 3 Ϫ Ϫ i s n 1 n 2 i 1 b r r E x F x 3 It is to be noted that Eq. 1(c) yields unbiased esti- b mate of skewness, only if the distribution is normal. The unbiased sample estimator (br)of r can be The attempts made to develop unbiased estimates of estimated using the expression given by Landwehr skewness for various non-normal distributions usually et al. (1979): result in a considerable increase in the variance of ! Xn Ϫ skewness (Vogel and Fennessey, 1993). Moreover, a i 1 x i notable drawback with conventional moment ratios 1 r b i 1 ! r 0; 1; …; n Ϫ 1 4 such as skewness and coefficient of variation, is that, r n n Ϫ 1 for finite samples, they are bounded. These bounds are shown to be dependent only on the sample size, irre- r spective of the population values (Kirby, 1974). where x is an ordered set of observations Wallis et al., 1974 have shown that the sample esti- (i) x Յ x Յ x Յ … Յ x . mates of skewness are highly biased for small 1 2 3 n The first four L-moments are given as: samples. Recently, Vogel and Fennessey (1993) have investigated the sampling properties of conven- l b 1 0 tional moment ratios for highly skewed Lognormal l b Ϫ b (LN-2) and generalised Pareto (GPA) distributions 2 2 1 0 for large samples, in terms of bias and root mean 5 l b Ϫ b ϩ b square error (RMSE) expressed in a relative form. 3 6 2 6 1 0 Their study reveals that sample estimates of coeffi- l b Ϫ b ϩ b Ϫ b cient of variation and skewness are remarkably biased 4 20 3 30 2 12 1 0 even for extremely large samples (n Ͼ 1000) drawn from highly skewed populations. The sample estimates (lr: r 1,2,3,…) of L- moments (l 1, l 2,…, l r) can be obtained by substitut- ing the unbiased sample estimates of b r given by Eq. 2. L-moments (4) into Eq. (5). More recently, Wang (1996) has derived direct esti- Recently, Hosking (1990) has defined L-moments, mators of L-moments by closely following the basic which are analogous to conventional moments, and definition of L-moments which is based on order can be expressed in terms of linear combinations of statistics. This eliminates the need to resort to order statistics. Basically, L-moments are linear func- PWMs, while defining the L-moments. However, tions of probability weighted moments (PWMs). the numerical values of the sample estimates from Greenwood et al. (1979) defined PWMs as the PWM-based indirect estimators and the order p: r Ϫ s statistics based direct estimators are the same. Mp;r;s E x F x 1 F x 2 where F(x) is the cumulative distribution function of x, with p,r,s being real numbers. 3. L-moment ratios This expression for Mp;r;s (Eq. (2)) is useful in char- acterising the statistical properties of an observed data Analogous to the conventional statistical moment ; ; ; ; set. The quantities {Mp;0;0 p 1 2 3 …} represent ratios, the L-moment ratios can be expressed as: the conventional non-central moments of x. But, in t l =l case of PWMs, the moments of a distribution are L-coefficient of variation 2 2 1 6a A. Sankarasubramanian, K. Srinivasan / Journal of Hydrology 218 (1999) 13–34 15 t l =l L-skewness 3 3 2 6b and L-moments can be obtained in terms of expecta- tions of quadratic functions of order statistics, t l =l ! ! L-kurtosis 4 4 2 6c Án Ϫ1 XX j Ϫ 1 Ϫ 2 E Xs:t Xr:t In general, 1ՅrϽsՅt t 1ՅjϽkՅn r Ϫ 1 8 t l =l r 3; 4; … 7 ! ! r r 2 k Ϫ j Ϫ 1 n Ϫ k × x Ϫ x 2; The sample estimates of L-moment ratios Ϫ Ϫ Ϫ k j ; ; ; s r 1 t s (t2 t3 … tr) can be obtained by replacing l ; l ; ; l ( 1 2 … r) by their respective sample estimates in a distribution free manner. But, these results depend ; ; ; l1 l2 … lr in the previous set of Eqs. (6a)–(6c) and on the joint distributions of order statistics, and hence (7). are extremely complex and the algebra of the same is L-moments are capable of characterising a wider not easily tractable. Such exact results are available range of distributions, compared to the conventional only for the uniform and the exponential distributions. moments. A distribution may be specified by its L- In contrast, the asymptotic distribution theory moments, even if some of its conventional moments provides an acceptable alternative to obtain practical, do not exist (Hosking, 1990). For example, in the case but slightly approximate estimates of variances of of the generalised Pareto distribution, the conven- sample PWMs and L-moments. If the underlying tional skewness is undefined beyond a value of 155 distribution has finite variance, then, it is possible to (shape parameter Ϫ 1/3), while the L-skewness can demonstrate asymptotic normality and also calculate be defined, even beyond that value. Further, L- the asymptotic bias and variance of the sample statis- moments are more robust to outliers in data than tics ar, br, lr and tr, using the set of expressions given conventional moments (Vogel and Fennessey, 1993) by Hosking (1986). The asymptotic theory usually and enable more reliable inferences to be made from provides a good approximation to the exact distribu- small samples about an underlying probability distri- tion of sample PWMs and L-moments for samples of bution. The advantages offered by L-moments over size n Ն 50 and in most cases adequate approximation conventional moments in hypothesis testing, bound- even for n 20 (Hosking, 1986). Using asymptotic edness of moment ratios and identification of distribu- theory, Hosking (1986) has derived approximate tions have been discussed in detail by Hosking (1986; analytical forms for the variance of L-moment and 1990).
Recommended publications
  • Concentration and Consistency Results for Canonical and Curved Exponential-Family Models of Random Graphs
    CONCENTRATION AND CONSISTENCY RESULTS FOR CANONICAL AND CURVED EXPONENTIAL-FAMILY MODELS OF RANDOM GRAPHS BY MICHAEL SCHWEINBERGER AND JONATHAN STEWART Rice University Statistical inference for exponential-family models of random graphs with dependent edges is challenging. We stress the importance of additional structure and show that additional structure facilitates statistical inference. A simple example of a random graph with additional structure is a random graph with neighborhoods and local dependence within neighborhoods. We develop the first concentration and consistency results for maximum likeli- hood and M-estimators of a wide range of canonical and curved exponential- family models of random graphs with local dependence. All results are non- asymptotic and applicable to random graphs with finite populations of nodes, although asymptotic consistency results can be obtained as well. In addition, we show that additional structure can facilitate subgraph-to-graph estimation, and present concentration results for subgraph-to-graph estimators. As an ap- plication, we consider popular curved exponential-family models of random graphs, with local dependence induced by transitivity and parameter vectors whose dimensions depend on the number of nodes. 1. Introduction. Models of network data have witnessed a surge of interest in statistics and related areas [e.g., 31]. Such data arise in the study of, e.g., social networks, epidemics, insurgencies, and terrorist networks. Since the work of Holland and Leinhardt in the 1970s [e.g., 21], it is known that network data exhibit a wide range of dependencies induced by transitivity and other interesting network phenomena [e.g., 39]. Transitivity is a form of triadic closure in the sense that, when a node k is connected to two distinct nodes i and j, then i and j are likely to be connected as well, which suggests that edges are dependent [e.g., 39].
    [Show full text]
  • The Statistical Analysis of Distributions, Percentile Rank Classes and Top-Cited
    How to analyse percentile impact data meaningfully in bibliometrics: The statistical analysis of distributions, percentile rank classes and top-cited papers Lutz Bornmann Division for Science and Innovation Studies, Administrative Headquarters of the Max Planck Society, Hofgartenstraße 8, 80539 Munich, Germany; [email protected]. 1 Abstract According to current research in bibliometrics, percentiles (or percentile rank classes) are the most suitable method for normalising the citation counts of individual publications in terms of the subject area, the document type and the publication year. Up to now, bibliometric research has concerned itself primarily with the calculation of percentiles. This study suggests how percentiles can be analysed meaningfully for an evaluation study. Publication sets from four universities are compared with each other to provide sample data. These suggestions take into account on the one hand the distribution of percentiles over the publications in the sets (here: universities) and on the other hand concentrate on the range of publications with the highest citation impact – that is, the range which is usually of most interest in the evaluation of scientific performance. Key words percentiles; research evaluation; institutional comparisons; percentile rank classes; top-cited papers 2 1 Introduction According to current research in bibliometrics, percentiles (or percentile rank classes) are the most suitable method for normalising the citation counts of individual publications in terms of the subject area, the document type and the publication year (Bornmann, de Moya Anegón, & Leydesdorff, 2012; Bornmann, Mutz, Marx, Schier, & Daniel, 2011; Leydesdorff, Bornmann, Mutz, & Opthof, 2011). Until today, it has been customary in evaluative bibliometrics to use the arithmetic mean value to normalize citation data (Waltman, van Eck, van Leeuwen, Visser, & van Raan, 2011).
    [Show full text]
  • Use of Statistical Tables
    TUTORIAL | SCOPE USE OF STATISTICAL TABLES Lucy Radford, Jenny V Freeman and Stephen J Walters introduce three important statistical distributions: the standard Normal, t and Chi-squared distributions PREVIOUS TUTORIALS HAVE LOOKED at hypothesis testing1 and basic statistical tests.2–4 As part of the process of statistical hypothesis testing, a test statistic is calculated and compared to a hypothesised critical value and this is used to obtain a P- value. This P-value is then used to decide whether the study results are statistically significant or not. It will explain how statistical tables are used to link test statistics to P-values. This tutorial introduces tables for three important statistical distributions (the TABLE 1. Extract from two-tailed standard Normal, t and Chi-squared standard Normal table. Values distributions) and explains how to use tabulated are P-values corresponding them with the help of some simple to particular cut-offs and are for z examples. values calculated to two decimal places. STANDARD NORMAL DISTRIBUTION TABLE 1 The Normal distribution is widely used in statistics and has been discussed in z 0.00 0.01 0.02 0.03 0.050.04 0.05 0.06 0.07 0.08 0.09 detail previously.5 As the mean of a Normally distributed variable can take 0.00 1.0000 0.9920 0.9840 0.9761 0.9681 0.9601 0.9522 0.9442 0.9362 0.9283 any value (−∞ to ∞) and the standard 0.10 0.9203 0.9124 0.9045 0.8966 0.8887 0.8808 0.8729 0.8650 0.8572 0.8493 deviation any positive value (0 to ∞), 0.20 0.8415 0.8337 0.8259 0.8181 0.8103 0.8206 0.7949 0.7872 0.7795 0.7718 there are an infinite number of possible 0.30 0.7642 0.7566 0.7490 0.7414 0.7339 0.7263 0.7188 0.7114 0.7039 0.6965 Normal distributions.
    [Show full text]
  • On Moments of Folded and Truncated Multivariate Normal Distributions
    On Moments of Folded and Truncated Multivariate Normal Distributions Raymond Kan Rotman School of Management, University of Toronto 105 St. George Street, Toronto, Ontario M5S 3E6, Canada (E-mail: [email protected]) and Cesare Robotti Terry College of Business, University of Georgia 310 Herty Drive, Athens, Georgia 30602, USA (E-mail: [email protected]) April 3, 2017 Abstract Recurrence relations for integrals that involve the density of multivariate normal distributions are developed. These recursions allow fast computation of the moments of folded and truncated multivariate normal distributions. Besides being numerically efficient, the proposed recursions also allow us to obtain explicit expressions of low order moments of folded and truncated multivariate normal distributions. Keywords: Multivariate normal distribution; Folded normal distribution; Truncated normal distribution. 1 Introduction T Suppose X = (X1;:::;Xn) follows a multivariate normal distribution with mean µ and k kn positive definite covariance matrix Σ. We are interested in computing E(jX1j 1 · · · jXnj ) k1 kn and E(X1 ··· Xn j ai < Xi < bi; i = 1; : : : ; n) for nonnegative integer values ki = 0; 1; 2;:::. The first expression is the moment of a folded multivariate normal distribution T jXj = (jX1j;:::; jXnj) . The second expression is the moment of a truncated multivariate normal distribution, with Xi truncated at the lower limit ai and upper limit bi. In the second expression, some of the ai's can be −∞ and some of the bi's can be 1. When all the bi's are 1, the distribution is called the lower truncated multivariate normal, and when all the ai's are −∞, the distribution is called the upper truncated multivariate normal.
    [Show full text]
  • On the Scale Parameter of Exponential Distribution
    Review of the Air Force Academy No.2 (34)/2017 ON THE SCALE PARAMETER OF EXPONENTIAL DISTRIBUTION Anca Ileana LUPAŞ Military Technical Academy, Bucharest, Romania ([email protected]) DOI: 10.19062/1842-9238.2017.15.2.16 Abstract: Exponential distribution is one of the widely used continuous distributions in various fields for statistical applications. In this paper we study the exact and asymptotical distribution of the scale parameter for this distribution. We will also define the confidence intervals for the studied parameter as well as the fixed length confidence intervals. 1. INTRODUCTION Exponential distribution is used in various statistical applications. Therefore, we often encounter exponential distribution in applications such as: life tables, reliability studies, extreme values analysis and others. In the following paper, we focus our attention on the exact and asymptotical repartition of the exponential distribution scale parameter estimator. 2. SCALE PARAMETER ESTIMATOR OF THE EXPONENTIAL DISTRIBUTION We will consider the random variable X with the following cumulative distribution function: x F(x ; ) 1 e ( x 0 , 0) (1) where is an unknown scale parameter Using the relationships between MXXX( ) ; 22( ) ; ( ) , we obtain ()X a theoretical variation coefficient 1. This is a useful indicator, especially if MX() you have observational data which seems to be exponential and with variation coefficient of the selection closed to 1. If we consider x12, x ,... xn as a part of a population that follows an exponential distribution, then by using the maximum likelihood estimation method we obtain the following estimate n ˆ 1 xi (2) n i1 119 On the Scale Parameter of Exponential Distribution Since M ˆ , it follows that ˆ is an unbiased estimator for .
    [Show full text]
  • Estimation of Common Location and Scale Parameters in Nonregular Cases Ahmad Razmpour Iowa State University
    Iowa State University Capstones, Theses and Retrospective Theses and Dissertations Dissertations 1982 Estimation of common location and scale parameters in nonregular cases Ahmad Razmpour Iowa State University Follow this and additional works at: https://lib.dr.iastate.edu/rtd Part of the Statistics and Probability Commons Recommended Citation Razmpour, Ahmad, "Estimation of common location and scale parameters in nonregular cases " (1982). Retrospective Theses and Dissertations. 7528. https://lib.dr.iastate.edu/rtd/7528 This Dissertation is brought to you for free and open access by the Iowa State University Capstones, Theses and Dissertations at Iowa State University Digital Repository. It has been accepted for inclusion in Retrospective Theses and Dissertations by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. INFORMATION TO USERS This reproduction was made from a copy of a document sent to us for microfilming. While the most advanced technology has been used to photograph and reproduce this document, the quality of the reproduction is heavily dependent upon the quality of the material submitted. The following explanation of techniques is provided to help clarify markings or notations which may appear on this reproduction. 1. The sign or "target" for pages apparently lacking from the document photographed is "Missing Page(s)". If it was possible to obtain the missing page(s) or section, they are spliced into the film along with adjacent pages. This may have necessitated cutting through an image and duplicating adjacent pages to assure complete continuity. 2. When an image on the film is obliterated with a round black mark, it is an indication of either blurred copy because of movement during exposure, duplicate copy, or copyrighted materials that should not have been filmed.
    [Show full text]
  • A Comparison of Unbiased and Plottingposition Estimators of L
    WATER RESOURCES RESEARCH, VOL. 31, NO. 8, PAGES 2019-2025, AUGUST 1995 A comparison of unbiased and plotting-position estimators of L moments J. R. M. Hosking and J. R. Wallis IBM ResearchDivision, T. J. Watson ResearchCenter, Yorktown Heights, New York Abstract. Plotting-positionestimators of L momentsand L moment ratios have several disadvantagescompared with the "unbiased"estimators. For generaluse, the "unbiased'? estimatorsshould be preferred. Plotting-positionestimators may still be usefulfor estimatingextreme upper tail quantilesin regional frequencyanalysis. Probability-Weighted Moments and L Moments •r+l-" (--1)r • P*r,k Olk '- E p *r,!•[J!•. Probability-weightedmoments of a randomvariable X with k=0 k=0 cumulativedistribution function F( ) and quantile function It is convenient to define dimensionless versions of L mo- x( ) were definedby Greenwoodet al. [1979]to be the quan- tities ments;this is achievedby dividingthe higher-orderL moments by the scale measure h2. The L moment ratios •'r, r = 3, Mp,ra= E[XP{F(X)}r{1- F(X)} s] 4, '", are definedby ßr-" •r/•2 ß {X(u)}PUr(1 -- U)s du. L momentratios measure the shapeof a distributionindepen- dently of its scaleof measurement.The ratios *3 ("L skew- ness")and *4 ("L kurtosis")are nowwidely used as measures Particularlyuseful specialcases are the probability-weighted of skewnessand kurtosis,respectively [e.g., Schaefer,1990; moments Pilon and Adamowski,1992; Royston,1992; Stedingeret al., 1992; Vogeland Fennessey,1993]. 12•r= M1,0, r = •01 (1 - u)rx(u) du, Estimators Given an ordered sample of size n, Xl: n • X2:n • ''' • urx(u) du. X.... there are two establishedways of estimatingthe proba- /3r--- Ml,r, 0 =f01 bility-weightedmoments and L moments of the distribution from whichthe samplewas drawn.
    [Show full text]
  • A Study of Non-Central Skew T Distributions and Their Applications in Data Analysis and Change Point Detection
    A STUDY OF NON-CENTRAL SKEW T DISTRIBUTIONS AND THEIR APPLICATIONS IN DATA ANALYSIS AND CHANGE POINT DETECTION Abeer M. Hasan A Dissertation Submitted to the Graduate College of Bowling Green State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY August 2013 Committee: Arjun K. Gupta, Co-advisor Wei Ning, Advisor Mark Earley, Graduate Faculty Representative Junfeng Shang. Copyright c August 2013 Abeer M. Hasan All rights reserved iii ABSTRACT Arjun K. Gupta, Co-advisor Wei Ning, Advisor Over the past three decades there has been a growing interest in searching for distribution families that are suitable to analyze skewed data with excess kurtosis. The search started by numerous papers on the skew normal distribution. Multivariate t distributions started to catch attention shortly after the development of the multivariate skew normal distribution. Many researchers proposed alternative methods to generalize the univariate t distribution to the multivariate case. Recently, skew t distribution started to become popular in research. Skew t distributions provide more flexibility and better ability to accommodate long-tailed data than skew normal distributions. In this dissertation, a new non-central skew t distribution is studied and its theoretical properties are explored. Applications of the proposed non-central skew t distribution in data analysis and model comparisons are studied. An extension of our distribution to the multivariate case is presented and properties of the multivariate non-central skew t distri- bution are discussed. We also discuss the distribution of quadratic forms of the non-central skew t distribution. In the last chapter, the change point problem of the non-central skew t distribution is discussed under different settings.
    [Show full text]
  • 1 One Parameter Exponential Families
    1 One parameter exponential families The world of exponential families bridges the gap between the Gaussian family and general dis- tributions. Many properties of Gaussians carry through to exponential families in a fairly precise sense. • In the Gaussian world, there exact small sample distributional results (i.e. t, F , χ2). • In the exponential family world, there are approximate distributional results (i.e. deviance tests). • In the general setting, we can only appeal to asymptotics. A one-parameter exponential family, F is a one-parameter family of distributions of the form Pη(dx) = exp (η · t(x) − Λ(η)) P0(dx) for some probability measure P0. The parameter η is called the natural or canonical parameter and the function Λ is called the cumulant generating function, and is simply the normalization needed to make dPη fη(x) = (x) = exp (η · t(x) − Λ(η)) dP0 a proper probability density. The random variable t(X) is the sufficient statistic of the exponential family. Note that P0 does not have to be a distribution on R, but these are of course the simplest examples. 1.0.1 A first example: Gaussian with linear sufficient statistic Consider the standard normal distribution Z e−z2=2 P0(A) = p dz A 2π and let t(x) = x. Then, the exponential family is eη·x−x2=2 Pη(dx) / p 2π and we see that Λ(η) = η2=2: eta= np.linspace(-2,2,101) CGF= eta**2/2. plt.plot(eta, CGF) A= plt.gca() A.set_xlabel(r'$\eta$', size=20) A.set_ylabel(r'$\Lambda(\eta)$', size=20) f= plt.gcf() 1 Thus, the exponential family in this setting is the collection F = fN(η; 1) : η 2 Rg : d 1.0.2 Normal with quadratic sufficient statistic on R d As a second example, take P0 = N(0;Id×d), i.e.
    [Show full text]
  • A Logistic Regression Equation for Estimating the Probability of a Stream in Vermont Having Intermittent Flow
    Prepared in cooperation with the Vermont Center for Geographic Information A Logistic Regression Equation for Estimating the Probability of a Stream in Vermont Having Intermittent Flow Scientific Investigations Report 2006–5217 U.S. Department of the Interior U.S. Geological Survey A Logistic Regression Equation for Estimating the Probability of a Stream in Vermont Having Intermittent Flow By Scott A. Olson and Michael C. Brouillette Prepared in cooperation with the Vermont Center for Geographic Information Scientific Investigations Report 2006–5217 U.S. Department of the Interior U.S. Geological Survey U.S. Department of the Interior DIRK KEMPTHORNE, Secretary U.S. Geological Survey P. Patrick Leahy, Acting Director U.S. Geological Survey, Reston, Virginia: 2006 For product and ordering information: World Wide Web: http://www.usgs.gov/pubprod Telephone: 1-888-ASK-USGS For more information on the USGS--the Federal source for science about the Earth, its natural and living resources, natural hazards, and the environment: World Wide Web: http://www.usgs.gov Telephone: 1-888-ASK-USGS Any use of trade, product, or firm names is for descriptive purposes only and does not imply endorsement by the U.S. Government. Although this report is in the public domain, permission must be secured from the individual copyright owners to reproduce any copyrighted materials contained within this report. Suggested citation: Olson, S.A., and Brouillette, M.C., 2006, A logistic regression equation for estimating the probability of a stream in Vermont having intermittent
    [Show full text]
  • The Asymmetric T-Copula with Individual Degrees of Freedom
    The Asymmetric t-Copula with Individual Degrees of Freedom d-fine GmbH Christ Church University of Oxford A thesis submitted for the degree of MSc in Mathematical Finance Michaelmas 2012 Abstract This thesis investigates asymmetric dependence structures of multivariate asset returns. Evidence of such asymmetry for equity returns has been reported in the literature. In order to model the dependence structure, a new t-copula approach is proposed called the skewed t-copula with individ- ual degrees of freedom (SID t-copula). This copula provides the flexibility to assign an individual degree-of-freedom parameter and an individual skewness parameter to each asset in a multivariate setting. Applying this approach to GARCH residuals of bivariate equity index return data and using maximum likelihood estimation, we find significant asymmetry. By means of the Akaike information criterion, it is demonstrated that the SID t-copula provides the best model for the market data compared to other copula approaches without explicit asymmetry parameters. In addition, it yields a better fit than the conventional skewed t-copula with a single degree-of-freedom parameter. In a model impact study, we analyse the errors which can occur when mod- elling asymmetric multivariate SID-t returns with the symmetric multi- variate Gauss or standard t-distribution. The comparison is done in terms of the risk measures value-at-risk and expected shortfall. We find large deviations between the modelled and the true VaR/ES of a spread posi- tion composed of asymmetrically distributed risk factors. Going from the bivariate case to a larger number of risk factors, the model errors increase.
    [Show full text]
  • On the Meaning and Use of Kurtosis
    Psychological Methods Copyright 1997 by the American Psychological Association, Inc. 1997, Vol. 2, No. 3,292-307 1082-989X/97/$3.00 On the Meaning and Use of Kurtosis Lawrence T. DeCarlo Fordham University For symmetric unimodal distributions, positive kurtosis indicates heavy tails and peakedness relative to the normal distribution, whereas negative kurtosis indicates light tails and flatness. Many textbooks, however, describe or illustrate kurtosis incompletely or incorrectly. In this article, kurtosis is illustrated with well-known distributions, and aspects of its interpretation and misinterpretation are discussed. The role of kurtosis in testing univariate and multivariate normality; as a measure of departures from normality; in issues of robustness, outliers, and bimodality; in generalized tests and estimators, as well as limitations of and alternatives to the kurtosis measure [32, are discussed. It is typically noted in introductory statistics standard deviation. The normal distribution has a kur- courses that distributions can be characterized in tosis of 3, and 132 - 3 is often used so that the refer- terms of central tendency, variability, and shape. With ence normal distribution has a kurtosis of zero (132 - respect to shape, virtually every textbook defines and 3 is sometimes denoted as Y2)- A sample counterpart illustrates skewness. On the other hand, another as- to 132 can be obtained by replacing the population pect of shape, which is kurtosis, is either not discussed moments with the sample moments, which gives or, worse yet, is often described or illustrated incor- rectly. Kurtosis is also frequently not reported in re- ~(X i -- S)4/n search articles, in spite of the fact that virtually every b2 (•(X i - ~')2/n)2' statistical package provides a measure of kurtosis.
    [Show full text]