Chapter 3 Robustness Properties of the Student T Based Pseudo

Total Page:16

File Type:pdf, Size:1020Kb

Chapter 3 Robustness Properties of the Student T Based Pseudo Chapter 3 Robustness Prop erties of the Student t Based Pseudo Maximum Likeliho o d Estimator In Chapter 2, some concepts from the robustness literature were intro duced. An imp ortant concept was the in uence function. In the presentchapter, the in uence function is used to assess the robustness prop erties of the Student t based pseudo maximum likeliho o d estimator with estimated degrees of freedom parameter. This estimator is often employed in the econometric literature as a rst relaxation of the usual normality assumption (see, e.g., de Jong et al. (1992), Kleib ergen and van Dijk (1993), Prucha and Kelejian (1984), and Spanos (1994)). In this chapter I show that the estimator is nonrobust in the sense that it has an unb ounded in uence function and an unb ounded change- 1 of-variance function if the degrees of freedom parameter is estimated rather than xed a priori. This result can already b e established in the setting of the simple lo cation/scale mo del and has obvious consequences for other robust estimators that estimate the tuning constant from the data. At the basis of the ab ove results lies the observation that the score func- tion for the pseudo maximum likeliho o d estimator for the degrees of freedom parameter is unb ounded. As a result, the in uence functions of the degrees of freedom and scale estimator are also unb ounded. In contrast, the in uence function of the lo cation parameter is b ounded due to the blo ck-diagonalityof the Fisher information matrix under the assumption of symmetry. The change- of-variance function of the estimator for the lo cation parameter, however, is unb ounded, suggesting that standard inference pro cedures for the lo cation pa- rameter are nonrobust if they are based on the Student t pseudo maximum likeliho o d estimator with estimated degrees of freedom parameter. These re- sults illustrate two basic p oints. First, one should carefully distinguish b etween parameters of interest and nuisance parameters when assessing the robustness prop erties of statistical pro cedures. Second, if a parameter can b e estimated robustly in the sense that an estimator can b e constructed with a b ounded 1 See Section 3.2 for a de nition of the change-of-variance function. 48 3. STUDENT t BASED MAXIMUM LIKELIHOOD ESTIMATION in uence function, this do es not imply that the corresp onding inference pro- cedure for that parameter is also robust. The chapter is set up as follows. Section 3.1 intro duces the mo del and the pseudo maximum likeliho o d estimator based on the Student t distribution. Section 3.2 derives the in uence function and change-of-variance function and provides a simple nite sample approximation to these functions. Section 3.3 provides a simulation based comparison of several robust and nonrobust estimators for the simple mo del of Section 3.1. Section 3.4 concludes this chapter. 3.1 The Mo del and Estimator Consider the simple lo cation/scale mo del y = + " ; (3:1) t t where is the lo cation parameter, is the scale parameter, f" ; ...;" g is a set 1 T of i.i.d. drawings with unit scale, and T denotes the sample size. Mo del (3.1) is extremely simple, but it suces to illustrate the problems studied in this chapter. The diculties that arise for (3.1) also show up in more complicated mo dels. The usual way of estimating and in (3.1) is by means of ordinary least- squares (OLS). This pro duces the arithmetic sample mean and the sample standard deviation as estimators for and , resp ectively. As describ ed in the previous chapter, the standard OLS estimator is sensitive to outliers in the data. One big outlier is enough to corrupt the estimates completely. In order 2 to reduce the sensitivity of the results to outliers, the class of M estimators was prop osed by Hub er (1964, 1981). In this chapter a sp eci c element from this class is studied, namely the pseudo maximum likeliho o d estimator based on the Student t distribution (MLT estimator). As was describ ed in Section 2.3, an M estimator minimizes T X (y ; ; ; ) (3:2) t t=1 with resp ect to and , with denoting some smo oth function and >0 denoting a user sp eci ed tuning constant. In this chapter ! 1 (+1) +1 2 2 ( ) (y ) t 2 p (y ; ; ; )= ln 1+ ; (3:3) t 2 ( ) 2 2 Metho ds for reducing the p ossibly bad e ects of outliers have a long history,ascanbe seen from the references in Section 1.3 of Hamp el et al. (1986). One of these references dates back to Greek antiquity. 3.1. THE MODEL AND ESTIMATOR 49 with () denoting the gamma function, such that can b e interpreted as the degrees of freedom parameter of a Student t distribution. The rst or- P T der conditions for the MLT estimators for and are (y )=0 and t t=1 P T (y ) = 0, resp ectively, with t t=1 (+ 1)(y ) t (y )= (3:4) t 2 2 +(y ) t and 1 (y ) = (1 + (y ) (y )) t t t 2 2 (y ) t = (3.5) 2 2 +(y ) t Although in the ab ove setup can b e regarded as a user sp eci ed tuning constant that determines the degree of robustness of the M estimator, it is not unusual to estimate together with and (see the references b elow). Several estimators for are available from the literature. Using the pseudo log likeliho o d , the most obvious estimator for is given by the (pseudo) P T maximum likeliho o d (ML) estimator, i.e., the value ^ that solves (y )= t t=1 0, with 2 1 (y ) +1 t (y )= ) ( )ln(1 + ) (y ) ; (3:6) ( t t 2 2 2 2 ( ) the digamma function ( ( )=dln(( ))=d ). , and () the gamma function. This estimator is used in, e.g., Fraser (1976), Little (1988), and Lange, Little, and Taylor (1989). Spanos (1994) used an alternative estimator for based on the sample kurtosis co ecient. A third estimator used in the literature is the one prop osed by Prucha and Kelejian (1984). They emb ed the family of Student t distributions in a more general class of distributions. Their estimator for uses an estimate of the rst absolute moment of the disturbance term. Yet another p ossibility for estimating is by using tail-index estimators 3 for the distribution of y (see, e.g., Gro enendijk, Lucas, and de Vries (1995)). t It is easily checked that the estimator of Spanos (1994) for is nonrobust. The nonrobustness of this estimator follows from the nonrobust estimation of the kurtosis co ecient. Similarly, the estimator used by Prucha and Kelejian 3 It is p erhaps useful to note here that (3.3) is closely linked to the assumption of i.i.d. Student t distributed errors in (3.1). Alternatively, one could study (3.1) under the as- sumption that (" ; ...;" ) has a multivariate Student t distribution with diagonal precision 1 T matrix, such that the errors are uncorrelated rather than indep endent. Zellner (1976, p. 402) proved that in this setting , , and cannot b e estimated simultaneously by means T of ML if only one realization of fy g is available. One way to solve this problem is by t t=1 T using several realizations of fy g , as is p ossible in a panel data context. One can then t t=1 construct a suitable estimator for the degrees of freedom parameter (see, e.g., Sutradhar and Ali (1986)). 50 3. STUDENT t BASED MAXIMUM LIKELIHOOD ESTIMATION (1984, p. 731) for estimating the rst absolute moment is also not robust to outliers, which results in a nonrobust estimator for . Finally, tail-index esti- mators in their usual implementation are intrinsically nonrobust, b ecause they concentrate on observations in the extreme quantiles of the distribution. So 4 the only remaining candidate for robust estimation of is the MLT estimator. The next section demonstrates, however, that the MLT estimator for is also nonrobust. 3.2 A Derivation of the In uence Function > > De ne =(; ;) and (y )= ( (y); (y ); (y )) , then the MLT t t t t P T > ^ estimator =(^; ;^ ^) solves (y )= 0. This section presents the t t=1 IF of the MLT estimator. First, some additional notation is needed. Let 0 > (y )=@ (y )=@ , with t t 0 1 (y ) (y ) (y ) t t t 0 @ A (y )= (y ) (y ) (y ) ; t t t t (y ) (y ) (y ) t t t and 2 2 2 2 2 (y ) = ( + 1)( (y ) )=( +(y ) ) ; t t t 2 2 2 (y ) = 2( + 1)(y )=( +(y ) ) ; t t t 2 2 2 2 2 (y ) = (y )( (y ) )=( +(y ) ) ; t t t t (y ) = ((y ) (y ) (y ))=; t t t t (y ) = (y ) (y )=; t t t 4 Of course, one can ob ject that the other estimators can easily b e extended such that they b ecome robust to at least some extent. For example, one can try to estimate the rst absolute moment of the errors in a robust way,thus `robustifying' the estimator of Prucha and Kelejian (1984). The problem with this approach is that one wants a consistent estimator for for the whole class of Student t distributions, as opp osed to an estimator that is constistent for only one sp eci c Student t distribution. If one constructs a robust estimator for the rst absolute momentbydownweighting extreme observations, one will end up estimating to o high, in general.
Recommended publications
  • Applied Biostatistics Mean and Standard Deviation the Mean the Median Is Not the Only Measure of Central Value for a Distribution
    Health Sciences M.Sc. Programme Applied Biostatistics Mean and Standard Deviation The mean The median is not the only measure of central value for a distribution. Another is the arithmetic mean or average, usually referred to simply as the mean. This is found by taking the sum of the observations and dividing by their number. The mean is often denoted by a little bar over the symbol for the variable, e.g. x . The sample mean has much nicer mathematical properties than the median and is thus more useful for the comparison methods described later. The median is a very useful descriptive statistic, but not much used for other purposes. Median, mean and skewness The sum of the 57 FEV1s is 231.51 and hence the mean is 231.51/57 = 4.06. This is very close to the median, 4.1, so the median is within 1% of the mean. This is not so for the triglyceride data. The median triglyceride is 0.46 but the mean is 0.51, which is higher. The median is 10% away from the mean. If the distribution is symmetrical the sample mean and median will be about the same, but in a skew distribution they will not. If the distribution is skew to the right, as for serum triglyceride, the mean will be greater, if it is skew to the left the median will be greater. This is because the values in the tails affect the mean but not the median. Figure 1 shows the positions of the mean and median on the histogram of triglyceride.
    [Show full text]
  • TR Multivariate Conditional Median Estimation
    Discussion Paper: 2004/02 TR multivariate conditional median estimation Jan G. de Gooijer and Ali Gannoun www.fee.uva.nl/ke/UvA-Econometrics Department of Quantitative Economics Faculty of Economics and Econometrics Universiteit van Amsterdam Roetersstraat 11 1018 WB AMSTERDAM The Netherlands TR Multivariate Conditional Median Estimation Jan G. De Gooijer1 and Ali Gannoun2 1 Department of Quantitative Economics University of Amsterdam Roetersstraat 11, 1018 WB Amsterdam, The Netherlands Telephone: +31—20—525 4244; Fax: +31—20—525 4349 e-mail: [email protected] 2 Laboratoire de Probabilit´es et Statistique Universit´e Montpellier II Place Eug`ene Bataillon, 34095 Montpellier C´edex 5, France Telephone: +33-4-67-14-3578; Fax: +33-4-67-14-4974 e-mail: [email protected] Abstract An affine equivariant version of the nonparametric spatial conditional median (SCM) is con- structed, using an adaptive transformation-retransformation (TR) procedure. The relative performance of SCM estimates, computed with and without applying the TR—procedure, are compared through simulation. Also included is the vector of coordinate conditional, kernel- based, medians (VCCMs). The methodology is illustrated via an empirical data set. It is shown that the TR—SCM estimator is more efficient than the SCM estimator, even when the amount of contamination in the data set is as high as 25%. The TR—VCCM- and VCCM estimators lack efficiency, and consequently should not be used in practice. Key Words: Spatial conditional median; kernel; retransformation; robust; transformation. 1 Introduction p s Let (X1, Y 1),...,(Xn, Y n) be independent replicates of a random vector (X, Y ) IR IR { } ∈ × where p > 1,s > 2, and n>p+ s.
    [Show full text]
  • 5. the Student T Distribution
    Virtual Laboratories > 4. Special Distributions > 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 5. The Student t Distribution In this section we will study a distribution that has special importance in statistics. In particular, this distribution will arise in the study of a standardized version of the sample mean when the underlying distribution is normal. The Probability Density Function Suppose that Z has the standard normal distribution, V has the chi-squared distribution with n degrees of freedom, and that Z and V are independent. Let Z T= √V/n In the following exercise, you will show that T has probability density function given by −(n +1) /2 Γ((n + 1) / 2) t2 f(t)= 1 + , t∈ℝ ( n ) √n π Γ(n / 2) 1. Show that T has the given probability density function by using the following steps. n a. Show first that the conditional distribution of T given V=v is normal with mean 0 a nd variance v . b. Use (a) to find the joint probability density function of (T,V). c. Integrate the joint probability density function in (b) with respect to v to find the probability density function of T. The distribution of T is known as the Student t distribution with n degree of freedom. The distribution is well defined for any n > 0, but in practice, only positive integer values of n are of interest. This distribution was first studied by William Gosset, who published under the pseudonym Student. In addition to supplying the proof, Exercise 1 provides a good way of thinking of the t distribution: the t distribution arises when the variance of a mean 0 normal distribution is randomized in a certain way.
    [Show full text]
  • Estimation of Common Location and Scale Parameters in Nonregular Cases Ahmad Razmpour Iowa State University
    Iowa State University Capstones, Theses and Retrospective Theses and Dissertations Dissertations 1982 Estimation of common location and scale parameters in nonregular cases Ahmad Razmpour Iowa State University Follow this and additional works at: https://lib.dr.iastate.edu/rtd Part of the Statistics and Probability Commons Recommended Citation Razmpour, Ahmad, "Estimation of common location and scale parameters in nonregular cases " (1982). Retrospective Theses and Dissertations. 7528. https://lib.dr.iastate.edu/rtd/7528 This Dissertation is brought to you for free and open access by the Iowa State University Capstones, Theses and Dissertations at Iowa State University Digital Repository. It has been accepted for inclusion in Retrospective Theses and Dissertations by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. INFORMATION TO USERS This reproduction was made from a copy of a document sent to us for microfilming. While the most advanced technology has been used to photograph and reproduce this document, the quality of the reproduction is heavily dependent upon the quality of the material submitted. The following explanation of techniques is provided to help clarify markings or notations which may appear on this reproduction. 1. The sign or "target" for pages apparently lacking from the document photographed is "Missing Page(s)". If it was possible to obtain the missing page(s) or section, they are spliced into the film along with adjacent pages. This may have necessitated cutting through an image and duplicating adjacent pages to assure complete continuity. 2. When an image on the film is obliterated with a round black mark, it is an indication of either blurred copy because of movement during exposure, duplicate copy, or copyrighted materials that should not have been filmed.
    [Show full text]
  • A Comparison of Unbiased and Plottingposition Estimators of L
    WATER RESOURCES RESEARCH, VOL. 31, NO. 8, PAGES 2019-2025, AUGUST 1995 A comparison of unbiased and plotting-position estimators of L moments J. R. M. Hosking and J. R. Wallis IBM ResearchDivision, T. J. Watson ResearchCenter, Yorktown Heights, New York Abstract. Plotting-positionestimators of L momentsand L moment ratios have several disadvantagescompared with the "unbiased"estimators. For generaluse, the "unbiased'? estimatorsshould be preferred. Plotting-positionestimators may still be usefulfor estimatingextreme upper tail quantilesin regional frequencyanalysis. Probability-Weighted Moments and L Moments •r+l-" (--1)r • P*r,k Olk '- E p *r,!•[J!•. Probability-weightedmoments of a randomvariable X with k=0 k=0 cumulativedistribution function F( ) and quantile function It is convenient to define dimensionless versions of L mo- x( ) were definedby Greenwoodet al. [1979]to be the quan- tities ments;this is achievedby dividingthe higher-orderL moments by the scale measure h2. The L moment ratios •'r, r = 3, Mp,ra= E[XP{F(X)}r{1- F(X)} s] 4, '", are definedby ßr-" •r/•2 ß {X(u)}PUr(1 -- U)s du. L momentratios measure the shapeof a distributionindepen- dently of its scaleof measurement.The ratios *3 ("L skew- ness")and *4 ("L kurtosis")are nowwidely used as measures Particularlyuseful specialcases are the probability-weighted of skewnessand kurtosis,respectively [e.g., Schaefer,1990; moments Pilon and Adamowski,1992; Royston,1992; Stedingeret al., 1992; Vogeland Fennessey,1993]. 12•r= M1,0, r = •01 (1 - u)rx(u) du, Estimators Given an ordered sample of size n, Xl: n • X2:n • ''' • urx(u) du. X.... there are two establishedways of estimatingthe proba- /3r--- Ml,r, 0 =f01 bility-weightedmoments and L moments of the distribution from whichthe samplewas drawn.
    [Show full text]
  • The Asymmetric T-Copula with Individual Degrees of Freedom
    The Asymmetric t-Copula with Individual Degrees of Freedom d-fine GmbH Christ Church University of Oxford A thesis submitted for the degree of MSc in Mathematical Finance Michaelmas 2012 Abstract This thesis investigates asymmetric dependence structures of multivariate asset returns. Evidence of such asymmetry for equity returns has been reported in the literature. In order to model the dependence structure, a new t-copula approach is proposed called the skewed t-copula with individ- ual degrees of freedom (SID t-copula). This copula provides the flexibility to assign an individual degree-of-freedom parameter and an individual skewness parameter to each asset in a multivariate setting. Applying this approach to GARCH residuals of bivariate equity index return data and using maximum likelihood estimation, we find significant asymmetry. By means of the Akaike information criterion, it is demonstrated that the SID t-copula provides the best model for the market data compared to other copula approaches without explicit asymmetry parameters. In addition, it yields a better fit than the conventional skewed t-copula with a single degree-of-freedom parameter. In a model impact study, we analyse the errors which can occur when mod- elling asymmetric multivariate SID-t returns with the symmetric multi- variate Gauss or standard t-distribution. The comparison is done in terms of the risk measures value-at-risk and expected shortfall. We find large deviations between the modelled and the true VaR/ES of a spread posi- tion composed of asymmetrically distributed risk factors. Going from the bivariate case to a larger number of risk factors, the model errors increase.
    [Show full text]
  • The Probability Lifesaver: Order Statistics and the Median Theorem
    The Probability Lifesaver: Order Statistics and the Median Theorem Steven J. Miller December 30, 2015 Contents 1 Order Statistics and the Median Theorem 3 1.1 Definition of the Median 5 1.2 Order Statistics 10 1.3 Examples of Order Statistics 15 1.4 TheSampleDistributionoftheMedian 17 1.5 TechnicalboundsforproofofMedianTheorem 20 1.6 TheMedianofNormalRandomVariables 22 2 • Greetings again! In this supplemental chapter we develop the theory of order statistics in order to prove The Median Theorem. This is a beautiful result in its own, but also extremely important as a substitute for the Central Limit Theorem, and allows us to say non- trivial things when the CLT is unavailable. Chapter 1 Order Statistics and the Median Theorem The Central Limit Theorem is one of the gems of probability. It’s easy to use and its hypotheses are satisfied in a wealth of problems. Many courses build towards a proof of this beautiful and powerful result, as it truly is ‘central’ to the entire subject. Not to detract from the majesty of this wonderful result, however, what happens in those instances where it’s unavailable? For example, one of the key assumptions that must be met is that our random variables need to have finite higher moments, or at the very least a finite variance. What if we were to consider sums of Cauchy random variables? Is there anything we can say? This is not just a question of theoretical interest, of mathematicians generalizing for the sake of generalization. The following example from economics highlights why this chapter is more than just of theoretical interest.
    [Show full text]
  • Notes Mean, Median, Mode & Range
    Notes Mean, Median, Mode & Range How Do You Use Mode, Median, Mean, and Range to Describe Data? There are many ways to describe the characteristics of a set of data. The mode, median, and mean are all called measures of central tendency. These measures of central tendency and range are described in the table below. The mode of a set of data Use the mode to show which describes which value occurs value in a set of data occurs most frequently. If two or more most often. For the set numbers occur the same number {1, 1, 2, 3, 5, 6, 10}, of times and occur more often the mode is 1 because it occurs Mode than all the other numbers in the most frequently. set, those numbers are all modes for the data set. If each number in the set occurs the same number of times, the set of data has no mode. The median of a set of data Use the median to show which describes what value is in the number in a set of data is in the middle if the set is ordered from middle when the numbers are greatest to least or from least to listed in order. greatest. If there are an even For the set {1, 1, 2, 3, 5, 6, 10}, number of values, the median is the median is 3 because it is in the average of the two middle the middle when the numbers are Median values. Half of the values are listed in order. greater than the median, and half of the values are less than the median.
    [Show full text]
  • A Bayesian Hierarchical Spatial Copula Model: an Application to Extreme Temperatures in Extremadura (Spain)
    atmosphere Article A Bayesian Hierarchical Spatial Copula Model: An Application to Extreme Temperatures in Extremadura (Spain) J. Agustín García 1,† , Mario M. Pizarro 2,*,† , F. Javier Acero 1,† and M. Isabel Parra 2,† 1 Departamento de Física, Universidad de Extremadura, Avenida de Elvas, 06006 Badajoz, Spain; [email protected] (J.A.G.); [email protected] (F.J.A.) 2 Departamento de Matemáticas, Universidad de Extremadura, Avenida de Elvas, 06006 Badajoz, Spain; [email protected] * Correspondence: [email protected] † These authors contributed equally to this work. Abstract: A Bayesian hierarchical framework with a Gaussian copula and a generalized extreme value (GEV) marginal distribution is proposed for the description of spatial dependencies in data. This spatial copula model was applied to extreme summer temperatures over the Extremadura Region, in the southwest of Spain, during the period 1980–2015, and compared with the spatial noncopula model. The Bayesian hierarchical model was implemented with a Monte Carlo Markov Chain (MCMC) method that allows the distribution of the model’s parameters to be estimated. The results show the GEV distribution’s shape parameter to take constant negative values, the location parameter to be altitude dependent, and the scale parameter values to be concentrated around the same value throughout the region. Further, the spatial copula model chosen presents lower deviance information criterion (DIC) values when spatial distributions are assumed for the GEV distribution’s Citation: García, J.A.; Pizarro, M.M.; location and scale parameters than when the scale parameter is taken to be constant over the region. Acero, F.J.; Parra, M.I. A Bayesian Hierarchical Spatial Copula Model: Keywords: Bayesian hierarchical model; extreme temperature; Gaussian copula; generalized extreme An Application to Extreme value distribution Temperatures in Extremadura (Spain).
    [Show full text]
  • Global Investigation of Return Autocorrelation and Its Determinants
    Global investigation of return autocorrelation and its determinants Pawan Jaina,, Wenjun Xueb aUniversity of Wyoming bFlorida International University ABSTRACT We estimate global return autocorrelation by using the quantile autocorrelation model and investigate its determinants across 43 stock markets from 1980 to 2013. Although our results document a decline in autocorrelation across the entire sample period for all countries, return autocorrelation is significantly larger in emerging markets than in developed markets. The results further document that larger and liquid stock markets have lower return autocorrelation. We also find that price limits in most emerging markets result in higher return autocorrelation. We show that the disclosure requirement, public enforcement, investor psychology, and market characteristics significantly affect return autocorrelation. Our results document that investors from different cultural backgrounds and regulation regimes react differently to corporate disclosers, which affects return autocorrelation. Keywords: Return autocorrelation, Global stock markets, Quantile autoregression model, Legal environment, Investor psychology, Hofstede’s cultural dimensions JEL classification: G12, G14, G15 Corresponding author, E-mail: [email protected]; fax: 307-766-5090. 1. Introduction One of the most striking asset pricing anomalies is the existence of large, positive, short- horizon return autocorrelation in stock portfolios, first documented in Conrad and Kaul (1988) and Lo and MacKinlay (1990). This existence of
    [Show full text]
  • Procedures for Estimation of Weibull Parameters James W
    United States Department of Agriculture Procedures for Estimation of Weibull Parameters James W. Evans David E. Kretschmann David W. Green Forest Forest Products General Technical Report February Service Laboratory FPL–GTR–264 2019 Abstract Contents The primary purpose of this publication is to provide an 1 Introduction .................................................................. 1 overview of the information in the statistical literature on 2 Background .................................................................. 1 the different methods developed for fitting a Weibull distribution to an uncensored set of data and on any 3 Estimation Procedures .................................................. 1 comparisons between methods that have been studied in the 4 Historical Comparisons of Individual statistics literature. This should help the person using a Estimator Types ........................................................ 8 Weibull distribution to represent a data set realize some advantages and disadvantages of some basic methods. It 5 Other Methods of Estimating Parameters of should also help both in evaluating other studies using the Weibull Distribution .......................................... 11 different methods of Weibull parameter estimation and in 6 Discussion .................................................................. 12 discussions on American Society for Testing and Materials Standard D5457, which appears to allow a choice for the 7 Conclusion ................................................................
    [Show full text]
  • Location-Scale Distributions
    Location–Scale Distributions Linear Estimation and Probability Plotting Using MATLAB Horst Rinne Copyright: Prof. em. Dr. Horst Rinne Department of Economics and Management Science Justus–Liebig–University, Giessen, Germany Contents Preface VII List of Figures IX List of Tables XII 1 The family of location–scale distributions 1 1.1 Properties of location–scale distributions . 1 1.2 Genuine location–scale distributions — A short listing . 5 1.3 Distributions transformable to location–scale type . 11 2 Order statistics 18 2.1 Distributional concepts . 18 2.2 Moments of order statistics . 21 2.2.1 Definitions and basic formulas . 21 2.2.2 Identities, recurrence relations and approximations . 26 2.3 Functions of order statistics . 32 3 Statistical graphics 36 3.1 Some historical remarks . 36 3.2 The role of graphical methods in statistics . 38 3.2.1 Graphical versus numerical techniques . 38 3.2.2 Manipulation with graphs and graphical perception . 39 3.2.3 Graphical displays in statistics . 41 3.3 Distribution assessment by graphs . 43 3.3.1 PP–plots and QQ–plots . 43 3.3.2 Probability paper and plotting positions . 47 3.3.3 Hazard plot . 54 3.3.4 TTT–plot . 56 4 Linear estimation — Theory and methods 59 4.1 Types of sampling data . 59 IV Contents 4.2 Estimators based on moments of order statistics . 63 4.2.1 GLS estimators . 64 4.2.1.1 GLS for a general location–scale distribution . 65 4.2.1.2 GLS for a symmetric location–scale distribution . 71 4.2.1.3 GLS and censored samples .
    [Show full text]