Notes on Median and Quantile Regression

Total Page:16

File Type:pdf, Size:1020Kb

Notes on Median and Quantile Regression Notes On Median and Quantile Regression James L. Powell Department of Economics University of California, Berkeley Conditional Median Restrictions and Least Absolute Deviations It is well-known that the expected value of a random variable Y minimizes the expected squared deviation between Y and a constant; that is, µ E[Y ] Y ≡ =argminE(Y c)2, c − assuming E Y 2 is finite. (In fact, it is only necessary to assume E Y is finite, if the minimand is normalized by|| subtracting|| Y 2, i.e., || || µ E[Y ] Y ≡ =argminE[(Y c)2 Y 2] c − − =argmin[c2 2cE[Y ]], c − and this normalization has no effect on the solution if the stronger condition holds.) It is less well-known that a median of Y, defined as any number ηY for which 1 Pr Y η and { ≤ Y } ≥ 2 1 Pr Y η , { ≥ Y } ≥ 2 minimizes an expected absolute deviation criterion, η =argminE[ Y c Y ], Y c | − | − | | though the solution to this minimization problem need not be unique. When the c.d.f. FY is strictly increasing everywhere (i.e., Y is continuously distributed with positive density), then uniqueness is not an issue, and 1 ηY = FY− (1/2). In this case, the first-order condition for minimization of E[ Y c Y ] is | − | − | | 0= E[sgn(Y c)], − − for sgn(u) the “sign” (or “signum”) function sgn(u) 1 2 1 u<0 , ≡ − · { } here defined to be right-continuous. 1 Thus, just as least squares (LS) estimation is the natural generalization of the sample mean to estimation of regression coefficients, least absolute deviations (LAD) estimation is the generalization of the sample median to the linear regression context. For the linear structural equation yi = xi0 β0 + εi, if the error terms εi are assumed to have (unique) conditional median zero given the regressors xi,i.e. E[sgn(εi) xi]=0, | E[sgn(εi c) xi] =0 if c = c(xi) =0, − | 6 6 then the true regression coefficients β0 are identified by β0 =argminE[ yi xi0 β εi ], β | − | − | | and, given an i.i.d. sample of size n from this model, a natural sample analogue to β0 is n ˆ 1 β arg min yi xi0 β ≡ β n | − | i=1 X arg min Sn(β), ≡ β a slightly-nonstandard extremum estimator (because the minimand is not twice continuously differentiable for all β. Consistency of LAD Demonstration of consistency of βˆ is straightforward, because the LAD minimand Sn(β) is clearly continuous in β with probability one; in fact, Sn(β) is convex in β, so consistency follows if Sn can be shown to converge pointwise to a function that is uniquely minimized at the true value β0. (Typically we need to show uniform convergence, but pointwise convergence of convex functions implies their uniform convergence on compact subsets.) To prove consistency, we need to impose some conditions on the model; here are some conditions that will suffice: n A1. The data (yi,x ) are independent and identically distributed across i; { i0 0}i=1 2 A2. The regressors have bounded second moment, i.e., E[ xi ] < . || || ∞ A3. The error terms εi are continously distributed given xi, with conditional density f(ε xi) satisfying the conditional median restriction, i.e. | 0 1 f(λ xi)dλ = . | 2 Z−∞ A4. The regressors and error density satisfy a “local identification” condition — namely, the matrix C E[f(0 xi)xix0 ] ≡ | i is positive definite. 2 Note that moments of yi or εi need not exist under these assumptions, which is why LAD estimation is attractive for heavy-tailed error distributions. Condition A4. combines a “unique median” assumption (implied by positivity of the conditional density f(ε xi) at ε =0) with the usual full-rank assumption on the second moments of the regressors. | Imposing these conditions, the first step in the consistency proof is to calculate the probability limit of the minimand. To avoid assuming E[ yi ] < , it is convenient to normalize the minimand Sn(β) by | | ∞ ˆ subtracting off its value at the true parameter β0, which clearly does not affect the minimizing value β. That is, ˆ β =argminSn(β) Sn(β0) β − 1 n =argmin [ yi xi0 β yi xi0 β0 ] β n | − | − | − | i=1 Xn 1 =argmin [ εi xi0 δ εi ], β n | − | − | | i=1 X where δ β β . ≡ − 0 But since xi δ εi x0 δ εi xi δ −|| ||·|| || ≤ | − i | − | | ≤ || ||·|| || by the triangle and Cauchy-Schwarz inequalities, the normalized minimand is a sample average of i.i.d. random variables with finite first (and even second) moments under condition A2,sobyKhintchine’sLaw of Large Numbers, p Sn(β) Sn(β ) S¯(δ) − 0 → E[Sn(β) Sn(β )] ≡ − 0 = E[ εi x0 δ εi ] | − i | − | | = E (εi x0 δ)sgn εi x0 δ εisgn εi − i { − i } − { } = E (εi x0 δ)(sgn εi x0 δ sgn εi ) £ − i { − i } − { } ¤ £ 0 ¤ = E 2 [λ (xi0 δ)]f(λ xi)dλ , x δ − | " Z i0 # where the second-to-last equality uses the fact that E[(x δ)sgn εi ]=E[E[(x δ)sgn εi xi]] = 0. (The i0 { } i0 { }| integral in the last equality is well-defined for both positive and negative values of xi0 δ, under the standard b a convention a dF = b dF .) By inspection, the− limit S¯(δ) equals zero at δ = β β =0, and is non-negative otherwise (since the R R − 0 sign of the integrand is the same as the sign of the lower limit xi0 δ). Furthermore, since Sn(β) Sn(β0) is ¯ − convex for all n, so is its probability limit S(β β0);thus,ifβ = β0 is a unique local minimizer, it is also a global minimizer, implying consistency of βˆ.− But by Leibnitz’ rule, ∂S¯(δ) 0 = 2E[xi f(λ xi)dλ], ∂δ − · x δ | Z i0 ∂S¯(0) =0, ∂δ 3 and ∂2S¯(δ) =2E[xixi0 f(xi0 δ xi)], ∂δ∂δ0 · | ∂2S¯(0) =2E[xixi0 f(0 xi)] 2C, ∂δ∂δ0 · | ≡ which is positive definite by condition A4.Soδ =0=β β0 is indeed a unique local (and global) minimizer of S¯(δ)=S¯(β β ), and thus − − 0 βˆ p β . → 0 ¥ To generalize this consistency result to nonlinear median regression models yi = g(xi, β0)+εi, 0=E[sgn εi xi], { }| the regularity conditions would have to be strengthened, since convexity of the corresponding LAD mini- 1 mand n− i yi g(xi, β) in β is no longer assured. Standard conditions would include the assumption that the LAD| criterion− is minimized| over a compact parameter space B (and not over all of Rp), and a P uniform Lipschitz continuity condition on the median regression function g(xi, β) would typically be im- posed, with the Lipschitz term assumed to have finite moments. Finally, the identification condition A4 would have to be strengthened to a global identification condition, such as: A4.’ For some τ > 0, the conditional density f(λ xi) > τ if λ < τ, and Pr g(xi, β) g(xi, β0) τ > 0 if β = β . | | | {| − | ≥ } 6 0 Asymptotic Normality of LAD Returning to the linear LAD estimator, while demonstration of consistency of βˆ involves routine appli- cation of asymptotic arguments for extremum estimators, demonstration of √n-consistency and asymptotic normality is complicated by thefactthattheLADcriterionS¯(β) is not continuously differentiable in β. For comparison, consider the “standard” theory for extremum estimators, where the estimator θˆ is defined to minimize (or maximize) a twice-differentiable criterion, e.g., 1 n ˆθ =argmin ρ(zi, θ), θ Θ n ∈ i=1 X and (for large n) to satisfy a first-order condition for an interior minimum, 1 n ∂ρ(z , θˆ) 0= i n ∂θ i=1 Xn 1 ψ (θˆ), ≡ n i i=1 X assuming consistency of θˆ for an (interior) parameter θ0 has been established. The true value θ0 satisfies the corresponding population first-order condition 0=E[ψi(θ0)]; 4 derivation of the asymptotic distribution of θˆ is based upon a Taylor’s series expansion of the sample first-order condition for θˆ around θˆ = θ0 : n 1 0 ψ (θˆ) (*) ≡ n i i=1 Xn n 1 1 ∂ψi(θ0) ψ (θ0)+ (θˆ θ0)+op( θˆ θ0 ), ≡ n i n ∂θ − || − || i=1 " i=1 0 # X X hich is solved for θˆ to yield the asymptotic linearity expression n 1 1 1 θˆ = θ + H− ψ (θ )+o , 0 0 n i 0 p √n i=1 X µ ¶ where ∂ψi(θ0) H0 E ≡− ∂θ · 0 ¸ ∂2ρ(z , θ ) = E i 0 − ∂θ∂θ · 0 ¸ is minus one times the expected Hessian of the original minimand. From the linearity expression, it follows that d 1 1 √n(ˆθ θ0) (0,H− V0H− ), − → N 0 0 where V0 is the asymptotic covariance matrix of the sample average of ψi(θ0), i.e., n 1 d ψ (θ0) (0,V0), √n i → N i=1 X which is established by appeal to a suitable central limit theorem. In the LAD case (where θ β), the criterion function ≡ ρ(zi, θ)= yi x0 β | − i | is not continuously differentiable at values of β for which yi = xi0 β; furthermore, the (discontinuous) subgradient ∂ρ(zi, θ) = sgn yi x0 β xi ∂θ { − i } itself has a derivative that is identically zero wherever it is defined. Thus the Taylor’s expansion (*) is not applicable to this problem, even though an approximate first-order condition 1 n 1 sgn yi x0 βˆ xi = op n { − i } √n i=1 X µ ¶ can be established for this problem.
Recommended publications
  • 5. the Student T Distribution
    Virtual Laboratories > 4. Special Distributions > 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 5. The Student t Distribution In this section we will study a distribution that has special importance in statistics. In particular, this distribution will arise in the study of a standardized version of the sample mean when the underlying distribution is normal. The Probability Density Function Suppose that Z has the standard normal distribution, V has the chi-squared distribution with n degrees of freedom, and that Z and V are independent. Let Z T= √V/n In the following exercise, you will show that T has probability density function given by −(n +1) /2 Γ((n + 1) / 2) t2 f(t)= 1 + , t∈ℝ ( n ) √n π Γ(n / 2) 1. Show that T has the given probability density function by using the following steps. n a. Show first that the conditional distribution of T given V=v is normal with mean 0 a nd variance v . b. Use (a) to find the joint probability density function of (T,V). c. Integrate the joint probability density function in (b) with respect to v to find the probability density function of T. The distribution of T is known as the Student t distribution with n degree of freedom. The distribution is well defined for any n > 0, but in practice, only positive integer values of n are of interest. This distribution was first studied by William Gosset, who published under the pseudonym Student. In addition to supplying the proof, Exercise 1 provides a good way of thinking of the t distribution: the t distribution arises when the variance of a mean 0 normal distribution is randomized in a certain way.
    [Show full text]
  • Theoretical Statistics. Lecture 20
    Theoretical Statistics. Lecture 20. Peter Bartlett 1. Recall: Functional delta method, differentiability in normed spaces, Hadamard derivatives. [vdV20] 2. Quantile estimates. [vdV21] 3. Contiguity. [vdV6] 1 Recall: Differentiability of functions in normed spaces Definition: φ : D E is Hadamard differentiable at θ D tangentially → ∈ to D D if 0 ⊆ φ′ : D E (linear, continuous), h D , ∃ θ 0 → ∀ ∈ 0 if t 0, ht h 0, then → k − k → φ(θ + tht) φ(θ) − φ′ (h) 0. t − θ → 2 Recall: Functional delta method Theorem: Suppose φ : D E, where D and E are normed linear spaces. → Suppose the statistic Tn : Ωn D satisfies √n(Tn θ) T for a random → − element T in D D. 0 ⊂ If φ is Hadamard differentiable at θ tangentially to D0 then ′ √n(φ(Tn) φ(θ)) φ (T ). − θ If we can extend φ′ : D E to a continuous map φ′ : D E, then 0 → → ′ √n(φ(Tn) φ(θ)) = φ (√n(Tn θ)) + oP (1). − θ − 3 Recall: Quantiles Definition: The quantile function of F is F −1 : (0, 1) R, → F −1(p) = inf x : F (x) p . { ≥ } Quantile transformation: for U uniform on (0, 1), • F −1(U) F. ∼ Probability integral transformation: for X F , F (X) is uniform on • ∼ [0,1] iff F is continuous on R. F −1 is an inverse (i.e., F −1(F (x)) = x and F (F −1(p)) = p for all x • and p) iff F is continuous and strictly increasing. 4 Empirical quantile function For a sample with distribution function F , define the empirical quantile −1 function as the quantile function Fn of the empirical distribution function Fn.
    [Show full text]
  • A Tail Quantile Approximation Formula for the Student T and the Symmetric Generalized Hyperbolic Distribution
    A Service of Leibniz-Informationszentrum econstor Wirtschaft Leibniz Information Centre Make Your Publications Visible. zbw for Economics Schlüter, Stephan; Fischer, Matthias J. Working Paper A tail quantile approximation formula for the student t and the symmetric generalized hyperbolic distribution IWQW Discussion Papers, No. 05/2009 Provided in Cooperation with: Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics Suggested Citation: Schlüter, Stephan; Fischer, Matthias J. (2009) : A tail quantile approximation formula for the student t and the symmetric generalized hyperbolic distribution, IWQW Discussion Papers, No. 05/2009, Friedrich-Alexander-Universität Erlangen-Nürnberg, Institut für Wirtschaftspolitik und Quantitative Wirtschaftsforschung (IWQW), Nürnberg This Version is available at: http://hdl.handle.net/10419/29554 Standard-Nutzungsbedingungen: Terms of use: Die Dokumente auf EconStor dürfen zu eigenen wissenschaftlichen Documents in EconStor may be saved and copied for your Zwecken und zum Privatgebrauch gespeichert und kopiert werden. personal and scholarly purposes. Sie dürfen die Dokumente nicht für öffentliche oder kommerzielle You are not to copy documents for public or commercial Zwecke vervielfältigen, öffentlich ausstellen, öffentlich zugänglich purposes, to exhibit the documents publicly, to make them machen, vertreiben oder anderweitig nutzen. publicly available on the internet, or to distribute or otherwise use the documents in public. Sofern die Verfasser die Dokumente unter Open-Content-Lizenzen (insbesondere CC-Lizenzen) zur Verfügung gestellt haben sollten, If the documents have been made available under an Open gelten abweichend von diesen Nutzungsbedingungen die in der dort Content Licence (especially Creative Commons Licences), you genannten Lizenz gewährten Nutzungsrechte. may exercise further usage rights as specified in the indicated licence. www.econstor.eu IWQW Institut für Wirtschaftspolitik und Quantitative Wirtschaftsforschung Diskussionspapier Discussion Papers No.
    [Show full text]
  • Stat 5102 Lecture Slides: Deck 1 Empirical Distributions, Exact Sampling Distributions, Asymptotic Sampling Distributions
    Stat 5102 Lecture Slides: Deck 1 Empirical Distributions, Exact Sampling Distributions, Asymptotic Sampling Distributions Charles J. Geyer School of Statistics University of Minnesota 1 Empirical Distributions The empirical distribution associated with a vector of numbers x = (x1; : : : ; xn) is the probability distribution with expectation operator n 1 X Enfg(X)g = g(xi) n i=1 This is the same distribution that arises in finite population sam- pling. Suppose we have a population of size n whose members have values x1, :::, xn of a particular measurement. The value of that measurement for a randomly drawn individual from this population has a probability distribution that is this empirical distribution. 2 The Mean of the Empirical Distribution In the special case where g(x) = x, we get the mean of the empirical distribution n 1 X En(X) = xi n i=1 which is more commonly denotedx ¯n. Those with previous exposure to statistics will recognize this as the formula of the population mean, if x1, :::, xn is considered a finite population from which we sample, or as the formula of the sample mean, if x1, :::, xn is considered a sample from a specified population. 3 The Variance of the Empirical Distribution The variance of any distribution is the expected squared deviation from the mean of that same distribution. The variance of the empirical distribution is n 2o varn(X) = En [X − En(X)] n 2o = En [X − x¯n] n 1 X 2 = (xi − x¯n) n i=1 The only oddity is the use of the notationx ¯n rather than µ for the mean.
    [Show full text]
  • Depth, Outlyingness, Quantile, and Rank Functions in Multivariate & Other Data Settings Eserved@D = *@Let@Token
    DEPTH, OUTLYINGNESS, QUANTILE, AND RANK FUNCTIONS – CONCEPTS, PERSPECTIVES, CHALLENGES Depth, Outlyingness, Quantile, and Rank Functions in Multivariate & Other Data Settings Robert Serfling1 Serfling & Thompson Statistical Consulting and Tutoring ASA Alabama-Mississippi Chapter Mini-Conference University of Mississippi, Oxford April 5, 2019 1 www.utdallas.edu/∼serfling DEPTH, OUTLYINGNESS, QUANTILE, AND RANK FUNCTIONS – CONCEPTS, PERSPECTIVES, CHALLENGES “Don’t walk in front of me, I may not follow. Don’t walk behind me, I may not lead. Just walk beside me and be my friend.” – Albert Camus DEPTH, OUTLYINGNESS, QUANTILE, AND RANK FUNCTIONS – CONCEPTS, PERSPECTIVES, CHALLENGES OUTLINE Depth, Outlyingness, Quantile, and Rank Functions on Rd Depth Functions on Arbitrary Data Space X Depth Functions on Arbitrary Parameter Space Θ Concluding Remarks DEPTH, OUTLYINGNESS, QUANTILE, AND RANK FUNCTIONS – CONCEPTS, PERSPECTIVES, CHALLENGES d DEPTH, OUTLYINGNESS, QUANTILE, AND RANK FUNCTIONS ON R PRELIMINARY PERSPECTIVES Depth functions are a nonparametric approach I The setting is nonparametric data analysis. No parametric or semiparametric model is assumed or invoked. I We exhibit the geometric structure of a data set in terms of a center, quantile levels, measures of outlyingness for each point, and identification of outliers or outlier regions. I Such data description is developed in terms of a depth function that measures centrality from a global viewpoint and yields center-outward ordering of data points. I This differs from the density function,
    [Show full text]
  • Sampling Student's T Distribution – Use of the Inverse Cumulative
    Sampling Student’s T distribution – use of the inverse cumulative distribution function William T. Shaw Department of Mathematics, King’s College, The Strand, London WC2R 2LS, UK With the current interest in copula methods, and fat-tailed or other non-normal distributions, it is appropriate to investigate technologies for managing marginal distributions of interest. We explore “Student’s” T distribution, survey its simulation, and present some new techniques for simulation. In particular, for a given real (not necessarily integer) value n of the number of degrees of freedom, −1 we give a pair of power series approximations for the inverse, Fn ,ofthe cumulative distribution function (CDF), Fn.Wealsogivesomesimpleandvery fast exact and iterative techniques for defining this function when n is an even −1 integer, based on the observation that for such cases the calculation of Fn amounts to the solution of a reduced-form polynomial equation of degree n − 1. We also explain the use of Cornish–Fisher expansions to define the inverse CDF as the composition of the inverse CDF for the normal case with a simple polynomial map. The methods presented are well adapted for use with copula and quasi-Monte-Carlo techniques. 1 Introduction There is much interest in many areas of financial modeling on the use of copulas to glue together marginal univariate distributions where there is no easy canonical multivariate distribution, or one wishes to have flexibility in the mechanism for combination. One of the more interesting marginal distributions is the “Student’s” T distribution. This statistical distribution was published by W. Gosset in 1908.
    [Show full text]
  • QUANTILE REGRESSION for CLIMATE DATA Dilhani Marasinghe Clemson University, [email protected]
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Clemson University: TigerPrints Clemson University TigerPrints All Theses Theses 8-2014 QUANTILE REGRESSION FOR CLIMATE DATA Dilhani Marasinghe Clemson University, [email protected] Follow this and additional works at: https://tigerprints.clemson.edu/all_theses Part of the Statistics and Probability Commons Recommended Citation Marasinghe, Dilhani, "QUANTILE REGRESSION FOR CLIMATE DATA" (2014). All Theses. 1909. https://tigerprints.clemson.edu/all_theses/1909 This Thesis is brought to you for free and open access by the Theses at TigerPrints. It has been accepted for inclusion in All Theses by an authorized administrator of TigerPrints. For more information, please contact [email protected]. QUANTILE REGRESSION FOR CLIMATE DATA A Master Thesis Presented to the Graduate School of Clemson University In Partial Fulfillment of the Requirements for the Degree MASTER OF SCIENCE Mathematical Sciences by DILHANI SHALIKA MARASINGHE August 2014 Accepted by: Dr. Collin Gallagher, Committee Chair Dr. Christoper McMahan Dr. Robert Lund Abstract Quantile regression is a developing statistical tool which is used to explain the relationship between response and predictor variables. This thesis describes two examples of climatology using quantile re- gression. Our main goal is to estimate derivatives of a conditional mean and/or conditional quantile function. We introduce a method to handle autocorrelation in the framework of quantile regression and used it with the temperature data. Also we explain some properties of the tornado data which is non-normally distributed. Even though quantile regression provides a more comprehensive view, when talking about residuals with the normality and the constant variance assumption, we would prefer least square regression for our temperature analysis.
    [Show full text]
  • Gretl User's Guide
    Gretl User’s Guide Gnu Regression, Econometrics and Time-series Allin Cottrell Department of Economics Wake Forest university Riccardo “Jack” Lucchetti Dipartimento di Economia Università Politecnica delle Marche December, 2008 Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.1 or any later version published by the Free Software Foundation (see http://www.gnu.org/licenses/fdl.html). Contents 1 Introduction 1 1.1 Features at a glance ......................................... 1 1.2 Acknowledgements ......................................... 1 1.3 Installing the programs ....................................... 2 I Running the program 4 2 Getting started 5 2.1 Let’s run a regression ........................................ 5 2.2 Estimation output .......................................... 7 2.3 The main window menus ...................................... 8 2.4 Keyboard shortcuts ......................................... 11 2.5 The gretl toolbar ........................................... 11 3 Modes of working 13 3.1 Command scripts ........................................... 13 3.2 Saving script objects ......................................... 15 3.3 The gretl console ........................................... 15 3.4 The Session concept ......................................... 16 4 Data files 19 4.1 Native format ............................................. 19 4.2 Other data file formats ....................................... 19 4.3 Binary databases ..........................................
    [Show full text]
  • Wooldridge, Introductory Econometrics, 4Th Ed. Appendix C
    Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathemati- cal statistics A short review of the principles of mathemati- cal statistics. Econometrics is concerned with statistical inference: learning about the char- acteristics of a population from a sample of the population. The population is a well-defined group of subjects{and it is important to de- fine the population of interest. Are we trying to study the unemployment rate of all labor force participants, or only teenaged workers, or only AHANA workers? Given a population, we may define an economic model that contains parameters of interest{coefficients, or elastic- ities, which express the effects of changes in one variable upon another. Let Y be a random variable (r.v.) representing a population with probability density function (pdf) f(y; θ); with θ a scalar parameter. We assume that we know f;but do not know the value of θ: Let a random sample from the pop- ulation be (Y1; :::; YN ) ; with Yi being an inde- pendent random variable drawn from f(y; θ): We speak of Yi being iid { independently and identically distributed. We often assume that random samples are drawn from the Bernoulli distribution (for instance, that if I pick a stu- dent randomly from my class list, what is the probability that she is female? That probabil- ity is γ; where γ% of the students are female, so P (Yi = 1) = γ and P (Yi = 0) = (1 − γ): For many other applications, we will assume that samples are drawn from the Normal distribu- tion. In that case, the pdf is characterized by two parameters, µ and σ2; expressing the mean and spread of the distribution, respectively.
    [Show full text]
  • A Tutorial on Quantile Estimation Via Monte Carlo
    A Tutorial on Quantile Estimation via Monte Carlo Hui Dong and Marvin K. Nakayama Abstract Quantiles are frequently used to assess risk in a wide spectrum of applica- tion areas, such as finance, nuclear engineering, and service industries. This tutorial discusses Monte Carlo simulation methods for estimating a quantile, also known as a percentile or value-at-risk, where p of a distribution’s mass lies below its p-quantile. We describe a general approach that is often followed to construct quantile estimators, and show how it applies when employing naive Monte Carlo or variance-reduction techniques. We review some large-sample properties of quantile estimators. We also describe procedures for building a confidence interval for a quantile, which provides a measure of the sampling error. 1 Introduction Numerous application settings have adopted quantiles as a way of measuring risk. For a fixed constant 0 < p < 1, the p-quantile of a continuous random variable is a constant x such that p of the distribution’s mass lies below x. For example, the median is the 0:5-quantile. In finance, a quantile is called a value-at-risk, and risk managers commonly employ p-quantiles for p ≈ 1 (e.g., p = 0:99 or p = 0:999) to help determine capital levels needed to be able to cover future large losses with high probability; e.g., see [33]. Nuclear engineers use 0:95-quantiles in probabilistic safety assessments (PSAs) of nuclear power plants. PSAs are often performed with Monte Carlo, and the U.S. Nuclear Regulatory Commission (NRC) further requires that a PSA accounts for the Hui Dong Amazon.com Corporate LLC∗, Seattle, WA 98109, USA e-mail: [email protected] ∗This work is not related to Amazon, regardless of the affiliation.
    [Show full text]
  • Quantreg: Quantile Regression
    Package ‘quantreg’ June 6, 2021 Title Quantile Regression Description Estimation and inference methods for models of conditional quantiles: Linear and nonlinear parametric and non-parametric (total variation penalized) models for conditional quantiles of a univariate response and several methods for handling censored survival data. Portfolio selection methods based on expected shortfall risk are also now included. See Koenker (2006) <doi:10.1017/CBO9780511754098> and Koenker et al. (2017) <doi:10.1201/9781315120256>. Version 5.86 Maintainer Roger Koenker <[email protected]> Repository CRAN Depends R (>= 2.6), stats, SparseM Imports methods, graphics, Matrix, MatrixModels, conquer Suggests tripack, akima, MASS, survival, rgl, logspline, nor1mix, Formula, zoo, R.rsp License GPL (>= 2) URL https://www.r-project.org NeedsCompilation yes VignetteBuilder R.rsp Author Roger Koenker [cre, aut], Stephen Portnoy [ctb] (Contributions to Censored QR code), Pin Tian Ng [ctb] (Contributions to Sparse QR code), Blaise Melly [ctb] (Contributions to preprocessing code), Achim Zeileis [ctb] (Contributions to dynrq code essentially identical to his dynlm code), Philip Grosjean [ctb] (Contributions to nlrq code), Cleve Moler [ctb] (author of several linpack routines), Yousef Saad [ctb] (author of sparskit2), Victor Chernozhukov [ctb] (contributions to extreme value inference code), Ivan Fernandez-Val [ctb] (contributions to extreme value inference code), Brian D Ripley [trl, ctb] (Initial (2001) R port from S (to my 1 2 R topics documented: everlasting shame -- how could I have been so slow to adopt R!) and for numerous other suggestions and useful advice) Date/Publication 2021-06-06 17:10:02 UTC R topics documented: akj..............................................3 anova.rq . .5 bandwidth.rq .
    [Show full text]
  • Chapter 9. Properties of Point Estimators and Methods of Estimation
    Chapter 9. Properties of Point Estimators and Methods of Estimation 9.1 Introduction 9.2 Relative Efficiency 9.3 Consistency 9.4 Sufficiency 9.5 The Rao-Blackwell Theorem and Minimum- Variance Unbiased Estimation 9.6 The Method of Moments 9.7 The Method of Maximum Likelihood 1 9.1 Introduction • Estimator θ^ = θ^n = θ^(Y1;:::;Yn) for θ : a function of n random samples, Y1;:::;Yn. • Sampling distribution of θ^ : a probability distribution of θ^ • Unbiased estimator, θ^ : E(θ^) − θ = 0. • Properties of θ^ : efficiency, consistency, sufficiency • Rao-Blackwell theorem : an unbiased esti- mator with small variance is a function of a sufficient statistic • Estimation method - Minimum-Variance Unbiased Estimation - Method of Moments - Method of Maximum Likelihood 2 9.2 Relative Efficiency • We would like to have an estimator with smaller bias and smaller variance : if one can find several unbiased estimators, we want to use an estimator with smaller vari- ance. • Relative efficiency (Def 9.1) Suppose θ^1 and θ^2 are two unbi- ased estimators for θ, with variances, V (θ^1) and V (θ^2), respectively. Then relative efficiency of θ^1 relative to θ^2, denoted eff(θ^1; θ^2), is defined to be the ra- V (θ^2) tio eff(θ^1; θ^2) = V (θ^1) - eff(θ^1; θ^2) > 1 : V (θ^2) > V (θ^1), and θ^1 is relatively more efficient than θ^2. - eff(θ^1; θ^2) < 1 : V (θ^2) < V (θ^1), and θ^2 is relatively more efficient than θ^1.
    [Show full text]