Quantile Regression

Total Page:16

File Type:pdf, Size:1020Kb

Quantile Regression QUANTILE REGRESSION AN INTRODUCTION ROGER KOENKER AND KEVIN F. HALLOCK UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Abstract. Quantile regression as intro duced in Ko enker and Bassett 1978 may b e viewed as a natural extension of classical least squares estimation of conditional mean mo dels to the estimation of an ensemble of mo dels for conditional quantile functions. The central sp ecial case is the median regression estimator that mini- mizes a sum of absolute errors. The remaining conditional quantile functions are estimated by minimizing an asymmetrically weighted sum of absolute errors. Taken together the ensemble of estimated conditional quantile functions o ers a much more complete view of the e ect of covariates on the lo cation, scale and shap e of the dis- tribution of the resp onse variable. This essay provides a brief tutorial intro duction to quantile regression metho ds, illustrating their application in several settings. Practical asp ects of computation, inference, and interpretation are discussed and suggestions for further reading are provided. 1. Introduction In the classical mythology of least-squares regression the conditional mean function, the function that describ es how the mean of y changes with the vector of covariates x, is almost all weneedtoknow ab out the relationship b etween y and x. In the resilient terminology of Frisch 1934 and Ko opmans 1937 it is \the `systematic comp onent' or `true value'," around which y uctuates due to an \erratic comp onent" or \accidental error." The crucial, and convenient, thing ab out this view is that the error is assumed to have precisely the same distribution whatever values maybetaken by the comp onents of the vector x.We refer to this as a pure lo cation shift mo del since it assumes that x a ects only the lo cation of the conditional distribution of y , not its scale, or any other asp ect of its distributional shap e. If this is the case, we can b e fully satis ed with an estimated mo del of the conditional mean function, supplemented p erhaps by an estimate of the conditional disp ersion of y around its mean. Version: Decemb er 28, 2000. Wewould like to thank Gib Bassett, John DiNardo, Olga Geling, and StevePortnoy for helpful comments on earlier drafts. This pap er has b een prepared for the Journal of Economic Perspectives \Symp osium on Econometric To ols". The researchwas partially supp orted by NSF grant SBR-9911184. 1 2 Quantile Regression When we add the further requirement that the errors are Gaussian, least-squares metho ds deliver the maximum likeliho o d estimates of the conditional mean function and achievea well-publicized optimality. Indeed, Gauss seems to have \discovered" the Gaussian densityasan ex post rationalization for the optimality of least-squares metho ds. But we will argue that there is more to econometric life than is dreamt of in this philosophy of the lo cation shift mo del. Covariates may in uence the con- ditional distribution of the resp onse in myriad other ways: expanding its disp ersion as in traditional mo dels of heteroscedasticity, stretching one tail of the distribution, compressing the other tail, and even inducing multimo dality. Explicit investigation of these e ects via quantile regression can provide a more nuanced view of the sto chastic relationship b etween variables, and therefore a more informative empirical analysis. The remainder of the pap er is organized as follows. Section 2 brie y explains how the ordinary quantiles, and consequently the regression quantiles, may b e de ned as the solution to a simple minimization of a weighted sum of absolute residuals. We then illustrate the technique in the classical bivariate setting of Ernst Engel's 1857 original fo o d exp enditure study. In Section 3 wesketch the outline of a more ambitious application to the analysis of infant birthweights in the U.S. In Section 4we o er a brief review of recent empirical applications of quantile regression in economics. Section 5 contains some practical guidance on matters of computation, inference, and software. Section 6 o ers several thumbnail sketches of \what can go wrong" and some p ossible remedies, and Section 7 concludes. 2. Whatisit? Wesay that a student scores at the th quantile of a standardized exam if he p erforms b etter than the prop ortion , of the reference group of students, and worse than the prop ortion 1 . Thus, half of students p erform b etter than the median student, and half p erform worse. Similarly, the quartiles divide the p opulation into four segments with equal prop ortions of the reference p opulation in each segment. The quintiles divide the p opulation into 5 parts; the deciles into 10 parts. The quantiles, or p ercentiles, or o ccasionally fractiles, refer to the general case. Quantile regression seeks to extend these ideas to the estimation of conditional quantile functions {models in which quantiles of the conditional distribution of the resp onse variable are expressed as functions of observed covariates. To accomplish this task weneeda newwayto de ne the quantiles. Quantiles, and their dual identities, the ranks, seem inseparably linked to the op er- ations of ordering and sorting that are generally used to de ne them. So it may come as a mild surprise to observethatwe can de ne the quantiles, and the ranks, through a simple alternative exp edient { as an optimization problem. Just as we can de ne the sample mean as the solution to the problem of minimizing a sum of squared resid- uals, we can de ne the median as the solution to the problem of minimizing a sum of absolute residuals. What ab out the other quantiles? If the symmetric absolute value Roger Koenker and Kevin F. Hallock 3 function yields the median, mayb e we can simply tilt the absolute value to pro duce the other quantiles. This \pinball logic" suggests solving X min 2.1 y i 2< where the function is illustrated in Figure 2.1. To see that this problem yields the sample quantiles as its solutions, it is only necessary to compute the directional derivative of the ob jective function with resp ect to ,taken from the left and from 1 the right. ρτ (u) τ−1 τ Figure 2.1. Quantile Regression Function Having succeeded in de ning the unconditional quantiles as an optimization prob- lem, it is easy to de ne conditional quantiles in an analogous fashion. Least squares regression o ers a mo del for how to pro ceed. If, presented with a random sample fy ;y ;::: ;y g,wesolve 1 2 n n X 2 2.2 min y ; i 2< i=1 1 1 Consider the median case with equal to : the directional derivatives are simply one-half 2 the sum of the signs of the residuals, y , with zero residuals counted as +1 from the rightand i ^ ^ as 1 from the left. If is taken to b e a value such that half the observations lie ab ove and half lie b elow, then b oth directional derivatives will b e p ositive, and since the ob jective function is ^ increasing in either direction must b e a lo cal minimum. But the ob jective function is a sum of convex functions and hence convex, so the lo cal minimum is also a global one. Such a solution is a median by our original de nition. The same argumentworks for the other quantiles, but nowwe have asymmetric weightingofthenumb er of observations with p ositive and negative residuals and ^ this leads to solutions corresp onding to the th quantiles. 4 Quantile Regression · · · · · · · · · · · · · · · · · · · · · · · · ·· ·· ·· · · ··· · ·· · · · · ········ · · Food Expenditure · ·· · · ········ · · · · ·· ········· ·· ·· · ············ ··· ····· ······ · ············ ·· · 500 1000········· 1500········ · 2000 ················· ·· ·········· · ······ 1000 2000 3000 4000 5000 Household Income Figure 2.2. Engel Curves for Fo o d: This gure plots data taken from Ernst Engel's 1857 study of the dep endence of households' fo o d ex- p enditure on household income. we obtain the sample mean, an estimate of the unconditional p opulation mean, EY . If wenow replace the scalar by a parametric function x; and solve n X 2 2.3 y x ; min i i p 2< i=1 we obtain an estimate of the conditional exp ectation function E Y jx. In quantile regression we pro ceed in exactly the same way.To obtain an estimate of the conditional median function, we simply replace the scalar in 2.1 by the 1 parametric function x ; and set to .Variants of this idea were prop osed in i 2 the mid 18th century by Boscovich, and subsequently investigated by Laplace and Edgeworth, among others. To obtain estimates of the other conditional quantile functions we simply replace absolute values by , and solve X 2.4 y x ; min i i p 2< ^ The resulting minimization problem, when x; is formulated as a linear func- tion of parameters, can b e solved very eciently by linear programming metho ds. We will defer further discussion of computational asp ects of quantile regression to Section 5. Roger Koenker and Kevin F. Hallock 5 To illustrate the basic ideas we brie y reconsider a classical empirical application, Ernst Engel's 1857 analysis of the relationship b etween household fo o d exp enditure and household income. In Figure 2.2 we plot the data taken from 235 Europ ean working class households. Sup erimp osed on the plot are seven estimated quantile regression lines corresp onding to the quantiles 2f:05;:1;:25;:5;:75;:9;:95g. The median = :5 t is indicated by the dashed line; the least squares t is plotted as the dotted line.
Recommended publications
  • Data Quality Management in Large-Scale Cyber-Physical Systems
    Data Quality Management in Large-Scale Cyber-Physical Systems Ahmed Abdulhasan Alwan School of Architecture, Computing and Engineering University of East London A thesis presented for the degree of Doctor of Philosophy July 19, 2021 Abstract Cyber-Physical Systems (CPSs) are cross-domain, multi-model, advance informa- tion systems that play a significant role in many large-scale infrastructure sectors of smart cities public services such as traffic control, smart transportation control, and environmental and noise monitoring systems. Such systems, typically, involve a substantial number of sensor nodes and other devices that stream and exchange data in real-time and usually are deployed in uncontrolled, broad environments. Thus, unexpected measurements may occur due to several internal and external factors, including noise, communication errors, and hardware failures, which may compromise these systems quality of data and raise serious concerns related to safety, reliability, performance, and security. In all cases, these unexpected measurements need to be carefully interpreted and managed based on domain knowledge and computational models. Therefore, in this research, data quality challenges were investigated, and a com- prehensive, proof of concept, data quality management system was developed to tackle unaddressed data quality challenges in large-scale CPSs. The data quality management system was designed to address data quality challenges associated with detecting: sensor nodes measurement errors, sensor nodes hardware failures, and mismatches in sensor nodes spatial and temporal contextual attributes. De- tecting sensor nodes measurement errors associated with the primary data quality dimensions of accuracy, timeliness, completeness, and consistency in large-scale CPSs were investigated using predictive and anomaly analysis models via utilising statistical and machine-learning techniques.
    [Show full text]
  • PDF Download
    Table of Contents About ICSEC 2019 ................................................................................................................... I Message from President of Prince of Songkla University .................................................. III Message from Rector of University North, Croatia ............................................................ IV Message from General Chair ................................................................................................ VI Keynote Speakers ................................................................................................................. VII Organizing Committee .......................................................................................................... XI Conference Venue ............................................................................................................... XIII Program at a Glance ........................................................................................................... XVI Technical Sessions Main Track ............................................................................................................................. 1 Special Session on Advanced Digital Media ....................................................................... 33 Special Session on Future SDN: Security, Virtualization, Systems and Architectures ....... 35 Author Index .......................................................................................................................... 36 List of Reviewers
    [Show full text]
  • Cyphyhouse: a Programming, Simulation, and Deployment Toolchain for Heterogeneous Distributed Coordination
    2020 IEEE International Conference on Robotics and Automation (ICRA) 31 May - 31 August, 2020. Paris, France CyPhyHouse: A programming, simulation, and deployment toolchain for heterogeneous distributed coordination Ritwika Ghosh1, Joao P. Jansch-Porto2, Chiao Hsieh1, Amelia Gosse2, Minghao Jiang3, Hebron Taylor2, Peter Du3, Sayan Mitra3, Geir Dullerud2 Abstract— Programming languages, libraries, and develop- With the aim of simplifying application development for ment tools have transformed the application development distributed and heterogeneous systems, we introduce CyPhy- processes for mobile computing and machine learning. This House1—an open source software toolchain for program- paper introduces CyPhyHouse—a toolchain that aims to provide similar programming, debugging, and deployment benefits for ming, simulating, and deploying mobile robotic applications. distributed mobile robotic applications. Users can develop In this work, we target distributed coordination applica- hardware-agnostic, distributed applications using the high-level, tions such as collaborative mapping [8], surveillance, de- event driven Koord programming language, without requiring livery, formation-flight, etc. with aerial drones and ground expertise in controller design or distributed network protocols. vehicles. We believe that for these applications, low-level The modular, platform-independent middleware of CyPhyHouse implements these functionalities using standard algorithms for motion control for the individual robots is standard but path planning (RRT), control (MPC), mutual exclusion, etc. tedious, and coordination across distributed (and possibly A high-fidelity, scalable, multi-threaded simulator for Koord heterogeneous) robots is particularly difficult and error- applications is developed to simulate the same application code prone. This motivates the two key abstractions provided for dozens of heterogeneous agents. The same compiled code by CyPhyHouse: (a) portability of high-level coordination can also be deployed on heterogeneous mobile platforms.
    [Show full text]
  • Behavioral Types in Programming Languages
    Behavioral Types in Programming Languages Behavioral Types in Programming Languages iv Davide Ancona, DIBRIS, Università di Genova, Italy Viviana Bono, Dipartimento di Informatica, Università di Torino, Italy Mario Bravetti, Università di Bologna, Italy / INRIA, France Joana Campos, LaSIGE, Faculdade de Ciências, Universidade de Lisboa, Portugal Giuseppe Castagna, CNRS, IRIF, Univ Paris Diderot, Sorbonne Paris Cité, Paris, France Pierre-Malo Deniélou, Royal Holloway, University of London, UK Simon J. Gay, School of Computing Science, University of Glasgow, UK Nils Gesbert, Université Grenoble Alpes, France Elena Giachino, Università di Bologna, Italy / INRIA, France Raymond Hu, Department of Computing, Imperial College London, UK Einar Broch Johnsen, Institutt for informatikk, Universitetet i Oslo, Norway Francisco Martins, LaSIGE, Faculdade de Ciências, Universidade de Lisboa, Portugal Viviana Mascardi, DIBRIS, Università di Genova, Italy Fabrizio Montesi, University of Southern Denmark Rumyana Neykova, Department of Computing, Imperial College London, UK Nicholas Ng, Department of Computing, Imperial College London, UK Luca Padovani, Dipartimento di Informatica, Università di Torino, Italy Vasco T. Vasconcelos, LaSIGE, Faculdade de Ciências, Universidade de Lisboa, Portugal Nobuko Yoshida, Department of Computing, Imperial College London, UK Boston — Delft Foundations and Trends R in Programming Languages Published, sold and distributed by: now Publishers Inc. PO Box 1024 Hanover, MA 02339 United States Tel. +1-781-985-4510 www.nowpublishers.com [email protected] Outside North America: now Publishers Inc. PO Box 179 2600 AD Delft The Netherlands Tel. +31-6-51115274 The preferred citation for this publication is D. Ancona et al.. Behavioral Types in Programming Languages. Foundations and Trends R in Programming Languages, vol. 3, no.
    [Show full text]
  • Insight MFR By
    Manufacturers, Publishers and Suppliers by Product Category 11/6/2017 10/100 Hubs & Switches ASCEND COMMUNICATIONS CIS SECURE COMPUTING INC DIGIUM GEAR HEAD 1 TRIPPLITE ASUS Cisco Press D‐LINK SYSTEMS GEFEN 1VISION SOFTWARE ATEN TECHNOLOGY CISCO SYSTEMS DUALCOMM TECHNOLOGY, INC. GEIST 3COM ATLAS SOUND CLEAR CUBE DYCONN GEOVISION INC. 4XEM CORP. ATLONA CLEARSOUNDS DYNEX PRODUCTS GIGAFAST 8E6 TECHNOLOGIES ATTO TECHNOLOGY CNET TECHNOLOGY EATON GIGAMON SYSTEMS LLC AAXEON TECHNOLOGIES LLC. AUDIOCODES, INC. CODE GREEN NETWORKS E‐CORPORATEGIFTS.COM, INC. GLOBAL MARKETING ACCELL AUDIOVOX CODI INC EDGECORE GOLDENRAM ACCELLION AVAYA COMMAND COMMUNICATIONS EDITSHARE LLC GREAT BAY SOFTWARE INC. ACER AMERICA AVENVIEW CORP COMMUNICATION DEVICES INC. EMC GRIFFIN TECHNOLOGY ACTI CORPORATION AVOCENT COMNET ENDACE USA H3C Technology ADAPTEC AVOCENT‐EMERSON COMPELLENT ENGENIUS HALL RESEARCH ADC KENTROX AVTECH CORPORATION COMPREHENSIVE CABLE ENTERASYS NETWORKS HAVIS SHIELD ADC TELECOMMUNICATIONS AXIOM MEMORY COMPU‐CALL, INC EPIPHAN SYSTEMS HAWKING TECHNOLOGY ADDERTECHNOLOGY AXIS COMMUNICATIONS COMPUTER LAB EQUINOX SYSTEMS HERITAGE TRAVELWARE ADD‐ON COMPUTER PERIPHERALS AZIO CORPORATION COMPUTERLINKS ETHERNET DIRECT HEWLETT PACKARD ENTERPRISE ADDON STORE B & B ELECTRONICS COMTROL ETHERWAN HIKVISION DIGITAL TECHNOLOGY CO. LT ADESSO BELDEN CONNECTGEAR EVANS CONSOLES HITACHI ADTRAN BELKIN COMPONENTS CONNECTPRO EVGA.COM HITACHI DATA SYSTEMS ADVANTECH AUTOMATION CORP. BIDUL & CO CONSTANT TECHNOLOGIES INC Exablaze HOO TOO INC AEROHIVE NETWORKS BLACK BOX COOL GEAR EXACQ TECHNOLOGIES INC HP AJA VIDEO SYSTEMS BLACKMAGIC DESIGN USA CP TECHNOLOGIES EXFO INC HP INC ALCATEL BLADE NETWORK TECHNOLOGIES CPS EXTREME NETWORKS HUAWEI ALCATEL LUCENT BLONDER TONGUE LABORATORIES CREATIVE LABS EXTRON HUAWEI SYMANTEC TECHNOLOGIES ALLIED TELESIS BLUE COAT SYSTEMS CRESTRON ELECTRONICS F5 NETWORKS IBM ALLOY COMPUTER PRODUCTS LLC BOSCH SECURITY CTC UNION TECHNOLOGIES CO FELLOWES ICOMTECH INC ALTINEX, INC.
    [Show full text]
  • Products Newsletter Emergiing Trends, Cuttiing-Edge Technollogiies, and Research Breakthroughs Iin Power Ellectroniics
    Products Newsletter Emergiing Trends, Cuttiing-Edge Technollogiies, and Research Breakthroughs iin Power Ellectroniics ​ ​ ​ November 20, 2020 | Issue 7 IEEE Power Electronics Magazine To empower next-generation engineers with hands-on skills in control and power electronics, Qing- Chang Zhong, Yeqin Wang, Yiting Dong, Beibei Ren, and Mohammad Amin have developed a versatile experimental tool that lowers the barriers to go real from simulations to experiments for various power electronic systems. Their Smart Grid Research and Educational Kit, which is a reconfigurable, open-source, multifunctional power electronic converter with the capability of directly downloading codes from MATLAB/Simulink. Besides minimizing the time, cost, and efforts needed to develop hardware systems, it removes the burden of coding. Read more in the September 2020 issue of IEEE Power Electronics Magazine! IEEE Transactions on Power Electronics (TPEL) The December 2020 Issue presents 86 papers with the latest research in power electronics! December Highlighted Papers: On Beat Frequency Oscillation of Two-Stage Wireless Power Receivers Kerui Li, Siew-Chong Tan, and Ron Shu Yuen Hui Novel iGSE-C Loss Modeling of X7R Ceramic Capacitors David Menzi, Dominik Bortis, Grayson Zulauf, Morris Heller, and Johann W. Kolar December Papers with Active/Multimedia Content Analysis and Design of the LLC LED Driver Based on State-Space Representation Direct Time- Domain Solution by Maikel Menke, João P. Duranti, Leandro Roggia, Fábio E. Bisogno, Rodrigo V. Tambara, Álysson R. Seidel provides a PowerPoint Presentation elucidating each step development of the proposed design procedure during the design of the LLC LED driver. In addition, the authors offer three Wolfram Mathematica notebook script files.
    [Show full text]
  • Homomorphic Encryption: Working and Analytical Assessment DGHV, Helib, Paillier, FHEW and HE in Cloud Security
    Master of Science in Computer Science February 2017 Homomorphic Encryption: Working and Analytical Assessment DGHV, HElib, Paillier, FHEW and HE in cloud security Srinivas Divya Papisetty Faculty of Computing Blekinge Institute of Technology SE-371 79 Karlskrona Sweden This thesis is submitted to the Faculty of Computing at Blekinge Institute of Technology in partial fulfillment of the requirements for the degree of Master of Science in Computer Science. The thesis is equivalent to 20 weeks of full time studies. Contact Information: Author(s): Srinivas Divya Papisetty E-mail: [email protected] University advisor: Dr. Emiliano Casalicchio Dept. of Computer Science & Engineering Faculty of Computing Internet : www.bth.se Blekinge Institute of Technology Phone : +46 455 38 50 00 SE-371 79 Karlskrona, Sweden Fax : +46 455 38 50 57 i i ABSTRACT Context. Secrecy has kept researchers spanning over centuries engaged in the creation of data protection techniques. With the growing rate of data breach and intervention of adversaries in confidential data storage and communication, efficient data protection has found to be a challenge. Homomorphic encryption is one such data protection technique in the cryptographic domain which can perform arbitrary computations on the enciphered data without disclosing the original plaintext or message. The first working fully homomorphic encryption scheme was proposed in the year 2009 and since then there has been a tremendous increase in the development of homomorphic encryption schemes such that they can be applied to a wide range of data services that demand security. All homomorphic encryption schemes can be categorized as partially homomorphic (PHE), somewhat homomorphic (SHE), leveled homomorphic (LHE), and fully homomorphic encryption (FHE).
    [Show full text]
  • How Machine Learning Has Been Applied in Software Engineering?
    How Machine Learning Has Been Applied in Software Engineering? Olimar Teixeira Borges a, Julia Colleoni Couto b, Duncan Dubugras A. Ruiz c and Rafael Prikladnicki1 d School of Technology, PUCRS, Porto Alegre, Brazil Keywords: Software Engineering, Machine Learning, Mapping Study. Abstract: Machine Learning (ML) environments are composed of a set of techniques and tools, which can help in solving problems in a diversity of areas, including Software Engineering (SE). However, due to a large number of possible configurations, it is a challenge to select the ML environment to be used for a specific SE domain issue. Helping software engineers choose the most suitable ML environment according to their needs would be very helpful. For instance, it is possible to automate software tests using ML models, where the model learns software behavior and predicts possible problems in the code. In this paper, we present a mapping study that categorizes the ML techniques and tools reported as useful to solve SE domain issues. We found that the most used algorithm is Na¨ıve Bayes and that WEKA is the tool most SE researchers use to perform ML experiments related to SE. We also identified that most papers use ML to solve problems related to SE quality. We propose a categorization of the ML techniques and tools that are applied in SE problem solving, linking with the Software Engineering Body of Knowledge (SWEBOK) knowledge areas. 1 INTRODUCTION automated to reduce human effort and project cost. However, to automate these tasks using ML, we need Machine Learning (ML) is an Artificial Intelligence to have the SE project’s data to explore.
    [Show full text]
  • Failure of the Point Blinding Countermeasure Against Fault Attack in Pairing-Based Cryptography Nadia El Mrabet, Emmanuel Fouotsa
    Failure of the Point Blinding Countermeasure Against Fault Attack in Pairing-Based Cryptography Nadia El Mrabet, Emmanuel Fouotsa To cite this version: Nadia El Mrabet, Emmanuel Fouotsa. Failure of the Point Blinding Countermeasure Against Fault Attack in Pairing-Based Cryptography. 2015, 10.1007/978-3-319-18681-8_21. hal-01197148 HAL Id: hal-01197148 https://hal.archives-ouvertes.fr/hal-01197148 Submitted on 11 Sep 2015 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Failure of the Point Blinding Countermeasure against Fault Attack in Pairing-Based Cryptography ? Nadia EL MRABET1 and Emmanuel FOUOTSA2 (1) LIASD − Universit´eParis 8 -France - [email protected] (1) SAS - CMP Gardanne - France - [email protected] (2) Dep of Mathematics, Higher Teacher's Training College, University of Bamenda - Cameroun (2) LMNO − Universit´ede Caen - France - [email protected] Abstract. Pairings are mathematical tools that have been proven to be very useful in the con- struction of many cryptographic protocols. Some of these protocols are suitable for implementation on power constrained devices such as smart cards or smartphone which are subject to side channel attacks. In this paper, we analyse the efficiency of the point blinding countermeasure in pairing based cryptography against side channel attacks.
    [Show full text]
  • Comparison of Three Common Statistical Programs Available to Washington State County Assessors: SAS, SPSS and NCSS
    Washington State Department of Revenue Comparison of Three Common Statistical Programs Available to Washington State County Assessors: SAS, SPSS and NCSS February 2008 Abstract: This summary compares three common statistical software packages available to county assessors in Washington State. This includes SAS, SPSS and NCSS. The majority of the summary is formatted in tables which allow the reader to more easily make comparisons. Information was collected from various sources and in some cases includes opinions on software performance, features and their strengths and weaknesses. This summary was written for Department of Revenue employees and county assessors to compare statistical software packages used by county assessors and as a result should not be used as a general comparison of the software packages. Information not included in this summary includes the support infrastructure, some detailed features (that are not described in this summary) and types of users. Contents General software information.............................................................................................. 3 Statistics and Procedures Components................................................................................ 8 Cost Estimates ................................................................................................................... 10 Other Statistics Software ................................................................................................... 13 General software information Information in this section was
    [Show full text]
  • STATA January 1993 TECHNICAL STB-11 BULLETIN a Publication to Promote Communication Among Stata Users
    STATA January 1993 TECHNICAL STB-11 BULLETIN A publication to promote communication among Stata users Editor Associate Editors Joseph Hilbe J. Theodore Anagnoson, Cal. State Univ., LA Department of Sociology Richard DeLeon, San Francisco State Univ. Arizona State University Paul Geiger, USC School of Medicine Tempe, Arizona 85287 Lawrence C. Hamilton, Univ. of New Hampshire 602-860-1446 FAX Stewart West, Baylor College of Medicine [email protected] EMAIL Subscriptions are available from Stata Corporation, email [email protected], telephone 979-696-4600 or 800-STATAPC, fax 979-696-4601. Current subscription prices are posted at www.stata.com/bookstore/stb.html. Previous Issues are available individually from StataCorp. See www.stata.com/bookstore/stbj.html for details. Submissions to the STB, including submissions to the supporting files (programs, datasets, and help files), are on a nonexclusive, free-use basis. In particular, the author grants to StataCorp the nonexclusive right to copyright and distribute the material in accordance with the Copyright Statement below. The author also grants to StataCorp the right to freely use the ideas, including communication of the ideas to other parties, even if the material is never published in the STB. Submissions should be addressed to the Editor. Submission guidelines can be obtained from either the editor or StataCorp. Copyright Statement. The Stata Technical Bulletin (STB) and the contents of the supporting files (programs, datasets, and help files) are copyright c by StataCorp. The contents of the supporting files (programs, datasets, and help files), may be copied or reproduced by any means whatsoever, in whole or in part, as long as any copy or reproduction includes attribution to both (1) the author and (2) the STB.
    [Show full text]
  • Reproducible Research in Computational Harmonic Analysis
    R EPRODUCIBLE RESEARCH Reproducible Research in Computational Harmonic Analysis Scientific computation is emerging as absolutely central to the scientific method, but the prevalence of very relaxed practices is leading to a credibility crisis. Reproducible computational research, in which all details of computations—code and data—are made conveniently available to others, is a necessary response to this crisis. The authors review their approach to reproducible research and describe how it has evolved over time, discussing the arguments for and against working reproducibly. assive computation is transform- academic scientist likely has more in common ing science, as researchers from with a large corporation’s information technol- numerous fields launch ambitious ogy manager than with a philosophy or English projects involving large-scale com- professor at the same university. Mputations. Emblems of our age include A rapid transition is now under way—visible particularly over the past two decades—that will • data mining for subtle patterns in vast data- finish with computation as absolutely central to bases; and scientific enterprise. However, the transition is • massive simulations of a physical system’s com- very far from completion. In fact, we believe that plete evolution repeated numerous times, as the dominant mode of scientific computing has al- simulation parameters vary systematically. ready brought us to a state of crisis. The prevalence of very relaxed attitudes about communicating ex- The traditional image of the scientist as a soli- perimental details and validating results is causing tary person working in a laboratory with beakers a large and growing credibility gap. It’s impossible and test tubes is long obsolete.
    [Show full text]