On the Examination of the Reliability of Statistical Software for Estimating Logistic
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Stan: a Probabilistic Programming Language
JSS Journal of Statistical Software MMMMMM YYYY, Volume VV, Issue II. http://www.jstatsoft.org/ Stan: A Probabilistic Programming Language Bob Carpenter Andrew Gelman Matt Hoffman Columbia University Columbia University Adobe Research Daniel Lee Ben Goodrich Michael Betancourt Columbia University Columbia University University of Warwick Marcus A. Brubaker Jiqiang Guo Peter Li University of Toronto, NPD Group Columbia University Scarborough Allen Riddell Dartmouth College Abstract Stan is a probabilistic programming language for specifying statistical models. A Stan program imperatively defines a log probability function over parameters conditioned on specified data and constants. As of version 2.2.0, Stan provides full Bayesian inference for continuous-variable models through Markov chain Monte Carlo methods such as the No-U-Turn sampler, an adaptive form of Hamiltonian Monte Carlo sampling. Penalized maximum likelihood estimates are calculated using optimization methods such as the Broyden-Fletcher-Goldfarb-Shanno algorithm. Stan is also a platform for computing log densities and their gradients and Hessians, which can be used in alternative algorithms such as variational Bayes, expectation propa- gation, and marginal inference using approximate integration. To this end, Stan is set up so that the densities, gradients, and Hessians, along with intermediate quantities of the algorithm such as acceptance probabilities, are easily accessible. Stan can be called from the command line, through R using the RStan package, or through Python using the PyStan package. All three interfaces support sampling and optimization-based inference. RStan and PyStan also provide access to log probabilities, gradients, Hessians, and data I/O. Keywords: probabilistic program, Bayesian inference, algorithmic differentiation, Stan. -
Introduction Rats Version 9.0
RATS VERSION 9.0 INTRODUCTION RATS VERSION 9.0 INTRODUCTION Estima 1560 Sherman Ave., Suite 510 Evanston, IL 60201 Orders, Sales Inquiries 800–822–8038 Web: www.estima.com General Information 847–864–8772 Sales: [email protected] Technical Support 847–864–1910 Technical Support: [email protected] Fax: 847–864–6221 © 2014 by Estima. All Rights Reserved. No part of this book may be reproduced or transmitted in any form or by any means with- out the prior written permission of the copyright holder. Estima 1560 Sherman Ave., Suite 510 Evanston, IL 60201 Published in the United States of America Preface Welcome to Version 9 of rats. We went to a three-book manual set with Version 8 (this Introduction, the User’s Guide and the Reference Manual; and we’ve continued that into Version 9. However, we’ve made some changes in emphasis to reflect the fact that most of our users now use electronic versions of the manuals. And, with well over a thousand example programs, the most common way for people to use rats is to pick an existing program and modify it. With each new major version, we need to decide what’s new and needs to be ex- plained, what’s important and needs greater emphasis, and what’s no longer topical and can be moved out of the main documentation. For Version 9, the chapters in the User’s Guide that received the most attention were “arch/garch and related mod- els” (Chapter 9), “Threshold, Breaks and Switching” (Chapter 11), and “Cross Section and Panel Data” (Chapter 12). -
Department of Geography
Department of Geography UNIVERSITY OF FLORIDA, SPRING 2019 GEO 4167c section #09A6 / GEO 6161 section # 09A9 (3.0 credit hours) Course# 15235/15271 Intermediate Quantitative Methods Instructor: Timothy J. Fik, Ph.D. (Associate Professor) Prerequisite: GEO 3162 / GEO 6160 or equivalent Lecture Time/Location: Tuesdays, Periods 3-5: 9:35AM-12:35PM / Turlington 3012 Instructor’s Office: 3137 Turlington Hall Instructor’s e-mail address: [email protected] Formal Office Hours Tuesdays -- 1:00PM – 4:30PM Thursdays -- 1:30PM – 3:00PM; and 4:00PM – 4:30PM Course Materials (Power-point presentations in pdf format) will be uploaded to the on-line course Lecture folder on Canvas. Course Overview GEO 4167x/GEO 6161 surveys various statistical modeling techniques that are widely used in the social, behavioral, and environmental sciences. Lectures will focus on several important topics… including common indices of spatial association and dependence, linear and non-linear model development, model diagnostics, and remedial measures. The lectures will largely be devoted to the topic of Regression Analysis/Econometrics (and the General Linear Model). Applications will involve regression models using cross-sectional, quantitative, qualitative, categorical, time-series, and/or spatial data. Selected topics include, yet are not limited to, the following: Classic Least Squares Regression plus Extensions of the General Linear Model (GLM) Matrix Algebra approach to Regression and the GLM Join-Count Statistics (Dacey’s Contiguity Tests) Spatial Autocorrelation / Regression -
The Evolution of Econometric Software Design: a Developer's View
Journal of Economic and Social Measurement 29 (2004) 205–259 205 IOS Press The evolution of econometric software design: A developer’s view Houston H. Stokes Department of Economics, College of Business Administration, University of Illinois at Chicago, 601 South Morgan Street, Room 2103, Chicago, IL 60607-7121, USA E-mail: [email protected] In the last 30 years, changes in operating systems, computer hardware, compiler technology and the needs of research in applied econometrics have all influenced econometric software development and the environment of statistical computing. The evolution of various representative software systems, including B34S developed by the author, are used to illustrate differences in software design and the interrelation of a number of factors that influenced these choices. A list of desired econometric software features, software design goals and econometric programming language characteristics are suggested. It is stressed that there is no one “ideal” software system that will work effectively in all situations. System integration of statistical software provides a means by which capability can be leveraged. 1. Introduction 1.1. Overview The development of modern econometric software has been influenced by the changing needs of applied econometric research, the expanding capability of com- puter hardware (CPU speed, disk storage and memory), changes in the design and capability of compilers, and the availability of high-quality subroutine libraries. Soft- ware design in turn has itself impacted applied econometric research, which has seen its horizons expand rapidly in the last 30 years as new techniques of analysis became computationally possible. How some of these interrelationships have evolved over time is illustrated by a discussion of the evolution of the design and capability of the B34S Software system [55] which is contrasted to a selection of other software systems. -
International Journal of Forecasting Guidelines for IJF Software Reviewers
International Journal of Forecasting Guidelines for IJF Software Reviewers It is desirable that there be some small degree of uniformity amongst the software reviews in this journal, so that regular readers of the journal can have some idea of what to expect when they read a software review. In particular, I wish to standardize the second section (after the introduction) of the review, and the penultimate section (before the conclusions). As stand-alone sections, they will not materially affect the reviewers abillity to craft the review as he/she sees fit, while still providing consistency between reviews. This applies mostly to single-product reviews, but some of the ideas presented herein can be successfully adapted to a multi-product review. The second section, Overview, is an overview of the package, and should include several things. · Contact information for the developer, including website address. · Platforms on which the package runs, and corresponding prices, if available. · Ancillary programs included with the package, if any. · The final part of this section should address Berk's (1987) list of criteria for evaluating statistical software. Relevant items from this list should be mentioned, as in my review of RATS (McCullough, 1997, pp.182- 183). · My use of Berk was extremely terse, and should be considered a lower bound. Feel free to amplify considerably, if the review warrants it. In fact, Berk's criteria, if considered in sufficient detail, could be the outline for a review itself. The penultimate section, Numerical Details, directly addresses numerical accuracy and reliality, if these topics are not addressed elsewhere in the review. -
Omegahat Packages for R
News The Newsletter of the R Project Volume 1/1, January 2001 Editorial by Kurt Hornik and Friedrich Leisch As all of R, R News is a volunteer project. The editorial board currently consists of the R core devel- Welcome to the first volume of R News, the newslet- opment team plus Bill Venables. We are very happy ter of the R project for statistical computing. R News that Bill—one of the authorities on programming the will feature short to medium length articles covering S language—has offered himself as editor of “Pro- topics that might be of interest to users or developers grammer’s Niche”, a regular column on R/S pro- of R, including gramming. This first volume already features a broad range Changes in R: new features of the latest release • of different articles, both from R core members and other developers in the R community (without Changes on CRAN: new add-on packages, • whom R would never have grown to what it is now). manuals, binary distributions, mirrors, . The success of R News critically depends on the ar- Add-on packages: short introductions to or re- ticles in it, hence we want to ask all of you to sub- • views of R extension packages mit to R News. There is no formal reviewing pro- cess yet, however articles will be reviewed by the ed- Programmer’s Niche: nifty hints for program- itorial board to ensure the quality of the newsletter. • ming in R (or S) Submissions should simply be sent to the editors by email, see the article on page 30 for details on how to Applications: Examples of analyzing data with • write articles. -
Estimating Regression Models for Categorical Dependent Variables Using SAS, Stata, LIMDEP, and SPSS*
© 2003-2008, The Trustees of Indiana University Regression Models for Categorical Dependent Variables: 1 Estimating Regression Models for Categorical Dependent Variables Using SAS, Stata, LIMDEP, and SPSS* Hun Myoung Park (kucc625) This document summarizes regression models for categorical dependent variables and illustrates how to estimate individual models using SAS 9.1, Stata 10.0, LIMDEP 9.0, and SPSS 16.0. 1. Introduction 2. The Binary Logit Model 3. The Binary Probit Model 4. Bivariate Logit/Probit Models 5. Ordered Logit/Probit Models 6. The Multinomial Logit Model 7. The Conditional Logit Model 8. The Nested Logit Model 9. Conclusion 1. Introduction A categorical variable here refers to a variable that is binary, ordinal, or nominal. Event count data are discrete (categorical) but often treated as continuous variables. When a dependent variable is categorical, the ordinary least squares (OLS) method can no longer produce the best linear unbiased estimator (BLUE); that is, OLS is biased and inefficient. Consequently, researchers have developed various regression models for categorical dependent variables. The nonlinearity of categorical dependent variable models (CDVMs) makes it difficult to fit the models and interpret their results. 1.1 Regression Models for Categorical Dependent Variables In CDVMs, the left-hand side (LHS) variable or dependent variable is neither interval nor ratio, but rather categorical. The level of measurement and data generation process (DGP) of a dependent variable determines the proper type of CDVM. Binary responses (0 or 1) are modeled with binary logit and probit regressions, ordinal responses (1st, 2nd, 3rd, …) are formulated into (generalized) ordered logit/probit regressions, and nominal responses are analyzed by multinomial logit, conditional logit, or nested logit models depending on specific circumstances. -
Kwame Nkrumah University of Science and Technology, Kumasi
KWAME NKRUMAH UNIVERSITY OF SCIENCE AND TECHNOLOGY, KUMASI, GHANA Assessing the Social Impacts of Illegal Gold Mining Activities at Dunkwa-On-Offin by Judith Selassie Garr (B.A, Social Science) A Thesis submitted to the Department of Building Technology, College of Art and Built Environment in partial fulfilment of the requirement for a degree of MASTER OF SCIENCE NOVEMBER, 2018 DECLARATION I hereby declare that this work is the result of my own original research and this thesis has neither in whole nor in part been prescribed by another degree elsewhere. References to other people’s work have been duly cited. STUDENT: JUDITH S. GARR (PG1150417) Signature: ........................................................... Date: .................................................................. Certified by SUPERVISOR: PROF. EDWARD BADU Signature: ........................................................... Date: ................................................................... Certified by THE HEAD OF DEPARTMENT: PROF. B. K. BAIDEN Signature: ........................................................... Date: ................................................................... i ABSTRACT Mining activities are undertaken in many parts of the world where mineral deposits are found. In developing nations such as Ghana, the activity is done both legally and illegally, often with very little or no supervision, hence much damage is done to the water bodies where the activities are carried out. This study sought to assess the social impacts of illegal gold mining activities at Dunkwa-On-Offin, the capital town of Upper Denkyira East Municipality in the Central Region of Ghana. The main objectives of the research are to identify factors that trigger illegal mining; to identify social effects of illegal gold mining activities on inhabitants of Dunkwa-on-Offin; and to suggest effective ways in curbing illegal mining activities. Based on the approach to data collection, this study adopts both the quantitative and qualitative approach. -
Investigating Data Management Practices in Australian Universities
Investigating Data Management Practices in Australian Universities Margaret Henty, The Australian National University Belinda Weaver, The University of Queensland Stephanie Bradbury, Queensland University of Technology Simon Porter, The University of Melbourne http://www.apsr.edu.au/investigating_data_management July, 2008 ii Table of Contents Introduction ...................................................................................... 1 About the survey ................................................................................ 1 About this report ................................................................................ 1 The respondents................................................................................. 2 The survey results............................................................................... 2 Tables and Comments .......................................................................... 3 Digital data .................................................................................... 3 Non-digital data forms....................................................................... 3 Types of digital data ......................................................................... 4 Size of data collection ....................................................................... 5 Software used for analysis or manipulation .............................................. 6 Software storage and retention ............................................................ 7 Research Data Management Plans......................................................... -
Package 'Ecdat'
Package ‘Ecdat’ November 3, 2020 Version 0.3-9 Date 2020-11-02 Title Data Sets for Econometrics Author Yves Croissant <[email protected]> and Spencer Graves Maintainer Spencer Graves <[email protected]> Depends R (>= 3.5.0), Ecfun Suggests Description Data sets for econometrics, including political science. LazyData true License GPL (>= 2) Language en-us URL https://www.r-project.org NeedsCompilation no Repository CRAN Date/Publication 2020-11-03 06:40:34 UTC R topics documented: Accident . .4 AccountantsAuditorsPct . .5 Airline . .7 Airq.............................................8 bankingCrises . .9 Benefits . 10 Bids............................................. 11 breaches . 12 BudgetFood . 15 BudgetItaly . 16 BudgetUK . 17 Bwages . 18 1 2 R topics documented: Capm ............................................ 19 Car.............................................. 20 Caschool . 21 Catsup . 22 Cigar ............................................ 23 Cigarette . 24 Clothing . 25 Computers . 26 Consumption . 27 coolingFromNuclearWar . 27 CPSch3 . 28 Cracker . 29 CRANpackages . 30 Crime . 31 CRSPday . 33 CRSPmon . 34 Diamond . 35 DM ............................................. 36 Doctor . 37 DoctorAUS . 38 DoctorContacts . 39 Earnings . 40 Electricity . 41 Fair ............................................. 42 Fatality . 43 FinancialCrisisFiles . 44 Fishing . 45 Forward . 46 FriendFoe . 47 Garch . 48 Gasoline . 49 Griliches . 50 Grunfeld . 51 HC.............................................. 52 Heating . 53 -
In This Issue
SCASA: SOUTHERN CALIFORNIA CHA P- E-Tidings Newsletter TER OF THE AMERI- CAN STATISTICAL ASSOCIATION SCASA Events and News VOLUME 8, ISSUES 1 - 2 JANUARY - FEBRUARY 2019 In This Issue Page 2: New SCASA board Pages 3-4: Presidential address Page 5: Book Club Page 6: Online store Page 7: Statistics Poster competition Page 8: DataFest Page 9: Careers Day Page 10: Applied Statistics Workshop Page 11: Traveling course Page 12: Job Opening Announcement Page 13: Dr. Normalcurvesaurus, Ph.D. presents The answer is at the bottom of this issue. http://community.amstat.org/scasa/newsletters VOLUME 8, ISSUES 1 - 2 P A G E 2 SCASA Officers 2019-2020 : CONGRATULATIONS TO ALL ELECTED AND RE-ELECTED!!! We have the newly elected SCASA board!!! Congratulations to Everyone!!! President: James Joseph, AKAKIA [[email protected]] President-Elect: Rebecca Le, County of Riverside [[email protected]] Immediate Past President: Olga Korosteleva, CSULB [[email protected]] Treasurer: Olga Korosteleva, CSULB [[email protected]] Secretary: Michael Tsiang, UCLA [[email protected]] Vice President of Professional Affairs: Anna Liza Antonio, Enterprise Analytics [[email protected]] Vice President of Academic Affairs: Shujie Ma, UCR [[email protected]] Vice President for Student Affairs: Anna Yu Lee, APU and Claremont Graduate University [[email protected]] The ASA Council of Chapters Representative: Harold Dyck, CSUSB [[email protected]] ENewsletter Editor-in-Chief: Olga Korosteleva, CSULB [[email protected]] Chair of the Applied Statistics Workshop Committee: James Joseph, AKAKIA [[email protected]] Treasurer of the Applied Statistics Workshop: Rebecca Le, County of Riverside [[email protected]] Webmaster: Anthony Doan, CSULB [[email protected]] http://community.amstat.org/scasa/newsletters P A G E 3 VOLUME 8, ISSUES 1 - 2 “GROW STRONG” Presidential Address Southern California may be the most diverse job market in the United States, if not the world. -
Applied Econometrics Using MATLAB
Applied Econometrics using MATLAB James P. LeSage Department of Economics University of Toledo CIRCULATED FOR REVIEW October, 1998 2 Preface This text describes a set of MATLAB functions that implement a host of econometric estimation methods. Toolboxes are the name given by the MathWorks to related sets of MATLAB functions aimed at solving a par- ticular class of problems. Toolboxes of functions useful in signal processing, optimization, statistics, nance and a host of other areas are available from the MathWorks as add-ons to the standard MATLAB software distribution. I use the termEconometrics Toolbox to refer to the collection of function libraries described in this book. The intended audience is faculty and students using statistical methods, whether they are engaged in econometric analysis or more general regression modeling. The MATLAB functions described in this book have been used in my own research as well as teaching both undergraduate and graduate econometrics courses. Researchers currently using Gauss, RATS, TSP, or SAS/IML for econometric programming might nd switching to MATLAB advantageous. MATLAB software has always had excellent numerical algo- rithms, and has recently been extended to include: sparse matrix algorithms, very good graphical capabilities, and a complete set of object oriented and graphical user-interface programming tools. MATLAB software is available on a wide variety of computing platforms including mainframe, Intel, Apple, and Linux or Unix workstations. When contemplating a change in software, there is always the initial investment in developing a set of basic routines and functions to support econometric analysis. It is my hope that the routines in the Econometrics Toolbox provide a relatively complete set of basic econometric analysis tools.