Profiling Heteroscedasticity in Linear Regression Models
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Generalized Linear Models and Generalized Additive Models
00:34 Friday 27th February, 2015 Copyright ©Cosma Rohilla Shalizi; do not distribute without permission updates at http://www.stat.cmu.edu/~cshalizi/ADAfaEPoV/ Chapter 13 Generalized Linear Models and Generalized Additive Models [[TODO: Merge GLM/GAM and Logistic Regression chap- 13.1 Generalized Linear Models and Iterative Least Squares ters]] [[ATTN: Keep as separate Logistic regression is a particular instance of a broader kind of model, called a gener- chapter, or merge with logis- alized linear model (GLM). You are familiar, of course, from your regression class tic?]] with the idea of transforming the response variable, what we’ve been calling Y , and then predicting the transformed variable from X . This was not what we did in logis- tic regression. Rather, we transformed the conditional expected value, and made that a linear function of X . This seems odd, because it is odd, but it turns out to be useful. Let’s be specific. Our usual focus in regression modeling has been the condi- tional expectation function, r (x)=E[Y X = x]. In plain linear regression, we try | to approximate r (x) by β0 + x β. In logistic regression, r (x)=E[Y X = x] = · | Pr(Y = 1 X = x), and it is a transformation of r (x) which is linear. The usual nota- tion says | ⌘(x)=β0 + x β (13.1) · r (x) ⌘(x)=log (13.2) 1 r (x) − = g(r (x)) (13.3) defining the logistic link function by g(m)=log m/(1 m). The function ⌘(x) is called the linear predictor. − Now, the first impulse for estimating this model would be to apply the transfor- mation g to the response. -
Publishing and Promotion in Economics: the Curse of the Top Five
Publishing and Promotion in Economics: The Curse of the Top Five James J. Heckman 2017 AEA Annual Meeting Chicago, IL January 7th, 2017 Heckman Curse of the Top Five Top 5 Influential, But Far From Sole Source of Influence or Outlet for Creativity Heckman Curse of the Top Five Table 1: Ranking of 2, 5 and 10 Year Impact Factors as of 2015 Rank 2 Years 5 Years 10 Years 1. JEL JEL JEL 2. QJE QJE QJE 3. JOF JOF JOF 4. JEP JEP JPE 5. ReStud JPE JEP 6. ECMA AEJae ECMA 7. AEJae ECMA AER 8. AER AER ReStud 9. JPE ReStud JOLE 10. JOLE AEJma EJ 11. AEJep AEJep JHR 12. AEJma EJ JOE 13. JME JOLE JME 14. EJ JHR HE 15. HE JME RED 16. JHR HE EER 17. JOE JOE - 18. AEJmi AEJmi - 19. RED RED - 20. EER EER - Note: Definition of abbreviated names: JEL - Journal of Economic Literature, JOF - Journal of Finance, JEP - Journal of Economic Perspectives, AEJae-American Economic Journal Applied Economics, AER - American Economic Review, JOLE-Journal of Labor Economics, AEJep-American Economic Journal Economic Policy, AEJma-American Economic Journal Macroeconomics, JME-Journal of Monetary Economics, EJ-Economic Journal, HE-Health Economics, JHR-Journal of Human Resources, JOE-Journal of Econometrics, AEJmi-American Economic Journal Microeconomics, RED-Review of Economic Dynamics, EER-European Economic Review; Source: Journal Citation Reports (Thomson Reuters, 2016). Heckman Curse of the Top Five Figure 1: Articles Published in Last 10 years by RePEc's T10 Authors (Last 10 Years Ranking) (a) T10 Authors (Unadjusted) (b) T10 Authors (Adjusted) Prop. -
Stochastic Process - Introduction
Stochastic Process - Introduction • Stochastic processes are processes that proceed randomly in time. • Rather than consider fixed random variables X, Y, etc. or even sequences of i.i.d random variables, we consider sequences X0, X1, X2, …. Where Xt represent some random quantity at time t. • In general, the value Xt might depend on the quantity Xt-1 at time t-1, or even the value Xs for other times s < t. • Example: simple random walk . week 2 1 Stochastic Process - Definition • A stochastic process is a family of time indexed random variables Xt where t belongs to an index set. Formal notation, { t : ∈ ItX } where I is an index set that is a subset of R. • Examples of index sets: 1) I = (-∞, ∞) or I = [0, ∞]. In this case Xt is a continuous time stochastic process. 2) I = {0, ±1, ±2, ….} or I = {0, 1, 2, …}. In this case Xt is a discrete time stochastic process. • We use uppercase letter {Xt } to describe the process. A time series, {xt } is a realization or sample function from a certain process. • We use information from a time series to estimate parameters and properties of process {Xt }. week 2 2 Probability Distribution of a Process • For any stochastic process with index set I, its probability distribution function is uniquely determined by its finite dimensional distributions. •The k dimensional distribution function of a process is defined by FXX x,..., ( 1 x,..., k ) = P( Xt ≤ 1 ,..., xt≤ X k) x t1 tk 1 k for anyt 1 ,..., t k ∈ I and any real numbers x1, …, xk . -
Analysis of Imbalance Strategies Recommendation Using a Meta-Learning Approach
7th ICML Workshop on Automated Machine Learning (2020) Analysis of Imbalance Strategies Recommendation using a Meta-Learning Approach Afonso Jos´eCosta [email protected] CISUC, Department of Informatics Engineering, University of Coimbra, Portugal Miriam Seoane Santos [email protected] CISUC, Department of Informatics Engineering, University of Coimbra, Portugal Carlos Soares [email protected] Fraunhofer AICOS and LIACC, Faculty of Engineering, University of Porto, Portugal Pedro Henriques Abreu [email protected] CISUC, Department of Informatics Engineering, University of Coimbra, Portugal Abstract Class imbalance is known to degrade the performance of classifiers. To address this is- sue, imbalance strategies, such as oversampling or undersampling, are often employed to balance the class distribution of datasets. However, although several strategies have been proposed throughout the years, there is no one-fits-all solution for imbalanced datasets. In this context, meta-learning arises a way to recommend proper strategies to handle class imbalance, based on data characteristics. Nonetheless, in related work, recommendation- based systems do not focus on how the recommendation process is conducted, thus lacking interpretable knowledge. In this paper, several meta-characteristics are identified in order to provide interpretable knowledge to guide the selection of imbalance strategies, using Ex- ceptional Preferences Mining. The experimental setup considers 163 real-world imbalanced datasets, preprocessed with 9 well-known resampling algorithms. Our findings elaborate on the identification of meta-characteristics that demonstrate when using preprocessing algo- rithms is advantageous for the problem and when maintaining the original dataset benefits classification performance. 1. Introduction Class imbalance is known to severely affect the data quality. -
A Simple Censored Median Regression Estimator
Statistica Sinica 16(2006), 1043-1058 A SIMPLE CENSORED MEDIAN REGRESSION ESTIMATOR Lingzhi Zhou The Hong Kong University of Science and Technology Abstract: Ying, Jung and Wei (1995) proposed an estimation procedure for the censored median regression model that regresses the median of the survival time, or its transform, on the covariates. The procedure requires solving complicated nonlinear equations and thus can be very difficult to implement in practice, es- pecially when there are multiple covariates. Moreover, the asymptotic covariance matrix of the estimator involves the density of the errors that cannot be esti- mated reliably. In this paper, we propose a new estimator for the censored median regression model. Our estimation procedure involves solving some convex min- imization problems and can be easily implemented through linear programming (Koenker and D'Orey (1987)). In addition, a resampling method is presented for estimating the covariance matrix of the new estimator. Numerical studies indi- cate the superiority of the finite sample performance of our estimator over that in Ying, Jung and Wei (1995). Key words and phrases: Censoring, convexity, LAD, resampling. 1. Introduction The accelerated failure time (AFT) model, which relates the logarithm of the survival time to covariates, is an attractive alternative to the popular Cox (1972) proportional hazards model due to its ease of interpretation. The model assumes that the failure time T , or some monotonic transformation of it, is linearly related to the covariate vector Z 0 Ti = β0Zi + "i; i = 1; : : : ; n: (1.1) Under censoring, we only observe Yi = min(Ti; Ci), where Ci are censoring times, and Ti and Ci are independent conditional on Zi. -
Particle Filtering-Based Maximum Likelihood Estimation for Financial Parameter Estimation
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Spiral - Imperial College Digital Repository Particle Filtering-based Maximum Likelihood Estimation for Financial Parameter Estimation Jinzhe Yang Binghuan Lin Wayne Luk Terence Nahar Imperial College & SWIP Techila Technologies Ltd Imperial College SWIP London & Edinburgh, UK Tampere University of Technology London, UK London, UK [email protected] Tampere, Finland [email protected] [email protected] [email protected] Abstract—This paper presents a novel method for estimating result, which cannot meet runtime requirement in practice. In parameters of financial models with jump diffusions. It is a addition, high performance computing (HPC) technologies are Particle Filter based Maximum Likelihood Estimation process, still new to the finance industry. We provide a PF based MLE which uses particle streams to enable efficient evaluation of con- straints and weights. We also provide a CPU-FPGA collaborative solution to estimate parameters of jump diffusion models, and design for parameter estimation of Stochastic Volatility with we utilise HPC technology to speedup the estimation scheme Correlated and Contemporaneous Jumps model as a case study. so that it would be acceptable for practical use. The result is evaluated by comparing with a CPU and a cloud This paper presents the algorithm development, as well computing platform. We show 14 times speed up for the FPGA as the pipeline design solution in FPGA technology, of PF design compared with the CPU, and similar speedup but better convergence compared with an alternative parallelisation scheme based MLE for parameter estimation of Stochastic Volatility using Techila Middleware on a multi-CPU environment. -
Alberto Abadie
ALBERTO ABADIE Office Address Massachusetts Institute of Technology Department of Economics 50 Memorial Drive Building E52, Room 546 Cambridge, MA 02142 E-mail: [email protected] Academic Positions Massachusetts Institute of Technology Cambridge, MA Professor of Economics, 2016-present IDSS Associate Director, 2016-present Harvard University Cambridge, MA Professor of Public Policy, 2005-2016 Visiting Professor of Economics, 2013-2014 Associate Professor of Public Policy, 2004-2005 Assistant Professor of Public Policy, 1999-2004 University of Chicago Chicago, IL Visiting Assistant Professor of Economics, 2002-2003 National Bureau of Economic Research (NBER) Cambridge, MA Research Associate (Labor Studies), 2009-present Faculty Research Fellow (Labor Studies), 2002-2009 Non-Academic Positions Amazon.com, Inc. Seattle, WA Academic Research Consultant, 2020-present Education Massachusetts Institute of Technology Cambridge, MA Ph.D. in Economics, 1995-1999 Thesis title: \Semiparametric Instrumental Variable Methods for Causal Response Mod- els." Centro de Estudios Monetarios y Financieros (CEMFI) Madrid, Spain M.A. in Economics, 1993-1995 Masters Thesis title: \Changes in Spanish Labor Income Structure during the 1980's: A Quantile Regression Approach." 1 Universidad del Pa´ıs Vasco Bilbao, Spain B.A. in Economics, 1987-1992 Specialization Areas: Mathematical Economics and Econometrics. Honors and Awards Elected Fellow of the Econometric Society, 2016. NSF grant SES-1756692, \A General Synthetic Control Framework of Estimation and Inference," 2018-2021. NSF grant SES-0961707, \A General Theory of Matching Estimation," with G. Imbens, 2010-2012. NSF grant SES-0617810, \The Economic Impact of Terrorism: Lessons from the Real Estate Office Markets of New York and Chicago," with S. Dermisi, 2006-2008. -
Rankings of Economics Journals and Departments in India
WP-2010-021 Rankings of Economics Journals and Departments in India Tilak Mukhopadhyay and Subrata Sarkar Indira Gandhi Institute of Development Research, Mumbai October 2010 http://www.igidr.ac.in/pdf/publication/WP-2010-021.pdf Rankings of Economics Journals and Departments in India Tilak Mukhopadhyay and Subrata Sarkar Indira Gandhi Institute of Development Research (IGIDR) General Arun Kumar Vaidya Marg Goregaon (E), Mumbai- 400065, INDIA Email (corresponding author): [email protected] Abstract This paper is the first attempt to rank economics departments of Indian Institutions based on their research output. Two rankings, one based on publications in international journals, and the other based on publications in domestic journals are derived. The rankings based on publications in international journals are obtained using the impact values of 159 journals found in Kalaitzidakis et al. (2003). Rankings based on publications in domestic journals are based on impact values of 20 journals. Since there are no published studies on ranking of domestic journals, we derived the rankings of domestic journals by using the iterative method suggested in Kalaitzidakis et al. (2003). The department rankings are constructed using two approaches namely, the ‘flow approach’ and the ‘stock approach’. Under the ‘flow approach’ the rankings are based on the total output produced by a particular department over a period of time while under the ‘stock approach’ the rankings are based on the publication history of existing faculty members in an institution. From these rankings the trend of research work and the growth of the department of a university are studied. Keywords: Departments,Economics, Journals, Rankings JEL Code: A10, A14 Acknowledgements: The auhtors would like to thank the seminar participants at Indira Gandhi Institute of Development Research. -
Cluster Analysis, a Powerful Tool for Data Analysis in Education
International Statistical Institute, 56th Session, 2007: Rita Vasconcelos, Mßrcia Baptista Cluster Analysis, a powerful tool for data analysis in Education Vasconcelos, Rita Universidade da Madeira, Department of Mathematics and Engeneering Caminho da Penteada 9000-390 Funchal, Portugal E-mail: [email protected] Baptista, Márcia Direcção Regional de Saúde Pública Rua das Pretas 9000 Funchal, Portugal E-mail: [email protected] 1. Introduction A database was created after an inquiry to 14-15 - year old students, which was developed with the purpose of identifying the factors that could socially and pedagogically frame the results in Mathematics. The data was collected in eight schools in Funchal (Madeira Island), and we performed a Cluster Analysis as a first multivariate statistical approach to this database. We also developed a logistic regression analysis, as the study was carried out as a contribution to explain the success/failure in Mathematics. As a final step, the responses of both statistical analysis were studied. 2. Cluster Analysis approach The questions that arise when we try to frame socially and pedagogically the results in Mathematics of 14-15 - year old students, are concerned with the types of decisive factors in those results. It is somehow underlying our objectives to classify the students according to the factors understood by us as being decisive in students’ results. This is exactly the aim of Cluster Analysis. The hierarchical solution that can be observed in the dendogram presented in the next page, suggests that we should consider the 3 following clusters, since the distances increase substantially after it: Variables in Cluster1: mother qualifications; father qualifications; student’s results in Mathematics as classified by the school teacher; student’s results in the exam of Mathematics; time spent studying. -
Variance Function Regressions for Studying Inequality
Variance Function Regressions for Studying Inequality The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters Citation Western, Bruce and Deirdre Bloome. 2009. Variance function regressions for studying inequality. Working paper, Department of Sociology, Harvard University. Citable link http://nrs.harvard.edu/urn-3:HUL.InstRepos:2645469 Terms of Use This article was downloaded from Harvard University’s DASH repository, and is made available under the terms and conditions applicable to Open Access Policy Articles, as set forth at http:// nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of- use#OAP Variance Function Regressions for Studying Inequality Bruce Western1 Deirdre Bloome Harvard University January 2009 1Department of Sociology, 33 Kirkland Street, Cambridge MA 02138. E-mail: [email protected]. This research was supported by a grant from the Russell Sage Foundation. Abstract Regression-based studies of inequality model only between-group differ- ences, yet often these differences are far exceeded by residual inequality. Residual inequality is usually attributed to measurement error or the in- fluence of unobserved characteristics. We present a regression that in- cludes covariates for both the mean and variance of a dependent variable. In this model, the residual variance is treated as a target for analysis. In analyses of inequality, the residual variance might be interpreted as mea- suring risk or insecurity. Variance function regressions are illustrated in an analysis of panel data on earnings among released prisoners in the Na- tional Longitudinal Survey of Youth. We extend the model to a decomposi- tion analysis, relating the change in inequality to compositional changes in the population and changes in coefficients for the mean and variance. -
The Detection of Heteroscedasticity in Regression Models for Psychological Data
Psychological Test and Assessment Modeling, Volume 58, 2016 (4), 567-592 The detection of heteroscedasticity in regression models for psychological data Andreas G. Klein1, Carla Gerhard2, Rebecca D. Büchner2, Stefan Diestel3 & Karin Schermelleh-Engel2 Abstract One assumption of multiple regression analysis is homoscedasticity of errors. Heteroscedasticity, as often found in psychological or behavioral data, may result from misspecification due to overlooked nonlinear predictor terms or to unobserved predictors not included in the model. Although methods exist to test for heteroscedasticity, they require a parametric model for specifying the structure of heteroscedasticity. The aim of this article is to propose a simple measure of heteroscedasticity, which does not need a parametric model and is able to detect omitted nonlinear terms. This measure utilizes the dispersion of the squared regression residuals. Simulation studies show that the measure performs satisfactorily with regard to Type I error rates and power when sample size and effect size are large enough. It outperforms the Breusch-Pagan test when a nonlinear term is omitted in the analysis model. We also demonstrate the performance of the measure using a data set from industrial psychology. Keywords: Heteroscedasticity, Monte Carlo study, regression, interaction effect, quadratic effect 1Correspondence concerning this article should be addressed to: Prof. Dr. Andreas G. Klein, Department of Psychology, Goethe University Frankfurt, Theodor-W.-Adorno-Platz 6, 60629 Frankfurt; email: [email protected] 2Goethe University Frankfurt 3International School of Management Dortmund Leipniz-Research Centre for Working Environment and Human Factors 568 A. G. Klein, C. Gerhard, R. D. Büchner, S. Diestel & K. Schermelleh-Engel Introduction One of the standard assumptions underlying a linear model is that the errors are inde- pendently identically distributed (i.i.d.). -
Flexible Signal Denoising Via Flexible Empirical Bayes Shrinkage
Journal of Machine Learning Research 22 (2021) 1-28 Submitted 1/19; Revised 9/20; Published 5/21 Flexible Signal Denoising via Flexible Empirical Bayes Shrinkage Zhengrong Xing [email protected] Department of Statistics University of Chicago Chicago, IL 60637, USA Peter Carbonetto [email protected] Research Computing Center and Department of Human Genetics University of Chicago Chicago, IL 60637, USA Matthew Stephens [email protected] Department of Statistics and Department of Human Genetics University of Chicago Chicago, IL 60637, USA Editor: Edo Airoldi Abstract Signal denoising—also known as non-parametric regression—is often performed through shrinkage estima- tion in a transformed (e.g., wavelet) domain; shrinkage in the transformed domain corresponds to smoothing in the original domain. A key question in such applications is how much to shrink, or, equivalently, how much to smooth. Empirical Bayes shrinkage methods provide an attractive solution to this problem; they use the data to estimate a distribution of underlying “effects,” hence automatically select an appropriate amount of shrinkage. However, most existing implementations of empirical Bayes shrinkage are less flexible than they could be—both in their assumptions on the underlying distribution of effects, and in their ability to han- dle heteroskedasticity—which limits their signal denoising applications. Here we address this by adopting a particularly flexible, stable and computationally convenient empirical Bayes shrinkage method and apply- ing it to several signal denoising problems. These applications include smoothing of Poisson data and het- eroskedastic Gaussian data. We show through empirical comparisons that the results are competitive with other methods, including both simple thresholding rules and purpose-built empirical Bayes procedures.