Shrinkage Estimation of Rate Statistics Arxiv:1810.07654V1 [Stat.AP] 17 Oct
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Shrinkage Priors for Bayesian Penalized Regression
Shrinkage Priors for Bayesian Penalized Regression. Sara van Erp1, Daniel L. Oberski2, and Joris Mulder1 1Tilburg University 2Utrecht University This manuscript has been published in the Journal of Mathematical Psychology. Please cite the published version: Van Erp, S., Oberski, D. L., & Mulder, J. (2019). Shrinkage priors for Bayesian penalized regression. Journal of Mathematical Psychology, 89, 31–50. doi:10.1016/j.jmp.2018.12.004 Abstract In linear regression problems with many predictors, penalized regression tech- niques are often used to guard against overfitting and to select variables rel- evant for predicting an outcome variable. Recently, Bayesian penalization is becoming increasingly popular in which the prior distribution performs a function similar to that of the penalty term in classical penalization. Specif- ically, the so-called shrinkage priors in Bayesian penalization aim to shrink small effects to zero while maintaining true large effects. Compared toclas- sical penalization techniques, Bayesian penalization techniques perform sim- ilarly or sometimes even better, and they offer additional advantages such as readily available uncertainty estimates, automatic estimation of the penalty parameter, and more flexibility in terms of penalties that can be considered. However, many different shrinkage priors exist and the available, often quite technical, literature primarily focuses on presenting one shrinkage prior and often provides comparisons with only one or two other shrinkage priors. This can make it difficult for researchers to navigate through the many prior op- tions and choose a shrinkage prior for the problem at hand. Therefore, the aim of this paper is to provide a comprehensive overview of the literature on Bayesian penalization. -
Linear Methods for Regression and Shrinkage Methods
Linear Methods for Regression and Shrinkage Methods Reference: The Elements of Statistical Learning, by T. Hastie, R. Tibshirani, J. Friedman, Springer 1 Linear Regression Models Least Squares • Input vectors • is an attribute / feature / predictor (independent variable) • The linear regression model: • The output is called response (dependent variable) • ’s are unknown parameters (coefficients) 2 Linear Regression Models Least Squares • A set of training data • Each corresponding to attributes • Each is a class attribute value / a label • Wish to estimate the parameters 3 Linear Regression Models Least Squares • One common approach ‐ the method of least squares: • Pick the coefficients to minimize the residual sum of squares: 4 Linear Regression Models Least Squares • This criterion is reasonable if the training observations represent independent draws. • Even if the ’s were not drawn randomly, the criterion is still valid if the ’s are conditionally independent given the inputs . 5 Linear Regression Models Least Squares • Make no assumption about the validity of the model • Simply finds the best linear fit to the data 6 Linear Regression Models Finding Residual Sum of Squares • Denote by the matrix with each row an input vector (with a 1 in the first position) • Let be the N‐vector of outputs in the training set • Quadratic function in the parameters: 7 Linear Regression Models Finding Residual Sum of Squares • Set the first derivation to zero: • Obtain the unique solution: 8 Linear Regression -
Linear, Ridge Regression, and Principal Component Analysis
Linear, Ridge Regression, and Principal Component Analysis Linear, Ridge Regression, and Principal Component Analysis Jia Li Department of Statistics The Pennsylvania State University Email: [email protected] http://www.stat.psu.edu/∼jiali Jia Li http://www.stat.psu.edu/∼jiali Linear, Ridge Regression, and Principal Component Analysis Introduction to Regression I Input vector: X = (X1, X2, ..., Xp). I Output Y is real-valued. I Predict Y from X by f (X ) so that the expected loss function E(L(Y , f (X ))) is minimized. I Square loss: L(Y , f (X )) = (Y − f (X ))2 . I The optimal predictor ∗ 2 f (X ) = argminf (X )E(Y − f (X )) = E(Y | X ) . I The function E(Y | X ) is the regression function. Jia Li http://www.stat.psu.edu/∼jiali Linear, Ridge Regression, and Principal Component Analysis Example The number of active physicians in a Standard Metropolitan Statistical Area (SMSA), denoted by Y , is expected to be related to total population (X1, measured in thousands), land area (X2, measured in square miles), and total personal income (X3, measured in millions of dollars). Data are collected for 141 SMSAs, as shown in the following table. i : 1 2 3 ... 139 140 141 X1 9387 7031 7017 ... 233 232 231 X2 1348 4069 3719 ... 1011 813 654 X3 72100 52737 54542 ... 1337 1589 1148 Y 25627 15389 13326 ... 264 371 140 Goal: Predict Y from X1, X2, and X3. Jia Li http://www.stat.psu.edu/∼jiali Linear, Ridge Regression, and Principal Component Analysis Linear Methods I The linear regression model p X f (X ) = β0 + Xj βj . -
Lasting Legacies
Tre Lag Stevne Clarion Hotel South Saint Paul, MN August 3-6, 2016 .#56+0).')#%+'5 6*'(7674'1(1742#56 Spotlights on Norwegian-Americans who have contributed to architecture, engineering, institutions, art, science or education in the Americas A gathering of descendants and friends of the Trøndelag, Gudbrandsdal and northern Hedmark regions of Norway Program Schedule Velkommen til Stevne 2016! Welcome to the Tre Lag Stevne in South Saint Paul, Minnesota. We were last in the Twin Cities area in 2009 in this same location. In a metropolitan area of this size it is not as easy to see the results of the Norwegian immigration as in smaller towns and rural communities. But the evidence is there if you look for it. This year’s speakers will tell the story of the Norwegians who contributed to the richness of American culture through literature, art, architecture, politics, medicine and science. You may recognize a few of their names, but many are unsung heroes who quietly added strands to the fabric of America and the world. We hope to astonish you with the diversity of their talents. Our tour will take us to the first Norwegian church in America, which was moved from Muskego, Wisconsin to the grounds of Luther Seminary,. We’ll stop at Mindekirken, established in 1922 with the mission of retaining Norwegian heritage. It continues that mission today. We will also visit Norway House, the newest organization to promote Norwegian connectedness. Enjoy the program, make new friends, reconnect with old friends, and continue to learn about our shared heritage. -
Kernel Mean Shrinkage Estimators
Journal of Machine Learning Research 17 (2016) 1-41 Submitted 5/14; Revised 9/15; Published 4/16 Kernel Mean Shrinkage Estimators Krikamol Muandet∗ [email protected] Empirical Inference Department, Max Planck Institute for Intelligent Systems Spemannstraße 38, T¨ubingen72076, Germany Bharath Sriperumbudur∗ [email protected] Department of Statistics, Pennsylvania State University University Park, PA 16802, USA Kenji Fukumizu [email protected] The Institute of Statistical Mathematics 10-3 Midoricho, Tachikawa, Tokyo 190-8562 Japan Arthur Gretton [email protected] Gatsby Computational Neuroscience Unit, CSML, University College London Alexandra House, 17 Queen Square, London - WC1N 3AR, United Kingdom ORCID 0000-0003-3169-7624 Bernhard Sch¨olkopf [email protected] Empirical Inference Department, Max Planck Institute for Intelligent Systems Spemannstraße 38, T¨ubingen72076, Germany Editor: Ingo Steinwart Abstract A mean function in a reproducing kernel Hilbert space (RKHS), or a kernel mean, is central to kernel methods in that it is used by many classical algorithms such as kernel principal component analysis, and it also forms the core inference step of modern kernel methods that rely on embedding probability distributions in RKHSs. Given a finite sample, an empirical average has been used commonly as a standard estimator of the true kernel mean. Despite a widespread use of this estimator, we show that it can be improved thanks to the well- known Stein phenomenon. We propose a new family of estimators called kernel mean shrinkage estimators (KMSEs), which benefit from both theoretical justifications and good empirical performance. The results demonstrate that the proposed estimators outperform the standard one, especially in a \large d, small n" paradigm. -
Generalized Shrinkage Methods for Forecasting Using Many Predictors
Generalized Shrinkage Methods for Forecasting using Many Predictors This revision: February 2011 James H. Stock Department of Economics, Harvard University and the National Bureau of Economic Research and Mark W. Watson Woodrow Wilson School and Department of Economics, Princeton University and the National Bureau of Economic Research AUTHOR’S FOOTNOTE James H. Stock is the Harold Hitchings Burbank Professor of Economics, Harvard University (e-mail: [email protected]). Mark W. Watson is the Howard Harrison and Gabrielle Snyder Beck Professor of Economics and Public Affairs, Princeton University ([email protected]). The authors thank Jean Boivin, Domenico Giannone, Lutz Kilian, Serena Ng, Lucrezia Reichlin, Mark Steel, and Jonathan Wright for helpful discussions, Anna Mikusheva for research assistance, and the referees for helpful suggestions. An earlier version of the theoretical results in this paper was circulated earlier under the title “An Empirical Comparison of Methods for Forecasting Using Many Predictors.” Replication files for the results in this paper can be downloaded from http://www.princeton.edu/~mwatson. This research was funded in part by NSF grants SBR-0214131 and SBR-0617811. 0 Generalized Shrinkage Methods for Forecasting using Many Predictors JBES09-255 (first revision) This revision: February 2011 [BLINDED COVER PAGE] ABSTRACT This paper provides a simple shrinkage representation that describes the operational characteristics of various forecasting methods designed for a large number of orthogonal predictors (such as principal components). These methods include pretest methods, Bayesian model averaging, empirical Bayes, and bagging. We compare empirically forecasts from these methods to dynamic factor model (DFM) forecasts using a U.S. macroeconomic data set with 143 quarterly variables spanning 1960-2008. -
Status for Interkommunalt Samarbeid I Troms Og Finnmark
NIVI Rapport 2019:4 Status for interkommunalt samarbeid i Troms og Finnmark Utarbeidet på oppdrag av Fylkesmannen Notat 2020- Av Geir Vinsand - NIVI Analyse AS FORORD På oppdrag fra Fylkesmannen i Troms og Finnmark har NIVI Analyse gjennomført en kartlegging av det formaliserte interkommunale samarbeidet i alle fylkets 43 kommuner. Kartleggingen har form av en kommunevis totalkartlegging og bygger på NIVIs kartleggingsmetodikk som er brukt i flere andre fylker. Prosjektet er gjennomført i nær dialog med Fylkesmannen og rådmennene i kommunene. Prosjektet ble startet opp i august 2019. Kontaktperson hos oppdragsgiver har vært fagdirektør Jan-Peder Andreassen. NIVI er ansvarlig for alle analyser av innsamlet materiale, inkludert løpende problematiseringer og anbefalinger. Ansvarlig konsulent i NIVI Analyse har vært Geir Vinsand. Sandefjord, 20. desember 2019 1 - NIVI Analyse AS INNHOLD HOVEDPUNKTER ................................................................................................. 3 1 METODISK TILNÆRMING ........................................................................ 6 1.1 Bakgrunn og formål ............................................................................. 6 1.2 Problemstillinger .................................................................................. 6 1.3 Definisjon av interkommunalt samarbeid ............................................ 7 1.4 Gjennomføring og erfaringer ............................................................... 8 1.5 Rapportering ....................................................................................... -
Cross-Validation, Shrinkage and Variable Selection in Linear Regression Revisited
Open Journal of Statistics, 2013, 3, 79-102 http://dx.doi.org/10.4236/ojs.2013.32011 Published Online April 2013 (http://www.scirp.org/journal/ojs) Cross-Validation, Shrinkage and Variable Selection in Linear Regression Revisited Hans C. van Houwelingen1, Willi Sauerbrei2 1Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, Leiden, The Netherlands 2Institut fuer Medizinische Biometrie und Medizinische Informatik, Universitaetsklinikum Freiburg, Freiburg, Germany Email: [email protected] Received December 8, 2012; revised January 10, 2013; accepted January 26, 2013 Copyright © 2013 Hans C. van Houwelingen, Willi Sauerbrei. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. ABSTRACT In deriving a regression model analysts often have to use variable selection, despite of problems introduced by data- dependent model building. Resampling approaches are proposed to handle some of the critical issues. In order to assess and compare several strategies, we will conduct a simulation study with 15 predictors and a complex correlation struc- ture in the linear regression model. Using sample sizes of 100 and 400 and estimates of the residual variance corre- sponding to R2 of 0.50 and 0.71, we consider 4 scenarios with varying amount of information. We also consider two examples with 24 and 13 predictors, respectively. We will discuss the value of cross-validation, shrinkage and back- ward elimination (BE) with varying significance level. We will assess whether 2-step approaches using global or pa- rameterwise shrinkage (PWSF) can improve selected models and will compare results to models derived with the LASSO procedure. -
Diploma Program__Magdalena Alfredova.Pdf (4.469Mb)
DIPLOMA PROGRAM FALL 2018 Diploma candidate: Magdalena Alfredova Institute: Architecture Main supervisor: Jørgen Tandberg Second supervisor: - External supervisor: - Company cooperation: Title of project: “ A fish processing facility: Sørvær” FISH PROCESSING FACILITY: SØRVÆR - BY MAGDALENA ALFREDOVA Fig. 1 Lofoten 1971, Robert Capa. Sjark vessels between 25-35 foot are still very common in small fishing villiages along the Norwefian coast. 1 “Et land mot ishavet, som alle gutter langs kysten drømte om å få komme til. Der ble det øvd storverk. Der kunne en vinne rikdom, der kappsegla karene med døden. En folkevandring hadde søkt der opp gjennom mange hundre år,og en god del bleiv på sjøen. Noen kom heim med en blank skilling, men de fleste segla livet igjennom i fattigdom. Og likevel – dit opp søkte de igjen, år etter år, den ene slekta etter den andre. Det var eventyret. Dit måtte de. Og nå var det hans tur. Nå så han Lofoten.” From the book “The last of the Vikings” by Johan Bojer, 1921 2 SUPERVISOR - Jørgen Johan Tandberg THESIS - In 2013 the Norwegian Coastal Administration registered a total of 673 fisheries, 93 of which are facilities located along the coastal line of Troms and Finnmark. The project will investigate these industries in transition and the architectural opportunities they have. Today the locally based fish production is economically challenged by the growing industry of fish farming and large fishing vessels with production on board. Because of its incapability of competing, the differences of these industries will only diverge further. The traditional industries along the coast decreases in size, but survives focusing on high quality products and local traditions (Fig.2). -
Mining Big Data Using Parsimonious Factor, Machine Learning, Variable Selection and Shrinkage Methods Hyun Hak Kim1 and Norman R
Mining Big Data Using Parsimonious Factor, Machine Learning, Variable Selection and Shrinkage Methods Hyun Hak Kim1 and Norman R. Swanson2 1Bank of Korea and 2Rutgers University February 2016 Abstract A number of recent studies in the economics literature have focused on the usefulness of factor models in the context of prediction using “big data” (see Bai and Ng (2008), Dufour and Stevanovic (2010), Forni et al. (2000, 2005), Kim and Swanson (2014a), Stock and Watson (2002b, 2006, 2012), and the references cited therein). We add to this literature by analyzing whether “big data” are useful for modelling low frequency macroeconomic variables such as unemployment, in‡ation and GDP. In particular, we analyze the predictive bene…ts associated with the use of principal component analysis (PCA), independent component analysis (ICA), and sparse principal component analysis (SPCA). We also evaluate machine learning, variable selection and shrinkage methods, including bagging, boosting, ridge regression, least angle regression, the elastic net, and the non-negative garotte. Our approach is to carry out a forecasting “horse-race” using prediction models constructed using a variety of model speci…cation approaches, factor estimation methods, and data windowing methods, in the context of the prediction of 11 macroeconomic variables relevant for monetary policy assessment. In many instances, we …nd that various of our benchmark models, including autoregressive (AR) models, AR models with exogenous variables, and (Bayesian) model averaging, do not dominate speci…cations based on factor-type dimension reduction combined with various machine learning, variable selection, and shrinkage methods (called “combination”models). We …nd that forecast combination methods are mean square forecast error (MSFE) “best” for only 3 of 11 variables when the forecast horizon, h = 1, and for 4 variables when h = 3 or 12. -
Shrinkage Improves Estimation of Microbial Associations Under Di↵Erent Normalization Methods 1 2 1,3,4, 5,6,7, Michelle Badri , Zachary D
bioRxiv preprint doi: https://doi.org/10.1101/406264; this version posted April 4, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Shrinkage improves estimation of microbial associations under di↵erent normalization methods 1 2 1,3,4, 5,6,7, Michelle Badri , Zachary D. Kurtz , Richard Bonneau ⇤, and Christian L. Muller¨ ⇤ 1Department of Biology, New York University, New York, 10012 NY, USA 2Lodo Therapeutics, New York, 10016 NY, USA 3Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, 10010 NY, USA 4Computer Science Department, Courant Institute, New York, 10012 NY, USA 5Center for Computational Mathematics, Flatiron Institute, Simons Foundation, New York, 10010 NY, USA 6Institute of Computational Biology, Helmholtz Zentrum M¨unchen, Neuherberg, Germany 7Department of Statistics, Ludwig-Maximilians-Universit¨at M¨unchen, Munich, Germany ⇤correspondence to: [email protected], cmueller@flatironinstitute.org ABSTRACT INTRODUCTION Consistent estimation of associations in microbial genomic Recent advances in microbial amplicon and metagenomic survey count data is fundamental to microbiome research. sequencing as well as large-scale data collection efforts Technical limitations, including compositionality, low sample provide samples across different microbial habitats that are sizes, and technical variability, obstruct standard -
Møteprotokoll Formannskapet 18.10.18.Pdf
v Hasvik kommune Møteprotokoll Utvalg: Formannskapet Møtested: Rådhuset - Kommunestyresalen I)ato: 18.10.2018 Tidspunkt: 09:30 - I l:30 Til stede: Navn Funksjon Medl. Varamedlem for Eva D. Husby Leder AP Lars Hustad Nestleder AP Elisabeth S. Mikalsen Varamedlem AP Siv Olsen Gamst Per Hansen Medlem AP Odd Ivar Gladsø Medlem H Følgende fra administrasjonen møtte: Navn stilling Odd Michelsen X'ormannskapssekretær Erik Arnesen Rådmann Gro Marie J. Nilssen Økonomileder Merknad i møte: Møteleder: Eva D. Husby Leder Møteprotokoll er godkjent av valgte representanter Saksliste Utvalgs- Innhold Lukket Arkivsaksnr saksnr PS 90/18 Godkjenning av innkalling PS 91l18 Godkjenning av saksliste PS 9211 8 Godkjenning av møteprotokoll fra forrige møte PS 9311 8 Delingssak - Rekvisisjon av oppmålingsforretning - Gnr.3 201811325 Bnr.4 - Kristine Brevik PS 9411 8 Delingssak - Søknad om deling av eiendom - Gnr. l6 Bnr 413 20181730 - Rick Larsen PS 9sl18 Søknad om konsesjon på erverv av fast eiendom - Gnr 16 Bnr 20t81t238 96 - Frode Karlsen PS 96118 Gammelgården - Søknad om dispensasjon fra arealplan - 20171862 Etablering av landbruksvirksomhet på GiBNR 715 og718 PS 97118 Kjøp av tomt - tilleggsareal - Henrik Sanden 201811423 PS 98/1 I Endring av ansvarsrett - Sørøya Transport og utleie AS 201811302 PS 99/r8 Søknad - Støtte angående oppgradering av drivstoffanlegg 201811410 Sørvær - Joker Sørvær PS 100/18 Søknad om tilskudd til kjøp av båt - Johnny Johansen 201811312 PS 101/18 Jakten på bolyst 20181509 PS 102/18 Regnskaps- og fi nansrapport 2. tertial 20 1 I 20181937 PS 103/18 Budsjettregulering 20 I I 2018/1465 PS 104/18 Opptak av lån til investeringer 2018 2018/1468 oRr 9/18 Orienteringssak - 1 8.