Uros2018.Pdf
Total Page:16
File Type:pdf, Size:1020Kb
Use of R in O cial Statistics 6th International Conference 2018 2018OV010 Eventbanner uRos2018 Rolbanner 100x200_DEF OPTIES .indd 1 23-7-2018 09:58:34 Eventbanner uRos2018 1920x400.jpg Eventbanner uRos2018 1920x400.bb Welcome The global community of R users is growing, and the number of Naonal and Interna- onal Stascal Offices that are adopng R is growing as well. About five years ago, when this conference was organized as an internaonal conference for the first me in Romania, we felt a bit like outlaws using Free and Open Source Soware (FOSS) in an area where commercial packages rule the land. How mes have changed: in the mean me FOSS, and in parcular R is considered a driving force of innovaon in academia, industry and government. The popularity of R is demonstrated by the hundreds of local R user groups, the thousands of R packages, and the RConsorum. The current conference, at Stascs Netherlands, marks the first occasion outside of the place where it was conceived: Romania. We are therefore especially pleased that our keynote speakers have roots in both countries. Alina Matei is a professor of stascs in Switzerland with Romanian roots. She will talk about opmal sample coordinaon using R. An important topic in mes where the reducon of response burden and increasing nonresponse rates force us to use smaller, more complex sampling methods. Not many R users are aware that there is a ‘touch of Dutch’ in R. Since 2017, Jeroen Ooms (UC Berkeley) is the maintainer of both Rtools and R for Windows. He will tell us about what it takes to compile, release, and modernize a system on which more than 12,500 R packages and millions of users rely every day. For the first me this year we have a full day of tutorials with topics including sample straficaon, data cleaning and processing, and geospaal modeling. Make sure to take full advantage of the experts that came here to share their knowledge. With about fiy contributed talks and around one hundred conference aendees, this uRos is the largest in its history. We are grateful to the speakers, tutorial orga- nizers and aendees for making this conference such a growing success. We wish you an conference. Welcome to uRos2018! uRos2018 iii Eventbanner uRos2018 1920x400.jpg Eventbanner uRos2018 1920x400.bb Organizing partners Stascs Netherlands Stascs Romania Stascs Austria University of Bucharest Ecological University of Bucharest Special Journal Issues Romanian Stascal Review: http://www.revistadestatistica.ro/ Austrian Journal of Stascs: https://www.ajs.or.at/ uRos2018 iv Eventbanner uRos2018 1920x400.jpg Eventbanner uRos2018 1920x400.bb Contents Welcome iii Praccal informaon 1 Program overview 3 Session overview 7 Tutorials 13 Fast & efficient data manipulaon with data.table (Jaap Walhout) . 14 Plong spaal data in R (Marjn Tennekes) . 15 So your Data is Tidy. But is it Clean? (Edwin de Jonge and Mark van der Loo) 16 Spaal Analysis in R with Open Geodata (Egge-Jan Pollé and Willy Tadema) 17 Use of R package SamplingStrata for the Opmal Straficaon of Sampling Frames for Mulpurpose Sampling Surveys (Giulio Barcaroli) . 18 Keynotes 19 Sample coordinaon and R (Alina Matei) . 20 The R infrastructure and Windows Build System (Jeroen Ooms) . 21 Conference presentaons 23 A Corporate Design Toolbox for R (Thomas Lo Russo) . 24 A First Step towards Stascal Disclosure Control on Mulple Tables Under the Presence of Differenal Aacks (Kazuhiro Minami and Yutaka Abe) 25 uRos2018 v Eventbanner uRos2018 1920x400.jpg Eventbanner uRos2018 1920x400.bb Alternave to LaTeX for high quality report generaon with rmarkdown (Romain Lesur) ............................ 26 An all-in-one R applicaon for validang model assumpons in linear re- gression analysis with visualizaons (Joy Chioma Nwabueze and Chisimk- wuo John) ............................... 27 An internal package for automated metadata documentaon (Mahias Gomolka) ............................... 28 Canadian Consumer Price Index (CPI) Dashboard built using R Shiny (Manolo Malaver-Vojvodic) ........................... 29 coder: An R-package for fast classificaon of item data into groups (Erik Bülow) ................................. 30 Combining JDemetra+ and R for Analysing and Visualising Time Series in Official Stascs (Atanaska Nikolova) . 31 Comparison of mulvariate outlier detecon methods for nearly ellip- cally distributed data (Kazumi Wada, Mariko Kawano and Hiroe Tsub- aki) .................................. 32 Development of R Shiny Dashboard on Paern and Characteriscs of Tu- berculosis Incidence in Malaysia (Kamarul Ariffin Mansor, Nurhuda Ismail, Asmahani Nayan and Abd Razak Ahmad) . 33 Easily translatable Shiny applicaons (Matjaž Jeran) . 34 Easy Bootstrapping for Rotaonal Surveys with ’surveysd’ (Johannes Gussen- bauer, Alexander Kowarik and Mahias Till) . 35 Errorlocate: finding errors in data (Edwin de Jonge) . 36 Esmang Differenal Mortality from EU-SILC UDB Longitudinal Data (To- bias Göllner, Johannes Klotz) ..................... 37 Evaluaon of esmaon methods for a new survey of the UK’s Office for Naonal Stascs (ONS) using R (Konstannos Soulanis) . 38 Evidence for the use of alternave data sources to track consumer and business confidence within emerging markets using senment based techniques (Hanjo Odenaal) ..................... 39 Experiences in the migraon to RStudio-Server in Stascs Austria (Bern- hard Meindl and Alexander Kowarik) . 40 From challenges to opportunies: The Romanian Case of Use R in Official Stascs (Nicoleta Caragea, Ana-Maria Ciuhu and Raluca Mariana Dragoescu) .............................. 41 uRos2018 vi Eventbanner uRos2018 1920x400.jpg Eventbanner uRos2018 1920x400.bb How R is improving the disseminaon of stascs within the Department for Work and Pensions (Aoife O’Neill) . 42 How the Scosh Government is moving towards R (Victoria Avila) . 43 Interacve data visualizaon web-based applicaon using R-Shiny (Hous- sam Hachimi) ............................. 44 Introducing R at Stascs Denmark – a not enrely completed how-to (Pe- ter Tibert Stoltze & other co-authors (to be added)) . 45 Introducon to ’flagr’ (Salva Maeo, Eurostat and Mészáros Mátyás) . 46 Invesgang Chaos in Time Series: Evidence from the Cryptocurrency Mar- ket (Sami Diaf) ............................ 47 Lack-of-fit tesng without replicates available – a modern clustering ap- proach (Tyler George) ......................... 48 Macroeconomic Stascal Forecasng for Engine Demand (Ankit kamboj, Debojyo Samadder and Ambica Rajagopal) . 49 Opmal Boundary Value for Creang Anonymized Microdata: Empirical Analysis based on Economic Survey Data (Yutaka Abe, Kiyomi Shi- rakawa and Hitotsubashi Ryota Chiba) . 50 Overlapping classificaon for autocoding system (Yukako Toko, Shinya Iijima and Mika Sato-Ilic) .......................... 51 R packages for opmal strafied sampling: a review and compared evalu- aon (Marco Ballin and Giulio Barcaroli) . 52 pesm - an R package to compute populaon esmaons using mobile phone data (: Bogdan Oancea, David Salgado, Luis Sanguiao, and Antoniade Ciprian Alexandru) ..................... 54 reclin: a package for record linkage and deduplicaon (Jan van der Laan (Stascs Netherlands)) ........................ 55 Responsive, web-based graphical user interfaces to R (Adrian Dușa) . 56 (R)evoluon of generalized systems and stascal tools at Stascs Canada (Susie Forer and Steven Thomas) . 57 R’s Shiny package and Survey Soluons for (Acve) Survey Management (single slot) (Michael Wild) ...................... 58 rtrim – an R implementaon of Trends and Indices in Monitoring data (Patrick Bogaart, Mark van der Loo, Jeroen Pannekoek) . 59 SelEdit… - a collecon of R packages to implement some opmizaon- based selecve eding techniques (Elisa Esteban, Soledad Saldaña, and David Salgado) .......................... 60 uRos2018 vii Eventbanner uRos2018 1920x400.jpg Eventbanner uRos2018 1920x400.bb The Use of R Shiny at the U.S. Bureau of Labor Stascs (Brandon Kopp) . 61 Transforming Health and Social Care Publicaons in Scotland (Anna Price, David Caldwell, Ewout Jaspers, Maighread Simpson) . 62 Two main uses of R in Stascs Portugal: sampling and confidenality (Pe- dro Sousa, Conceição Ferreira, Inês Rodrigues, Pedro Campos) . 63 Use of Choropleth Maps for Regional Stascs (Jillian Delaney) . 64 Using R for analysis and producon of Price Indices for the Producon and Services sector of the economy (Ma Mayhew) . 65 Using R for data cleaning, integraon and esmaon challenges in Stas- cs Poland - some conclusions aer VIP.ADMIN project (Beręsewicz Maciej and Pawlikowski Dawid) .................... 66 Using R for variance esmaon in social surveys (Eleanor Law, Vahé Nafilyan, Ria Sanderson) ............................ 67 Using R to access official data from the Guatemalan Naonal Instute of Stascs (Oscar de León) ....................... 68 Variance esmaon for annual point esmates and net changes for LFS using R package vardpoor (Juris Breidaks) . 69 uRos2018 viii Eventbanner uRos2018 1920x400.jpg Eventbanner uRos2018 1920x400.bb Praccal informaon Public wifi SSID CBS-Public User name uros2018 Password uros2018 Guests understand and acknowledge that we exercise no control over the nature, content, or reliability of the informaon and/or data passing through our network. Social dinner The social dinner will take place on Thursday 13 September at 19:00. Restaurant Luden Plein 6–7 2511CR Den Haag The easiest way to get there form CBS is to take the lightrail (tram) number 3 or 4 in the direcon of Den Haag. Get out at stop ‘Spui’, this is the first stop aer the Central Staon. From there it is two minutes