R Course for the Nsos in the Arab Countries Part 3: R Data Management

Total Page:16

File Type:pdf, Size:1020Kb

R Course for the Nsos in the Arab Countries Part 3: R Data Management R Course for the NSOs in the Arab countries Part 3: R Data Management Valentin Todorov1 1United Nations Industrial Development Organization, Vienna 18-20 May 2015 Todorov (UNIDO) R Course for the NSOs in the Arab countriesPart 3: R Data Management18-20 May 2015 1 / 41 Outline 1 Motivation 2 Data exchange with other statistical tools 3 Reading and Writing in Excel Format 4 Reading Data in SDMX Format 5 R data base interfaces and Relational DBMSs 6 Case study: UNIDO database 7 R packages for database access 8 Exercise: The Data Expo 2006 9 Accessing international statistical databases 10 Summary and conclusions Todorov (UNIDO) R Course for the NSOs in the Arab countriesPart 3: R Data Management18-20 May 2015 2 / 41 Motivation Motivation 1. A number of statistical software tools and other programs are in use in a statistical organization 2. The data exchange between such systems (SAS, SPSS, EViews, Stata, Excel, Matlab, Octave, etc) is essential 3. Reading and writing data from/to Excel is very important due to its extreme popularity 4. Often data are stored in relational databases (MS Access, MySql, DB2, MS SQL server, Sybase, etc.) and the size do not allow to extract them into flat files before analysis 5. Using SDMX for data and metadata exchange becomes more and more important Todorov (UNIDO) R Course for the NSOs in the Arab countriesPart 3: R Data Management18-20 May 2015 3 / 41 Data exchange with other statistical tools R as a mediator Todorov (UNIDO) R Course for the NSOs in the Arab countriesPart 3: R Data Management18-20 May 2015 4 / 41 Data exchange with other statistical tools Package foreign • Package foreign reads data stored by Minitab, S, SAS, SPSS, Stata, Systat, Weka, dBase and others and writes in the format of some of these. • Some limitations apply 1. read.dta(), write.dta() - reads and writes data in STATA format. Supports versions 9-12. Will not support versions higher than 12. 2. read.spss() - reads a file stored by the SPSS save or export commands. this was originally written in 2000 and has limited support for changes in SPSS format since. 3. read.ssd() - reads a SAS data set (sas7bdat). For this purpose it generates a SAS program to convert the ssd contents to SAS transport format and then uses read.xport() to obtain a data frame. Requires SAS to be available. Todorov (UNIDO) R Course for the NSOs in the Arab countriesPart 3: R Data Management18-20 May 2015 5 / 41 Data exchange with other statistical tools Example: Read and write STATA files > ## Read and write STATA files > library(foreign) > data(rock) # from 'datasets' in base > rockfile <- tempfile() # get a temporary file > write.dta(rock, file=rockfile) # write in STATA format > head(read.dta(rockfile)) # read STATA data area peri shape perm 1 4990 2791.90 0.0903296 6.3 2 7002 3892.60 0.1486220 6.3 3 7558 3930.66 0.1833120 6.3 4 7352 3869.32 0.1170630 6.3 5 7943 3948.54 0.1224170 17.1 6 7979 4010.15 0.1670450 17.1 Todorov (UNIDO) R Course for the NSOs in the Arab countriesPart 3: R Data Management18-20 May 2015 6 / 41 Data exchange with other statistical tools Example: Read and write SAS files 1. Using read.ssd() from foreign: > ## Read and write SAS files > library(foreign) > libname <- "C:/MYSASDATA" > members <- c("data1", "data2", "data3") > SAS <- "C:/Program Files/SAS/SAS 9.1/SAS.EXE" > sasdata <- read.ssd(libname=libname, sectionnames=members, + sascmd=SAS) > is.data.frame(sasdata$data1) 2. Using sas.get() from Hmisc (quite similar to read.ssd() though): > library(Hmisc) > sasdata <- sas.get("C:/MYSASDATA", "data1") 3. Using function read.sas7bdat() from sas7bdat: http://sas-and-r.blogspot.co.at/2011/07/ really-useful-r-package-sas7bdat.html Todorov (UNIDO) R Course for the NSOs in the Arab countriesPart 3: R Data Management18-20 May 2015 7 / 41 Reading and Writing in Excel Format Reading and Writing in Excel Format • Spreadsheets (especially Microsoft Excel), are one of the most common methods for data exchange • R provides several ways to access Excel files • The trivial way is to save the data to a CSV or a tab-delimited file and then use read.csv() or read.table() (manual intervention necessary) • Follows a non-comprehensive list of R packages: 1. gdata: Requires installation of Perl on Windows. Supports only reading from Excel. 2. RODBC: On Windows, function ODBConnectExcel(). Too complicated, there are better ways for reading Excel. Todorov (UNIDO) R Course for the NSOs in the Arab countriesPart 3: R Data Management18-20 May 2015 8 / 41 Reading and Writing in Excel Format Reading and Writing in Excel Format 3. xlsx: Based on Java, could be slow. Prefer the read.xlsx2() over read.xlsx()|significantly faster for large data sets. > ## Read and write Excel files using the package 'xlsx' > library(xlsx) > data(rock) # from datasets > fname <- paste(tempdir(), "/myfile.xlsx", sep="") > write.xlsx(rock, fname) > ## Now read the file and display the beginning > df <- read.xlsx2(fname, 1) # read first sheet > head(df) X. area peri shape perm 1 1 4990 2791.9 0.0903296 6.3 2 2 7002 3892.6 0.148622 6.3 3 3 7558 3930.66 0.183312 6.3 4 4 7352 3869.32 0.117063 6.3 5 5 7943 3948.54 0.122417 17.1 6 6 7979 4010.15 0.167045 17.1 Todorov (UNIDO) R Course for the NSOs in the Arab countriesPart 3: R Data Management18-20 May 2015 9 / 41 Reading and Writing in Excel Format Reading and Writing in Excel Format 4. xlsReadWrite: Available for Windows only; uses non-OSS DELPHI component; does not support .xlsx files which is a serious drawback. Not available for 64 bit platforms; It has been removed from CRAN lately. 5. XLConnect: Based on Java, might be slow for large data set but very powerful otherwise. http://miraisolutions.wordpress.com/ Uses Apache POI API as the underlying interface: http://poi.apache.org/. More details and examples follow. 6. RExcel: An add-in for Excel. Allows access to the R from within Excel, i.e. Excel becomes GUI for R. http://rcom.univie.ac.at/download.html. Todorov (UNIDO) R Course for the NSOs in the Arab countriesPart 3: R Data18-20 Management May 2015 10 / 41 Reading and Writing in Excel Format XLConnect • It seems that this is the best package for manipulating Excel files for the moment • Allows you to produce formatted Excel reports, including graphics • Main functions: I Reading and writing of Excel worksheets (via data.frames) I Reading and writing of named ranges (via data.frames) I Creating, removing, renaming and cloning worksheets I Adding graphics I Controlling sheet visibility I Defining column width and row height I Merge/unmerge, cell formulas, formula recalculation, auto-filters, cell styles Todorov (UNIDO) R Course for the NSOs in the Arab countriesPart 3: R Data18-20 Management May 2015 11 / 41 Reading and Writing in Excel Format XLConnect: writing data to a worksheet > ## Create a new workbook and write a data frame > ## to a worksheet > library(XLConnect) > data(rock) # from datasets > fname1 <- paste(tempdir(), "/rock-1.xlsx", sep="") > wb <- loadWorkbook(fname1, create=TRUE) > createSheet(wb, name="rock") > writeWorksheet(wb, data=rock, sheet="rock") > saveWorkbook(wb) # Do not forget this one • saveWorkbook() does the actual file creation. • Position the data anywhere using startRow and startCol Todorov (UNIDO) R Course for the NSOs in the Arab countriesPart 3: R Data18-20 Management May 2015 12 / 41 Reading and Writing in Excel Format XLConnect: writing data to a worksheet with one call > ## Writing to worksheet with one call > library(XLConnect) > data(rock) # from datasets > fname2 <- paste(tempdir(), "/rock-2.xlsx", sep="") > writeWorksheetToFile(fname2, data=rock, sheet="rock", + startRow=3, startCol=4) • writeWorksheetToFile() loads the workbook, creates the sheet and finally saves the workbook. • Useful when you only need to write one sheet into an Excel file • Position the data anywhere using startRow and startCol Todorov (UNIDO) R Course for the NSOs in the Arab countriesPart 3: R Data18-20 Management May 2015 13 / 41 Reading and Writing in Excel Format XLConnect: reading data from worksheet > ## Reading data from worksheet > library(XLConnect) > wb = loadWorkbook(fname1, create=TRUE) > data = readWorksheet(wb, sheet="rock") > head(data) area peri shape perm 1 4990 2791.90 0.0903296 6.3 2 7002 3892.60 0.1486220 6.3 3 7558 3930.66 0.1833120 6.3 4 7352 3869.32 0.1170630 6.3 5 7943 3948.54 0.1224170 17.1 6 7979 4010.15 0.1670450 17.1 • Sheets (and regions) can be referenced by name as well • Alternatively use startRow and startCol Todorov (UNIDO) R Course for the NSOs in the Arab countriesPart 3: R Data18-20 Management May 2015 14 / 41 Reading and Writing in Excel Format XLConnect: reading data from worksheet with one call > ## Reading data from worksheet with one call > library(XLConnect) > data = readWorksheetFromFile(fname2, sheet="rock") > head(data) area peri shape perm 1 4990 2791.90 0.0903296 6.3 2 7002 3892.60 0.1486220 6.3 3 7558 3930.66 0.1833120 6.3 4 7352 3869.32 0.1170630 6.3 5 7943 3948.54 0.1224170 17.1 6 7979 4010.15 0.1670450 17.1 • Recognizes square regions automatically • Useful when you only need to write one sheet into an Excel file • Deals with opening/closing connections Todorov (UNIDO) R Course for the NSOs in the Arab countriesPart 3: R Data18-20 Management May 2015 15 / 41 Reading Data in SDMX Format SDMX Example • Several publicly available files in SDMX format can be found on the IMF web site http://www.imf.org/oecd • We will consider the annual exchange rates for the period 1980 to 2007 • Starts by a Header and than continues with time series blocks for each country Listing 1: Read exchange rates in SDMX format.
Recommended publications
  • R-Based Strategies for DH in English Linguistics: a Case Study
    R-based strategies for DH in English Linguistics: a case study Nicolas Ballier Paula Lissón Université Paris Diderot Université Paris Diderot UFR Études Anglophones UFR Études Anglophones CLILLAC-ARP (EA 3967) CLILLAC-ARP (EA 3967) nicolas.ballier@univ- [email protected] paris-diderot.fr paris-diderot.fr language for research in linguistics, both in a Abstract quantitative and qualitative approach. Developing a culture based on the program- This paper is a position statement advocating ming language R (R Core Team 2016) for NLP the implementation of the programming lan- among MA and PhD students doing English Lin- guage R in a curriculum of English Linguis- guistics is no easy task. Broadly speaking, most tics. This is an illustration of a possible strat- of the students have little background in mathe- egy for the requirements of Natural Language matics, statistics, or programming, and usually Processing (NLP) for Digital Humanities (DH) studies in an established curriculum. R feel reluctant to study any of these disciplines. plays the role of a Trojan Horse for NLP and While most PhD students in English linguistics statistics, while promoting the acquisition of are former students with a Baccalauréat in Sci- a programming language. We report an over- ences, some MA students pride themselves on view of existing practices implemented in an having radically opted out of Maths. However, MA and PhD programme at the University of we believe that students should be made aware of Paris Diderot in the recent years. We empha- the growing use of statistical and NLP methods size completed aspects of the curriculum and in linguistics and to be able to interpret and im- detail existing teaching strategies rather than plement these techniques.
    [Show full text]
  • R for Statistics: First Steps
    R for statistics: first steps CAR meeting, Raleigh, Feb 26 2011 Peter Aldhous, San Francisco Bureau Chief [email protected] A note of caution before using any stats package: Beware running with scissors! You can bamboozle yourself, and your readers, by misusing statistics. Make sure you understand the methods you are using, in particular the assumptions that must be met for them to be valid (e.g. normal distribution for many common tests). So before rushing in, consult: • IRE tipsheets (e.g. Donald & LaFleur, #2752; Donald & Hacker, #2731) • Statistical textbooks (e.g. http://www.statsoft.com/textbook/ is free online) • Experts who can provide a reality check on your analysis! Getting started: Download R, instructions at: http://www.r-project.org/ Start the program: Prepare the data: 1) Save as a csv file 2) Point R at the data First steps in the R command line: Load and examine the data: View a basic summary: What is the mean age and salary for CEOs in each sector? An alternative, computing mean and standard deviation in one go: Is there a correlation between age and salary? Draw a graph to explore the correlation analysis: Does mean CEO age differ significantly across sectors? Does mean CEO salary differ significantly across sectors? Draw a graph to explore the distribution of salary by sector: Moving beyond the basic functions: R packages The R community has written 2800+ extensions to R’s basic statistical and graphical functions, available from the CRAN repository: http://cran.r-project.org/web/packages/ Follow the links to find a PDF Reference Manual for each package.
    [Show full text]
  • R in a Nutshell
    R IN A NUTSHELL Second Edition Joseph Adler Beijing • Cambridge • Farnham • Köln • Sebastopol • Tokyo R in a Nutshell, Second Edition by Joseph Adler Copyright © 2012 Joseph Adler. All rights reserved. Printed in the United States of America. Published by O’Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA 95472. O’Reilly books may be purchased for educational, business, or sales promotional use. Online editions are also available for most titles (http://my.safaribooksonline.com). For more infor- mation, contact our corporate/institutional sales department: 800-998-9938 or [email protected]. Editors: Mike Loukides and Meghan Blanchette Indexer: Fred Brown Production Editor: Holly Bauer Cover Designer: Karen Montgomery Proofreader: Julie Van Keuren Interior Designer: David Futato Illustrators: Robert Romano and Re- becca Demarest September 2009: First Edition. October 2012: Second Edition. Revision History for the Second Edition: 2012-09-25 First release See http://oreilly.com/catalog/errata.csp?isbn=9781449312084 for release details. Nutshell Handbook, the Nutshell Handbook logo, and the O’Reilly logo are registered trade- marks of O’Reilly Media, Inc. R in a Nutshell, the image of a harpy eagle, and related trade dress are trademarks of O’Reilly Media, Inc. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and O’Reilly Media, Inc., was aware of a trademark claim, the designations have been printed in caps or initial caps. While every precaution has been taken in the preparation of this book, the publisher and author assume no responsibility for errors or omissions, or for damages resulting from the use of the information contained herein.
    [Show full text]
  • Eviews 9 Command and Programming Reference Eviews 9 Command and Programming Reference Copyright © 1994–2015 IHS Global Inc
    EViews 9 Command and Programming Reference EViews 9 Command and Programming Reference Copyright © 1994–2015 IHS Global Inc. All Rights Reserved ISBN: 978-1-880411-29-2 This software product, including program code and manual, is copyrighted, and all rights are reserved by IHS Global Inc. The distribution and sale of this product are intended for the use of the original purchaser only. Except as permitted under the United States Copyright Act of 1976, no part of this product may be reproduced or distributed in any form or by any means, or stored in a database or retrieval system, without the prior written permission of IHS Global Inc. Disclaimer The authors and IHS Global Inc. assume no responsibility for any errors that may appear in this manual or the EViews program. The user assumes all responsibility for the selection of the pro- gram to achieve intended results, and for the installation, use, and results obtained from the pro- gram. Trademarks EViews® is a registered trademark of IHS Global Inc. Windows, Excel, PowerPoint, and Access are registered trademarks of Microsoft Corporation. PostScript is a trademark of Adobe Corpora- tion. X11.2 and X12-ARIMA Version 0.2.7, and X-13ARIMA-SEATS are seasonal adjustment pro- grams developed by the U. S. Census Bureau. Tramo/Seats is copyright by Agustin Maravall and Victor Gomez. Info-ZIP is provided by the persons listed in the infozip_license.txt file. Please refer to this file in the EViews directory for more information on Info-ZIP. Zlib was written by Jean-loup Gailly and Mark Adler.
    [Show full text]
  • The Use of Statistical Software to Teach Nonparametric Curve Estimation: from Excel to R
    ICOTS8 (2010) Invited Paper Refereed Cao & Naya THE USE OF STATISTICAL SOFTWARE TO TEACH NONPARAMETRIC CURVE ESTIMATION: FROM EXCEL TO R Ricardo Cao and Salvador Naya Research Group MODES, Department of Mathematics, University of A Coruña, Spain [email protected] The advantages of using R and Excel for teaching nonparametric curve estimation are presented in this paper. The use of these two tools for teaching nonparametric curve estimation is illustrated by means of several well-known data sets. Computation of histogram and kernel density estimators as well as kernel and local polynomial regression estimators is presented using Excel and R. Interactive changes in the sample and the smoothing parameter are illustrated using both tools. R incorporates sophisticated routines for crucial issues in nonparametric curve estimation, as smoothing parameter selection. The paper concludes summarizing the relative merits of these two tools for teaching nonparametric curve estimation and presenting RExcel, a free add-in for Excel that can be downloaded from the R distribution network. INTRODUCTION There has been an enormous expansion, over the past few years, on the use of computer and communication technologies for teaching statistics at different levels. Microsoft Excel is the most popular spreadsheet program that is used to store information in columns and rows, which can then be organized and/or processed. Many authors consider Microsoft Excel as an excellent tool for statistical education (see, for example, Giles, 2002). On the other hand, the use of free software is also one of the most interesting available tools for teaching statistics. Universal access to the Internet enables easy installation of free software.
    [Show full text]
  • Appendix C Some Details of Matrix.Xla(M)
    Appendix C Some details of Matrix.xla(m) C.1 Matrix nomenclature For the sake of notational compactness, we will denote a square diagonal matrix by D with elements dii, a square tridiagonal matrix by T with elements tij where | j – i | ≤ 1, most other square matrices by S, rectangular matrices by R, and all matrix elements by mij. A vector will be shown as v, with elements vi, and a scalar as s. Particular values are denoted by x when real, and by z when complex. All optional pa- rameters are shown in straight brackets, [ ]. All matrices, vectors, and scalars are assumed to be real, ex- cept when specified otherwise. All matrices are restricted to two dimensions, and vectors to one dimen- sion. Table C.1 briefly explains some matrix terms that will be used in subsequent tables. With some functions, the user is given the integer option Int of applying integer arithmetic. When a matrix only contains integer elements, selecting integer arithmetic may avoid most round-off problems. On the other hand, the range of integer arithmetic is limited, so that overflow errors may result if the ma- trix is large and/or contains large numbers. Another common option it Tiny, which defines the absolute value of quantities that can be regarded as most likely resulting from round-off errors, and are therefore set to zero. When not activated, the routine will use its user-definable default value. Condition of a matrix: ratio of its largest to smallest singular value Diagonal of a square matrix: the set of terms mij where i = j Diagonal matrix D square matrix with mij = 0 for all off-diagonal elements i ≠ j.
    [Show full text]
  • ASA JSM Program Book 06.Indd
    GENERAL PROGRAM SCHEDULE ✪ Themed Session ● Applied Session ❖ Presenter CC-Washington State Convention & Trade Center H-Grand Hyatt Seattle S-Sheraton Seattle Hotel & Towers TUESDAY, AUGUST 8 7:00 a.m.–10:00 p.m. CC-Level 4 South Lobby Cyber Center Tuesday 7:00 a.m.–10:00 a.m. S-Metropolitan Ballroom A Tours Council of Chapters Business Meeting and Breakfast (closed) 9:00 a.m.–1:00 p.m. CC-Convention Place Chair(s): Ronald Wasserstein, Washburn University TR06 - Historical Seattle Tour (fee event) 7:00 a.m.–10:00 a.m. S-Ballard 1:00 p.m.–5:00 p.m. CC-Convention Place Section on Physical and Engineering Sciences TR07 - Glassblowing Tour (fee event) Executive Committee Meeting (closed) Chair(s): Robert Wilkinson, Lubrizol Committee/Business Meetings 7:00 a.m.–6:00 p.m. CC-507, CC-508 & Other Activities Speaker Work Rooms 5:15 a.m.–7:00 a.m. Off Property 7:30 a.m.–9:00 a.m. H-Cayuse Gertrude Cox Scholarship Race Project on Filming of Distinguished Statisticians (closed) 7:00 a.m.–8:30 a.m. S-Cedar Room Organizer(s): Nitis Mukhopadhyay, University of Connecticut Section on Statisticians in Defense and National 7:30 a.m.–12:00 p.m. S-Aspen Room Security Business Meeting Chair(s): Ron Fricker, Naval Postgraduate School Biopharmaceutical Section Executive Committee Meeting (closed) 7:00 a.m.–8:30 a.m. H-Excelsior Chair(s): Stacy Lindborg, Eli Lilly and Company Technometrics Editorial Board Meeting (closed) Chair(s): Randy R. Sitter, Simon Fraser University 7:30 a.m.–4:30 p.m.
    [Show full text]
  • Creating and Deploying an Application with (R)Excel and R
    CONTRIBUTED RESEARCH ARTICLES 5 Creating and Deploying an Application with (R)Excel and R Thomas Baier, Erich Neuwirth and Michele De Meo Prediction of a quantitative or categorical variable, is done through a tree structure, which even non- Abstract We present some ways of using R in professionals can read and understand easily. The Excel and build an example application using the application of a computationally complex algorithm package rpart. Starting with simple interactive thus results in an intuitive and easy to use tool. Pre- use of rpart in Excel, we eventually package the diction of a categorical variable is performed by a code into an Excel-based application, hiding all classification tree, while the term regression tree is used details (including R itself) from the end user. In for the estimation of a quantitative variable. the end, our application implements a service- Our application will be built for Microsoft Excel oriented architecture (SOA) with a clean separa- and will make use of R and rpart to implement the tion of presentation and computation layer. functionality. We have chosen Excel as the primary tool for performing the analysis because of various advantages: Motivation • Excel has a familiar and easy-to-use user inter- Building an application for end users is a very chal- face. lenging goal. Building a statistics application nor- • Excel is already installed on most of the work- mally involves three different roles: application de- stations in the industries we mentioned. veloper, statistician, and user. Often, statisticians are programmers too, but are only (or mostly) fa- • In many cases, data collection has been per- miliar with statistical programming (languages) and formed using Excel, so using Excel for the anal- definitely are not experts in creating rich user inter- ysis seems to be the logical choice.
    [Show full text]
  • Non-Programming Introduction to R
    sheepsqueezers.com Non-Programming Introduction to R Copyright ©2011 sheepsqueezers.com Legal Stuff sheepsqueezers.com This work may be reproduced and redistributed, in whole or in part, without alteration and without prior written permission, provided all copies contain the following statement: Copyright ©2011 sheepsqueezers.com. This work is reproduced and distributed with the permission of the copyright holder. This presentation as well as other presentations and documents found on the sheepsqueezers.com website may contain quoted material from outside sources such as books, articles and websites. It is our intention to diligently reference all outside sources. Occasionally, though, a reference may be missed. No copyright infringement whatsoever is intended, and all outside source materials are copyright of their respective author(s). Copyright ©2011 sheepsqueezers.com R Lecture Series sheepsqueezers.com Non- Programming Programming Programming I II Introduction Graphics Advanced I Topics Copyright ©2011 sheepsqueezers.com Charting Our Course Goal of this Presentation sheepsqueezers.com What is R? Installing the Software Installing R Commander and Rattle Packages Installing RExcel for R Commander Installing RGGobi Installing Additional Packages Introducing Rterm and RGui Introducing R Commander Introducing Rattle Introducing RExcel Introducing RGGobi Appendix A: References Appendix B: R-Related Websites Copyright ©2011 sheepsqueezers.com Goal of this Presentation The goal of this presentation is to show you how to use R through its many sheepsqueezers.com graphical user interfaces (GUIs). We decided to start with this instead of charging directly into programming to ease you into R rather than throwing you to the wolves. Many of you will be casual R users and will not need or not want to learn the R programming language itself and it is our hope that this presentation will be to your benefit.
    [Show full text]
  • Installation of Rexcel
    Appendix A Installation of RExcel Abstract • Excel is the most prevalent software used for data storage, analysis, and interpre- tation. Elementary and medium-quality mathematical and statistical functions are included with Excel. Good statistical analysis in Excel with more advanced methods than just frequency counts, however, requires an add-in package. • R is one of the best and most powerful statistics programs currently available. • RExcel integrates a menu system, based on the R Commander package, that puts complete access to the full power of R onto the Excel menu bar. Results from the analyses in R can be returned to the spreadsheet. Ordinary formulas in spread- sheet cells can use functions written in R. A.1 Basic Installation Procedures The easiest way to install R, RExcel, and the additionally needed software mod- ules and tools is to download the current version of RAndFriendsSetup from http://rcom.univie.ac.at. Running this program will install everything needed for a working configuration on your machine. A detailed description of the installation is in Section A.3. You will need a working internet connection during the installa- tion process because one module, statconnDCOM, is not under the GPL license that covers most of R. statconnDCOM must be downloaded separately during the installation. More information on the license is in Section A.8. If you already have a working version of R (version 2.8.1 or later) on your ma- chine, you can simply install the R packages RExcelInstaller and RthroughExcel- WorkbooksInstaller (and the packages they require) from CRAN. Section A.4 gives more details about this process.
    [Show full text]
  • R for Dummies
    R 2nd Edition by Andrie de Vries and Joris Meys R For Dummies®, 2nd Edition Published by: John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ 07030‐5774, www.wiley.com Copyright © 2015 by John Wiley & Sons, Inc., Hoboken, New Jersey Media and software compilation copyright © 2015 by John Wiley & Sons, Inc. All rights reserved. Published simultaneously in Canada No part of this publication may be reproduced, stored in a retrieval system or transmitted in any form or by any means, electronic, mechanical, photocopying, recording, scanning or otherwise, except as permit- ted under Sections 107 or 108 of the 1976 United States Copyright Act, without the prior written permis- sion of the Publisher. Requests to the Publisher for permission should be addressed to the Permissions Department, John Wiley & Sons, Inc., 111 River Street, Hoboken, NJ 07030, (201) 748‐6011, fax (201) 748‐6008, or online at http://www.wiley.com/go/permissions. Trademarks: Wiley, For Dummies, the Dummies Man logo, Dummies.com, Making Everything Easier, and related trade dress are trademarks or registered trademarks of John Wiley & Sons, Inc. and may not be used without written permission. All trademarks are the property of their respective owners. John Wiley & Sons, Inc. is not associated with any product or vendor mentioned in this book. LIMIT OF LIABILITY/DISCLAIMER OF WARRANTY: THE PUBLISHER AND THE AUTHOR MAKE NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE ACCURACY OR COMPLETENESS OF THE CONTENTS OF THIS WORK AND SPECIFICALLY DISCLAIM ALL WARRANTIES, INCLUDING WITHOUT LIMITATION WARRANTIES OF FITNESS FOR A PARTICULAR PURPOSE. NO WARRANTY MAY BE CREATED OR EXTENDED BY SALES OR PROMOTIONAL MATERIALS.
    [Show full text]
  • Download Slides
    RR in the City Richard Saldanha Oxquant Consulting [email protected] LondonR Group Meeting 3rd November 2009 S Language Development 1965 Bell Labs pre-S work on a 1992 S version 3 statistical computing language Introduces structures to make 1977 Bell Labs S version 1 statistical modelling easier 1984 S to the world Parallel development of Quantitative Programming Environment (QPE) 1998 S version 4 by John Chambers Internal redesign and 1988 ‘New S ’ implements more formal object-oriented Combines S with QPE structure Typically referred to as S version 2 2 Oxquant Introducing R Combines a dialect of S with environment for data manipulation, calculation and graphical display http://www.r-project.org Can be thought of as an open source implementation of S Since 2008 commercial implementation of S exists in the form of TIBCO ’s Spotfire S+ (previously developed as S-PLUS from 1988) Attempts to commercialize R: R+, REvolution R, RStat Comprehensive R Archive Network (CRAN) http://cran.r-project.org – Precompiled binaries and source code including alpha and beta releases – Contributed packages (over 2,000 currently) – Manuals, FAQs and other contributed documentation – Mailing lists http://www.R-project.org/mail.html R-help: main mailing list for problems and solutions R-announce: major developments R-packages: new or enhanced contributed packages R-devel: questions and discussions about code development R-Forge http://r-forge.r-project.org – Platform for collaborative development of R packages 3 Oxquant Over 80 published books
    [Show full text]