Introduction to R
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  Tinn-R Team Has a New Member Working on the Source Code: Wel- Come Huashan ChenEditus eBook Series Editus eBooks is a series of electronic books aimed at students and re- searchers of arts and sciences in general. Tinn-R Editor (2010 1. ed. Rmetrics) Tinn-R Editor - GUI forR Language and Environment (2014 2. ed. Editus) José Cláudio Faria Philippe Grosjean Enio Galinkin Jelihovschi Ricardo Pietrobon Philipe Silva Farias Universidade Estadual de Santa Cruz GOVERNO DO ESTADO DA BAHIA JAQUES WAGNER - GOVERNADOR SECRETARIA DE EDUCAÇÃO OSVALDO BARRETO FILHO - SECRETÁRIO UNIVERSIDADE ESTADUAL DE SANTA CRUZ ADÉLIA MARIA CARVALHO DE MELO PINHEIRO - REITORA EVANDRO SENA FREIRE - VICE-REITOR DIRETORA DA EDITUS RITA VIRGINIA ALVES SANTOS ARGOLLO Conselho Editorial: Rita Virginia Alves Santos Argollo – Presidente Andréa de Azevedo Morégula André Luiz Rosa Ribeiro Adriana dos Santos Reis Lemos Dorival de Freitas Evandro Sena Freire Francisco Mendes Costa José Montival Alencar Junior Lurdes Bertol Rocha Maria Laura de Oliveira Gomes Marileide dos Santos de Oliveira Raimunda Alves Moreira de Assis Roseanne Montargil Rocha Silvia Maria Santos Carvalho Copyright©2015 by JOSÉ CLÁUDIO FARIA PHILIPPE GROSJEAN ENIO GALINKIN JELIHOVSCHI RICARDO PIETROBON PHILIPE SILVA FARIAS Direitos desta edição reservados à EDITUS - EDITORA DA UESC A reprodução não autorizada desta publicação, por qualquer meio, seja total ou parcial, constitui violação da Lei nº 9.610/98. Depósito legal na Biblioteca Nacional, conforme Lei nº 10.994, de 14 de dezembro de 2004. CAPA Carolina Sartório Faria REVISÃO Amek Traduções Dados Internacionais de Catalogação na Publicação (CIP) T591 Tinn-R Editor – GUI for R Language and Environment / José Cláudio Faria [et al.]. – 2. ed. – Ilhéus, BA : Editus, 2015. xvii, 279 p. ; pdf Texto em inglês.
- 
												  Rkward: a Comprehensive Graphical User Interface and Integrated Development Environment for Statistical Analysis with RJSS Journal of Statistical Software June 2012, Volume 49, Issue 9. http://www.jstatsoft.org/ RKWard: A Comprehensive Graphical User Interface and Integrated Development Environment for Statistical Analysis with R Stefan R¨odiger Thomas Friedrichsmeier Charit´e-Universit¨atsmedizin Berlin Ruhr-University Bochum Prasenjit Kapat Meik Michalke The Ohio State University Heinrich Heine University Dusseldorf¨ Abstract R is a free open-source implementation of the S statistical computing language and programming environment. The current status of R is a command line driven interface with no advanced cross-platform graphical user interface (GUI), but it includes tools for building such. Over the past years, proprietary and non-proprietary GUI solutions have emerged, based on internal or external tool kits, with different scopes and technological concepts. For example, Rgui.exe and Rgui.app have become the de facto GUI on the Microsoft Windows and Mac OS X platforms, respectively, for most users. In this paper we discuss RKWard which aims to be both a comprehensive GUI and an integrated devel- opment environment for R. RKWard is based on the KDE software libraries. Statistical procedures and plots are implemented using an extendable plugin architecture based on ECMAScript (JavaScript), R, and XML. RKWard provides an excellent tool to manage different types of data objects; even allowing for seamless editing of certain types. The objective of RKWard is to provide a portable and extensible R interface for both basic and advanced statistical and graphical analysis, while not compromising on flexibility and modularity of the R programming environment itself. Keywords: GUI, integrated development environment, plugin, R.
- 
												  Statistical Graphical User Interface Plug-In for Survival Analysis in R Statistical and Graphics Language and EnvironmentApplied Medical Informatics Original Research Vol. 23, No. 3-4/2008, pp: 57 - 62 Statistical Graphical User Interface Plug-In for Survival Analysis in R Statistical and Graphics Language and Environment Daniel C. LEUCUŢA*, Andrei ACHIMAŞ CADARIU „Iuliu Haţieganu” University of Medicine and Pharmacy Cluj-Napoca, Department of Medical Informatics and Biostatistics, 6 Louis Pasteur, 400349 Cluj-Napoca, Romania. E-mail: [email protected] * Author to whom correspondence should be addressed; Tel.: +4-0264-431697; Fax: +4-0264- 593847. Abstract: Introduction: R is a statistical and graphics language and environment. Although it is extensively used in command line, graphical user interfaces exist to ease the accommodation with it for new users. Rcmdr is an R package providing a basic-statistics graphical user interface to R. Survival analysis interface is not provided by Rcmdr. The AIM of this paper was to create a plug-in for Rcmdr to provide survival analysis user interface for some basic R survival analysis functions. Materials and Methods: The Rcmdr plug-in code was written in Tinn-R. The plug-in package was tested and built with Rtools. The plug-in was installed and tested in R with Rcmdr package on a Windows XP workstation with the "aml" and "kidney" data sets from survival R package. Results: The Rcmdr survival analysis plug-in was successfully built and it provides the functionality it was designed to offer: interface for Kaplan Meier and log log survival graph, interface for the log-rank test, interface to create a Cox proportional hazard regression model, interface commands to test and assess graphically the proportional hazard assumption, and influence observations.
- 
												  Deducer: a Data Analysis GUI for RJSS Journal of Statistical Software June 2012, Volume 49, Issue 8. http://www.jstatsoft.org/ Deducer: A Data Analysis GUI for R Ian Fellows University of California, Los Angeles Abstract While R has proven itself to be a powerful and flexible tool for data exploration and analysis, it lacks the ease of use present in other software such as SPSS and Minitab. An easy to use graphical user interface (GUI) can help new users accomplish tasks that would otherwise be out of their reach, and improves the efficiency of expert users by replacing fifty key strokes with five mouse clicks. With this in mind, Deducer presents dialogs that are understandable for the beginner, and yet contain all (or most) of the options that an experienced statistician, performing the same task, would want. An Excel-like spreadsheet is included for easy data viewing and editing. Deducer is based on Java's Swing GUI library and can be used on any common operating system. The GUI is independent of the specific R console and can easily be used by calling a text-based menu system. Graphical menus are provided for the JGR console and the Windows R GUI. Keywords: GUI, R. 1. Introduction R (R Development Core Team 2012) is a powerful statistical programming language that places the latest statistical techniques at one's fingertips through thousands of add-on packages available on the Comprehensive R Archive Network (CRAN) download servers. The price for all of this power is complexity. Because R analyses must be called as text commands, the user is required to find out the name of the function that will accomplish their task, and then remember that name along with the names of the variables to feed it, and its argument options.
- 
												  A New Package for the Birnbaum-Saunders DistributionNews The Newsletter of the R Project Volume 6/4, October 2006 Editorial by Paul Murrell Matthew Pocernich takes a moment to stop and smell the CO2 levels and describes how R is involved Welcome to the second regular issue of R News in some of the reproducible research that informs the for 2006, which follows the release of R version climate change debate. 2.4.0. This issue reflects the continuing growth in The next article, by Roger Peng, introduces the R’s sphere of influence, with articles covering a wide filehash package, which provides a new approach range of extensions to, connections with, and ap- to the problem of working with large data sets in R. plications of R. Another sign of R’s growing popu- Robin Hankin then describes the gsl package, which larity was the success and sheer magnitude of the implements an interface to some exotic mathematical second useR! conference (in Vienna in June), which functions in the GNU Scientific Library. included over 180 presentations. Balasubramanian The final three main articles have more statistical Narasimhan provides a report on useR! 2006 on page content. Wolfgang Lederer and Helmut Küchenhoff 45 and two of the articles in this issue follow on from describe the simex package for taking measurement presentations at useR!. error into account. Roger Koenker (another useR! We begin with an article by Max Kuhn on the presenter) discusses some non-traditional link func- odfWeave package, which provides literate statisti- tions for generalised linear models. And Víctor Leiva cal analysis à la Sweave, but with ODF documents as and co-authors introduce the bs package, which im- plements the Birnbaum-Saunders Distribution.
- 
												  Molecular Data in R Phylogeny, Evolution & RIntroduction R Data Alignment Basic analysis SNP DAPC Structure Spatial analysis Trees Evolution The end Molecular data in R Phylogeny, evolution & R Vojtěch Zeisek Department of Botany, Faculty of Science, Charles University, Prague Institute of Botany, Czech Academy of Sciences, Průhonice https://trapa.cz/, [email protected] February 8 to 11, 2021 . Vojtěch Zeisek (https://trapa.cz/) Molecular data in R February 8 to 11, 2021 1 / 421 Introduction R Data Alignment Basic analysis SNP DAPC Structure Spatial analysis Trees Evolution The end Outline I 1 Introduction 2 R Installation Let’s start with R Basic operations in R Tasks Packages for our work 3 Data Overview of data and data types Microsatellites AFLP Notes about data DNA sequences, SNP . VCF . Vojtěch Zeisek (https://trapa.cz/) Molecular data in R February 8 to 11, 2021 2 / 421 Introduction R Data Alignment Basic analysis SNP DAPC Structure Spatial analysis Trees Evolution The end Outline II Export Tasks 4 Alignment Overview and MAFFT MAFFT, Clustal, MUSCLE and T-Coffee Multiple genes Display and cleaning Tasks 5 Basic analysis First look at the data Statistics MSN . Genetic distances . Vojtěch Zeisek (https://trapa.cz/) Molecular data in R February 8 to 11, 2021 3 / 421 Introduction R Data Alignment Basic analysis SNP DAPC Structure Spatial analysis Trees Evolution The end Outline III AMOVA Hierarchical clustering NJ (and UPGMA) tree PCoA Tasks 6 SNP PCA and NJ 7 DAPC Bayesian clustering Discriminant analysis and visualization Tasks 8 Structure Running Structure from R . Vojtěch Zeisek (https://trapa.cz/) Molecular data in R February 8 to 11, 2021 4 / 421 Introduction R Data Alignment Basic analysis SNP DAPC Structure Spatial analysis Trees Evolution The end Outline IV ParallelStructure on Windows Post processing 9 Spatial analysis Moran’s I sPCA Monmonier Mantel test Geneland Plotting maps Tasks 10 Trees Manipulations .
- 
												  A Glossary of R JargonA Glossary of R jargon Below is a selection of common R terms defined first using Stata jargon (or plain English when possible) and then more formally using R jargon. Some definitions in Stata jargon are quite loose given the fact that they have no direct analog of some R terms. Definitions in R terms are often quoted (with permission) or paraphrased from S Poetry by Patrick Burns [3]. Apply The process of having a command work on variables or observations. Determines whether a procedure will act as a typical command or as a function instead. Also the name of a function that controls that process. More formally, the process of targeting a function on rows or columns. Also a function that does that. Argument The options that control what the commands do and the arguments that control what functions do. Confusing because in R, functions do what both commands and functions do in Stata. More formally, input(s) to a function that control it. Includes data to analyze. Array A matrix with more than two dimensions. All variables must be only one type (e.g., all numeric or all character). More formally, a vec- tor with a dim attribute. The dim controls the number and size of dimensions. Assignment function Assigns values like the equal sign in Stata. The two-key sequence, “<-”, that places data or results of procedures or transformations into a variable or data set. More formally, the two-key sequence, “<-”, that gives names to objects. Atomic object A variable whose values are all of one type, such as all numeric or all character.
- 
												  Kurt Hornik IR FAQ Frequently Asked Questions on R Version 2.6.2007-10-22 ISBN 3-900051-08-9 Kurt Hornik i Table of Contents 1 Introduction............................... 1 1.1 Legalese .................................................... 1 1.2 Obtaining this document..................................... 1 1.3 Citing this document ........................................ 1 1.4 Notation.................................................... 1 1.5 Feedback ................................................... 2 2 R Basics .................................. 3 2.1 What is R? ................................................. 3 2.2 What machines does R run on?............................... 3 2.3 What is the current version of R?............................. 4 2.4 How can R be obtained? ..................................... 4 2.5 How can R be installed? ..................................... 4 2.5.1 How can R be installed (Unix) ........................... 4 2.5.2 How can R be installed (Windows) ....................... 5 2.5.3 How can R be installed (Macintosh) ...................... 5 2.6 Are there Unix binaries for R? ............................... 6 2.7 What documentation exists for R? ............................ 6 2.8 Citing R .................................................... 8 2.9 What mailing lists exist for R? ............................... 9 2.10 What is CRAN? ........................................... 10 2.11 Can I use R for commercial purposes? ...................... 10 2.12 Why is R named R? ....................................... 11 2.13 What
- 
												  Kurt Hornik IR faq Frequently Asked Questions on R Version 2.7.2008-04-18 ISBN 3-900051-08-9 Kurt Hornik i Table of Contents 1 Introduction ............................... 1 1.1 Legalese ................................................ 1 1.2 Obtaining this document................................. 1 1.3 Citing this document .................................... 1 1.4 Notation................................................ 1 1.5 Feedback ............................................... 2 2 R Basics .................................. 3 2.1 What is R? ............................................. 3 2.2 What machines does R run on?........................... 3 2.3 What is the current version of R?......................... 4 2.4 How can R be obtained? ................................. 4 2.5 How can R be installed? ................................. 4 2.5.1 How can R be installed (Unix) ................... 4 2.5.2 How can R be installed (Windows) ............... 5 2.5.3 How can R be installed (Macintosh).............. 5 2.6 Are there Unix binaries for R? ........................... 6 2.7 What documentation exists for R? ........................ 7 2.8 Citing R................................................ 8 2.9 What mailing lists exist for R? ........................... 9 2.10 What is cran? ....................................... 10 2.11 Can I use R for commercial purposes? .................. 11 2.12 Why is R named R? ................................... 11 2.13 What is the R Foundation? ............................ 11 3 R and S .................................
- 
												  R Packages Installed on ITS MachinesR Packages Installed on ITS machines Bruce Dudek Fall Semester 2020 ITS classrooms have R/RStudio installed on Windows machines. This document contains a list of all packages available in those installations. Users who wish to have additional packages installed can contact bdudek at albany dot edu and the installation can happen rapidly. Pls note that the base system R packages are listed at the bottom of the table, below the zoo package. PackageName PackageTitle abd The Analysis of Biological Data abind Combine Multidimensional Arrays ACCLMA ACC & LMA Graph Plotting acepack ACE and AVAS for Selecting Multiple Regression Transformations ada The R Package Ada for Stochastic Boosting addinmanager RStudio ’addin’ Manager additivityTests Additivity Tests in the Two Way Anova with Single Sub-class Numbers ade4 Analysis of Ecological Data: Exploratory and Euclidean Methods in Environmental Sciences ade4TkGUI ’ade4’ Tcl/Tk Graphical User Interface adegenet Exploratory Analysis of Genetic and Genomic Data adegraphics An S4 Lattice-Based Package for the Representation of Multivariate Data adehabitat Analysis of Habitat Selection by Animals adehabitatLT Analysis of Animal Movements adehabitatMA Tools to Deal with Raster Maps adephylo Exploratory Analyses for the Phylogenetic Comparative Method ads Spatial Point Patterns Analysis AED What the package does (short line) AER Applied Econometrics with R afex Analysis of Factorial Experiments agricolae Statistical Procedures for Agricultural Research airports Data on Airports akima Interpolation of Irregularly
- 
												  R Front-Ends in QuantianUse R fifteen different ways: R front-ends in Quantian Dirk Eddelbuettel [email protected] Abstract submitted to useR! 2006 Quantian (Eddelbuettel, 2003) has become one of the 11. R can be invoked from Python using the Rpy most comprehensive environments for quantitative module; and scientific computing. Within Quantian, the R lan- guage and programming environment has always had 12. Similarly, RSPerl permits R to be driven from 3 a central focus: Quantian provides not only R itself, Perl – and the other way around; but numerous add-ons, interfaces as well as essen- 13. R can also be embedded into other applications: tially all packages from the CRAN and BioConductor one such example is provided by the Pl/R proce- repositories. dural R language for the PostgreSQL RDBMS; With release 0.7.9.2 of Quantian1, the list of ‘inter- faces’ (where the term is used in a fuzzy and encom- 14. rJava permits R to be used from Java programs; passing way) has increased considerably. The paper will briefly discuss and summarize all of the interfaces 15. SNOW provides a high-level interface to dis- to R that are now included and ready to be deployed tributed statistical computing, and the under- immediately: lying components Rmpi and Rpvm are also avail- able directly. 1. R can of course be used the traditional way from Quantian 0.7.9.2 also contain 877 packages provid- the command-line or shell as R; ing a complete collection of R code. This set com- 2. R can be used via the cross-platform graphical prises essentially all Unix-installable packages from user interface when started as R --gui=Tk; CRAN, the complete BioConductor relase 1.7, a few Omegahat packages as well as packages from J.
- 
												  An Introduction to RAn introduction to R Longhow Lam Under Construction Feb-2010 some sections are unfinished! longhowlam at gmail dot com Contents 1 Introduction 7 1.1 What is R? . .7 1.2 The R environment . .8 1.3 Obtaining and installing R . .8 1.4 Your first R session . .9 1.5 The available help . 11 1.5.1 The on line help . 11 1.5.2 The R mailing lists and the R Journal . 12 1.6 The R workspace, managing objects . 12 1.7 R Packages . 13 1.8 Conflicting objects . 15 1.9 Editors for R scripts . 16 1.9.1 The editor in RGui . 16 1.9.2 Other editors . 16 2 Data Objects 19 2.1 Data types . 19 2.1.1 Double . 19 2.1.2 Integer . 20 2.1.3 Complex . 21 2.1.4 Logical . 21 2.1.5 Character . 22 2.1.6 Factor . 23 2.1.7 Dates and Times . 25 2.1.8 Missing data and Infinite values . 27 2.2 Data structures . 28 2.2.1 Vectors . 28 2.2.2 Matrices . 32 2.2.3 Arrays . 34 2.2.4 Data frames . 35 2.2.5 Time-series objects . 37 2.2.6 Lists . 38 2.2.7 The str function . 41 3 Importing data 42 3.1 Text files . 42 1 3.1.1 The scan function . 44 3.2 Excel files . 44 3.3 Databases . 45 3.4 The Foreign package . 46 4 Data Manipulation 47 4.1 Vector subscripts . 47 4.2 Matrix subscripts . 51 4.3 Manipulating Data frames .