Getting Started with R and Rstudio
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Instructions for Creating Your Own R Package∗
Instructions for Creating Your Own R Package∗ In Song Kimy Phil Martinz Nina McMurryx Andy Halterman{ March 18, 2018 1 Introduction The following is a step-by-step guide to creating your own R package. Even beyond this course, you may find this useful for storing functions you create for your own research or for editing existing R packages to suit your needs. This guide contains three different sets of instructions. If you use RStudio, you can follow the \Ba- sic Instructions" in Section 2 which involve using RStudio's interface. If you do not use RStudio or you do use RStudio but want a little bit more of control, follow the instructions in Section 3. Section 4 illustrates how to create a R package with functions written in C++ via Rcpp helper functions. NOTE: Write all of your functions first (in R or RStudio) and make sure they work properly before you start compiling your package. You may also want to try compiling with a very simple function first (e.g. myfun <- function(x)fx + 7g). 2 Basic Instructions (for RStudio Users Only) All of the following should be done in RStudio, unless otherwise noted. Even if you build your package in RStudio using the \Basic Instructions," we strongly recommend that you carefully review the \Advanced Instructions" as well. RStudio has built-in tools that will do many of these steps for you, but knowing how to do them manually will make it easier for you to build and distribute your own packages in the future and/or adapt existing packages. -
Rstudio Connect: Admin Guide Version 1.5.12-7
RStudio Connect: Admin Guide Version 1.5.12-7 Abstract This guide will help an administrator install and configure RStudio Connect on a managed server. You will learn how to install the product on different operating systems, configure authentication, and monitor system resources. Contents 1 Introduction 4 1.1 System Requirements . .4 2 Getting Started 5 2.1 Installation . .5 2.2 Initial Configuration . .7 3 License Management 9 3.1 Capabilities . .9 3.2 Notification of Expiration . .9 3.3 Product Activation . .9 3.4 Connectivity Requirements . 10 3.5 Evaluations . 11 3.6 Licensing Errors . 12 3.7 Floating Licensing . 12 4 Files & Directories 15 4.1 Program Files . 15 4.2 Configuration . 15 4.3 Server Log . 15 4.4 Access Logs . 16 4.5 Application Logs . 16 4.6 Variable Data . 16 4.7 Backups . 18 4.8 Server Migrations . 18 5 Server Management 19 5.1 Stopping and Starting . 19 5.2 System Messages . 21 5.3 Health-Check . 21 5.4 Upgrading . 21 5.5 Purging RStudio Connect . 22 6 High Availability and Load Balancing 22 6.1 HA Checklist . 22 6.2 HA Limitations . 23 6.3 Updating HA Nodes . 24 6.4 Downgrading . 24 6.5 HA Details . 24 1 7 Running with a Proxy 25 7.1 Nginx Configuration . 26 7.2 Apache Configuration . 27 8 Security & Auditing 28 8.1 API Security . 28 8.2 Browser Security . 28 8.3 Audit Logs . 30 8.4 Audit Logs Command-Line Interface . 31 9 Database 31 9.1 SQLite . 31 9.2 PostgreSQL . -
A Practical Guide for Improving Transparency and Reproducibility in Neuroimaging Research Krzysztof J
bioRxiv preprint first posted online Feb. 12, 2016; doi: http://dx.doi.org/10.1101/039354. The copyright holder for this preprint (which was not peer-reviewed) is the author/funder. It is made available under a CC-BY 4.0 International license. A practical guide for improving transparency and reproducibility in neuroimaging research Krzysztof J. Gorgolewski and Russell A. Poldrack Department of Psychology, Stanford University Abstract Recent years have seen an increase in alarming signals regarding the lack of replicability in neuroscience, psychology, and other related fields. To avoid a widespread crisis in neuroimaging research and consequent loss of credibility in the public eye, we need to improve how we do science. This article aims to be a practical guide for researchers at any stage of their careers that will help them make their research more reproducible and transparent while minimizing the additional effort that this might require. The guide covers three major topics in open science (data, code, and publications) and offers practical advice as well as highlighting advantages of adopting more open research practices that go beyond improved transparency and reproducibility. Introduction The question of how the brain creates the mind has captivated humankind for thousands of years. With recent advances in human in vivo brain imaging, we how have effective tools to peek into biological underpinnings of mind and behavior. Even though we are no longer constrained just to philosophical thought experiments and behavioral observations (which undoubtedly are extremely useful), the question at hand has not gotten any easier. These powerful new tools have largely demonstrated just how complex the biological bases of behavior actually are. -
Fira Code: Monospaced Font with Programming Ligatures
Personal Open source Business Explore Pricing Blog Support This repository Sign in Sign up tonsky / FiraCode Watch 282 Star 9,014 Fork 255 Code Issues 74 Pull requests 1 Projects 0 Wiki Pulse Graphs Monospaced font with programming ligatures 145 commits 1 branch 15 releases 32 contributors OFL-1.1 master New pull request Find file Clone or download lf- committed with tonsky Add mintty to the ligatures-unsupported list (#284) Latest commit d7dbc2d 16 days ago distr Version 1.203 (added `__`, closes #120) a month ago showcases Version 1.203 (added `__`, closes #120) a month ago .gitignore - Removed `!!!` `???` `;;;` `&&&` `|||` `=~` (closes #167) `~~~` `%%%` 3 months ago FiraCode.glyphs Version 1.203 (added `__`, closes #120) a month ago LICENSE version 0.6 a year ago README.md Add mintty to the ligatures-unsupported list (#284) 16 days ago gen_calt.clj Removed `/**` `**/` and disabled ligatures for `/*/` `*/*` sequences … 2 months ago release.sh removed Retina weight from webfonts 3 months ago README.md Fira Code: monospaced font with programming ligatures Problem Programmers use a lot of symbols, often encoded with several characters. For the human brain, sequences like -> , <= or := are single logical tokens, even if they take two or three characters on the screen. Your eye spends a non-zero amount of energy to scan, parse and join multiple characters into a single logical one. Ideally, all programming languages should be designed with full-fledged Unicode symbols for operators, but that’s not the case yet. Solution Download v1.203 · How to install · News & updates Fira Code is an extension of the Fira Mono font containing a set of ligatures for common programming multi-character combinations. -
Econometric Data Science
Econometric Data Science Francis X. Diebold University of Pennsylvania October 22, 2019 1 / 280 Copyright c 2013-2019, by Francis X. Diebold. All rights reserved. All materials are freely available for your use, but be warned: they are highly preliminary, significantly incomplete, and rapidly evolving. All are licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. (Briefly: I retain copyright, but you can use, copy and distribute non-commercially, so long as you give me attribution and do not modify. To view a copy of the license, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.) In return I ask that you please cite the books whenever appropriate, as: "Diebold, F.X. (year here), Book Title Here, Department of Economics, University of Pennsylvania, http://www.ssc.upenn.edu/ fdiebold/Textbooks.html." The painting is Enigma, by Glen Josselsohn, from Wikimedia Commons. 2 / 280 Introduction 3 / 280 Numerous Communities Use Econometrics Economists, statisticians, analysts, "data scientists" in: I Finance (Commercial banking, retail banking, investment banking, insurance, asset management, real estate, ...) I Traditional Industry (manufacturing, services, advertising, brick-and-mortar retailing, ...) I e-Industry (Google, Amazon, eBay, Uber, Microsoft, ...) I Consulting (financial services, litigation support, ...) I Government (treasury, agriculture, environment, commerce, ...) I Central Banks and International Organizations (FED, IMF, World Bank, OECD, BIS, ECB, ...) 4 / 280 Econometrics is Special Econometrics is not just \statistics using economic data". Many properties and nuances of economic data require knowledge of economics for sucessful analysis. I Emphasis on predictions, guiding decisions I Observational data I Structural change I Volatility fluctuations ("heteroskedasticity") I Even trickier in time series: Trend, Seasonality, Cycles ("serial correlation") 5 / 280 Let's Elaborate on the \Emphasis on Predictions Guiding Decisions".. -
Reproducible Reports with Knitr and R Markdown
Reproducible Reports with knitr and R Markdown https://dl.dropboxusercontent.com/u/15335397/slides/2014-UPe... Reproducible Reports with knitr and R Markdown Yihui Xie, RStudio 11/22/2014 @ UPenn, The Warren Center 1 of 46 1/15/15 2:18 PM Reproducible Reports with knitr and R Markdown https://dl.dropboxusercontent.com/u/15335397/slides/2014-UPe... An appetizer Run the app below (your web browser may request access to your microphone). http://bit.ly/upenn-r-voice install.packages("shiny") Or just use this: https://yihui.shinyapps.io/voice/ 2/46 2 of 46 1/15/15 2:18 PM Reproducible Reports with knitr and R Markdown https://dl.dropboxusercontent.com/u/15335397/slides/2014-UPe... Overview and Introduction 3 of 46 1/15/15 2:18 PM Reproducible Reports with knitr and R Markdown https://dl.dropboxusercontent.com/u/15335397/slides/2014-UPe... I know you click, click, Ctrl+C and Ctrl+V 4/46 4 of 46 1/15/15 2:18 PM Reproducible Reports with knitr and R Markdown https://dl.dropboxusercontent.com/u/15335397/slides/2014-UPe... But imagine you hear these words after you finished a project Please do that again! (sorry we made a mistake in the data, want to change a parameter, and yada yada) http://nooooooooooooooo.com 5/46 5 of 46 1/15/15 2:18 PM Reproducible Reports with knitr and R Markdown https://dl.dropboxusercontent.com/u/15335397/slides/2014-UPe... Basic ideas of dynamic documents · code + narratives = report · i.e. computing languages + authoring languages We built a linear regression model. -
David O. Neville, Phd, MS Rev
David O. Neville, PhD, MS Rev. 05 March 2021 The Center for Teaching, Learning, and Assessment Email: [email protected] 1119 6th Avenue Website: https://doktorfrag.com Grinnell College Twitter: https://twitter.com/doktorfrag Grinnell, IA 50112 Education MS Utah State University (Logan, Utah, USA, 2007) Instructional Technology and Learning Sciences Concentration in Computer Science; Business Information Systems PhD Washington University in St. Louis (Missouri, USA, 2002) German Language and Literature Concentration in Latin Language and Literature; Medieval Studies Ludwig-Maximilians-Universität (Munich, Germany, 1999-2000) DAAD Annual Scholarship (Jahresstipendium) AM Washington University in St. Louis (Missouri, USA, 1997) German Language and Literature BA, Brigham Young University (Provo, Utah, USA, 1994) Honors German Language and Literature Minor in Russian Language and Literature Employment Grinnell College 2015- Digital Liberal Arts Specialist Present The Digital Liberal Arts Collaborative Elon University 2014-15 Associate Professor of German Department of World Languages and Cultures 2008-14 Assistant Professor of German and Director of Language Learning Technologies Department of World Languages and Cultures Utah State 2006-08 Instructional Designer and Blackboard Administrator University Faculty Assistance Center for Teaching (FACT) 2004-06 Visiting Assistant Professor Department of Languages, Philosophy, and Speech Communication Washington 2002-03 Lecturer and Instructional Technology Specialist University Department of Germanic Languages and Literatures in St. Louis Curriculum Vitæ: David O. Neville, PhD, MS Pg. 1 Fellowships and Awards 2017 Top Three Print Poster in the 2017 Humanities, Arts, Science and Technology Alliance and Collaboratory (HASTAC) Conference Poster Competition: “Visualizing Difficult Historical Realities: The Uncle Sam Plantation Project.” With Sarah Purcell (Co-Presenter). -
R Markdown Cheat Sheet I
1. Workflow R Markdown is a format for writing reproducible, dynamic reports with R. Use it to embed R code and results into slideshows, pdfs, html documents, Word files and more. To make a report: R Markdown Cheat Sheet i. Open - Open a file that ii. Write - Write content with the iii. Embed - Embed R code that iv. Render - Replace R code with its output and transform learn more at rmarkdown.rstudio.com uses the .Rmd extension. easy to use R Markdown syntax creates output to include in the report the report into a slideshow, pdf, html or ms Word file. rmarkdown 0.2.50 Updated: 8/14 A report. A report. A report. A report. A plot: A plot: A plot: A plot: Microsoft .Rmd Word ```{r} ```{r} ```{r} = = hist(co2) hist(co2) hist(co2) ``` ``` Reveal.js ``` ioslides, Beamer 2. Open File Start by saving a text file with the extension .Rmd, or open 3. Markdown Next, write your report in plain text. Use markdown syntax to an RStudio Rmd template describe how to format text in the final report. syntax becomes • In the menu bar, click Plain text File ▶ New File ▶ R Markdown… End a line with two spaces to start a new paragraph. *italics* and _italics_ • A window will open. Select the class of output **bold** and __bold__ you would like to make with your .Rmd file superscript^2^ ~~strikethrough~~ • Select the specific type of output to make [link](www.rstudio.com) with the radio buttons (you can change this later) # Header 1 • Click OK ## Header 2 ### Header 3 #### Header 4 ##### Header 5 ###### Header 6 4. -
The Architecture of Platforms: a Unified View Working Paper
The Architecture of Platforms: A Unified View Carliss Y. Baldwin C. Jason Woodard Working Paper 09-034 Copyright © 2008 by Carliss Y. Baldwin and C. Jason Woodard Working papers are in draft form. This working paper is distributed for purposes of comment and discussion only. It may not be reproduced without permission of the copyright holder. Copies of working papers are available from the author. The Architecture of Platforms: A Unified View Carliss Y. Baldwin* C. Jason Woodard† *corresponding author: Harvard Business School Boston, MA, 02163 [email protected] † Singapore Management University [email protected] Copyright © Carliss Y. Baldwin and C. Jason Woodard, 2008 2 The Architecture of Platforms: A Unified View Carliss Y. Baldwin and C. Jason Woodard Abstract The central role of “platform” products and services in mediating the activities of disaggregated “clusters” or “ecosystems” of firms has been widely recognized. But platforms and the systems in which they are embedded are very diverse. In particular, platforms may exist within firms as product lines, across firms as multi-product systems, and in the form of multi-sided markets. In this paper we argue that there is a fundamental unity in the architecture of platforms. Platform architectures are modularizations of complex systems in which certain components (the platform itself) remain stable, while others (the complements) are encouraged to vary in cross- section or over time. Among the most stable elements in a platform architecture are the modular interfaces that mediate between the platform and its complements. These interfaces are even more stable than the interior core of the platform, thus control over the interfaces amounts to control over the platform and its evolution. -
HGC: a Fast Hierarchical Graph-Based Clustering Method
Package ‘HGC’ September 27, 2021 Type Package Title A fast hierarchical graph-based clustering method Version 1.1.3 Description HGC (short for Hierarchical Graph-based Clustering) is an R package for conducting hierarchical clustering on large-scale single-cell RNA-seq (scRNA-seq) data. The key idea is to construct a dendrogram of cells on their shared nearest neighbor (SNN) graph. HGC provides functions for building graphs and for conducting hierarchical clustering on the graph. The users with old R version could visit https://github.com/XuegongLab/HGC/tree/HGC4oldRVersion to get HGC package built for R 3.6. License GPL-3 Encoding UTF-8 SystemRequirements C++11 Depends R (>= 4.1.0) Imports Rcpp (>= 1.0.0), RcppEigen(>= 0.3.2.0), Matrix, RANN, ape, dendextend, ggplot2, mclust, patchwork, dplyr, grDevices, methods, stats LinkingTo Rcpp, RcppEigen Suggests BiocStyle, rmarkdown, knitr, testthat (>= 3.0.0) VignetteBuilder knitr biocViews SingleCell, Software, Clustering, RNASeq, GraphAndNetwork, DNASeq Config/testthat/edition 3 NeedsCompilation yes git_url https://git.bioconductor.org/packages/HGC git_branch master git_last_commit 61622e7 git_last_commit_date 2021-07-06 Date/Publication 2021-09-27 1 2 CKNN.Construction Author Zou Ziheng [aut], Hua Kui [aut], XGlab [cre, cph] Maintainer XGlab <[email protected]> R topics documented: CKNN.Construction . .2 FindClusteringTree . .3 HGC.dendrogram . .4 HGC.parameter . .5 HGC.PlotARIs . .6 HGC.PlotDendrogram . .7 HGC.PlotParameter . .8 KNN.Construction . .9 MST.Construction . 10 PMST.Construction . 10 Pollen . 11 RNN.Construction . 12 SNN.Construction . 12 Index 14 CKNN.Construction Building Unweighted Continuous K Nearest Neighbor Graph Description This function builds a Continuous K Nearest Neighbor (CKNN) graph in the input feature space using Euclidean distance metric. -
Rmarkdown : : CHEAT SHEET RENDERED OUTPUT File Path to Output Document SOURCE EDITOR What Is Rmarkdown? 1
rmarkdown : : CHEAT SHEET RENDERED OUTPUT file path to output document SOURCE EDITOR What is rmarkdown? 1. New File Write with 5. Save and Render 6. Share find in document .Rmd files · Develop your code and publish to Markdown ideas side-by-side in a single rpubs.com, document. Run code as individual shinyapps.io, The syntax on the lef renders as the output on the right. chunks or as an entire document. set insert go to run code RStudio Connect Rmd preview code code chunk(s) Plain text. Plain text. Dynamic Documents · Knit together location chunk chunk show End a line with two spaces to End a line with two spaces to plots, tables, and results with outline start a new paragraph. start a new paragraph. narrative text. Render to a variety of 4. Set Output Format(s) Also end with a backslash\ Also end with a backslash formats like HTML, PDF, MS Word, or and Options reload document to make a new line. to make a new line. MS Powerpoint. *italics* and **bold** italics and bold Reproducible Research · Upload, link superscript^2^/subscript~2~ superscript2/subscript2 to, or attach your report to share. ~~strikethrough~~ strikethrough Anyone can read or run your code to 3. Write Text run all escaped: \* \_ \\ escaped: * _ \ reproduce your work. previous modify chunks endash: --, emdash: --- endash: –, emdash: — chunk run options current # Header 1 Header 1 chunk ## Header 2 Workflow ... Header 2 2. Embed Code ... 11. Open a new .Rmd file in the RStudio IDE by ###### Header 6 Header 6 going to File > New File > R Markdown. -
Statistics with Free and Open-Source Software
Free and Open-Source Software • the four essential freedoms according to the FSF: • to run the program as you wish, for any purpose • to study how the program works, and change it so it does Statistics with Free and your computing as you wish Open-Source Software • to redistribute copies so you can help your neighbor • to distribute copies of your modified versions to others • access to the source code is a precondition for this Wolfgang Viechtbauer • think of ‘free’ as in ‘free speech’, not as in ‘free beer’ Maastricht University http://www.wvbauer.com • maybe the better term is: ‘libre’ 1 2 General Purpose Statistical Software Popularity of Statistical Software • proprietary (the big ones): SPSS, SAS/JMP, • difficult to define/measure (job ads, articles, Stata, Statistica, Minitab, MATLAB, Excel, … books, blogs/posts, surveys, forum activity, …) • FOSS (a selection): R, Python (NumPy/SciPy, • maybe the most comprehensive comparison: statsmodels, pandas, …), PSPP, SOFA, Octave, http://r4stats.com/articles/popularity/ LibreOffice Calc, Julia, … • for programming languages in general: TIOBE Index, PYPL, GitHut, Language Popularity Index, RedMonk Rankings, IEEE Spectrum, … • note that users of certain software may be are heavily biased in their opinion 3 4 5 6 1 7 8 What is R? History of S and R • R is a system for data manipulation, statistical • … it began May 5, 1976 at: and numerical analysis, and graphical display • simply put: a statistical programming language • freely available under the GNU General Public License (GPL) → open-source