The Voice of the R Community
Total Page:16
File Type:pdf, Size:1020Kb
The Voice of the R Community David Smith @revodavid Director R Consortium R Consortium’s mission is to work with and provide support to key organizations and groups developing, maintaining, distributing and using R. The R Community Survey • Survey of R users Worldwide – 29 survey questions provided in English, Spanish and Chinese • Opened July 1, 2017; closed Aug 31, 2017 • 3618 Responses – 98% of respondents reported using R – 2% of respondents used R in the past, or planned to in future • Many verbatim (free-text responses) – Lots of interesting feedback! Where did the responses come from? 1616 respondents selected their location on an interactive map Demographics of respondents R usage by respondents • 83% use R as their primary data analysis tool • 88% use R for work – 37% use R in production • 37% have been using R for more than 5 years Where R users work Of respondents: 35% Academic 65% Industry What R interface do respondents use? 90% of respondents use R via RStudio (multiple selections allowed) What do people do with R? 37% of respondents have authored or contributed to an R package What other tools do respondents use? 66% also use Excel 44% also use Python (multiple selections allowed) What is “Big Data” for R users? 17% of respondents typically analyze 1M records or more Satisfaction with R 80% of respondents were “completely satisfied” or “very satisfied” by R overall What is the best aspect of working with R? Key themes: community and ecosystem the R language itself packages online resources What is the best aspect of working with R? Key themes: community and ecosystem the R language itself packages online resources What is the worst aspect of working with R? Key themes: steep learning curve performance & memory unfriendly docs / error messages issues with production use comparisons to Python What is the worst aspect of working with R? Key themes: steep learning curve performance & memory unfriendly docs / error messages issues with production use comparisons to Python Impacts from the survey data • R Localization – R translated into other languages • DBI Project – improve interface between R and databases • Infrastructure Development – R and package binaries • R in Medicine and R in Pharma working groups • R Community Diversity and Inclusion Working Group www.r-consortium.org/projects/awarded-projects R package developers embracing sustainability CII Best Practices Badge makes it easier for users of open source projects to see which projects take security seriously. Introduced in May 2016, over 1,000+ FOSS projects have registered for the badge. While only 10% of the projects successfully passed, every one of them made an improvement to achieve a badge. Learn more about the program and start the process with your package at https://bestpractices.coreinfrastructure.org/en R Packages that have or in progress of achieving CII Best Practices Badge ( as of 20180701 ) madrid.air R package to parse air quality data https://github.com/nramon/madri foghorn R package to summarize CRAN Check Results in https://github.com/fmichonneau/f published by http://datos.madrid.es/. d.air the Terminal oghorn An R Package for Quick ciTools is an R package that makes https://github.com/jthaman/ciToo osrm Shortest Paths and Travel Time from https://rgeomatic.hypotheses.org/ Uncertainty Intervals working with model uncertainty as easy ls OpenStreetMap with R category/osrm as possible. It gives the user easy access R_Matrix R package for Sparse and Dense Matrix Classes http://matrix.r-forge.r-project.org to confidence or prediction intervals... and Methods dodgr Distances on Directed Graphs in R https://ATFutures.github.io/dodgr A rich hierarchy of matrix classes, including netReg Network-penalized generalized linear https://dirmeier.github.io/netReg triangular, symmetric, and diagonal matrices,... models in R and C++. base64enc R tools for base64 encoding https://github.com/s-u/base64enc DBI A database interface (DBI) definition for http://dbi.r-dbi.org communication between R and RDBMSs ggplot2 An implementation of the Grammar of Graphics https://ggplot2.tidyverse.org datastructures Implementation of core data structures https://dirmeier.github.io/datastr in R for R. uctures covr Test coverage reports for R https://github.com/r-lib/covr Acknowledgements • Andrie de Vries • Mark Hornik, Augustina Ragwitz • John Mertic • R Consortium Members • You? – Looking for help to anonymize the survey data so we can release it for further analysis Thank you! David Smith @revodavid www.r-consortium.org.