Book of Abstracts
Total Page:16
File Type:pdf, Size:1020Kb
Book of Abstracts June 27, 2015 1 Conference Sponsors Diamond Sponsor Platinum Sponsors Gold Sponsors Silver Sponsors Open Analytics Bronze Sponsors Media Sponsors 2 Conference program Time Tuesday Wednesday Thursday Friday 08:00 Registration opens Registration opens Registration opens Registration opens 08:30 – 09:00 Opening session (by Rector peR! M. Johansen, Aalborg University) Aalborghallen 09:00 – 10:00 Romain François Di Cook Thomas Lumley Aalborghallen Aalborghallen Aalborghallen 10:00 – 10:30 Coffee break Coffee break Coffee break (15 min) ee break Sponsored by Quantide Sponsored by Alteryx ff Session 1 Session 4 10:30 – 12:00 Sponsor session (10:15) Kaleidoscope 1 Kaleidoscope 4 Aalborghallen Aalborghallen Aalborghallen incl. co Morning Tutorials DataRobot Ecology Medicine Gæstesalen Gæstesalen RStudio Teradata Networks Regression Musiksalen Musiksalen Revolution Analytics Reproducibility Commercial Offerings alteryx Det Lille Teater Det Lille Teater TIBCO H O Interfacing Interactive graphics 2 Radiosalen Radiosalen HP 12:00 – 13:00 Sandwiches Lunch (standing buffet) Lunch (standing buffet) Break: 12:00 – 12:30 Sponsored by Sponsored by TIBCO ff Revolution Analytics Ste en Lauritzen (12:30) Aalborghallen Session 2 Session 5 13:00 – 14:30 13:30: Closing remarks Kaleidoscope 2 Kaleidoscope 5 Aalborghallen Aalborghallen 13:45: Grab ’n go lunch 14:00: Conference ends Case study Teaching 1 Gæstesalen Gæstesalen Clustering Statistical Methodology 1 Musiksalen Musiksalen ee break Data Management Machine Learning 1 ff Det Lille Teater Det Lille Teater Computational Performance Visualisation 1 incl. co Radiosalen Radiosalen Afternoon Tutorials 14:30 – 15:00 Coffee (with cake) Coffee (with cake) Adrian Baddeley Susan Holmes 15:00 – 16:00 Aalborghallen Aalborghallen Session 3 Session 6 16:00 – 17:30 Kaleidoscope 3 Kaleidoscope 6 Aalborghallen Aalborghallen Business Teaching 2 Gæstesalen Gæstesalen Spatial Statistical Methodology 2 Musiksalen Musiksalen Database Machine Learning 2 Det Lille Teater Det Lille Teater Lightning talks Visualisation 2 Radiosalen Radiosalen Evening Welcome reception Poster session, drinks and buffet Conference dinner Sponsored by Teradata Sponsored by DataRobot Sponsored by RStudio (19:00 – 22:00) (18:00 – 21:00) (18:00 – 23:00) Contents Part I: Invited speakers Invited speakers 19 My R adventures ............................... 19 Romain François How R has changed spatial statistics ................... 20 Adrian Baddeley A Survey of Two Decades of Efforts to Build Interactive Graphics Capacity in R ................................. 21 Di Cook Multitype data integration : challenges from the Human Microbiome 22 Susan Holmes How flexible computing expands what an individual can do . 23 Thomas Lumley Linear estimating equations for Gaussian graphical models with symmetry ................................... 24 Steffen L. Lauritzen Part II: Oral Presentation Kaleidoscope 1 26 flowcatchR: A user-friendly workflow solution for the analysis of time-lapse cell flow imaging data ..................... 26 Federico Marini Image processing and alignment with RNiftyReg and mmand . 27 Jonathan Clayden rags2ridges: Ridge estimation and graphical modeling for high-dimensional precision matrices ................... 28 Carel F. W. Peeters dgRaph: Discrete factor graphs in R .................... 29 Henrik Tobias Madsen 3 Contents 4 Ecology 30 Optimized R functions for analysis of ecological community data using the R virtual laboratory (Rvlab) ................... 30 Costas Varsos and Theodore Patkos Building ecological models bit-by-bit ................... 31 David L Miller Simulating ecological microcosms with systems of differential equations: tools for the scientific, technical and communication challenges. .................................. 32 Andrew Dolman A Graphical User Interface for R in an Integrated Development Environment for Ecological Modeling, Scientific Image Analysis and Statistical Analysis ............................. 33 Marcel Austenfeld Networks 34 fbRads: Analyzing and managing Facebook ads from R . 34 Gergely Daroczi Web scraping with R - A fast track overview. 35 Peter Meißner multiplex: Analysis of Multiple Social Networks with Algebra . 36 Antonio Rivero Ostoic What’s new in igraph and networks .................... 37 Gabor Csardi Reproducibility 38 rOpenSci: A suite of reproducible research tools in R . 38 Karthik Ram Enhancing reproducibility and collaboration via management of R package cohorts ............................... 39 Michael Lawrence A Review of Meta-Analysis Packages in R . 40 Joshua R. Polanin & Emily A. Hennessy Simple reproducibility with the checkpoint package . 41 David Smith Interfacing 42 Some lessons relevant to including external libraries in your R package 42 Kasper D. Hansen Integrating R with the Go programming language using interprocess communication ............................... 43 Christoph Best Naturally Sweet Rcpp with Modern C++ and Boost . 44 Matt P. Dziubinski Contents 5 Linking R to the Spark MLlib Machine Learning Library . 45 Dan Putler Kaleidoscope 2 46 archivist: Tools for Storing, Restoring and Searching for R Objects . 46 Przemyslaw Biecek R User Groups ................................ 47 Joseph B. Rickert Computational Precision and Floating-Point Arithmetic: A Teacher’s Guide to Answering FAQ 7.31 ....................... 48 Richard M. Heiberger Tiny Data, Approximate Bayesian Computation and the Socks of Karl Broman .................................... 49 Rasmus Bååth Case study 50 Using R for small area estimation in the Norwegian National Forest Inventory ................................... 50 Johannes Breidenbach Using R for natural gas market balancing in the Czech republic . 51 Ivan Kasanický Heteroscedastic censored and truncated regression for weather forecasting .................................. 52 Jakob W. Messner Multinomial functional regression with application to lameness detection for horses ............................. 53 Helle Sørensen Clustering 54 Unsupervised Clustering and Meta-Analysis using Gaussian Mixture Copula Models ................................ 54 Anders Ellern Bilgrau Hierarchical Cluster Analysis of hyperspectral Raman images: a new point of view leads to 10000fold speedup . 55 Claudia Beleites Dirichlet process Bayesian clustering with the R package PReMiuM . 56 Silvia Liverani Examining the Environmental Characteristics of Tornado Outbreaks in the United States using Spatial Clustering. 57 Thomas Jagger Contents 6 Data Management 58 Taking testing to another level: testwhat . 58 Filip Schouwenaars Failing fast and early: assertive/defensive programming for R data analysis pipelines .............................. 59 Tony Fischetti Getting your data into R .......................... 60 Hadley Wickham A better way to manage hierarchical data . 61 Christoph Glur A proposal for distributed data-structures in R . 62 Indrajit Roy, Michael Lawrence Computational Performance 63 Running R+Hadoop using Docker Containers . 63 E. James Harner Algorithmic Differentiation for Extremum Estimation: An Introduction Using RcppEigen ....................... 64 Matt P. Dziubinski Improving computational performance with algorithm engineering . 65 Kirill Müller Performance Analysis for Parallel R Programs: Towards Efficient Resource Utilization ............................. 66 Helena Kotthaus Refactoring the xtable Package ....................... 67 David Scott Kaleidoscope 3 68 Coding for the enterprise server - what does it mean for you? . 68 Friedrich Schuster R as a citizen in a polyglot world - the promise of the Truffle framework 69 Lukas Stadler Architect. An IDE for Data Science (and R) . 70 Tobias Verbeke Distributed computing with R ....................... 71 Balasubramanian Narasimhan Business 72 Statistical consulting using R: a DRY approach from the Australian outback. ................................... 72 Peter Baker Using R in Production ............................ 73 Stefan Milton Bache Contents 7 Hedging and Risk Management of CDOs portfolio with R . 74 Giuseppe Bruno Data Driven Customer Segmentation with R . 75 Jim Porzak Spatial 76 Bringing Geospatial Tasks into the Mainstream of Business Analytics 76 Ian Cook Novel hybrid spatial predictive methods of machine learning and geostatistics with applications to terrestrial and marine environments in Australia .................................. 77 Jin Li Graphical Modelling of Multivariate Spatial Point Patterns . 78 Matthias Eckardt Spatial Econometrics Models with R-INLA . 79 Virgilio Gomez-Rubio Spatio-Temporal Analysis of Epidemic Phenomena Using the R Package surveillance ............................ 80 Sebastian Meyer Databases 81 Rango - Databases made easy ....................... 81 Willem Ligtenberg Ad-Hoc User-Defined Functions for MonetDB with R . 82 Hannes Mühleisen R database connectivity: what did we leave behind? . 83 Mateusz Żółtak jsonlite and mongolite ........................... 84 Jeroen Ooms Using R Efficiently with Large Databases . 85 Michael Wurst Kaleidoscope 4 86 While my base R gently weeps ....................... 86 A. Jonathan R. Godfrey Rapid Deployment of Automatic Scoring Models to Hadoop Production Systems ............................. 87 Amitai Golub Fast, stable and scalable true radix sorting . 88 Matt Dowle Fast, flexible and memory efficient data manipulation using data.table 89 Arunkumar Srinivasan Contents 8 Medicine 90 Phenotypic deconvolution: the next frontier in pharma . 90 Marvin Steijaert†, Vladimir Chupakhin‡, Hugo Ceulemans‡, Joerg Wegner‡ medplot: