Data Science with Postgresql

Total Page:16

File Type:pdf, Size:1020Kb

Data Science with Postgresql Data Science with PostgreSQL Data Science with PostgreSQL Balázs Bárány Data Scientist pgconf.eu 2015 Data Science with PostgreSQL Contents Introduction What is Data Science? Process model Tools and methods of Data Scientists Data Science with PostgreSQL Business & data understanding Preprocessing Modeling Evaluation Deployment Summary I Who is a Data Scientist? Data Science with PostgreSQL Introduction What is Data Science? Sexiest job of the 21st century I According to Google, LinkedIn, ... Data Science with PostgreSQL Introduction What is Data Science? Sexiest job of the 21st century I According to Google, LinkedIn, ... I Who is a Data Scientist? Data Science with PostgreSQL Introduction What is Data Science? Data Science Venn Diagram (c) Drew Conway, 2010. CC-BY-NC I Mash up & format for analysis I Analyze & visualize I Predict & prescribe I Operationalize Data Science with PostgreSQL Introduction What is Data Science? Tasks of data scientists I Get data from various sources I Big data? I Analyze & visualize I Predict & prescribe I Operationalize Data Science with PostgreSQL Introduction What is Data Science? Tasks of data scientists I Get data from various sources I Big data? I Mash up & format for analysis I Predict & prescribe I Operationalize Data Science with PostgreSQL Introduction What is Data Science? Tasks of data scientists I Get data from various sources I Big data? I Mash up & format for analysis I Analyze & visualize I Operationalize Data Science with PostgreSQL Introduction What is Data Science? Tasks of data scientists I Get data from various sources I Big data? I Mash up & format for analysis I Analyze & visualize I Predict & prescribe Data Science with PostgreSQL Introduction What is Data Science? Tasks of data scientists I Get data from various sources I Big data? I Mash up & format for analysis I Analyze & visualize I Predict & prescribe I Operationalize Data Science with PostgreSQL Introduction What is Data Science? Process model The Data Mining process Cross Industry Standard Process for Data Mining (Kenneth Jensen/Wikimedia Commons) Data Science with PostgreSQL Tools and methods of Data Scientists Tools and methods Tools and methods Data Science with PostgreSQL Tools and methods of Data Scientists Scripting and programming I R I Python with extensions I Octave/Matlab, other mathematic languages I Hadoop and Big Data programming libraries (mostly Java) I Cloud services Data Science with PostgreSQL Tools and methods of Data Scientists Integrated GUI tools I (partly) Open Source: RapidMiner, KNIME, Orange I Data Warehouse tools extended for analytics: Pentaho, Talend I Many commercial tools, e. g. SAS, IBM SPSS I Hadoop-related newcomers: e. g. Datameer Data Science with PostgreSQL Tools and methods of Data Scientists Data Infrastructure I Databases and data stores I Relational, NoSQL I Hadoop clusters I In-memory I Data streams I Free-form data: text, images, video, audio, ... I Web APIs I Open Data I Joining, aggregating, ltering, calculating, ... I Data cleansing I Missing values I Abnormal values I Result: data set suitable for analytics Data Science with PostgreSQL Tools and methods of Data Scientists Data acquisition and preprocessing I Data ingestion in raw format I Data cleansing I Missing values I Abnormal values I Result: data set suitable for analytics Data Science with PostgreSQL Tools and methods of Data Scientists Data acquisition and preprocessing I Data ingestion in raw format I Joining, aggregating, ltering, calculating, ... I Result: data set suitable for analytics Data Science with PostgreSQL Tools and methods of Data Scientists Data acquisition and preprocessing I Data ingestion in raw format I Joining, aggregating, ltering, calculating, ... I Data cleansing I Missing values I Abnormal values Data Science with PostgreSQL Tools and methods of Data Scientists Data acquisition and preprocessing I Data ingestion in raw format I Joining, aggregating, ltering, calculating, ... I Data cleansing I Missing values I Abnormal values I Result: data set suitable for analytics I Classication (supervised): Prediction of a class or category I Regression (supervised): Prediction of numeric value I Clustering (unsupervised): Automatic grouping of data I Association analysis, outlier detection, time series prediction, ... Data Science with PostgreSQL Tools and methods of Data Scientists Predictive Modeling I Supervised and unsupervised methods I Target variable known or not I Clustering (unsupervised): Automatic grouping of data I Association analysis, outlier detection, time series prediction, ... Data Science with PostgreSQL Tools and methods of Data Scientists Predictive Modeling I Supervised and unsupervised methods I Target variable known or not I Classication (supervised): Prediction of a class or category I Regression (supervised): Prediction of numeric value Data Science with PostgreSQL Tools and methods of Data Scientists Predictive Modeling I Supervised and unsupervised methods I Target variable known or not I Classication (supervised): Prediction of a class or category I Regression (supervised): Prediction of numeric value I Clustering (unsupervised): Automatic grouping of data I Association analysis, outlier detection, time series prediction, ... I Store in ERP or CRM I Tell someone (email, popup) I Add a label (e. g. mark email as spam) I Interrupt nancial transaction => prescription I Order supplies => prescription I ... Data Science with PostgreSQL Tools and methods of Data Scientists Deployment and operationalization I Model application to new data => prediction + condence I What to do with predictions? I Interrupt nancial transaction => prescription I Order supplies => prescription I ... Data Science with PostgreSQL Tools and methods of Data Scientists Deployment and operationalization I Model application to new data => prediction + condence I What to do with predictions? I Store in ERP or CRM I Tell someone (email, popup) I Add a label (e. g. mark email as spam) Data Science with PostgreSQL Tools and methods of Data Scientists Deployment and operationalization I Model application to new data => prediction + condence I What to do with predictions? I Store in ERP or CRM I Tell someone (email, popup) I Add a label (e. g. mark email as spam) I Interrupt nancial transaction => prescription I Order supplies => prescription I ... Data Science with PostgreSQL Data Science with PostgreSQL Data Science with PostgreSQL Doing Data Science with PostgreSQL I Must be root and postgres I Maintain your PostgreSQL yourself I Able to compile stu I You should ask ;-) I your boss I co-workers I customer Data Science with PostgreSQL Data Science with PostgreSQL Caveats I This stu is not easy I You should ask ;-) I your boss I co-workers I customer Data Science with PostgreSQL Data Science with PostgreSQL Caveats I This stu is not easy I Must be root and postgres I Maintain your PostgreSQL yourself I Able to compile stu Data Science with PostgreSQL Data Science with PostgreSQL Caveats I This stu is not easy I Must be root and postgres I Maintain your PostgreSQL yourself I Able to compile stu I You should ask ;-) I your boss I co-workers I customer I Project goals and challenges I Availability of data and resources I Success criteria I Not a technical activity, PostgreSQL can't help much Data Science with PostgreSQL Data Science with PostgreSQL Business & data understanding Business understanding I What is the purpose of the business? I What are existing processes? I Drivers of business success I Not a technical activity, PostgreSQL can't help much Data Science with PostgreSQL Data Science with PostgreSQL Business & data understanding Business understanding I What is the purpose of the business? I What are existing processes? I Drivers of business success I Project goals and challenges I Availability of data and resources I Success criteria Data Science with PostgreSQL Data Science with PostgreSQL Business & data understanding Business understanding I What is the purpose of the business? I What are existing processes? I Drivers of business success I Project goals and challenges I Availability of data and resources I Success criteria I Not a technical activity, PostgreSQL can't help much I Connecting separate data sources I Simple or complex IDs I Data size I Too small I Too big I Suitability for predictive modeling I Target variable? I Attribute types Data Science with PostgreSQL Data Science with PostgreSQL Business & data understanding Data understanding I Existing data I Entities and covered concepts I Complete? Correct? In suitable form? I Usable? (regulations, access constraints, ...) I Data size I Too small I Too big I Suitability for predictive modeling I Target variable? I Attribute types Data Science with PostgreSQL Data Science with PostgreSQL Business & data understanding Data understanding I Existing data I Entities and covered concepts I Complete? Correct? In suitable form? I Usable? (regulations, access constraints, ...) I Connecting separate data sources I Simple or complex IDs I Suitability for predictive modeling I Target variable? I Attribute types Data Science with PostgreSQL Data Science with PostgreSQL Business & data understanding Data understanding I Existing data I Entities and covered concepts I Complete? Correct? In suitable form? I Usable? (regulations, access constraints, ...) I Connecting separate data sources I Simple or complex IDs I Data size I Too small I Too big Data Science with PostgreSQL Data Science with PostgreSQL Business & data understanding Data understanding
Recommended publications
  • Software Corner: Rkward: Installation and Use Aaron Olaf Batty [email protected] SRB General Editor
    Software Corner Software Corner: RKWard: Installation and use Aaron Olaf Batty [email protected] SRB General Editor Many readers of SRB are likely familiar with the free, open-source statistical software package R. R offers a staggeringly wide range of analyses and outputs, besting every other general statistics package on the market, for free. This value, however, comes at a cost that may seem too steep for many users: it is command-line only. One must learn to write R code to take advantage of this software, and although the code itself is relatively memorable and intuitive, it cannot compete with the ease of use offered by the graphical user interfaces (GUIs) of competing packages, such as SPSS/PASW—a software package whose price can run over a thousand US dollars. What is RKWard? RKWard seeks to lower the usability barrier to entry into statistical analysis with R by providing a GUI frontend that hides much of the code from the use (although it is easy to inspect in the con- sole for those who wish to do so). Furthermore, it streamlines the sometimes-maddening process of importing data into R, and can read SPSS and Stata files natively in addition to standard text formats. Finally, its general look-and-feel will be immediately familiar to those who have used SPSS, further easing entry for novices. It has been in development since 2005, and achieved its first stable release in 2011. Although it uses the KDE desktop environment, which is usually found on Linux systems, it is not limited to computers running Linux/Unix.
    [Show full text]
  • Rkward: a Comprehensive Graphical User Interface and Integrated Development Environment for Statistical Analysis with R
    JSS Journal of Statistical Software June 2012, Volume 49, Issue 9. http://www.jstatsoft.org/ RKWard: A Comprehensive Graphical User Interface and Integrated Development Environment for Statistical Analysis with R Stefan R¨odiger Thomas Friedrichsmeier Charit´e-Universit¨atsmedizin Berlin Ruhr-University Bochum Prasenjit Kapat Meik Michalke The Ohio State University Heinrich Heine University Dusseldorf¨ Abstract R is a free open-source implementation of the S statistical computing language and programming environment. The current status of R is a command line driven interface with no advanced cross-platform graphical user interface (GUI), but it includes tools for building such. Over the past years, proprietary and non-proprietary GUI solutions have emerged, based on internal or external tool kits, with different scopes and technological concepts. For example, Rgui.exe and Rgui.app have become the de facto GUI on the Microsoft Windows and Mac OS X platforms, respectively, for most users. In this paper we discuss RKWard which aims to be both a comprehensive GUI and an integrated devel- opment environment for R. RKWard is based on the KDE software libraries. Statistical procedures and plots are implemented using an extendable plugin architecture based on ECMAScript (JavaScript), R, and XML. RKWard provides an excellent tool to manage different types of data objects; even allowing for seamless editing of certain types. The objective of RKWard is to provide a portable and extensible R interface for both basic and advanced statistical and graphical analysis, while not compromising on flexibility and modularity of the R programming environment itself. Keywords: GUI, integrated development environment, plugin, R.
    [Show full text]
  • Molecular Data in R Phylogeny, Evolution & R
    Introduction R Data Alignment Basic analysis SNP DAPC Structure Spatial analysis Trees Evolution The end Molecular data in R Phylogeny, evolution & R Vojtěch Zeisek Department of Botany, Faculty of Science, Charles University, Prague Institute of Botany, Czech Academy of Sciences, Průhonice https://trapa.cz/, [email protected] January 27 to 30, 2020 . Vojtěch Zeisek (https://trapa.cz/) Molecular data in R January 27 to 30, 2020 1 / 393 Introduction R Data Alignment Basic analysis SNP DAPC Structure Spatial analysis Trees Evolution The end Outline I 1 Introduction 2 R Installation Let’s start with R Basic operations in R Packages for our work 3 Data Overview of data and data types Microsatellites AFLP Notes about data DNA sequences, SNP VCF Export . Vojtěch Zeisek (https://trapa.cz/) Molecular data in R January 27 to 30, 2020 2 / 393 Introduction R Data Alignment Basic analysis SNP DAPC Structure Spatial analysis Trees Evolution The end Outline II Tasks 4 Alignment Overview and MAFFT MAFFT, Clustal, MUSCLE and T-Coffee Multiple genes Display and cleaning 5 Basic analysis First look at the data Statistics Genetic distances Hierarchical clustering AMOVA . MSN . Vojtěch Zeisek (https://trapa.cz/) Molecular data in R January 27 to 30, 2020 3 / 393 Introduction R Data Alignment Basic analysis SNP DAPC Structure Spatial analysis Trees Evolution The end Outline III NJ (and UPGMA) tree PCoA Tasks 6 SNP PCA and NJ 7 DAPC Bayesian clustering Discriminant analysis and visualization 8 Structure Running Structure from R ParallelStructure on Windows Post processing 9 Spatial analysis . Vojtěch Zeisek (https://trapa.cz/) Molecular data in R January 27 to 30, 2020 4 / 393 Introduction R Data Alignment Basic analysis SNP DAPC Structure Spatial analysis Trees Evolution The end Outline IV Moran’s I sPCA Monmonier Mantel test Geneland Plotting maps 10 Trees Manipulations MP Seeing trees in the forest Comparisons Notes about plotting the trees .
    [Show full text]
  • Pipenightdreams Osgcal-Doc Mumudvb Mpg123-Alsa Tbb
    pipenightdreams osgcal-doc mumudvb mpg123-alsa tbb-examples libgammu4-dbg gcc-4.1-doc snort-rules-default davical cutmp3 libevolution5.0-cil aspell-am python-gobject-doc openoffice.org-l10n-mn libc6-xen xserver-xorg trophy-data t38modem pioneers-console libnb-platform10-java libgtkglext1-ruby libboost-wave1.39-dev drgenius bfbtester libchromexvmcpro1 isdnutils-xtools ubuntuone-client openoffice.org2-math openoffice.org-l10n-lt lsb-cxx-ia32 kdeartwork-emoticons-kde4 wmpuzzle trafshow python-plplot lx-gdb link-monitor-applet libscm-dev liblog-agent-logger-perl libccrtp-doc libclass-throwable-perl kde-i18n-csb jack-jconv hamradio-menus coinor-libvol-doc msx-emulator bitbake nabi language-pack-gnome-zh libpaperg popularity-contest xracer-tools xfont-nexus opendrim-lmp-baseserver libvorbisfile-ruby liblinebreak-doc libgfcui-2.0-0c2a-dbg libblacs-mpi-dev dict-freedict-spa-eng blender-ogrexml aspell-da x11-apps openoffice.org-l10n-lv openoffice.org-l10n-nl pnmtopng libodbcinstq1 libhsqldb-java-doc libmono-addins-gui0.2-cil sg3-utils linux-backports-modules-alsa-2.6.31-19-generic yorick-yeti-gsl python-pymssql plasma-widget-cpuload mcpp gpsim-lcd cl-csv libhtml-clean-perl asterisk-dbg apt-dater-dbg libgnome-mag1-dev language-pack-gnome-yo python-crypto svn-autoreleasedeb sugar-terminal-activity mii-diag maria-doc libplexus-component-api-java-doc libhugs-hgl-bundled libchipcard-libgwenhywfar47-plugins libghc6-random-dev freefem3d ezmlm cakephp-scripts aspell-ar ara-byte not+sparc openoffice.org-l10n-nn linux-backports-modules-karmic-generic-pae
    [Show full text]
  • Awoken Icon Theme - Installation & Customizing Instructions 1
    Awoken Icon Theme - Installation & Customizing Instructions 1 AWOKEN ICON THEME Installation & Customizing Instructions Alessandro Roncone mail: [email protected] homepage: http://alecive.deviantart.com/ Awoken homepage (GNOME Version): link kAwoken homepage (KDE Version): link Contents 1 Iconset Credits 3 2 Copyright 3 3 Installation 3 3.1 GNOME........................................................3 3.2 KDE..........................................................4 4 Customizing Instructions 4 4.1 GNOME........................................................4 4.2 KDE..........................................................5 5 Overview of the customization script6 5.1 How to customize a single iconset..........................................7 6 Customization options 8 6.1 Folder types......................................................8 6.2 Color-NoColor.................................................... 11 6.3 Distributor Logos................................................... 11 6.4 Trash types...................................................... 11 6.5 Other Options.................................................... 11 6.5.1 Gedit icon................................................... 11 6.5.2 Computer icon................................................ 11 6.5.3 Home icon................................................... 11 6.6 Deprecated...................................................... 12 7 How to colorize the iconset 13 8 Icons that don't want to change (but I've drawed) 14 9 Conclusions 15 9.1 Changelog......................................................
    [Show full text]
  • Debian and Ubuntu
    Debian and Ubuntu Lucas Nussbaum lucas@{debian.org,ubuntu.com} lucas@{debian.org,ubuntu.com} Debian and Ubuntu 1 / 28 Why I am qualified to give this talk Debian Developer and Ubuntu Developer since 2006 Involved in improving collaboration between both projects Developed/Initiated : Multidistrotools, ubuntu usertag on the BTS, improvements to the merge process, Ubuntu box on the PTS, Ubuntu column on DDPO, . Attended Debconf and UDS Friends in both communities lucas@{debian.org,ubuntu.com} Debian and Ubuntu 2 / 28 What’s in this talk ? Ubuntu development process, and how it relates to Debian Discussion of the current state of affairs "OK, what should we do now ?" lucas@{debian.org,ubuntu.com} Debian and Ubuntu 3 / 28 The Ubuntu Development Process lucas@{debian.org,ubuntu.com} Debian and Ubuntu 4 / 28 Linux distributions 101 Take software developed by upstream projects Linux, X.org, GNOME, KDE, . Put it all nicely together Standardization / Integration Quality Assurance Support Get all the fame Ubuntu has one special upstream : Debian lucas@{debian.org,ubuntu.com} Debian and Ubuntu 5 / 28 Ubuntu’s upstreams Not that simple : changes required, sometimes Toolchain changes Bugfixes Integration (Launchpad) Newer releases Often not possible to do work in Debian first lucas@{debian.org,ubuntu.com} Debian and Ubuntu 6 / 28 Ubuntu Packages Workflow lucas@{debian.org,ubuntu.com} Debian and Ubuntu 7 / 28 Ubuntu Packages Workflow Ubuntu Karmic Excluding specific packages language-(support|pack)-*, kde-l10n-*, *ubuntu*, *launchpad* Missing 4% : Newer upstream
    [Show full text]
  • Upgrade Issues
    Upgrade issues Graph of new conflicts libsiloh5-0 libhdf5-lam-1.8.4 (x 3) xul-ext-dispmua (x 2) liboss4-salsa-asound2 (x 2) why sysklogd console-cyrillic (x 9) libxqilla-dev libxerces-c2-dev iceape xul-ext-adblock-plus gnat-4.4 pcscada-dbg Explanations of conflicts pcscada-dbg libpcscada2-dev gnat-4.6 gnat-4.4 Similar to gnat-4.4: libpolyorb1-dev libapq-postgresql1-dev adacontrol libxmlada3.2-dev libapq1-dev libaws-bin libtexttools2-dev libpolyorb-dbg libnarval1-dev libgnat-4.4-dbg libapq-dbg libncursesada1-dev libtemplates-parser11.5-dev asis-programs libgnadeodbc1-dev libalog-base-dbg liblog4ada1-dev libgnomeada2.14.2-dbg libgnomeada2.14.2-dev adabrowse libgnadecommon1-dev libgnatvsn4.4-dbg libgnatvsn4.4-dev libflorist2009-dev libopentoken2-dev libgnadesqlite3-1-dev libnarval-dbg libalog1-full-dev adacgi0 libalog0.3-base libasis2008-dbg libxmlezout1-dev libasis2008-dev libgnatvsn-dev libalog0.3-full libaws2.7-dev libgmpada2-dev libgtkada2.14.2-dbg libgtkada2.14.2-dev libasis2008 ghdl libgnatprj-dev gnat libgnatprj4.4-dbg libgnatprj4.4-dev libaunit1-dev libadasockets3-dev libalog1-base-dev libapq-postgresql-dbg libalog-full-dbg Weight: 5 Problematic packages: pcscada-dbg hostapd initscripts sysklogd Weight: 993 Problematic packages: hostapd | initscripts initscripts sysklogd Similar to initscripts: conglomerate libnet-akamai-perl erlang-base screenlets xlbiff plasma-widget-yawp-dbg fso-config- general gforge-mta-courier libnet-jifty-perl bind9 libplack-middleware-session-perl libmail-listdetector-perl masqmail libcomedi0 taxbird ukopp
    [Show full text]
  • Upgrade Issues
    Upgrade issues Graph of new conflicts libboost1.46-dev libboost-random-dev (x 18) libboost-mpi-python1.46.1 libboost-mpi-python-dev libwoodstox-java (x 7) liboss4-salsa-asound2 (x 2) libboost1.46-doc libboost-doc libgnutls28-dev libepc-dev libabiword-2.9-dev libcurl4-openssl-dev (x 5) python-cjson (x 2) nova-compute-kvm (x 4) printer-driver-all-enforce lprng (x 2) mdbtools-dev libiodbc2 tesseract-ocr-deu (x 8) tesseract-ocr libjpeg62-dev libcvaux-dev (x 2) ldtp python-pyatspi Explanations of conflicts ldtp python-pyatspi2 python-pyatspi Similar to ldtp: python-ldtp Weight: 29 Problematic packages: ldtp libboost-mpi-python-dev libboost-mpi-python1.48-dev libboost-mpi-python1.48.0 libboost-mpi-python1.46.1 Similar to libboost-mpi-python1.46.1: libboost1.46-all-dev libboost-mpi-python1.46-dev Similar to libboost-mpi-python-dev: libboost-all-dev Weight: 149 Problematic packages: libboost-mpi-python-dev tesseract-ocr-vie tesseract-ocr Similar to tesseract-ocr: slimrat tesseract-ocr-dev slimrat-nox Weight: 61 Problematic packages: tesseract-ocr-vie tesseract-ocr-spa tesseract-ocr Similar to tesseract-ocr: slimrat tesseract-ocr-dev slimrat-nox Weight: 295 Problematic packages: tesseract-ocr-spa tesseract-ocr-por tesseract-ocr Similar to tesseract-ocr: slimrat tesseract-ocr-dev slimrat-nox Weight: 133 Problematic packages: tesseract-ocr-por tesseract-ocr-nld tesseract-ocr Similar to tesseract-ocr: slimrat tesseract-ocr-dev slimrat-nox Weight: 112 Problematic packages: tesseract-ocr-nld tesseract-ocr-ita tesseract-ocr Similar to tesseract-ocr:
    [Show full text]
  • QCA with \Textsf{R}
    Adrian Dușa QCA with R A Comprehensive Resource 2021-06-15 Contents 1 R basics ................................................... 11 1.1 Working directory ....................................... 13 1.2 Workspace .............................................. 13 1.3 Object types ............................................ 14 1.4 Mathematical and logical operations ....................... 17 1.5 Indexing and subsetting .................................. 19 1.6 Data coercion ........................................... 22 1.7 User interfaces .......................................... 24 1.8 Working with script files ................................. 26 2 The QCA package ......................................... 29 2.1 Installing the QCA package .............................. 29 2.2 Structure ............................................... 30 2.3 Command line mode ..................................... 32 2.3.1 Getting help ...................................... 32 2.3.2 Function arguments ............................... 33 2.4 The GUI - graphical user interface ......................... 36 2.4.1 Description ....................................... 36 2.4.2 Starting the graphical user interface ................. 38 2.4.3 Creating an executable icon......................... 39 2.4.4 Command construction ............................ 42 2.4.5 The web R console ................................. 43 2.4.6 Graphics ......................................... 45 2.4.7 The data editor ................................... 48 2.4.8 Import, export and load
    [Show full text]
  • Introduction to Writing Plugins for Rkward
    Introduction to Writing Plugins for RKWard Thomas Friedrichsmeier Meik Michalke Introduction to Writing Plugins for RKWard 2 Contents 1 Introduction 9 2 Preliminaries: What are plugins in RKWard? How do they work? 10 3 Creating menu entries 11 3.1 Controlling the order of menu entries . 13 4 Defining the GUI 15 4.1 Defining a dialog . 15 4.2 Adding a wizard interface . 18 4.3 Some considerations on GUI design . 19 4.3.1 <radio> vs. <checkbox> vs. <dropdown> . 19 5 Generating R code from GUI settings 20 5.1 Using JavaScript in RKWard plugins . 20 5.1.1 preprocess() . 20 5.1.2 calculate() . 21 5.1.3 printout() . 21 5.2 Conventions, policies, and background . 22 5.2.1 Understanding the local() environment . 22 5.2.2 Code formatting . 22 5.2.3 Dealing with complex options . 23 5.3 Tips and tricks . 23 6 Writing a help page 25 7 Logic interactions between GUI elements 28 7.1 GUI logic . 28 7.2 Scripted GUI logic . 30 Introduction to Writing Plugins for RKWard 8 Embedding Plugins into Plugins 31 8.1 Use cases for embedding . 31 8.2 Embedding inside a dialog . 31 8.3 Code generation when embedding . 32 8.4 Embedding inside a wizard . 32 8.5 Less embedded embedding: Further Options button . 32 8.6 Embedding/defining incomplete plugins . 33 9 Dealing with many similar plugins 34 9.1 Overview on different approaches . 34 9.2 Using the JS include statement . 34 9.3 Including .xml files..................................... 35 9.4 Using <snippets> .
    [Show full text]
  • Scientific and Statistical Computing with R
    Introduction R Basics R Development Providing Services Over R Migration SCIENTIFIC AND STATISTICAL COMPUTING WITH R A. Mani Member, Calcutta Mathematical Society a:mani@member:ams:org Homepage: http://amani.topcities.com BARCAMP 2009, IIT KGP KOLKATA CENTER Introduction R Basics R Development Providing Services Over R Migration ABSTRACT The R environment is an integrated suite of FOSS facilities for data manipulation, knowledge discovery from databases, data analysis, scientific computing and graphical display. Technically R is an interpreted functional programming language with object-oriented programming features like inheritance. I intend to introduce some basic aspects of R, outline more advanced features and speak about providing scientific and statistical computing services over it. R caters to statistics and other methods of knowledge discovery like PCA, fuzzy techniques, data clustering, neural networks, DNA computing, belief networks, general mathematical modelling, econometrics and of course to the various sciences including bioinformatics. Introduction R Basics R Development Providing Services Over R Migration Contents 1 Introduction 2 R Basics 3 R Development 4 Providing Services Over R 5 Migration Introduction R Basics R Development Providing Services Over R Migration What is R? R is • A Free and Open Source Software under the GNU-GPL • A fully planned and coherent system unlike other data analysis software • A vehicle for developing new methods of interactive data analysis • Partly based on the statistical programming language
    [Show full text]
  • Selecting the Right Tool for the Job—R Commander Or Something Else? Jason Wilson∗
    Focus Article Statistical computing with R: selecting the right tool for the job—R Commander or something else? Jason Wilson∗ R Commander (Rcmdr) is one of the most popular graphical user interfaces (GUIs) for R. Why is R Commander so popular? What does it do? When should R Commander be used? When should it not be used? What are the alternatives to R Commander? What can be said about using R Commander, or an alternative, to teach introductory statistics courses? The answers to these questions form the sections of the paper. Included is a Box describing many of the alternative R GUIs (RExcel, Deducer, Rattle, and RKWard) and interactive development environments (IDEs—RStudio, REvolution, and RKWard). Advice is given to help the user ‘select the right tool for the job’. © 2012 Wiley Periodicals, Inc. How to cite this article: WIREs Comput Stat 2012, 4:518–526. doi: 10.1002/wics.1228 INTRODUCTION As R is open source (free), the R Core Team rightly focuses on maintaining and improving the Commander is a package which gives graphical base software. As a result, they do not offer a ‘pretty user interface (GUI) functionality like Minitab or R face’ GUI with push-button menus which contempo- SPSS to R. The purpose of this article is to provide the rary software users have come to love. This is where R reader with enough information to make a decision Commander comes in: it is probably the most popular whether or not R Commander is right for them and, if GUI for R.2 The R package name is ‘Rcmdr’, which is it is not, advice on suitable alternatives.
    [Show full text]