Leveraging SAS with KNIME
Total Page:16
File Type:pdf, Size:1020Kb
Leveraging SAS with KNIME Thomas Gabriel and Phil Winters Copyright © 2013 by KNIME.com AG. All rights reserved. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. ® indicates USA registration. Copyright © 2013 by KNIME.com AG. All rights reserved. 2 A bit of History: The Original SAS Concept Access Manage Present Analyse Copyright © 2013 by KNIME.com AG. All rights reserved. 4 More Products to meet new requirements EG DI RTD DB2 Macro Oracle PC FILIE Connect Access Manage MM SCL Present Analyse EM ® AF DVD QC JMP OR IML IDP ETS STAT Graph Copyright © 2013 by KNIME.com AG. All rights reserved. 5 Even More Products…. Teradata Hadoop EG DI RTD DB2 Macro Oracle PC FILIE Connect Access Manage MM SCL Present Analyse EM PMML …Model Manager …Model PMML High Performance Analytics Performance High ® AF DVD QC JMP OR IML IDP ETS STAT Graph Text Mining Social Media Analytics R…. in IML in R…. Copyright © 2013 by KNIME.com AG. All rights reserved. 6 Our new reality: You must have Choice and Control New Infrastructures New Data New Other Science Applications New New Users Methods New Business Copyright © 2013 by KNIME.com AG. All rights reserved. Challenges 7 The KNIME Platform Open, Open Source, Free on the Desktop Copyright © 2013 by KNIME.com AG. All rights reserved. 8 The question is not “which is better”…. Copyright © 2013 by KNIME.com AG. All rights reserved. 9 The question is: What’s the Big Difference? SAS KNIME A script-oriented 4GL programming language A Script-free environment in four major parts: comprised of • The DATA step • Nodes and Connectors • Procedure steps • A macro language, • Metanodes and Flow variables for packaging a metaprogramming language • ODS statements • GUIs: are most often front-ends • GUI is the interface to facilitate SAS Program script generation • Through Proprietary Technology “Provide it all” • Through Open Source “Provide it all” Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 10 KNIME: Free on the desktop including over 1000 native and embedded nodes: MySQL, Oracle, etc. ETL Statistics via BIRT SAS, SPSS, etc. Row, Data Mining PMML Excel, Flat, etc. Column Machine Learning XML Hive etc. Matrix Web Analytics Databases XML, PMML Text, Image Text Mining Excel, Flat, etc. Text, Doc, Image Time Series Network Analysis Hive etc. Web Crawlers Java Social Media Analysis Text, Doc, Image Industry Specific Python WEKA Industry Specific Community / 3rd Community / 3rd R R Community / 3rd Community / 3rd JFreeChart Copyright © 2013 by KNIME.com AG. All rights reserved. 11 Community / 3rd Some 3rd party nodes are charged Accessing Data SAS Data can be read Each Node directly has a specific dialog Huge Range of Read (and write) nodes available At no cost. Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 13 Accessing Data Tables available at every node. Manually execute, start, stop, test nodes and sequences Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 14 Transforming data Resource allocation concepts similar: balance IO, Memory, Resources Nodes for columns, rows, matrix ETL can be mixed and matched with all other nodes. No Separate Progrmming step required Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 15 Packaging Workflows Drag and Mark nodes, Right Click, Collapse into Metanode To create a Metanode that can be reused Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 16 Our Example: “Next Best Offer! The Access and Transform Metanode we created Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 17 More Transformation Applying Color to Attributes KNIME nodes for transforming Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 18 More Transformation Calling your Favorite External Packages such as Java, Python, Matlab, SQL, REST, Or your favorite Database Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 19 Using SAS Transformation Calling SAS either Locally or Remotely. Automatically passing Data into and out of SAS Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 20 Using SAS Transformation Calling SAS either Locally or Remotely. Automatically passing Data into and out of SAS Controlling how that happens with Flow Variables and Quickforms Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 21 Giving you instant Customized Dialogs Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 22 Giving you instant Customized Dialogs No Macro Code. No SCL. No Java. No Other Programming Language required. A Nice Dialog for many Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 23 types of users….. Explore and Visualize Many nodes For Exploring and Visualizing All that allow Marking and highlighting across all nodes No seperate package required. Other packages can be used if available Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 24 Be Open to All Methods, and let the best model win !!! • KNIME Decision Tree • KNIME Logistic Regression • R Decision Tree • PMML model (from SPSS) • Other Methods • Weka • KNIME Uplift (courtesy of Dymatrix) • SAS Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 25 Allow for best practices without learning scripting • Missing Management • Partitioning • Binning and Bagging • Boosting • Logic and Flow Control • Cross Validation • Feature Elimination • Feature Selection • Error Handling KNIME examples available for all these techniques…. Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 26 Report with your favorite Tool Or use the built-in and Free BIRT open source BI Tool Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 27 Deploy the Models Score any Database Or how about scoring (including Hadoop) straight into SAS via Proc SQL and the (commercial) node from Dymatrix? Use the PMML on another system (Such as ADAPA) Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 28 Deploy to Excel, XML, etc.….. Or directly back to SAS! SAS Transport Dataset Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 29 Power Users in Teams The KNIME Server (Commercial Software) Copyright © 2013 by KNIME.com AG. All rights reserved. 31 Business Consumers via the web The KNIME Server (Commercial Software) Copyright © 2013 by KNIME.com AG. All rights reserved. 32 Business Consumers via the web The KNIME Server (Commercial Software) Copyright © 2013 by KNIME.com AG. All rights reserved. 33 Embedding, Automating, Security The KNIME Server (Commercial Software) KNIME Server KNIME Copyright © 2013 by KNIME.com AG. All rights reserved. 34 Hot Topics with KNIME (and SAS!) users: • Social Media Analysis • Recommendation/Next Best Offer/Market Basket/etc. Analysis • Text Mining • Network Analysis • Forecasting • Realtime • Machine Learning • Making R Usable • Automated Model generation, scoring and tuning with Dynamine from Dymatrix • Big Data Copyright © 2013 by KNIME.com AG. All rights reserved. 36 Thank you [email protected] Copyright © 2013 by KNIME.com AG. All rights reserved. .