Leveraging SAS with KNIME
Thomas Gabriel and Phil Winters
Copyright © 2013 by KNIME.com AG. All rights reserved. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. ® indicates USA registration.
Copyright © 2013 by KNIME.com AG. All rights reserved. 2 A bit of History: The Original SAS Concept
Access Manage
Present Analyse
Copyright © 2013 by KNIME.com AG. All rights reserved. 4 More Products to meet new requirements
EG
DI
RTD
DB2
Macro
Oracle PCFILIE
Connect Access Manage MM
SCL Present Analyse EM
® AF
DVD QC JMP
OR
IML
IDP
ETS
STAT Graph
Copyright © 2013 by KNIME.com AG. All rights reserved. 5 Even More Products….
Teradata Hadoop
EG
DI
RTD
DB2
Macro
Oracle PCFILIE
Connect Access Manage MM
SCL Present Analyse EM
PMML …Model Manager …Model PMML High Performance Analytics Performance High
® AF
DVD QC JMP
OR
IML
IDP
ETS
STAT Graph
Text Mining Social Media Analytics R…. in IML in R….
Copyright © 2013 by KNIME.com AG. All rights reserved. 6 Our new reality: You must have Choice and Control
New Infrastructures
New Data
New Other Science Applications
New New Users Methods New Business
Copyright © 2013 by KNIME.com AG. All rights reserved. Challenges 7 The KNIME Platform Open, Open Source, Free on the Desktop
Copyright © 2013 by KNIME.com AG. All rights reserved. 8 The question is not “which is better”….
Copyright © 2013 by KNIME.com AG. All rights reserved. 9 The question is: What’s the Big Difference?
SAS KNIME A script-oriented 4GL programming language A Script-free environment in four major parts: comprised of • The DATA step • Nodes and Connectors • Procedure steps • A macro language, • Metanodes and Flow variables for packaging a metaprogramming language • ODS statements
• GUIs: are most often front-ends • GUI is the interface to facilitate SAS Program script generation
• Through Proprietary Technology “Provide it all” • Through Open Source “Provide it all”
Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 10 KNIME: Free on the desktop including over 1000 native and embedded nodes:
MySQL, Oracle, etc. ETL Statistics via BIRT SAS, SPSS, etc. Row, Data Mining PMML Excel, Flat, etc. Column Machine Learning XML Hive etc. Matrix Web Analytics Databases XML, PMML Text, Image Text Mining Excel, Flat, etc. Text, Doc, Image Time Series Network Analysis Hive etc. Web Crawlers Java Social Media Analysis Text, Doc, Image Industry Specific Python WEKA Industry Specific Community / 3rd Community / 3rd R R Community / 3rd Community / 3rd JFreeChart Copyright © 2013 by KNIME.com AG. All rights reserved. 11 Community / 3rd Some 3rd party nodes are charged Accessing Data SAS Data can be read Each Node directly has a specific dialog
Huge Range of Read (and write) nodes available
At no cost.
Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 13 Accessing Data
Tables available at every node. Manually execute, start, stop, test nodes and sequences
Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 14 Transforming data Resource allocation concepts similar: balance IO, Memory, Resources
Nodes for columns, rows, matrix
ETL can be mixed and matched with all other nodes.
No Separate Progrmming step required
Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 15 Packaging Workflows
Drag and Mark nodes, Right Click, Collapse into Metanode
To create a Metanode that can be reused
Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 16 Our Example: “Next Best Offer!
The Access and Transform Metanode we created
Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 17 More Transformation
Applying Color to Attributes
KNIME nodes for transforming
Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 18 More Transformation
Calling your Favorite External Packages such as Java, Python, Matlab, SQL, REST, Or your favorite Database
Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 19 Using SAS Transformation
Calling SAS either Locally or Remotely. Automatically passing Data into and out of SAS
Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 20 Using SAS Transformation
Calling SAS either Locally or Remotely. Automatically passing Data into and out of SAS
Controlling how that happens with Flow Variables and Quickforms
Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 21 Giving you instant Customized Dialogs
Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 22 Giving you instant Customized Dialogs
No Macro Code. No SCL. No Java. No Other Programming Language required.
A Nice Dialog for many Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 23 types of users….. Explore and Visualize
Many nodes For Exploring and Visualizing
All that allow Marking and highlighting across all nodes No seperate package required. Other packages can be used if available Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 24 Be Open to All Methods, and let the best model win !!!
• KNIME Decision Tree • KNIME Logistic Regression • R Decision Tree • PMML model (from SPSS) • Other Methods • Weka • KNIME Uplift (courtesy of Dymatrix) • SAS
Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 25 Allow for best practices without learning scripting
• Missing Management • Partitioning • Binning and Bagging • Boosting • Logic and Flow Control • Cross Validation • Feature Elimination • Feature Selection • Error Handling
KNIME examples available for all these techniques….
Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 26 Report with your favorite Tool
Or use the built-in and Free BIRT open source BI Tool
Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 27 Deploy the Models
Score any Database Or how about scoring (including Hadoop) straight into SAS via Proc SQL and the (commercial) node from Dymatrix?
Use the PMML on another system (Such as ADAPA)
Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 28 Deploy to Excel, XML, etc.….. Or directly back to SAS!
SAS Transport Dataset
Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 29 Power Users in Teams The KNIME Server (Commercial Software)
Copyright © 2013 by KNIME.com AG. All rights reserved. 31 Business Consumers via the web The KNIME Server (Commercial Software)
Copyright © 2013 by KNIME.com AG. All rights reserved. 32 Business Consumers via the web The KNIME Server (Commercial Software)
Copyright © 2013 by KNIME.com AG. All rights reserved. 33 Embedding, Automating, Security The KNIME Server (Commercial Software)
KNIME Server KNIME
Copyright © 2013 by KNIME.com AG. All rights reserved. 34 Hot Topics with KNIME (and SAS!) users:
• Social Media Analysis • Recommendation/Next Best Offer/Market Basket/etc. Analysis • Text Mining • Network Analysis • Forecasting • Realtime • Machine Learning • Making R Usable • Automated Model generation, scoring and tuning with Dynamine from Dymatrix • Big Data
Copyright © 2013 by KNIME.com AG. All rights reserved. 36 Thank you
Copyright © 2013 by KNIME.com AG. All rights reserved.