Leveraging SAS with KNIME

Thomas Gabriel and Phil Winters

Copyright © 2013 by KNIME.com AG. All rights reserved. SAS and all other SAS Institute Inc. product or service names are registered trademarks or trademarks of SAS Institute Inc. in the USA and other countries. ® indicates USA registration.

Copyright © 2013 by KNIME.com AG. All rights reserved. 2 A bit of History: The Original SAS Concept

Access Manage

Present Analyse

Copyright © 2013 by KNIME.com AG. All rights reserved. 4 More Products to meet new requirements

EG

DI

RTD

DB2

Macro

Oracle PCFILIE

Connect Access Manage MM

SCL Present Analyse EM

® AF

DVD QC JMP

OR

IML

IDP

ETS

STAT Graph

Copyright © 2013 by KNIME.com AG. All rights reserved. 5 Even More Products….

Teradata Hadoop

EG

DI

RTD

DB2

Macro

Oracle PCFILIE

Connect Access Manage MM

SCL Present Analyse EM

PMML …Model Manager …Model PMML High Performance Analytics Performance High

® AF

DVD QC JMP

OR

IML

IDP

ETS

STAT Graph

Text Mining Social Media Analytics …. in IML in R….

Copyright © 2013 by KNIME.com AG. All rights reserved. 6 Our new reality: You must have Choice and Control

New Infrastructures

New Data

New Other Science Applications

New New Users Methods New Business

Copyright © 2013 by KNIME.com AG. All rights reserved. Challenges 7 The KNIME Platform Open, Open Source, Free on the Desktop

Copyright © 2013 by KNIME.com AG. All rights reserved. 8 The question is not “which is better”….

Copyright © 2013 by KNIME.com AG. All rights reserved. 9 The question is: What’s the Big Difference?

SAS KNIME A script-oriented 4GL programming language A Script-free environment in four major parts: comprised of • The DATA step • Nodes and Connectors • Procedure steps • A macro language, • Metanodes and Flow variables for packaging a metaprogramming language • ODS statements

• GUIs: are most often front-ends • GUI is the interface to facilitate SAS Program script generation

• Through Proprietary Technology “Provide it all” • Through Open Source “Provide it all”

Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 10 KNIME: Free on the desktop including over 1000 native and embedded nodes:

MySQL, Oracle, etc. ETL Statistics via BIRT SAS, SPSS, etc. Row, PMML Excel, Flat, etc. Column XML Hive etc. Matrix Web Analytics Databases XML, PMML Text, Image Excel, Flat, etc. Text, Doc, Image Time Series Network Analysis Hive etc. Web Crawlers Social Media Analysis Text, Doc, Image Industry Specific Python Industry Specific Community / 3rd Community / 3rd R R Community / 3rd Community / 3rd JFreeChart Copyright © 2013 by KNIME.com AG. All rights reserved. 11 Community / 3rd Some 3rd party nodes are charged Accessing Data SAS Data can be read Each Node directly has a specific dialog

Huge Range of Read (and write) nodes available

At no cost.

Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 13 Accessing Data

Tables available at every node. Manually execute, start, stop, test nodes and sequences

Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 14 Transforming data Resource allocation concepts similar: balance IO, Memory, Resources

Nodes for columns, rows, matrix

ETL can be mixed and matched with all other nodes.

No Separate Progrmming step required

Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 15 Packaging Workflows

Drag and Mark nodes, Right Click, Collapse into Metanode

To create a Metanode that can be reused

Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 16 Our Example: “Next Best Offer!

The Access and Transform Metanode we created

Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 17 More Transformation

Applying Color to Attributes

KNIME nodes for transforming

Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 18 More Transformation

Calling your Favorite External Packages such as Java, Python, Matlab, SQL, REST, Or your favorite Database

Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 19 Using SAS Transformation

Calling SAS either Locally or Remotely. Automatically passing Data into and out of SAS

Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 20 Using SAS Transformation

Calling SAS either Locally or Remotely. Automatically passing Data into and out of SAS

Controlling how that happens with Flow Variables and Quickforms

Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 21 Giving you instant Customized Dialogs

Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 22 Giving you instant Customized Dialogs

No Macro Code. No SCL. No Java. No Other Programming Language required.

A Nice Dialog for many Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 23 types of users….. Explore and Visualize

Many nodes For Exploring and Visualizing

All that allow Marking and highlighting across all nodes No seperate package required. Other packages can be used if available Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 24 Be Open to All Methods, and let the best model win !!!

• KNIME Decision Tree • KNIME Logistic Regression • R Decision Tree • PMML model (from SPSS) • Other Methods • Weka • KNIME Uplift (courtesy of Dymatrix) • SAS

Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 25 Allow for best practices without learning scripting

• Missing Management • Partitioning • Binning and Bagging • Boosting • Logic and Flow Control • Cross Validation • Feature Elimination • Feature Selection • Error Handling

KNIME examples available for all these techniques….

Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 26 Report with your favorite Tool

Or use the built-in and Free BIRT open source BI Tool

Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 27 Deploy the Models

Score any Database Or how about scoring (including Hadoop) straight into SAS via Proc SQL and the (commercial) node from Dymatrix?

Use the PMML on another system (Such as ADAPA)

Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 28 Deploy to Excel, XML, etc.….. Or directly back to SAS!

SAS Transport Dataset

Copyright © 2013 by KNIME.com AG. All rights reserved. KNIME User Training 29 Power Users in Teams The KNIME Server (Commercial Software)

Copyright © 2013 by KNIME.com AG. All rights reserved. 31 Business Consumers via the web The KNIME Server (Commercial Software)

Copyright © 2013 by KNIME.com AG. All rights reserved. 32 Business Consumers via the web The KNIME Server (Commercial Software)

Copyright © 2013 by KNIME.com AG. All rights reserved. 33 Embedding, Automating, Security The KNIME Server (Commercial Software)

KNIME Server KNIME

Copyright © 2013 by KNIME.com AG. All rights reserved. 34 Hot Topics with KNIME (and SAS!) users:

• Social Media Analysis • Recommendation/Next Best Offer/Market Basket/etc. Analysis • Text Mining • Network Analysis • Forecasting • Realtime • Machine Learning • Making R Usable • Automated Model generation, scoring and tuning with Dynamine from Dymatrix • Big Data

Copyright © 2013 by KNIME.com AG. All rights reserved. 36 Thank you

[email protected]

Copyright © 2013 by KNIME.com AG. All rights reserved.