Process Mining: Data Science in Action

Process Mining Software

prof.dr.ir. www.processmining.org

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Data scientists need to have data (see previous lectures), tools, and preferably also questions. Process mining toolbox Spectrum of tools

part of a bigger suite

process mining specific Disco ProM

simple, easy powerful, many to use, ... functions, extendible, ... ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Spectrum of tools

Knime data RapidMiner mining ... R

part of a bigger suite

process mining specific Disco ProM

simple, easy powerful, many to use, ... functions, extendible, ... ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Spectrum of tools

Knime data RapidMiner mining ... R

part of a bigger suite Perceptive Process Mining ...... Celonis process mining Discovery specific Disco ProM

simple, easy powerful, many to use, ... functions, extendible, ... ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Spectrum of tools

Knime data RapidMiner mining ... R

part of a bigger suite RapidProM Perceptive Process Mining ...... Celonis process mining Discovery specific Disco ProM

simple, easy powerful, many to use, ... functions, extendible, ... ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Checklist for a process mining tool

people organizations business “world” machines processes documents • Which of the 10 process mining information system(s) activities are supported? provenance event logs

“pre “post mortem” current historic mortem” • Which perspectives are data data supported? navigation auditing cartography • What modeling notations are check detect predict explore promote discover enhance compare diagnose recommend supported? models de jure models de facto models • What import and export formats control-flow control-flow are supported (event logs, data/rules data/rules

resources/ resources/ models, alignments, etc.)? organization organization

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Example: ProM

people organizations business “world” machines • Aims to cover the whole process processes documents

information system(s) mining spectrum. provenance event logs • Notations supported: Petri nets “pre “post mortem” current historic mortem” data data (many types), BPMN, C-nets, fuzzy navigation auditing cartography models, transition systems, Declare, check detect predict explore promote discover

enhance etc. compare diagnose recommend

models • Also supports conformance de jure models de facto models checking and operational support. control-flow control-flow • Many plug-ins are experimental data/rules data/rules

resources/ resources/ prototypes and not user friendly. organization organization

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Example: Disco

people organizations business “world” machines processes documents • Focus on discovery and performance

information system(s) analysis (including animation). provenance event logs • Powerful filtering capabilities for “pre “post mortem” current historic mortem” data data comparative process mining and ad- navigation auditing cartography hoc checking of patterns. check detect predict explore promote discover enhance compare diagnose

recommend • Uses a variant of fuzzy models, etc.

models • Does not support conformance de jure models de facto models checking and operational support. control-flow control-flow

data/rules data/rules • Easy to use and excellent

resources/ resources/ organization organization performance.

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Other commercial tools (1/2)

• Perceptive Process Mining (before Futura Reflect and BPM|one) (Perceptive Software) • ARIS Process Performance Manager (Software AG) • QPR ProcessAnalyzer (QPR) • Celonis Discovery (Celonis) • Interstage Process Discovery (Fujitsu) • Discovery Analyst (StereoLOGIC) • XMAnalyzer (XMPro) • ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Other commercial tools (2/2)

• Most tools are similar to Disco in terms of scope (but often less user-friendly and less performant). • Typical coverage:

control-flow control-flow and ... only time resources data . discovery L M conformance L+M D enhancement L+M M

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Three examples (next to ProM and Disco)

Knime data RapidMiner mining ... R

part of a 1 bigger suite RapidProM 3 Perceptive Process Mining ... 2 ... Celonis process mining Discovery specific Disco ProM

simple, easy powerful, many to use, ... functions, extendible, ... ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) RapidProM

www.rapidprom.org

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Create a process mining

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) check conformance using alignments

copy objects create ProM (technicality) context (technicality)read event log discover a Petri net using the Alpha algorithm ©Wil van der Aalst & TU/e (use only with permission & acknowledgements)

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Another process mining workflow

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) create dotted chart create Petri net using inductive miner

read event create social log network based on handover of work ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Results

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) RapidProM

• Can be used to repeat process mining analyses (scientific experiments, reusable macros, ). • Can be used to combine process mining with , text mining, etc.

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Celonis Discovery web-based

supports process discovery

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) (but no concurrency) frequencies

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) time information

©Wil van der Aalst & TU/e (use only with permission & acknowledgements)

animation ©Wil van der Aalst & TU/e (use only with permission & acknowledgements)

frequency- based filtering (1/3)

©WilBPI van challengeder Aalst & TU/e (use2013 only withlog permission & acknowledgements)

frequency- based filtering (2/3)

©WilBPI van challengeder Aalst & TU/e (use2013 only withlog permission & acknowledgements)

frequency- based filtering (3/3)

©WilBPI van challengeder Aalst & TU/e (use2013 only withlog permission & acknowledgements)

performance-related dashboards ©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Perceptive Process Mining

web-based

supports process discovery

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) (two types)

©Wil van der Aalst & TU/e (use only with permission & acknowledgements)

various ways of filtering e.g. based on frequency

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Tokens colored based on flowtime

replay event log

tokens colored based on flowtime

©Wil van der Aalst & TU/e (use only with permission & acknowledgements)

replay based on social network (top-3)

©Wil van der Aalst & TU/e (use only with permission & acknowledgements)

performance- related charts

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Related tools Although related, (beyond pure process mining tools)How about BI tools?classical data mining and BPM systems can tools typically do not be extended with support process process mining mining. capabilities.

Related to operational support, but typically not process-oriented.

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) (BI) tools

• Possible definition (one of many): "BI is a set of methodologies, processes, architectures, and technologies that transform raw data into meaningful and useful information used to enable more effective strategic, tactical, and operational insights and decision making." [Boris Evelson, Forrester Research 2010] • Many products, e.g., IBM Cognos Business Intelligence (IBM), Oracle Business Intelligence (Oracle), SAP BusinessObjects (SAP), WebFOCUS (Information Builders), MS SQL Server (Microsoft), MicroStrategy (MicroStrategy), NovaView (Panorama Software), QlikView (QlikTech), SAS Enterprise Business Intelligence (SAS), TIBCO Spotfire Analytics (TIBCO), Jaspersoft (Jaspersoft), and Pentaho BI Suite (Pentaho).

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Typical functionality

• ETL (Extract, Transform, and Load). • Ad-hoc querying. • Reporting. • Interactive dashboards. • Alert generation.

©Wil van der Aalst & TU/e (use only with permission & acknowledgements) Tools are available, but process mining is still a relatively young discipline.

New tools will appear in Avoid the trap: coming years and process • process mining ⊆ data mining mining functionality will • process mining ⊆ BI be embedded in more • process mining ⊆ BI/BPM/DM suites. Use checklists! Part I: Preliminaries Part III: Beyond Process Discovery Chapter 1 Chapter 2 Chapter 3 Chapter 7 Chapter 8 Chapter 9 Introduction Process Modeling and Data Mining Conformance Mining Additional Operational Support Analysis Checking Perspectives

Part II: From Event Logs to Process Models Part IV: Putting Process Mining to Work

Chapter 4 Chapter 5 Chapter 6 Chapter 10 Chapter 11 Chapter 12 Getting the Data Process Discovery: An Advanced Process Tool Support Analyzing “Lasagna Analyzing “Spaghetti Introduction Discovery Techniques Processes” Processes”

Part V: Reflection Chapter 13 Chapter 14 Cartography and Epilogue Navigation

©Wil van der Aalst & TU/e (use only with permission & acknowledgements)©Wil van der Aalst & TU/e (use only with permission & acknowledgements)