The Business Analytics Directory

June 2016 About this Directory ...... 4

Business Intelligence ...... 5

KPI Dashboards ...... 5

BI with Advanced Analytics ...... 8

Visual Analytics Platforms ...... 9

Free and Open Source Reporting Tools ...... 11

MySQL Reporting ...... 14

Machine Learning...... 17

Machine Learning as a Service ...... 17

Commercial Predictive Analytics Platforms ...... 19

Open Source Platforms...... 21

Free Data Mining Clustering Tools...... 23

Text Analytics ...... 24

Text Mining and Text Analytics Platforms ...... 24

Free NLP Machine Learning ...... 31

Decision Support ...... 32

Free and Open Source Decision Support Software...... 32

Social Network and Media Analysis ...... 33

Free and Open Source Social Network Analysis Software ...... 33

Commercial Social Network Analysis Tools ...... 37

Developer and Operations ...... 39

Free Python Data Tools ...... 39

Free ETL Tools for MySQL ...... 41

Free Data Ingestion Tools ...... 42

IoT Platforms ...... 44

Data Management ...... 46

Data Quality Tools ...... 48

Solutions ...... 49 Sales Recommendation Engines ...... 49 About this Directory

This directory is a work in progress and will be updated every month. It is free to download and distribute as desired.

Every supplier and product entry contains a hyperlink to the relevant web site. Business Analytics Directory

Business Intelligence • Graphics library includes donut, pie, histogram, 3D, bar, gauge, funnel, maps, radar, treemap etc. KPI Dashboards • Free single user subscription up to $600/ Key performance indicators (KPIs) provide mo for large data volumes. measures of performance that are important ClicData easy to use but with sophisticated to an organization, department, or even an features individual. They are often represented on dashboards, where several KPIs can be seen • Cloud based dashboard platform with together. Formats include a wide variety of easy to use interface. charts including pie charts, histograms, dials, • Sophisticated features such as conditional gauges, and even maps. output, pivot tables and complex KPI dashboard software varies considerably. calculations. General purpose dashboard software from • Wide range of charts including maps, bar, suppliers such as Qlik and Tableau are pie, tables, line, dials, gauges etc. potentially more powerful, but are generally • Connectors for database, spreadsheets, used as part of a larger business intelligence social data, online apps, and support for initiative. Specialized KPI platforms such as SQL if needed. Bilbeo can provide specific KPI functionality, and may be more suitable for some • Free single user version through to the organizations. Enterprise at just $50 per month. Bilbeo is perhaps the smartest KPI Cyfe is easy to use and extremely good dashboard platform available. value. • Cloud based service (on site options • Extensive support for internal and external available) that predicts KPI performance data sources. and correlates KPIs to find leading • Many graph and chart types including line, indicators. spline, area, area-spline, stacked area, • API to access on-premises data sources, column, … , pie, donut, list, funnel, tables and alerts generated on KPI performance. – and so on. • KPIs rated from ‘Very Bad’ to ‘Excellent’ • A top contender for people considering a automatically, with drill down into social media monitoring dashboard. individual KPIs. • The whole environment is widget driven, • White labelling. and so it is very easy to connect with social sources such as Facebook, • Pricing based on size of data and number LinkedIn and Twitter (many others too). of users. • A free account is offered for people with Bittle good value online service. modest needs, or the Premium service • Ideal for the SME with white labelling offers pretty well unlimited everything for option. $19 per month. • Collaboration features and mobile access. Dundas Dashboard - high end sophisticated • Connectors to many database sources, dashboarding. spreadsheets, online apps. • Extensive collaboration features allowing comments to be attached to data points.

© butleranalytics.com 5 Business Analytics Directory

• Support for OLAP processing and • Hundreds of connectors for database, complex metrics. spreadsheet, online apps, social data. • Support for enterprise databases and • Ability to combine data sources and applications. create complex formulas. • Above average visualization capabilities • Dozens of chart types - dials, line, pie, and part of broader BI suite. maps, gauges, tables etc. • Pricing model unknown. • Pricing starts at $20/mo one user, to thousands of users. Geckoboard cloud service best for online data Qlik Sense - sophisticated desktop and enterprise dashboards • Online dashboards service with commonly used visuals and support for online and • Very powerful desktop and enterprise social data sources. dashboard and platform. • Chart types include bar, bullet graph, • Supports large number of data sources funnel, dials, Highcharts charting library with an optional service for a large number support, leaderboard, line, pie, maps and of online data services. several others. • Associative engine makes combining data • Over 60 integrations with online apps and sources straight forward. data sources, spreadsheet data, social • Large number of chart types and can be sources. extended if needed. • Custom widgets support push (data • Desktop version free with a free cloud pushed to dashboard) and pull (data dashboard sharing service. Enterprise pulled by dashboard from datasource). edition requires server license. • Starts at $49/mo for single user with 2 SimpleKPI - specifically targeted at KPI dashboards, up to $399/mo for 15 users applications. and 25 dashboards, plus more if needed. • Complete suite of KPI apps for iDashboards general cloud dashboard dashboards, reporting, analytics, tracking. platform • Extensive sharing and collaboration. • Cloud (or on-premises) dashboard platform with particularly attractive visuals. • Easy to use with pre-built templates. • Data Integrator for blending data from • Supports most data sources and comes multiple sources (databases, social, with a rich set of charts. spreadsheet, web apps etc). • £11/mo for a single user, £467 for 100 • Drag and drop interface and full drill down users and upward. into data from dashboards. Slemma is a cloud BI and dashboard • Good mobile support. software that allows devs and non-devs alike to do ad hoc visual analysis against data. • Pricing unavailable. Slemma plugs in directly to the most popular Klipfolio - very capable and reasonably databases (including XMLA data sources), priced. cloud storage and cloud services, and allows to bypass the need for a warehouse. It is • Sophisticated online dashboard service easy to create a dashboard by using intuitive with good data handling

© butleranalytics.com 6 Business Analytics Directory

drag-and-drop interface or writing and running SQLs to build charts. With the dashboards created in Slemma, team and clients can access dashboards through the web. And they can slice and dice the data in real-time by dimensions and measures for their own needs. The platform also supports data transformation features like aggregation, calculations, filtering, sorting and ordering data. A free version supports Excel and CSV files and integration with Google Drive, Dropbox and OneDrive. Paid subscriptions start at US $ 29 per month. Tableau - rich graphics, cloud or on premises. • Particularly rich graphics with desktop, server and cloud editions. • Sharing, mobile and collaboration features. • Fast in-memory processing for most data sets. • Complete data visualization and discovery platform. • Pricing based on number of users - typically around $2000 per seat.

© butleranalytics.com 7 Business Analytics Directory

BI with Advanced Analytics production reporting needs. The data visualization capabilities are above average, Business Intelligence has traditionally but not exceptional. Microstrategy does delivered a look in the rear view mirror, however support advanced analytics, and providing a description of what has happened has done for some time. or is happening. Advanced analytics are more SAS Visual Analytics is an easy-to-use, concerned with the future, allowing sophisticated analytics platform that stands businesses to predict how customers, apart from most of the competition. Several suppliers and other aspects of business will features guide and advise the user to make behave. Right now, advanced analytics is the experience informed and effective - only supplied by a few BI vendors - eventually something almost totally lacking in most other they will all have to deliver. The crucial BI platforms. property of advanced analytics in BI is ease of use - and this is what makes it difficult for Sisense majors on ease-of-use and suppliers to deliver. The platforms listed performance. It is capable of handling very below offer varying degrees of support for large data volumes at speed, and provides advanced analytics, but new entrants such as powerful data visualization capabilities. BeyondCore set the pace to some degree. Sisense also supports good extensibility, and version 6 improves on this and adds support Alteryx is an interesting product, filling a void for advanced analytics via R. that most other analytic platforms do not address. It allows skilled business users and Spotfire from TIBCO successfully combines analysts to analyse their data using a advanced analytics with an easy-to-use combination of data visualization and interface. At the most basic level users can predictive analytics tools. It also supports create charts and dashboards with spatial analytics where location is important. considerable ease. As skills and experience increase so other forms of analytics are BeyondCore - The four dominant modes of available. The implementations of additional analysis are covered by BeyondCore – analytic capabilities are of a very high quality, descriptive (traditional BI), discovery (looking and Spotfire boasts one of the fastest R for unknown facts), predictive (finding runtime engines available. consistent patterns that can be used in future activities), and prescriptive (actions that can Yellowfin is an excellent all-round cloud be taken to improve performance). What is enterprise BI solution with with good data particularly significant about BeyondCore is connectivity and management, numerous that all the complexity is hidden from the mechanisms for effective governance and business user. support for embedding and the creation of cloud based services. It also supports some Birst is a cloud based BI platform that caters forms of advanced analytics. for self-service and production needs. The underlying data architecture is excellent, and places Birst ahead of the pack in this respect. It is one of only a few true enterprise BI platforms available as a cloud deployment (public or private). Advanced analytics are well supported via the Weka toolkit. Microstrategy is a very deep and very broad business intelligence platform. It is most suitable for large businesses with extensive

© butleranalytics.com 8 Business Analytics Directory

Visual Analytics Platforms users can create the data visualizations they need. Visual analytics is the act of finding meaning DOMO is a cloud based BI solution that in data using visual artifacts such as charts, caters to management and executive graphs, maps and dashboards. In addition, information needs. It also comes with strong the user interface is typically driven by drag collaboration support, and is particularly easy and drop actions using wholly visual to use. The downside of this is that DOMO constructs. The up side to this is ease of use provides relatively weak data visualization - the down side can be limited functionality. capability. Many platforms include some form of scripting language for more complex GoodData is a cloud based BI platform ideal requirements, and some of the newer tools for business users who need an easy-to-use also include help with interpretation. Visual interface. It supports the creation of charts analytics are very prone to misinterpretation, and dashboards, and connectivity to a large but guidance and significance are increasingly number of data sources. This is not a being incorporated into visual analytics particularly sophisticated platform, but does software. offer recommended visualizations and mechanisms to embed visualizations into BeyondCore - The four dominant modes of other applications. analysis are covered by BeyondCore – descriptive (traditional BI), discovery (looking Inetsoft provides reporting and data for unknown facts), predictive (finding visualization capability that is well appreciated consistent patterns that can be used in future by its OEM customers as well as enterprise activities), and prescriptive (actions that can users. It employs some novel techniques for be taken to improve performance). What is high performance query execution, and particularly significant about BeyondCore is addresses the needs of power business that all the complexity is hidden from the users in addition to production needs. business user. InfoCaptor is available through the cloud, or BIME is a cloud based visual analytics can be installed locally – in both cases it platform that connects with a wide range of provides a browser interface. Technically this data sources – internal and external, it offers is a very competent offering, with a a very easy to use interface, supports bewildering number of chart types, and the extensive collaboration and does all the facility to easily create and distribute things that might be expected of a BI dashboards. solution. All the usual charts are supported, in Logi Analytics provide Logi Info for addition to highly functional maps, relational operational reporting, dashboards and self- analysis, funnel charts – all in a very service needs, and Logi Vision for data customizable environment. visualization and exploration. It’s a very broad ClearStory provides a cloud based platform range of functionality, and Logi offers that takes business users from data to excellent extensibility, developer tools, visualization with minimal need for technical embedding and ease-of-use. skills. In common with several other suppliers Looker bucks the trend by placing a powerful (Tamr, Paxata, Platfora and others) it uses modeling language at the heart of its product machine learning techniques and Apache architecture, and by directly processing data Spark in-memory processing, to take data held in external databases. It is particularly from its raw state, to a state where business well suite to businesses that need to analyze complex metrics.

© butleranalytics.com 9 Business Analytics Directory

Microsoft Power BI data visualization sets a version 6 improves on this and adds support standard for ease-of-use with its natural for advanced analytics via R. language query. It is also evolving at a rapid Spotfire from TIBCO successfully combines rate, and is starting to challenge established advanced analytics with an easy-to-use player in this space. Expect to see it become interface. At the most basic level users can very widely adopted. create charts and dashboards with Microstrategy is a very deep and very broad considerable ease. As skills and experience business intelligence platform. It is most increase so other forms of analytics are suitable for large businesses with extensive available. The implementations of additional production reporting needs. The data analytic capabilities are of a very high quality, visualization capabilities are above average, and Spotfire boasts one of the fastest R but not exceptional. Microstrategy does runtime engines available. however support advanced analytics, and Tableau is the benchmark for easy to use has done for some time. data visualization tools, and has certainly Prognoz provides a broadly based BI caused other suppliers to improve their act. capability with several types of analysis – The scalability of the product is excellent, but specifically OLAP, time-series-analysis, and it lacks the sophisticated developer support modeling and forecasting. It also provides of some other products. It is a very good end-user tools, and specifically the ability to adjunct to BI platforms which do not have create charts and dashboards. particularly good visualization capabilities. Qlik Sense is an easy to use visual analytics ThoughtSpot BI platform is given the tag line platform with an excellent data discovery of ‘search driven analytics’. The user interface engine and can easily be extended for is driven by a search bar, much as a user of bespoke needs. It comes as a free stand- Google experiences, and natural language alone desktop version, an enterprise server queries can be entered using the language of edition and also as a cloud service. the business concerned. SAP Lumira is the obvious data visualization Yellowfin is an excellent all-round cloud tool for businesses with a large existing SAP enterprise BI solution with with good data investment. It doesn’t distinguish itself in any connectivity and management, numerous particular way, and other users will find a mechanisms for effective governance and large number of more suitable alternatives are support for embedding and the creation of available. cloud based services. It also supports some forms of advanced analytics. SAS Visual Analytics is an easy-to-use, sophisticated analytics platform that stands apart from most of the competition. Several features guide and advise the user to make the experience informed and effective - something almost totally lacking in most other BI platforms. Sisense majors on ease-of-use and performance. It is capable of handling very large data volumes at speed, and provides powerful data visualization capabilities. Sisense also supports good extensibility, and

© butleranalytics.com 10 Business Analytics Directory

Free and Open Source fyiReporting RDL Project is a powerful report and charting system based on Report Reporting Tools Definition Language (RDL). Tabular, free form, matrix, charts are fully supported. Report Both free and open source reporting tools are output may be displayed as HTML, PDF, listed here. The free versions are sometimes Excel, RTF, XML, .Net Control, Web Archive, cut down versions of a commercial product, and to a printer. Libraries exist for use in but some are fully functional. ASP.NET, Windows .Net, and command line BIRT (free BI suite) is an open source applications. A WYSIWYG designer allows software project that provides the BIRT you to create reports without knowledge of technology platform to create data RDL. Wizards are available for creating new visualizations and reports that can be reports and for inserting new Tables, embedded into rich client and web Matrixes, and Charts into existing reports. applications, especially those based on Java i-net Clear Reports is a full-blown reporting and Java EE. BIRT is a top-level software solution completely coded in Java and also project within the Eclipse Foundation. available for .NET. It comes with a free report DataVision (free reporting tool) is an Open designer program to custom design report Source reporting tool similar to Crystal template files and is Platform Independent. It Reports. Reports can be designed using a provides Crystal Reports Support, Ad Hoc drag-and-drop GUI. They may be run, Reporting, Report Repository and free mobile viewed, and printed from the application or repository apps for Android OS and iPhone, exported as HTML, XML, PDF, Excel, iPod touch with iO. LaTeX2e, DocBook, or tab- or comma- icCube community edition is a free end-to- delimited text files. The output files produced end BI solution covering reporting and by LaTeX2e and DocBook can in turn be analytics. The reporting is designed for used to produce PDF, text, HTML, business users and facilitates exploration of PostScript, and more. OLAP data via reports, graphs and DataVision is written in Java and runs almost dashboards. Analytics include an ETL layer anywhere. It can generate reports from and support for multidimensional analysis and databases or text data files. Any database MDX+. XMLA is also supported for with an available JDBC driver should work: connection to other reporting tools (eg Excel), Oracle, PostgreSQL, MySQL, Informix, and support for R and Java means other hsqldb, Microsoft Access, Progress, and forms of analysis can be carried out. more. Columns read from text files can be Intellicus is a comprehensive suite of all BI & separated by any character. Reporting capabilities; Ad-Hoc & Standard Report descriptions are stored as XML files. reports, Dashboards, ETL, OLAP and more. This means you can not only use the Visualizations provide contemporary style of DataVision GUI but you may also edit reports web controls allowing users to derive using your favorite text editor. meaningful value from their data. Generate, format and analyze multiple Ad-hoc reports dmyreports (free reporting tool) is a dynamic with ease. Use desktop and web studio to report generator for MySQL. End users will be design complex & pixel perfect reports. able to select tables, fields and set conditions Visualize your business data in dashboards. graphically using a web based interface to Free cut down version and full enterprise generate reports. These reports can then be version. saved or exported to other formats. (currently in alpha).

© butleranalytics.com 11 Business Analytics Directory

Jaspersoft Community addresses the Report Manager is a reporting application creation of reports and charts. The (Report Manager Designer) and a set of Community edition essentially provides a libraries and utilities to preview, export or print reporting and charting environment with reports. Include native .Net and Delphi/C+ supporting infrastructure. The report designer +Builder libraries, ActiveX component and supports charts, images, crosstabs and sub- also standard dynamic link library for use in reports for sophisticated report layouts. any language like GNU . The reporting Interactive report viewing is a browser based engine includes a TCP enabled Report Server report viewer with sorting, filtering and so thin clients can obtain reports processed formatting of report snapshot views. A in the server. Also a fully functional web report centralized repository provides infrastructure server application is available, generating for reporting and stores user profiles, reports, Adobe PDF files on the fly. dashboards and analytic views. This is a true net and web report server with JMagallanes is an open source end user no license fees and multiprocessor support. application for Olap and Dynamic Reports Report Manager is open source under the written in Java/J2EE. It combines static MPL license model, (including a GPL allowed reports (based on JasperReports), a Swing use clause) so you can use it in your pivot table for OLAP analysis, and charts commercial software. (based on JFreeChart). It reads from many ReportServer (free reporting tool) – This data sources as SQL, Excel, XML, and provides an extremely flexible open source others, and produces many outputs as PDF, reporting and dashboard environment. It XML, and application specific files for later off- supports Eclipse Birt, JasperReports and line visualization of reports. SAP Crystal Reports reporting engines in MyNeoReport library is a free and easy to addition to its own ad-hoc oriented reporting use way to design and add reports to .NET tools. The user interface is web based and it applications. It supports a WYSIWYG report supports a wide range of admin tools. designer and the libraries have to be Central to ReportServer is the Dynamic List. distributed with the application. This is the preferred method supporting a Pentaho Community (free BI suite) – main wide range of functions such as column components are the reporting tools, the data selection, filtering, sorting, grouping, sub- integration platform, the ROLAP analytics totals, calculation and so on. JasperReports platform and the data mining tools. With the and Eclipse Birt tend to be used for ‘pixel Pentaho-Report-Designer you can create perfect’ reporting with output to a pdf file. report-definitions in a graphical environment. Finally Script Reports are used for particularly Reports are usually published to the Pentaho- complex reports, and require programming Platform, which allows you to manage, run skills to use. Interactive dashboards are and schedule the reports you created. If you supported and are generally constructed for are new to Pentaho-Reporting, you probably items called dadgets (Dashboard Gadgets) – want to start with the Pentaho Report- these can be anything from a report to an Designer. interactive HTML5 app. Internally, reports are executed by the Currently supported data sources include Pentaho Reporting Classic Engine. Pentaho Oracle, Microsoft SQL Server, IBM Informix, Reporting encompasses more than two IBM DB2, mySQL, PostgreSQL, h2 and of dozen software projects that facilitate creating course csv files. and publishing data-driven business reports.

© butleranalytics.com 12 Business Analytics Directory

Seal Report is an Open Source for the Microsoft .Net Framework entirely written in C#. It supports dynamic SQL sources, native pivot tables, HTML 5 charts and Microsoft charts, HTML rendering using the Razor engine, web report server and task scheduler. SpagoBI (free BI suite) is essentially a very large collection of open source software brought together to create a broad business intelligence capability. In fact it goes beyond the traditional notion of BI to embrace domains such as data mining and BPM. Support for 4 engines: JasperReport, BIRT, Accessible report, BO. SpagoBI allows to realize structured reports, using structured information views (e.g. lists, tables, crosstabs, reports) and to export them using several formats (HTML, PDF, XLS, XML, TXT, CSV, RTF). Zoho Reports is an online reporting and business intelligence service that helps you easily analyze your business data, and create insightful reports & dashboards for informed decision-making. It allows you to create and share powerful reports. The free service limits users (2) and data, and has cut down functionality.

© butleranalytics.com 13 Business Analytics Directory

MySQL Reporting Software saved or exported to other formats. (currently in alpha). These reporting tools tend to come as part of JasperReports (free BI suite) Server is a a more extensive BI suite or as stand-alone stand-alone and embeddable reporting tools. Many are free and some are server. It provides reporting and analytics that commercial products. In some cases the free can be embedded into a web or mobile products are pretty well unsupported, and application as well as operate as a central suppliers offer commercial versions with information hub for the enterprise by training, and support. delivering mission critical information on a BIRT (free BI suite) is an open source real-time or scheduled basis to the browser, software project that provides the BIRT mobile device, printer, or email inbox in a technology platform to create data variety of file formats. JasperReports Server is visualizations and reports that can be optimized to share, secure, and centrally embedded into rich client and web manage your Jaspersoft reports and analytic applications, especially those based on Java views. and Java EE. BIRT is a top-level software Jaspersoft ETL is a state-of-the-art data project within the Eclipse Foundation. integration engine, powered by Talend. DataVision (free reporting tool) is an Open Extract data from various sources, transform Source reporting tool similar to Crystal the data based on defined business rules, Reports. Reports can be designed using a and load into a centralized data warehouse or drag-and-drop GUI. They may be run, data mart for reporting and analysis. viewed, and printed from the application or A powerful eclipse-based report designer for exported as HTML, XML, PDF, Excel, JasperReports and JasperReports Server. LaTeX2e, DocBook, or tab- or comma- Build reports from any data source, format delimited text files. The output files produced the look and feel for print or on-screen by LaTeX2e and DocBook can in turn be reading, deploy to JasperReports Server or used to produce PDF, text, HTML, your own JasperReports implementation, and PostScript, and more. export to a wide range of formats. DataVision is written in Java and runs almost JReport (commercial BI tool) from Jinfonet anywhere. It can generate reports from excels in embeddable reporting and data databases or text data files. Any database visualisation. It is used by some of the world’s with an available JDBC driver should work: largest businesses, typically to provide online Oracle, PostgreSQL, MySQL, Informix, reporting facilities to customers, but is also hsqldb, Microsoft Access, Progress, and used in many businesses to provide more. Columns read from text files can be information directly to employees and separated by any character. partners. At the current time, around two Report descriptions are stored as XML files. thirds of Jinfonet’s business comes from This means you can not only use the OEM, ISVs and other third parties who DataVision GUI but you may also edit reports provide embedded intelligence in a variety of using your favorite text editor. services and products. JReport supports dashboards, charts, reporting, and connects dmyreports (free reporting tool) is a dynamic to a wide variety of data sources, including report generator for MySQL. End users will be , relational databases and cloud able to select tables, fields and set conditions based data. graphically using a web based interface to generate reports. These reports can then be

© butleranalytics.com 14 Business Analytics Directory

myDBR (commercial reporting tool) is a web- • NextReports Server supports the based reporting system for relational scheduling of NextReports and Jasper databases that allows you build a feature rich, reports and to automatically deliver reports manageable and scalable reporting via E-mail, FTP, SSH, Windows Share, environment. myDBR manages the WebDAV in most popular file formats distribution and access rights automatically, (HTML, EXCEL, PDF, etc) or in real time so it also scales well to large scale dashboards. installations. Users are able to access reports Pentaho Community (free BI suite) – main using just a web browser, no additional components are the reporting tools, the data software installations is required. integration platform, the ROLAP analytics With myDBR you can easily link reports platform and the data mining tools. With the together allowing you to building a report Pentaho-Report-Designer you can create network. This gives your users a natural path report-definitions in a graphical environment. for data mining and drill-down reports using Reports are usually published to the Pentaho- business concepts close to them. Moving Platform, which allows you to manage, run from individual reports to a report network and schedule the reports you created. If you makes sure you are getting the most out of are new to Pentaho-Reporting, you probably your data. want to start with the Pentaho Report- Designer. MySQL Pivot Table Generator creates concise and informative web based pivot Internally, reports are executed by the tables (cross tabulation) for any type of data Pentaho Reporting Classic Engine. Pentaho summary without writing any code or Reporting encompasses more than two conducting any database queries, and will dozen software projects that facilitate creating simply and seamlessly export your pivot and publishing data-driven business reports. table’s data into Excel. It automatically ReportServer (free reporting tool) – This updates your web based pivot tables provides an extremely flexible open source whenever the database has been updated reporting and dashboard environment. It with new information. supports Eclipse Birt, JasperReports and NextReports (free reporting tool) provides SAP Crystal Reports reporting engines in three, free open source utilities that support addition to its own ad-hoc oriented reporting report creation using a variety of database tools. The user interface is web based and it platforms. supports a wide range of admin tools. • NextReports Designer is an application to Central to ReportServer is the Dynamic List. design in-grid reports, using connections This is the preferred method supporting a to most popular databases including wide range of functions such as column Oracle, MySql, MSSQL, PostgreSQL, selection, filtering, sorting, grouping, sub- Firebird and Derby. NextReports Designer totals, calculation and so on. JasperReports has an intuitive interface that makes report and Eclipse Birt tend to be used for ‘pixel creation a quick and easy task. perfect’ reporting with output to a pdf file. Finally Script Reports are used for particularly • NextReports Engine is a lightweight (330k) complex reports, and require programming Java platform development library which skills to use. Interactive dashboards are can be used to run NextReports inside supported and are generally constructed for applications. This library is very easy to items called dadgets (Dashboard Gadgets) – use, reports can be integrated with just a these can be anything from a report to an few lines of code. interactive HTML5 app.

© butleranalytics.com 15 Business Analytics Directory

Currently supported data sources include processing and storing. Stimulsoft Reports Oracle, Microsoft SQL Server, IBM Informix, Server can work as a standalone report IBM DB2, mySQL, PostgreSQL, h2 and of server as well as an embeddable solution. course csv files. The software provides all features appropriate for all reporting servers – creating and editing Smart Report Maker (commercial reporting reports, storing them, scheduling and tool) – from MySQLReports provides three delivering them to the client. tools to generate reports, create charts and build forms: Ubiq (commercial report tool) is an easy to use but powerful web based reporting tool for • Smart Report Maker provides an MySQL data. You can use it to analyze your environment to compile and manage an business data, create web based reports & unlimited number of MySQL database dashboards to track signups, sales, reports based on tables and/or queries. operations, and other business information It’s a quick and easy PHP MySQL report without any programming. Data can be generator that gives you the ability to queried using drag and drop, and prepare the reports that you need at the dashboards are updated automatically to click of a button. reflect the latest information. • Smart Chart Maker is very similar to Smart In drag and drop mode charts can be created Report Maker in terms of usability and by selecting fields from a list reflecting data functionality. As previously noted, Smart available in connected databases. Functions Chart Maker supports unlimited dynamic can also be applied to data by selecting charts that are fed directly from a MySQL dropdown options, and filters and joins can database and situated around the tables be created with changes reflected in real- and/or queries. time. There is a choice of most charting types • PHP MySQL Form Maker allows web – table, line, bar (stacked, group, percent), forms to be easily generated, and typically scatter plots, stacked area, column (grouped, without programming. percent, stacked), pie, donut and funnel. SpagoBI (free BI suite) is essentially a very Charts can also be created using sql syntax, large collection of open source software and regardless of creation method can be brought together to create a broad business exported to PDF, PNG or CSV. intelligence capability. In fact it goes beyond Windward Solution (commercial reporting the traditional notion of BI to embrace platform) is ideal for commercial software domains such as data mining and BPM. (OEM), system integrator and enterprise Support for 4 engines: JasperReport, BIRT, developers who need to add Reporting and Accessible report, BO. SpagoBI allows to DocGen functionality to desktop, online, realize structured reports, using structured SaaS or mobile products. The .NET or Java information views (e.g. lists, tables, crosstabs, report engines connect to virtually any data reports) and to export them using several source and can be quickly integrated in 14 formats (HTML, PDF, XLS, XML, TXT, CSV, lines of code, meaning faster time to market. RTF). Windward enables design without restrictions through AutoTag, the Microsoft Office-based Stimulsoft (commercial BI) Reports Server is template design tool. Any user can create a complete Business Intelligence, client- and rapidly revise templates, freeing up dev server solution that provides reporting and resources. analytics. The front end provides facility to visualize data. The back end provides data

© butleranalytics.com 16 Business Analytics Directory

Machine Learning clustering, classification and collaborative filtering. Machine Learning as a Service Amazon Machine Learning is a service that makes it easy for developers of all skill levels Machine learning is a nebulous term used to to use machine learning technology. Amazon address capabilities such as face recognition, Machine Learning provides visualization tools speech recognition, natural language and wizards that guide you through the processing, predictive analytics and specific process of creating machine learning (ML) techniques such as deep learning (deep models without having to learn complex ML neural networks). All the services listed below algorithms and technology. Once your provide APIs so the various services can be models are ready, Amazon Machine Learning used, usually with the aim of embedding makes it easy to get predictions for your some form of intelligence into an application. application using simple APIs, without having Free accounts are offered by IBM, Amazon to implement custom prediction generation and Microsoft - usually throttled in some way, code, or manage any infrastructure. but giving an opportunity to evaluate. Some BigML is a cloud based machine learning of the platforms also provide algorithm platform with an easy to use graphical development and solution building tools (e.g. interface. It also provides simple mechanisms Microsoft, FICO and Amazon). to incorporate predictive models into Algorithmia provides a could based platform production applications through its REST API. for algorithm developers to share their work, The platform combines supervised learning and for application developers to incorporate (to build predictive models), unsupervised algorithms into their applications. Hundreds learning (to understand behavior), anomaly of algorithms are already available addressing detection (used in fraud detection), data most conceivable tasks including text visualization tools (scatter-plots and Sunburst analytics, computer vision, graphs, machine diagrams) and many mechanisms for learning and others. Costing is based on the exploring data. The modest pricing will make frequency of algorithm usage and compute it attractive to medium and large businesses time. who want the benefits associated with machine learning without large upfront costs algorithms.io provide a cloud hosted service and implementation delays. BigML is a to collect data, generate classification models pragmatic, low cost, easy to use platform for and score new data. Code is added to web building powerful predictive models. and portable device applications which stream data to the algorithms.io service, FICO arguably has the most experince of any where it is captured and processed using supplier in the application of statistical and random forest, support vector machine, K- machine learning technologies to business Means, decision tree, and problems. FICO Analytic Cloud embraces neural network algorithms. The resulting machine learning, statistics, optimization and model is then used to categorize new data. business rules management, in the context of The results are passed back as a parsed data a well managed environment. It also serves stream to power apps, or as reports and as a marketplace for developers of analytic visualizations. A set of APIs are provided for solutions and users who have a need for developers to integrate machine learning into them. web and mobile applications. The algorithms Google Prediction API can integrate with are categorized as anomaly detection, App Engine, and the RESTful API is available through libraries for many popular languages,

© butleranalytics.com 17 Business Analytics Directory

such as Python, JavaScript and .NET. The industry security standard and can exports to Prediction API provides pattern-matching and PMML those models that are supported by machine learning capabilities. the standard. Data scientists can connect and control Yottamine Predictive Web HPE Haven OnDemand provides more than Services using R programming language via 60 APIs and services that deliver deep YottamineR package. learning analytics on a wide range of data, including text, audio, image, social, web and video. IBM’s Watson Analytics offers predictive analytics and data visualization, and a conversational type interface. It automatically does the hard math to show the most relevant facts, patterns and relationships. A free version is offered with limitations on data volumes. Microsoft Machine Learning Studio features a library of sample experiments, R and Python packages and best-in-class algorithms from Microsoft businesses like Xbox and Bing. Azure ML also supports R and Python custom code, which can be dropped directly into your workspace. PurePredictive uses AI to automate the machine learning process. The platform automates the discovery of data transformations and higher order relationships between data features and automatically accommodates data drift. The cloud platform scales automatically for workloads so that data sets of virtually any size can be accommodated. The models are easily consumed through web services, and can be automatically maintained to deal with changes in business conditions. Yottamine includes comprehensive capabilities for importing and applying models in a real-world setting. It is designed to allow users to take full advantage of scalable on- demand cloud computing, and eliminate the high costs of a dedicated infrastructure. The Yottamine Predictive Service allows for building models or making predictions in two simple steps. Via integration with scalable cloud computing it provides high speed and efficiency. It also conforms with the SSL

© butleranalytics.com 18 Business Analytics Directory

Commercial Predictive Analytics Hadoop environment. Once there Datameer supplies the data wrangling tools necessary Platforms to profile and transform data into useable formats. Analysts and data scientists can Alteryx is an interesting product, filling a void then use the large set of algorithms provided that most other analytic platforms do not by Datameer to create predictive models and address. It allows skilled business users and perform other forms of quantitative analysis. analysts to analyze their data using a Finally, business users can visualize data combination of data visualization and using a wide variety of charts and predictive analytics tools. It also supports dashboards, and more advanced spatial analytics where location is important. visualizations such as clustering and decision Angoss provides a broad suite of analytical trees can be created via an easy-to-use tools and solutions which cover predictive interface. analytics, text analytics, document FICO arguably has the most experience of exploration, scorecards and advanced any supplier in the application of statistical modeling in an integrated environment. It has and machine learning technologies to recently expanded the capability of its business problems. FICO Analytic Cloud products considerably to meet general model embraces machine learning, statistics, building needs and provide model optimization and business rules management capabilities. It is truly an management, in the context of a well enterprise solution for analytical needs, managed environment. It also serves as a providing the infrastructure and management marketplace for developers of analytic controls necessary to deploy predictive solutions and users who have a need for models into a production environment. them. Angoss provides an analytics platform of considerable breadth and capability, and joins IBM SPSS provides statistical and data an elite group of no more than five suppliers mining capability, and has associated who truly offer enterprise capability. predictive applications, particularly in marketing. Most likely to be of interest to Dataiku Data Science Studio (DSS) provides existing IBM customers. a productive data-to-production analytics workbench. Many of the time consuming KNIME is arguably the premier open source steps that slow down analytic model platform for creating predictive models. It production have been automated and provides a drag and drop graphical interface streamlined, enabling skilled business for the creation of workflows of any analysts and data scientists to quickly complexity. The basic edition is free, and prepare and understand data, build a model, various extensions are available, along with and quickly integrate it into a production support and training for business use. environment. Data scientists and business RapidMiner is similar to KNIME, in that it analysts alike will find that Dataiku DSS delivers a graphical, drag and drop interface provides a productive, flexible, analytics for the creation of predictive models. Like workbench that is capable of addressing KNIME it provides hundreds of functions to virtually all analytical needs. A free community prepare data, process data, find patterns, version is available that is limited to 100,000 and visuals to display graphs and charts. The rows of data. commercial edition comes with big data Datameer provides the means to bring large support and greater sophistication. data sets, that display great diversity (text, relational, streaming data etc) into the

© butleranalytics.com 19 Business Analytics Directory

Salford Systems delivers a portfolio of products capable of traditional descriptive analytics and predictive analytics. What distinguishes this company is the lack of hype around the technology it offers and a willingness to discuss the pitfalls and traps associated with predictive analytics – which ironically is a prerequisite for successful analytics. The SPM Salford Predictive Modeler supports both traditional descriptive and predictive analytics. CART (Classification and Regression Tree) supports classification and the discovery of hidden relationships between attributes. It embodies a number of proprietary methods and patented extensions to the original work done in the eighties. SAS predictive analytics is part of the very broad analytics capability offered by SAS. It not only offers the data preparation and data mining tools, but also a run-time environment. The main complaint is the cost. Skytree will primarily appeal to large organizations with some experience in the use of machine learning technologies; and in fact Skytree positions itself as ‘The Machine Learning Company’. Its Infinity platform provides the tools for analysts and data scientists to create predictive models, in a manner that is both productive and effective. Productivity benefits come from the automation of many tedious tasks that typically require weeks of fine tuning, and the effectiveness of the resulting predictive models is due mainly to the extensive data exploration tools, model performance monitoring, and the fact that most of the algorithms have been designed ground up, specifically for big data analytics. Statistica, now part of Dell, is an integrated predictive analytics tool, which includes text analytics and statistical analysis. It has recently gone through a makeover in Version 13, with support for streaming data. There’s a lot going on here and well worth a look.

© butleranalytics.com 20 Business Analytics Directory

Open Source Data Mining MLlib is Spark’s machine learning (ML) library. Its goal is to make practical machine learning Platforms scalable and easy. It consists of common learning algorithms and utilities, including Apache Mahout spark machine leaning classification, regression, clustering, supports mainly three use cases: collaborative filtering, dimensionality Recommendation mining takes users’ reduction, as well as lower-level optimization behavior and from that tries to find items primitives and higher-level pipeline APIs. users might like. Clustering takes e.g. text documents and groups them into groups of mlpy is a Python module for Machine topically related documents. Classification Learning built on top of NumPy/SciPy and the learns from existing categorized documents GNU Scientific Libraries. mlpy provides a what documents of a specific category look wide range of state-of-the-art machine like and is able to assign unlabelled learning methods for supervised and documents to the (hopefully) correct category. unsupervised problems and it is aimed at finding a reasonable compromise among Jubatus is the first open source platform for modularity, maintainability, reproducibility, online distributed machine learning on the usability and efficiency. data streams of Big Data. Jubatus uses a loose model sharing architecture for efficient Orange is a very capable open source training and sharing of machine learning visualization and set of data mining tools with models, by defining three fundamental an easy to use interface. Most analysis can operations; Update, Mix, and Analyze, in a be achieved through its visual programming similar way with the Map and Reduce interface (drag and drop of widgets) and most operations in Hadoop. visual tools are supported including scatterplots, bar charts, trees, dendograms KEEL is an open source (GPLv3) Java and heatmaps. A large number (over 100) of software tool to assess evolutionary widgets are supported. algorithms for Data Mining problems including regression, classification, clustering, pattern R - is described as a project for statistical mining and so on. It contains a big collection computing, but it might be more accurately of classical algorithms, described as the lingua franca of analytics. A preprocessing techniques (training set large number of commercial analytics tools selection, feature selection, discretization, support R (Oracle, Microsoft, FICO, TIBCO, imputation methods for missing values, etc.), Angoss …), simply because it does pretty Computational Intelligence based learning much everything. The out-of-the-box runtime algorithms, including evolutionary rule environment is fairly slow, and so vendors learning algorithms based on different such as TIBCO, Microsoft and Lavastorm approaches (Pittsburgh, Michigan and IRL, provide speeded up runtime support. The …), and hybrid models such as genetic fuzzy great thing about R is that almost everything systems, evolutionary neural networks, etc. is possible, the downside is that it’s a language and needs to be programmed. KNIME is arguably the premier open source However various packages exist to make the platform for creating predictive models. It whole thing more productive and easier. provides a drag and drop graphical interface for the creation of workflows of any RapidMiner is similar to KNIME, in that it complexity. The basic edition is free, and delivers a graphical, drag and drop interface various extensions are available, along with for the creation of predictive models. Like support and training for business use. KNIME it provides hundreds of functions to prepare data, process data, find patterns,

© butleranalytics.com 21 Business Analytics Directory

and visuals to display graphs and charts. This link is for the open source version. The commercial edition comes with big data support and greater sophistication. Rattle (the R Analytical Tool To Learn Easily) presents statistical and visual summaries of data, transforms data into forms that can be readily modeled, builds both unsupervised and supervised models from the data, presents the performance of models graphically, and scores new datasets. scikit learn provides many easy to use tools for data mining and analysis. It is built on python and specifically NumPy, SciPy and matplotlib. TANAGRA is a free set of data mining tools for academic and research purposes. It proposes several data mining methods from exploratory data analysis, statistical learning, machine learning and databases area. WEKA is set of data mining tools incorporated into many other products (Knime and Rapid Miner for example), but it also a stand-alone platform for many data mining tasks including preprocessing, clustering, regression, classification and visualization. The support for data sources is extended through Java Database Connectivity, but the default format for data is the flat file.

© butleranalytics.com 22 Business Analytics Directory

Free Data Mining Clustering usability and efficiency. It includes k-Means Tools and . Orange is a (relatively) easy to use data Data mining clustering methods are usually mining platform with support for hundreds of used for exploratory purposes and to gain operators. The clustering methods it supports insights into data. The most commonly used include k-Means, SOM (self organizing maps), methods include k-Means, agglomerative Hierarchical Clustering, and MDS (multi- hierarchical clustering and increasingly self dimensional scaling). organizing maps. The tools listed below PermutMatrix provides data visualizations support a wide variety of methods, some of with clustering and seriation analysis. It which are particular to a given toolset. supports hierarchical clustering. Databionic ESOM Tools is a suite of RapidMiner Community Edition is perhaps programs to perform data mining tasks like the most widely used visual data mining clustering, visualization, and classification platform and supports Hierarchical Clustering, with Emergent Self-Organizing Maps (ESOM). Support Vector Clustering, Top Down KEEL is an open source (GPLv3) Java Clustering, k-Means and k-Mediods. software tool to assess evolutionary scikit learn provides many easy to use tools algorithms for Data Mining problems including for data mining and analysis. It is built on regression, classification, clustering, pattern python and specifically NumPy, SciPy and mining and so on. It contains a big collection matplotlib, and supports many clustering of classical knowledge extraction algorithms, methods including k-Means, affinity preprocessing techniques (training set propagation, spectral clustering, Ward selection, feature selection, discretization, hierarchical clustering, agglomerative imputation methods for missing values, etc.), clustering (hierarchical), Gaussian mixtures Computational Intelligence based learning and Birch clustering. algorithms, including evolutionary rule learning algorithms based on different Tanagra supports a variety of methods approaches (Pittsburgh, Michigan and IRL, including k-Means, SOM, LVQ (Learning …), and hybrid models such as genetic fuzzy Vector Quantizers) and Hierarchical systems, evolutionary neural networks, etc. Clustering. KNIME is a general purpose data mining Weka is a platform whose algorithms are platform with over 1000 different operators. used in many other toolkits (RapidMiner for Its support for clustering includes k-Means, k- example). Clustering methods include Mediods, Hierarchcial Clustering, Fuzzy c- DBSCAN, COBWEB (creates a classification Means and SOTA (self organizing tree tree), k-Means and the EM (expectation algorithm). maximization) algorithm. mlpy is a Python module for Machine Learning built on top of NumPy/SciPy and the GNU Scientific Libraries. mlpy provides a wide range of state-of-the-art machine learning methods for supervised and unsupervised problems and it is aimed at finding a reasonable compromise among modularity, maintainability, reproducibility,

© butleranalytics.com 23 Business Analytics Directory

Text Analytics Entity, theme and topic extraction are supported along with decision and strategy trees for profiling, segmentation and and Text Analytics predictive modeling. Platforms supports the visual graphing of sentiment trends and individual documents can be Virtually all the text analytics platforms listed marked up for sentiment. below primarily address customer and social Attensity majors on social analytics, but also data. This is the signature application for text offers a general purpose text analytics engine. analytics in business, and so there are many Four major components define the offering: options available. Many provide a cloud based service, and some support on- • Attensity Pipeline collects data from over premises deployment. So instead of one hundred million social sources as repeating on every entry in the list that input for analysis. customer based analytics is the focus, the • Attensity Respond provides a mechanism information gives a summary of the text for responding to social comment. analytics capability offered. • Attensity Analyze allows text in emails, AlchemyAPI (now part of IBM) provides cloud call-center notes, surveys and other based text analytics services to support sources of text to be analyzed for sentiment analysis, marketing, content sentiment and trend. discovery, business intelligence, and most tasks where natural language processing is • Attensity Text Analytics provides an needed. An on-site capability can also be underlying engine that embraces several provided if needed. The capabilities offered unique NLP technologies and a semantic by AlchemyAPI go beyond those most large annotation server for auto-classification, organizations could build in-house, and not entity extraction and exhaustive extraction. least because the training set used to model It comes with good integration tools too language is 250 times larger than Wikipedia. so that the results of text analytics can be Innovative techniques using deep learning merged with structured data analytics. technologies (multi-layered neural networks) Three horizontal solutions are offered for also go well beyond most of the competition, marketing, customer service and IT. and AlchemyAPI distinguishes itself by using the technology for image recognition in Basis Technology delivers a variety of addition to text analytics. products and services based on multilingual text analytics and digital forensics. The KnowledgeREADER from Angoss is part of a Rosette platform provides morphological broad suite of analytics tools and specifically analysis, entity extraction, name matching addresses text analytics in the context of and name translation in fields such as customer oriented and marketing information retrieval, government intelligence, applications. It majors on visual e-discovery and financial compliance. representation including dashboards for sentiment and text analysis, and also The Rosette search and text analytics provides a somewhat unique map of the technology comes in five distinct functional results of association mining to display words units: that tend to occur together. Many of the • RLI – Rosette Language Identifier – advanced features make use of the automatic language and character embedded Lexalytics text analytics engine – encoding identification. widely recognized as one of the best in class.

© butleranalytics.com 24 Business Analytics Directory

• RBL – Rosette Base Linguistics – many • Subjectivity Analysis – categorizes text as search engines have used RBL to provide subjective or objective based on the essential linguistic services such as content and the writing style. tokenization, lemmatization, • Topic Classification – you can decompounding, part-of-speech tagging, automatically tag text into topic categories sentence boundary detection, and noun based on the IAB QAG Taxonomy phrase detection. Currently supports 40 Standards. languages. • Gender Detection – This NLP API identifies • REX – Rosette Entity Extractor – finds whether content is written by or targets a entities such as names, places, man or woman based on the words, organizations and dates. context and idoms found in the content. • RNI – Rosette Name Indexer – matches • Keyword Extraction. – enables you to the names of people, places and extract from an arbitrary document, organizations written in different languages webpage or data stream all the keywords against a single, universal index. and word-combinations along with their • RNT – Rosette Name Translator – occurrences in the text. provides multilingual name translation • Entity Extraction – named entity through a combination of dictionaries, recognition for identifying people, places, linguistic algorithms and statistical things, and other named items. inference. Clarabridge provides a text analytics solution A Rosette plug-in is available for Lucene and with a customer experience focus. This Solr search technologies and Basis embraces various sources of customer Technology provides solutions for information including surveys, emails, social government, social media monitoring, media and the call centre. financial compliance, e-discovery and enterprise search. The technology addresses three essential steps in the analysis of textual information. It Buzzlogix provides cloud based natural supports the aggregation of information from language processing and machine learning most sources imaginable, allows the API’s to support sentiment analysis, data information to be processed for linguistic mining, content discovery, business content and the creation of categories, and intelligence, and most tasks where natural finally provides a rich user interface so the language processing is leveraged. Buzzlogix results of analysis can be seen. There are provides a free version as well as commercial three main areas of functionality: versions. • Clarabridge Analyze comes with the ability The various functions are called via a REST to tune classification models and the way API and address the following types of data sentiment is scored, and provides various applications: reports and visualizations. • Sentiment Analysis – classifies text as • Clarabridge Act provides a customer positive, negative or neutral. engagement environment for all customer • Twitter Sentiment Analysis – classifies facing employees by providing real-time Twitter tweets as positive, negative or dashboards and the mechanisms to neutral. address customer feedback. • Clarabridge Intelligence Platform carries out analysis and is essentially a natural

© butleranalytics.com 25 Business Analytics Directory

language processing (NLP) engine. Solutions are offered for organizations Connections to other applications in the operating in defence and security, life organization can be facilitated by sciences and media, and Connexor works Clarabridge Connect, and includes out-of- with a wide variety of organizations (software the-box connectors for , Radian houses, businesses, systems integrators etc.) 6, Lithium and other applications. to deliver NLP capability. A free NLP web service is available. Clustify, used mainly by legal firms, groups related documents into clusters, providing an DatumBox provides a cloud based machine overview of the document set and aiding with learning platform with 14 separate areas of categorization. This is done without functionality, much of which is relevant to text preconceptions about keywords or analytics. The various functions are called via taxonomies — the software analyzes the text a REST API and address the following types and identifies the structure that arises of application: naturally. Clustify can cluster millions of • Sentiment Analysis – classifies documents documents on a desktop computer in less as positive, negative or neutral. than an hour, bringing organization to large projects. • Twitter Sentiment Analysis – specifically targeted at Twitter data. Clustify identifies important keywords used for clustering and reports frequency • Subjectivity Analysis – classifies information so that clusters can be browsed documents as subjective (personal which contain a set of specified keywords. It opinions) or objective. also identifies a representative document for • Topic Classification – documents assigned each cluster, allowing decisions to be made to 12 thematic categories. on other documents in the same cluster. • Spam Detection – documents labeled as Uses of Clustify include taxonomy spam or nospam. development, search engine enhancement, litigation and ad targeting. The technology is • Adult Content Detection. built on proprietary mathematical models • Readability Assessment – based on terms which measure the similarity of documents. and idioms. Connexor provides a suite of text analytics • Langauge Detection. tools which embrace a wide variety of NLP methods. These include metadata discovery, • Commercial Detection – commercial or name recognition, sentiment detection, non-commercial based on keywords and language identification, automatic document expressions. summarization, document classification, text • Educational Detection – based on context. cleansing, language analysis (10 European languages) and machine translation. • Gender Detection – written by or targeting men/women based on words and idioms. Connexor’s Machinese libraries transform text into linguistically analyzed structured data. • Keyword Extraction. This includes Machinese Phrase Tagger • Text Extraction – extraction of important which splits text into word units, Machinese information from a web page. Syntax which shows the relationship between • Document Similarity – to detect web page words and concepts and Machinese duplicates and plagiarism. Metadata which will extract information in 10 languages.

© butleranalytics.com 26 Business Analytics Directory

Eaagle provides text mining technology to precision and recall in the process information marketing and research professionals. Data is retrieval and management. loaded into Eaagle and a variety of reports The technology has found uses in CRM and charts are returned showing relevant applications, product development, topics and words, word clouds, and other competitive intelligence, marketing and many statistics. Both online and Windows based activities where knowledge sharing is critical. software is offered. The Windows offering is called Full Text Mapper with good visuals to Fluent Editor 2014 from Cognitum is a explore topics and various word statistics. comprehensive tool for editing and manipulating complex ontologies that uses ExpertSystem majors on semantic analysis, Controlled Natural Language. Fluent editor employing a semantic analysis engine and provides a more suitable alternative to XML- complete semantic network for a complete based OWL editors. It’s main feature is the understanding of text, finding hidden usage of Controlled English as a knowledge relationships, trends and events, and modeling language. Supported via Predictive transforming unstructured information into Editor, it prohibits one from entering any structured data. Its Cogito semantic sentence that is grammatically or technology offers a complete set of features morphologically incorrect, and actively helps including: semantic search and natural the user during sentence writing. Controlled language search, text analytics, development English is a subset of English with restricted and management of taxonomies and grammar and vocabulary in order to reduce ontologies, automatic categorization, the ambiguity and complexity of the extraction of data and metadata, and natural language. language processing. Intellexer provides a family of tools for natural At the heart of Cogito is the Sensigrafo, a rich language search, document management, and comprehensive semantic network, which document comparison and the enables the disambiguation of terms, a major summarization and analysis of documents stumbling block in many text analytics and web content. Nine solutions are offered, technologies. Sensigrafo allows Cogito to all reasonably priced: understand the meaning of words and context (Jaguar: car or animal?; apple: the • Name recognition – extracts names fruit or the company?) – a critical differentiator (named entities) and defines relations between semantic technology and traditional between them. keyword and statistics based approaches. • Summarizer – extracts main ideas in a Sensigrafo is available in different languages document and creates a short summary. and contains more than 1 million concepts, • Categorizer – for automatic document more than 4 million relationships for the categorization. English language alone, and a rich set of attributes for each concept. The Cogito • Comparator – compares documents and semantic network includes common words, determines the degree of proximity which comprise 90% of all content, and rich between them. vertical domain dictionaries including • Question-answering – looks for Corporate & Homeland Security, Finance, documents which answer a natural Media & Publishing, Oil & Gas, Life Sciences language query. & Pharma, Government and Telecommunications, providing rich • Natural language interface – generates contextual understanding that improves Boolean queries for any application.

© butleranalytics.com 27 Business Analytics Directory

• Related Facts – is an IE plugin for Google down to individual documents. It is used in search and selects 5 main topics and survey analysis, market research, social supplements them with related facts. media monitoring, customer loyalty and forensic analysis. • Summarizer plug-in for IE – summarizes web pages and extracts concepts. Leximancer Enterprise runs on a multi-user server providing users with a browser • PDF Converter – to incorporate PDF interface, and also provides a REST web documents into text processing. services interface for application integration. KBSPortal provides an NLP capability which A desktop version is available as a stand- includes tagging and categorizing user alone environment, or users can access the submitted web site content, text LexiPortal via a web browser for a web based summarization, document linking by entities, service (charging based on usage). vulgarity detection, sentiment rating and Moderately priced academic versions are also association of sentiment with products and available. people. This functionality is available as a web Linguamatics provides a NLP capability with service or through purchase of source code either in-house or cloud based for in-house deployment. implementation. A search engine approach to Lexalytics is one of the forerunners in text mining text comes with a good query analytics and its Salience text analytics interface and the ability to drill down to engine is used in market research, social individual documents. A domain knowledge media monitoring, survey analysis/voice of plug-in supports taxonomies, thesauri and customer, enterprise search and public policy ontologies. applications. The functionality offered by The technology is widely used in life sciences Salinece includes sentiment analysis, named and healthcare and the on-line service entity extraction, theme extraction, entity-level provides access to content in this domain. A sentiment analysis, summarization and facet web services API supports most and attribute extraction. The Salience engine programming languages. can be integrated into other business applications via a flexible set of APIs, and can Linguasys primarily satisfies the need to be tuned for very specific tasks and high process text in multiple languages – and by levels of performance. multiple we mean English, Arabic, Chinese, German, French, Hebrew, Indonesian, Another essential component in the Lexalytics Japanese, Korean, Malay, Spanish, Pashto, approach is data directories. This effectively Persian, Portuguese, Russian, Thai, provides a parameter driven environment with Vietnamese, Urdu and others under files to set up relationship patterns, sentiment development. This may well be unique in the analysis, and the creation of themes. Non- world of natural language processing, and is English support is provided through this possible because all languages are mechanism. Each directory can be transformed into a large collection of configured to support a particular task concepts, each with its own identifier. It is the delivering considerable flexibility and power. concepts which link all the languages Leximancer uses ‘concepts’ as a primary together. The concept ‘mobile phone’ for analytic structure, and these are automatically example has the same concept number in all identified by the software without need for languages and is given identifier 26300, along existing structures such as taxonomies or with all variants that mean the same thing – ontologies. Analysis is presented through a ‘cellular phone’ for example. variety of useful visualizations, with drilling

© butleranalytics.com 28 Business Analytics Directory

Luminoso is a cloud based text analytics summarize a document or perform natural service that calls upon a multi-lingual language queries. capability. Many of the current problems PolyVista provides easy-to-use software and associated with text analytics (ambiguity for services to improve customer experience, example) are at least partly solved by enable competitive analysis, and facilitate Luminoso. A variety of useful reports and predictive analytics. PolyVista helps its visualizations provide users with a particularly customers extract actionable insights from good interface. social data. Without additional cost, PolyVista MeaningCloud comes with as an Excel add- bundles its technology with professional in and cloud based service. It provides services in a business model called Solution feature level sentiment analysis, supports as a Service. PolyVista offers POC (proof of multiple languages, is easily integrated into concept), as well as one-time, monthly, and other applications and automatically codes multi-month contracts to meet its clients’ and classifies documents of any kind. It needs and budgets. Several intuitive user- comes with a generous free plan, otherwise interfaces are offered, and the company has fees are based on usage. been in business since 2001. NetOwl provides both text and entity Provalis provides a suite of text analytics analytics in the cloud and in private tools, each of which facilitates a particular deployments. Text analytics includes type of text analysis. QDA Miner (available in Extractor to perform entity extraction, a free Lite version) supports qualitative DocMatcher which compares and analysis with coding, annotation, retrieval and categorizes documents according to user analysis of document and image collections. defined concepts, and TextMiner for mining WordStat on the other hand supports the large amounts of text. Entity analytics is used extraction of themes and trends, taxonomy to accurately match and identify names – and ontology creation, clustering and important in many areas, including CRM, anti- proximity analysis, and machine learning tools fraud and national security. This includes for document classification. SimStat, as the NameMatcher to identify name variants from name suggests provides statistical analysis large multicultural and multilingual name tools for text analysis. These three databases. EntityMatcher performs identity components can be purchased separately or resolution on similar databases. as ProSuite, and all components are integrated with each other. WordStat now PolyAnalyst from Megaputer is a data and integrates with Stata – the statistical analysis text mining platform which embraces the platform. complete analytics lifecycle. Megaputer provides two separate software packages for SAS Text Analytics is part of the very broad text analysis. PolyAnalyst performs linguistic analytics capability offered by SAS. Several and semantic text analysis and coding, modules are provided including: clustering and categorization of documents, • SAS Contextual Analysis – for the creation entity extraction, visualization of patterns, of document classification models. automated or manual taxonomy creation, text OLAP, and generating interactive graphical • SAS Enterprise Content Categorization – reports on results. TextAnalyst provides a list for automated content categorization, and of the most important keywords in a various add-on modules add extra document, a set of related keywords for each capability as needed. word, and the ability to automatically • SAS Ontology Management – to define semantic relationships.

© butleranalytics.com 29 Business Analytics Directory

• SAS Sentiment Analysis • SAS Text Miner – use of various supervised and unsupervised techniques. SIFT is a powerful cloud-based text analytics and reporting platform that can scale to process thousands of text comments in a matter of minutes. SIFT uses natural language processing and machine learning technology to analyze unstructured customer feedback such as customer comments, emails, product reviews, call center transcripts and open-ended survey responses. Core features including the ability to import and export data, sentiment analysis, topic detection, integration with Salesforce, data visualization and report collaboration. There is no need for hard-coded logic or complex IT integrations, SIFT comes ready out-of-the box, and can be used by marketing, product and support teams regardless of their level of natural language processing expertise. Statistica Text Miner is part of the extensive Statistica statistical analysis and data mining product set. Extensive pre-processing options are available with stemming and stub lists for most European languages. ‘Bag of words’ type analysis can be carried out with input to the data mining capabilities of Statistica.

© butleranalytics.com 30 Business Analytics Directory

Free NLP Machine Learning ML Analyzer - Text Classification, Article Summarization, Sentiment Analysis, Stock APIs symbol extraction, Person Names Extractor, Language Detection, Locations Extractor, AlchemyAPI (developer) provides advanced Adult content Analyzer. cloud-based and on-premise text analysis infrastructure that eliminates the expense and Sentiment works by examining individual difficulty of integrating natural language words and short sequences of words (n- processing systems into your application, grams) and comparing them with a service, or data processing pipeline. probability model. The probability model is built on a prelabeled test set of IMDb movie Free Natural Language Processing Service - reviews. It can also detect negations in 100% free service including sentiment phrases, i.e, the phrase “not bad” will be analysis, content extraction, and language classified as positive despite having two detection. individual words with a negative sentiment. Idilia brings artificial intelligence to the cloud Text-Processing - Sentiment analysis, for the developer community. Idilia’s core stemming and lemmatization, part-of-speech technology analyzes text (including social tagging and chunking, phrase extraction and media streams and queries) and determines named entity recognition. the meanings of words in context, including both common words and proper nouns. This Tweet Sentiment API lets you send small core technology, a major breakthrough in AI, chunks of text - for example Tweets or is the most accurate in the world. Third party Facebook Posts - and returns information application developers can now leverage about whether the sentiment is probably three core components through our Web positive, negative or neutral, along with a Services API. Free version throttled to 30 day score indicating how strong that probability is. quotas - e.g. 100K sense analysis. It does this using natural language processing, breaking the text down into Jeannie (Voice Actions) is a virtual assistant component parts and comparing those parts with over three million downloads, now also to a massive database of human-rated available via API. The objective of this service historical data. is to provide you and your robot with the smartest response to any natural language WebKnox REST API offers functionality such question, just like Siri. Jeannie API provides as Named Entity Recognition, Part-of-Speech an interface to the standard functions that Tagging, Question Answering, and Language users demand of modern voice assistants, Detection. WebKnox Keywords API gives you like chatting, controlling your devices, looking access to keyword data. You can find up information, creating messages and much information about keywords such as how much more. often they are searched, how many competing pages there are on Google. You MeaningCloud comes with as an Excel add- may also get a list of related keywords to your in and cloud based service. It provides given keywords. Several other APIs are feature level sentiment analysis, supports available. multiple languages, is easily integrated into other applications and automatically codes Wit.ai is a natural language processing and classifies documents of any kind. It platform recently acquired by Facebook. It comes with a generous free plan, otherwise allows developers to add voice interfaces to fees are based on usage. any application and it simply converts speech or text into structured data.

© butleranalytics.com 31 Business Analytics Directory

Decision Support environmental and social aspects of technologies within a life cycle approach. The PROSUITE DSS is a practical, ready-to-use Free and Open Source Decision software tool that is available for free, as open Support Software source system application. It offers a consistent set of assessment tools, with a Decision Explorer is a proven tool for common user interface, that can be used managing “soft” issues – the qualitative either as stand-alone tools or integrated in information that surrounds complex or the platform. It is designed in such a way that uncertain situations. It allows you to capture it can be updated on a regular basis within in detail thoughts and ideas, to explore them, and beyond the project, in order to and gain new understanding and insight. The accommodate newly developed methods, result is a fresh perspective, and time saved and newly developed inventory data for through increased productivity, release of current and future situations. creativity and a better focus. A demo version can be freely downloaded that supports 30 concepts. Lumina Analytica provides an intuitive interface to build influence diagrams and its Intelligent Arrays provide easy management of multidimensional arrays. Monte Carlo analysis provides estimation of risk and uncertainty, and a free version, Analytica 101 can be downloaded for perpetual use - with support for 101 variables and other objects. OpenRules is a general purpose open source Business Rules and Decision Management System available as an Open Source product. It allows subject matter experts and software developers to create, test, execute, and maintain enterprise-class decision support applications. Paramount Decisions is a cloud based decision tool that breaks down the decision- making process into 8 easy-to-follow steps. After users complete a decision, they can generate reports to communicate the rationale of the decision to other stakeholders. It comes with strong collaboration features, version control and strong security. A free subscription is available for up to 5 decisions, but does not support collaboration. PROSUITE Decision Support System provides a coherent and consistent software platform for assessing economic,

© butleranalytics.com 32 Business Analytics Directory

Social Network and Media coefficient, average shortest path, pagerank and HITS. Dynamic filtering allows edges Analysis and/or nodes to be selected based on network structure or data. Ideal for social Free and Open Source Social network analysis, link analysis and biological network analysis. Perhaps the most Network Analysis Software advanced of the open source tools. Social network analysis software generally Graph-tool is an efficient Python module for uses network and graph theory to investigate manipulation and statistical analysis of graphs social structures both analytically and visually. (a.k.a. networks). Contrary to most other The main constructs are nodes (the entities python modules with similar functionality, the we are interested in – typically people), and core data structures and algorithms are the ties or edges that connect them. Many of implemented in C++, making extensive use of the products listed here are open source with template metaprogramming, based heavily on a license that permits free use in commercial the Boost Graph Library. This confers it a settings. level of performance that is comparable (both in memory usage and computation time) to Cuttlefish is a network workbench that of a pure C/C++ library. Graph-tool has application that visualizes the networks with its own layout algorithms and versatile, some of the best known layout algorithms. It interactive drawing routines based on cairo allows detailed visualizations of the network and GTK+, but it can also work as a very data, interactive manipulation of the layout, comfortable interface to the excellent graph edition and process visualization as graphviz package. well as different input methods and outputs in tex using Tikz and PSTricks. It is developed GraphChi can run very large graph by the Chair of Systems Design of ETH computations on just a single machine, by Zürich , a research group that applies a using a novel algorithm for processing the complex system approach to investigate graph from disk (SSD or hard drive). economic and social networks. Programs for GraphChi are written in the vertex-centric model, proposed by GraphLab Cytoscape is an open source software and Google’s Pregel. GraphChi runs vertex- platform for visualizing complex networks and centric programs asynchronously (i.e changes integrating these with any type of attribute written to edges are immediately visible to data. A lot of Apps are available for various subsequent computation), and in parallel. kinds of problem domains, including GraphChi also supports streaming graph bioinformatics, social network analysis, and updates and removal of edges from the semantic web. graph. The promise of GraphChi is to bring Gephi is an interactive visualization and web-scale graph computation, such as exploration platform for all kinds of networks analysis of social networks, available to and complex systems, dynamic and anyone with a modern laptop. hierarchical graphs. It runs on Windows, JUNG – the Java Universal Network/Graph and Mac OS X. Gephi is open-source Framework–is a software library that provides and free. It supports all types of networks – a common and extendible language for the directed, undirected and mixed, and is modeling, analysis, and visualization of data capable of handling very large network that can be represented as a graph or graphs of up to one million nodes. Various network. It is written in Java, which allows metrics are supported including JUNG-based applications to make use of the betweenness, closeness, diameter, clustering

© butleranalytics.com 33 Business Analytics Directory

extensive built-in capabilities of the Java API, • computes different measures of centrality as well as those of other existing third-party (network metrics and statistics) Java libraries. • automatically detects communities The JUNG architecture is designed to (community mining) support a variety of representations of entities • shows community dynamics in time and their relations, such as directed and (community event analysis and undirected graphs, multi-modal graphs, visualization) graphs with parallel edges, and hypergraphs. It provides a mechanism for annotating Netlytic is a cloud-based text analyzer and graphs, entities, and relations with metadata. social networks visualizer. Netlytic can This facilitates the creation of analytic tools for automatically summarize large volumes of complex data sets that can examine the text and discover and visualize social relations between entities as well as the networks from conversations on social media metadata attached to each entity and sites such as Twitter, Youtube, blog relation. comments, online forums and chats. It is designed to help researchers and others to libSNA is an open-source library for Social understand an online group’s operation, Network Analysis, licensed under the LGPL. identify key and influential constituents, and This library is under active development by discover how information and other resources Abe Usher in hopes that it will serve as a flow in a network. catalyst for improving the field of Social Network Analysis. NetworKit is a growing open-source toolkit for high-performance network analysis. • Easy to use Python API NetworKit is a Python module. It implements • Flexible data import options efficient graph algorithms, many of them parallel to utilise multicore architectures. • Scalable These are meant to compute standard • Built-in reports measures of network analysis, such as • Built-in data export capabilities degree sequences, clustering coefficients and centrality.High-performance algorithms are • Open source – easily extended written in C++ and exposed to Python via the • Fast processing time (efficient use of toolchain. graph algorithms) NetworkX is a Python language software MeerKat is suitable for many types of package for the creation, manipulation, and network analysis, including social networks. If study of the structure, dynamics, and provides filtering mechanisms, interactive functions of complex networks. Features editing, support for dynamic networks, include: various metrics and automatically detects • Python language data structures for communities. graphs, digraphs, and multigraphs. • network visualization with multiple layouts • Many standard graph algorithms • interactive network editing • Network structure and analysis measures • support for dynamic networks (multiple • Generators for classic graphs, random timeframes) graphs, and synthetic networks • network filtering • Nodes can be “anything” (e.g. text, images, XML records)

© butleranalytics.com 34 Business Analytics Directory

• Edges can hold arbitrary data (e.g. Graph Metric Calculations Easily calculate weights, time-series) degree, betweenness centrality, closeness centrality, eigenvector centrality, PageRank, NodeXL is a free, open-source template for clustering coefficient, graph density and Microsoft® Excel® 2007, 2010 and 2013 more. Task Automation Perform a set of that makes it easy to explore network graphs. repeated tasks with a single click. With NodeXL, you can enter a network edge list in a worksheet, click a button and see Pajek – suite of programs for analysis and your graph, all in the familiar environment of visualization of very large networks. the Excel window. R is a general purpose analytics software, but Flexible Import and Export Import and export several libraries are available for social graphs in GraphML, Pajek, UCINet, and network analysis. These include degreenet, matrix formats. Direct Connections to Social RSeina, PAFit, igraph, sna network, tnet, Networks Import social networks directly from ergm, Bergm, hergm, latentnet and Twitter, YouTube, Flickr and email, or use one networksis. Each provides specialized of several available plug-ins to get networks functionality and for people familiar with R from Facebook, Exchange, Wikis and WWW represent a rich set of resources. hyperlinks. Social Networks Visualizer (SocNetV) is a Zoom and Scale Zoom into areas of interest, cross-platform, user-friendly tool for the and scale the graph’s vertices to reduce analysis and visualization of Social Networks. clutter. Flexible Layout Use one of several It lets you construct networks (mathematical “force-directed” algorithms to lay out the graphs) with a few clicks on a virtual canvas, graph, or drag vertices around with the or load networks of various formats mouse. Have NodeXL move all of the graph’s (GraphML, GraphViz, Adjacency,Pajek, smaller connected components to the UCINET, etc). Also, SocNetV enables you to bottom of the graph to focus on what’s modify the social networks, analyse their important. social and mathematical properties and apply visualization layouts for relevant presentation. Easily Adjusted Appearance Set the colour, shape, size, label, and opacity of individual Furthermore, random networks (Erdos-Renyi, vertices by filling in worksheet cells, or let Watts-Strogatz, ring lattice, etc) and known NodeXL do it for you based on vertex social network datasets (i.e. Padgett’s attributes such as degree, betweenness Florentine families) can be easily recreated. centrality or PageRank. SocNetV also offers a built-in web crawler, allowing you to automatically create networks Dynamic Filtering Instantly hide vertices and from links found in a given initial URL. edges using a set of sliders—hide all vertices with degree less than five, for example. The application computes basic graph properties, such as density, diameter, Powerful Vertex Grouping Group the graph’s geodesics and distances (geodesic lengths), vertices by common attributes, or have connectedness, eccentricity, etc. It also NodeXL analyse their connectedness and calculates advanced structural measures for automatically group them into clusters. Make social network analysis such as centrality and groups distinguishable using shapes and prestige indices (i.e. closeness centrality, colour, collapse them with a few clicks, or put betweeness centrality, information centrality, each group in its own box within the graph. power centrality, proximity and rank prestige), “Bundle” intergroup edges to make them triad census, cliques, clustering coefficient, more manageable. etc.

© butleranalytics.com 35 Business Analytics Directory

SocNetV offers various layout algorithms learning. SUBDUE has been successfully based either on prominence indices or applied in a number of areas, including dynamic models (i.e. Spring-embedder) for bioinformatics, web structure mining, meaningful visualizations of social networks. counter-terrorism, social network analysis, There is also comprehensivedocumentation, aviation and geology. both online and while running the application, Tulip is an information visualization framework which explains each feature and algorithm of dedicated to the analysis and visualization of SocNetV in detail. relational data. Tulip aims to provide the Socioviz is a web-based Twitter analytics developer with a complete library, supporting platform powered by Social Network Analysis the design of interactive information metrics. Allows user to query Twitter visualization applications for relational data conversations and find most influential people that can be tailored to the problems he or she based on who replies to whom and who is addressing. Written in C++ the framework mentioned whom. Social Network graphs enables the development of algorithms, visual (user mention and hashtag copresence) are encodings, interaction techniques, data visualized and can be exported in Gephi models, and domain-specific visualizations. format (gexf) for further analysis. One of the goal of Tulip is to facilitate the reuse of components and allows the statnet is a suite of software packages for developers to focus on programming their network analysis that implement recent application. This development pipeline makes advances in the statistical modeling of the framework efficient for research networks. The analytic framework is based prototyping as well as the development of on Exponential family Random Graph Models end-user applications. (ergm). statnet provides a comprehensive framework for ergm-based network Visone is a software tool intended for modeling, including tools for model research and teaching in social network estimation, model evaluation, model-based analysis. It is specifically designed to allow network simulation, and network visualization. experts and novices alike to apply innovative This broad functionality is powered by a and advanced visual methods with ease and central Markov chain Monte Carlo (MCMC) accuracy. The main features include: algorithm. • interactive graphical user interface, tailored SUBDUE is a graph-based knowledge to social networks discovery system that finds structural, • innovative network visualizations relational patterns in data representing entities and relationships. SUBDUE • support of unconfirmed relations represents data using a labeled, directed • available in Java for Windows, Linux, and graph in which entities are represented by MacOS labeled vertices or subgraphs, and relationships are represented by labeled • import and export of standard formats for edges between the entities. SUBDUE uses social network data the minimum description length (MDL) • publication-quality export in JPEG, PDF, principle to identify patterns that minimize the SVG, Metafile, and other formats number of bits needed to describe the input graph after being compressed by the pattern. SUBDUE can perform several learning tasks, including unsupervised learning, supervised learning, clustering and graph grammar

© butleranalytics.com 36 Business Analytics Directory

Commercial Social Network • Extendable to all sources of network data (e.g. collaborative work on electronic Analysis Tools documents or contents, electronic project collaboration, VoIP telephony/Contact Social network analysis tools generally use Centers, Instant Messaging, E-Mail, network and graph theory to investigate Discussion, …) social structures both analytically and visually. The main constructs are nodes (the entities InFlow performs network analysis and we are interested in – typically people), and network visualization in one integrated the ties or edges that connect them. product – no passing files back and forth Commercial products tend to be more between different programs like other tools. targeted at the end-user, and are generally What is mapped in one window is measured easier to use than open source products (but in the other window – what you see, is what not always). you measure. InFlow excels at what-if analysis - change the network, get new Centrifuge Analytics is a big data discovery metrics - just 2 clicks of the mouse. You do technology that provides the power and not need to be an expert in statistics to use flexibility to connect, visualize and collaborate InFlow. InFlow runs under Windows 7, 8, and without complex data integration, costly 10, and also on the Mac. services or a data science degree. It combines sophisticated link-analysis, Keynetiq is an innovative platform for interactive visualizations and discovery Organizational Network Analysis. It’s features to dramatically simplify data pattern designed to map, visualize and analyze and connection recognition. networks of people and relations between them, revealing how organizations really Commetrix is a Software Framework for operate in day-to-day business. By mapping Dynamic Network Visualization and Analysis and analyzing the patterns of information that supports Community Moderators, flow, knowledge exchange, collaboration, Members, and Network Researchers. It decision flow and other key interactions, provides easy exploratory yet comprehensive Keynetiq gives customized, actionable access to network data and allows for: insights. At an individual level, Keynetiq • Extracting virtual communities in electronic identifies employees with key network roles - communication networks informal opinion leaders, change agents, rising stars, natural mentors, connection • Analyzing dynamic network change, brokers, top influencers and knowledge hubs. properties, lifecycles, and structures Understanding their strengths and network • Creating rich expert network maps or position allows optimal resource allocation recommendation systems from and process improvement. communication logs or other network data Netlytic is a cloud-based text analyzer and sources (including surveys) social networks visualizer. Netlytic can • Searching, filtering, navigating social automatically summarize large volumes of corpora, like e-mail, discussions text and discover and visualize social • Understanding and utilizing your social networks from conversations on social media networks sites such as Twitter, Youtube, blog comments, online forums and chats. It is • Trace dissemination of topics or properties designed to help researchers and others to through the network understand an online group’s operation, identify key and influential constituents, and

© butleranalytics.com 37 Business Analytics Directory

discover how information and other resources features, Sentinel Visualizer helps users flow in a network. maximize the value of their data. NetMiner is a premium software tool for • Integrated knowledgebase, link analysis, Exploratory Analysis and Visualization of social network analysis, geospatial, and Network Data. NetMiner allows users to timelines explore a network data visually and • Industry standard database format interactively, and helps detect underlying patterns and structures of the network. • Supports network multi-user environments NetMiner has the comprehensive data model • Laptop deployable expressing various types of nodes, links, node attributes and link attributes. Through • Multiple monitor support its data model, NetMiner is able to represent • Originally created for the intelligence most social, natural and physical phenomena community as network data. SVAT is for data visualization, fraud Polinode has the ability to map, visualize and investigation, and more. It provides user- analyze network data. Targeted mainly at friendly, cost-effective visualization of links organizations and consultants, applications and flows between subjects. A chronological range from identifying change agents and overview of the visualized dataset is crucial in finding critical links through to promoting many cases. SVAT supports two different diversity and improving workplace layouts. timeline views with a lot of options to choose Features include: from. It can mine data from structured or • Upload any existing network data you unstructured sources and crunches them to have easily in either Excel, JSON or GEXF reveal hidden patterns. formats TouchGraph Navigator is 100% java • Completely customize the color and technology. A range of different relationship appearance of the network visualization, types are supported and edges can be including node size, edge thickness, node directed, undirected, and can show flow in colors, background color, label both directions. Text and numerical attributes appearance and a lot more can be associated with nodes and edges. Tables display the attributes and allow • Animated and computationally efficient sorting, and mages can be associated with Force Atlas layout algorithm nodes. Advanced cluster computation reveals Sentinel Visualizer provides users with inherent groupings and co-citations and co- insight into patterns and trends hidden in occurrence analysis clarifies dense networks. data. Its database driven data visualization Desktop and server editions available. platform lets users quickly see multi-level links XANALYS specialise in providing powerful among entities and model different software capabilities. From threat relationship types. Advanced drawing and assessment, Investigative major case redrawing features generate optimized views management and advance crime and fraud to highlight the most important entities. analytics. It helps to manage multi-jurisdiction Social Network Analysis (SNA) metrics reveal major crime investigations, evaluate and the most interesting suspects in complex analyse suspicious financial transactions, webs. With advanced filtering, squelching, capture and act upon intelligence reports, weighted relationship types, shortest path and disclose evidence in a court-ready format analysis, timelines, and integrated geospatial to ensure successful outcomes.

© butleranalytics.com 38 Business Analytics Directory

Developer and Operations • Easy extendibility via custom sources, modules, and data filters. Free Python Data Visualization • Reading several file formats: VTK (legacy Tools and XML), PLOT3D, etc. • Saving of visualizations. Several python data visualization tools - some • Saving rendered visualization in a variety of aimed at scientific work, and others with a image formats. more commercial touch. • Convenient functionality for rapid scientific Bokeh is a Python interactive visualization plotting via mlab library that targets modern web browsers for presentation. Its goal is to provide elegant, Pygal is a python SVG chart generator with concise construction of novel graphics in the support for bar charts, line charts, XY charts, style of D3.js, but also deliver this capability pie charts, radar charts, box plots, dot with high-performance interactivity over very charts, pyramid charts, funnel charts, gauge large or streaming datasets. Bokeh can help charts, worldmap charts and country charts. anyone who would like to quickly and easily There are many options for chart create interactive plots, dashboards, and customization. data applications. Seaborn is a library for making attractive and ggplot is a plotting system for Python based informative statistical graphics in Python. It is on R’s ggplot2 and the Grammar of Graphics. built on top of matplotlib and tightly It is built for making professional looking, integrated with the PyData stack, including plots quickly with minimal code. Making plots support for numpy and pandas data is a very repetitive: draw this line, add these structures and statistical routines from colored points, then add these, etc. Instead and statsmodels. Some of the features that of re-using the same code over and over, seaborn offers are: ggplot implements them using a high-level • Several built-in themes that improve on but very expressive API. The result is less the default matplotlib aesthetics time spent creating your charts, and more time interpreting what they mean. • Tools for choosing color palettes to make beautiful plots that reveal patterns in your matplotlib is a python 2D plotting library data which produces publication quality figures in a variety of hardcopy formats and interactive • Functions for visualizing univariate and environments across platforms. matplotlib bivariate distributions or for comparing can be used in python scripts, the python them between subsets of data and ipython shell (ala MATLAB®* or • Tools that fit and visualize linear regression Mathematica®†), web application servers, models for different kinds of independent and six graphical user interface toolkits. and dependent variables Mayavi2 is a general purpose, cross-platform • Functions that visualize matrices of data tool for 3-D scientific data visualization. Its and use clustering algorithms to discover features include: structure in those matrices • Visualization of scalar, vector and tensor • A function to plot statistical timeseries data in 2 and 3 dimensions. data with flexible estimation and • Easy scriptability using Python. representation of uncertainty around the estimate

© butleranalytics.com 39 Business Analytics Directory

• High-level abstractions for structuring grids of plots that let you easily build complex visualizations Vispy is a high-performance interactive 2D/ 3D data visualization library. Vispy leverages the computational power of modern Graphics Processing Units (GPUs) through the OpenGL library to display very large datasets. Applications of Vispy include: • High-quality interactive scientific plots with millions of points. • Direct visualization of real-time data. • Fast interactive visualization of 3D models (meshes, volume rendering). • OpenGL visualization demos. • Scientific GUIs with fast, scalable visualization widgets (Qt or IPython notebook with WebGL).

© butleranalytics.com 40 Business Analytics Directory

Free ETL Tools for MySQL • KETL is open sourced under a combination of the both the GNU Lesser Apatar provides connectivity to many popular Public License (LGPL) and the GNU Public applications and data sources (Oracle, MS License (GPL). SQL, MySQL, Sybase, DB2, MS Access, Pentaho’s Data Integration, also known as PostgreSQL, XML, InstantDB, Paradox, Kettle, delivers powerful extraction, BorlandJDataStore, Csv, MS Excel, Qed, transformation, and loading (ETL) capabilities. HSQL, Compiere ERP, SalesForce.Com, You can use this stand-alone application to SugarCRM, Goldmine, any JDBC data visually design transforms and jobs that sources and more). Supports bi-directional extract your existing data and make it integration, is platform independent and can available for easy reporting and analysis. be used without coding via the Visual Job Designer. An on-demand version supports Talend Open Source Data Integrator Salesforce and QuickBooks. provides multiple solutions for data integration, both open source and CloverETL supports a wide range of data commercial editions. Talend offers an Eclipse- sources including CSV, Excel, databases via based interface, drag-and-drop design flow, JDBC drivers, LDAP, Lotus Notes, and broad connectivity with more than 400 Quickbase, Infobright, web services, XML pre-configured application connectors to and JSON. Functionality includes filters, joins, bridge between databases, mainframes, file lookup, aggregate, sort, dedup, rollup, systems, web services, packaged enterprise normalize, pivot and much more. Interface is applications, data warehouses, OLAP primarily visual without coding. Free applications, Software-as-a-Service, Cloud- community edition and various commercial based applications, and more. packages. Jaspersoft ETL is easy to deploy and out- performs many proprietary ETL software systems. It is used to extract data from your transactional system to create a consolidated data warehouse or data mart for reporting and analysis. KETL is a premier, open source ETL tool. The data integration platform is built with portable, java-based architecture and open, XML- based configuration and job language. KETL features successfully compete with major commercial products available today. Highlights include: • Support for integration of security and data management tools • Proven scalability across multiple servers and CPU’s and any volume of data • No additional need for third party schedule, dependency, and notification tools

© butleranalytics.com 41 Business Analytics Directory

Free Data Ingestion Tools Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, Chukwa is an open source data collection and moving large amounts of log data. It has system for monitoring large distributed a simple and flexible architecture based on systems. Chukwa is built on top of the streaming data flows. It is robust and fault Hadoop Distributed File System (HDFS) and tolerant with tunable reliability mechanisms Map/Reduce framework and inherits and many failover and recovery mechanisms. Hadoop’s scalability and robustness. Chukwa It uses a simple extensible data model that also includes a flexible and powerful toolkit for allows for online analytic application. displaying, monitoring and analysing results Gobblin is a universal data ingestion to make the best use of the collected data. framework for extracting, transforming, and Cloudera Morphlines is an open source loading large volume of data from a variety of framework that reduces the time and skills data sources, e.g., databases, rest APIs, necessary to build or change Search indexing FTP/SFTP servers, filers, etc., onto Hadoop. applications. A morphline is a rich Gobblin handles the common routine tasks configuration file that simplifies defining an required for all data ingestion ETLs, including ETL transformation chain. These job/task scheduling, task partitioning, error transformation chains support consuming any handling, state management, data quality kind of data from any kind of data source, checking, data publishing, etc. Gobblin processing the data, and loading the results ingests data from different data sources in the into Cloudera Search. Executing in a small same execution framework, and manages embeddable Java runtime system, metadata of different sources all in one place. morphlines can be used for Near Real Time This, combined with other features such as applications, as well as batch processing auto scalability, fault tolerance, data quality applications. assurance, extensibility, and the ability of handling data model evolution, makes Databus is a source-agnostic distributed Gobblin an easy-to-use, self-serving, and change data capture system, which is an efficient data ingestion framework. integral part of LinkedIn’s data processing pipeline. The Databus transport layer Heka is a tool for collecting and collating data provides latencies in the low milliseconds and from a number of different sources, handles throughput of thousands of events performing “in-flight” processing of collected per second per server while supporting data, and delivering the results to any number infinite look back capabilities and rich of destinations for further analysis. Heka is subscription functionality. written in Go, but Heka plugins can be written in either Go or Lua. The easiest way to Fluentd is an open source data collector, compile Heka is by sourcing (see below) the which lets users unify the data collection and build script in the root directory of the project, consumption for a better use and which will set up a Go environment, verify the understanding of data. Fluentd tries to prerequisites, and install all required structure data as JSON as much as possible: dependencies. The build process also this allows Fluentd to unify all facets of provides a mechanism for easily integrating processing log data: collecting, filtering, external plug-in packages into the generated buffering, and outputting logs across multiple sources and destinations. 300+ community- A single Kafka broker can handle hundreds of contributed plugins connect dozens of data megabytes of reads and writes per second sources and data outputs. from thousands of clients. Kafka is designed to allow a single cluster to serve as the

© butleranalytics.com 42 Business Analytics Directory

central data backbone for a large organization. It can be elastically and transparently expanded without downtime. Data streams are partitioned and spread over a cluster of machines to allow data streams larger than the capability of any single machine and to allow clusters of coordinated consumers. Apache Sqoop is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases. You can use Sqoop to import data from external structured datastores into Hadoop Distributed File System or related systems like Hive and HBase. Conversely, Sqoop can be used to extract data from Hadoop and export it to external structured datastores such as relational databases and enterprise data warehouses.

© butleranalytics.com 43 Business Analytics Directory

IoT Platforms for IoT projects. The functionality includes device management, listeners, rules, triggers, These IoT platforms address a wide range of SDK application engine, custom alarms and needs, from connection and management of custom control panels. consumer devices through to plant [EVRYTHNG] is an award-winning IoT cloud monitoring and management. Most address platform that connects any consumer the development of apps, data analytics and product to the Web and manages real-time connectivity - but with each product the data to drive applications. Smart products emphasis is different. don’t just deliver connected experiences and Arrayent is a SaaS IoT platform primarily support services, they share data with targeted at allowing manufacturers to create enterprise systems and other device clouds connected products. Customers can control for smarter ROI. products via mobile devices, and collected EVRYTHNG manages digital identity data in data is used for warranty and understanding an intelligent IoT ‘smart products’ cloud to customer behavior. The platform is comprised connect consumer products to the Web and of an embedded software agent to connect drive real-time applications. to the Arrayent cloud, cloud software for analytics and toolkits for mobile and web Exosite platform supports the creation of application development. connected devices, processing in the cloud and data management tools. It uses well- Axeda provides an advanced Cloud-Based established, open Internet protocols that are Service and Software for managing thoroughly tested, widely suetheriospported, connected products and machines and and continuously improved upon. It also implementing innovative Machine-to-Machine provides example code for a variety of (M2M) and Internet of Things (IoT) devices to help developers communicate with applications. It supports the conversion of the platform. Exosite’s APIs allow users to machine and sensor data into business easily get data to and from any existing information, machine and sensor data platform or business system, so they can use integration, and M2M and IoT application data in the way they need to. development. IoT Eclipse is an implementation of IoT B-Scada provides a software system used to standards such as MQTT, CoAP, LWM2M automate and/or monitor industrial processes and OneM2M. The Eclipse IoT Working in various vertical markets: manufacturing, Group is an industry collaboration of transportation, energy management, building companies who invest and promote an open automation, and any other field where real source community for IoT. Eclipse IoT time operational data is used to make provides building blocks that sit on top of decisions. HMI/SCADA data visualization open standards and protocols and provide solutions work by giving operators access to additional services and frameworks for device real-time operational data in graphic form management, wired/wireless communication through a Human Machine Interface (HMI), and vertical solutions like home automation. allowing them to remotely monitor and control the processes from a centralized location. IoT-Ticket has a history going back to 1999 and has considerable experience in Carriots is a platform as a service (PaaS) for connecting devices and machines to the IoT. M2M and IoT projects. It supports the It can be used to create condition, collection of data from connected things, and maintenance or status reports to keep track the building of applications. It also provides a of assets, and provides a web-based development environment, APIs and hosting

© butleranalytics.com 44 Business Analytics Directory

Desktop to monitor and control them in real Thingsquare is a software platform that time. product makers use to connect their products with smartphones. The wireless Jasper provides a cloud-based software firmware – which runs on a wireless chip platform for the Internet of Things (IoT) and, inside a product – automatically creates a more specifically, to enable product self-healing wireless network that businesses to become IoT service automatically detects and heals wireless businesses. The platform is designed to help problems. The Thingsquare platform has out- organizations launch, manage, and monetize of-the-box support for Android and iOS apps the deployment of the Internet of Things that deal with device discovery and setup, worldwide. user signup and login sessions. Apps can be MachineShop is an API-centric IoT platform either mobile web HTML5 apps, to hit the company that simplifies the way market quickly, or as native apps, for a more organizations access, understand and act in-depth investment. upon real world events in real time. Xively is an enterprise IoT platform and MachineShop is a collection of hundreds of application solution for connected unique services with RESTful APIs - like businesses. Xively simplifies the way building blocks - to create and integrate companies securely and robustly CONNECT applications and solutions. their products & users, MANAGE IoT data at Muzzley is a platform that enables users to scale, and ENGAGE more closely with their access connected devices via mobile customers, users & partners. devices. It’s a single entry point for consumers to interact, in one single app, with all their connected devices, access to their activities and receive important notifications from them. IoT developers and/or business can get started with Muzzley by creating their own connected apps and widgets (or use the ones Muzzley provide off-the-shelf) and use them to engage with users and/or customers. Raco Wireless supports the creation, deployment and management of M2M solutions. The Omega Management Suite is a powerful cloud-based dashboard that provides partners of RacoWireless with custom control of their connected solutions. The Position Logic platform focuses on making it easy for business to address enterprise asset management, in-house asset tracking solutions, GPS asset tracking in a fully-customizable solution. With Omega DevCloud, web developers with common programming knowledge can transfer their skills to quickly build IoT/M2M applications in a matter of minutes.

© butleranalytics.com 45 Business Analytics Directory

Data Management only in high-end customized software solutions Data Quality Tools • Quickly combine customer, vendor, and sales lead information AB Initio provides significant data quality • Big data capability on data sets up to 100 tools as part of a broader suite of products million records for building, running and integrating enterprise applications. The end-to-end • Advanced record linking technology approach to data quality is based on design provides ability to create data warehouses patterns using Ab Initios coupled • Quick data profiling tool technologies. They are all architected • Scalable configurations for deduplication together including the Co>Operating System, and record linking, suppression, the Enterprise Meta>Environment (EME), the enhancement, extraction, and Business Rules Environment (BRE), and the standardization of business and customer Data Profiler. Using Ab Initio, a company can data implement a complete data quality program including detection, remediation, reporting, Link and consolidate customer data quickly and alerting. and easily. Data Ladder offers a free trial for users. Blazent’s 5-step data evolution process begins with data atomization, which breaks Data Manager is a program which allows you down IT data, regardless of its source, to its to process and manipulate your data in a most granular level. It then enriches the data easy and logical manner using a graphical with identity management, relationship interface. It reads and writes delimited files analysis, purification, and historicity. To create such as comma separated files (CSV) and the master source of truth, Blazent integrates also can read data from ODBC Data Sources. with more than 230 discrete data sources, It also allows you to construct a conceptual from ITSM systems like ServiceNow to design on how you are going to process your procurement, billing, operational tool stacks, data and transform it into another form. You or even shadow IT sources like spreadsheets. form your design by adding functional nodes Powered by high-performance technologies and linking them such that the links form the including Active MQ, Cassandra, Hadoop data flow through nodes on a graphical work and Spark, Blazent’s big data engine is area. optimized for scalability and near real-time Each node performs a single function on your data processing. data, once it completes it passes your data to Data Ladder’s data quality tools offer very the node it is linked to and the process high levels of matching speed and accuracy continues until the data encounters a output for the business user at an affordable price. node. You can form a simple design or a The company recently beat IBM and SAS in complicated design with hundreds of nodes matching accuracy and speed for enterprise and multiple input and output nodes. level data cleansing in an independent study. Datamartist is a fast, easy to use, visual data The company’s flagship software suite profiling and transformation tool. It includes a DataMatch includes the following features: data profiling tool for analyzing format, types, • Clean, deduplicate, and match data with completeness and value counts. Understand advanced technology previously available data quality issues clearly and quickly. Data can be transformed in a graphical ETL environment with a library of different data

© butleranalytics.com 46 Business Analytics Directory

blocks. Export out to files or directly to cleaning it; transforming it from one format databases. into another; extending it with web services; and linking it to databases like Freebase. DataPreparator is a free software tool designed to assist with common tasks of Paxata is an enterprise platform providing the data preparation(or data preprocessing) in tools to significantly speed up data cleansing, data analysis and data mining. and offers a contemporary solution that DataPreparator provides: employs a big data infrastructure and automated techniques which exploit machine • A variety of techniques for data cleaning, learning methods. The net result is a self- transformation, andexploration service data preparation platform that can be • Chaining of preprocessing operators into a used by business analysts and skilled flow graph (operator tree) business users to considerably speed up the • Handling of large volumes of data (since data preparation task. data sets are not stored in the computer The core capability of Paxata leverages memory) Hadoop and specifically Spark, so that large • Stand alone tool independent of any other scale in-memory processing is available for tools the machine learning algorithms that give Paxata much of its power. Paxata can be • User friendly graphical user interface deployed on premises or accessed as a DQ Cloud Services from Uniserv is unusual cloud service. The on-premises deployment in that it is a cloud based service. Connectors requires a Hadoop environment (either exist for a wide range of business dedicated or shared). applications, including Microsoft Dynamics Syncsort offers a variety of data processing CRM, Oracle Siebel, Salesforce and SAP tools including fast data sort, ETL, ETL Business Suite. It is able to enhance data in Optimization, SQL migration - and several applications (telephone numbers, location others. etc) by accessing public databases. These functions cover bank data checks, email Talend’s open source data quality tools are validation, entity titles (businesses or embedded in Talend Open Studio for Data individuals), and address correction. Quality, a popular open source data quality application. Main features include: DQGlobal provides a suite of data quality software including deduplication, data • Free to download and use under an migration, an API with a set of data quality Apache license. improvement functions, and specific utilities • Very easy to learn, with an Eclipse-based (e.g. formatting addresses). graphical workspace geared toward drag iManageData is an essential tool for ’n drop functionality. commercial data pre-processing. • Versatile enough to work in any IT iManageData helps you create cleaner, more architecture, with more than 400 built-in useful information from your data. With its connectors that enable easy access to comprehensive selection of data sources, major databases, file formats, and filtering, data conversions and mathematical package enterprise applications. transformations, iManageData provides • Comprehensive data quality improvement quality data for any analytical application. functionality, including support for data OpenRefine (formerly Google Refine) is a standardization, de-duplication, and powerful tool for working with messy data: enrichment.

© butleranalytics.com 47 Business Analytics Directory

• Support for the design and deployment of from authorised people, and Data reusable enterprise data quality services, Governance detects unusual activity. including real-time data cleansing services WinPure’s Clean & Match data quality to keep up the quality of incoming data. software provides advanced data matching • A web-based data quality monitoring and and sophisticated data cleansing at an reporting portal to help spread data quality affordable price for any business size. Clean awareness and a data quality culture & Match has been specially designed to be across your enterprise. used by anyone, not just IT professionals. Combining a simple-to-use interface and The data quality tools in Talend Open Studio powerful features its ideal for cleaning, for Data Quality allow users, without having to correcting and deduplicating mailing lists, write any code, to perform data quality databases, spreadsheets and CRM’s. Main analysis tasks ranging from simple statistical features include: profiling, to analysis of text fields and numeric fields, to validation against standard patterns • Fast fuzzy matching technology to identify (email address syntax, credit card number duplicate records, previously only available formats) or custom patterns of a user’s own in enterprise software. creation. • Real-time statistics with 3D charts and Tamr provides a sophisticated, but easy to scoring system to help populate missing use, data mapping, cleansing, integration and data. unified data access platform. Underneath the • Complete suite of data quality tools hood there is some very smart technology. including standardizing, profiling, filtering Support for big data and semi-structured and de-duplication. data types will be welcomed by many users employing those data sources and types. • Works with business and consumer data, local and international. TS Quality is the data cleansing and standardization component of the Trillium • Scalable editions for any business size. Software System, a robust, scalable, highly • World-class customer support. available and easily deployable solution for mission-critical enterprise data quality. WinPure offer a free 21 day trial so you can Trilliums data quality services deploy in batch try the software using your own data. or real-time through an on-site or hosted solution, using the same rule sets and standards across an unlimited number of applications and systems. Uniserve Data Quality Service Hub augments and corrects data in many business applications. Correction of address information, email address checking, telephone number checking, and bank data verification are all included. The platform can operate in batch and/or real-time. Data Analyzer establishes the current state of data quality, Data Cleansing corrects data, Data Protection ensures updates are adequate and

© butleranalytics.com 48 Business Analytics Directory

Solutions use service. The recommendation engine is simple to put in place, and will yield result in the first month. Users also benefit from client support, which is free, immediate, multicanal Sales Recommendation and multilingual. Engines BrainSINS provides the right personalisation tools to increase eCommerce conversion and The availability of large volumes of data, and sales the development of analytics technologies Celebrus collects, contextualizes and delivers mean there is every reason to add intelligence data about how individuals are behaving to sales and marketing processes. Here is a across a brand’s digital channels including list of over 30 platforms that include websites, mobile apps, social and streaming recommendation engines. media. The tagging-free software streams this 4-Tell offers several products including Recs4 online behavioral data in real-time, or near Web for ecommerce up-sell and cross-sell, real-time, into the client’s chosen target Recs4 Mobile, Recs4 Email, Recs4 Ads and database or technology. Global blue-chip Target4 Ads and Lists. The 4-Tell Boost clients use the data captured by Celebrus to Dashboard is a powerful merchandising tool deepen their customer knowledge through to view, manage and control the digital intelligence, uncover hidden insight recommendations. through big data analytics and power individualization through contextual Apptus eSales’ personalisation algorithms marketing. work across all your screen real-estate, and recommendations will select the right Certona delivers customer experiences products to promote to customers. tailored to each individual using continuous behavioural profiling and predictive Attraqt is an online visual merchandising tool technology, resulting in increased that helps retailers deliver a step change in engagement and conversions. their conversion rate, and drive online sale success. It supports product sequencing, Clerk increase sales and conversions with an search technology that learns from customer intelligent search engine, and increases behaviour and product recommendation. loyalty and customer lifetime value with personalized retention emails and offers that Barilliance creates a personalised shopping make customers return. experience. Every step of the purchase journey is supported with real time website click2customer QuickStart personalisation, and will generate cart Recommendations provides targeted abandonment emails and personalised recommendations to your customers. product recommendations. QuickStart Rescues allows users to save customers who are just about to abandon Baynote supports collection of customer their shopping carts. data, A/B testing, and reporting for online merchandising. Commerce Sciences supports demand generation, e-commerce optimisation, A/B bitREC boosts eCommerce conversion rates testing, and customer onboarding. by providing Shoppers personalized and context-aware shopping experience. DirectedEdge web services API allows users to add E-Commerce recommendations to an blueknow is a SaaS solution that transforms existing shop. Language support for PHP, a complex product into a simple and easy to

© butleranalytics.com 49 Business Analytics Directory

Ruby, Python and Java (along with our generating relevant content at every generic REST API) to allow users to offer touchpoint of the customer journey and related product and personalized increase sales and customer loyalty through recommendations to customers. Also offer a personalized recommendations. It also social API and recommendation engine. supports marketing automation and dynamic pricing. Ezako offer a recommendation engine in a totally integrated version that is directly Qubit provides AB & MV Testing, embedded in the web pages blended to the Personalisation, Social Proof, site design. Recommendations, Voice of Customer, Landing Page Optimisation, Merchandising, FACT-Finder combines all sales channels Mobile Apps, Digital Analytics, Mobile with one solution - online shop, in-store, point Analytics, Marketing Attribution, Customer of sales, terminals, mobile and social. Feedback Hugefly incorporates best-in-class machine RichRelevance enables personalized search learning technology that helps make and browse with the most relevant products. ecommerce engaging. Use our superior real- Also create highly relevant content time search, recommendations and experiences for each customer. personalization to offer relevant, trustworthy buying experience. Sajari adds search and recommendations to any app or web site. Magnetic brings together big data, patented algorithms, predictive intelligence and SmartFocus supports personalization, proprietary analytics. The platform is powered market intelligence, social insights, email and by a unique combination of in-market and in- mobile notifications. store data, and the technology responds to Softcube provides personalization without shifts in people’s behaviors in real-time: when any programming effort. they’re on the web, on site, in their inbox, and on any device. Sparkow is a SaaS solution that integrates with all the software on the market. It Monetate makes it easy to create and deploy connects to all product and customer data super-relevant, multi-channel experiences for sources: e-commerce platforms, CRM, ERP, unlimited segments. PIM, DOM, WMS, social networks, web Peerius personalization engine helps the analytics… Users can create business rules world’s leading brands increase revenue by that personalise the buying experience. creating a personalized experience for their Access to the application is via a simple web customers, through a suite of powerful next- interface. generation technology. Steelhouse supports creation of unlimited Prediggo provides intelligent search, audience segments based on purchase semantic merchandising and 1 to 1 patterns, device, search terms, location, and marketing. Ontology Filtering - a patented more. First-party data is gathered in real time, algorithm, integrates semantic product allowing users to target any and all audience understanding with transactional analysis to segments with relevant offers immediately. support personalization strategies. Strands Retail makes intelligent product Prudsys allows organizations to personalize recommendations to customers – based on product and content recommendations as what other users bought, unique shopper well as search result lists in real time. behavior, and upsell, cross-sell. Optimize the customer experience by

© butleranalytics.com 50 Business Analytics Directory

Swogo provides automated bundles to sell more high margin products. TargetingMantra Snowflake, segments visitors at the push of a button, based on Browsing behavior, Purchase history, Location and much more. Create curated campaigns on the fly. UNBXD supports a personalized shopping experience with site search and recommendations.

© butleranalytics.com 51