Polyanalyst™ 6.5 TECHNICAL CAPABILITIES and SYSTEM REQUIREMENTS
Total Page:16
File Type:pdf, Size:1020Kb
PolyAnalyst™ 6.5 TECHNICAL CAPABILITIES AND SYSTEM REQUIREMENTS PolyAnalyst™ is comprehensive data and text mining system providing tools for handling all steps of a typical data analysis process: from data loading, integration, manipulation and cleansing, to advanced text analytics, machine learning and knowledge discovery, and to versatile results visualization and reporting. Tools for the analysis of structured and textual data in a single environment enables joint processing of all available data for better insights. © 2019 MEGAPUTER INTELLIGENCE. ALL RIGHTS RESERVES VERSION 20.19 R.1 FEATURES / PolyAnalyst™ Web Reports PolyAnalyst™ Server PolyAnalyst™ Clients DOCUMENTS / EMAILS DATA LOADING REPORTING DATA STORAGE ALERTING ANALYSIS SCHEDULING WEB / SOCIAL MEDIA OLAP Web Report Editor DATABASES Administrative Client ODBC / OLEDB Figure 1. PolyAnalyst architecture. Client/Server architecture Security Server architecture enables the implementation of Recognizing that data is one of the most valuable and PolyAnalyst™ 6.5 as an enterprise level analytical sensitive assets of a modern organization, PolyAnalyst™ system. It facilitates the collaboration between data provides solid mechanisms to ensure data security. analysts working on the same projects and sharing Communications between client and server are various related resources such as analysis scenarios, performed in a fully encrypted manner with a new dictionaries, taxonomies, and multi-dimensional encryption key generated by the server for every matrices. Server architecture helps enhance the communication session. PolyAnalyst supports secure performance of the system by performing calculations user login based on user rights and passwords and on the most powerful machines, reducing data transfer keeps track of individual and group rights and over the network, scheduling execution of tasks at a sequences of actions carried out by the users. In given time, and generating custom reports and condition addition, full compliance with the requirements of the based alerts for different groups of business users. It HIPAA legislation facilitates PolyAnalyst implementations provides the centralized management and audit of the at healthcare and insurance organizations. list of system users and their actions. POLYANALYST™ SOFTWARE 1 © 2019 MEGAPUTER INTELLIGENCE. ALL RIGHTS RESERVES / FEATURES Scalability and Performance Data manipulation PolyAnalyst provides industrial level scalability: it can Every data analysis project starts with data normalization, handle huge amounts of data within reasonable time manipulation, and exploratory analysis. In fact, the intervals. This scalability is ensured through a combination dominant part of the analyst’s time is spent on data of several factors. PolyAnalyst™ utilizes hard disk instead manipulations preceding the application of machine or RAM for holding all data and supporting meta- learning algorithms. PolyAnalyst™ provides the analyst information. Special scalable implementation of with a vast set of powerful manipulation tools for data analytical algorithms enables the system to process cleansing, aggregation and derivation of new attributes. large volumes of data. PolyAnalyst user interface allows Virtually any data transformation task can be solved the user to develop complex data analysis scenarios through sequential application of PolyAnalyst™ data without loading data in the system, thus saving analyst’s manipulation tools. time. The availability of PolyAnalyst as the first 64-bit analytical system and its server farm implementation Analytical algorithms help dramatically increase the performance of the system and allow numerous users to perform data The main quality the user expects in a knowledge analysis and report viewing simultaneously. discovery system is its ability to use mathematical algorithms to learn from historical data and predict outcomes of future situations. PolyAnalyst™ provides Reusable analysis scenarios a broad selection of analytical algorithms for clustering A typical data analysis project involves a sequence and categorization of data, solving prediction, link of steps of data loading, preparation, analysis and analysis and affinity grouping tasks, learning patterns reporting. Frequently, the same sequence of analytical and discovering anomalies in data. Ranging from neural steps has to be carried out repetitively on new batches networks and decision trees, to Bayesian Networks of data. PolyAnalyst™ is a self documenting system that and Support Vector Machines, to CHAID and logistic provides intuitive visual tools for developing and editing regression, and to Case-based Reasoning and reusable multi-step data analysis scenarios. The system Evolutionary Programming, PolyAnalyst™ scalable is easy to learn and fun to use. Data analysis and report algorithms enable the user to solve the analytical generation scenarios can be scheduled for re-execution task at hand. at any given time. This ensures that business users have timely access to up-to-date reports built on most Text Analysis recent data. The data might contain attributes holding free form text, like in incident reports, claims notes, or in survey Data loading and integration responses. Or the project might require the analysis Whatever your data sources are, PolyAnalyst™ of huge collections of documents in various formats, provides means for loading and integrating these possibly harvested from the Internet in real time. data. PolyAnalyst™ can load data from disparate data Whatever the task, PolyAnalyst™ offers a collection sources including all popular database, statistical, and of text analysis algorithms that enable the data analyst spreadsheet systems. In addition, it can load collections to solve it. Based on Megaputer linguistic platform of documents in html, doc, pdf and txt formats, as well and the incorporation of various semantic dictionaries, as load data from an internet source including websites, PolyAnalyst™ represents a powerful natural language RSS feeds and Social Media. PolyAnalyst™ offers visual processing tool. PolyAnalyst™ provides tools for handling on-the-fly integration and merging of data coming from both analyst driven analysis, such as taxonomy based disparate sources to create data marts for further categorization and Text OLAP, and data driven analysis, analysis. It supports incremental data appending such as intelligent spell checking, keyword, and entity and referencing data sets in previously created extraction, clustering, and taxonomy creation. PolyAnalyst™ projects. © 2019 MEGAPUTER INTELLIGENCE. ALL RIGHTS RESERVES 2 POLYANALYST™ SOFTWARE FEATURES / PolyAnalyst 6.5 Features LOADING CLEANSING + MANIPULATION TEXT ANALYSIS + MACHINE LEARNING REPORTING Report Databases Pattern Substitution Document Link Categorization Analysis Templates Anomaly Spreadsheets Intelligent Spellchecker Taxonomy GIS Generation Detection Statistical Report Dictionary Management Entity OLAP Systems Extraction Publishing Document Grammar Checker Text OLAP Trend Web Collections Analysis Reports Emails Semantic Data Sources Integration Categorization Scheduled Flat Files Search Execution Internet Aggregation Document Affinity Executive Clustering Grouping Reporting Trends & Hadoop Attribute Derivation Keyword Extraction Clustering Alerts Pattern Social Media Anonymize Prediction Drill-Down Detection Reporting Figure 2. PolyAnalyst™ key capabilites. Multi-dimensional analysis The development of multi-dimensional cubes based on the analyzed data allows the user to answer a variety of business questions by slicing data across various dimensions. This technology is widely known as OLAP (for on-line analytical processing). PolyAnalyst™ offers a robust OLAP engine as one of its data analysis algorithm. In addition, PolyAnalyst™ enhances standard OLAP by adding dimensions defined on fields containing free form text and offering a unique multi-dimensional OLAP interface. POLYANALYST™ SOFTWARE 3 © 2019 MEGAPUTER INTELLIGENCE. ALL RIGHTS RESERVES / FEATURES Figure 3. PolyAnalyst™ web reports. Interactive visualization Reporting PolyAnalyst provides the data analyst with an immediate PolyAnalyst enables the data analyst to create custom feedback on the results of their analysis. It offers interactive reports delivering key results of the analysis to business and visual user experience whenever possible. In fact, users across the organization in a clean, consistent and exploratory analysis represents a very important stage easy to comprehend format. Interactive reports include of the complete data analysis cycle. The user of a mixture of graphs, tables, numbers, text and links to PolyAnalyst™ commands a variety of tools for interactive other PolyAnalyst objects. Reports can be scheduled for visualization of the data and the results of the analysis: re-execution at a given time to provide business users one and two-dimensional histograms, pie charts, line with results based on the analysis of the most up-to-date charts, bubble charts, scatter plots, snake charts, data. Static snapshots of reports can be exported to statistical widgets, link charts, and trends graphs help PDF, HTML and RTF format. the user make sense of the data. © 2019 MEGAPUTER INTELLIGENCE. ALL RIGHTS RESERVES 4 POLYANALYST™ SOFTWARE TECHNICAL / 1. Data loading and integration a. Data Sources: ODBC, OLEDB, XML, CSV, MS Excel, Web, File System, FTP b. Document Formats: PDF, ASCII, HTML, MS Word, MS RTF, RSS Feeds c. Character Formats: ASCII, Latin-1, Double-byte, UTF d. Web Data sources