Vertica Enterprise Edition

Total Page:16

File Type:pdf, Size:1020Kb

Vertica Enterprise Edition Data Sheet Vertica Advanced Analytics Platform Vertica Enterprise Edition Micro Focus® Vertica is designed for use in data warehouses and other big data workloads where speed, scalability, simplicity, and openness are crucial to the success of analytics. With Vertica Enterprise Edition software, you get high-performance data analytics for all of your data to make better business decisions in real time. Packed with powerful features for harnessing the value from all your data, Vertica Enterprise Edition manages massive amounts of data quickly and reliably, giving you real-time insight for advanced, Big Data analytics. Product Highlights Extensible In-Database Quick View As part of the Vertica Analytics Platform, you Analytics Framework • Real-time big data analytics. Gain insights into can perform queries 50 to 1,000 times faster Achieve open access to in-database processing your data in real time. Consume, analyze, and make than with traditional databases, at a fraction of through a robust development framework for informed decisions at the speed of business. the cost and using a fraction of the hardware. procedural, user-defined analytics. In addition to using built-in SQL analytic and aggregate func- • Fast time to value. Monetize your data in a matter Unlike traditional RDBMS databases, which of minutes, not weeks or months. Prevent data blind are not designed for analytics and today’s com- tions, you can define your own custom functions spots from increasing your risk and uncertainty plex analytics workloads, the Vertica Analytics by using a software developers’ kit (SDK). The around regulatory compliance. Platform is built from the ground up to deliver SDK features secure sandboxing, and resulting • Agile deployment. Get maximum flexibility—deploy functions run in parallel for fast performance. breakneck performance on Exabyte volumes at Vertica Enterprise Edition on industry-standard nearly infinite scale. hardware, virtual machines, or in the cloud. In-Database Analytics Library Key Features and Benefits Vertica provides a number of machine learning functions for performing in-database analysis. Columnar Storage and Execution Out-of-the-box, in-database analytics include By eliminating costly disk I/O associated with event-series pattern matching, event-series traditional row-oriented SQL databases, you joins, clustering, linear regression, geospa- 1 can perform queries 50 to 1,000 times faster. tial support, advanced time series, and more. And you can store 10x–30x more data per You also get open source analytics libraries, server than traditional databases with patented including thousands of packages from CRAN 2 columnar compression. (Comprehensive R Archive Network). “Scale-Out” Massively Parallel Native Support for Hadoop, Processing (MPP) Kafka, and More Scale your solution as much as you need Vertica Enterprise Edition includes application by adding an unlimited number of industry- programming interfaces (APIs) for user-defined standard servers and storage to your analytics aggregates, analytics, and multi-phase trans- environment. form functions. These dynamically integrate __________ with Apache Hadoop, Apache Kafka, and 1 TechValidate reference: techvalidate.com/ MapReduce, so that you can analyze large sets product-research/hp-vertica/charts/B9F- of structured, semi-structured, and unstruc- BA0-073 tured data in real time. 2 A Vertica internal study Data Sheet Vertica Enterprise Edition Automatic High Availability Run non-stop with data replication, failover, and recovery. Vertica Enterprise Edition is optimized for performance and is transparent to your in- tegration and operations teams. Optimized System and Performance Management Management Console displays tuning recom- mendations from the Workload Analyzer tool, addressing a wide range of system concerns, including empty administrator passwords, high CPU usage, and recommendations for better data management. Broad Deployment and Consumption Models—All Based on the Same Vertica Enterprise Edition Offerings Vertica Core The Vertica Enterprise Edition has two options. The Express edition consists of the base functional- Available on-premises, in the Cloud, or on ity and the Premium edition has additional advanced capabilities as shown below. Hadoop, Vertica offers proven Big Data ana- lytics that can deliver unmatched speed and Express Premium scale. Depending on the situation, if you need Enterprise Capabilities (Price per TB) (Price per TB) extra capacity and have no time to stand up MPP Architecture for high scalability ✔ ✔ on-premises hardware, deploying Vertica in the High Availability provides tolerance for node failures ✔ ✔ Cloud is an attractive option. For organizations Role-Based Security ✔ ✔ that prefer to apply a sophisticated SQL query User Defined Extension (UDx) for filtering, parsing, scalar functions, ✔ ✔ engine to their growing data lakes, Vertica for aggregates, transforms and more SQL on Apache Hadoop is another popular Standard SQL (ANSI 99) ✔ ✔ Vertica consumption model. Flex Tables—Load semi structured data (JSON, Avro, etc.) into a table ✔ ✔ for querying with pre-built parsers Out-of-the-Box Productivity Workload Analyzer, DB Designer, Management Console for ease of ✔ ✔ management and optimal query performance Vertica Enterprise Edition was designed to Elastic Cluster for seamlessly adding or removing nodes in a cluster ✔ ✔ get your users up and running quickly, requir- Advanced SQL Analytics (Time Series, SQL Windowing, Gap Filling, ✔ ing fewer resources than traditional solutions. more)* and Text Search With real-time loading and querying, automated Routable Query (Key Value) Interface to achieve high throughput for ✔ database design, and familiar interfaces, your low latency and high-concurrency queries users are productive immediately. Fault Groups provides high availability and reduces risk of correlated ✔ failures Operating on industry-standard hardware and Advanced Analytics with Sentiment, Geospatial, R Extensions, and ✔ with an optimized architecture, Vertica En ter- In-database Machine learning prise Edition can deliver significant savings Advanced Relational OLAP functions (Rollup Aggregate, Grouping ✔ Sets Aggregate, Cube Aggregate, and Pivot) while providing you with industry-leading ana- Live Aggregate & Pre-Join Projections to optimize query performance ✔ lytics capabilities and performance. for frequently used queries containing aggregates 2 New Technology, Technical Specifications Not Retrofitted Solutions Supported 64-bit operating systems on x86, x64 architecture: operating Designed and developed in the 21st century, Red Hat Enterprise Linux 6.6, 6.7, 6.8, 6.9, 7.0, 7.3 systems Vertica Enterprise Edition powers some of the SUSE Linux Enterprise Server 11.0 SP3*, 12 SP2 largest organizations and most innovative busi- Oracle Enterprise Linux 6.7, 6.8, 7.3 (Red Hat compatible kernel only) ness models around the world. Debian Linux 7.6, 7.7, 8.5 CentOS6.6, 6.7, 6.8, 6.9, 7.0, 7.3 3 Real-Time Big Data Analytics Supported file Extensively tested on all supported Linux platforms running ext3 or ext4 file systems. systems Gain insights into your data in real time. Operating System Processor Driver Manager Consume, analyze, and make informed deci- ODBC drivers Microsoft Windows x86, x64 Microsoft ODBC MDAC 2.8 iODBC sions at the speed of business. (32-bit and 64-bit 3.52.6 or later unixODBC 2.3.0 or later Windows 7, 8 and 10 all variants drivers provided) Microsoft Windows Server 2012, DataDirect 5.3 and 6.1 or later Fast Time to Value 2012 R2, 2016 Monetize your data in a matter of minutes, not Red Hat Enterprise Linux 6.6–6.9, x86, x64 weeks or months. Prevent data blind spots from 7.0, 7.3 increasing your risk and uncertainty around SUSE Linux Enterprise 11.0 SP3*, x86, x64 regulatory compliance. 12 SP2 Oracle Enterprise Linux 6.7, 6.8, 7.3 x86, x64 Agile Deployment (Red Hat compatible kernel only) Get maximum flexibility—deploy Vertica Enter - Ubuntu 14.04 LTS x86, x64 prise Edition on industry-standard hardware, CentOS 6.6–6.9, 7.0, 7.3 x86, x64 virtual machines, or in the cloud. Solaris 10 x86, x64, SPARC You can get all of these benefits with a lead- AIX 7.1 PowerPC ing data analytics solution that scales without HPUX 11i V3 IA64 compromising performance. Mac OS X 10.10, 10.11, 10.12 x86, x64 Mac OS X Driver Manager (iODBC) JDCB drivers Supported on all operating environment platforms Try Vertica Today (JDBC 4.0 Get started today and download Vertica Com- compliant) munity Edition, a free version of Vertica Enter- Operating System Processor .NET Requirements prise Edition. Store up to 1 TB of data and ADO.NET driver Microsoft Windows x86, x64 Microsoft .NET—Framework 3.5 SP1 or later deploy Vertica on a three node cluster. Sign up Windows Server 2012, 2012 R2, for Vertica Community Edition at: www.vertica. 2016 com/try Windows 7, 8 and 10 all variants __________ vsql client Supported on all Vertica Enterprise Edition ODBC driver platforms Perl Driver 3 While other file systems have been successfully Perl Version Modules ODBC requirements deployed by some customers, Vertica cannot Perl 5.8 DBI driver See ODBC drivers. guarantee performance or stability of the product 5.10 version 1.609 on these file systems. In certain support situations, DBD: ODBC you may need to migrate from these untested file version 1.22 systems to help you troubleshoot or fix an issue. Continued on the next page Vertica Community Edition does not support Linux Logical Volume Manager. www.vertica.com 3 www.vertica.com Technical Specifications
Recommended publications
  • Finding Fraud in Large and Diverse Data Sets
    Business white paper Finding fraud in large and diverse data sets Applying real-time, next-generation analytics to fraud detection and prevention using the HP Vertica Analytics Platform Developments in data mining In the effort to identify and deter fraud, conventional wisdom still applies: Follow the money. That simple adage notwithstanding, the task of tracking fraud and its perpetrators continues to vex both private and public organizations. Clearly, advancements in information technology have made it possible to capture transaction data at the most granular level. For instance, in the retail trade alone, transmissions of up to 500 megabytes daily between individual point-of-sale sites and their data centers are typical.2 Logically, such detail should result in greater transparency and greater capacity to fight fraud. Yet, the sheer volume of data that organizations now maintain, pulled from so many sources and stored across a range of locations has made the same organizations more vulnerable.3 More points of entry amount to more opportunities for fraud. In its annual Global Fraud Report, The Economist found that 50% of all businesses surveyed acknowledged they were vulnerable to fraud; 35% of North American companies specifically cited IT complexity for increasing their exposure to risk. Accordingly, the application of data mining as a security measure A solution for real-time fraud detection has become increasingly germane to modern fraud detection. Historically, data mining as a means of identifying trends from raw Fraud saps hundreds of billions of dollars each year from the statistics can be traced back to the 1700s with the introduction of bottom line of industries such as banking, insurance, retail, Bayes’ theorem.
    [Show full text]
  • The Vertica Analytic Database: C-Store 7 Years Later
    The Vertica Analytic Database: C-Store 7 Years Later Andrew Lamb, Matt Fuller, Ramakrishna Varadarajan Nga Tran, Ben Vandiver, Lyric Doshi, Chuck Bear Vertica Systems, An HP Company Cambridge, MA {alamb, mfuller, rvaradarajan, ntran, bvandiver, ldoshi, cbear} @vertica.com ABSTRACT that they were designed for transactional workloads on late- This paper describes the system architecture of the Ver- model computer hardware 40 years ago. Vertica is designed tica Analytic Database (Vertica), a commercialization of the for analytic workloads on modern hardware and its success design of the C-Store research prototype. Vertica demon- proves the commercial and technical viability of large scale strates a modern commercial RDBMS system that presents distributed databases which offer fully ACID transactions a classical relational interface while at the same time achiev- yet efficiently process petabytes of structured data. ing the high performance expected from modern “web scale” This main contributions of this paper are: analytic systems by making appropriate architectural choices. 1. An overview of the architecture of the Vertica Analytic Vertica is also an instructive lesson in how academic systems Database, focusing on deviations from C-Store. research can be directly commercialized into a successful product. 2. Implementation and deployment lessons that led to those differences. 1. INTRODUCTION 3. Observations on real-world experiences that can in- The Vertica Analytic Database (Vertica) is a distributed1, form future research directions for large scale analytic massively parallel RDBMS system that commercializes the systems. ideas of the C-Store[21] project. It is one of the few new commercial relational database systems that is widely used We hope that this paper contributes a perspective on com- in business critical systems.
    [Show full text]
  • Database Software Market: Billy Fitzsimmons +1 312 364 5112
    Equity Research Technology, Media, & Communications | Enterprise and Cloud Infrastructure March 22, 2019 Industry Report Jason Ader +1 617 235 7519 [email protected] Database Software Market: Billy Fitzsimmons +1 312 364 5112 The Long-Awaited Shake-up [email protected] Naji +1 212 245 6508 [email protected] Please refer to important disclosures on pages 70 and 71. Analyst certification is on page 70. William Blair or an affiliate does and seeks to do business with companies covered in its research reports. As a result, investors should be aware that the firm may have a conflict of interest that could affect the objectivity of this report. This report is not intended to provide personal investment advice. The opinions and recommendations here- in do not take into account individual client circumstances, objectives, or needs and are not intended as recommen- dations of particular securities, financial instruments, or strategies to particular clients. The recipient of this report must make its own independent decisions regarding any securities or financial instruments mentioned herein. William Blair Contents Key Findings ......................................................................................................................3 Introduction .......................................................................................................................5 Database Market History ...................................................................................................7 Market Definitions
    [Show full text]
  • Vertica Advanced Analytics Platform
    Data Sheet Vertica Advanced Analytics Platform Vertica Advanced Analytics Platform The Vertica Advanced Analytics Platform is consciously designed with speed, scalability, simplicity, and openness at its core and architected to handle analytical workloads via a distributed compressed columnar architecture. Vertica Advanced Analytics Platform provides blazingly fast speed (queries run 10–50X faster), exabyte scale (store 10–30X more data per server), openness, and simplicity (use any business intelligence [BI]/ETL tools, Hadoop, etc.)—at a much lower cost than traditional data warehouse solutions and a better time to market than unproven open source solutions. Handling Today’s Massive What Are the Key Technology Quick View Requirements of a Big Data Data Volumes At the core of the Vertica Advanced Analytics In modern data infrastructures, data comes Analytics Platform? Platform is a column-oriented, relational database from everywhere: business systems like CRM So, just what should you look for in a data ana- built specifically to handle today’s analytic workloads. and ERP, IoT sensors, tweets and other social lytics solution to address today and tomorrow’s Unlike commercial and open-source row stores, media data, Web logs and data streams, gas data challenges? Consider the following: which were designed long ago to support small data, the Vertica Advanced Analytics Platform provides and electrical grids, and mobile networks to Analyze huge data volumes in a unified customers with: name a few. With all this data generated from so manner: You are likely looking to analyze many places, companies are turning to dispa- data at unlimited scale combined with the • Complete and advanced SQL-based analytical functions to provide powerful SQL analytics rate, lower-cost storage locations to store and need to store your data in the right place manage these volumes, adding complexity and at the right time.
    [Show full text]
  • Gartner Magic Quadrant for Data Management Solutions for Analytics
    16/09/2019 Gartner Reprint Licensed for Distribution Magic Quadrant for Data Management Solutions for Analytics Published 21 January 2019 - ID G00353775 - 74 min read By Analysts Adam Ronthal, Roxane Edjlali, Rick Greenwald Disruption slows as cloud and nonrelational technology take their place beside traditional approaches, the leaders extend their lead, and distributed data approaches solidify their place as a best practice for DMSA. We help data and analytics leaders evaluate DMSAs in an increasingly split market. Market Definition/Description Gartner defines a data management solution for analytics (DMSA) as a complete software system that supports and manages data in one or many file management systems, most commonly a database or multiple databases. These management systems include specific optimization strategies designed for supporting analytical processing — including, but not limited to, relational processing, nonrelational processing (such as graph processing), and machine learning or programming languages such as Python or R. Data is not necessarily stored in a relational structure, and can use multiple data models — relational, XML, JavaScript Object Notation (JSON), key-value, graph, geospatial and others. Our definition also states that: ■ A DMSA is a system for storing, accessing, processing and delivering data intended for one or more of the four primary use cases Gartner identifies that support analytics (see Note 1). ■ A DMSA is not a specific class or type of technology; it is a use case. ■ A DMSA may consist of many different technologies in combination. However, any offering or combination of offerings must, at its core, exhibit the capability of providing access to the data under management by open-access tools.
    [Show full text]
  • Presto: the Definitive Guide
    Presto The Definitive Guide SQL at Any Scale, on Any Storage, in Any Environment Compliments of Matt Fuller, Manfred Moser & Martin Traverso Virtual Book Tour Starburst presents Presto: The Definitive Guide Register Now! Starburst is hosting a virtual book tour series where attendees will: Meet the authors: • Meet the authors from the comfort of your own home Matt Fuller • Meet the Presto creators and participate in an Ask Me Anything (AMA) session with the book Manfred Moser authors + Presto creators • Meet special guest speakers from Martin your favorite podcasts who will Traverso moderate the AMA Register here to save your spot. Praise for Presto: The Definitive Guide This book provides a great introduction to Presto and teaches you everything you need to know to start your successful usage of Presto. —Dain Sundstrom and David Phillips, Creators of the Presto Projects and Founders of the Presto Software Foundation Presto plays a key role in enabling analysis at Pinterest. This book covers the Presto essentials, from use cases through how to run Presto at massive scale. —Ashish Kumar Singh, Tech Lead, Bigdata Query Processing Platform, Pinterest Presto has set the bar in both community-building and technical excellence for lightning- fast analytical processing on stored data in modern cloud architectures. This book is a must-read for companies looking to modernize their analytics stack. —Jay Kreps, Cocreator of Apache Kafka, Cofounder and CEO of Confluent Presto has saved us all—both in academia and industry—countless hours of work, allowing us all to avoid having to write code to manage distributed query processing.
    [Show full text]
  • Vertica in the Clouds
    Data Sheet Vertica Advanced Analytics Platform Vertica in the Clouds Vertica maximizes cloud economics for mission-critical big data analytical initiatives, delivering blazing performance and elastic scalability for just-in time deployments on major public clouds–AWS, Azure, and Google Cloud Platform. Packed with the most comprehensive set of features and functionality, Vertica manages massive amounts of data quickly and reliably to provide fast analytical insight. With Vertica, queries run much faster than other analytical databases and without breaking your budget. Product Highlight Vertica Eon Mode Beta Maximizes Cloud Quick View Economics Vertica performs queries 5 to 10 times Vertica is a blazingly fast, elastically scalable, faster than traditional databases, at a With Vertica 9, you can run Vertica in Eon advanced SQL analytics database based on a fraction of the cost and using a fraction of Mode (currently in Beta) on Amazon Web massively parallel processing architecture with the cloud compute and storage resources. Services to capitalize on cloud economics, in-database machine learning that supports the Unlike traditional RDBMS databases, while still enjoying the fast query processing entire predictive analytics process, allowing data which are not designed for analytics and of Vertica. Running Vertica in Eon mode Beta scientists and analysts to embrace the power today’s complex analytics workloads, the separates the computational processes of Big Data and accelerate business outcomes Vertica Analytics Platform is built from from the storage layer of your database. This with no limits and no compromises. the ground up for the cloud to elastically new architecture enables Vertica to scale and independently scale compute and elastically, adapting to varying dynamic • Choice of Cloud Deployment—Get storage resources to deliver breakneck workloads and linearly scaling throughput.
    [Show full text]
  • A Modern Optimizer for Real-Time Analytics in a Distributed Database
    The MemSQL Query Optimizer: A modern optimizer for real-time analytics in a distributed database Jack Chen, Samir Jindel, Robert Walzer, Rajkumar Sen, Nika Jimsheleishvilli, Michael Andrews MemSQL Inc. 534 4th Street San Francisco, CA, 94107, USA {jack, samir, rob, raj, nika, mandrews}@memsql.com ABSTRACT clusters with many nodes enables dramatic performance Real-time analytics on massive datasets has become a very improvements in execution times for analytical data workloads. common need in many enterprises. These applications require not Several other industrial database systems such as SAP HANA [3], only rapid data ingest, but also quick answers to analytical queries Teradata/Aster, Netezza [15], SQL Server PDW [14], Oracle operating on the latest data. MemSQL is a distributed SQL Exadata [20], Pivotal GreenPlum [17], Vertica [7], and database designed to exploit memory-optimized, scale-out VectorWise [21] have gained popularity and are designed to run architecture to enable real-time transactional and analytical analytical queries very fast. workloads which are fast, highly concurrent, and extremely scalable. Many analytical queries in MemSQL’s customer 1.1 Overview of MemSQL workloads are complex queries involving joins, aggregations, sub- MemSQL is a distributed memory-optimized SQL database which queries, etc. over star and snowflake schemas, often ad-hoc or excels at mixed real-time analytical and transactional processing produced interactively by business intelligence tools. These at scale. MemSQL can store data in two formats: an in-memory queries often require latencies of seconds or less, and therefore row-oriented store and a disk-backed column-oriented store. require the optimizer to not only produce a high quality Tables can be created in either rowstore or columnstore format, distributed execution plan, but also produce it fast enough so that and queries can involve any combination of both types of tables.
    [Show full text]
  • Vertica Overview Data Sheet
    Data Sheet Analytics and Big Data Vertica Overview The Vertica Analytics Platform delivers speed, scalability, and built-in machine learning that today’s most analytically intensive workloads demand, whether in the public clouds, on-premises, on Hadoop, or any hybrid combination. Key Benefits Key Features ■ Flexibility and scalability to easily ramp up when workloads increase. Step Up to the Fastest, Most Flexible At the core of the Vertica Advanced Analytics Platform is a column-oriented, relational data- Big Data Analytics Platform ■ Better load throughput and concurrency base built specifically to handle today’s analytic What should you look for in a data analyt- with querying. workloads. This powerful analytics platform ics warehouse to address today and tomor- ■ In-database machine learning addressing pro vides you with: row’s data challenges? Consider the following every step in the ML process—algorithms, Vertica capabilities: ■ Complete and advanced SQL-based R support, Python extensibility, and more. analytical functions to provide powerful ■ Unify your analytics, not the data: Vertica’s ■ Analyze data in place and in any format, SQL analytics. Unified Analytics Warehouse allows you including complex data types like Maps to combine data siloes that are growing ■ A clustered approach to storing big data, and Arrays, Structs in Parquet on S3, exponentially—without moving the data. offering superior query and analytic HDFS, open SQL-based analytics, and new performance. use cases. ■ Save on both storage and computational charges: While cloud-based data storage ■ Better compression, requiring less ■ Vertica in Eon Mode provisions dynamic is low cost, analyzing that data can lead to hardware and storage than comparable workloads as needed, separates storage prohibitively expensive compute charges.
    [Show full text]
  • Choosing a Cloud DBMS: Architectures and Tradeoffs
    Choosing A Cloud DBMS: Architectures and Tradeoffs Junjay Tan1, Thanaa Ghanem2,∗ , Matthew Perron3, Xiangyao Yu3, Michael Stonebraker3,6, David DeWitt3, Marco Serafini4, Ashraf Aboulnaga5, Tim Kraska3 1Brown University; 2Metropolitan State University (Minnesota), CSC; 3MIT CSAIL; 4University of Massachusetts Amherst, CICS; 5Qatar Computing Research Institute, HBKU; 6Tamr, Inc. [email protected], [email protected], fmperron,yxy,[email protected], [email protected], [email protected], [email protected], [email protected] ABSTRACT Query Executor Nodes As analytic (OLAP) applications move to the cloud, DBMSs have shifted from employing a pure shared-nothing design Local instance Local instance Local instance Local instance with locally attached storage to a hybrid design that com- storage storage storage storage bines the use of shared-storage (e.g., AWS S3) with the use of shared-nothing query execution mechanisms. This paper sheds light on the resulting tradeoffs, which have not been properly identified in previous work. To this end, it evaluates Remote object / block store the TPC-H benchmark across a variety of DBMS offerings running in a cloud environment (AWS) on fast 10Gb+ net- Database works, specifically database-as-a-service offerings (Redshift, Figure 1: Shared Disk Architecture Athena), query engines (Presto, Hive), and a traditional cloud agnostic OLAP database (Vertica). While these com- parisons cannot be apples-to-apples in all cases due to cloud are deploying servers by the millions; not by the thousands) configuration restrictions, we nonetheless identify patterns and specialization (cloud vendors' business priority is infras- and design choices that are advantageous. These include tructure management, whereas other organizations perform prioritizing low-cost object stores like S3 for data storage, this to support main lines of business).
    [Show full text]
  • Vertica Advanced Analytics Platform
    Data Sheet Analytics and Big Data Vertica Advanced Analytics Platform The Vertica Advanced Analytics Platform is consciously designed with speed, scalability, simplic- ity, and openness at its core and architected to handle analytical workloads via a distributed com- pressed columnar architecture. Product Overview traffic optimization, enterprises that embrace Key Features Vertica Advanced Analytics Platform provides Big Data analytics platforms are changing the At the core of the Vertica Advanced Analytics blazingly fast speed (queries run 10–50X dynamics of industries from retail to health care Platform is a column-oriented, relational database faster), exabyte scale (store 10–30X more data to telecommunications to energy and beyond. built specifically to handle today’s analytic per server), openness, and simplicity (use any workloads. Unlike commercial and open-source business intelligence [BI]/ETL tools, Hadoop, row stores, which were designed long ago to Key Benefits support small data, the Vertica Advanced Analytics etc.)—at a much lower cost than traditional What Are the Key Technology Platform provides customers with: data warehouse solutions and a better time to market than unproven open source solutions. Requirements of a Big Data • Complete and advanced SQL-based analytical Analytics Platform? functions to provide powerful SQL analytics Handling Today’s Massive Data Volumes So, just what should you look for in a data ana- • A clustered approach to storing Big Data, In modern data infrastructures, data comes lytics solution
    [Show full text]
  • In-Database Machine Learning with Vertica Analytics Platform
    Business white paper Unlock machine learning for the new speed and scale of business In-database machine learning with HPE Vertica Analytics Platform Business white paper Table of contents 3 Machine learning: A competitive advantage 3 What is machine learning, and why is it important? 4 Purpose of this white paper 5 Types of machine learning 6 Barriers to applying machine learning at scale 6 HPE Vertica Analytics Platform delivers predictive analytics at speed and scale 7 Implementing machine learning with the HPE Vertica Analytics Platform 7 In-database machine learning 7 User-defined extensions (UDxs) 7 HPE Vertica Analytics Platform in-database machine learning functions 8 K-means clustering 8 Logistic regression 9 Linear regression 9 Naïve Bayes 10 Data preparation and normalization 10 Real-world implementation of Vertica’s in-database machine learning features 12 Use cases for machine learning across different industries 12 Summary Business white paper Page 3 Machine learning: A competitive advantage In today’s data-driven world, creating a competitive advantage depends on your ability to transform massive volumes of data into meaningful insights. Companies that use advanced analytics and machine learning are twice as likely to be top quartile financial performers, and three times more likely to execute effective decisions.1 However, nearly 50% of business leaders admit to not knowing how to get value from their data, and only 30%2 are able to act upon their information in real time. Leveraging Big Data to the full can help provide an in-depth understanding of customer behavior, enabling you to personalize the consumer experience, prevent churn, detect fraud, and increase your bottom line.
    [Show full text]