Data Platforms in Financial Services: the Nosql Edge

Total Page:16

File Type:pdf, Size:1020Kb

Data Platforms in Financial Services: the Nosql Edge DATA PLATFORMS IN FINANCIAL SERVICES THE NOSQL EDGE WHITE PAPER Data Platforms in Financial Services: The NoSQL Edge Copying or distribution without written permission is prohibited 2 Contents Application and Criticality of Data Processing at Scale in Financial Services …………….. 06 • Fraud prevention, personalized customer experience, and risk management • Explosion of data sources, data sets, and formats • Shift from batch to near real-time to instantaneous • Growth of web-scale applications Common Challenges in Data Processing in Financial Services ………………………………….. 09 • Data silos caused by organizational structures • The increasing need for real-time processing • Always-on availability and performance • Consistency of data • Transition from mainframes to distributed workloads • Processing data for AI/ML in real-time Data Handling and Processing in Traditional Architectures ………………………………………. 12 • Not designed for extreme real-time workloads • Concentrates on data integrity over performance • Vertical scaling • Built for persistence and batch processing • Not in-memory • Slower response when implemented for real-time feedback • Caching layers built on top of operational layers NoSQL Databases in the Changing World of Financial Services …………………………..……. 15 • Digital transformation in financial services and the changes imposed by it • What is a NoSQL database? • Types of NoSQL databases • Growth of NoSQL databases Where NoSQL Databases Fit in Financial Services ……………………………………………………. 20 • NoSQL databases are better equipped to handle larger data sets • Ease of handling both structured and unstructured data formats • High-performance handling thereby designed for real-time feedback at web-scale • Built to connect legacy systems with newer and faster front-end systems • Built to drive two-paced development of modern architectures • Can scale better horizontally as the data grows • Easier and faster to implement compared to traditional databases • Support for better performance handling Opportunities and Business Benefits That Can Be Derived from NoSQL Databases …... 24 • Improved fraud prevention and financial crime monitoring • Real-time digital ID validation • Personalized offerings using improved real-time data analytics • Targeted marketing • Pre-trade assistance Conclusion ……………………………………………………………………………………………………….…… 28 The Aerospike Advantage …………………………………………………………………………………...…. 30 • Top 3 Global Online Payments System: Fraud detection for digital payments • Top 3 Global Brokerage Firm: Intra-day trading for financial services • LexisNexis Risk Solutions/ThreatMetrix: Tracking digital identity Bibliography ………………………………………………………………………………………………………….. 33 Copying or distribution without written permission is prohibited 3 Copying or distribution without written permission is prohibited 4 Executive Summary The financial services industry is a crucial cog absence of real-time data processing, have in the functioning of an economy. The an impact on performance and intended industry acts as a force multiplier of economic business outcomes. activity from storing to creating wealth and providing access to credit. It has always strived to evolve with the changing needs of The emphatic acceptance of modern the consumer. platforms and technologies such as NoSQL databases (which are powering tech giants like Facebook, Amazon, and Google) has The digitization of financial services has been provided the opportunity to tackle these an ongoing process for the past few decades. challenges effectively. This report explores The transition has not always been smooth the various facets of NoSQL databases and and easy from legacy mainframe systems of how their growth has played a crucial role in the 60s and 70s to today’s cloud-based transforming several financial services financial services. Customers are increasingly business areas. FinTech companies such as adopting digital channels to access banking neobanks, online trading platforms, and services and other offerings. Neobanks have payment platforms are already incorporating launched that are completely digital, NoSQL databases in their architectures and without having any branch banking. are reaping the benefits of being agile and Financial services can no longer offer generic resilient. solutions that target a wide range of consumers. Be it an individual working in a multinational corporation or a student, a This white paper also investigates the several neighborhood business, or a large-scale advantages that NoSQL offers over enterprise catering to a global populace, the traditional databases for a financial needs of each client are unique. To address institution. The ability to handle structured this diversity, institutions are consuming data and unstructured data formats, ease of from multiple sources inside and outside the connecting legacy systems with new, enterprise. These include structured data speedier front-end systems, horizontal from sources like enterprise systems, market scaling, and faster implementation are some systems, and government databases, as well of the points that are explored in detail. It also as unstructured data from social networks analyzes how each of these advantages and media. translates to better performance in the various functions of a financial institution ranging from fraud prevention, better This white paper explores the criticality of customer service and customer 360, targeted real-time data processing in financial marketing, data analytics, and real-time institutions and the common challenges they digital identity verification. face. The growth in online transactions and web applications is driving a focus on fraud prevention, customer experience, and a new Financial institutions are increasingly looking digital imperative that requires new to move operational and delivery models from thinking on how data is stored, managed, physical to digital platforms as more and and processed. However, the challenges more customers prefer these channels. posed by legacy infrastructure, such as data NoSQL will play a critical role in enabling inconsistency, difficulty in transitioning from this shift and in ensuring the high quality of mainframes to distributed workloads, and the services that their new, digital clientele expects from them. Copying or distribution without written permission is prohibited 5 01. Application and Criticality of Data Processing at Scale in Financial Services Copying or distribution without written permission is prohibited 6 Application and Criticality of Data Processing at Scale in Financial Services Fraud prevention, personalized customer experience, and risk management Financial services firms have multiple The collection of large sets of data on requirements that their data processing consumer spend and behavior enables firms architectures must meet. They process large to offer personalized services catering to their amounts of data ranging from customer requirements. Financial firms can cross-market records, transactional data, and records of various products and services based on the multiple interactions with third-party providers projected needs of the consumer and their such as regulators, vendors, and clients. The real- spending capabilities. time processing and analysis of this data enables faster decision-making and provides a rewarding Along with better customer service, real-time experience to the consumer. processing also enables for better risk management. For regulatory purposes, financial Fraud prevention is an important aspect both institutions must measure and manage the in terms of financial and reputational damage. riskiness of customer portfolios, requiring a move Since there is a transfer of wealth involved in to real-time transaction processing for a large these transactions, it is very crucial for these part of what a bank does. organizations to ensure that there are no attacks by unscrupulous entities. Explosion of data sources, data sets, and formats For financial services enterprises, data resides in The rise of FinTech firms has also put pressure on multiple siloed environments such as data traditional players in the financial services warehouses, core applications, finance and risk industry to transform themselves digitally. solutions, analytics solutions, mobile Digital-only banks, online trading platforms, applications, and social media interactions, to and blockchain-based payment systems are name a few. Since this data is not meaningfully some of the new business models adopted by linked across these silos, it has become less FinTechs in financial services. These accessible, thereby compromising useful developments have now led to the traditional insights into customers, products, partners, sales players converting themselves into data-driven channels, and financial performance. enterprises that can compete with the more agile, web-based FinTech startups. Figure. 1 Data from Multiple Sources Leading to Better Insights New Business Models Data & Value Chain Processing Participants Large Amounts New Data Insights of Data (Raw) (Structured) & Values To do this, they have to process large amounts of data arriving from various sources such as images, audio, and video generated through mobile applications and other devices. This new data, combined with newer business models and participants in the value chain that increase the digitization of financial services, provides enterprises with opportunities to gain additional insight and value. Copying or distribution without written permission is prohibited 7 Application
Recommended publications
  • Operational Database Offload
    Operational Database Offload Partner Brief Facing increased data growth and cost pressures, scale‐out technology has become very popular as more businesses become frustrated with their costly “Our partnership with Hortonworks is able to scale‐up RDBMSs. With Hadoop emerging as the de facto scale‐out file system, a deliver to our clients 5‐10x faster performance Hadoop RDBMS is a natural choice to replace traditional relational databases and over 75% reduction in TCO over traditional scale‐up databases. With Splice like Oracle and IBM DB2, which struggle with cost or scaling issues. Machine’s SQL‐based transactional processing Designed to meet the needs of real‐time, data‐driven businesses, Splice engine, our clients are able to migrate their legacy database applications without Machine is the only Hadoop RDBMS. Splice Machine offers an ANSI‐SQL application rewrites” database with support for ACID transactions on the distributed computing Monte Zweben infrastructure of Hadoop. Like Oracle and MySQL, it is an operational database Chief Executive Office that can handle operational (OLTP) or analytical (OLAP) workloads, while scaling Splice Machine out cost‐effectively from terabytes to petabytes on inexpensive commodity servers. Splice Machine, a technology partner with Hortonworks, chose HBase and Hadoop as its scale‐out architecture because of their proven auto‐sharding, replication, and failover technology. This partnership now allows businesses the best of all worlds: a standard SQL database, the proven scale‐out of Hadoop, and the ability to leverage current staff, operations, and applications without specialized hardware or significant application modifications. What Business Challenges are Solved? Leverage Existing SQL Tools Cost Effective Scaling Real‐Time Updates Leveraging the proven SQL processing of Splice Machine leverages the proven Splice Machine provides full ACID Apache Derby, Splice Machine is a true ANSI auto‐sharding of HBase to scale with transactions across rows and tables by using SQL database on Hadoop.
    [Show full text]
  • Product 360: Retail and Consumer Industries
    PRODUCT 360: RETAIL AND CONSUMER INDUSTRIES MARKLOGIC WHITE PAPER • NOVEMBER 2015 PRODUCT INFORMATION IS COMPLEX A major challenge for Retail and Consumer companies today is product proliferation and product complexity. An electronics retailer for example may have over 70,000 products in its catalog, while it is not uncommon for an industrial distributor to have over a million products and represent over 1,000 suppliers. Products HD typically have short shelf lives. In electronics for example it’s not uncommon for a new model to be released every year. And, a “Product” is not just a physical SKU (stock keeping unit) but a complex combination of structured HD and unstructured data that helps consumers search for, evaluate, compare, and choose their desired purchase. Product information includes a variety of data elements Product information includes a wide variety of data elements which are generated and stored in multiple locations, for example: WHY IS “PRODUCT 360” • Product descriptive information (e.g. size, color, IMPORTANT? material, nutritional information, usage, and other Creating, maintaining, and managing a 360 degree view elements that define it) of products is at the core of competitive differentiation • Digital images and videos – and in fact even survival – for retail and consumer • Customer ratings and reviews companies. • Dynamic pricing and promotions • Availability in-stock Key benefits of a Product 360 include: • Consumer loyalty information (who’s most likely to buy it) REVENUE GROWTH • Related products and accessories Today just 3% of on-line e-commerce transactions actually result in a sale. E-Commerce is the fastest These data elements often sit in different databases and growing channel for retailers, and sales via e-commerce legacy systems, making accessing them a challenge.
    [Show full text]
  • Operational Database Overview Date Published: 2020-02-29 Date Modified: 2021-02-04
    Cloudera Runtime 7.2.7 Operational Database Overview Date published: 2020-02-29 Date modified: 2021-02-04 https://docs.cloudera.com/ Legal Notice © Cloudera Inc. 2021. All rights reserved. The documentation is and contains Cloudera proprietary information protected by copyright and other intellectual property rights. No license under copyright or any other intellectual property right is granted herein. Copyright information for Cloudera software may be found within the documentation accompanying each component in a particular release. Cloudera software includes software from various open source or other third party projects, and may be released under the Apache Software License 2.0 (“ASLv2”), the Affero General Public License version 3 (AGPLv3), or other license terms. Other software included may be released under the terms of alternative open source licenses. Please review the license and notice files accompanying the software for additional licensing information. Please visit the Cloudera software product page for more information on Cloudera software. For more information on Cloudera support services, please visit either the Support or Sales page. Feel free to contact us directly to discuss your specific needs. Cloudera reserves the right to change any products at any time, and without notice. Cloudera assumes no responsibility nor liability arising from the use of products, except as expressly agreed to in writing by Cloudera. Cloudera, Cloudera Altus, HUE, Impala, Cloudera Impala, and other Cloudera marks are registered or unregistered trademarks in the United States and other countries. All other trademarks are the property of their respective owners. Disclaimer: EXCEPT AS EXPRESSLY PROVIDED IN A WRITTEN AGREEMENT WITH CLOUDERA, CLOUDERA DOES NOT MAKE NOR GIVE ANY REPRESENTATION, WARRANTY, NOR COVENANT OF ANY KIND, WHETHER EXPRESS OR IMPLIED, IN CONNECTION WITH CLOUDERA TECHNOLOGY OR RELATED SUPPORT PROVIDED IN CONNECTION THEREWITH.
    [Show full text]
  • P6 Reporting Database Planning and Sizing
    P6 Reporting Database Ver 3.0 Planning and Sizing An Oracle White Paper December 2011 Disclaimer The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle. Contents Introduction ...................................................................................................................................... 5 Critical Performance Factors ............................................................................................................ 5 Four Key Areas of the ETL ................................................................................................................. 6 Pulling Data between Servers ..................................................................................................... 6 Merging Updates into Target Database ...................................................................................... 6 PL/SQL-based Transformations .................................................................................................. 7 Planning Process .............................................................................................................................. 7 Why Planning is Key .........................................................................................................................
    [Show full text]
  • What Is Database? Types and Examples
    What is database? Types and Examples Visit our site for more information: www.examplanning.com Facebook Page: https://www.facebook.com/examplanning10/ Twitter: https://twitter.com/examplanning10 TABLE OF CONTENTS Sr. Description 1 What is database? 2 Different definitions of database 3 Growth of Database 4 Elements of Database 5 Components of database 6 Database System Environment 7 Types of Databas 8 Characteristics of database 9 Advantages of Database 10 Disadvantages of Database What is Database? A database is a collection of information or data which are organized in such a way that it can be easily accessed, managed and retrieved. Database is abbreviated ad DB. Different definitions of database. “a usually large collection of data organized especially for rapid search and retrieval (as by a computer) an online database” (merriam-webster) “a comprehensive collection of related data organized for convenient access, generally in a computer.” (dictionary) A database is an organized collection of data. (Wikipedia) What is data? It is used as both singular and plural form. It can be a quantity, symbol or character on which operations are performed. Data is information which are converted into digital form. Growth of Database Database was evolved in 1960's started with the hierarchical database. Relational database was invented by EF Codd in 1970s while object oriented database was invented in 1980s. In 1990s object oriented database rose with the growth of object oriented programming languages. Nowadays, databases with SQL and NoSQL are popular. Elements of Database Database elements are fields, rows, columns, tables. All these are building blocks of database.
    [Show full text]
  • Database Software Market: Billy Fitzsimmons +1 312 364 5112
    Equity Research Technology, Media, & Communications | Enterprise and Cloud Infrastructure March 22, 2019 Industry Report Jason Ader +1 617 235 7519 [email protected] Database Software Market: Billy Fitzsimmons +1 312 364 5112 The Long-Awaited Shake-up [email protected] Naji +1 212 245 6508 [email protected] Please refer to important disclosures on pages 70 and 71. Analyst certification is on page 70. William Blair or an affiliate does and seeks to do business with companies covered in its research reports. As a result, investors should be aware that the firm may have a conflict of interest that could affect the objectivity of this report. This report is not intended to provide personal investment advice. The opinions and recommendations here- in do not take into account individual client circumstances, objectives, or needs and are not intended as recommen- dations of particular securities, financial instruments, or strategies to particular clients. The recipient of this report must make its own independent decisions regarding any securities or financial instruments mentioned herein. William Blair Contents Key Findings ......................................................................................................................3 Introduction .......................................................................................................................5 Database Market History ...................................................................................................7 Market Definitions
    [Show full text]
  • Leanxcale's Technology in a Nutshell
    LeanXcale’s Technology in a Nutshell LeanXcale – technology is a nutshell 1 Gartner (HTAP), Forrester (translytical), LeanXcale technology in a and 451 research (HOAP). nutshell LeanXcale scales in all dimensions an enterprise needs: Introduction • Volume: to terabytes • Velocity: to 100s of millions of LeanXcale is a database designed for transactions per second fast-growing businesses and enterprise • Variety: natively supports SQL, companies who make intensive use of key-value, and soon JSON. It data. also supports polyglot queries across SQL and NoSQL (key- It is an ultra-scalable full SQL value data stores, graph operational database supporting full databases, document-oriented ACID transactions. It is possible thanks data stores, Hadoop data lakes) to a patented parallel-distributed and data streaming. transactional manager. It blends operational and analytical capabilities, enabling analytical queries over the operational data. All market analyst named this capability as the next future database technology: LeanXcale – technology is a nutshell 2 Architecture LeanXcale’s architecture has three distributed layers: Capacities LeanXcale was founded under the idea of the database technical excellence, trying to sort out all the problems that enterprise databases have. This philosophy is in the LeanXcale's DNA, and it is embodied in any database aspect, being the origin of more than ten disruptive technologies. • A distributed SQL query engine: It provides full SQL and a JDBC driver to access the database. It supports both scaling out OLTP workloads (distributing transactions across nodes) and OLAP workloads (using multiple nodes for a single large analytical query). Scalability • A distributed transaction Traditional ACID databases do not scale manager: It uses our patented out linearly or do not scale out at all.
    [Show full text]
  • Supply 360: Retail & Consumer Industries
    SUPPLY 360: RETAIL & CONSUMER INDUSTRIES MARKLOGIC WHITE PAPER • FEBRUARY 2016 The MarkLogic Supply 360 solution helps you better manage complexity, by providing a consolidated, operational view across the wide variety and sources of supply chain data – helping you grow revenue, manage risk and compliance, and address cost and operational concerns. MANAGING SUPPLY CHAIN COMPLEXITY A Supply Chain is a complex, dynamic environment. It connects all interactions from "Farm to Fork" in the Consumer Goods industry, and from the source of inputs or raw materials to the final point of sale at Retail. This process involves multiple players and the exchange of a wide variety of data across all the links in the supply chain – not only structured information but also unstructured documents including contracts, design objects, RFID sensors, carrier manifests, shipment notifications, and payments. The MarkLogic Supply 360 solution helps you better manage the complexity, by providing one consolidated view across the supply chain — design objects, bids and contracts, forecasts, inventory, procurement documents, and shipments. With MarkLogic, you can: Back-End Returns & Un-Saleables • Integrate and ingest multiple sources of data Figure 1: A simple version of a supply chain seamlessly into one operational database platform without upfront data modeling, saving you time and BENEFITS OF A SUPPLY 360 money Maintaining and managing a 360 degree view of your • Perform real-time search and query on all your data, supply chain and sources of supply delivers significant and get alerts on new items of importance benefits for consumer and retail companies. • Build applications to support forecasting, planning, GROW REVENUE and tracking especially for recalls and return of un- Plan and forecast the movement of products across saleables your supply chain and reduce out-of-stocks at retail.
    [Show full text]
  • HTAP WP.Indd
    Hybrid Transaction and Analytical Processing with NuoDB Whitepaper Core Tech Technical Whitepaper Hybrid Transaction and Analytical Processing with NuoDB Abstract NuoDB is a distributed SQL database management system that provides a simpler, more cost effective, and ultimately higher-scale approach to hybrid transaction and analytical processing (HTAP). In this paper, we explain the architectural basis for this assertion, and describe an HTAP example to illustrate these unique characteristics. What Is HTAP And Why Is It Important? The combination of simultaneous transactional workloads (“operational”) and analytics workloads (business intelligence, Big Data etc.) is a concept that is “ gaining traction among database vendors and industry analysts. Gartner will Data-driven businesses be publishing research on the topic in the next 12 months under the “HTAP” often need real-time moniker, HTAP meaning Hybrid Transaction/Analytical Processing. discovery and analysis HTAP is about spotting trends and being aware of leading indicators in order of transactional data to take immediate action. For example, an online retailer uses HTAP to find to make meaningful out what products are currently trending as best sellers in the last hour. changes immediately. He sees snow shovels are selling well and reacts by putting a snow shovel ” promotion on the home page to leverage the trend. Much of Big Data analytics has focused on information discovery, i.e., essentially using Hadoop for batch-oriented storing and querying of large amounts of data sometimes referred to as “data lakes.” However, data-driven businesses often need real-time discovery and analysis of transactional data to make meaningful changes immediately. Real-time discovery and analysis requires repeatable analytics on very recent data.
    [Show full text]
  • A New HOAP? Hybrid Operational- Analytic Processing and the Future
    451 RESEARCH REPRINT REPORT REPRINT A new HOAP? Hybrid operational- analytic processing and the future of the database market MATT ASLETT, JAMES CURTIS 04 DEC 2017 Data from 451 Research’s Total Data Market Monitor indicates that databases designed to support a combination of operational and analytical processing workloads will quickly become mainstream, at least for new application projects. THIS REPORT, LICENSED TO MEMSQL, DEVELOPED AND AS PROVIDED BY 451 RESEARCH, LLC, WAS PUBLISHED AS PART OF OUR SYNDICATED MARKET INSIGHT SUBSCRIPTION SERVICE. IT SHALL BE OWNED IN ITS ENTIRETY BY 451 RESEARCH, LLC. THIS REPORT IS SOLELY INTENDED FOR USE BY THE RECIPIENT AND MAY NOT BE REPRODUCED OR RE-POSTED, IN WHOLE OR IN PART, BY THE RE- CIPIENT WITHOUT EXPRESS PERMISSION FROM 451 RESEARCH. ©2017 451 Research, LLC | WWW.451RESEARCH.COM 451 RESEARCH REPRINT 451 Research has previously identified the emergence of a new breed of database providers with products that are positioned for a combination of operational and analytical workloads, as well as the systems of intelligence workloads that they are used for. Data from 451 Research’s Total Data Market Monitor suggests that these databases that are designed to support hybrid operational and analytic processing (HOAP) will quickly become mainstream in the coming years – at least for new application projects. THE 451 TAKE The blending of operational and analytical systems continues to add value for many organizations. And while hybrid systems may not be an ideal fit for every firm, there are many reasons they do make sense. Beyond the reduction in maintaining a separate transactional and analytical system, hybrid databases enable organizations to carry out analytics on incoming operational data, taking advantage of the ‘transaction window,’ which, if done right, could be incredibly lucrative.
    [Show full text]
  • Oracle's Converged Database: How to Make Developers and Data More
    Oracle’s Converged Database: How to Make Developers And Data More Productive As enterprises digitize more business processes and decision points, they face a seemingly impossible choice—improve developer productivity now or data productivity later. But a radically new approach, Oracle’s converged database, breaks this impasse. December 2020 | Version 1.2 Copyright © 2020, Oracle and/or its affiliates Purpose Statement This document is intended to help CTOs, enterprise architects, and development managers understand the benefits of converged databases compared to single-purpose databases. Intended Audience The intended audience of this paper is I.T. leaders making decisions about the future of enterprise computing architecture, including CTOs, enterprise architects, and development managers. Disclaimer This document in any form, software or printed matter, contains proprietary information that is the exclusive property of Oracle. Your access to and use of this confidential material is subject to the terms and conditions of your Oracle software license and service agreement, which has been executed and with which you agree to comply. This document and information contained herein may not be disclosed, copied, reproduced or distributed to anyone outside Oracle without prior written consent of Oracle. This document is not part of your license agreement nor can it be incorporated into any contractual agreement with Oracle or its subsidiaries or affiliates. This document is for informational purposes only and is intended solely to assist you in planning for the implementation and upgrade of the product features described. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions.
    [Show full text]
  • Big Data Integration
    BIG DATA INTEGRATION DATA SHEET VoltDB is an in-memory NewSQL operational database architected to act VoltDB is… on Fast Data streaming into applications at the rate of hundreds of thou- An in-memory, NewSQL sands to millions of events per second. Ideally suited as an operational da- Operational Database tabase to process Fast Data, VoltDB also has the ability to export data at for High-Performance, high speed to long-term analytics stores such as HP Vertica and Netezza, Fast Data Applications as well as Hadoop-based data warehouses. • Write and read millions of VoltDB Export continually and transactionally pushes data from VoltDB data events per second into another system, similar to an ETL (extract, transform, load) process. • In-memory performance and Unlike ETL, which pulls data and can stall when data changes at high rates on-disk persistence of speed, VoltDB Export uses a push model that exports data at the same • Relational, ACID-compliant rate at which it is ingested. SQL and JSON VoltDB Export lets application developers automate the export process by specifying certain tables in the schema as sources for export. At runtime, In-memory Analytics any data written to the specified tables is sent to the Export Connector, • Ability to make automated, which queues the data for export, then sends it to the selected output per-event decisions based target. VoltDB provides connectors for exporting to files, for exporting to on historical data other business processes, for exporting to Big Data’s Hadoop, for export- • Enables current data to be ing to a distributed message queue such as Kafka or RabbitMQ, or for factored into analytics exporting to other relational databases via JDBC.
    [Show full text]