A Survey of Cloud Database Systems

Total Page:16

File Type:pdf, Size:1020Kb

A Survey of Cloud Database Systems FEATURE: CLOUD COMPUTING A Survey of Cloud Database Systems Ganesh Chandra Deka, Ministry of Labor & Employment, Government of India This survey of 15 popular cloud databases provides an overview of each system and its storage platform, license type, and programming language used for writing the source code of the NoSQLs. It also considers features such as data-handling techniques and billing practices. he exponential growth of the Internet to meet the requirements of different user groups. has resulted in an explosion of data It’s not possible to discuss all of them here, so sources, creating storage and data-us- I selected 15 popular NoSQL databases that ability problems. Furthermore, an in- are representative (rather than inclusive), and I Tcrease in the number of data types has created review some of their interesting characteristics. challenges in terms of storing and manipulating unstructured data. Surveyed Systems These issues have lead companies and open NoSQL databases can be divided into two source communities to build new tools, known groups, depending on the elasticity level. The as NoSQL systems or “key-value-store” systems, first group contains truly elastic databases, such which aim to offer, on a massive scale, on- as MongoDB, which allows the addition of new demand services and simplified application nodes to a cluster without any observable down- development and deployment. NoSQL databases time for the clients. The second group contains are useful for applications that deal with very rigidly defined BigTable-based NoSQL databases large semistructured and unstructured data. (such as Cassandra and HBase), which have sig- The growing popularity of big data will compel nificant downtime when new nodes are added to many companies to use NoSQL databases1 the cluster. instead of traditional database, so you can expect Constant data availability when nodes are to see vendors offering simplified rollouts and added or removed from the cluster is made additional support for NoSQL solutions.2 possible by routing mechanisms and algorithms According to nosql-database.org, there are at that decide when to move data chunks that least 150 NoSQL databases, with various features are working together. For example, when data 50 IT Pro March/April 2014 Published by the IEEE Computer Society 1520-9202/14/$31.00 © 2014 IEEE itpro-16-02-deka.indd 50 07/03/14 3:00 PM must be moved to newly added nodes, during stored in tablets already looked for will be made the copying process, the data is served from the directly to the last level of the tree. original location. When the new node has an up- to-date version of the data, the routing processes HBase start to send requests to this node.1 HBase is an open source, distributed, versioned, column-oriented data store modeled after Google’s Cassandra BigTable. Basically, it’s a clone of BigTable, The Apache Cassandra database offers good providing a real-time, structured database on top scalability and high availability without com­­ of the Hadoop distributed file system. HBase is promising performance. Its demonstrated suitable for applications requiring random, real- fault-tolerance on commodity hardware (cloud time read/write access to big data. HBase’s goal infrastructures) and linear scalability make it is to host very large tables with billions of rows the ideal platform for mission-critical data. and millions of columns on top of clusters of Cassandra features allow replication across commodity hardware. multiple datacenters, offering lower latency HBase provides linear and modular scalability for data availability during regional outages. (ability of a database to handle a growing amount Cassandra’s ColumnFamily information model of data), consistent reads and writes, automatic offers the convenience of column indexes with and configurable sharding (a horizontal partition the performance of log-structured updates, in a database) of tables, and automatic failover strong support for materialize views (also known support between RegionServers. It also offers as snapshots), and powerful built-in caching. convenient base classes for backing Hadoop Netflix, Twitter, Urban Airship, Reddit, Cisco, MapReduce jobs with HBase tables, an easy-to- OpenX, Digg, CloudKick, and Ooyala are some use Java API for client access, block cache and of the companies that use Cassandra to deal Bloom Filters for real-time queries, as well as with huge, active, online interactive datasets. query-predicate pushdown via server-side filters. The largest known Cassandra cluster has over Finally, it has an extensible JRuby-based shell 300 Tbytes of information in over 400 machines and includes support via the Hadoop metrics (http://cassandra.apache.org). subsystem to files for exporting metrics via Java Management Extensions. BigTable Google’s BigTable maps two arbitrary string MongoDB values (row key and column key) and a time MongoDB is an open source, schema free, stamp (creating 3D mapping) into an associated document-oriented, scalable NoSQL database arbitrary byte array. BigTable can be characterized system. This high-performance, fault-tolerant, as a light, distributed, multidimensional sorted persistent system provides a complex query map. It was developed to scale to the petabyte language as well as an implementation of range among numerous machines to make it easy MapReduce. to add machines, automatically taking advantage MongoDB offers of those resources without any reconfiguration.3 When sizes threaten to grow beyond a specified • document-oriented storage—JavaScript Object limit, the tablets are compressed using the Notation (JSON)-style documents with dy- BMDiff 4,5 algorithm and the Zippy open source namic schemas offer simplicity and power; compression algorithm,6 publicly known as • full index support—that is, it can index any Snappy,5 which is a less space-optimal variation attribute; of the LZ77 algorithm but more efficient in terms • data availability—it can mirror across LANs of computing time (www.aosabook.org/en/posa/ and WANs for scalability; and infinispan.html). • autosharding—it scales horizontally without To get a specific row stored in BigTable, a new compromising functionality. client must connect to all levels of the tree, but the information obtained on the upper levels is It also supports rich, document-based queries; cached, meaning that further requests for data atomic modifiers for contention-free performance; computer.org/ITPro 51 itpro-16-02-deka.indd 51 07/03/14 3:00 PM FEATURE: CLOUD COMPUTING Requests Requests Requests Master copy of partition table/ tablet mapping reliability. Hypertable will be useful for organization that must administer rapidly evolving data support for online real-time applications. Modeled Tablet Routers controller after Google’s BigTable project, Hypertable is designed to manage information storage and processing on a large cluster of commodity servers, providing resilience to machine and component failures. CouchDB Tablets Apache CouchDB is a document- Tablet servers oriented database written using Erlang, a robust functional programming language ideal for building concurrent Figure 1. The Pnuts data storage architecture. Multiple applications distributed systems. CouchDB can be share this massive-scale centrally managed database system. queried and indexed using JavaScript in a MapReduce fashion. It also offers incremental replication with flexible aggregation and data processing; and bidirectional conflict detection and resolution. GridFS (a MongoDB file format for storing files It provides a RESTful JSON API that can be larger than 16 MB), so it can store files of any size accessed from any environment that allows without complicating your stack. HTTP requests. There are many third-party client libraries, making it easier to choose a programming Pnuts language. CouchDB’s built-in Web administration The Pnuts system is a massive-scale hosted, console speaks directly to the database using centrally managed database system shared by HTTP requests issued from the browser. multiple applications (see Figure 1). It supports Yahoo’s data-serving Web applications rather Voldemort than complex queries (such as offline analysis of Voldemort is a fully distributed key-value Web crawls).7 storage system. Each node is independent with Pnuts provides data management as a service. no central point of failure or coordination. This significantly reduces application development Voldemort is designed for use as a simple storage time, because developers don’t have to architect that’s fast enough to avoid needing a caching and implement their own scalable, reliable data- layer on top of it. The software architecture is management solutions.4 Consolidating multiple made of several layers, each of which implements applications into a single service lets users amortize the put, get, and delete operations. Each layer is operations costs over multiple applications and responsible for a specific function, such as apply the same best practices to the data man­­ TCP/IP communications, routing, or conflict agement of many different applications. Moreover, resolution. having a shared service means you can keep With Voldemort, data is automatically replicated resources (servers, disks, and so on) in reserve and over multiple servers as well as automatically quickly assign them to applications experiencing a partitioned, so each server contains only a sudden surge in popularity.6 subset of the total data. Server failure is handled transparently, and pluggable serialization
Recommended publications
  • Distributed, Concurrent, and Independent Access to Encrypted Cloud Databases
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL. 25, NO. 2, FEBRUARY 2014 437 Distributed, Concurrent, and Independent Access to Encrypted Cloud Databases Luca Ferretti, Michele Colajanni, and Mirco Marchetti Abstract—Placing critical data in the hands of a cloud provider should come with the guarantee of security and availability for data at rest, in motion, and in use. Several alternatives exist for storage services, while data confidentiality solutions for the database as a service paradigm are still immature. We propose a novel architecture that integrates cloud database services with data confidentiality and the possibility of executing concurrent operations on encrypted data. This is the first solution supporting geographically distributed clients to connect directly to an encrypted cloud database, and to execute concurrent and independent operations including those modifying the database structure. The proposed architecture has the further advantage of eliminating intermediate proxies that limit the elasticity, availability, and scalability properties that are intrinsic in cloud-based solutions. The efficacy of the proposed architecture is evaluated through theoretical analyses and extensive experimental results based on a prototype implementation subject to the TPC-C standard benchmark for different numbers of clients and network latencies. Index Terms—Cloud, security, confidentiality, SecureDBaaS, database Ç 1INTRODUCTION N a cloud context, where critical information is placed in SecureDBaaS that supports the execution
    [Show full text]
  • Database Security in the Cloud
    Database security in the cloud I M A L S A K H I Examensarbete inom Datateknik Grundnivå, 15 hp 1 Stockholm 2012 Detta examensarbete har utförts i samarbete med Försvarsmakten Handledare på Försvarsmakten: Ingvar Ståhl Databassäkerhet i molnet Database security in the cloud I m a l S a k h i Examensarbete inom Datorteknik Grundnivå, 15 hp Handledare på KTH: Magnus Brenning Examinator: Thomas Lindh Skolan för teknik och hälsa TRITA-STH 2012:51 Kungliga Tekniska Högskolan Skolan för teknik och hälsa 136 40 Handen, Sweden http://www.kth.se/sth ii Abstract The aim of the thesis is to get an overview of the database services available in cloud computing environment, investigate the security risks associated with it and propose the possible countermeasures to minimize the risks. The thesis also analyzes two cloud database service providers namely; Amazon RDS and Xeround. The reason behind choosing these two providers is because they are currently amongst the leading cloud database providers and both provide relational cloud databases which makes the comparison useful. The focus of the analysis has been to provide an overview of their database services as well as the available security measurements. A guide has been appended at the end of the report to help with technical configurations of database migration and connecting applications to the databases for the two mentioned cloud database providers. The thesis has been conducted on behalf of the Swedish Armed Forces and after reviewing the security risks associated with cloud databases, it is recommended that the Armed Forces should refrain from public cloud database services.
    [Show full text]
  • 6-Th INTERNATIONAL SCIENTIFIC CONFERENCE
    International Conference on Applied Information and Communication Technologies (AICT2013), 25.-26. April, 2013, Jelgava, Latvia Different solutions of MySQL in the cloud – security and possibilities Zigmunds Bulins, Vjaceslavs Sitikovs Institute of Applied Software, Riga Technical University, Meza 1/3, Riga, LV-1048, Latvia [email protected], [email protected] Abstract: Cloud computing is a good way to raise productivity of offered service without investments into new infrastructure, training of the personnel or software acquisition. This technology expands potential possibilities of existing information systems. In recent years cloud computing grew from good business concept to one of the most demanded industry in information technologies. The paper contains a short review of different cloud database providers which uses MySQL as a basis. Technical nuances, potential problems and risks related to migration of the existing MySQL databases to the new environment are reviewed. In the paper we try to review the actual possibilities of the new platform and compare the cloud DBaaS (database as a service) solutions which are implemented with MySQL database management system widely used in the web. Keywords: MySQL, DBaaS, Heroku, Google Cloud SQL, ClearDB. Introduction Cloud computing is a dynamic method of increasing productivity of service or possibilities without investments in new infrastructure, training of the personnel or software licensing. This expands possibilities of existing information systems. In recent years cloud computing grew from good concept business to one of the most quickly developing industries of information technologies (Chandra, Mondal, 2011). In the last two years there was quite a lot of activity around the cloud databases on the stage – Google, Amazon and Xeround (Xeround, 2012a) companies announced their DBaaS based on the MySQL database.
    [Show full text]
  • OVERVIEW of DATABASES AS SERVICE PROVIDING SYSTEMS on the BASE of XEROUND, AMAZON RDS and SQL AZURE Yeleu Y.A
    Section VII: Informatics and Control in Engineering Systems CLOUD TECHNOLOGY: OVERVIEW OF DATABASES AS SERVICE PROVIDING SYSTEMS ON THE BASE OF XEROUND, AMAZON RDS AND SQL AZURE Yeleu Y.A. Scientific supervisor: Botygin I.A., associate professor Language supervisor: Pichugova I.L., senior teacher Tomsk Polytechnic University, 30, Lenin Avenue, Tomsk, 634050, Russia E-mail: [email protected] Cloud computing is the delivery provisioning often appear to be unlimited and can of computing as a service rather than a product, be appropriated in any quantity at any time. whereby shared resources, software, and 3. On-demand self-service. A consumer can information are provided to computers and other unilaterally get computing capabilities, such as devices as a metered service over server time and network storage, as needed a network (typically the Internet). [1] Cloud automatically without requiring human interaction computing is a model for delivering information with each service provider. technology services where resources are retrieved 4. Broad network access. Capabilities are available from the Internet through web-based tools and over the network and accessed through standard applications, rather than a direct connection to a mechanisms that promote use by heterogeneous server. Data and software packages are stored in thin or thick client platforms (e.g., mobile phones, servers. However, cloud computing structure tablets, laptops, and workstations). allows access to information as long as an 5. Economy of the process.Measured service is electronic device has access to the web. This type used in cloud technology. Cloud systems of system allows employees to work remotely. [2] automatically control and optimize resource use by There is a more common definition.
    [Show full text]
  • © 2011 by the 451 Group. All Rights Reserved Nosql, Newsql and Beyond Open Source Driving Innovation in Distributed Data Management
    © 2011 by The 451 Group. All rights reserved NoSQL, NewSQL and Beyond Open Source Driving Innovation in Distributed Data Management Matt hew As lett, The 451 Group [email protected] © 2011 by The 451 Group. All rights reserved Overview – Open source impact in the database market – NoSQL and NewSQL databases • Adoption/development drivers • Use cases – And beyond • Data grid/cache • Cloud databases • Total data © 2011 by The 451 Group. All rights reserved 451 Research is focused on the business of enterprise IT innovation. The company’s analysts provide critical and timely insight into the competitive dynamics of innovation in emerging technology segments. The 451 Group Tier1 Research is a single‐source research and advisory firm covering the multi‐tenant datacenter, hosting, IT and cloud‐computing sectors, blending the best of industry and financial research. The Uptime Institute is ‘The Global Data Center Authority’ and a pioneer in the creation and facilitation of end‐user knowledge communities to improve reliability and uninterruptible availability in datacenter facilities. TheInfoPro is a leading IT advisory and research firm that provides real‐world ppperspectives on the customer and market dynamics of the TOTAL DATA enterprise information technology landscape, harnessing the collective knowledge and insight of leading IT organizations worldwide. ChangeWave Research is a research firm that identifies and quantifies ‘change’ in consumer spending behavior, corporate purchasing, and industry, company and technology trends. © 2011 by The 451 Group. All rights reserved Coverage areas – Senior analyst, enterprise software – Commercial Adoption of Open Source (CAOS) • Adoption by enterprises • Adoption by vendors – Information Management • Databases • Data warehousing • Data caching © 2011 by The 451 Group.
    [Show full text]
  • Fluidcloud: an Open Framework for Relocation of Cloud Services
    FluidCloud: An Open Framework for Relocation of Cloud Services Andy Edmonds Thijs Metsch Zurcher¨ Hochschule fur¨ Angewandte Wissenschaften Intel Ireland Limited Dana Petcu Erik Elmroth Jamie Marshall Institute e-Austria Timisoara Umea˚ University Prologue Plamen Ganchosov CloudSigma Abstract and automated. From this work a number of research and engineering challenges arise including data optimisation, Cloud computing delivers new levels of being connected, runtime architecture adaptation, and goal-oriented ser- instead of the once disconnected PC-type systems. The vice instance relocation. proposal in this paper extends that level of connectedness in the cloud such that cloud service instances, hosted by providers, can relocate between clouds. This is key in 2 A Problem in the Cloud? order to provide economical and regulatory benefits but more importantly liberation and positive market disrup- Cloud service instances remain locked under the control tion. of the service provider. FluidCloud will liberate these in- While service providers want to lock in their cus- stances. Having the ability for a cloud service instance to tomer’s services, FluidCloud wants the liberation of easily and seamlessly move from one provider to another those and thereby allow the service owner to freely will bring advantages and freedom to any cloud service choose the best matching provider at any time. In the owner. Essentially, it will bring service instance “move- cloud world of competing cloud standards and software ment rights” to the cloud. However there is no encom- solutions, each only partially complete, the central re- passing means to accomplish this. search question which this paper intends to answer: How FluidCloud fits within the soon future cloud.
    [Show full text]
  • Effect of Information Flow Analysis in Different Cloud Computing Environments
    IPASJ International Journal of Computer Science (IIJCS) Web Site: http://www.ipasj.org/IIJCS/IIJCS.htm A Publisher for Research Motivation ........ Email: [email protected] Volume 4, Issue 9, September 2016 ISSN 2321-5992 Effect of Information Flow Analysis in different Cloud Computing Environments Mr. Ravindra Sharma Chandigarh College of Architecture, Chandigarh ABSTRACT Cloud computing provides variety of benefits to deliver services over the net. The services area unit categorised into 3 distinct environments Software-as-a-Service, Platform-as-aService, and Infrastructure-as-a-Service. Now-a-days, a replacement rising service known as Database-as-aService that could be a a part of Software-as-a-Service. This service is outstanding for database- driven applications. This paper lists necessary parameters of information. As a result, the performance analysis of information as MySQL varies considerably counting on the various cloud infrastructure like Amazon EC2 and Joyent Cloud. 1. INTRODUCTION Cloud computing is technology that uses net and management remote server to keep up information and application. rather than putting in set of computer code for every pc, you wish to load your application and use it while not installation when and access personal files at any pc with the assistance of net. Cloud computing is assessed into 3 services: Infrastructure-as- a-Service, Platform-as-a Service and Software-as-a-Service even Database-as-a-Service (DBaaS) is enclosed into Softwareas-a-Service. This technology provides America economical computing by centralized storage, memory, process and information measure. 2. AMAZON ELASTIC CLOUD FIGURE EC2 Amazon Cloud EC2 provides cloud computing answer on the idea of as per usage.
    [Show full text]
  • A Survey on Recent Developments in Cloud Based Systems
    [TYPE TEXT] Recent Developments in Cloud Based Systems: State of Art Mansaf Alam a and Kashish Ara Shakil b a, b Department of Computer Science , Jamia Millia Islamia, New Delhi 110025 b Corresponding Author Abstract— Cloud computing is the latest buzzword in the head of techies round the clock these days. The importance and the different applications of cloud computing are overwhelming and thus, it is a topic of huge significance. It provides several astounding features like Multitenancy, on demand service, pay per use etc. This manuscript presents an exhaustive survey on cloud computing technology and attempts to cover most of the developments that have taken place in the field of cloud computing. It discusses about the various available cloud computing platforms, Security in cloud, reference architectures for cloud and storage of data in cloud computing. Furthermore it gives an insight into the recent developments in cloud environment with the help of use cases and the author’s perspective about the future of the respective use cases. Finally it concludes by discussing the limitations of data management in cloud and potential research issues in cloud computing that needs to be addressed in future along with a proposed architecture for cloud. Index Terms—Cloud computing, CDBMS, database as a service, cloud platforms —————————— —————————— 1 INTRODUCTION Cloud computing is a newly emerging technology for the future with it roots based on the rapidly increasing demands on data centers that needs to be catered to. Cloud computing is defined as the use of computing resources to access data over the internet. It is a means or a mechanism to enhance the existing capabilities of Information technology by many folds [120].The terminology cloud comes from the fact that the data is not stored on your desktop or your device but is located far away similar to a cloud in literal terms, but despite of it being away its within your reach, you can access it irrespective of your geographical location using a computing device via an internet.
    [Show full text]
  • Mysql in the Hosted Cloud Scale13x
    MySQL in the Hosted Cloud Colin Charles, MariaDB Corporation Ab [email protected] | [email protected] http://mariadb.com/ | http://mariadb.org/ http://bytebot.net/blog/ | @bytebot on Twitter SCALE13x, Los Angeles, CA, USA 21 February 2015 1 whoami • Work on MariaDB at MariaDB Corporation (SkySQL Ab) • Merged with Monty Program Ab, makers of MariaDB • Formerly MySQL AB (exit: Sun Microsystems) • Past lives include Fedora Project (FESCO), OpenOffice.org 2 Agenda • MySQL as a service offering (DBaaS) • Choices • Considerations • MySQL versions & access • Costs • Deeper into RDS • Should you run this on EC2 or an equivalent? • Conclusion 3 MySQL as a service • Database as a Service (DBaaS) • MySQL available on-demand, without any installation/configuration of hardware/ software • Pay-per-usage based • Provider maintains MySQL, you don’t maintain, upgrade, or administer the database 4 New way of deployment • Enter a credit card number • call API (or use the GUI) ec2-run-instances ami- xxx -k ${EC2_KEYPAIR} -t m1.large nova boot --image centos6-x86_64 --flavor m1.large db1 credit: http://www.flickr.com/photos/68751915@N05/6280507539/ 5 Why DBaaS? • “Couldn’t we just have a few more servers to handle the traffic spike during the elections?” • Don’t have a lot of DBAs, optimise for operational ease • Rapid deployment & scale-out 6 Your choices today • Amazon Web Services Relational Database Service (RDS) • Rackspace Cloud Databases • Google Cloud SQL • HP Helion Public Cloud Relational DB 7 There are more • Jelastic - PaaS offering MySQL, MariaDB •
    [Show full text]
  • Effective Mysql Replication Techniques in Depth
    Oracle_Flash / Effective MySQL: Replication Techniques in Depth / Bradford / 186-8 ® Effective MySQL Replication Techniques in Depth 00-FM.indd 1 9/6/12 6:20 PM Oracle_Flash / Effective MySQL: Replication Techniques in Depth / Bradford / 186-8 About the Authors Ronald Bradford has worked in the relational database industry for over 20 years. His professional background began in 1989 with Ingres and Oracle RDBMS. His expertise includes a vast experience with database architec- ture, performance tuning, and management of large enterprise systems. Ronald has, for the past 13 years, worked primarily with MySQL, the world’s most popular open source database. His previous employment has includ- ed Oracle Corporation (1996–1999) as an Oracle Consultant and MySQL, Inc. (2006–2008) as a senior MySQL Consultant. His contributions to the MySQL community have included recognition as the all-time top individual MySQL blog contributor at Planet MySQL (2010), an Oracle ACE Director (2010), and MySQL Community Member of the Year (2009). Ronald combines his extensive consulting expertise with a passion to share the knowledge and benefits of using MySQL. He is the author of four books on MySQL, and his many public speaking engagements have in- cluded presentations at conferences in over 25 countries. The Effective MySQL series of books and presentations aim to provide practical educa- tion for DBAs, developers, and architects in MySQL performance, scalabil- ity, and business continuity. Chris Schneider is a long-time open source advocate and MySQL evange- list, administrator, and architect. Over the past decade Chris has devoted his professional career to open source technologies, with his primary focus being MySQL at scale.
    [Show full text]
  • Cloud Data Warehousing for Smes
    Cloud Data Warehousing for SMEs Sérgio Fernandes1 and Jorge Bernardino1,2 1ISEC – Superior Institute of Engineering of Coimbra, Polytechnic Institute of Coimbra, 3030-190 Coimbra, Portugal 2CISUC – Centre of Informatics and Systems of the University of Coimbra, University of Coimbra, 3030-290 Coimbra, Portugal Keywords: Database as a Service, Cloud Computing, Cloud Databases, Database Management System, DBaaS, DWaaS. Abstract: The emergence of cloud computing caused a revolution in the universe of Information Technology. With cloud computing solutions it is possible to access powerful features, hardware and software in less time and with considerably lower costs by using the model "pay-as-you-go". At the same time, this turnover increased information, and data warehouses must respond to this new reality. Small and medium enterprises (SMEs) were deprived of owning a traditional data warehouse due to the costs involved, but the cloud has made it possible to overcome this barrier. This paper provides an overview of Data Warehouse (DW) in the cloud and presents the main characteristics of the following solutions: Amazon Redshift, IBM dashDB, Snowflake, Teradata Active Data Warehouse Private Cloud, Treasure Data, and Microsoft Azure. 1 INTRODUCTION Cloud computing is presented as a medium that allows easy acquisition of computational resources, In recent years, the significant growth in data scalable infrastructure with minimal setup and it is a volume and its complexity makes organizations look service at a lower cost, using the model "pay-as-you- for ways to keep the focus on their business instead go" (Malliga, 2012). Many organizations have of in their IT infrastructure. Advances in chosen to implement or migrate their systems, infrastructure and technology have led companies to processes and data in order to reap the benefits of trust the cloud model, including Data Warehouses elasticity that this environment provides, so that IT (DW).
    [Show full text]
  • A Practical Guide to Big Data: Opportunities, Challenges & Tools 2 © 2012 Dassault Systèmes About EXALEAD
    A PRACTICAL GUIDE TO BIG DaTA Opportunities, Challenges & Tools 3DS.COM/EXALEAD “Give me a lever long enough and a fulcrum on which to place it, and I shall move the world.” 1 Archimedes ABOUT THE AUTHOR Laura Wilber is the former founder and CEO of California-based AVENCOM, Inc., a software development company specializing in online databases and database-driven Internet applications (acquired by Red Door Interactive in 2004), and she served as VP of Marketing for Kintera, Inc., a provider of SaaS software to the nonprofit and government sectors. She also developed courtroom tutorials for technology-related intellectual property litigation for Legal Arts Multimedia, LLC. Ms. Wilber earned an M.A. from the University of Maryland, where she was also a candidate in the PhD program, before joining the federal systems engineering division of Bell Atlantic (now Verizon) in Washington, DC. Ms. Wilber currently works as solutions ana- lyst at EXALEAD. Prior to joining EXALEAD, Ms. Wilber taught Business Process Reengineering, Management of Informa- tion Systems and E-Commerce at ISG (l’Institut Supérieur de Gestion) in Paris. She and her EXALEAD colleague Gregory Grefenstette recently co-authored Search-Based Applications: At the Confluence of Search and Database Technologies, published in 2011 by Morgan & Claypool Publishers. A Practical Guide to Big Data: Opportunities, Challenges & Tools 2 © 2012 Dassault Systèmes ABOUT EXALEAD Founded in 2000 by search engine pioneers, EXALEAD® is the leading Search-Based Application platform provider to business and government. EXALEAD’s worldwide client base includes leading companies such as PricewaterhouseCooper, ViaMichelin, GEFCO, the World Bank and Sanofi Aventis R&D, and more than 100 million unique users a month use EXALEAD’s technology for search and information access.
    [Show full text]