Succinct: Enabling Queries on Compressed Data

Succinct: Enabling Queries on Compressed Data Rachit Agarwal, Anurag Khandelwal, and Ion Stoica, University of California, Berkeley https://www.usenix.org/conference/nsdi15/technical-sessions/presentation/agarwal This paper is included in the Proceedings of the 12th USENIX Symposium on Networked Systems Design and Implementation (NSDI ’15). May 4–6, 2015 • Oakland, CA, USA ISBN 978-1-931971-218 Open Access to the Proceedings of the 12th USENIX Symposium on Networked Systems Design and Implementation (NSDI ’15) is sponsored by USENIX Succinct: Enabling Queries on Compressed Data Rachit Agarwal Anurag Khandelwal Ion Stoica UC Berkeley UC Berkeley UC Berkeley Abstract that indexes can be as much as 8× larger than the in- Succinct is a data store that enables efficient queries di- put data size. Traditional compression techniques can rectly on a compressed representation of the input data. reduce the memory footprint but suffer from degraded Succinct uses a compression technique that allows ran- throughput since data needs to be decompressed even dom access into the input, thus enabling efficient stor- for simple queries. Thus, existing data stores either re- age and retrieval of data. In addition, Succinct natively sort to using complex memory management techniques supports a wide range of queries including count and for identifying and caching “hot” data [5, 6, 26, 35] or search of arbitrary strings, range and wildcard queries. simply executing queries off-disk or off-SSD [25].In What differentiates Succinct from previous techniques either case, latency and throughput advantages of in- is that Succinct supports these queries without storing dexes drop compared to in-memory query execution. indexes — all the required information is embedded We present Succinct, a distributed data store that within the compressed representation. operates at a new point in the design space: memory Evaluation on real-world datasets show that Succinct efficiency close to data scans and latency close to in- requires an order of magnitude lower memory than sys- dexes. Succinct queries on secondary attributes, how- tems with similar functionality. Succinct thus pushes ever, touch all machines; thus, Succinct may achieve more data in memory, and provides low query latency lower throughput than indexes when the latter fits in for a larger range of input sizes than existing systems. memory. However, due to its low memory footprint, Succinct is able to store more data in memory, avoid- 1 Introduction ing latency and throughput degradation due to off-disk or off-SSD query execution for a much larger range of High-performance data stores, e.g. document stores [1, input sizes than systems that use indexes. 6],key-valuestores[5,9,23,24,26,38,39,43] and multi- Succinct achieves the above using two key ideas. attribute NoSQL stores [3, 19, 21, 25, 35, 48],arethe First, Succinct stores an entropy-compressed representa- bedrock of modern cloud services. While existing data tion of the input data that allows random access, en- stores provide efficient abstractions for storing and re- abling efficient storage and retrieval of data. Succinct’s trieving data using primary keys, interactive queries on data representation natively supports count, search, values (or, secondary attributes) remains a challenge. range and wildcard queries without storing indexes — To support queries on secondary attributes, existing all the required information is embedded within this data stores can use two main techniques. At one ex- compressed representation. Second, Succinct executes treme, systems such as column oriented stores, simply queries directly on the compressed representation,avoid- scan the data [10, 36].However,datascansincurhigh ing data scans and decompression. What makes Suc- latency for large data sizes, and have limited through- 1 cinct a unique system is that it not only stores a com- put since queries typically touch all machines .Atthe pressed representation of the input data, but also pro- other extreme, one can construct indexes on queried vides functionality similar to systems that use indexes attributes [3, 6, 35].Whenstoredin-memory,thesein- along with input data. dexes are not only fast, but can achieve high throughput since it is possible to execute each query on a single ma- Specifically, Succinct makes three contributions: chine. The main disadvantage of indexes is their high • Enables efficient queries directly on a compressed memory footprint. Evaluation of popular open-source representation of the input data. Succinct achieves data stores [6,35] using real-world datasets (§6)shows this using (1) a new data structure, in addition to adapting data structures from theory literature [32, 1Most data stores shard data by rows, and one needs to scan all rows. Even if data is sharded by columns, one needs to touch multiple 44–46],tocompresstheinputdata;and(2)anew machines to construct the row(s) in the query result. query algorithm that executes random access, count, 1 USENIX Association 12th USENIX Symposium on Networked Systems Design and Implementation (NSDI ’15) 337 search, range and wildcard queries directly on the f=compress(file) compressed representation (§3). In addition, Suc- append(f, buffer) cinct provides applications the flexibility to tradeoff buffer = extract(f, offset, len) memory for faster queries and vice versa (§4). cnt = count(f, str) • Efficiently supports data appends by chaining multi- [offset1, ...]=search(f, str) ple stores, each making a different tradeoff between [offset1, ...]=rangesearch(f, str1, str2) write, query and memory efficiency (§4): (1) a small [[offset1, len1], ...] log-structured store optimized for fine-grained ap- = wildcardsearch(f, prefix, suffix, dist) pends; (2) an intermediate store optimized for query efficiency while supporting bulk appends; and (3) an Figure 1: Interface exposed by Succinct (see §2). immutable store that stores most of the data, and op- timizes memory using Succinct’s data representation. Arguably, the most powerful operation provided by Succinct is search which takes as an argument an arbi- • Exposes a minimal, yet powerful, API that operates trary string (i.e.,notnecessarilyword-based)andre- on flat unstructured files (§2). Using this simple API, turns offsets of all occurrences in the uncompressed we have implemented many powerful abstractions file.Forexample,iffile contains abbcdeabczabgz, for semi-structured data on top of Succinct including invoking search(f, “ab”) will return offsets [0, 6, document store (e.g.,MongoDB[6]), key-value store 10].Whilesearch returns an array of offsets, we pro- (e.g.,Dynamo[23]), and multi-attribute NoSQL store vide a convenient iterator interface in our implementa- (e.g.,Cassandra[35]), enabling efficient queries on tion. What makes Succinct unique is that search not both primary and secondary attributes. only runs on the compressed representation but is also We evaluate Succinct against MongoDB [6],Cassan- efficient, that is, does not require scanning the file. dra [35],HyperDex[25] and DB-X, an industrial colum- Succinct provides two other search functions, again nar store that supports queries via data scans. Evalua- on arbitrary input strings. First, rangesearch returns tion results show that Succinct requires 10−11× lower the offsets of all strings between str1 and str2 memory than data stores that use indexes, while provid- in lexicographical order. Second, wildcardsearch(f, ing similar or stronger functionality. In comparison to prefix, suffix, dist) returns an array of tuples. traditional compression techniques, Succinct’s data rep- Atuplecontainstheoffsetandthelengthofastring resentation achieves lower decompression throughput with the given prefix and suffix,andwhosedis- but supports point queries directly on the compressed tance between the prefix and suffix does not exceed representation. By pushing more data in memory and dist,measuredinnumberofinputcharacters.Sup- by executing queries directly on the compressed repre- pose again that file f contains abbcdeabczabgz,then sentation, Succinct achieves dramatically lower latency wildcardsearch(f, “ab”, “z”, 2) will return tu- and higher throughput (sometimes an order of magni- ples [6, 9] for abcz,and[10, 13] for abgz.Note tude or more) compared to above systems even for mod- that we do not return the tuple corresponding to erate size datasets. abbcdeabcz as the distance between the prefix and suf- 2 Succinct Interface fix of this string is greater than 2. Succinct exposes a simple interface for storing, retriev- 2.1 Extensions for semi-structured data ing and querying flat (unstructured) files; see Figure 1. Consider a logical collection of records of the form We show in §2.1 that this simple interface already al- (key, avpList),wherekey is a unique iden- lows us to model many powerful abstractions including tifier, and avpList is a list of attribute value MongoDB [6],Cassandra[35] and BigTable [19],en- pairs, i.e., avpList = ((attrName1, value1),... abling efficient queries on semi-structured data. (attrNameN, valueN)).ToenablequeriesusingSuc- The application submits and compresses a flat file cinct API, we encode avpList within Succinct data rep- using compress;oncecompressed,itcaninvokeaset resentation; see Figure 2.Specifically,wetransformthe of powerful primitives directly on the compressed file. semi-structured data into a flat file with each attribute In particular, the application can append new data us- value separated by a delimiter unique to that attribute. ing append,canperformrandomaccessusingextract In addition,

Succinct: Enabling Queries on Compressed Data

Reflex: Remote Flash at the Performance of Local Flash

Multi-Resolution Storage and Search in Sensor Networks Deepak Ganesan University of Massachusetts Amherst

Network-Compute Co-Design for Distributed In-Memory Computing” Advisors: Prof

Consistent and Durable Data Structures for Non-Volatile Byte-Addressable Memory

(ASE 18 US 8,370.224 B2 Page 2

Bluecache: a Scalable Distributed Flash-Based Key-Value Store

Building Distributed Storage with Specialized Hardware

Iot Governance, Privacy and Security Issues EUROPEAN RESEARCH CLUSTER on the INTERNET of THINGS

Zipg: a Memory-Efficient Graph Store for Interactive Queries

Towards Elastic High-Performance Geo-Distributed Storage in the Cloud

A Scalable Distributed Flash-Based Key-Value Store

Building a Transactional Distributed Data Store with Erlang