VLDB2014 40th International Conference on Very Large Data Bases, ,

Proceedings of the VLDB Endowment

Volume 7, No. 11 – July 2014 Proceedings of the 40th International Conference on Very Large Data Bases, Hangzhou, China

Program Chairs and Editors-in-Chief: H. V. Jagadish, Aoying Zhou

Associate Editors – Research and Innovative Systems Tracks: Shivnath Babu, Lei Chen, Graham Cormode, Bin Cui, Wynne Hsu, Martin Kersten, Donald Kossman, Elke Rundensteiner, Kyuseok Shim, Wang-Chiew Tan, Letizia Tanca, Jeffrey Yu

Associate Editors – Experiments and Analysis Track: Gao Cong, Jens Dittrich

Associate Editors – Vision Track: Zachary Ives

Proceedings Chairs: Li Xiong, Cong Yu

PVLDB – Proceedings of the VLDB Endowment Volume 7, No. 11, July 2014. The 40th International Conference on Very Large Data Bases, Hangzhou, China.

Copyright 2014 VLDB Endowment

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/3.0/. Obtain permission prior to any use beyond those covered by the license. Contact copyright holder by emailing [email protected].

Volume 7, Number 11, July 2014: VLDB 2014 Pages ii - ix and 931 - 1022 ISSN 2150-8097

Additional copies only online at: portal.acm.org, arxiv.org/corr, and www.vldb.org

PVLDB Vol. 7 No. 11 ii VLDB2014 – Hangzhou, China

TABLE OF CONTENTS

Front Matter Copyright Notice ...... ii Table of Contents ...... iii VLDB 2014 Organization and Review Board ...... iv

Letters First VLDB in Mainland China ...... Lidan Shou ix

Research Papers

Trekking Through Siberia: Managing Cold Data in a Memory-Optimized Database ...... 931 ...... Ahmed Eldawy, Justin Levandoski, Per-Åke Larson

The Case for Personal Data-Driven Decision Making ...... 943 ...... Jennie Duggan

ConfluxDB: Multi-Master Replication for Partitioned Snapshot Isolation Databases ...... 947 ...... Prima Chairunnanda, Khuzaima Daudjee, M. Tamer Ozsu

γ-DB: Managing scientific hypotheses as uncertain data ...... 959 ...... Bernardo Goncalves, Fabio Porto

Ibex — An Intelligent Storage Engine with Support for Advanced SQL Off-loading ...... 963 ...... Louis Woods, Zsolt Istvan, Gustavo Alonso

NOMAD: Nonlocking, stOchastic Multi-machine algorithm for ...... 975 Asynchronous and Decentralized matrix completion ...... Hyokun Yun, Hsiang-Fu Yu, Cho-Jui Hsieh, S V N Vishwanathan, Inderjit Dhillon

Repairing Vertex Labels under Neighborhood Constraints ...... 987 ...... Shaoxu Song, Hong Cheng, Jeffrey Xu Yu, Lei Chen

Progressive Approach to Relational Entity Resolution ...... 999 ...... Yasser Altowim, Dmitri V. Kalashnikov, Sharad Mehrotra

Concurrent Analytical Query Processing with GPUs ...... 1011 ...... Kaibo Wang, Kai Zhang, Yuan Yuan, Siyuan Ma, Rubao Lee, Xiaoning Ding, Xiaodong Zhang

PVLDB Vol. 7 No. 11 iii VLDB2014 – Hangzhou, China

VLDB 2014 ORGANIZATION AND REVIEW BOARD

Honorary Chair Yunhe Pan, Chinese Academy of Engineering

General Chairs Chun Chen, University Sharad Mehrotra, University of California, Irvine

Program Chairs and Editors-in-Chief of PVLDB 7 H. V. Jagadish, University of Michigan Aoying Zhou, East Normal University, China

Research and Innovative Systems Tracks Associate Editors Shivnath Babu, Duke University Lei Chen, Hong Kong University of Science and Technology Graham Cormode, University of Warwick Bin Cui, Peking University, China Wynne Hsu, NUS Martin Kersten, CWI Donald Kossman, ETH Elke Rundensteiner, WPI Kyuseok Shim, Seoul National University Wang-Chiew Tan, University of California, Santa Cruz Letizia Tanca, Poli Milano Jeffrey Yu, Chinese University of Hong Kong

Experiments and Analysis Track Associate Editors Gao Cong, Nanyang Technology University Jens Dittrich, Saarland

Visions Track Associate Editor Zachary Ives, University of Pennsylvania

Industrial and Applications Track Associate Editors Umeshwar Dayal, HP C. Mohan, IBM Ge Yu, Northeastern University, China

PVLDB Vol. 7 No. 11 iv VLDB2014 – Hangzhou, China

Demonstration Chairs Workshop Chairs Mong-Li Lee, NUS Anastasia Ailamaki, EPFL Feifei Li, University of Utah Kaushik Chakrabarti, Microsoft Sunil Prabhakar, Purdue Panel Chairs Tutorial Chairs Hakan Hacigumus, NEC Labs Xiaoyong Du, Renmin University Jignesh Patel, University of Wisconsin Murat Kantarcioglu, University of Texas, Dallas Xiaoyang Sean Wang, Fudan University Divesh Srivastava, AT&T Labs

Research Track Review Board Sibel Adali, Rensselear Polytechnic Institute Brian Cooper, Google, USA Foto Afrati, NTU Athens Philippe Cudré-Mauroux, University of Fribourg Yanif Ahmad, JHU Carlo Curino, MIT Jose Luis Ambite, ISI - USC Gautam Das, UT Arlington and QCRI Walid Aref, Purdue University Sudipto Das, Microsoft Research Claudia Bauzer Medeiros, University of Campinas Anish Das Sarma Srikanta Bedathur, IIIT Delhi Atish Das Sarma, eBay Research Labs Michael Benedikt, Oxford University Khuzaima Daudjee, University of Waterloo Sonia Bergamaschi, Universita Modena Antonios Deligiannakis, Technical University of Crete Laure Berti-Equille, IRD, France Daniel Deutch, Ben Gurion University Leopoldo Bertossi, Carleton University, Ottawa Yanlei Diao, University of Massachusetts Amherst Subhash Bhalla, University of Aizu, Japan Xin (Luna) Dong, Google, USA Peter Boncz, CWI Sameh Elnikety, Microsoft Research Angela Bonifati, University of Lille 1 Mohamed Eltabakh, Worcester Polytechnic Institute Rajesh Bordawekar, IBM Watson Research Center Ihab F. Ilyas, QCRI Omar Boucelma, Aix-Marseille University Hakan Ferhatosmanoglu, Bilkent University Nico Bruno, Microsoft Research Ada Wai-Chee Fu, Chinese University of Hong Kong Andrea Cali, University of London, Birkbeck College Minos Garofalakis, Technical University of Crete Malu Castellanos, HP Labs Wolfgang Gatterbauer, Carnegie Mellon University Badrish Chandramouli, Microsoft Research Tingjian Ge, University of Massachussets, Lowell Adriane Chapman, Mitre Buğra Gedik, Bilkent University Gang Chen, Zhejiang University Rainer Gemulla, Max-Plack-Institut Saarbr點ken Yi Chen, New Jersey Institute of Technology Gabriel Ghinita, University of Massachusetts Boston James Cheng, CUHK Parke Godfrey, York University Reynold Cheng, University of Hong Kong Lukasz Golab, University of Waterloo

PVLDB Vol. 7 No. 11 v VLDB2014 – Hangzhou, China

Sergio Greco, University of Calabria Laks V. S. Lakshmanan, University of British Columbia Le Gruenwald, University of Oklahoma Mounia Lalmas, Yahoo Inc. Giovanna Guerrini, Universita Genova Mong-Li Lee, National University of Singapore Krishna Gummadi, MPI-SWS Wolfgang Lehner, Technische University Dresden Rahul Gupta, Google Research Justin Levandoski, Microsoft Research Rajeev Gupta, IBM Research Chengkai Li, The University of Texas at Arlington Shyam Gupta, IIT Delhi Cuiping Li, Renmin University of China Marios Hadjielefhteriou, AT&T labs Feifei Li, University of Utah Wook-Shin Han, KNU, Korea Guoliang Li, Tsinghua University Kuno Harumi, HP Labs Jianzhong Li, Harbin Institute of Technology Bingsheng He, NTU Singapore Yunyao Li, IBM Almaden Sven Helmer, Free University of Bozen-Bolzano Zhanhuai Li, Northwestern Polytechnical University Jan Hidders, TUDelft Dan Lin, Missouri S&T, USA Wei Hong, Cisco System Inc. Xuemin Lin, University of New South Wales Katja Hose, Aalborg University Bin Liu, NEC Labs America Zi Huang, University of Queensland Ziyang Liu, NEC Labs America Jeong-Hyon Hwang, SUNY - Albany Eric Lo, The Hong Kong Polytechnic University Seung-won Hwang, POSTECH, Korea Qiong Luo, HKUST Stratos Idreos, CWI Shuai Ma, Beihang University Yoshiharu Ishikawa, Nagoya University Ashwin Machanavajjhala, Duke University Zachary Ives, University of Pennsylvania Brad Malin, Duke University Ricardo Jimenez-Peris, Technical University of Madrid Nikos Mamoulis, University of Hong Kong Cheqing Jin, East China Normal University Stefan Manegold, CWI Ruoming Jin, Kent State University Murali Mani, University of Michigan Alekh Jindal, Saarland University/MIT Ioana Manolescu, INRIA, France Ryan Johnson, University of Toronto Amélie Marian, Rutgers University Dmitri V Kalashnikov, UC Irvine Volker Markl, TU Berlin Panos Kalnis, KAUST, Saudi Arabia Marta Mattoso, Federal University of Rio de Janeiro Ben Kao, Hong Kong University Frank McSherry, Microsoft Panagiotis Karras, Rutgers University Alexandra Meliou, Umass Amherst Yiping Ke, Institute of High Performance Computing Marco Mesiti, University of Milano Bettina Kemme, McGill University Dan Miranker, The University of Texas at Austin Daniel Kifer, PSU Mohamed Mokbel, University of Minnesota Benny Kimelfeld, IBM Bongki Moon, Seoul National University Hideaki Kimura, Microsoft Jim Gray Systems Lab Yasuhiko Morimoto, Hiroshima University George Kollios, Boston University Mirella Moro, Universidade Federal de Minas Gerais Christian König, Microsoft Research Kyriakos Mouratidis, SMU, Singapore Tim Kraska, Brown University Karin Murthy, IBM India

PVLDB Vol. 7 No. 11 vi VLDB2014 – Hangzhou, China

Arnab Nandi, Ohio State University Yufei Tao, Chinese University of Hong Kong Wolfgang Nejdl, University of Hannover Nesime Tatbul, ETH Zurich Thomas Neumann, Technology University Munchen Arash Termehchy, Oregon State University Boris Novikov, St Petersburg University Evimaria Terzi, University of Boston Dan Olteanu, Oxford University Martin Theobald, Max Planck Institute, Germany Gultekin Ozsoyoglu, Case Western Reserve University Srikanta Tirthapura, Iowa State University Tamer Ozsu, University of Waterloo Riccardo Torlone, Roma Tre University Esther Pacitti, University of Montpellier Anthony Tung, National University of Singapore Themis Palpanas, University of Trento Kostas Tzoumas, Technical University of Berlin Ippokratis Pandis, IBM Almaden Sergei Vassilvitskii, Google Research Stelios Paparizos, Microsoft Research Marcos Vaz Salles, University of Copenhagen (DIKU) Aditya Parameswaran, Stanford University Stratis Viglas, University of Edinburgh Srinivasan Parthasarathy, The Ohio State University Hoang Tam Vo, National University of Singapore Jignesh Patel, University of Wisconsin Daisy Zhe Wang, University of Florida Andrew Pavlo, Brown University Haixun Wang, Microsoft Research Asia Peter Pietzuch, Imperial College London Ke Wang, Simon Fraser University Neoklis Polyzotis, University of California - Santa Cruz Wei Wang, University of New South Wales Cecilia M. Procopiuc, AT&T Labs Xiaoling Wang, East China Normal University Li Qian, University of Michigan Ingmar Weber, Yahoo Jorge Quiané-Ruiz, QCRI Raymond Chi Wing Wong, HKUST Elisa Quintarelli, Politecnico di Milano Sai Wu, Zhejiang University Maya Ramanath, IIT Delhi Yuqing Wu, Indiana University Louiqa Raschid, University of Maryland Xiaokui Xiao, NTU Vibhar Rastogi, Yahoo Dong Xin, Google Matthias Renz, University of Munich Jianliang Xu, Hong Kong Baptist University Kenneth Ross, Columbia University Jun (Jim) Xu, Georgia Institute of Technology Sourav S Bhowmick, NTU, Singapore Xifeng Yan, University of Santa Barbara Dimitris Sacharidis, IMIS Athena, Greece Xiaoyan Yang, Advanced Digital Science Center Kenneth Salem, Univesity of Waterloo Ke Yi, HKUST Maria Sapino, University of Torino Ge Yu, Northeastern University, China Kai-Uwe Sattler, TU Ilmenau Hwanjo Yu, POSTECH, Korea Monica Scannapieco, ISTAT Meihui Zhang, National University of Singapore Bernhard Seeger, University of Marburg Wenjie Zhang, The University of New South Wales Lidan Shou, Zhejiang University Ying Zhang, The University of New South Wales Adam Silberstein, Trifacta Zhenjie Zhang, Advanced Digital Science Center Lisa Singh, Georgetown University Wenzhao Zhou, Georgetown University Radu Sion, Stony Brook University Xiaofang Zhou, University of Queensland

PVLDB Vol. 7 No. 11 vii VLDB2014 – Hangzhou, China

PhD Workshop Chairs Proceedings Chairs Erich Neuhold, University of Vienna Li Xiong, Emory University Yunyao Li, IBM Cong Yu, Google Research

Sponsorship Chairs Treasury Chair Mike Carey, University of California, Irvine Li (Eric) Qian, University of Michigan Lizhu Zhou, Tsinghua University VLDB Endowment Liaison Local Organization Chair Kyu-Young Whang, KAIST Lidan Shou, Zhejiang University PVLDB Managing Editor Web Management Chair Divesh Srivastava, AT&T Labs Sai Wu, Zhejiang University PVLDB Information Director Conference and Registration Chairs Gerald Weber, University of Auckland Ke Chen, Zhejiang University PVLDB Advisory Committee Cuiping Li, Renmin University Philip Bernstein, Michael Böhlen, Peter Buneman, Susan Davidson, Z. Meral Ozsoyoglu, S. Sudarshan, Publicity Chairs Gerhard Weikum Vasilis Vassalos, AUEB, Greece Dunlu Peng, USST, China Logo Design Guanmin Guo

PVLDB Vol. 7 No. 11 viii VLDB2014 – Hangzhou, China

FIRST VLDB IN MAINLAND CHINA

As the Local Organization Chair of this year's VLDB, I would like to take this opportunity to introduce to you the wonderful city of Hangzhou, which is to host the upcoming VLDB 2014. Located at the southern tip of the Yangtze River Delta, Hangzhou is the capital of the highly developed Zhejiang province in eastern China. Having a culture-rich history dating back to two centuries B.C., Hangzhou boasts of its influential position in ancient poetry, textile design, calligraphy, and other numerous forms of traditional Chinese arts. Besides all these man-made wonders, the city is most famous for the natural scenes around the mythical , a major tourist area surrounded by tens of attractions. While boating on the lake is a must-go tour for most, you may be more excited about a detour into the serenity of the HuPao Ancient Temple, a light hiking at dawn towards the Baochu , or cycling at dusk on the Bai Causeway.

While maintaining its historic, romantic name of "earthly paradise", Hangzhou is striving to earn the title of "IT paradise" of China today. The city is home to several major players in the Chinese ICT industry. It has also aggregated tens of thousands of young technological entrepreneurs venturing out of its High- Tech District and University Incubators.

This year's VLDB is not only a fiesta for the world's database academics and industry, but also a landmark event for the Chinese database community. On behalf of the local organizing team at Zhejiang University, I would like to deliver my heartfelt thanks to all those who have helped and sponsored this event. Please join us in the picturesque city of Hangzhou!

Lidan Shou, Zhejiang University, Hangzhou, Zhejiang, China Local Organization Chair, VLDB 2014

PVLDB Vol. 7 No. 11 ix VLDB2014 – Hangzhou, China