Exploiting Non-Uniform Query Distributions in Data Structuring Problems

Total Page:16

File Type:pdf, Size:1020Kb

Exploiting Non-Uniform Query Distributions in Data Structuring Problems Exploiting Non-Uniform Query Distributions in Data Structuring Problems by John Howat A thesis submitted to the Faculty of Graduate and Postdoctoral Affairs in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Computer Science Carleton University Ottawa, Ontario © 2012 John Howat Library and Archives Bibliotheque et Canada Archives Canada Published Heritage Direction du 1+1 Branch Patrimoine de I'edition 395 Wellington Street 395, rue Wellington Ottawa ON K1A0N4 Ottawa ON K1A 0N4 Canada Canada Your file Votre reference ISBN: 978-0-494-93655-9 Our file Notre reference ISBN: 978-0-494-93655-9 NOTICE: AVIS: The author has granted a non­ L'auteur a accorde une licence non exclusive exclusive license allowing Library and permettant a la Bibliotheque et Archives Archives Canada to reproduce, Canada de reproduire, publier, archiver, publish, archive, preserve, conserve, sauvegarder, conserver, transmettre au public communicate to the public by par telecommunication ou par I'lnternet, preter, telecommunication or on the Internet, distribuer et vendre des theses partout dans le loan, distrbute and sell theses monde, a des fins commerciales ou autres, sur worldwide, for commercial or non­ support microforme, papier, electronique et/ou commercial purposes, in microform, autres formats. paper, electronic and/or any other formats. The author retains copyright L'auteur conserve la propriete du droit d'auteur ownership and moral rights in this et des droits moraux qui protege cette these. Ni thesis. Neither the thesis nor la these ni des extraits substantiels de celle-ci substantial extracts from it may be ne doivent etre imprimes ou autrement printed or otherwise reproduced reproduits sans son autorisation. without the author's permission. In compliance with the Canadian Conformement a la loi canadienne sur la Privacy Act some supporting forms protection de la vie privee, quelques may have been removed from this formulaires secondaires ont ete enleves de thesis. cette these. While these forms may be included Bien que ces formulaires aient inclus dans in the document page count, their la pagination, il n'y aura aucun contenu removal does not represent any loss manquant. of content from the thesis. Canada Abstract This thesis examines data structures with query times that are a function of the distribution of queries made to them. When a query distribution exhibits non­ uniformity—as is often the case in many applications—the sequence of queries can often be executed faster. Several such problems are considered. For the dictionary problem, this thesis presents the first binary search tree to achieve the working-set property for individual queries in the worst case, a data structure that supports searching from arbitrary temporal fingers, and a data struc­ ture that supports a stronger version of the unified property. For the predecessor search problem in bounded universes, this thesis presents a data structure that answers queries in time that is a function of the distance between the query and its answers (which leads to several applications in the areas of nearest neighbour search and range searching), as well as several data structures that answer queries in time that is a function of the entropy of the query distribution and various space requirements. Acknowledgements A PhD thesis does not happen in isolation. I owe my supervisors, Prosenjit Bose and Pat Morin, a large debt of gratitude for their assistance during my studies. Their willingness to work with me and read countless drafts of papers has been incredible. I cannot imagine how many times I’ve knocked on their doors over the years and asked, “Got a second?” My thanks are also due to the other members of my thesis committee: Paola Flochinni, Ian Munro, Michiel Smid, and Brett Stevens. Their feedback was invalu­ able. I also gratefully acknowledge the generous funding I have received from Car- leton University, the School of Computer Science, the Computational Geometry Laboratory, and the Natural Sciences and Engineering Research Council of Canada. It has been a pleasure to be a part of the Computational Geometry Laboratory at Carleton University for the past five years. This lab would not be possible without Prosenjit Bose, Anil Maheshwari, Pat Morin, and Michiel Smid, as well as our many students and postdoctoral fellows (past and present). Our frequent seminars, open problem sessions, and social activities have all resulted in a happy and productive environment. The laboratory was lucky enough to host a number of visitors dur­ ing my time here. In particular, discussions with Rolf Fagerberg and John Iacono contributed to some parts of this thesis. Like any good graduate student at Carleton, I spent a completely reasonable amount of time at Mike’s Place. My sincere thanks are owed to my fellow graduate students in other departments for joining me there. In particular, I thank Matthew Meier, Tara Ogaick, and Elise Vist for helping me unwind after long days of research. I am fortunate to have an incredibly supportive family. I can only begin to offer my thanks: my father, Robert Howat, for starting my journey with an old 386; my late mother, Judy Howat, for being my biggest advocate; my sister, Laurie Howat, for being my number one fan; my stepmother, Debbie Howat, for letting me talk through everything; and more extended family than I can enumerate. I could not have asked for more support. Finally, I thank my wonderful wife, Bianca Howat, for helping me through grad­ uate school in more ways than I suspect either of us realize. I would not have been able to complete this thesis without her love and support (and occasional interest in the Fibonacci numbers). Contents Abstract ii Acknowledgements iii 1 Introduction 1 1.1 Motivation ................................................................................................ 2 1.2 Problem Statements .............................................................................. 3 1.3 Summary of Contributions ..................................................................... 5 1.4 Bibliographic Notes................................................................................. 7 1.5 Organization of the T hesis ..................................................... 8 2 Background 10 2.1 Optimum Binary Search T rees ............................................................... 10 2.2 Defining Non-Uniformity ........................................................................ 11 2.2.1 Static and Dynamic O ptim ality ................................................ 12 2.2.2 Key-Independent Optimality ...................................................... 13 2.2.3 The Working-Set Property ......................................................... 14 2.2.4 The Queueish Property ............................................................... 15 v 2.2.5 The Static and Dynamic Finger P roperties .............................. 16 2.2.6 The Unified Property ................................................................... 17 2.3 Splay Trees ................................... 18 2.4 Some Distribution-Sensitive Data Structures ............................ 19 3 The Working-Set Property in Binary Search TYees 26 3.1 Problem Definition ................................................................................. 26 3.1.1 Background ................................................................................. 27 3.1.2 The Working-Set Structure . ............................................. 27 3.1.3 M odel .......................................................................... 30 3.2 Definition of Layered Working-Set T re e s '. 31 3.2.1 Tree Decom position .................................................................. 31 3.2.2 Encoding the Linked L is ts ......................................................... 33 3.3 Operations on Layered Working-Set T r e e s ........................................... 34 3.3.1 Intra-Layer Operations ......................................... 35 3.3.2 Inter-Layer Operations ............................................................... 38 3.3.3 Tree Operations ........................................................................... 44 3.4 Conclusion and Open Problem s ............................................... 47 4 Searching with Temporal Fingers 48 4.1 Problem Definition ................................................................................. 48 4.1.1 Defining Temporal D istance ...................................................... 49 4.1.2 Background ................................................................................. 50 4.2 The Data Structure .............................................................................. 50 4.2.1 The O ld Data S tructu re................................ 51 vi 4.2.2 The Y oung Data Structure ..................................................... 52 4.2.3 Performing a Query ................................................................. 52 4.2.4 Access C o s t................................................................................ 53 4.3 Conclusion and Open Problem s ........................................................... 56 5 The Strong Unified Property 57 5.1 Problem Definition.............................................................................. 57 5.1.1 Defining the Strong Unified Property ......................... 58 5.1.2 Background ................... 60 5.2 Towards the Strong Unified P roperty .................................................. 61 5.2.1
Recommended publications
  • 1 Suffix Trees
    This material takes about 1.5 hours. 1 Suffix Trees Gusfield: Algorithms on Strings, Trees, and Sequences. Weiner 73 “Linear Pattern-matching algorithms” IEEE conference on automata and switching theory McCreight 76 “A space-economical suffix tree construction algorithm” JACM 23(2) 1976 Chen and Seifras 85 “Efficient and Elegegant Suffix tree construction” in Apos- tolico/Galil Combninatorial Algorithms on Words Another “search” structure, dedicated to strings. Basic problem: match a “pattern” (of length m) to “text” (of length n) • goal: decide if a given string (“pattern”) is a substring of the text • possibly created by concatenating short ones, eg newspaper • application in IR, also computational bio (DNA seqs) • if pattern avilable first, can build DFA, run in time linear in text • if text available first, can build suffix tree, run in time linear in pattern. • applications in computational bio. First idea: binary tree on strings. Inefficient because run over pattern many times. • fractional cascading? • realize only need one character at each node! Tries: • used to store dictionary of strings • trees with children indexed by “alphabet” • time to search equal length of query string • insertion ditto. • optimal, since even hashing requires this time to hash. • but better, because no “hash function” computed. • space an issue: – using array increases stroage cost by |Σ| – using binary tree on alphabet increases search time by log |Σ| 1 – ok for “const alphabet” – if really fussy, could use hash-table at each node. • size in worst case: sum of word lengths (so pretty much solves “dictionary” problem. But what about substrings? • Relevance to DNA searches • idea: trie of all n2 substrings • equivalent to trie of all n suffixes.
    [Show full text]
  • Managing Unbounded-Length Keys in Comparison-Driven Data
    Managing Unbounded-Length Keys in Comparison-Driven Data Structures with Applications to On-Line Indexing∗ Amihood Amir Gianni Franceschini Department of Computer Science Dipartimento di Informatica Bar-Ilan University, Israel Universita` di Pisa, Italy and Department of Computer Science John Hopkins University, Baltimore MD Roberto Grossi Tsvi Kopelowitz Dipartimento di Informatica Department of Computer Science Universita` di Pisa, Italy Bar-Ilan University, Israel Moshe Lewenstein Noa Lewenstein Department of Computer Science Department of Computer Science Bar-Ilan University, Israel Netanya College, Israel arXiv:1306.0406v1 [cs.DS] 3 Jun 2013 ∗Parts of this paper appeared as extended abstracts in [3, 20]. 1 Abstract This paper presents a general technique for optimally transforming any dynamic data struc- ture that operates on atomic and indivisible keys by constant-time comparisons, into a data structure that handles unbounded-length keys whose comparison cost is not a constant. Exam- ples of these keys are strings, multi-dimensional points, multiple-precision numbers, multi-key data (e.g. records), XML paths, URL addresses, etc. The technique is more general than what has been done in previous work as no particular exploitation of the underlying structure of is required. The only requirement is that the insertion of a key must identify its predecessor or its successor. Using the proposed technique, online suffix tree construction can be done in worst case time O(log n) per input symbol (as opposed to amortized O(log n) time per symbol, achieved by previously known algorithms). To our knowledge, our algorithm is the first that achieves O(log n) worst case time per input symbol.
    [Show full text]
  • Augmentation: Range Trees (PDF)
    Lecture 9 Augmentation 6.046J Spring 2015 Lecture 9: Augmentation This lecture covers augmentation of data structures, including • easy tree augmentation • order-statistics trees • finger search trees, and • range trees The main idea is to modify “off-the-shelf” common data structures to store (and update) additional information. Easy Tree Augmentation The goal here is to store x.f at each node x, which is a function of the node, namely f(subtree rooted at x). Suppose x.f can be computed (updated) in O(1) time from x, children and children.f. Then, modification a set S of nodes costs O(# of ancestors of S)toupdate x.f, because we need to walk up the tree to the root. Two examples of O(lg n) updates are • AVL trees: after rotating two nodes, first update the new bottom node and then update the new top node • 2-3 trees: after splitting a node, update the two new nodes. • In both cases, then update up the tree. Order-Statistics Trees (from 6.006) The goal of order-statistics trees is to design an Abstract Data Type (ADT) interface that supports the following operations • insert(x), delete(x), successor(x), • rank(x): find x’s index in the sorted order, i.e., # of elements <x, • select(i): find the element with rank i. 1 Lecture 9 Augmentation 6.046J Spring 2015 We can implement the above ADT using easy tree augmentation on AVL trees (or 2-3 trees) to store subtree size: f(subtree) = # of nodes in it. Then we also have x.size =1+ c.size for c in x.children.
    [Show full text]
  • Optimal Finger Search Trees in the Pointer Machine
    Alcom-FT Technical Report Series ALCOMFT-TR-02-77 Optimal Finger Search Trees in the Pointer Machine Gerth Stølting Brodal∗ George Lagogiannis† Christos Makris† Athanasios Tsakalidis† Kostas Tsichlas† Abstract We develop a new finger search tree with worst-case constant update time in the Pointer Machine (PM) model of computation. This was a major problem in the field of Data Structures and was tantalizingly open for over twenty years while many attempts by researchers were made to solve it. The result comes as a consequence of the innovative mechanism that guides the rebalancing operations combined with incremental multiple splitting and fusion techniques over nodes. Keywords balanced trees, update operations, finger search trees, data structures, complexity 1 Introduction The balanced search tree is one of the most common data structures used in algorithms. Assuming that the update position is known, balanced search trees with O(1) amortized update time have been presented long ago ([6, 14]). It has also been known ([6, 16]) that updates can be performed in O(1) structural changes, but the nodes to be changed have to be searched in Ω(log n) time. Levcopoulos and Overmars ([13]) presented an algorithm achieving O(1) worst case update time by using a global splitting lemma that is based on a pebble game combined with the bucketing technique of Overmars ([14]). Instead of storing single keys in the leaves of the search tree, each leaf can store a list of several keys. Unfortunately, the buckets in [13] have size O(log2 n), so they need a two level hierarchy of lists in order to guarantee O(log n) query time within the buckets.
    [Show full text]
  • Bulk Updates and Cache Sensitivity in Search Trees
    BULK UPDATES AND CACHE SENSITIVITY INSEARCHTREES TKK Research Reports in Computer Science and Engineering A Espoo 2009 TKK-CSE-A2/09 BULK UPDATES AND CACHE SENSITIVITY INSEARCHTREES Doctoral Dissertation Riku Saikkonen Dissertation for the degree of Doctor of Science in Technology to be presented with due permission of the Faculty of Information and Natural Sciences, Helsinki University of Technology for public examination and debate in Auditorium T2 at Helsinki University of Technology (Espoo, Finland) on the 4th of September, 2009, at 12 noon. Helsinki University of Technology Faculty of Information and Natural Sciences Department of Computer Science and Engineering Teknillinen korkeakoulu Informaatio- ja luonnontieteiden tiedekunta Tietotekniikan laitos Distribution: Helsinki University of Technology Faculty of Information and Natural Sciences Department of Computer Science and Engineering P.O. Box 5400 FI-02015 TKK FINLAND URL: http://www.cse.tkk.fi/ Tel. +358 9 451 3228 Fax. +358 9 451 3293 E-mail: [email protected].fi °c 2009 Riku Saikkonen Layout: Riku Saikkonen (except cover) Cover image: Riku Saikkonen Set in the Computer Modern typeface family designed by Donald E. Knuth. ISBN 978-952-248-037-8 ISBN 978-952-248-038-5 (PDF) ISSN 1797-6928 ISSN 1797-6936 (PDF) URL: http://lib.tkk.fi/Diss/2009/isbn9789522480385/ Multiprint Oy Espoo 2009 AB ABSTRACT OF DOCTORAL DISSERTATION HELSINKI UNIVERSITY OF TECHNOLOGY P.O. Box 1000, FI-02015 TKK http://www.tkk.fi/ Author Riku Saikkonen Name of the dissertation Bulk Updates and Cache Sensitivity in Search Trees Manuscript submitted 09.04.2009 Manuscript revised 15.08.2009 Date of the defence 04.09.2009 £ Monograph ¤ Article dissertation (summary + original articles) Faculty Faculty of Information and Natural Sciences Department Department of Computer Science and Engineering Field of research Software Systems Opponent(s) Prof.
    [Show full text]
  • Space-Efficient Data Structures, Streams, and Algorithms
    Andrej Brodnik Alejandro López-Ortiz Venkatesh Raman Alfredo Viola (Eds.) Festschrift Space-Efficient Data Structures, Streams, LNCS 8066 and Algorithms Papers in Honor of J. Ian Munro on the Occasion of His 66th Birthday 123 Lecture Notes in Computer Science 8066 Commenced Publication in 1973 Founding and Former Series Editors: Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen Editorial Board David Hutchison Lancaster University, UK Takeo Kanade Carnegie Mellon University, Pittsburgh, PA, USA Josef Kittler University of Surrey, Guildford, UK Jon M. Kleinberg Cornell University, Ithaca, NY, USA Alfred Kobsa University of California, Irvine, CA, USA Friedemann Mattern ETH Zurich, Switzerland John C. Mitchell Stanford University, CA, USA Moni Naor Weizmann Institute of Science, Rehovot, Israel Oscar Nierstrasz University of Bern, Switzerland C. Pandu Rangan Indian Institute of Technology, Madras, India Bernhard Steffen TU Dortmund University, Germany Madhu Sudan Microsoft Research, Cambridge, MA, USA Demetri Terzopoulos University of California, Los Angeles, CA, USA Doug Tygar University of California, Berkeley, CA, USA Gerhard Weikum Max Planck Institute for Informatics, Saarbruecken, Germany Andrej Brodnik Alejandro López-Ortiz Venkatesh Raman Alfredo Viola (Eds.) Space-Efficient Data Structures, Streams, and Algorithms Papers in Honor of J. Ian Munro on the Occasion of His 66th Birthday 13 Volume Editors Andrej Brodnik University of Ljubljana, Faculty of Computer and Information Science Ljubljana, Slovenia and University of Primorska,
    [Show full text]
  • Finger Search Trees
    11 Finger Search Trees 11.1 Finger Searching....................................... 11-1 11.2 Dynamic Finger Search Trees ....................... 11-2 11.3 Level Linked (2,4)-Trees .............................. 11-3 11.4 Randomized Finger Search Trees ................... 11-4 Treaps • Skip Lists 11.5 Applications............................................ 11-6 Optimal Merging and Set Operations • Arbitrary Gerth Stølting Brodal Merging Order • List Splitting • Adaptive Merging and University of Aarhus Sorting 11.1 Finger Searching One of the most studied problems in computer science is the problem of maintaining a sorted sequence of elements to facilitate efficient searches. The prominent solution to the problem is to organize the sorted sequence as a balanced search tree, enabling insertions, deletions and searches in logarithmic time. Many different search trees have been developed and studied intensively in the literature. A discussion of balanced binary search trees can e.g. be found in [4]. This chapter is devoted to finger search trees which are search trees supporting fingers, i.e. pointers, to elements in the search trees and supporting efficient updates and searches in the vicinity of the fingers. If the sorted sequence is a static set of n elements then a simple and space efficient representation is a sorted array. Searches can be performed by binary search using 1+⌊log n⌋ comparisons (we throughout this chapter let log x denote log2 max{2, x}). A finger search starting at a particular element of the array can be performed by an exponential search by inspecting elements at distance 2i − 1 from the finger for increasing i followed by a binary search in a range of 2⌊log d⌋ − 1 elements, where d is the rank difference in the sequence between the finger and the search element.
    [Show full text]
  • An..Time Algorithm for Triangulating
    SIAM J. COMPUT. (C) 1988 Society tbr Industrial and Applied Mathematics Voi. 17, No. I, February 1988 010 AN O(n log log n)-TIME ALGORITHM FOR TRIANGULATING A SIMPLE POLYGON* ROBERT E. TARJANf* AND CHRISTOPHER J. VAN WYK" Abstract. Given a simple n-vertex polygon, the triangulation problem is to partition the interior of the polygon into n-2 triangles by adding n-3 nonintersecting diagonals. We propose an O(n log logn)-time algorithm for this problem, improving on the previously best bound of O (n log n) and showing that triangu- lation is not as hard as sorting. Improved algorithms for several other computational geometry problems, including testing whether a polygon is simple, follow from our result. Key words, amortized time, balanced divide and conquer, heterogeneous finger search tree, homogene- ous finger search tree, horizontal visibility information, Jordan sorting with error-correction, simplicity test- ing AMS(MOS) subject classifications. 51M15, 68P05, 68Q25 1. Introduction. Let P be an n-vertex simple polygon, defined by a list Vo,V vn- of its vertices in clockwise order around the boundary. (The interior of the polygon is to the right as one walks clockwise around the boundary.) We denote the boundary of P by 0P. We assume throughout this paper (without loss of generality) that the vertices of P have distinct y-coordinates. For convenience we define vn V o. The edges of P are the open line segments whose endpoints are vi,vi+l for 0 < < n. The diagonals of P are the open line segments whose endpoints are vertices and that lie entirely in the interior of P.
    [Show full text]
  • Australian Journal
    ISSN 004-8917 AUSTRALIAN JOURNAL Volume 24, Number 2, May 1992 SPECIAL ISSUE ON ALGORITHMS 41 Guest Editorial 42 Algorithmic Issues in the Geometry of Planar Linkage Movement S WHITESIDES 51 Algorithms for Groups J CANNON and GEORGE HA VAS 61 Parallel Search Algorithms for Trees and Graphs PCHAUDHUR1 70 An Overview of Adaptive Sorting A MOFFA T and O PETERSSON SPECIAL FEATURES 50 Book Review Information 78 Book Reviews Published for Australian Computer Society Incorporated Registered by Australia Post, Publication No. NBG 1124 AUSTRALIAN Editor Computer R COOK, The University of Queensland ———■«™——«■—— iiiiiiiiii iiiiiiiiiiii Editorial Co-ordinator LOIS FORDHAM, The University of 77ie Australian Computer Journal is an official publication Queensland of the Australian Computer Society Incorporated. Associate Editors CONTRIBUTIONS to ensure uniformity of style. Referees may MJ LAWRENCE, University of NSW suggest major revisions to be performed by the All material for publication should be sent to: author. C ANDREWS, Mr Rob Cook, Editor Proofs and Reprints: Page proofs of Papers DN WILSON, University of Technology, Australian Computer Journal and Short Communications are sent to the Sydney Centre for Information Technology Research authors for correction prior to publication. Fifty K RAYMOND, The University of Queensland The University of Queensland, Qld 4072 copies of reprints will be supplied to authors J ZELEZNIKOW, Latrobe University without charge. Reprints of individual papers A MOFFAT, Melbourne University Prospective authors may wish to consult may be purchased from Associated Business E SONNENBERG, Melbourne University manuscript preparation guidelines published in Publications, PO Box 440, Broadway, NSW the February 1991 issue. The paragraphs below 2007.
    [Show full text]
  • Algorithms and Data Structures for Strings, Points and Integers Or, Points About Strings and Strings About Points
    Downloaded from orbit.dtu.dk on: Sep 23, 2021 Algorithms and Data Structures for Strings, Points and Integers or, Points about Strings and Strings about Points Vind, Søren Juhl Publication date: 2015 Document Version Publisher's PDF, also known as Version of record Link back to DTU Orbit Citation (APA): Vind, S. J. (2015). Algorithms and Data Structures for Strings, Points and Integers: or, Points about Strings and Strings about Points. Technical University of Denmark. DTU Compute PHD-2015 No. 366 General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. Users may download and print one copy of any publication from the public portal for the purpose of private study or research. You may not further distribute the material or use it for any profit-making activity or commercial gain You may freely distribute the URL identifying the publication in the public portal If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim. Algorithms and Data Structures for Strings, Points and Integers or, Points about Strings and Strings about Points Søren Vind Ph.D. Thesis PHD-2015-366 ISSN: 0909-3192 DTU Compute Department of Applied Mathematics and Computer Science Technical University of Denmark Richard Petersens Plads Building 324 2800 Kongens Lyngby, Denmark Phone +45 4525 3031 [email protected] www.compute.dtu.dk PHD-2015-366 ISSN: 0909-3192 Abstract This dissertation presents our research in the broad area of algorithms and data struc- tures.
    [Show full text]
  • Finger Search in Grammar-Compressed Strings
    View metadata,Downloaded citation and from similar orbit.dtu.dk papers on:at core.ac.uk Oct 23, 2019 brought to you by CORE provided by Online Research Database In Technology Finger Search in Grammar-Compressed Strings Bille, Philip; Christiansen, Anders Roy; Cording, Patrick Hagge; Gørtz, Inge Li Published in: Theory of Computing Systems Link to article, DOI: 10.1007/s00224-017-9839-9 Publication date: 2018 Document Version Peer reviewed version Link back to DTU Orbit Citation (APA): Bille, P., Christiansen, A. R., Cording, P. H., & Gørtz, I. L. (2018). Finger Search in Grammar-Compressed Strings. Theory of Computing Systems, 62(8), 1715-1735. https://doi.org/10.1007/s00224-017-9839-9 General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. Users may download and print one copy of any publication from the public portal for the purpose of private study or research. You may not further distribute the material or use it for any profit-making activity or commercial gain You may freely distribute the URL identifying the publication in the public portal If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim. Finger Search in Grammar-Compressed Strings Philip Bille Anders Roy Christiansen Patrick Hagge Cording [email protected] [email protected] [email protected] Inge Li Gørtz [email protected] Abstract Grammar-based compression, where one replaces a long string by a small context-free grammar that generates the string, is a simple and powerful paradigm that captures many popular compression schemes.
    [Show full text]
  • Worst Case Efficient Data Structures
    BRICS Basic Research in Computer Science BRICS DS-97-1 G. S. Brodal: Worst Case Efficient Data Structures Worst Case Efficient Data Structures Gerth Stølting Brodal BRICS Dissertation Series DS-97-1 ISSN 1396-7002 January 1997 Copyright c 1997, BRICS, Department of Computer Science University of Aarhus. All rights reserved. Reproduction of all or part of this work is permitted for educational or research use on condition that this copyright notice is included in any copy. See back inner page for a list of recent BRICS Dissertation Series publi- cations. Copies may be obtained by contacting: BRICS Department of Computer Science University of Aarhus Ny Munkegade, building 540 DK–8000 Aarhus C Denmark Telephone: +45 8942 3360 Telefax: +45 8942 3255 Internet: [email protected] BRICS publications are in general accessible through the World Wide Web and anonymous FTP through these URLs: http://www.brics.dk ftp://ftp.brics.dk This document in subdirectory DS/97/1/ Worst Case Efficient Data Structures Gerth Stølting Brodal Ph.D. Dissertation Department of Computer Science University of Aarhus Denmark Worst Case Efficient Data Structures A Dissertation Presented to the Faculty of Science of the University of Aarhus in Partial Fulfillment of the Requirements for the Ph.D. Degree by Gerth Stølting Brodal January 31, 1997 Abstract We study the design of efficient data structures. In particular we focus on the design of data structures where each operation has a worst case efficient implementations. The concrete prob- lems we consider are partial persistence, implementation of priority queues, and implementation of dictionaries. The first problem we consider is how to make bounded in-degree and out-degree data structures partially persistent, i.e., how to remember old versions of a data structure for later access.
    [Show full text]