Indexing Tree-Based Indexing

Total Page:16

File Type:pdf, Size:1020Kb

Indexing Tree-Based Indexing Indexing Chapter 8, 10, 11 Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Tree-Based Indexing The data entries are arranged in sorted order by search key value. A hierarchical search data structure (tree) is maintained that directs searches to the correct page of data entries. Tree-structured indexing techniques support both range searches and equality searches. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 2 Trees Tree A structure with a unique starting node (the root), in which each node is capable of having child nodes and a unique path exists from the root to every other node Root The top node of a tree structure; a node with no parent Leaf Node A tree node that has no children Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 3 Trees Level Distance of a node from the root Height The maximum level Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 4 Time Complexity of Tree Operations Time complexity of Searching: . O(height of the tree) Time complexity of Inserting: . O(height of the tree) Time complexity of Deleting: . O(height of the tree) Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 5 Multi-way Tree If we relax the restriction that each node can have only one key, we can reduce the height of the tree. A multi-way search tree is a tree in which the nodes hold between 1 to m-1 distinct keys Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 6 Range Searches ``Find all students with gpa > 3.0’’ . If data is in sorted file, do binary search to find first such student, then scan to find others. Cost of binary search can be quite high. Simple idea: Create an `index’ file. k1 k2 kN Index File Page 1 Page 2 Page 3 Page N Data File Can do binary search on (smaller) index file! Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 7 Property of Multi-way Tree The keys in each node are sorted. A node with k values has k+1 sub-trees, where the sub-trees may be empty. The i’th sub-tree of a node [v1, ..., vk], 0 < i < k, may hold only values v in the range vi < v < vi+1 (v0 is assumed to equal -∞, and vk+1is assumed to equal +∞). A m-way tree of height h has between h and mh - 1 keys. The height of a complete m-ary tree with n nodes is ceiling(logmn). Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 8 Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 9 Searching m-way Tree We make an m-way branching decision at each node according to the number of the node’s children. Searching is performed in a recursive way. Time complexity is O(h). Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 10 Two Popular Trees ISAM (Indexed Sequential Access Method) . A static structure; B+ tree: . A dynamic structure, adjusts gracefully under inserts and deletes. Leaf pages of both of them contain data entries. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 11 ISAM index entry P K P K P K P 0 1 1 2 2 m m Index file may still be quite large. But we can apply the idea repeatedly! Non-leaf Pages Leaf Pages Overflow page Primary pages Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 12 Comments on ISAM A multi-way tree Each tree node is a disk page; Data Pages Leaf pages contain data entries . Alternative 1 : Leaf pages are created with data record with key value k; All leaf pages are allocated sequentially and sorted on the search Index Pages key value. Alternative 2 or 3 : Data records are created and sorted in a separate file, and then storing <key, rid> in the leaf pages of ISAM index. Overflow pages . Then index pages allocated, then space for overflow pages. • Index entries: <search key value, page id>; they `direct’ search for data entries, which are in leaf pages. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 13 Data Comments on ISAM Pages Index Pages Search: Start at root; use key comparisons to go to leaf. Cost log F N ; . F = # entries/index pg, N = # leaf pgs Overflow pages Insert: Find leaf data entry belongs to, and put it there. Delete: Find and remove from leaf; if empty overflow page, de-allocate. Static tree structure: inserts/deletes affect only leaf pages. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 14 Example ISAM Tree Each node can hold 2 entries; no need for `next-leaf-page’ pointers. (Why? ->static) Example: search a record with the key value Root 27. 40 20 33 51 63 10* 15* 20* 27* 33* 37* 40* 46* 51* 55* 63* 97* Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 15 After Inserting 23*, 48*, 41*, 42* ... Root Index 40 Pages 20 33 51 63 Primary Leaf 10* 15* 20* 27* 33* 37* 40* 46* 51* 55* 63* 97* Pages Overflow 23* 48* 41* Pages 42* Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 16 ... Then Deleting 42*, 51*, 97* The number of primary leaf pages is fixed at file creation time – STATIC. Root 40 20 33 51 63 10* 15* 20* 27* 33* 37* 40* 46* 55* 63* 23* 48* 41* Static tree structure: inserts/deletes affect only leaf pages. Note that 51* appears in index levels, but not in leaf! Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 17 Comments on ISAM Static design leads to the problem that long overflow chains could develop. To alleviate this problem, the tree is initially created so that about 20% of each page is free. Static design has the advantage that locking step is not needed since index-level pages are never modified. Static tree structure: inserts/deletes affect only leaf pages. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 18 B+ Trees A Dynamic Index Structure Most Widely Used Index A balanced tree in which the internal nodes direct the search and the leaf nodes contain the data entries. Tree structure grows and shrinks dynamically, leaf pages are not sequentially allocated. Leaf pages are organized in doubly linked list. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 19 B+ Tree Indexes Non-leaf Pages Leaf Pages (Sorted by search key) Double linked list Leaf pages contain data entries, and are chained (prev & next) Non-leaf pages have index entries; only used to direct searches: index entry P K P K P 0 1 1 2 P 2 K m m Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 20 Fan-out of the Tree Fan-out of the tree: the average number of children for a non-leaf node. If every non-leaf node has n children, a tree of height h has nh leaf pages. A good approximation of the number of leaf pages, Fh ( F is the average # of children, which is at least 100). Example: . A tree of height 4 contains 100 million leaf pages. A binary search will take log2100,000,000 > 25 I/Os. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 21 B+ Trees Insert/delete at log F N cost; keep tree height-balanced. (F = fanout, N = # leaf pages) Minimum 50% occupancy (except for root). Each node contains m entries, where d <= m <= 2d . Except the root has m entries, where 1 <= m <= 2d. The parameter d is called the order of the tree. Supports equality and range-searches efficiently. Index Entries (Direct search) Data Entries ("Sequence set") Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 22 Example B+ Tree Root Note how data entries 17 in leaf level are sorted Entries <= 17 Entries > 17 5 13 27 30 2* 3* 5* 7* 8* 14* 16* 22* 24* 27* 29* 33* 34* 38* 39* Find 28*? 29*? All > 15* and < 30* Insert/delete: Find data entry in leaf, then change it. Need to adjust parent sometimes. And change sometimes bubbles up the tree Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 23 B+ Trees in Practice Typical order: d=100. Typical fill-factor: 67%. average fanout = 133 Typical capacities: . Height 4: 1334 = 312,900,700 records . Height 3: 1333 = 2,352,637 records Can often hold top levels in buffer pool: . Level 1 = 1 page = 8 Kbytes . Level 2 = 133 pages = 1 Mbyte . Level 3 = 17,689 pages = 133 MBytes Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 24 Inserting a Data Entry into a B+ Tree Find correct leaf node L. Put data entry onto L. If L has enough space, done! . Else, must split L (into L and a new node L2) • Redistribute entries evenly, copy up middle key. • Insert index entry pointing to L2 into parent of L. This can happen recursively . To split index node, redistribute entries evenly, but push up middle key. (Contrast with leaf splits.) Splits “grow” tree; root split increases height. Tree growth: gets wider or one level taller at top. Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 25 Inserting 8* entry into B+ Tree Root 13 17 24 30 2* 3* 5* 7* 14* 16* 19* 20* 22* 24* 27* 29* 33* 34* 38* 39* Entry to be inserted in parent node. Split the full leaf 5 (Note that 5 iss copied up and continues to appear in the leaf.) node, copy up the middle key. 2* 3* 5* 7* 8* Database Management Systems 3ed, R.
Recommended publications
  • Search Trees
    Lecture III Page 1 “Trees are the earth’s endless effort to speak to the listening heaven.” – Rabindranath Tagore, Fireflies, 1928 Alice was walking beside the White Knight in Looking Glass Land. ”You are sad.” the Knight said in an anxious tone: ”let me sing you a song to comfort you.” ”Is it very long?” Alice asked, for she had heard a good deal of poetry that day. ”It’s long.” said the Knight, ”but it’s very, very beautiful. Everybody that hears me sing it - either it brings tears to their eyes, or else -” ”Or else what?” said Alice, for the Knight had made a sudden pause. ”Or else it doesn’t, you know. The name of the song is called ’Haddocks’ Eyes.’” ”Oh, that’s the name of the song, is it?” Alice said, trying to feel interested. ”No, you don’t understand,” the Knight said, looking a little vexed. ”That’s what the name is called. The name really is ’The Aged, Aged Man.’” ”Then I ought to have said ’That’s what the song is called’?” Alice corrected herself. ”No you oughtn’t: that’s another thing. The song is called ’Ways and Means’ but that’s only what it’s called, you know!” ”Well, what is the song then?” said Alice, who was by this time completely bewildered. ”I was coming to that,” the Knight said. ”The song really is ’A-sitting On a Gate’: and the tune’s my own invention.” So saying, he stopped his horse and let the reins fall on its neck: then slowly beating time with one hand, and with a faint smile lighting up his gentle, foolish face, he began..
    [Show full text]
  • Lock-Free Search Data Structures: Throughput Modeling with Poisson Processes
    Lock-Free Search Data Structures: Throughput Modeling with Poisson Processes Aras Atalar Chalmers University of Technology, S-41296 Göteborg, Sweden [email protected] Paul Renaud-Goud Informatics Research Institute of Toulouse, F-31062 Toulouse, France [email protected] Philippas Tsigas Chalmers University of Technology, S-41296 Göteborg, Sweden [email protected] Abstract This paper considers the modeling and the analysis of the performance of lock-free concurrent search data structures. Our analysis considers such lock-free data structures that are utilized through a sequence of operations which are generated with a memoryless and stationary access pattern. Our main contribution is a new way of analyzing lock-free concurrent search data structures: our execution model matches with the behavior that we observe in practice and achieves good throughput predictions. Search data structures are formed of basic blocks, usually referred to as nodes, which can be accessed by two kinds of events, characterized by their latencies; (i) CAS events originated as a result of modifications of the search data structure (ii) Read events that occur during traversals. An operation triggers a set of events, and the running time of an operation is computed as the sum of the latencies of these events. We identify the factors that impact the latency of such events on a multi-core shared memory system. The main challenge (though not the only one) is that the latency of each event mainly depends on the state of the caches at the time when it is triggered, and the state of caches is changing due to events that are triggered by the operations of any thread in the system.
    [Show full text]
  • Persistent Predecessor Search and Orthogonal Point Location on the Word
    Persistent Predecessor Search and Orthogonal Point Location on the Word RAM∗ Timothy M. Chan† August 17, 2012 Abstract We answer a basic data structuring question (for example, raised by Dietz and Raman [1991]): can van Emde Boas trees be made persistent, without changing their asymptotic query/update time? We present a (partially) persistent data structure that supports predecessor search in a set of integers in 1,...,U under an arbitrary sequence of n insertions and deletions, with O(log log U) { } expected query time and expected amortized update time, and O(n) space. The query bound is optimal in U for linear-space structures and improves previous near-O((log log U)2) methods. The same method solves a fundamental problem from computational geometry: point location in orthogonal planar subdivisions (where edges are vertical or horizontal). We obtain the first static data structure achieving O(log log U) worst-case query time and linear space. This result is again optimal in U for linear-space structures and improves the previous O((log log U)2) method by de Berg, Snoeyink, and van Kreveld [1995]. The same result also holds for higher-dimensional subdivisions that are orthogonal binary space partitions, and for certain nonorthogonal planar subdivisions such as triangulations without small angles. Many geometric applications follow, including improved query times for orthogonal range reporting for dimensions 3 on the RAM. ≥ Our key technique is an interesting new van-Emde-Boas–style recursion that alternates be- tween two strategies, both quite simple. 1 Introduction Van Emde Boas trees [60, 61, 62] are fundamental data structures that support predecessor searches in O(log log U) time on the word RAM with O(n) space, when the n elements of the given set S come from a bounded integer universe 1,...,U (U n).
    [Show full text]
  • Lecture III BALANCED SEARCH TREES
    §1. Keyed Search Structures Lecture III Page 1 Lecture III BALANCED SEARCH TREES Anthropologists inform that there is an unusually large number of Eskimo words for snow. The Computer Science equivalent of snow is the tree word: we have (a,b)-tree, AVL tree, B-tree, binary search tree, BSP tree, conjugation tree, dynamic weighted tree, finger tree, half-balanced tree, heaps, interval tree, kd-tree, quadtree, octtree, optimal binary search tree, priority search tree, R-trees, randomized search tree, range tree, red-black tree, segment tree, splay tree, suffix tree, treaps, tries, weight-balanced tree, etc. The above list is restricted to trees used as search data structures. If we include trees arising in specific applications (e.g., Huffman tree, DFS/BFS tree, Eskimo:snow::CS:tree alpha-beta tree), we obtain an even more diverse list. The list can be enlarged to include variants + of these trees: thus there are subspecies of B-trees called B - and B∗-trees, etc. The simplest search tree is the binary search tree. It is usually the first non-trivial data structure that students encounter, after linear structures such as arrays, lists, stacks and queues. Trees are useful for implementing a variety of abstract data types. We shall see that all the common operations for search structures are easily implemented using binary search trees. Algorithms on binary search trees have a worst-case behaviour that is proportional to the height of the tree. The height of a binary tree on n nodes is at least lg n . We say that a family of binary trees is balanced if every tree in the family on n nodes has⌊ height⌋ O(log n).
    [Show full text]
  • Finger Search Trees
    11 Finger Search Trees 11.1 Finger Searching....................................... 11-1 11.2 Dynamic Finger Search Trees ....................... 11-2 11.3 Level Linked (2,4)-Trees .............................. 11-3 11.4 Randomized Finger Search Trees ................... 11-4 Treaps • Skip Lists 11.5 Applications............................................ 11-6 Optimal Merging and Set Operations • Arbitrary Gerth Stølting Brodal Merging Order • List Splitting • Adaptive Merging and University of Aarhus Sorting 11.1 Finger Searching One of the most studied problems in computer science is the problem of maintaining a sorted sequence of elements to facilitate efficient searches. The prominent solution to the problem is to organize the sorted sequence as a balanced search tree, enabling insertions, deletions and searches in logarithmic time. Many different search trees have been developed and studied intensively in the literature. A discussion of balanced binary search trees can e.g. be found in [4]. This chapter is devoted to finger search trees which are search trees supporting fingers, i.e. pointers, to elements in the search trees and supporting efficient updates and searches in the vicinity of the fingers. If the sorted sequence is a static set of n elements then a simple and space efficient representation is a sorted array. Searches can be performed by binary search using 1+⌊log n⌋ comparisons (we throughout this chapter let log x denote log2 max{2, x}). A finger search starting at a particular element of the array can be performed by an exponential search by inspecting elements at distance 2i − 1 from the finger for increasing i followed by a binary search in a range of 2⌊log d⌋ − 1 elements, where d is the rank difference in the sequence between the finger and the search element.
    [Show full text]
  • Data Structure for Language Processing
    Data Structure for Language Processing Bhargavi H. Goswami Assistant Professor Sunshine Group of Institutions INTRODUCTION: • Which operation is frequently used by a Language Processor? • Ans: Search. • This makes the design of data structures a crucial issue in language processing activities. • In this chapter we shall discuss the data structure requirements of LP and suggest efficient data structure to meet there requirements. Criteria for Classification of Data Structure of LP: • 1. Nature of Data Structure: whether a “linear” or “non linear”. • 2. Purpose of Data Structure: whether a “search” DS or an “allocation” DS. • 3. Lifetime of a data structure: whether used during language processing or during target program execution. Linear DS • Linear data structure consist of a linear arrangement of elements in the memory. • Advantage: Facilitates Efficient Search. • Dis-Advantage: Require a contagious area of memory. • Do u consider it a problem? Yes or No? • What the problem is? • Size of a data structure is difficult to predict. • So designer is forced to overestimate the memory requirements of a linear DS to ensure that it does not outgrow the allocated memory. • Disadvantage: Wastage Of Memory. Non Linear DS • Overcomes the disadvantage of Linear DS. HOW? • Elements of Non Linear DS are accessed using pointers. • Hence the elements need not occupy contiguous areas of memory. • Disadvantage: Non Linear DS leads to lower search efficiency. Linear & Non-Linear DS E F B E A H G F F H D C Linear Non Linear Search Data Structures • Search DS are used during LP’ing to maintain attribute information concerning different entities in source program.
    [Show full text]
  • Dynamic Interpolation Search Revisited
    Information and Computation 270 (2020) 104465 Contents lists available at ScienceDirect Information and Computation www.elsevier.com/locate/yinco ✩ Dynamic Interpolation Search revisited Alexis Kaporis a, Christos Makris b, Spyros Sioutas b, Athanasios Tsakalidis b, ∗ Kostas Tsichlas d, Christos Zaroliagis b,c, a Department of Information & Communication Systems Engineering, University of the Aegean, Karlovassi, Samos, 83200, Greece b Department of Computer Engineering and Informatics, University of Patras, 26500 Patras, Greece c Computer Technology Institute & Press “Diophantus”, N. Kazantzaki Str, Patras University Campus, 26504 Patras, Greece d Department of Informatics, Aristotle University of Thessaloniki, Greece a r t i c l e i n f o a b s t r a c t Article history: A new dynamic Interpolation Search (IS) data structure is presented that achieves Received 11 June 2018 O (log logn) search time with high probability on unknown continuous or even discrete Received in revised form 19 February 2019 input distributions with measurable probability of element collisions, including power law Accepted 15 March 2019 and Binomial distributions. No such previous result holds for IS when the probability of Available online 4 September 2019 element collisions is measurable. Moreover, our data structure exhibits O (1) search time Keywords: with high probability (w.h.p.) for a wide class of input distributions that contains all those Interpolation Search for which o(log logn) expected search time was previously known. © Dynamic predecessor search 2019 Elsevier Inc. All rights reserved. Dynamic search data structure 1. Introduction The dynamic predecessor search problem is one of the fundamental problems in computer science. In this problem we have to maintain a set of elements subject to insertions and deletions such that given a query element y we can retrieve the largest element in the set smaller or equal to y.
    [Show full text]
  • CMSC 420 Data Structures1
    CMSC 420 Data Structures1 David M. Mount Department of Computer Science University of Maryland Fall 2019 1 Copyright, David M. Mount, 2019, Dept. of Computer Science, University of Maryland, College Park, MD, 20742. These lecture notes were prepared by David Mount for the course CMSC 420, Data Structures, at the University of Maryland. Permission to use, copy, modify, and distribute these notes for educational purposes and without fee is hereby granted, provided that this copyright notice appear in all copies. Lecture Notes 1 CMSC 420 Lecture 1: Course Introduction and Background Algorithms and Data Structures: The study of data structures and the algorithms that ma- nipulate them is among the most fundamental topics in computer science. Most of what computer systems spend their time doing is storing, accessing, and manipulating data in one form or another. Some examples from computer science include: Information Retrieval: List the 10 most informative Web pages on the subject of \how to treat high blood pressure?" Identify possible suspects of a crime based on fingerprints or DNA evidence. Find movies that a Netflix subscriber may like based on the movies that this person has already viewed. Find images on the Internet containing both kangaroos and horses. Geographic Information Systems: How many people in the USA live within 25 miles of the Mississippi River? List the 10 movie theaters that are closest to my current location. If sea levels rise 10 meters, what fraction of Florida will be under water? Compilers: You need to store a set of variable names along with their associated types. Given an assignment between two variables we need to look them up in a symbol table, determine their types, and determine whether it is possible to cast from one type to the other).
    [Show full text]
  • Algorithms and Data Structures for IP Lookup, Packet Classification And
    Dissertation zur Erlangung des Doktorgrades der FakultÄatfÄurAngewandte Wissenschaften der Albert-Ludwigs-UniversitÄatFreiburg Algorithms and Data Structures for IP Lookup, Packet Classi¯cation and Conflict Detection Christine Maindorfer Betreuer: Prof. Dr. Thomas Ottmann Dekan der FakultÄatfÄurAngewandte Wissenschaften: Prof. Dr. Hans Zappe Betreuer: Prof. Dr. Thomas Ottmann Zweitgutachterin: Prof. Dr. Susanne Albers Tag der Disputation: 2.3.2009 Zusammenfassung Die Hauptaufgabe eines Internet-Routers besteht in der Weiterleitung von Paketen. Um den nÄachsten Router auf dem Weg zum Ziel zu bestimmen, wird der Header, welcher u.a. die Zieladresse enthÄalt,eines jeden Datenpaketes inspiziert und gegen eine Routertabelle abgeglichen. Im Falle, dass mehrere PrÄa¯xein der Routertabelle mit der Zieladresse Äubereinstimmen, wird in der Regel eine Strategie gewÄahlt,die als \Longest Pre¯x Matching" bekannt ist. Hierbei wird von allen mÄoglichen Aktio- nen diejenige ausgewÄahlt,die durch das lÄangstemit der Adresse Äubereinstimmende PrÄa¯xfestgelegt ist. Zur LÄosungdieses sogenannten IP-Lookup-Problems sind zahlreiche Algorithmen und Datenstrukturen vorgeschlagen worden. AnderungenÄ in der Netzwerktopologie aufgrund von physikalischen Verbindungsaus- fÄallen,der Hinzunahme von neuen Routern oder Verbindungen fÄuhrenzu Aktu- alisierungen in den Routertabellen. Da die Performanz der IP-Lookup-Einheit einen entscheidenden Einfluss auf die Gesamtperformanz des Internets hat, ist es entscheidend, dass IP-Lookup sowie Aktualisierungen so schnell wie mÄoglich durchgefÄuhrtwerden. Um diese Operationen zu beschleunigen, sollten Routerta- bellen so implementiert werden, dass Lookup und Aktualisierungen gleichzeitig ausgefÄuhrtwerden kÄonnen.Um zu sichern, dass auf SuchbÄaumenbasierte dynami- sche Routertabellen nicht durch Updates degenerieren, unterlegt man diese mit einer balancierten Suchbaumklasse. Relaxierte Balancierung ist ein gebrÄauchliches Konzept im Design von nebenlÄau¯gimplementierten SuchbÄaumen. Hierbei wer- den die Balanceoperationen ggf.
    [Show full text]
  • Data Structures and Algorithms (CS210A) Semester I – 2014-15
    Data Structures and Algorithms (CS210A) Semester I – 2014-15 Lecture 39 • Integer sorting : Radix Sort • Search data structure for integers : Hashing 1 Types of sorting algorithms In Place Sorting algorithm: A sorting algorithm which uses only O(1) extra space to sort. Example: Heap sort, Quick sort. Stable Sorting algorithm: A sorting algorithm which preserves the order of equal keys while sorting. 0 1 2 3 4 5 6 7 A 2 5 3 0 6.1 3 7.9 4 0 1 2 3 4 5 6 7 A 0 2 3 3 4 5 6.1 7.9 Example: Merge sort. 2 Integer Sorting algorithms Continued from last class Counting sort: algorithm for sorting integers Input: An array A storing 풏 integers in the range [0…풌 − ퟏ]. Output: Sorted array A. Running time: O(풏 + 풌) in word RAM model of computation. Extra space: O(풏 + 풌) Counting sort: a visual description 0 1 2 3 4 5 6 7 A 2 5 3 0 2 3 0 3 We couldWhy have did we used scan Count 0 1 2 3 4 5 arrayelements only ofto Aoutput in reverse the orderelements (from indexof A in 풏 sorted− ퟏ to ퟎ) Count 2 0 2 3 0 1 whileorder. placing Why did them we incompute the final sortedPlace andarray B B? ? 0 1 2 3 4 5 Place 21 2 4 67 7 8 Answer:Answer: TheTo inputensure might that beCounting an array sort of recordsis stable and. theThe aim reason to sort why these stability records is required according will tob someecome integer clear soonfield.
    [Show full text]
  • Comparison of Search Data Structures and Their Performance Evaluation with Detailed Analysis
    International Journal of Scientific & Engineering Research, Volume 4, Issue 4, April 2013 1437 ISSN 2229-5518 Comparison of search data structures and their performance evaluation with detailed analysis S.V.SRIDHAR, P.RAVINDER RAO, E.PRIYADARSHINI,N.VAMSI KRISHNA ABSTRACT: In the field of computer science and information technology , and many other related areas where large volumes of data are present we need to do many operations on those sets of data which are part of processing , analyzing and producing results and related conclusions . so in this paper we present one of the most commonly used operations – SEARCHING in the fields related to information processing. Along with a detailed study on sequential and binary search we also present performance analysis and a detailed technical view on the method of operation of the above said searching methods along with programmatic approach Index Terms— Minimum 7 keywords are mandatory, Keywords should closely reflect the topic and should optimally characterize the paper. Use about four key words or phrases in alphabetical order, separated by commas. I. INTRODUCTION: In computer science, a search data structure is any data structure that allows the efficient retrieval of specific items from a set of items, such as a specific record from a database or searching for the presence of an element among a list of elements. 23 14 8 55 71 31 11 43 12 13 Figure 1 The above picture represents a general list of elements that are saved in memory. Assume that in a particular instance we need to know whether element 71 is present in the above list or not, if present we need to know the position of the element among the above, thereby enabling to access the element if needed , because in the field of computer science a storage operation always occurs by presence of a memory address and a mechanism to access the value present at that location and also a mechanism and store back the value after usage if needed.
    [Show full text]
  • Exploring the Problem Space of Orthogonal Range Searching
    Exploring the Problem Space of Orthogonal Range Searching by Bryan T. Wilkinson A dissertation presented to the Faculty of Science and Technology at Aarhus University in fulfillment of a requirement for the degree of Doctor of Philosophy in Computer Science Aarhus, Denmark, 2015 c Bryan T. Wilkinson 2015 Abstract Orthogonal range searching is a fundamental problem in computational geometry: preprocess a set of points into a data structure such that we can efficiently answer ques- tions about the points that lie in axis-aligned rectangles called query ranges. There are many interesting variants of orthogonal range searching with differing complexities. We consider classic fundamental variants, new variants inspired by practical applications, and special cases that admit efficient solutions. In general, we explore the problem space of orthogonal range searching to refine our understanding of which problems can be solved efficiently. We thus become better informed of the feasibility of applying orthogonal range searching techniques to solve both practical and theoretical problems. We give improved bounds for dynamic range reporting, a fundamental variant in which points can be inserted and deleted and a query asks for a list of the points in the query range. We consider the problem in the word RAM model. The best previous data structure uses optimal space and query time, and O(logc n) update time for a positive constant c < 1. We reduce the exponent in the update time for the two- and three-sided special cases. We also obtain significantly more efficient data structures for the very special case of insertion-only range emptiness.
    [Show full text]