Merging String Sequences by Longest Common Prefixes

Total Page:16

File Type:pdf, Size:1020Kb

Merging String Sequences by Longest Common Prefixes Information and Media Technologies 3(2): 236-245 (2008) reprinted from: IPSJ Digital Courier 4: 69-78 (2008) © Information Processing Society of Japan Regular Paper Merging String Sequences by Longest Common Prefixes Waihong Ng †1 and Katsuhiko Kakehi†2 We present LCP Merge, a novel merging algorithm for merging two ordered sequences of strings. LCP Merge substitutes string comparisons with integer comparisons whenever possible to reduce the number of character-wise comparisons as well as the number of key accesses by utilizing the longest common prefixes (LCP) between the strings. As one of the applications of LCP Merge, we built a string merge sort based on recursive merge sort by replacing the merging algorithm with LCP Merge and we call it LCP Merge sort. In case of sorting strings, the computational complexity of recursive merge sort tends to be greater than O(n lg n) because string comparisons are generally not constant time and depend on the properties of the strings. However, LCP Merge sort improves recursive merge sort to the extent that its computational complexity remains O(n lg n) on average. We performed a number of experiments to compare LCP Merge sort with other string sorting algorithms to evaluate its practical performance and the experimental results showed that LCP Merge sort is efficient even in the real-world. lem discussed in this paper. We applied LCP 1. Introduction Merge to build a string merge sort (we call it Merging is a fundamental computational pro- LCP Merge sort) and compared it with recur- cess which combines two or more ordered se- sive merge sort to evaluate its improvements. quences of objects into a single ordered se- In addition, LCP Merge sort is compared with quence of the objects and each object has a key four other string sorting algorithms: Multikey which governs the ordering of the object. Merg- Quicksort 1), Burstsort 13), MSD radix sort 10) ing has many applications, for instance, merge and CRadix sort 11), in a number of experi- sort, external sorting, etc. Classical merging ments to evaluate its performance. algorithms (e.g., Section 5.2.4 Algorithm M, The primal form of LCP Merge first appeared two-way merge, in Ref. 8)) assume the compar- in our previous work 4) and was called Length of ison operation runs in constant time. However, Longest Common Prefix Merge (LLCP Merge). when the keys are strings, which are widely But we renamed LLCP Merge to LCP Merge, used in practice, the computational complex- taking into account that the word LCP is very ity of the comparison operation is not constant common and it connotes ‘length’. but depends on the properties of the strings. In 2. Preliminaries this paper, we present Longest Common Pre- fix Merge or LCP Merge for short, a novel We define some terminology for the discus- merging algorithm for merging two ordered se- sions in this paper. quences of strings in which their keys are the Definition 2.1 An alphabet is a finite to- strings themselves. LCP Merge has a fewer tally ordered set of symbols and the elements number of key accesses and character-wise com- of the alphabet are called characters. parisons than those of classical merging algo- For example, {a, b,...,z} is the alphabet of rithms. The reductions in the number of key all lower-case English letters. accesses and the number of character-wise com- Without ambiguity, an alphabet is assumed parisons are achieved by utilizing the LCPs be- where appropriate in our discussion without be- tween the strings. Approaches utilizing LCP in- ing mentioned. formation are well-known for constructing suf- Definition 2.2 A string is a list of char- fix trees or suffix arrays 3),7),9). But to the best acters. Given a string s = s[1]s[2] ...s[a], s[i] of our knowledge, there is no published LCP (1 ≤ i ≤ a)isthei th character of s.Thenum- based merging algorithm which solves the prob- ber of characters in s, denoted |s|, is the length of s.Theempty or null string is the string †1 Graduate School of Fundamental Science and Engi- neering, Waseda University which has zero length and is denoted . †2 Faculty of Science and Engineering, Waseda Univer- Definition 2.3 Concatenation of two sity strings is defined as the juxtaposition of them. 236 Information and Media Technologies 3(2): 236-245 (2008) reprinted from: IPSJ Digital Courier 4: 69-78 (2008) © Information Processing Society of Japan For example, given that two strings s = or monotonically increasing if si ≤ si+1 for s[1]s[2] ...s[a]andt = t[1]t[2] ...t[b], then i :1≤ i<n. the concatenation of s and t is st = 3. LCP Merging Algorithm s[1]s[2] ...s[a]t[1]t[2] ...t[b]. Definition 2.4 Astringp is a prefix of The classical merging algorithm(Section 5.2.4 string s if s = ps for some string s. Algorithm M (two-way merge) in Ref. 8)) is Note that any string is a prefix of the string basedoncomparisonoperationwhichisas- itself and is a prefix of any string. sumed to be run in constant time. When the Definition 2.5 Astringp is a common pre- keys are strings, a general algorithm for the fix of the strings s and t iff s = ps and t = pt comparison operation will be similar to for some strings s and t.Whenp is the longest int strcmp(const char *s1, one among the common prefixes of s and t, i.e., const char *s2) |p| > |p| for any common prefix p = p of s and { t,thenp is the longest common prefix (LCP) of char c1, c2; s and t. The length of the longest common pre- int r; fix (LLCP) of s and t is denoted by the notation do { λ(s, t). c1 = *s1++; c2 = *s2++; For example, if we let s and t be strings over r = (unsigned char)c1 the alphabet containing English letters and sup- - (unsigned char)c2; pose s = AACDCDAB and t = AACDACEF, } while ((r==0) && (c1!=’\0’)); then the strings , A, AA, AAC and AACD are return r; the common prefixes, AACD is the LCP of s } and t,andλ(s, t)=4. The above algorithm (program) has the same Definition 2.6 Let S be a set of strings functionality as the standard C library func- containing at least two strings. If s = ∈ S and tion strcmp. It may be the simplest one as it l is the maximum of λ(s, t) for all t = s ∈ S, repeatedly compares the strings given in the ar- then the string guments character by character until a different s, l = |s|; one is found or a null character is encountered, p = s[1] ...s[l +1],l<|s|, and there is no way to reduce those character- wise comparisons without prior knowledge of is the distinguishing prefix 12) of s in S. the input strings 10). However, it is obvious that In other words, the distinguishing prefix of a the running time of the algorithm is not con- string is the string itself if the string forms a stant since the number of character-wise com- prefix of any other strings in the set; otherwise parisons required depends on the strings being it is the shortest prefix of the string that is not compared. Further, despite that this algorithm a prefix of other strings in the set. explicitly gives us the relative order and implic- Forexample,ifweconsiderthesetS = itly the LLCP of the strings (the number of it- {AACDCDAB, AACDACEF, AACEBA, AAC}, erations of the while loop minus one), known then the distinguishing prefix of AACDCDAB merging algorithms do not use their LLCP. is AACDC and that of AAC is AAC itself. Therefore, there arises the question whether Definition 2.7 The ordering relation < on we can utilize the LLCP information to ease strings is defined as follows (s = ps and t = pt our computational process in some context. In are strings and p is the LCP of s and t). the next section, we describe our merging algo- s>tif s[1] >t[1] or when s = t ∧ t = rithm, Longest Common Prefix Merge or LCP (s is greater than t) Merge for short, which utilizes LLCP informa- tion between the strings, for merging two or- s<tif s [1] <t[1] or when s = t ∧ s = dered string sequences. (s is smaller than t) 3.1 LCP Merge s = t if s = t = LCP Merge is built on the concept of anno- (s is equal to t) tated strings. It utilizes the LCP information Definition 2.8 A string sequence, denoted between the strings to substitutes string com- by s1,s2,...,sn, is a list of finite number parisons with integer comparisons whenever ap- of strings where si (1 ≤ i ≤ n) are strings. plicable, which in turn reduces the running time A string sequence is said to be ascending of the comparison operation in such cases. As a 237 Information and Media Technologies 3(2): 236-245 (2008) reprinted from: IPSJ Digital Courier 4: 69-78 (2008) © Information Processing Society of Japan result, a significant amount of key accesses and ing merges X and Y to form a single ascending character-wise comparisons are reduced when string sequence Z = z1,z2,...,za+b so that Z compared with those of classical merging algo- is a permutation of X and Y. Instead of merg- rithms. ing X and Y directly, we investigate merging 3.1.1 Annotated String them in their annotated string sequence forms.
Recommended publications
  • Optimizing Sorting with Genetic Algorithms ∗
    Optimizing Sorting with Genetic Algorithms ∗ Xiaoming Li, Mar´ıa Jesus´ Garzaran´ and David Padua Department of Computer Science University of Illinois at Urbana-Champaign {xli15, garzaran, padua}@cs.uiuc.edu http://polaris.cs.uiuc.edu Abstract 1 Introduction Although compiler technology has been extraordinarily The growing complexity of modern processors has made successful at automating the process of program optimiza- the generation of highly efficient code increasingly difficult. tion, much human intervention is still needed to obtain high- Manual code generation is very time consuming, but it is of- quality code. One reason is the unevenness of compiler im- ten the only choice since the code generated by today’s com- plementations. There are excellent optimizing compilers for piler technology often has much lower performance than some platforms, but the compilers available for some other the best hand-tuned codes. A promising code generation platforms leave much to be desired. A second, and perhaps strategy, implemented by systems like ATLAS, FFTW, and more important, reason is that conventional compilers lack SPIRAL, uses empirical search to find the parameter values semantic information and, therefore, have limited transfor- of the implementation, such as the tile size and instruction mation power. An emerging approach that has proven quite schedules, that deliver near-optimal performance for a par- effective in overcoming both of these limitations is to use ticular machine. However, this approach has only proven library generators. These systems make use of semantic in- successful on scientific codes whose performance does not formation to apply transformations at all levels of abstrac- depend on the input data.
    [Show full text]
  • Space-Efficient Data Structures, Streams, and Algorithms
    Andrej Brodnik Alejandro López-Ortiz Venkatesh Raman Alfredo Viola (Eds.) Festschrift Space-Efficient Data Structures, Streams, LNCS 8066 and Algorithms Papers in Honor of J. Ian Munro on the Occasion of His 66th Birthday 123 Lecture Notes in Computer Science 8066 Commenced Publication in 1973 Founding and Former Series Editors: Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen Editorial Board David Hutchison Lancaster University, UK Takeo Kanade Carnegie Mellon University, Pittsburgh, PA, USA Josef Kittler University of Surrey, Guildford, UK Jon M. Kleinberg Cornell University, Ithaca, NY, USA Alfred Kobsa University of California, Irvine, CA, USA Friedemann Mattern ETH Zurich, Switzerland John C. Mitchell Stanford University, CA, USA Moni Naor Weizmann Institute of Science, Rehovot, Israel Oscar Nierstrasz University of Bern, Switzerland C. Pandu Rangan Indian Institute of Technology, Madras, India Bernhard Steffen TU Dortmund University, Germany Madhu Sudan Microsoft Research, Cambridge, MA, USA Demetri Terzopoulos University of California, Los Angeles, CA, USA Doug Tygar University of California, Berkeley, CA, USA Gerhard Weikum Max Planck Institute for Informatics, Saarbruecken, Germany Andrej Brodnik Alejandro López-Ortiz Venkatesh Raman Alfredo Viola (Eds.) Space-Efficient Data Structures, Streams, and Algorithms Papers in Honor of J. Ian Munro on the Occasion of His 66th Birthday 13 Volume Editors Andrej Brodnik University of Ljubljana, Faculty of Computer and Information Science Ljubljana, Slovenia and University of Primorska,
    [Show full text]
  • Sorting Algorithm 1 Sorting Algorithm
    Sorting algorithm 1 Sorting algorithm In computer science, a sorting algorithm is an algorithm that puts elements of a list in a certain order. The most-used orders are numerical order and lexicographical order. Efficient sorting is important for optimizing the use of other algorithms (such as search and merge algorithms) that require sorted lists to work correctly; it is also often useful for canonicalizing data and for producing human-readable output. More formally, the output must satisfy two conditions: 1. The output is in nondecreasing order (each element is no smaller than the previous element according to the desired total order); 2. The output is a permutation, or reordering, of the input. Since the dawn of computing, the sorting problem has attracted a great deal of research, perhaps due to the complexity of solving it efficiently despite its simple, familiar statement. For example, bubble sort was analyzed as early as 1956.[1] Although many consider it a solved problem, useful new sorting algorithms are still being invented (for example, library sort was first published in 2004). Sorting algorithms are prevalent in introductory computer science classes, where the abundance of algorithms for the problem provides a gentle introduction to a variety of core algorithm concepts, such as big O notation, divide and conquer algorithms, data structures, randomized algorithms, best, worst and average case analysis, time-space tradeoffs, and lower bounds. Classification Sorting algorithms used in computer science are often classified by: • Computational complexity (worst, average and best behaviour) of element comparisons in terms of the size of the list . For typical sorting algorithms good behavior is and bad behavior is .
    [Show full text]
  • Ankara Yildirim Beyazit University Graduate
    ANKARA YILDIRIM BEYAZIT UNIVERSITY GRADUATE SCHOOL OF NATURAL AND APPLIED SCIENCES DEVELOPMENT OF SORTING AND SEARCHING ALGORITHMS Ph. D Thesis By ADNAN SAHER MOHAMMED Department of Computer Engineering December 2017 ANKARA DEVELOPMENT OF SORTING AND SEARCHING ALGORITHMS A Thesis Submitted to The Graduate School of Natural and Applied Sciences of Ankara Yıldırım Beyazıt University In Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in Computer Engineering, Department of Computer Engineering By ADNAN SAHER MOHAMMED December 2017 ANKARA PH.D. THESIS EXAMINATION RESULT FORM We have read the thesis entitled “Development of Sorting and Searching Algorithms” completed by Adnan Saher Mohammed under the supervision of Prof. Dr. Fatih V. ÇELEBİ and we certify that in our opinion it is fully adequate, in scope and in quality, as a thesis for the degree of Doctor of Philosophy. Prof. Dr. Fatih V. ÇELEBİ (Supervisor) Prof. Dr. Şahin EMRAH Yrd. Doç. Dr. Hacer YALIM KELEŞ (Jury member) (Jury member) Yrd. Doç. Dr. Özkan KILIÇ Yrd. Doç. Dr. Shafqat Ur REHMAN (Jury member) (Jury member) Prof. Dr. Fatih V. ÇELEBİ Director Graduate School of Natural and Applied Sciences ii ETHICAL DECLARATION I hereby declare that, in this thesis which has been prepared in accordance with the Thesis Writing Manual of Graduate School of Natural and Applied Sciences, • All data, information and documents are obtained in the framework of academic and ethical rules, • All information, documents and assessments are presented in accordance with scientific ethics and morals, • All the materials that have been utilized are fully cited and referenced, • No change has been made on the utilized materials, • All the works presented are original, and in any contrary case of above statements, I accept to renounce all my legal rights.
    [Show full text]
  • Sorting Algorithm 1 Sorting Algorithm
    Sorting algorithm 1 Sorting algorithm A sorting algorithm is an algorithm that puts elements of a list in a certain order. The most-used orders are numerical order and lexicographical order. Efficient sorting is important for optimizing the use of other algorithms (such as search and merge algorithms) which require input data to be in sorted lists; it is also often useful for canonicalizing data and for producing human-readable output. More formally, the output must satisfy two conditions: 1. The output is in nondecreasing order (each element is no smaller than the previous element according to the desired total order); 2. The output is a permutation (reordering) of the input. Since the dawn of computing, the sorting problem has attracted a great deal of research, perhaps due to the complexity of solving it efficiently despite its simple, familiar statement. For example, bubble sort was analyzed as early as 1956.[1] Although many consider it a solved problem, useful new sorting algorithms are still being invented (for example, library sort was first published in 2006). Sorting algorithms are prevalent in introductory computer science classes, where the abundance of algorithms for the problem provides a gentle introduction to a variety of core algorithm concepts, such as big O notation, divide and conquer algorithms, data structures, randomized algorithms, best, worst and average case analysis, time-space tradeoffs, and upper and lower bounds. Classification Sorting algorithms are often classified by: • Computational complexity (worst, average and best behavior) of element comparisons in terms of the size of the list (n). For typical serial sorting algorithms good behavior is O(n log n), with parallel sort in O(log2 n), and bad behavior is O(n2).
    [Show full text]
  • Comparison Sorts Name Best Average Worst Memory Stable Method Other Notes Quicksort Is Usually Done in Place with O(Log N) Stack Space
    Comparison sorts Name Best Average Worst Memory Stable Method Other notes Quicksort is usually done in place with O(log n) stack space. Most implementations on typical in- are unstable, as stable average, worst place sort in-place partitioning is case is ; is not more complex. Naïve Quicksort Sedgewick stable; Partitioning variants use an O(n) variation is stable space array to store the worst versions partition. Quicksort case exist variant using three-way (fat) partitioning takes O(n) comparisons when sorting an array of equal keys. Highly parallelizable (up to O(log n) using the Three Hungarian's Algorithmor, more Merge sort worst case Yes Merging practically, Cole's parallel merge sort) for processing large amounts of data. Can be implemented as In-place merge sort — — Yes Merging a stable sort based on stable in-place merging. Heapsort No Selection O(n + d) where d is the Insertion sort Yes Insertion number of inversions. Introsort No Partitioning Used in several STL Comparison sorts Name Best Average Worst Memory Stable Method Other notes & Selection implementations. Stable with O(n) extra Selection sort No Selection space, for example using lists. Makes n comparisons Insertion & Timsort Yes when the data is already Merging sorted or reverse sorted. Makes n comparisons Cubesort Yes Insertion when the data is already sorted or reverse sorted. Small code size, no use Depends on gap of call stack, reasonably sequence; fast, useful where Shell sort or best known is No Insertion memory is at a premium such as embedded and older mainframe applications. Bubble sort Yes Exchanging Tiny code size.
    [Show full text]
  • Random Shuffling to Reduce Disorder in Adaptive Sorting Scheme Md
    Random Shuffling to Reduce Disorder in Adaptive Sorting Scheme Md. Enamul Karim and Abdun Naser Mahmood AI and Algorithm Research Lab Department of Computer Science, University of Dhaka [email protected] Abstract In this paper we present a random shuffling scheme to apply with adaptive sorting algorithms. Adaptive sorting algorithms utilize the presortedness present in a given sequence. We have probabilistically increased the amount of presortedness present in a sequence by using a random shuffling technique that requires little computation. Theoretical analysis suggests that the proposed scheme can improve the performance of adaptive sorting. Experimental results show that it significantly reduces the amount of disorder present in a given sequence and improves the execution time of adaptive sorting algorithm as well. Keywords: Adaptive Sorting, Random Shuffle. 1. Introduction Sorting is the process of arranging numbers in either increasing (non-decreasing) or decreasing (non- increasing) order. Due to its immense importance, sorting has never ceased to appear as research topic in computer science studies. The problem of sorting has been extensively studied and many important discoveries have been made in this respect. It has been observed [Estivill-Castro and Wood 1991] that the performance of sorting algorithms can be increased if the presortedness of the data is taken into account. This led to a new type of sorting approach, which is now known as adaptive sorting. Adaptive-sorting Algorithms take the advantage of existing order in the input. Intuitively ‘run time’ of an adaptive sorting algorithm varies smoothly from O(n) to O(n log n) as the disorder or “unsortedness” of the input varies [Mehlhorn 1984].
    [Show full text]
  • An Adaptive Sorting Algorithm for Almost Sorted List
    International Journal of Computer Sciences and Engineering Open Access Research Paper Volume-5, Issue-12 E-ISSN: 2347-2693 An Adaptive Sorting Algorithm for Almost Sorted List Rina Damdoo1*, Kanak Kalyani2 1* Department of CSE, RCOEM, Nagpur, India 2 Department of CSE, RCOEM, Nagpur, India *Corresponding Author: [email protected], Tel.: +91-9324425240 Available online at: www.ijcseonline.org Received: 20/Nov/2017, Revised: 29/Nov/2017, Accepted: 17/Dec/2017, Published: 31/Dec/2017 Abstract—Sorting algorithm has a great impact on computing and also attracts a grand deal of research. Even though many sorting algorithms are evolved, there is always a scope of for a new one. As an example, Bubble sort was first analyzed in 1956, but due to the complexity issues it was not wide spread. Although many consider Bubble sort a solved problem, new sorting algorithms are still being evolved as per the problem scenarios (for example, library sort was first published in 2006 and its detailed experimental analysis was done in 2015) [11]. In this paper new adaptive sorting method Paper sort is introduced. The method is simple for sorting real life objects, where the input list is almost but not completely sorted. But, one may find it complex to implement when time-and-space trade-off is considered. Keywords—Adaptive sort, Time complexity, Paper sort, Sorted list, Un-sorted list I. INTRODUCTION Now the third position element is compared with the A sorting algorithm is known as adaptive, if it takes benefit last (say L) number in the sorted list and is added at end to of already sorted elements in the list i.e.
    [Show full text]
  • Msc THESIS FPGA-Based High Throughput Merge Sorter
    Computer Engineering 2018 Mekelweg 4, 2628 CD Delft The Netherlands http://ce.et.tudelft.nl/ MSc THESIS FPGA-Based High Throughput Merge Sorter Xianwei Zeng Abstract As database systems have shifted from disk-based to in-memory, and the scale of the database in big data analysis increases significantly, the workloads analyzing huge datasets are growing. Adopting FP- GAs as hardware accelerators improves the flexibility, parallelism and power consumption versus CPU-only systems. The accelerators CE-MS-2018-02 are also required to keep up with high memory bandwidth provided by advanced memory technologies and new interconnect interfaces. Sorting is the most fundamental database operation. In multiple- pass merge sorting, the final pass of the merge operation requires significant throughput performance to keep up with the high mem- ory bandwidth. We study the state-of-the-art hardware-based sorters and present an analysis of our own design. In this thesis, we present an FPGA-based odd-even merge sorter which features throughput of 27.18 GB/s when merging 4 streams. Our design also presents stable throughput performance when the number of input streams is increased due to its high degree of parallelism. Thanks to such a generic design, the odd-even merge sorter does not suffer through- put drop for skewed data distributions and presents constant perfor- mance over various kinds of input distributions. Faculty of Electrical Engineering, Mathematics and Computer Science FPGA-Based High Throughput Merge Sorter THESIS submitted in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE in ELECTRICAL ENGINEERING by Xianwei Zeng born in GUANGDONG, CHINA Computer Engineering Department of Electrical Engineering Faculty of Electrical Engineering, Mathematics and Computer Science Delft University of Technology FPGA-Based High Throughput Merge Sorter by Xianwei Zeng Abstract As database systems have shifted from disk-based to in-memory, and the scale of the database in big data analysis increases significantly, the workloads analyzing huge datasets are growing.
    [Show full text]
  • Sorting N-Elements Using Natural Order: a New Adaptive Sorting Approach
    Journal of Computer Science 6 (2): 163-167, 2010 ISSN 1549-3636 © 2010 Science Publications Sorting N-Elements Using Natural Order: A New Adaptive Sorting Approach 1Shamim Akhter and 2M. Tanveer Hasan 1Aida Lab, National Institute of Informatics, Tokyo, Japan 2 CEO, DSI Software Company Ltd., Dhaka, Bangladesh Abstract: Problem statement: Researchers focused their attention on optimally adaptive sorting algorithm and illustrated a need to develop tools for constructing adaptive algorithms for large classes of measures. In adaptive sorting algorithm the run time for n input data smoothly varies from O(n) to O(nlogn), with respect to several measures of disorder. Questions were raised whether any approach or technique would reduce the run time of adaptive sorting algorithm and provide an easier way of implementation for practical applications. Approach: The objective of this study is to present a new method on natural sorting algorithm with a run time for n input data O(n) to O(nlogm), where m defines a positive value and surrounded by 50% of n. In our method, a single pass over the inputted data creates some blocks of data or buffers according to their natural sequential order and the order can be in ascending or descending. Afterward, a bottom up approach is applied to merge the naturally sorted subsequences or buffers. Additionally, a parallel merging technique is successfully aggregated in our proposed algorithm. Results: Experiments are provided to establish the best, worst and average case runtime behavior of the proposed method. The simulation statistics provide same harmony with the theoretical calculation and proof the method efficiency.
    [Show full text]
  • CSE 373: Master Method, Finishing Sorts, Intro to Graphs
    The tree method: precise analysis Problem: Need a rigorous way of getting a closed form We want to answer a few core questions: CSE 373: Master method, finishing sorts, How much work does each recursive level do? intro to graphs 1. How many nodes are there on level i?(i = 0 is “root” level) 2. At some level i, how much work does a single node do? (Ignoring subtrees) Michael Lee 3. How many recursive levels are there? Monday, Feb 12, 2018 How much work does the leaf level (base cases) do? 1. How much work does a single leaf node do? 2. How many leaf nodes are there? 1 2 The tree method: precise analysis The tree method: precise analysis n 1 node, n work per How many levels are there, exactly? Is it log2(n)? n n n Let’s try an example. Suppose we have T(4). What happens? 2 2 2 nodes, 2 work per 4 n n n n n 4 4 4 4 4 nodes, 4 work per 2 2 . 2i nodes, n work per . i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2h nodes, 1 work per Height is log (4) = 2. 1. numNodes(i) = 2i 2 n For this recursive function, num recursive levels is same as height. 2. workPerNode(n; i) = 2i 3. numLevels(n) = ? Important: total levels, counting base case, is height + 1. 4. workPerLeafNode(n) = 1 Important: for other recursive functions, where base case doesn’t 5.
    [Show full text]
  • A Time-Based Adaptive Hybrid Sorting Algorithm on CPU and GPU with Application to Collision Detection
    Fachbereich 3: Mathematik und Informatik Diploma Thesis A Time-Based Adaptive Hybrid Sorting Algorithm on CPU and GPU with Application to Collision Detection Robin Tenhagen Matriculation No.2288410 5 January 2015 Examiner: Prof. Dr. Gabriel Zachmann Supervisor: Prof. Dr. Frieder Nake Advisor: David Mainzer Robin Tenhagen A Time-Based Adaptive Hybrid Sorting Algorithm on CPU and GPU with Application to Collision Detection Diploma Thesis, Fachbereich 3: Mathematik und Informatik Universität Bremen, January 2015 Diploma ThesisA Time-Based Adaptive Hybrid Sorting Algorithm on CPU and GPU with Application to Collision Detection Selbstständigkeitserklärung Hiermit erkläre ich, dass ich die vorliegende Arbeit selbstständig angefertigt, nicht anderweitig zu Prüfungszwecken vorgelegt und keine anderen als die angegebenen Hilfsmittel verwendet habe. Sämtliche wissentlich verwendete Textausschnitte, Zitate oder Inhalte anderer Verfasser wurden ausdrücklich als solche gekennzeichnet. Bremen, den 5. Januar 2015 Robin Tenhagen 3 Diploma ThesisA Time-Based Adaptive Hybrid Sorting Algorithm on CPU and GPU with Application to Collision Detection Abstract Real world data is often being sorted in a repetitive way. In many applications, only small parts change within a small time frame. This thesis presents the novel approach of time-based adaptive sorting. The proposed algorithm is hybrid; it uses advantages of existing adaptive sorting algorithms. This means, that the algorithm cannot only be used on CPU, but, introducing new fast adaptive GPU algorithms, it delivers a usable adaptive sorting algorithm for GPGPU. As one of practical examples, for collision detection in well-known animation scenes with deformable cloths, especially on CPU the presented algorithm turned out to be faster than the adaptive hybrid algorithm Timsort.
    [Show full text]