Data Structures Libraries

Data Structures Libraries By: Leonor Frias Moya Advisors: Jordi Petit Silvestre and Salvador Roura Ferret PhD Thesis Departament de Llenguatges i Sistemes Informàtics Universitat Politècnica de Catalunya June 2010 This PhD Dissertation was publically defended on June 8th, 2010, in Barcelona. According to the judgement of the Jury, this PhD Dissertation obtained the qualification excellent cum laude and with the European mention. In the public defense, it was my honor to have the following PhD Jury: Prof. Dr. Conrado Mart´ınez, Universitat Politècnica de Catalunya, Spain • Dr. Joaquim Gabarró, Universitat Politècnica de Catalunya, Spain • Prof. Dr. Ulrich Meyer, Goethe Universität Frankfurt am Main, Germany • Prof. Dr. Giuseppe Italiano, Universitàdi Roma ’Tor Vergata’, Italy • Dr. Ricardo Baeza-Yates, Universitat Pompeu Fabra, Spain • To whom I am (and I will always be) very grateful. Leonor Frias Moya I dedicate this work to my parents, Eusebio and Leonor Dedico este trabajo a mis padres, Eusebio y Leonor Resum Aquesta tesi doctoral estudia diversos problemes que sorgeixen quan es combina teoria i pràctica en algor´ısmia, en particular, quan es consideren els requirements de les biblioteques de programari. La Standard Template Library (STL) del llenguatge de programacióC++ actua com a fil conductor en l’estudi de problemes fonamentals en algor´ısmia, com ara les seqüències, els diccionaris, l’ordenació i la selecció. Per fer-ho, seguim aproximacions diverses peròinterrelacionades, i que han estat parcialment influenciades pels canvis recents i radicals en l’arquitectura dels computadors. Per una banda, presentem diversos algorismes i estructures de dades que es fonamenten en les capacitats dels computadors moderns. Concretament, de les lists de la STL presentem implementacions conscients de la memòria cau que, tot aprofitant les propietats d’aquestes jerarquies de memòria, aconsegueixen recorreguts molt eficients. Alhora, com tambéés el cas de les implementacions de la STL amb llistes doblement enllaçades, compleixen amb els requeriments de cost de l’estàndard per a les operacions, aix´ıcom amb la validesa en tot moment dels iteradors. Tambépresentem diversos algorismes pràctics per a processadors multi-core. Concretament, descrivim algorismes paral lels per a la construcciói la inserciómúltiple en arbres vermell-negre. Aquests algorismes es basen· en (a) l’accés en temps constant als elements usant el seu ´ındex, com és el cas tambédels algorismes en el model PRAM, (b) operacions d’uniói divisiód’arbres, aix´ıcom (c) tècniques d’equilibri dinàmic de la càrrega paral lela. La nostra implementacióestén la del com- pilador GCC per als diccionaris de la STL. Els resultats· experimentals en mostren la conveniència pràctica. A més a més, per preprocessar l’entrada de les anteriors operacions eficientment, hem definit un algorisme online general per particionar seqüències de mida desconeguda. Aquest mètode téun interès tant teòric com pràctic. Per altra banda, descrivim millores en algorismes, ja existents, basats en reusar part dels càlculs previs. En primer lloc, presentem implementacions en paral lel de la partition de la STL. Aquests són els primers algorismes pràctics que fan un nombre òpt· im de comparacions, és a dir, una per element. Més endavant, mostrem la utilitat d’aquesta propietat quan els elements són cadenes de caràcters (strings) i es fan particions repetitivament i jeràrquica, com ara en el quicksort i el quickselect. Certament, mostrem grans millores de rendiment en combinar-los amb tècniques ja existents per evitar comparacions de caràcters redundants. Els nostres algorismes poden usar-se per especialitzar el sort i el nth element de la STL, respectivament. També, fem notar que algunes de les comparacions en aquests tipus d’algorismes i estructures de dades no accedeixen a les cadenes de caràcters pròpiament dites. Si, com és habitual, els strings s’implementen mitjançant punters, aquest fet és rellevant si es considera l’ús de la memòria cau. En aquest sentit, quantifiquem anal´ıticament el nombre d’accessos a strings en arbres binaris de cerca, i tambéen quicksort i quickselect modi- ficats d’aquesta manera. Tambéanalitzem els beneficis d’afegir redundància a aquests algorismes i estructures de dades. Finalment, presentem i analitzem detalladament l’algorisme multikey quickselect, que és l’anàleg de l’algorisme d’ordenació multikey quicksort per al problema de la seleccióper a strings. Aquest algorisme és eficient respecte el nombre de comparacions de caràcters, no necessita memòria auxiliar, i és fàcil d’implementar. A més a més, es pot usar per a especialitzar el nth element de la STL. La nostra anàlisi considera tant el nombre de comparacions com el d’intercanvis sota la hipòtesi d’una distribucióde claus aleatòria uniforme. Addicionalment, proposem diverses variants per millorar l’eficiència, que tambées poden aplicar al multikey quicksort. Abstract This thesis studies several problems that arise from combining theory and practice in algorithmics, in particular, when considering requirements in software libraries. The well-known Standard Template Library (STL) of the C++ programming language is the common thread of this research. We have tackled fundamental problems in algorithmics, such as sorting and selection, sequences and dictionaries. This aim has been achieved using varied, intertwined and evolving approaches, in particular, partially influenced by fast and radical changes in technology. Most of the contributions concern generic algorithms and data structures that take advantage of the processing capabilities of modern computers. First of all, we present cache-conscious imple- mentations of STL lists. Taking advantage of the properties of memory hierarchies, very good performance is obtained for traversal-based operations. But like typical STL doubly linked list im- plementations, they keep up with the required cost for operations as well as with the validity of iterators at all times. Besides, we present several practical algorithms for multi-core computers. Specifically, we describe parallel algorithms for the construction and insertion of many elements at a time into red- black trees. These algorithms use (a) constant-cost index computations to access the elements, as it is the case for some existing algorithms in the PRAM model, (b) split and union operations on trees, and (c) parallel load-balancing techniques. The implementation extends a red-black tree codification of STL dictionaries in the GCC compiler. The experiments show the practicability of this approach. In addition, motivated by efficiently preprocessing the input of the aforementioned bulk operations in dictionaries, we describe a general online algorithm for partitioning sequences whose size is unknown. Our algorithm is interesting from both a theoretical and practical perspective. Furthermore, we have devised new variants of algorithms by exploiting the reusing of computations. In particular, we consider the parallel partitioning of an array with respect to a pivot element. We present practical algorithms that can be used to implement STL partition. These are the first such algorithms that perform an optimal number of comparisons, i.e., one per element. Later, we show that this property is particularly useful when the elements are strings and partitioning is repetitively used and in a hierarchical way, as in quicksort and quickselect. Specifically, we show big performance improvements combining parallel quicksort and quickselect with existing techniques to avoid redundant character comparisons. Furthermore, the resulting algorithms can be used to specialize STL sort and nth element, respectively. Moreover, we highlight that some of the comparisons made in this kind of algorithms and data structures do not need to access the actual strings. Under the common assumption of a pointer string implementation, this is very relevant from a cache-conscious perspective. In this sense, we analytically quantify the number of string lookups in so modified binary search trees, quicksort and quickselect. Also, we analyze the benefits of adding some redundancy on the top of these algorithms and data structures. Finally, we present and analyze multikey quickselect, the analog of multikey quicksort for the selection problem for strings. This algorithm is efficient with respect to the number of character comparisons, it is in-place and it is easy to implement. In particular, it can be used to specialize nth element for strings. We present an analysis of its cost, measuring both the number of comparisons and the number of swaps, under a random uniform distribution. Last but not least, we propose several enhanced variants, that can be also applied to multikey quicksort. Acknowledgements In the development of my PhD thesis, I have accumulated knowledge about several topics, fluency in giving talks, but also, and above all, an international vision and experience about (research) work and life. But in order to make this moment a reality, I have had to overcome several challenges of various kinds. And I could not have made it without the support of the people around me. These acknowledgements are dedicated to them. Firstly, I thank my advisors, Jordi Petit and Salvador Roura, for willing to advise my thesis. In particular, I am specially grateful for their useful advice and patience. Besides, I am also very grateful to Conrado Mart´ınez for counting with me in his research projects and for being very acces- sible. In addition, I thank the rest of ALBCOM members, especially Mar´ıa J. Serna, for including me into account into their research activities. In particular, I enjoyed being part of several organizing committees, as well as attending several courses and conferences abroad. I had the chance to share some of them with Amalia Duch and Mar´ıa J.

Load more