Lecture 08 Hashing
18.10.18 ADT – asscociative array • INSERT, SEARCH, DELETE Advanced Algorithmics (6EAP) • An associative array (also associative container, MTAT.03.238 map, mapping, dictionary, finite map, and in query- Hashing processing an index or index file) is an abstract data type composed of a collection of unique keys and a collection of values, where each key is associated Jaak Vilo with one value (or set of values). The operation of 2018 Fall finding the value associated with a key is called a lookup or indexing, and this is the most important operation supported by an associative array. Jaak Vilo 1 Why? • Speed! - O(1) • Space efficient 1 18.10.18 Keys • Integers • Strings • ID-codes • Floating point nr-s(?) • Usual assumption – keys are integers. • Step 1: Map keys to integers. 2 18.10.18 3 18.10.18 Primary clustering Implementation of open addressing How do we define h(k,i) ? Linear probing: Quadratic probing: Double hashing: 4 18.10.18 Quadratic Probing Quadratic Probing • Suppose that an element should appear in bin • If one of h + i2 falls into a cluster, this does not h: imply the next one will – if bin h is occupied, then check the following sequence of bins: h + 12, h + 22, h + 32, h + 42, h + 52, ... h + 1, h + 4, h + 9, h + 16, h + 25, ... • For example, with M = 17: • Secondary clustering – all k colliding in h(k) will cluster to same locations… 5 18.10.18 Birthday Paradox • In probability theory, the birthday problem, or birthday paradox[1] pertains to the probability that in a set of randomly chosen people some pair of them will have the same birthday.
[Show full text]