Deterministic Distributed Edge-Coloring Via Hypergraph Maximal Matching

Total Page:16

File Type:pdf, Size:1020Kb

Deterministic Distributed Edge-Coloring Via Hypergraph Maximal Matching Deterministic Distributed Edge-Coloring via Hypergraph Maximal Matching Manuela Fischer Mohsen Ghaffari Fabian Kuhn ETH Zurich ETH Zurich University of Freiburg manuela.fi[email protected] ghaff[email protected] [email protected] Abstract We present a deterministic distributed algorithm that computes a (2∆ 1)-edge-coloring, or even list-edge-coloring, in any n-node graph with maximum degree ∆, in O(log− 7 ∆ log n) rounds. This answers one of the long-standing open questions of distributed graph algorithms· from the late 1980s, which asked for a polylogarithmic-time algorithm. See, e.g., Open Problem 4 in the Distributed Graph Coloring book of Barenboim and Elkin. The previous best round complexities O(√log n) were 2 by Panconesi and Srinivasan [STOC’92] and O˜(√∆) + O(log∗ n) by Fraigniaud, Heinrich, and Kosowski [FOCS’16]. A corollary of our deterministic list-edge-coloring also improves the randomized complexity of (2∆ 1)-edge-coloring to poly(log log n) rounds. − The key technical ingredient is a deterministic distributed algorithm for hypergraph maximal matching, which we believe will be of interest beyond this result. In any hypergraph of rank r — where each hyperedge has at most r vertices — with n nodes and maximum degree ∆, this algorithm computes a maximal matching in O(r5 log6+log r ∆ log n) rounds. · This hypergraph matching algorithm and its extensions also lead to a number of other results. In particular, we obtain a polylogarithmic-time deterministic distributed maximal independent set (MIS) algorithm for graphs with bounded neighborhood independence, hence answering Open Problem 5 of Barenboim and Elkin’s book, a (log ∆/ε)O(log 1/ε) -round deterministic algorithm for (1 + ε)-approximation of maximum matching, and a quasi-polylogarithmic-time deterministic distributed algorithm for orienting λ-arboricity graphs with out-degree at most (1 + ε)λ , for any constant ε> 0, hence partially answering Open Problem 10 of Barenboim and Elkin’s⌈ book.⌉ arXiv:1704.02767v1 [cs.DS] 10 Apr 2017 1 Introduction and Related Work Distributed graph algorithms have been studied extensively over the past 30 years, since the seminal work of Linial [Lin87]. Despite this, determining whether there are efficient deterministic distributed algorithms for the most classic problems of the area remains a long-standing open question. Distributed graph algorithms are typically studied in a standard synchronous message passing model known as the LOCAL model [Lin87, Pel00]: the network is abstracted as an undirected graph G = (V, E), n = V , with maximum degree ∆, and where each node has a Θ(log n)-bit unique identifier. Initially,| nodes| only know their neighbors in G. At the end, each node should know its own part of the solution, e.g., the colors of its edges in edge-coloring. Communication happens in synchronous rounds, where in each round each node sends a message to each of its neighbors.1 The main complexity measure is the number of rounds needed for solving a given graph problem. The four classic local distributed graph problems are maximal independent set (MIS), (∆ + 1)- vertex-coloring, (2∆ 1)-edge-coloring, and maximal matching [PR01, BE13]. All of these problems − have trivial greedy sequential algorithms, as well as simple O(log n)-round randomized distributed algorithms [Lub86,ABI86], and even some faster ones [BEPS12,EPS15,Gha16,HSS16]. But the deter- ministic distributed complexity of these problems remains widely open, despite extensive interest (see, e.g., the first five open problems of [BE13]). Particularly, with regards to MIS — which is the hardest of the four problems, as the other three can be reduced to MIS locally [Lin87] — Linial [Lin92] asked “can it [MIS] always be found [deterministically] in polylogarithmic time?”. This remains the most well-known open question of the area. The best known round complexity is 2O(√log n), due to Panconesi and Srinivasan [PS92]. Panconesi and Rizzi pointed out in the year 2000 [PR01] that “while maximal matchings can be computed in polylogarithmic time, in n, in the distributed model [HKP98], it is a decade old open problem whether the same running time is achievable for the remaining 3 structures.” The status remains the same as of today, after almost two more decades. In particular, for edge-coloring which is our main target, Barenboim and Elkin stated the following problem in their recent Distributed Graph Coloring book [BE13]: Open Problem 11.4 [BE13] Devise or rule out a deterministic (2∆ 1)-edge-coloring algorithm that runs in polylogarithmic time. − 1.1 Our Contributions Improved Deterministic Edge-Coloring Algorithm: One of our main end results is a positive resolution of the above question. Theorem 1.1. There is a deterministic distributed algorithm that computes a (2∆ 1)-edge-coloring in − O(log7 ∆ log n) rounds, in any n-node graph with maximum degree ∆. Moreover, the same algorithm solves list-edge-coloring,· where each edge e E must get a color from an arbitrary given list L of ∈ e colors with L = d + 1, where d denotes the number of edges incident to e. | e| e e For list-edge-coloring, the previously best known round complexity was 2O(√log n), by a classic net- O(√log n log log n) work decomposition of Panconesi and Srinivasan [PS92], which itself improved on an 2 · - round algorithm of Awerbuch et al. [ALGP89]. For low-degree graphs, the best known is an (O˜(√∆)+ O(log∗ n))-round algorithm of Fraigniaud, Heinrich, and Kosowski [FHK16], which is more general and applies also to (∆ + 1)-list-vertex-coloring. There are some known poly log n-round determin- istic edge-coloring algorithms, but all have two major shortcomings: (A) they require more colors, and (B) they are quite restricted and do not work for list-coloring. These algorithms are as follows: 1In the LOCAL model, messages might be of arbitrary size. A variant of the model where the messages have to be of bounded size is known as the CONGEST model [Pel00]. Our edge-coloring algorithms and our MIS and vertex-coloring algorithms for graphs of bounded neighborhood independence in fact work with small O(log n)-bit size messages. Though, for the sake of the readability, we avoid explicitly discussing the details of this aspect. 1 O log ∆ (I) a ((2 + o(1))∆)-edge-coloring by Ghaffari and Su [GS17]; (II) a ∆ 2 log log ∆ -edge-coloring by · Barenboim and Elkin [BE11]—see also [GS17, Appendix B] for a short proof; and (III) an O(∆ log n)- edge-coloring by Czygrinow et al. [CHK01]. See also [BE11, Chapter 8]. We also note that a recent work of Barenboim, Elkin, and Maimon [BEM16] presents an efficient deterministic algorithm for computing a (∆ + o(∆))-edge-coloring in graphs with arboricity a ∆1 δ, ≤ − for some constant δ > 0. In Corollary 5.1, we sketch how a simple combination of the list-edge-coloring algorithm of Theorem 1.1 with H-partitionings [BE13, Chapter 5.1] significantly extends their result. Improved Randomized Edge-Coloring Algorithm: The deterministic list-edge-coloring algo- rithm of Theorem 1.1, in combination with some randomized edge-coloring algorithms of [EPS15, BEPS12], also improves the complexity of randomized algorithms for (2∆ 1)-edge-coloring, making it the first among the four classic problems whose randomized complexity falls− down to poly(log log n). Corollary 1.2. There is a randomized distributed algorithm that computes a (2∆ 1)-edge-coloring in O(log8 log n) rounds, with high probability, in any n-node graph with maximum degree− ∆. The previous (worst-case) complexity for randomized (2∆ 1)-edge-coloring was 2O(√log log n) − rounds2, due to Elkin, Pettie, and Su [EPS15]. By improving this, Corollary 1.2 widens the prov- able gap between the complexity of (2∆ 1)-edge-coloring, which is now in poly(log log n) rounds, and the complexity of maximal matching,− which needs Ω( log n/ log log n) rounds [KMW16]. Unified Formulation as Hypergraph Maximal Matchingp : Our first step towards proving Theorem 1.1 is a simple unification of all the aforementioned four classic problems: MIS, (∆ + 1)- vertex-coloring, (2∆ 1)-edge-coloring, and maximal matching. We can cast each of these problems − as a maximal matching problem on hypergraphs3 of some rank r, which depends on the problem, and increases as we move from maximal matching to maximal independent set. In other words, the hy- pergraph maximal matching problem can be used to obtain a smooth interpolation between maximal matching in graphs and maximal independent set in graphs. Recall that the rank of a hypergraph is the maximum number of vertices in any of its hyperedges. Moreover, a matching in a hypergraph is a set of hyperedges, no two of which share an endpoint. We present a reduction from (2∆ 1)-edge-coloring to maximal matching in rank-3 hypergraphs, − as we sketch next in Lemma 1.3. A similar reduction can be used for list-edge-coloring, as formalized in Lemma 2.13. We note that these reductions are inspired by the well-known reduction of Luby from (∆ + 1)-vertex-coloring to MIS [Lub86, Lin87]. Lemma 1.3. Given a deterministic distributed algorithm that computes a maximal matching in A N-vertex hypergraphs of rank 3 and maximum degree d in T (N, d) rounds, there is a deterministic distributed algorithm that computes a (2∆ 1)-edge-coloring of any n-node graph G = (V, E) B − with maximum degree ∆ in at most T (3n∆, 2∆ 1) rounds. − Proof Sketch. To edge-color G = (V, E), we generate a hypergraph H: Take 2∆ 1 copies of G. − For each edge e E, let e1 to e2∆ 1 be its copies.
Recommended publications
  • On Vertex Independence Number of Uniform Hypergraphs
    Acta Univ. Sapientiae, Informatica, 6, 1 (2014) 132{158 On vertex independence number of uniform hypergraphs Tariq A. CHISHTI Guofei ZHOU University of Kashmir Nanjing University Department of Mathematics Department of Mathematics Srinagar, India Nanjing, China email: [email protected] email: [email protected] Shariefuddin PIRZADA Antal IVANYI´ University of Kashmir E¨otv¨osLor´andUniversity Department of Mathematics Faculty of Informatics Srinagar, India Budapest, Hungary email: email: [email protected] [email protected] Abstract. Let H be an r-uniform hypergraph with r ≥ 2 and let α(H) be its vertex independence number. In the paper bounds of α(H) are given for different uniform hypergraphs: if H has no isolated vertex, then in terms of the degrees, and for triangle-free linear H in terms of the order and average degree. 1 Introduction to independence in graphs Let n be a positive integer. A graph G on vertex set V = fv1; v2; : : : ; vng is a pair (V; E), where the edge set E is a subset of V × V. n is the order of G and jEj is the size of G. Computing Classification System 1998: G.2.2 Mathematics Subject Classification 2010: 05C30, 05C50 Key words and phrases: uniform hypergraph, independence number, lower bound DOI:10.2478/ausi-2014-0022 132 On vertex independence number of uniform hypergraphs 133 Let v 2 V and N(v) be the neighborhood of v, namely, the set of vertices x so that there is an edge which contains both v and x. Let U be a subset of V, then the subgraph of G induced by U is defined as a graph on vertex set U and edge set EU = f(u; v)ju 2 u and v 2 Ug.
    [Show full text]
  • Qualitative Reasoning for Spatial Matching 1 [email protected] Qualitative Reasoning for Spatial Matching
    Qualitative Reasoning for Spatial Matching 1 [email protected] Qualitative Reasoning for Spatial Matching Diedrich Wolter Universit¨atBremen, Transregional Collaborative Research Center SFB/TR 8 Spatial Cognition Abstract. Matching two representations against one another is central to many applications, e.g., matching observation against models in ob- ject or scene recognition or matching a local perception against a global map in self-localization if a mobile robot. In both examples, spatial infor- mation structures are associated, i.e., the representation includes spatial arrangement information of individual parts, either in a relative or an absolute frame of reference. In this paper the utility of qualitative configuration information for matching is investigated. Qualitative configuration information expresses relative arrangement information. It is argued that such information is helpful for making side conditions of plausible matchings explicit, which can improve matching efficiency as well as quality of the matching. Keywords. Matching, Qualitative Spatial Information 1 Introduction Many applications involve the task of matching two representations against one another. For example scene recognition involves identifying a set of spatially distributed objects. Many objects bear rich information which can be exploited the recognition task, but it is information about the spatial arrangement of the involved objects that allows for important distinctions. Consider you are observing a scene that contains dishes, glasses, and sets of cutlery. These objects, when nicely arranged on a table, gives rise to the hypothesis that you are about to enjoy a nice meal, whereas piled up dishes, glasses, and clutter of cutlery suggests dish washing to be your destiny. This article investigates into matching tasks which are involved with match- ing spatially embedded representations, i.e.
    [Show full text]
  • Curriculum Vitae Prof. Dr. Anand Srivastav Personal Data Name Prof
    Curriculum Vitae Prof. Dr. Anand Srivastav Personal Data Name Prof. Dr. Anand Srivastav (Dipl. Math., Dipl. Phys.) Place/Date of Birth Agra (India), in 1958 Affiliation Institution Christian-Albrechts-Universität zu Kiel (CAU) Institute Department of Computer Science Address Christian-Albrechts-Platz 4, D-24118 Kiel, Germany Phone / Fax +49 431 880-7252 / -1725 E-Mail [email protected] Education 1996 Habilitation in Computer Science, Freie Universität zu Berlin, Germany 1988 Dr. rer. nat. (in Mathematics), Westfälische Wilhelms-Universität Münster 1988 Dipl. Phys. degree (Master of Physics), Westfälische Wilhelms-Universität Münster, Germany 1984 Dipl. Math. degree (Master of Mathematics), Westfälische Wilhelms- Universität Münster, Germany 1978 – 1984 Study of Physics and Mathematics at the Westfälische Wilhelms- Universität Münster, Germany 1965 – 1978 School Education in India and Germany Academic Appointments since 1997 Professor for Mathematics (Discrete Optimization), Christian-Albrechts- Universität zu Kiel 1995 – 1997 Assistent Professor at the Computer Science Department, Humboldt- Universität zu Berlin 1994 – 1995 Visiting Professor at the Computer Science Department, Freie Universität Berlin 1993 – 1994 Visiting Professor at IMA (University of Minnesota), Courant Institute for Mathematical Sciences (New York University) and Yale University, USA 1988 – 1993 Assistent Professor, Institute for Discrete Mathematics, Rheinische Friedrich- Wilhelms-Universität Bonn Important Functions since 2008 Co-Speaker of the Research
    [Show full text]
  • Matchings in Balanced Hypergraphs
    Matchings in balanced hypergraphs Von der Fakultat¨ fur¨ Mathematik, Informatik und Naturwissenschaften der RWTH Aachen University zur Erlangung des akademischen Grades eines Doktors der Naturwissenschaften genehmigte Dissertation vorgelegt von Diplom-Mathematiker Robert Berthold Scheidweiler aus Duren-Birkesdorf.¨ Berichter: Universitatsprofessor¨ Dr. Eberhard Triesch Universitatsprofessor¨ Dr. Arie M.C.A. Koster Tag der mundlichen¨ Prufung:¨ 19. April 2011 Diese Dissertation ist auf den Internetseiten der Hochschulbibliothek online verfugbar.¨ Danksagung In den letzten funf¨ Jahren habe ich am Lehrstuhl II fur¨ Mathematik der RWTH Aachen Uni- versity die vorliegende Dissertation verfasst. Einigen gebuhrt¨ fur¨ ihre Unterstutzung¨ und Hilfe wahrend¨ dieser Zeit besonderer Dank. Zuallererst mochte¨ ich mich bei dem Betreuer meiner Dissertation und meinem Chef, Eberhard Triesch, bedanken. Durch ihn habe ich das Thema dieser Arbeit erhalten, das mir sehr ans Herz gewachsen ist. Er hat mir bei meinen Forschungen immer mit Rat und Tat zur Seite gestanden und mich auch bei langer¨ andauern- den Durststrecken niemals unter Druck gesetzt. Seine positive Unterstutzung¨ und geduldige Hilfe haben mich motiviert, diese Arbeit zu vollenden. Weiterhin mochte¨ ich mich bei Arie Koster, meinem Zweitgutachter, bedanken. Mehrfach hat er im Verlauf meiner Promotion Anregungen gegeben, die dann in die Dissertation eingeflossen sind. Vor der endgultigen¨ Ab- gabe hat er durch seine Verbesserungsvorschlage,¨ fur¨ die ich sehr dankbar bin, zur jetzigen Form der Arbeit beigetragen. Danken mochte¨ ich außerdem Bert Randerath, der mir half, einige Startschwierigkeiten zu uberwinden,¨ als ich begann, die balancierten Hypergraphen zu erforschen. Hartmut Fuhr¨ hat sehr viel Zeit darauf verwendet, mir die harmonische Analysis naher¨ zu bringen. Seine Bemuhungen¨ haben meine Promotion weiter voran gebracht.
    [Show full text]
  • Learning from Networked Examples
    Learning from Networked Examples∗ Yuyi Wang ETH Zurich,¨ Switzerland Zheng-Chu Guo Zhejiang University, P.R. China Jan Ramon Inria, France & KULeuven, Belgium Editor: The correct dates will be entered by the editor Abstract Many machine learning algorithms are based on the assumption that training examples are drawn independently. However, this assumption does not hold anymore when learning from a networked sample because two or more training examples may share some common objects, and hence share the features of these shared objects. We show that the classic approach of ignoring this problem potentially can have a harmful effect on the accuracy of statistics, and then consider alternatives. One of these is to only use independent examples, discarding other information. However, this is clearly suboptimal. We analyze sample error bounds in this networked setting, providing signifi- cantly improved results. An important component of our approach is formed by efficient sample weighting schemes, which leads to novel concentration inequalities. Keywords: Networked Examples, Non-i.i.d. Sample, Concentration Inequality, Sample Error Bound, Generalization Error Bound 1. Introduction Recently, there has been an increasing interest in network-structured data, such as data in social net- works (Leinhardt, 2013), economic networks (Knoke, 2014), citation networks (Nykl et al., 2014) and chemical interaction networks (Szklarczyk et al., 2014). An important challenge is that data concerning related objects cannot be assumed to be independent. More precisely, a key assump- arXiv:1405.2600v4 [cs.AI] 3 Jun 2017 tion made by many approaches in the field of statistics and machine learning is that observations are drawn independently and identically (i.i.d.) from a fixed distribution.
    [Show full text]
  • Tur\'An Numbers for Forests of Paths in Hypergraphs
    TURAN´ NUMBERS FOR FORESTS OF PATHS IN HYPERGRAPHS NEAL BUSHAW AND NATHAN KETTLE Abstract. The Tur´an number of an r-uniform hypergraph H is the maximum number of edges in any r-graph on n vertices which does not contain H as a (r) subgraph. Let Pℓ denote the family of r-uniform loose paths on ℓ edges, F(k,l) (r) (r) denote the family of hypergraphs consisting of k disjoint paths from Pℓ , and Lℓ denote an r-uniform linear path on ℓ edges. We determine precisely exr(n; F(k,l)) (r) and exr(n; k · Lℓ ), as well as the Tur´an numbers for forests of paths of differing lengths (whether these paths are loose or linear) when n is appropriately large dependent on k,l,r, for r ≥ 3. Our results build on recent results of F¨uredi, Jiang, and Seiver who determined the extremal numbers for individual paths, and provide more hypergraphs whose Turan numbers are exactly determined. 1. Introduction and Background Extremal graph theory is that area of combinatorics which is concerned with find- ing the largest, smallest, or otherwise optimal structures with a given property. Often, the area is concerned with finding the largest (hyper)graph avoiding some subgraph. We build on earlier work of F¨uredi, Jiang, and Seiver [19], who deter- mined the extremal numbers when the forbidden hypergraph is a single linear path or a single loose path. In this paper, we determine precisely the exact Tur´an numbers when the forbidden hypergraph is a forest of loose paths, or a forest of linear paths; our main results appear in Section 2.
    [Show full text]
  • Distributed Algorithms for Covering, Packing and Maximum Weighted Matching
    Noname manuscript No. (will be inserted by the editor) Distributed Algorithms for Covering, Packing and Maximum Weighted Matching Christos Koufogiannakis · Neal E. Young Received: date / Accepted: date Abstract This paper gives poly-logarithmic-round, dis- 1 Background and results tributed δ-approximation algorithms for covering prob- lems with submodular cost and monotone covering con- Many distributed systems are composed of components straints (Submodular-cost Covering). The approx- that can only communicate locally, yet need to achieve a imation ratio δ is the maximum number of variables in global (system-wide) goal involving many components. any constraint. Special cases include Covering Mixed The general possibilities and limitations of such sys- Integer Linear Programs (CMIP), and Weighted tems are widely studied [42,49,54,38,39,12]. It is of Vertex Cover (with δ = 2). specific interest to see which fundamental combinato- Via duality, the paper also gives poly-logarithmic- rial optimization problems admit efficient distributed round, distributed δ-approximation algorithms for Frac- algorithms that achieve approximation guarantees that tional Packing linear programs (where δ is the max- are as good as those of the best centralized algorithms. imum number of constraints in which any variable oc- Research in this spirit includes works on Set Cover curs), and for Max Weighted c-Matching in hy- (Dominating Set) [26,40,41], capacitated domi- pergraphs (where δ is the maximum size of any of the nating set [37], Capacitated Vertex Cover [16, hyperedges; for graphs δ = 2). 17], and many other problems. This paper presents dis- The paper also gives parallel (RNC) 2-approximation tributed approximation algorithms for some fundamen- algorithms for CMIP with two variables per constraint tal covering and packing problems.
    [Show full text]
  • On Derandomizing Local Distributed Algorithms
    On Derandomizing Local Distributed Algorithms Mohsen Ghaffari David G. Harris Fabian Kuhn∗ ETH Zurich University of Maryland, College Park University of Freiburg [email protected] [email protected] [email protected] Abstract The gap between the known randomized and deterministic local distributed algorithms underlies ar- guably the most fundamental and central open question in distributed graph algorithms. In this paper, we develop a generic and clean recipe for derandomizing randomized LOCAL algorithms and transform- ing them into efficient deterministic LOCAL algorithm. We also exhibit how this simple recipe leads to significant improvements on a number of problems, in cases resolving known open problems. Two sample end-results are as follows: • An improved distributed hypergraph maximal matching algorithm, which improves on that of Fis- cher, Ghaffari, and Kuhn [FOCS’17], and leads to improved algorithms for edge-coloring, maxi- mum matching approximation, and low out-degree edge orientation. The first gives an improved algorithm for Open Problem 11.4 of the book of Barenboim and Elkin, and the last gives the first positive resolution of their Open Problem 11.10. • An improved distributed algorithm for the Lovasz´ Local Lemma, which gets closer to a conjec- ture of Chang and Pettie [FOCS’17], and moreover leads to improved distributed algorithms for problems such as defective coloring and k-SAT. ∗Supported by ERC Grant No. 336495 (ACDC). 1 Introduction and Related Work The gap between the best known deterministic and randomized local distributed algorithms constitutes a central open question in the area of distributed graph algorithms. For many of the classic problems (e.g., maximal independent set (MIS) and (∆ + 1)-vertex coloring), O(log n)-time randomized algorithms have been known since the pioneering work of Luby [Lub86] and Alon, Babai, and Itai [ABI86].
    [Show full text]
  • Matching Theory-A Sampler: from D&Es Kgnig to The
    Discrete Mathematics 100 (1992) 177-219 177 North-Holland Matching theory-a sampler: from D&es KGnig to the present Michael D. Plummer* Department of Mathematics, Vanderbilt University, Nashville, TN 37235, USA Received 20 March 1991 Revised 8 October 1991 Abstract Plummer, M.D., Matching theory-a sampler: from D&es Kijnig to the present, Discrete Mathematics 100 (1992) 177-219. 1. Introduction and terminology Let G be a finite undirected graph without loops or multiple lines. A set of lines M c E(G) is a matching if no two share a common endpoint. A matching is said to be perfect if it covers all points of G. One can take the point of view that a perfect matching is a special case of a more general concept in graph theory-an n-factor. An n-factor of graph G is a spanning subgraph which is regular of degree n; that is, each point has degree it. So a perfect matching is a l-factor. But we can generalize even more. Let f denote a function which assigns to each point u in V(G) a nonnegative integer f(v). Then an f-factor is any subgraph F of G in which deg F2r =f(v). The existence of these increasingly more general concepts for a fixed graph G are all instances of what have come to be called degree-constrained subgraph problems. The reader of these Proceedings may see in Mulder’s article [126] that these ideas were of great interest to Julius Petersen and in fact Petersen enjoyed some considerable success in his studies of such problems.
    [Show full text]
  • New Applications of the Nearest-Neighbor Chain Algorithm
    UC Irvine UC Irvine Electronic Theses and Dissertations Title New Applications of the Nearest-Neighbor Chain Algorithm Permalink https://escholarship.org/uc/item/6nt3g7d4 Author Mamano Grande, Nil Publication Date 2019 Peer reviewed|Thesis/dissertation eScholarship.org Powered by the California Digital Library University of California UNIVERSITY OF CALIFORNIA, IRVINE New Applications of the Nearest-Neighbor Chain Algorithm DISSERTATION submitted in partial satisfaction of the requirements for the degree of DOCTOR OF PHILOSOPHY in Computer Science by Nil Mamano Grande Dissertation Committee: Professor Michael T. Goodrich, Chair Professor David Eppstein Professor Sandy Irani 2019 c 2019 Nil Mamano Grande TABLE OF CONTENTS Page LIST OF FIGURES v LIST OF TABLES vii LIST OF ALGORITHMS viii ACKNOWLEDGMENTS ix CURRICULUM VITAE x ABSTRACT OF THE DISSERTATION xii 1 Introduction 1 1.1 Greedy algorithms . 3 1.2 Local greedy . 4 1.3 Global–local equivalence . 6 1.4 Background . 8 1.4.1 Hierarchical clustering . 8 1.4.2 Nearest-neighbor chain algorithm . 10 2 Global–Local Equivalence 13 2.1 Combinatorial optimization problems . 16 2.1.1 Maximum-weight independent set . 22 2.2 Multi-fragment TSP . 25 2.3 Shortest common superstring . 28 2.4 Conclusions . 36 3 Nearest-Neighbor Chain Algorithms 37 3.1 Preliminaries: nearest-neighbor data structures . 37 3.2 Soft nearest-neighbor data structure . 40 3.2.1 Implementation . 41 3.2.2 Choice of parameters . 43 3.2.3 The m-way soft nearest-neighbor data structure . 47 3.2.4 The closest pair problem . 47 3.3 Geometric TSP . 48 ii 3.3.1 Related work .
    [Show full text]
  • Workshop on Probabilistic and Extremal Combinatorics (19W5009)
    Workshop on Probabilistic and Extremal Combinatorics (19w5009) Penny Haxell (University of Waterloo) Michael Krivelevich (Tel Aviv University) Benny Sudakov (ETH Zurich) September 1-6, 2019 1 Overview of the Field Combinatorics, also known as Discrete Mathematics, is a fundamental mathematical discipline which studies the nature of discrete mathematical objects, such as permutations, graphs and set-families. Encompassing some of the most basic and natural mathematical questions, Combinatorics is probably as old as the human ability to count. During the twentieth century and the first two decades of the present century, the field has experienced remarkable growth, transitioning from a collection of mostly isolated results relying on ad hoc arguments, to a major branch of mathematics with its own theory, problems and techniques. Modern Combi- natorics has well-established and fruitful connections to numerous scientific fields, including Number Theory, Discrete Geometry, Statistical Physics, Evolutionary Biology and, most notably, Theoretical Computer Sci- ence, with which it is intimatively intertwined. Combinatorial methods and ideas have been exported to many of the above disciplines, in some cases leading to outstanding results. The workshop focused on two major branches of Combinatorics: Extremal Combinatorics and Proba- bilistic Combinatorics. These two highly-related fields have flourished in recent decades, producing several breakthroughs and many fascinating results, and leading to the creation of a vibrant community comprising both senior and young researchers. We now briefly overview each of these two disciplines. Extremal Combinatorics encompasses several important areas of research, such as Extremal Graph (and Hypergraph) Theory, Extremal Set Theory, and Ramsey Theory. Extremal Graph Theory typically asks to maximize/minimize some graph parameter over all graphs satisfying a given property.
    [Show full text]
  • Hypergraph Matchings and Designs
    P. I. C. M. – 2018 Rio de Janeiro, Vol. 4 (3131–3154) HYPERGRAPH MATCHINGS AND DESIGNS P K Abstract We survey some aspects of the perfect matching problem in hypergraphs, with particular emphasis on structural characterisation of the existence problem in dense hypergraphs and the existence of designs. 1 Introduction Matching theory is a rich and rapidly developing subject that touches on many areas of Mathematics and its applications. Its roots are in the work of Steinitz [1894], Egerváry [1931], Hall [1935] and König [1931] on conditions for matchings in bipartite graphs. After the rise of academic interest in efficient algorithms during the mid 20th century, three cornerstones of matching theory were Kuhn’s ‘Hungarian’ algorithm (Kuhn [1955]) for the Assignment Problem, Edmonds’ algorithm (Edmonds [1965]) for finding a maximum matching in general (not necessarily bipartite) graphs, and the Gale and Shapley [1962] algorithm for Stable Marriages. For an introduction to matching theory in graphs we refer to Lovász and Plummer [2009], and for algorithmic aspects to parts II and III of Schrijver [2003]. There is also a very large literature on matchings in hypergraphs. This article will be mostly concerned with one general direction in this subject, namely to determine condi- tions under which the necessary ‘geometric’ conditions of ‘space’ and ‘divisibility’ are sufficient for the existence of a perfect matching. We will explain these terms and discuss some aspects of this question in the next two sections, but first, for the remainder of this introduction, we will provide some brief pointers to the literature in some other directions.
    [Show full text]