Practical Approximation Algorithms for Clustering and Covering

Total Page:16

File Type:pdf, Size:1020Kb

Practical Approximation Algorithms for Clustering and Covering Practical Approximation Algorithms for Clustering and Covering a dissertation presented by Jessica May McClintock to The School of Engineering in total fulfillment of the requirements for the degree of Doctor of Philosophy in the subject of Computing and Information Systems Student Number: 268135 The University of Melbourne Melbourne, Australia June 2017 Thesis advisor: Associate Professor Tony Wirth Jessica May McClintock Practical Approximation Algorithms for Clustering and Covering Abstract The aim of this thesis is to provide solutions to computationally difficult network optimisation problems on modern architectures. There are three types of problem that we will be studying – clustering, covering and ordering – all which are computa- tionally hard to solve, and in practice often involve very large data sets. These models are used in range of real-world applications, and therefore we will investigate both the practical and theoretical aspects of solving these problems in big data contexts. The main approach we consider for solving such instances is to obtain polynomial- time approximation-algorithms, which efficiently solve these problems to within some constant factor of optimality. We also introduce several heuristics for a type of schedul- ing problem with graph-based constraints and demonstrate their performance in a practical setting, before providing an approximation algorithm and hardness res- ults for a formalised version of the problem. For instances on big data, where the computational bottleneck is the available RAM, we consider models for algorithm design that would allow such instances to be solved. For this purpose, we also design clustering algorithms using the MapReduce paradigm for parallelisation, giving ex- perimental evaluations of their performance in comparison to existing approaches. iv Contents 0 Introduction 1 0.1 Clustering and Parallel Algorithms..................... 2 0.2 Graph Covering Problems.......................... 4 0.3 Test-case Prioritisation with Precedences.................. 6 0.4 Min-Sum Set Cover.............................. 8 1 Related Work 11 1.1 Optimisation and Approximation...................... 11 1.2 Graph Theory ................................. 13 1.3 Parallelism and Clustering Problems.................... 16 1.4 Graph Covering Problems.......................... 32 1.5 The Min-Sum Set Cover Problem...................... 46 1.6 Scheduling and Prioritisation with Precedences.............. 53 1.7 Summary.................................... 61 2 Efficient Clustering with MapReduce 63 2.1 Clustering with MapReduce......................... 64 2.2 Parallel k-centre................................ 65 2.3 Analysis of EIM sampling .......................... 72 2.4 Runtime .................................... 80 2.5 Experiments.................................. 82 2.6 Results ..................................... 86 2.7 Conclusions .................................. 90 3 Parallel Coverings of Graphs 93 3.1 MapReduce for Covering Problems..................... 93 3.2 Tree Covering Problems ........................... 94 v 3.3 The Minimum Path Cover Problem..................... 96 3.4 The k-star Covering Problem......................... 98 3.5 Hardness of k-star Covering......................... 99 4 Scheduling with Precedences 105 4.1 Test-case Prioritisation with Precedences..................105 4.2 Our Contributions...............................112 4.3 Algorithms and Analysis...........................114 4.4 Special Cases..................................121 4.5 Experiments..................................122 4.6 Results .....................................129 4.7 Conclusions ..................................141 5 Min-Sum Set-Cover with Precedences 147 5.1 Our Contributions...............................149 5.2 Max-Density Precedence-Closed Subgraphs . 151 5.3 An Algorithm for precMSSC.........................161 5.4 Hardness Results ...............................165 5.5 Conclusions ..................................169 6 Summary 171 References 174 vi List of Figures 1.1 Gonzalez k-centre solution for k = 3 .................... 28 1.2 An example rooted k-tree covering ..................... 35 1.3 An example rooted k-path covering..................... 36 1.4 An example clustering solution....................... 37 1.5 An example k-star covering ......................... 38 1.6 A bad assignment for k-star ......................... 40 1.7 An improved assignment for k-star..................... 40 2.1 Flowchart for MRG .............................. 67 2.2 A bad assignment and seeding for MRG . 69 2.3 A bad sample for MRG ............................ 70 2.4 An example tight solution for MRG ..................... 71 2.5 A point being satisfied by the sample.................... 76 2.6 Satisfied and unsatisfied points....................... 76 2.7 Comparison of performance over k on real datasets............ 85 2.8 Runtimes over k ................................ 86 2.9 Average solution value over k on synthetic data.............. 86 2.10 Runtime for k = 25 over a range of n .................... 89 2.11 Runtime over k on synthetic data ...................... 90 3.1 Star covering on a line metric ........................ 99 4.1 Fault detection histogram for default ordering . 107 4.2 Fault detection histogram for fault-coverage based ordering . 108 4.3 Fault detection histogram for code-coverage based ordering . 110 4.4 An example precedence graph........................112 4.5 Fault detection histogram for precedence-constrained ordering . 112 4.6 Combined APFD and AF results for real and synthetic dependencies . 128 vii 4.7 APFD scores for real data sets........................129 4.8 AF scores for real data sets..........................130 4.9 APFD and AF for data sets with synthetic dependencies . 131 4.10 APFD for Siemens with synthetic dependencies . 135 4.11 AF for Siemens with synthetic dependencies . 136 4.12 Lookahead comparison for gsm2 . 138 4.13 Lookahead comparison for tot info . 140 4.14 Lookahead comparison for replace . 144 4.15 Lookahead comparison for print tokens2 . 145 5.1 A bad precedence graph when d < 1 . 153 5.2 An example in-tree ..............................155 5.3 An example out-tree .............................157 5.4 Histograms for greedy and OPT . 163 viii List of Tables 1.1 Current best-known solutions........................ 61 1.2 Our contributions............................... 62 2.1 Comparison of algorithms for k-centre................... 83 2.2 Average solution value over k on Gau (n = 1,000,000, k0 = 25) . 87 2.3 Average solution value over k on Unif (n = 100,000)........... 87 2.4 Average solution value over k on UnB (n = 200,000, k0 = 25) . 87 2.5 Average solution over f ............................ 88 2.6 Average runtime over f ............................ 88 4.1 An example defect-coverage matrix.....................106 4.2 An example code-coverage matrix......................109 4.3 Metrics for real systems............................123 4.4 Metrics for systems with synthetic precedences . 125 ix x List of Algorithms 1 MapReduce-MST(V, E) ............................. 21 2 GON(V, k) .................................... 27 3 k-means(V, k) .................................. 29 4 Greedy(S, U, w) ................................ 47 5 MRG(V, k, m) .................................. 67 6 EIM-MapReduce-Sample(V, E, k, #) ...................... 72 7 Select(H, S) .................................. 74 8 k-Tree-Cover(G, k, B) ............................. 95 9 Min-Path-Cover(G, B) ............................ 97 10 Minimum Star Cover............................. 101 11 Topological-sort based greedy........................ 115 12 Coverage-first greedy............................. 117 13 greedy-subgraph(P = (S, E), U) ...................... 151 14 greedy-subgraph+(P = (S, E), U) ..................... 154 15 subgraph-outtree(T = (S, E), v, Rv) .................... 158 16 mssc-greedy(S, P, U) ............................. 162 xi xii Acknowledgments Many thanks to Tony Wirth for these years of advice and assistance as my supervisor – without his support and encouragement I certainly would not be where I am today. His attention to detail and discerning advice have provided many lessons that will be long remembered and appreciated. Thanks to Tim Miller for introducing me to the fascinating problem of scheduling with precedences, and providing datasets for the experiments. Thanks to Julian Mestre for providing direction and insight on the min-sum set cover problem. And thanks to Andrew Turpin for providing diversions along the way. Many thanks to my family and friends for their support and tolerance along this journey, particularly to Robert Marshall for proof reading and encouragement, and to Helen and Diana for providing reassurance and distraction. And finally a special thank you to my mother, Karen McClintock, who has been endlessly supportive even in times of uncertainty. xiii Declaration This is to certify that: 1. The thesis comprises only their original work towards the PhD except where indicated in the preface; 2. Due acknowledgement has been made in the text to all other material used; 3. The thesis is fewer than the maximum word limit in length, exclusive of tables, maps, bibliographies and appendices. Signed, Jessica McClintock Date xv xvi Preface The original work in this thesis is related to the following papers, one of which has been peer reviewed and appeared in conference proceedings. • McClintock, J., and Wirth, A., Efficient Parallel Algorithms
Recommended publications
  • Independence Number, Vertex (Edge) ୋ Cover Number and the Least Eigenvalue of a Graph
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector Linear Algebra and its Applications 433 (2010) 790–795 Contents lists available at ScienceDirect Linear Algebra and its Applications journal homepage: www.elsevier.com/locate/laa The vertex (edge) independence number, vertex (edge) ୋ cover number and the least eigenvalue of a graph ∗ Ying-Ying Tan a, Yi-Zheng Fan b, a Department of Mathematics and Physics, Anhui University of Architecture, Hefei 230601, PR China b School of Mathematical Sciences, Anhui University, Hefei 230039, PR China ARTICLE INFO ABSTRACT Article history: In this paper we characterize the unique graph whose least eigen- Received 4 October 2009 value attains the minimum among all graphs of a fixed order and Accepted 4 April 2010 a given vertex (edge) independence number or vertex (edge) cover Available online 8 May 2010 number, and get some bounds for the vertex (edge) independence Submitted by X. Zhan number, vertex (edge) cover number of a graph in terms of the least eigenvalue of the graph. © 2010 Elsevier Inc. All rights reserved. AMS classification: 05C50 15A18 Keywords: Graph Adjacency matrix Vertex (edge) independence number Vertex (edge) cover number Least eigenvalue 1. Introduction Let G = (V,E) be a simple graph of order n with vertex set V = V(G) ={v1,v2, ...,vn} and edge set E = E(G). The adjacency matrix of G is defined to be a (0, 1)-matrix A(G) =[aij], where aij = 1ifvi is adjacent to vj and aij = 0 otherwise. The eigenvalues of the graph G are referred to the eigenvalues of A(G), which are arranged as λ1(G) λ2(G) ··· λn(G).
    [Show full text]
  • 3.1 Matchings and Factors: Matchings and Covers
    1 3.1 Matchings and Factors: Matchings and Covers This copyrighted material is taken from Introduction to Graph Theory, 2nd Ed., by Doug West; and is not for further distribution beyond this course. These slides will be stored in a limited-access location on an IIT server and are not for distribution or use beyond Math 454/553. 2 Matchings 3.1.1 Definition A matching in a graph G is a set of non-loop edges with no shared endpoints. The vertices incident to the edges of a matching M are saturated by M (M-saturated); the others are unsaturated (M-unsaturated). A perfect matching in a graph is a matching that saturates every vertex. perfect matching M-unsaturated M-saturated M Contains copyrighted material from Introduction to Graph Theory by Doug West, 2nd Ed. Not for distribution beyond IIT’s Math 454/553. 3 Perfect Matchings in Complete Bipartite Graphs a 1 The perfect matchings in a complete b 2 X,Y-bigraph with |X|=|Y| exactly c 3 correspond to the bijections d 4 f: X -> Y e 5 Therefore Kn,n has n! perfect f 6 matchings. g 7 Kn,n The complete graph Kn has a perfect matching iff… Contains copyrighted material from Introduction to Graph Theory by Doug West, 2nd Ed. Not for distribution beyond IIT’s Math 454/553. 4 Perfect Matchings in Complete Graphs The complete graph Kn has a perfect matching iff n is even. So instead of Kn consider K2n. We count the perfect matchings in K2n by: (1) Selecting a vertex v (e.g., with the highest label) one choice u v (2) Selecting a vertex u to match to v K2n-2 2n-1 choices (3) Selecting a perfect matching on the rest of the vertices.
    [Show full text]
  • Catalogue of Graph Polynomials
    Catalogue of graph polynomials J.A. Makowsky April 6, 2011 Contents 1 Graph polynomials 3 1.1 Comparinggraphpolynomials. ........ 3 1.1.1 Distinctivepower.............................. .... 3 1.1.2 Substitutioninstances . ...... 4 1.1.3 Uniformalgebraicreductions . ....... 4 1.1.4 Substitutioninstances . ...... 4 1.1.5 Substitutioninstances . ...... 4 1.1.6 Substitutioninstances . ...... 4 1.2 Definabilityofgraphpolynomials . ......... 4 1.2.1 Staticdefinitions ............................... ... 4 1.2.2 Dynamicdefinitions .............................. 4 1.2.3 SOL-definablepolynomials ............................ 4 1.2.4 Generalizedchromaticpolynomials . ......... 4 2 A catalogue of graph polynomials 5 2.1 Polynomialsfromthezoo . ...... 5 2.1.1 Chromaticpolynomial . .... 5 2.1.2 Chromatic symmetric function . ...... 6 2.1.3 Adjointpolynomials . .... 6 2.1.4 TheTuttepolynomial . ... 7 2.1.5 Strong Tutte symmetric function . ....... 7 2.1.6 Tutte-Grothendieck invariants . ........ 8 2.1.7 Aweightedgraphpolynomial . ..... 8 2.1.8 Chainpolynomial ............................... 9 2.1.9 Characteristicpolynomial . ....... 10 2.1.10 Matchingpolynomial. ..... 11 2.1.11 Theindependentsetpolynomial . ....... 12 2.1.12 Thecliquepolynomial . ..... 14 2.1.13 Thevertex-coverpolynomial . ...... 14 2.1.14 Theedge-coverpolynomial . ..... 15 2.1.15 TheMartinpolynomial . 16 2.1.16 Interlacepolynomial . ...... 17 2.1.17 Thecoverpolynomial . 19 2.1.18 Gopolynomial ................................. 21 2.1.19 Stabilitypolynomial . ...... 23 1 2.1.20 Strong U-polynomial...............................
    [Show full text]
  • IEOR 269, Spring 2010 Integer Programming and Combinatorial Optimization
    IEOR 269, Spring 2010 Integer Programming and Combinatorial Optimization Professor Dorit S. Hochbaum Contents 1 Introduction 1 2 Formulation of some ILP 2 2.1 0-1 knapsack problem . 2 2.2 Assignment problem . 2 3 Non-linear Objective functions 4 3.1 Production problem with set-up costs . 4 3.2 Piecewise linear cost function . 5 3.3 Piecewise linear convex cost function . 6 3.4 Disjunctive constraints . 7 4 Some famous combinatorial problems 7 4.1 Max clique problem . 7 4.2 SAT (satisfiability) . 7 4.3 Vertex cover problem . 7 5 General optimization 8 6 Neighborhood 8 6.1 Exact neighborhood . 8 7 Complexity of algorithms 9 7.1 Finding the maximum element . 9 7.2 0-1 knapsack . 9 7.3 Linear systems . 10 7.4 Linear Programming . 11 8 Some interesting IP formulations 12 8.1 The fixed cost plant location problem . 12 8.2 Minimum/maximum spanning tree (MST) . 12 9 The Minimum Spanning Tree (MST) Problem 13 i IEOR269 notes, Prof. Hochbaum, 2010 ii 10 General Matching Problem 14 10.1 Maximum Matching Problem in Bipartite Graphs . 14 10.2 Maximum Matching Problem in Non-Bipartite Graphs . 15 10.3 Constraint Matrix Analysis for Matching Problems . 16 11 Traveling Salesperson Problem (TSP) 17 11.1 IP Formulation for TSP . 17 12 Discussion of LP-Formulation for MST 18 13 Branch-and-Bound 20 13.1 The Branch-and-Bound technique . 20 13.2 Other Branch-and-Bound techniques . 22 14 Basic graph definitions 23 15 Complexity analysis 24 15.1 Measuring quality of an algorithm .
    [Show full text]
  • New Approximation Algorithms for Minimum Weighted Edge Cover
    New Approximation Algorithms for Minimum Weighted Edge Cover S M Ferdous∗ Alex Potheny Arif Khanz Abstract The K-Nearest Neighbor graph is used to spar- We describe two new 3=2-approximation algorithms and a sify data sets, which is an important step in graph-based new 2-approximation algorithm for the minimum weight semi-supervised machine learning. Here one has a few edge cover problem in graphs. We show that one of the 3=2-approximation algorithms, the Dual Cover algorithm, labeled items, many unlabeled items, and a measure of computes the lowest weight edge cover relative to previously similarity between pairs of items; we are required to known algorithms as well as the new algorithms reported label the remaining items. A popular approach for clas- here. The Dual Cover algorithm can also be implemented to be faster than the other 3=2-approximation algorithms on sification is to generate a similarity graph between the serial computers. Many of these algorithms can be extended items to represent both the labeled and unlabeled data, to solve the b-Edge Cover problem as well. We show the and then to use a label propagation algorithm to classify relation of these algorithms to the K-Nearest Neighbor graph construction in semi-supervised learning and other the unlabeled items [23]. In this approach one builds applications. a complete graph out of the dataset and then sparsi- fies this graph by computing a K-Nearest Neighbor 1 Introduction graph [22]. This sparsification leads to efficient al- An Edge Cover in a graph is a subgraph such that gorithms, but also helps remove noise which can af- every vertex has at least one edge incident on it in fect label propagation [11].
    [Show full text]
  • Edge Guarding Plane Graphs
    Edge Guarding Plane Graphs Master Thesis of Paul Jungeblut At the Department of Informatics Institute of Theoretical Informatics Reviewers: Prof. Dr. Dorothea Wagner Prof. Dr. Peter Sanders Advisor: Dr. Torsten Ueckerdt Time Period: 26.04.2019 – 25.10.2019 KIT – The Research University in the Helmholtz Association www.kit.edu Statement of Authorship I hereby declare that this document has been composed by myself and describes my own work, unless otherwise acknowledged in the text. I also declare that I have read the Satzung zur Sicherung guter wissenschaftlicher Praxis am Karlsruher Institut für Technologie (KIT). Paul Jungeblut, Karlsruhe, 25.10.2019 iii Abstract Let G = (V; E) be a plane graph. We say that a face f of G is guarded by an edge vw 2 E if at least one vertex from fv; wg is on the boundary of f. For a planar graph class G the function ΓG : N ! N maps n to the minimal number of edges needed to guard all faces of any n-vertex graph in G. This thesis contributes new bounds on ΓG for several graph classes, in particular Γ Γ Γ on 4;stacked for stacked triangulations, on for quadrangulations and on sp for series parallel graphs. Specifically we show that • b(2n - 4)=7c 6 Γ4;stacked(n) 6 b2n=7c, b(n - 2)=4c Γ (n) bn=3c • 6 6 and • b(n - 2)=3c 6 Γsp(n) 6 bn=3c. Note that the bounds for stacked triangulations and series parallel graphs are tight (up to a small constant). For quadrangulations we identify the non-trivial subclass 2 Γ (n) bn=4c of -degenerate quadrangulations for which we further prove ;2-deg 6 matching the lower bound.
    [Show full text]
  • Vertex and Edge Covers with Clustering Properties: Complexity and Algorithms∗
    Vertex and Edge Covers with Clustering Properties: Complexity and Algorithms∗ Henning Fernau1;y and David F. Manlove2;z 1 FB IV|Abteilung Informatik, Universit¨at Trier, 54286 Trier, Germany 2 Department of Computing Science, University of Glasgow, Glasgow G12 8QQ, UK Abstract We consider the concepts of a t-total vertex cover and a t-total edge cover (t 1), which generalise the notions of a vertex cover and an edge cover, respectively. ≥A t- total vertex (respectively edge) cover of a connected graph G is a vertex (edge) cover S of G such that each connected component of the subgraph of G induced by S has at least t vertices (edges). These definitions are motivated by combining the concepts of clustering and covering in graphs. Moreover they yield a spectrum of parameters that essentially range from a vertex cover to a connected vertex cover (in the vertex case) and from an edge cover to a spanning tree (in the edge case). For various values of t, we present -completeness and approximability results (both upper and lower bounds) and N Palgorithms for problems concerned with finding the minimum size of a t-total vertexFPT cover, t-total edge cover and connected vertex cover, in particular improving on a previous algorithm for the latter problem. FPT 1 Introduction In graph theory, the notion of covering vertices or edges of graphs by other vertices or edges has been extensively studied (see [35] for a survey). For instance, covering vertices by other vertices leads to parameters concerned with vertex domination [30, 31]. When edges are to be covered by vertices we obtain parameters connected with the classical vertex covering problem [29, p.94].
    [Show full text]
  • On the Approximability of a Geometric Set Cover Problem
    Electronic Colloquium on Computational Complexity, Report No. 19 (2011) On the Approximability of a Geometric Set Cover Problem Valentin E. Brimkov1, Andrew Leach2,JimmyWu2, and Michael Mastroianni2 1 Mathematics Department, SUNY Buffalo State College, Buffalo, NY 14222, USA 2 Mathematics Department, University at Buffalo, Buffalo, NY 1426-2900, USA Abstract. Given a finite set of straight line segments S in R2 and some k ∈ N, is there a subset V of points on segments in S with |V |≤k such that each segment of S contains at least one point in V ? This is a special case of the set covering problem where the family of subsets given can be taken as a set of intersections of the straight line segments in S. Requiring that the given subsets can be interpreted geometrically this way is a major restriction on the input, yet we have shown that the problem is still strongly NP-complete. In light of this result, we studied the performance of two polynomial-time approximation algorithms which return segment coverings. We obtain certain theoretical results, and in particular we show that the performance ratio for each of these algorithms is unbounded, in general. Keyword: guarding set of segments, set cover, vertex cover, approximation algorithm, worst case perfor- mance 1 Introduction Given a finite set of straight line segments, we wish to find a minimum number of points so that every segment contains at least one chosen point. Consider any physical structure that can be modeled by a finite set of straight line segments. Some examples could be a network of streets in a city, tunnels in a mine, corridors in a building or pipes in a factory.
    [Show full text]
  • Approximability of Packing and Covering Problems
    Approximability of Packing and Covering Problems ATHESIS SUBMITTED IN PARTIAL FULFILMENT OF THE REQUIREMENTS FOR THE DEGREE OF Master of Technology IN Faculty of Engineering BY Arka Ray Computer Science and Automation Indian Institute of Science Bangalore – 560 012 (INDIA) June, 2021 Declaration of Originality I, Arka Ray, with SR No. 04-04-00-10-42-19-1-16844 hereby declare that the material presented in the thesis titled Approximability of Packing and Covering Problems represents original work carried out by me in the Department of Computer Science and Automation at Indian Institute of Science during the years 2019-2021. With my signature, I certify that: • I have not manipulated any of the data or results. • Ihavenotcommittedanyplagiarismofintellectualproperty.Ihaveclearlyindicatedand referenced the contributions of others. • Ihaveexplicitlyacknowledgedallcollaborativeresearchanddiscussions. • Ihaveunderstoodthatanyfalseclaimwillresultinseveredisciplinaryaction. • I have understood that the work may be screened for any form of academic misconduct. Date: Student Signature In my capacity as supervisor of the above-mentioned work, I certify that the above statements are true to the best of my knowledge, and I have carried out due diligence to ensure the originality of the thesis. etmindamthan 26 06 Advisor Name: Arindam Khan Advisor Signature2021 a © Arka Ray June, 2021 All rights reserved DEDICATED TO Ma and Baba for believing in me Acknowledgements My sincerest gratitude to Arindam Khan, who, despite the gloom and doom situation of the present times, kept me motivated through his constant encouragement while tolerating my idiosyncrasies. I would also like to thank him for introducing me to approximation algorithms, along with Anand Louis, in general, and approximation of packing and covering problems in particular.
    [Show full text]
  • 1 Matchings in Graphs
    1 Matchings in Graphs JJ1 2 J 3 J4J 5 JJJ67 8 C C C C C1 C2 3 4 5 C6 7 C8 JJ1 2 J 3 J4J 5 JJJ67 8 C C C C C1 C2 3 4 5 C6 7 C8 1 Definition 1 Two edges are called independent if they are not adjacent in the graph. A set of mutually independent edges is called a matching. A matching is called maximal if no other matching contains it. maximum if its cardinality is maximal among all matchings perfect if every vertex of the graph is incident to an edge of the matching. maximal vs maximum How to construct a maximal matching? 2 Given a matching M in G, a path is called M-alternating if its edges are alternatively in M and not in M. a vertex x is called weak if no edge of M is incident to x. Using alternating path connecting weak vertices 3 The symmetric difference of two sets X and Y is the set of all ele- ments that belong to one but not the other of the sets. X ⊗ Y = (X ∪ Y ) − (X ∩ Y ) Theorem 1 A matching M is maximum iff there exists no al- ternating path between any two distinct weak vertices of G. Proof. For a proof, we will try to answer the following questions: • If a graph M is a matching, what is the maximum degree of a vertex in such a graph? • If the edge set E of a graph F is the symmetric difference of two matchings M1 and M2, then what is the maximal vertex degree of F ? • Consider a component C of F above.
    [Show full text]
  • On the Triangle Clique Cover and $ K T $ Clique Cover Problems
    On the Triangle Clique Cover and Kt Clique Cover Problems Hoang Dau Olgica Milenkovic Gregory J. Puleo Abstract An edge clique cover of a graph is a set of cliques that covers all edges of the graph. We generalize this concept to Kt clique cover, i.e. a set of cliques that covers all complete subgraphs on t vertices of the graph, for every t ≥ 1. In particular, we extend a classical result of Erd¨os,Goodman, and P´osa(1966) on the edge clique cover number (t = 2), also known as the intersection number, to the case t = 3. The upper bound is tight, with equality holding only for the Tur´angraph T (n; 3). As part of the proof, we obtain new upper bounds on the classical intersection number, which may be of independent interest. We also extend an algorithm of Scheinerman and Trenk (1999) to solve a weighted version of the Kt clique cover problem on a superclass of chordal graphs. We also prove that the Kt clique cover problem is NP-hard. 1 Introduction A clique in a graph G is a set of vertices that induces a complete subgraph; all graphs considered in this paper are simple and undirected. A vertex clique cover of a graph G is a set of cliques in G that collectively cover all of its vertices. The vertex clique cover number of G, denoted θv(G), is the minimum number of cliques in a vertex clique cover of G. An edge clique cover of a graph G is a set of cliques of G that collectively cover all of its edges.
    [Show full text]
  • Hardness of Covering All the Triangles
    Hardness of covering all the triangles Thinh D. Nguyen∗ Moscow State University [email protected] July 17, 2018 Abstract Vertex Cover and Edge Cover are two classical examples that are often used to show the contrast of problem solvability. While Vertex Cover is hard, Edge Cover can be solved in polynomial time. We claim that the former remains intractable even if the objects to be covered are triangles instead of edges. Therefore, one more combinatorial optimization problem, namely Covering Triangles, is added to the decades-old list of the problems in this research area. I. Definitions and sketch of proof If G = (V, E) is a given graph, HG is a graph which has all of the vertices and edges of G, and also for each edge of G like e = (x, y), HG has a new vertex which is connected to both x and y. A triangle is a complete graph with 3 vertices, also denoted K3. Our goal is to prove that the problem of finding the minimum number of vertices to cover all the triangles in a given graph G is NP-complete. It is worth mentioning that: (i) the Vertex Cover problem asks for a vertex cover that covers all edges, (ii) our problem Covering Triangles problem also asks for a vertex cover but the covered objects are triangles, not edges. Hence, we call our problem Covering Triangle to avoid misleading notation. Sketch of Proof: We reduce from Vertex Cover of triangle-free cubic graph, Guruswami et al. prove the hardness of this problem in [2].
    [Show full text]