Inapproximability of Treewidth and Related Problems (Extended Abstract) ∗

Total Page:16

File Type:pdf, Size:1020Kb

Inapproximability of Treewidth and Related Problems (Extended Abstract) ∗ Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence (IJCAI 2015) Inapproximability of Treewidth and Related Problems (Extended Abstract) ∗ Yu (Ledell) Wu y, Per Austrin z, Toniann Pitassi and David Liu Department of Computer Science, University of Toronto, Ontario, Canada fwuyu,austrin,toni,[email protected] Abstract the distribution that allows one to express it more succinctly. A graphical model represents such a joint probability distribu- Graphical models, such as Bayesian Networks and tion by a graph where the vertices represent the random vari- Markov networks play an important role in artifi- ables, and the dependences are modeled by the graph struc- cial intelligence and machine learning. Inference is ture. Associated with each vertex of the graph is a conditional a central problem to be solved on these networks. probability table, which specifies the conditional probabili- This, and other problems on these graph models ties of this random variable, conditioned on its neighboring are often known to be hard to solve in general, vertices. The two most common types of graphical models but tractable on graphs with bounded Treewidth. are Bayesian networks (also called belief networks), where Therefore, finding or approximating the Treewidth the underlying graph is directed, and Markov networks (also of a graph is a fundamental problem related to in- called Markov random fields), where the underlying graph is ference in graphical models. In this paper, we study undirected. The most basic problem in graphical models is the approximability of a number of graph problems: the inference problem, which is the problem of computing Treewidth and Pathwidth of graphs, Minimum Fill- the posterior marginal distribution of a variable at some ver- In, and a variety of different graph layout problems tex. Unfortunately, inference in general is well-known to be such as Minimum Cut Linear Arrangement. We NP-hard to compute exactly as well as to approximate [Roth, show that, assuming Small Set Expansion Conjec- 1996]. ture, all of these problems are NP-hard to approx- Despite this intractability, an important class of bounded imate to within any constant factor in polynomial Treewidth instances of probabilistic inference has been iden- time. tified and shown to be exactly computable in polynomial This paper is an extended abstract of the Journal of time. The Treewidth of a graph [Robertson and Seymour, Artificial Intelligence Research [Wu et al., 2014] 1984] is a fundamental parameter of a graph that measures how close the graph is to being a tree. Treewidth is very 1 Introduction closely related to the other notions in machine learning such as Branch-width, Clique-width and Elimination-width (for an Graphical models provide a computational framework for ef- overview of Treewidth and related notions, see [Bodlaender ficiently manipulating probability distributions over high di- et al., 1995]). On graphs with small Treewidth and where mensional spaces, often involving hundreds of thousands of the tree decomposition is known, a dynamic programming al- variables. This framework has found applications in an enor- gorithm yields a polynomial-time algorithm. Particular al- mous range of domains including: medical and fault diag- gorithms for probabilistic inference on bounded Treewidth nosis, image understanding, speech recognition, web search, graphs are the junction-tree method, variable elimination and coding theory, and statistical physics [Koller and Friedman, clique trees (e.g. see [Koller and Friedman, 2009], ch. 9, 10). 2009]. A graphical model is an efficient representation of a The same ideas also yield polynomial-time algorithms and joint distribution over some set of n random variables. Even often even linear time algorithms for small Treewidth in- if the random variables are binary, it is well known that an ar- n stances for an astonishing variety of other NP-hard prob- bitrary joint distribution requires the specification of 2 prob- lems, including: satisfiability, counting satisfying assign- abilities. Luckily, in the real world, there is often structure in ments, constraint satisfaction, vertex cover, maximum inde- ∗This paper is an extended abstract of the Journal of Artificial pendent set, Hamiltonian circuit, matrix decomposition, and Intelligence Research [Wu et al., 2014]. Research supported by more generally all problems definable in monadic second- NSERC. order logic. (See the excellent survey [Bodlaender, 2005] for yPresent Affiliation: Facebook AI Research, 770 Broadway, New motivation, including theoretical as well as practical applica- York, USA. tions of Treewidth.) One catch is that for all of these prob- zPresent Affiliation: School of Computer Science and Commu- lems, the algorithm must begin by finding a tree decompo- nication, KTH Royal Institute of Technology, Stockholm, Sweden sition, and then use the decomposition to solve the problem. 4222 Given the tree decomposition, the algorithm is typically ex- optimal lower bound, since there is a simple algorithm that 7 ponential in the width of the underlying tree decomposition. achieves an approximation ratio of 8 . Despite this success, Thus there is a need for efficient algorithms to actually com- for many important problems, such as Vertex Cover, Max- pute the Treewidth of a given graph, and to find tree decom- Cut, and Kernel Clustering, the hardness of approximation positions with optimal or close to optimal width. results obtained through the PCP theorem have not matched Unfortunately, while there are many good heuristics for the best approximation algorithms known. finding a good tree decomposition, it is NP-hard in gen- The formulation of the Unique Games Conjecture (UGC) eral to determine the Treewidth of a graph [Arnborg et al., due to Khot [Khot, 2002] was intended to clarify the ap- 1987]. However, Bodlaender et al. [Bodlaender et al., proximability of many optimization problems. The conjec- 1995] obtained an O(log n) factor approximation algorithm ture postulates that the problem of determining the value of for Treewidth. In fact, they actually show that if there is a a certain type of game, known as a unique game, is NP-hard. factor c approximation algorithm for vertex separator, then Under UGC, many of the known algorithms in approximation there is an O(c) approximation algorithm for Treewidth. And are proven to be tight (for an excellent survey on this topic, if there is a factor b approximation algorithm for Treewidth see [Khot and Vishnoi, 2005]). For instance, under the UGC, then there is an O(b log n) approximation algorithm for the the Vertex Cover problem is NP-hard to approximate within related Pathwidth problem. The best currentlyp known approx- a factor of 2 − , for any > 0 [Khot and Regev, 2008]. imation factor for vertex separator is O( log n) [Feige et al., Perhaps most strikingly, Raghavendra [Raghavendra, 2008] 2005] and thus the best algorithm for Treewidth finds a tree proved that under the UGC, the semi-definite programming p (SDP) approximation algorithm for a large class of constraint decomposition that is withinp an O( log n) factor of the opti- mal width, and an O(( log n)(log n)) factor approximation satisfaction problems (CSP) are essentially the best one can algorithm for Pathwidth. hope for. Thus, the UGC has become the central open prob- It is a longstanding open question whether or not there is a lem in inapproximability and encapsulates the barrier of de- constant factor approximation algorithm for Treewidth. Such signing better polynomial time approximation algorithms for an algorithm would lead to faster algorithms to find good tree- a large class of problems. decompositions for all of the problems mentioned above. The Despite this tremendous progress, still there remain im- current best known algorithm that achieves a constant factor portant yet stubborn problems such as Treewidth, Balanced approximation for Treewidth runs in time 2O(w)O(n), where Separator, Minimum Linear Arrangement (MLA), and many w is the Treewidth of the underlying graph, and achieves a other graph layout problems whose approximation status re- factor 5 approximation [Bodlaender, 2007]. Similarly, the ap- mains unresolved even assuming the UGC. In the work of proximability of many related graph layout problems is also Raghavendra and Steurer [Raghavendra and Steurer, 2010], unresolved, including Minimum Cut Linear Arrangement and the Small Set Expansion (SSE) Conjecture was introduced, Interval Graph Completion. and it was shown that it implies the UGC, and that the In this paper, we make an important step to resolve this SSE Conjecture follows if one assumes that the UGC is problem by showing that Treewidth, Pathwidth, and a host true for somewhat expanding graphs. In follow-up work by of related graph layout problems are hard to approximate to Raghavendra et al. [Raghavendra et al., 2012], it was shown within any constant factor, under the Small Set Expansion that the SSE Conjecture is in fact equivalent to the UGC on (SSE) conjecture [Raghavendra and Steurer, 2010]. somewhat expanding graphs, and that the SSE Conjecture im- The SSE conjecture is a strengthened version of the con- plies NP-hardness of approximation for balanced separator jecture that P is different from NP and warrants some ex- and MLA. In this light, the Small Set Expansion conjecture planation. In the next subsection (Section 1.1), we explain serves as a natural unified conjecture that yields all of the im- the SSE conjecture, and how it relates to the P versus NP plications of UGC and also hardness for expansion-like prob- question and to related conjectures. We then state our main lems that could not be resolved with the UGC. hardness results for Treewidth, Pathwidth and graph layout Our main contribution in this paper is to prove that a wide problems (Sections 1.2, and 1.4), and discuss related results range of other graph layout problems are SSE-hard to approx- in Section 1.5. imate to within any constant factor. For these problems, no evidence of hardness of approximation was known prior to 1.1 The Small Set Expansion Conjecture our results.
Recommended publications
  • Chicago Journal of Theoretical Computer Science the MIT Press
    Chicago Journal of Theoretical Computer Science The MIT Press Volume 1997, Article 1 12 March 1997 ISSN 1073–0486. MIT Press Journals, 55 Hayward St., Cambridge, MA 02142 USA; (617)253-2889; [email protected], [email protected]. Published one article at a time in LATEX source form on the Internet. Pag- ination varies from copy to copy. For more information and other articles see: http://www-mitpress.mit.edu/jrnls-catalog/chicago.html • http://www.cs.uchicago.edu/publications/cjtcs/ • ftp://mitpress.mit.edu/pub/CJTCS • ftp://cs.uchicago.edu/pub/publications/cjtcs • Feige and Kilian Limited vs. Polynomial Nondeterminism (Info) The Chicago Journal of Theoretical Computer Science is abstracted or in- R R R dexed in Research Alert, SciSearch, Current Contents /Engineering Com- R puting & Technology, and CompuMath Citation Index. c 1997 The Massachusetts Institute of Technology. Subscribers are licensed to use journal articles in a variety of ways, limited only as required to insure fair attribution to authors and the journal, and to prohibit use in a competing commercial product. See the journal’s World Wide Web site for further details. Address inquiries to the Subsidiary Rights Manager, MIT Press Journals; (617)253-2864; [email protected]. The Chicago Journal of Theoretical Computer Science is a peer-reviewed scholarly journal in theoretical computer science. The journal is committed to providing a forum for significant results on theoretical aspects of all topics in computer science. Editor in chief: Janos Simon Consulting
    [Show full text]
  • (L ,J SEP 3 0 2009 LIBRARIES
    Nearest Neighbor Search: the Old, the New, and the Impossible by Alexandr Andoni Submitted to the Department of Electrical Engineering and Computer Science in partial fulfillment of the requirements for the degree of Doctor of Philosophy at the MASSACHUSETTS INSTITUTE OF TECHNOLOGY September 2009 © Massachusetts Institute of Technology 2009. All rights reserved. Author .............. .. .. ......... .... ..... ... ......... ....... Department of Electrical Engineering and Computer Science September 4, 2009 (l ,J Certified by................ Piotr I/ yk Associate Professor Thesis Supervisor Accepted by ................................ /''~~ Terry P. Orlando Chairman, Department Committee on Graduate Students MASSACHUSETTS PaY OF TECHNOLOGY SEP 3 0 2009 ARCHIVES LIBRARIES Nearest Neighbor Search: the Old, the New, and the Impossible by Alexandr Andoni Submitted to the Department of Electrical Engineering and Computer Science on September 4, 2009, in partial fulfillment of the requirements for the degree of Doctor of Philosophy Abstract Over the last decade, an immense amount of data has become available. From collections of photos, to genetic data, and to network traffic statistics, modern technologies and cheap storage have made it possible to accumulate huge datasets. But how can we effectively use all this data? The ever growing sizes of the datasets make it imperative to design new algorithms capable of sifting through this data with extreme efficiency. A fundamental computational primitive for dealing with massive dataset is the Nearest Neighbor (NN) problem. In the NN problem, the goal is to preprocess a set of objects, so that later, given a query object, one can find efficiently the data object most similar to the query. This problem has a broad set of applications in data processing and analysis.
    [Show full text]
  • Approximation, Randomization, and Combinatorial Optimization
    Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques 17th International Workshop, APPROX 2014, and 18th International Workshop, RANDOM 2014 September 4–6, 2014, Barcelona, Spain Edited by Klaus Jansen José D. P. Rolim Nikhil R. Devanur Cristopher Moore LIPIcs – Vol. 28 – APPROX/RANDOM’14 www.dagstuhl.de/lipics Editors Klaus Jansen José D. P. Rolim University of Kiel University of Geneva Kiel Geneva [email protected] [email protected] Nikhil R. Devanur Cristopher Moore Microsoft Research Santa Fe Institute Redmond New Mexico [email protected] [email protected] ACM Classification 1998 C.2.1 Network Architecture and Design, C.2.2 Computer-communication, E.4 Coding and Information Theory, F. Theory of Computation, F.1.0 Computation by Abstract Devices, F.1.1 Models of Computation – relations between models, F.1.2 Modes of Computation, F.1.3 Complexity Measures and Classes, F.2.0 Analysis of Algorithms and Problem Complexity, F.2.1 Numerical Algorithms and Problems, F.2.2 Nonnumerical Algorithms and Problems G.1.2 Approximation, G.1.6 Optimization, G.2 Discrete Mathematics, G.2.1 Combinatorics, G.2.2 Graph Theory, G.3 Probability and Statistics, I.1.2 Algorithms, J.4 Computer Applications – Social and Behavioral Sciences ISBN 978-3-939897-74-3 Published online and open access by Schloss Dagstuhl – Leibniz-Zentrum für Informatik GmbH, Dagstuhl Publishing, Saarbrücken/Wadern, Germany. Online available at http://www.dagstuhl.de/dagpub/978-3-939897-74-3. Publication date September, 2014 Bibliographic information published by the Deutsche Nationalbibliothek The Deutsche Nationalbibliothek lists this publication in the Deutsche Nationalbibliografie; detailed bibliographic data are available in the Internet at http://dnb.d-nb.de.
    [Show full text]
  • Uriel Feige, Publications, July 2017. Papers Are Sorted by Categories. for Papers That Have More Than One Version (Typically, Jo
    Uriel Feige, Publications, July 2017. Papers are sorted by categories. For papers that have more than one version (typically, journal version and conference proceedings), the different versions are combined into one entry in the list. STOC is shorthand for \Annual ACM Symposium on the Theory of Computing". FOCS is shorthand for \Symposium on Foundations of Computer Science". Journal publications 1. Uriel Feige, Amos Fiat, and Adi Shamir. \Zero Knowledge Proofs of Identity". Jour- nal of Cryptology, 1988, Vol. 1, pp. 77-94. Preliminary version appeared in Proc. of the 19th STOC, 1987, pp. 210-217. 2. Uriel Feige, David Peleg, Prabhakar Raghavan and Eli Upfal. \Randomized Broadcast in Networks". Random Structures and algorithms, 1(4), 1990, pp. 447-460. Preliminary version appeared in proceedings of SIGAL International Symposium on Algorithms, Tokyo, Japan, August 1990. 3. Uriel Feige and Adi Shamir. \Multi-Oracle Interactive Protocols with Constant Space Verifiers”. Journal of Computer and System Sciences, Vol. 44, pp. 259-271, April 1992. Preliminary version appeared in Proceedings of Fourth Annual Conference on Struc- ture in Complexity Theory, June 1989. 4. Uriel Feige. "On the Complexity of Finite Random Functions". Information Process- ing Letters 44 (1992) 295-296. 5. Uriel Feige, David Peleg, Prabhakar Raghavan and Eli Upfal. \Computing With Noisy Information". SIAM Journal on Computing, 23(5), 1001{1018, 1994. Preliminary version appeared in Proceedings of 22nd STOC, 1990, pp. 128-137. 6. Noga Alon, Uriel Feige, Avi Wigderson, and David Zuckerman. "Derandomized Graph Products". Computational Complexity 5 (1995) 60{75. 7. Uriel Feige. \A Tight Upper Bound on the Cover Time for Random Walks on Graphs".
    [Show full text]
  • László Lovász Avi Wigderson of Eötvös Loránd University of the Institute for Advanced Study, in Budapest, Hungary and Princeton, USA
    2021 The Norwegian Academy of Science and Letters has decided to award the Abel Prize for 2021 to László Lovász Avi Wigderson of Eötvös Loránd University of the Institute for Advanced Study, in Budapest, Hungary and Princeton, USA, “for their foundational contributions to theoretical computer science and discrete mathematics, and their leading role in shaping them into central fields of modern mathematics.” Theoretical Computer Science (TCS) is the study of computational lens”. Discrete structures such as the power and limitations of computing. Its roots go graphs, strings, permutations are central to TCS, back to the foundational works of Kurt Gödel, Alonzo and naturally discrete mathematics and TCS Church, Alan Turing, and John von Neumann, leading have been closely allied fields. While both these to the development of real physical computers. fields have benefited immensely from more TCS contains two complementary sub-disciplines: traditional areas of mathematics, there has been algorithm design which develops efficient methods growing influence in the reverse direction as well. for a multitude of computational problems; and Applications, concepts, and techniques from TCS computational complexity, which shows inherent have motivated new challenges, opened new limitations on the efficiency of algorithms. The notion directions of research, and solved important open of polynomial-time algorithms put forward in the problems in pure and applied mathematics. 1960s by Alan Cobham, Jack Edmonds, and others, and the famous P≠NP conjecture of Stephen Cook, László Lovász and Avi Wigderson have been leading Leonid Levin, and Richard Karp had strong impact on forces in these developments over the last decades. the field and on the work of Lovász and Wigderson.
    [Show full text]
  • How Well Do Random Walks Parallelize?
    How Well Do Random Walks Parallelize? Klim Efremenko ∗ Omer Reingold † August 24, 2009 Abstract A random walk on a graph is a process that explores the graph in a random way: at each step the walk is at a vertex of the graph, and at each step it moves to a uniformly selected neighbor of this vertex. Random walks are extremely useful in computer science and in other fields. A very natural problem that was recently raised by Alon, Avin, Koucky, Kozma, Lotker, and Tuttle (though it was implicit in several previous papers) is to analyze the behavior of k independent walks in comparison with the behavior of a single walk. In particular, Alon et al. showed that in various settings (e.g., for expander graphs), k random walks cover the graph (i.e., visit all its nodes), Ω(k)-times faster (in expectation) than a single walk. In other words, in such cases k random walks efficiently “parallelize” a single random walk. Alon et al. also demonstrated that, depending on the specific setting, this “speedup” can vary from logarithmic to exponential in k. In this paper we initiate a more systematic study of multiple random walks. We give lower and upper bounds both on the cover time and on the hitting time (the time it takes to hit one specific node) of multiple random walks. Our study revolves over three alternatives for the starting vertices of the random walks: the worst starting vertices (those who maximize the hitting/cover time), the best starting vertices, and starting vertices selected from the stationary distribution.
    [Show full text]
  • Classical Cryptographic Protocols in a Quantum World∗
    Classical Cryptographic Protocols in a Quantum World∗ Sean Hallgren †‡ Adam Smith †§ Fang Song ¶ Abstract Cryptographic protocols, such as protocols for secure function evaluation (SFE), have played a cru- cial role in the development of modern cryptography. The extensive theory of these protocols, however, deals almost exclusively with classical attackers. If we accept that quantum information processing is the most realistic model of physically feasible computation, then we must ask: what classical protocols remain secure against quantum attackers? Our main contribution is showing the existence of classical two-party protocols for the secure eval- uation of any polynomial-time function under reasonable computational assumptions (for example, it suffices that the learning with errors problem be hard for quantum polynomial time). Our result shows that the basic two-party feasibility picture from classical cryptography remains unchanged in a quantum world. ∗A preliminary version of this work appeared in Advances in Cryptology - CRYPTO 2011. †Department of Computer Science and Engineering, Pennsylvania State University, University Park, PA, U.S.A. ‡Partially supported by National Science Foundation award CCF-0747274 and by the National Security Agency (NSA) under Army Research Office (ARO) contract number W911NF-08-1-0298. §Partially supported by National Science Foundation award CCF-0747294. ¶Department of Combinatorics & Optimization and Institute for Quantum Computing, University of Waterloo, Canada. Partially supported by Cryptoworks21, NSERC, ORF and US ARO. Most work was conducted while at the Pennsylvania State University. 1 Contents 1 Introduction 3 1.1 Our Contributions . .3 1.2 Related work . .5 1.3 Future Directions . .6 2 Preliminaries 7 3 Modeling Security in the Presence of Quantum Attacks 9 3.1 A General Quantum Stand-Alone Security Model .
    [Show full text]
  • Lipics-APPROX-RANDOM-2019-0
    Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques APPROX/RANDOM 2019, September 20–22, 2019, Massachusetts Institute of Technology, Cambridge, MA, USA Edited by Dimitris Achlioptas László A. Végh L I P I c s – Vo l . 145 – APPROX/RANDOM 2019 www.dagstuhl.de/lipics Editors Dimitris Achlioptas UC Santa Cruz, California, USA [email protected] László A. Végh London School of Economics and Political Science, London, UK [email protected] ACM Classification 2012 Mathematics of computing; Theory of computation ISBN 978-3-95977-125-2 Published online and open access by Schloss Dagstuhl – Leibniz-Zentrum für Informatik GmbH, Dagstuhl Publishing, Saarbrücken/Wadern, Germany. Online available at https://www.dagstuhl.de/dagpub/978-3-95977-125-2. Publication date September, 2019 Bibliographic information published by the Deutsche Nationalbibliothek The Deutsche Nationalbibliothek lists this publication in the Deutsche Nationalbibliografie; detailed bibliographic data are available in the Internet at https://portal.dnb.de. License This work is licensed under a Creative Commons Attribution 3.0 Unported license (CC-BY 3.0): https://creativecommons.org/licenses/by/3.0/legalcode. In brief, this license authorizes each and everybody to share (to copy, distribute and transmit) the work under the following conditions, without impairing or restricting the authors’ moral rights: Attribution: The work must be attributed to its authors. The copyright is retained by the corresponding authors. Digital Object Identifier: 10.4230/LIPIcs.APPROX-RANDOM.2019.0 ISBN 978-3-95977-125-2 ISSN 1868-8969 https://www.dagstuhl.de/lipics 0:iii LIPIcs – Leibniz International Proceedings in Informatics LIPIcs is a series of high-quality conference proceedings across all fields in informatics.
    [Show full text]
  • Hardness of Approximation, NP ⊆ PCP[Polylog,Polylog]
    CS221: Computational Complexity Prof. Salil Vadhan Lecture 34: More Hardness of Approximation, NP ⊆ PCP[polylog; polylog] 12/16 Scribe: Vitaly Feldman Contents 1 More Tight Hardness of Approximation Results 1 2 Open Problems 2 3 Sketch of the Proof of the PCP Theorem 2 Additional Readings. Papadimitriou, Ch. 13 has a detailed treatment of approximability, with less emphasis on the PCP theorem (which is stated in different language in Thm 13.12). 1 More Tight Hardness of Approximation Results Last time we saw the tight results for hardness of approximation of Max3SAT and E3LIN2. Today we will survey some other tight hardness of approximation results. Definition 1 MaxClique — given a graph G = (V; E) find the largest clique. This problem is completely equivalent to finding the maximum independent set. Best known approximation algorithm gives n -approximation (which is not much better than the trivial log2 n n-approximation). Theorem 2 (H˚astad) It is NP-hard to approximate MaxClique within n1¡² factor for any positive ². The result is proven by tailoring the PCP Theorem for this specific problem (namely, optimizing the so-called “amortized free bit complexity” of the PCP). Definition 3 Set Cover problem: given a collection of sets S1;:::;Sm which are subsets of the set f1; : : : ; ng, find the smallest subcollection of sets Si1 ;:::;Sit whose union is equal to f1; : : : ; ng (that is, they cover the “universe”). The greedy algorithm gives H(n) ¼ ln n approximation to this problem (H(n) is the n-th harmonic number). 1 Theorem 4 (Feige) It is NP-hard to approximate Set Cover to within (1 ¡ ²) ln n factor for any positive ².
    [Show full text]
  • Open Main.Pdf
    The Pennsylvania State University The Graduate School EFFICIENT COMBINATORIAL METHODS IN SPARSIFICATION, SUMMARIZATION AND TESTING OF LARGE DATASETS A Dissertation in Computer Science and Engineering by Grigory Yaroslavtsev c 2014 Grigory Yaroslavtsev Submitted in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy May 2014 The dissertation of Grigory Yaroslavtsev was reviewed and approved∗ by the following: Sofya Raskhodnikova Associate Professor of Computer Science and Engineering Chair of Committee and Dissertation Advisor Piotr Berman Associate Professor of Computer Science and Engineering Jason Morton Assistant Professor of Mathematics and Statistics Adam D. Smith Associate Professor of Computer Science and Engineering Raj Acharya Head of Department of Computer Science and Engineering ∗Signatures are on file in the Graduate School. Abstract Increasingly large amounts of structured data are being collected by personal computers, mobile devices, personal gadgets, sensors, etc., and stored in data centers operated by the government and private companies. Processing of such data to extract key information is one of the main challenges faced by computer scientists. Developing methods for construct- ing compact representations of large data is a natural way to approach this challenge. This thesis is focused on rigorous mathematical and algorithmic solutions for sparsi- fication and summarization of large amounts of information using discrete combinatorial methods. Areas of mathematics most closely related to it are graph theory, information theory and analysis of real-valued functions over discrete domains. These areas, somewhat surprisingly, turn out to be related when viewed through the computational lens. In this thesis we illustrate the power and limitations of methods for constructing small represen- tations of large data sets, such as graphs and databases, using a variety methods drawn from these areas.
    [Show full text]
  • Federated Computing Research Conference, FCRC’96, Which Is David Wise, Steering Being Held May 20 - 28, 1996 at the Philadelphia Downtown Marriott
    CRA Workshop on Academic Careers Federated for Women in Computing Science 23rd Annual ACM/IEEE International Symposium on Computing Computer Architecture FCRC ‘96 ACM International Conference on Research Supercomputing ACM SIGMETRICS International Conference Conference on Measurement and Modeling of Computer Systems 28th Annual ACM Symposium on Theory of Computing 11th Annual IEEE Conference on Computational Complexity 15th Annual ACM Symposium on Principles of Distributed Computing 12th Annual ACM Symposium on Computational Geometry First ACM Workshop on Applied Computational Geometry ACM/UMIACS Workshop on Parallel Algorithms ACM SIGPLAN ‘96 Conference on Programming Language Design and Implementation ACM Workshop of Functional Languages in Introductory Computing Philadelphia Skyline SIGPLAN International Conference on Functional Programming 10th ACM Workshop on Parallel and Distributed Simulation Invited Speakers Wm. A. Wulf ACM SIGMETRICS Symposium on Burton Smith Parallel and Distributed Tools Cynthia Dwork 4th Annual ACM/IEEE Workshop on Robin Milner I/O in Parallel and Distributed Systems Randy Katz SIAM Symposium on Networks and Information Management Sponsors ACM CRA IEEE NSF May 20-28, 1996 SIAM Philadelphia, PA FCRC WELCOME Organizing Committee Mary Jane Irwin, Chair Penn State University Steve Mahaney, Vice Chair Rutgers University Alan Berenbaum, Treasurer AT&T Bell Laboratories Frank Friedman, Exhibits Temple University Sampath Kannan, Student Coordinator Univ. of Pennsylvania Welcome to the second Federated Computing Research Conference, FCRC’96, which is David Wise, Steering being held May 20 - 28, 1996 at the Philadelphia downtown Marriott. This second Indiana University FCRC follows the same model of the highly successful first conference, FCRC’93, in Janice Cuny, Careers which nine constituent conferences participated.
    [Show full text]
  • Program of the Thirty-Second Annual ACM Symposium on Theory of Computing
    Program of the Thirty-Second Annual ACM Symposium on Theory of Computing May 21-23, 2000, Portland, Oregon, USA Important Note There will be a workshop on “Challenges for Theoretical Computer Science” on Saturday, May 20, the day before the STOC conference starts. Details will be posted on the SIGACT web page as they become available. Sunday, May 21, 2000 Session 1A Chair: Frances Yao 8:30 am Extractors and pseudo-random generators with optimal seed length Russell Impagliazzo, Ronen Shaltiel, and Avi Wigderson 8:55 am Pseudo-random functions and factoring Moni Naor, Omer Reingold, and Alon Rosen 9:20 am Satisfiability of equations in free groups is in PSPACE Claudio Guti´errez 9:45 am Setting 2 variables at a time yields a new lower bound for random 3-SAT Dimitris Achlioptas Sessions 1B Chair: Ding-Zhu Du 8:30 am A new algorithmic approach to the general Lovasz Local Lemma with applications to scheduling and satisfiability problems Artur Czumaj and Christian Scheideler 8:55 am A deterministic polynomial-time algorithm for approximating mixed discriminant and mixed volume Leonid Gurvits and Alex Samorodnitsky 9:20 am Randomized metarounding Robert Carr and Santosh Vempala 9:45 am Isomorphism testing for embeddable graphs through definability Martin Grohe Coffee Break 10:10 am - 10:30 am Session 2A Chair: Frances Yao 10:30 am Circuit minimization problem Valentine Kabanets and Jin-Yi Cai 10:55 am On the efficiency of local decoding procedures for error-correcting codes Jonathan Katz and Luca Trevisan 11:20 am Statistical mechanics, three-dimensionality
    [Show full text]