The Strongish Planted Clique Hypothesis and Its Consequences
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Computational Lower Bounds for Community Detection on Random Graphs
JMLR: Workshop and Conference Proceedings vol 40:1–30, 2015 Computational Lower Bounds for Community Detection on Random Graphs Bruce Hajek [email protected] Department of ECE, University of Illinois at Urbana-Champaign, Urbana, IL Yihong Wu [email protected] Department of ECE, University of Illinois at Urbana-Champaign, Urbana, IL Jiaming Xu [email protected] Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, PA, Abstract This paper studies the problem of detecting the presence of a small dense community planted in a large Erdos-R˝ enyi´ random graph G(N; q), where the edge probability within the community exceeds q by a constant factor. Assuming the hardness of the planted clique detection problem, we show that the computational complexity of detecting the community exhibits the following phase transition phenomenon: As the graph size N grows and the graph becomes sparser according to −α 2 q = N , there exists a critical value of α = 3 , below which there exists a computationally intensive procedure that can detect far smaller communities than any computationally efficient procedure, and above which a linear-time procedure is statistically optimal. The results also lead to the average-case hardness results for recovering the dense community and approximating the densest K-subgraph. 1. Introduction Networks often exhibit community structure with many edges joining the vertices of the same com- munity and relatively few edges joining vertices of different communities. Detecting communities in networks has received a large amount of attention and has found numerous applications in social and biological sciences, etc (see, e.g., the exposition Fortunato(2010) and the references therein). -
Spectral Algorithms (Draft)
Spectral Algorithms (draft) Ravindran Kannan and Santosh Vempala March 3, 2013 ii Summary. Spectral methods refer to the use of eigenvalues, eigenvectors, sin- gular values and singular vectors. They are widely used in Engineering, Ap- plied Mathematics and Statistics. More recently, spectral methods have found numerous applications in Computer Science to \discrete" as well \continuous" problems. This book describes modern applications of spectral methods, and novel algorithms for estimating spectral parameters. In the first part of the book, we present applications of spectral methods to problems from a variety of topics including combinatorial optimization, learning and clustering. The second part of the book is motivated by efficiency considerations. A fea- ture of many modern applications is the massive amount of input data. While sophisticated algorithms for matrix computations have been developed over a century, a more recent development is algorithms based on \sampling on the fly” from massive matrices. Good estimates of singular values and low rank ap- proximations of the whole matrix can be provably derived from a sample. Our main emphasis in the second part of the book is to present these sampling meth- ods with rigorous error bounds. We also present recent extensions of spectral methods from matrices to tensors and their applications to some combinatorial optimization problems. Contents I Applications 1 1 The Best-Fit Subspace 3 1.1 Singular Value Decomposition . .3 1.2 Algorithms for computing the SVD . .7 1.3 The k-means clustering problem . .8 1.4 Discussion . 11 2 Clustering Discrete Random Models 13 2.1 Planted cliques in random graphs . -
Approximating Cumulative Pebbling Cost Is Unique Games Hard
Approximating Cumulative Pebbling Cost Is Unique Games Hard Jeremiah Blocki Department of Computer Science, Purdue University, West Lafayette, IN, USA https://www.cs.purdue.edu/homes/jblocki [email protected] Seunghoon Lee Department of Computer Science, Purdue University, West Lafayette, IN, USA https://www.cs.purdue.edu/homes/lee2856 [email protected] Samson Zhou School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA https://samsonzhou.github.io/ [email protected] Abstract P The cumulative pebbling complexity of a directed acyclic graph G is defined as cc(G) = minP i |Pi|, where the minimum is taken over all legal (parallel) black pebblings of G and |Pi| denotes the number of pebbles on the graph during round i. Intuitively, cc(G) captures the amortized Space-Time complexity of pebbling m copies of G in parallel. The cumulative pebbling complexity of a graph G is of particular interest in the field of cryptography as cc(G) is tightly related to the amortized Area-Time complexity of the Data-Independent Memory-Hard Function (iMHF) fG,H [7] defined using a constant indegree directed acyclic graph (DAG) G and a random oracle H(·). A secure iMHF should have amortized Space-Time complexity as high as possible, e.g., to deter brute-force password attacker who wants to find x such that fG,H (x) = h. Thus, to analyze the (in)security of a candidate iMHF fG,H , it is crucial to estimate the value cc(G) but currently, upper and lower bounds for leading iMHF candidates differ by several orders of magnitude. Blocki and Zhou recently showed that it is NP-Hard to compute cc(G), but their techniques do not even rule out an efficient (1 + ε)-approximation algorithm for any constant ε > 0. -
A New Point of NP-Hardness for Unique Games
A new point of NP-hardness for Unique Games Ryan O'Donnell∗ John Wrighty September 30, 2012 Abstract 1 3 We show that distinguishing 2 -satisfiable Unique-Games instances from ( 8 + )-satisfiable instances is NP-hard (for all > 0). A consequence is that we match or improve the best known c vs. s NP-hardness result for Unique-Games for all values of c (except for c very close to 0). For these c, ours is the first hardness result showing that it helps to take the alphabet size larger than 2. Our NP-hardness reductions are quasilinear-size and thus show nearly full exponential time is required, assuming the ETH. ∗Department of Computer Science, Carnegie Mellon University. Supported by NSF grants CCF-0747250 and CCF-0915893, and by a Sloan fellowship. Some of this research performed while the author was a von Neumann Fellow at the Institute for Advanced Study, supported by NSF grants DMS-083537 and CCF-0832797. yDepartment of Computer Science, Carnegie Mellon University. 1 Introduction Thanks largely to the groundbreaking work of H˚astad[H˚as01], we have optimal NP-hardness of approximation results for several constraint satisfaction problems (CSPs), including 3Lin(Z2) and 3Sat. But for many others | including most interesting CSPs with 2-variable constraints | we lack matching algorithmic and NP-hardness results. Take the 2Lin(Z2) problem for example, in which there are Boolean variables with constraints of the form \xi = xj" and \xi 6= xj". The largest approximation ratio known to be achievable in polynomial time is roughly :878 [GW95], whereas it 11 is only known that achieving ratios above 12 ≈ :917 is NP-hard [H˚as01, TSSW00]. -
Public-Key Cryptography in the Fine-Grained Setting
Public-Key Cryptography in the Fine-Grained Setting B B Rio LaVigne( ), Andrea Lincoln( ), and Virginia Vassilevska Williams MIT CSAIL and EECS, Cambridge, USA {rio,andreali,virgi}@mit.edu Abstract. Cryptography is largely based on unproven assumptions, which, while believable, might fail. Notably if P = NP, or if we live in Pessiland, then all current cryptographic assumptions will be broken. A compelling question is if any interesting cryptography might exist in Pessiland. A natural approach to tackle this question is to base cryptography on an assumption from fine-grained complexity. Ball, Rosen, Sabin, and Vasudevan [BRSV’17] attempted this, starting from popular hardness assumptions, such as the Orthogonal Vectors (OV) Conjecture. They obtained problems that are hard on average, assuming that OV and other problems are hard in the worst case. They obtained proofs of work, and hoped to use their average-case hard problems to build a fine-grained one-way function. Unfortunately, they proved that constructing one using their approach would violate a popular hardness hypothesis. This moti- vates the search for other fine-grained average-case hard problems. The main goal of this paper is to identify sufficient properties for a fine-grained average-case assumption that imply cryptographic prim- itives such as fine-grained public key cryptography (PKC). Our main contribution is a novel construction of a cryptographic key exchange, together with the definition of a small number of relatively weak struc- tural properties, such that if a computational problem satisfies them, our key exchange has provable fine-grained security guarantees, based on the hardness of this problem. -
A New Approach to the Planted Clique Problem
A new approach to the planted clique problem Alan Frieze∗ Ravi Kannan† Department of Mathematical Sciences, Department of Computer Science, Carnegie-Mellon University. Yale University. [email protected]. [email protected]. 20, October, 2003 1 Introduction It is well known that finding the largest clique in a graph is NP-hard, [11]. Indeed, Hastad [8] has shown that it is NP-hard to approximate the size of the largest clique in an n vertex graph to within 1 ǫ a factor n − for any ǫ > 0. Not surprisingly, this has directed some researchers attention to finding the largest clique in a random graph. Let Gn,1/2 be the random graph with vertex set [n] in which each possible edge is included/excluded independently with probability 1/2. It is known that whp the size of the largest clique is (2+ o(1)) log2 n, but no known polymomial time algorithm has been proven to find a clique of size more than (1+ o(1)) log2 n. Karp [12] has even suggested that finding a clique of size (1+ ǫ)log2 n is computationally difficult for any constant ǫ > 0. Significant attention has also been directed to the problem of finding a hidden clique, but with only limited success. Thus let G be the union of Gn,1/2 and an unknown clique on vertex set P , where p = P is given. The problem is to recover P . If p c(n log n)1/2 then, as observed by Kucera [13], | | ≥ with high probability, it is easy to recover P as the p vertices of largest degree. -
Research Statement
RESEARCH STATEMENT SAM HOPKINS Introduction The last decade has seen enormous improvements in the practice, prevalence, and usefulness of machine learning. Deep nets give our phones ears; matrix completion gives our televisions good taste. Data-driven articial intelligence has become capable of the dicult—driving a car on a highway—and the nearly impossible—distinguishing cat pictures without human intervention [18]. This is the stu of science ction. But we are far behind in explaining why many algorithms— even now-commonplace ones for relatively elementary tasks—work so stunningly well as they do in the wild. In general, we can prove very little about the performance of these algorithms; in many cases provable performance guarantees for them appear to be beyond attack by our best proof techniques. This means that our current-best explanations for how such algorithms can be so successful resort eventually to (intelligent) guesswork and heuristic analysis. This is not only a problem for our intellectual honesty. As machine learning shows up in more and more mission- critical applications it is essential that we understand the guarantees and limits of our approaches with the certainty aorded only by rigorous mathematical analysis. The theory-of-computing community is only beginning to equip itself with the tools to make rigorous attacks on learning problems. The sophisticated learning pipelines used in practice are built layer-by-layer from simpler fundamental statistical inference algorithms. We can address these statistical inference problems rigorously, but only by rst landing on tractable problem de- nitions. In their traditional theory-of-computing formulations, many of these inference problems have intractable worst-case instances. -
Lipics-CCC-2021-35.Pdf (1
Hardness of KT Characterizes Parallel Cryptography Hanlin Ren # Ñ Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing, China Rahul Santhanam # University of Oxford, UK Abstract A recent breakthrough of Liu and Pass (FOCS’20) shows that one-way functions exist if and only if the (polynomial-)time-bounded Kolmogorov complexity, Kt, is bounded-error hard on average to compute. In this paper, we strengthen this result and extend it to other complexity measures: We show, perhaps surprisingly, that the KT complexity is bounded-error average-case hard if and only if there exist one-way functions in constant parallel time (i.e. NC0). This result crucially relies on the idea of randomized encodings. Previously, a seminal work of Applebaum, Ishai, and Kushilevitz (FOCS’04; SICOMP’06) used the same idea to show that NC0-computable one-way functions exist if and only if logspace-computable one-way functions exist. Inspired by the above result, we present randomized average-case reductions among the NC1- versions and logspace-versions of Kt complexity, and the KT complexity. Our reductions preserve both bounded-error average-case hardness and zero-error average-case hardness. To the best of our knowledge, this is the first reduction between the KT complexity and a variant of Kt complexity. We prove tight connections between the hardness of Kt complexity and the hardness of (the hardest) one-way functions. In analogy with the Exponential-Time Hypothesis and its variants, we define and motivate the Perebor Hypotheses for complexity measures such as Kt and KT. We show that a Strong Perebor Hypothesis for Kt implies the existence of (weak) one-way functions of near-optimal hardness 2n−o(n). -
Spectral Barriers in Certification Problems
Spectral Barriers in Certification Problems by Dmitriy Kunisky A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy Department of Mathematics New York University May, 2021 Professor Afonso S. Bandeira Professor Gérard Ben Arous © Dmitriy Kunisky all rights reserved, 2021 Dedication To my grandparents. iii Acknowledgements First and foremost, I am immensely grateful to Afonso Bandeira for his generosity with his time, his broad mathematical insight, his resourceful creativity, and of course his famous reservoir of open problems. Afonso’s wonderful knack for keeping up the momentum— for always finding some new angle or somebody new to talk to when we were stuck, for mentioning “you know, just yesterday I heard a very nice problem...” right when I could use something else to work on—made five years fly by. I have Afonso to thank for my sense of taste in research, for learning to collaborate, for remembering to enjoy myself, and above all for coming to be ever on the lookout for something new to think about. I am also indebted to Gérard Ben Arous, who, though I wound up working on topics far from his main interests, was always willing to hear out what I’d been doing, to encourage and give suggestions, and to share his truly encyclopedic knowledge of mathematics and its culture and history. On many occasions I’ve learned more tracking down what Gérard meant with the smallest aside in the classroom or in our meetings than I could manage to get out of a week on my own in the library. -
Parameterized AC0 – Some Upper and Lower Bounds
Parameterized AC0 { Some upper and lower bounds Yijia Chen Fudan University September 19th, 2017 @ Shonan AC0 0 A family of Boolean circuits Cn are AC -circuits if for every n 2 n2N N n (i)C n computes a Boolean function from f0; 1g to f0; 1g; (ii) the depth of Cn is bounded by a fixed constant; (iii) the size of Cn is polynomially bounded in n. Remark 1. Without (ii), we get a family of polynomial-size circuits Cn , which n2N decides a language in P/ poly. 2. If Cn is computable by a TM in time O(log n), then Cn is n2N dlogtime-uniform, which corresponds to FO(<; +; ×) [Barrington, Immerman, and Straubing, 1990]. The k-clique problem Definition Let G be a graph and k 2 N. Then a subset C ⊆ V (G) is a k-clique if (i) for every two vertices u; v 2 V (G) either u = v or fu; vg 2 E(G), (ii) and jCj = k. 296 CHAPTER 7 / TIME COMPLEXITY FIGURE 7.23 A graph with a 5-cliqueA graph with a 5-clique. The clique problem is to determine whether a graph contains a clique of a specified size. Let CLIQUE = G, k G is an undirected graph with a k-clique . {h i| } THEOREM 7.24 CLIQUE is in NP. PROOF IDEA The clique is the certificate. PROOF The following is a verifier V for CLIQUE. V = “On input G, k , c : hh i i 1. Test whether c is a subgraph with k nodes in G. 2. -
Finding Planted Cliques in Sublinear Time
Finding Planted Cliques in Sublinear Time Jay Mardia∗ Hilal Asi∗ Kabir Aladin Chandrasekher∗ Abstract We study the planted clique problem in which a clique of size k is planted in an Erd}os-R´enyi graph G(n; 1 ) and one is interested in recovering this planted clique. It is well known that for p 2 k = Ω( n), polynomial time algorithms can find the planted clique. In fact, the fastest known algorithms in this regime run in time linear O(n2) (or nearly linear) in the size of the input [FR10, DGGP14, DM15a]. In this work, we initiate thep development of sublinear time algorithms that find the planted clique in the regime k = !( n log log n). Our algorithms can reliably recover the clique n 3 3 p 2 k2 in time Oe n + ( ) = Oe n 2 when k = Ω( n log n), and in time Oe n = exp for p k p 24n !( n log log n) = k = o( n log n). An Ω(n) running time lower bound for the planted clique recovery problem follows easily from the results of [RS19] and therefore our recovery algorithms 2 are optimal whenever k = Ω(n 3 ). As the lower bound of [RS19] builds on purely information theoretic arguments, it cannot n2 p provide a detection lower bound stronger than Ω(e k2 ). Since our algorithms for k = Ω( n log n) n3 run in time Oe k3 + n , we show stronger lower bounds based on computational hardness assumptions. Using a slightly different formalization of the planted clique problem, in which every vertex is included in the clique independently with probability k=n, we show that the Planted Clique Conjecture implies the following. -
Limits of Approximation Algorithms: Pcps and Unique Games (DIMACS Tutorial Lecture Notes)1
Limits of Approximation Algorithms: PCPs and Unique Games (DIMACS Tutorial Lecture Notes)1 Organisers: Prahladh Harsha & Moses Charikar arXiv:1002.3864v1 [cs.CC] 20 Feb 2010 1Jointly sponsored by the DIMACS Special Focus on Hardness of Approximation, the DIMACS Special Focus on Algorithmic Foundations of the Internet, and the Center for Computational Intractability with support from the National Security Agency and the National Science Foundation. Preface These are the lecture notes for the DIMACS Tutorial Limits of Approximation Algorithms: PCPs and Unique Games held at the DIMACS Center, CoRE Building, Rutgers University on 20-21 July, 2009. This tutorial was jointly sponsored by the DIMACS Special Focus on Hardness of Approximation, the DIMACS Special Focus on Algorithmic Foundations of the Internet, and the Center for Computational Intractability with support from the National Security Agency and the National Science Foundation. The speakers at the tutorial were Matthew Andrews, Sanjeev Arora, Moses Charikar, Prahladh Harsha, Subhash Khot, Dana Moshkovitz and Lisa Zhang. We thank the scribes { Ashkan Aazami, Dev Desai, Igor Gorodezky, Geetha Jagannathan, Alexander S. Kulikov, Darakhshan J. Mir, Alan- tha Newman, Aleksandar Nikolov, David Pritchard and Gwen Spencer for their thorough and meticulous work. Special thanks to Rebecca Wright and Tami Carpenter at DIMACS but for whose organizational support and help, this workshop would have been impossible. We thank Alantha Newman, a phone conversation with whom sparked the idea of this workshop. We thank the Imdadullah Khan and Aleksandar Nikolov for video recording the lectures. The video recordings of the lectures will be posted at the DIMACS tutorial webpage http://dimacs.rutgers.edu/Workshops/Limits/ Any comments on these notes are always appreciated.