Proof Verification and Hardness of Approximation Problems

Proof Verification and Hardness of Approximation Problems

Proof Verification and Hardness of Approximation Problems Sanjeev Arora∗ Carsten Lundy Rajeev Motwaniz Madhu Sudanx Mario Szegedy{ Abstract that finding a solution within any constant factor of optimal is also NP-hard. Garey and Johnson The class PCP(f(n); g(n)) consists of all languages [GJ76] studied MAX-CLIQUE: the problem of find- L for which there exists a polynomial-time probabilistic ing the largest clique in a graph. They showed oracle machine that uses O(f(n)) random bits, queries that if a polynomial time algorithm computes MAX- O(g(n)) bits of its oracle and behaves as follows: If CLIQUE within a constant multiplicative factor then x 2 L then there exists an oracle y such that the ma- MAX-CLIQUE has a polynomial-time approximation chine accepts for all random choices but if x 62 L then scheme (PTAS) [GJ78], i.e., for any c > 1 there ex- for every oracle y the machine rejects with high proba- ists a polynomial-time algorithm that approximates bility. Arora and Safra very recently characterized NP MAX-CLIQUE within a factor of c. They used graph as PCP(log n; (log log n)O(1)). We improve on their products to construct \gap increasing reductions" that result by showing that NP = PCP(log n; 1). Our re- mapped an instance of the clique problem into an- sult has the following consequences: other instance in order to enlarge the relative gap of 1. MAXSNP-hard problems (e.g., metric TSP, the clique sizes. Berman and Schnitger [BS92] used MAX-SAT, MAX-CUT) do not have polynomial the ideas of increasing gaps to show that if the clique time approximation schemes unless P=NP. size of graphs with bounded co-degree (degree of the complement) does not have a randomized PTAS then 2. For some > 0 the size of the maximal clique in there is an > 0 such that MAX-CLIQUE cannot a graph cannot be approximated within a factor of be approximated within a factor of n in randomized n unless P=NP. polynomial-time. Yet for the most part, not much could be said for a wide variety of problems until very recently. A con- 1 Introduction nection between two seemingly unrelated areas within theoretical computer science, established by Feige et The notion of NP-completeness [Coo71, Kar72, al. [FGLSS91], led to surprisingly strong hardness re- Lev73] has been used since the early seventies to show sults for approximating optimization problems. Feige the hardness of finding optimum solutions for a large et al. exploited a recent characterization of multi- variety of combinatorial optimization problems. The prover interactive proof systems by Babai, Fortnow apparent intractability of these problems motivated and Lund [BFL91] to obtain intractability results for the search for approximate solutions to these hard op- approximating MAX-CLIQUE under the assumption timization problems. For some problems this effort that NP 6⊆ DT IME(nO(log log n)). gave good approximation algorithms, but for others it Recently Arora and Safra [AS92] improved on this seemed hard even to find near optimal solutions. by showing that it is NP-hard to approximate MAX- The task of proving hardness of the approxima- CLIQUE within any constant factor (and even within tion versions of such problems met with limited suc- log n=(log log n)O(1) cess. For the traveling salesman problem without tri- a factor of 2 ). Their solution builds angle inequality Sahni and Gonzalez [SG76] showed on and further develops the techniques of Feige et al. and yields an elegant new characterization of the class ∗ Computer Science Division, U. C. Berkeley, Berkeley, CA NP in terms of probabilistically checkable proofs. 94720. Supported by NSF PYI Grant CCR 8896202. yAT&T Bell Labs, Murray Hill, NJ 07974. Relying on their work we further develop these tech- zDepartment of Computer Science, Stanford University, niques and show that there is no PTAS for a large Stanford, CA 94305. Supported by NSF Grant CCR-9010517, number of combinatorial optimization problems un- and grants from Mitsubishi and OTL. less P=NP. These problems, like traveling salesman xComputer Science Division, U. C. Berkeley, Berkeley, CA 94720. Supported by NSF PYI Grant CCR 8896202. problem with triangle inequality, minimal steiner tree, {AT&T Bell Labs, Murray Hill, NJ 07974. maximum directed cut, shortest superstring, etc. be- long to the class of MAXSNP-hard problems, defined A problem opt is MAXSNP-hard (complete) if ev- by Papadimitriou and Yannakakis [PY91] in terms of ery problem in MAXSNP L-reduces to it (and opt 2 logic and reductions that preserves approximability. MAXSNP ). Papadimitriou and Yannakakis showed Our result also improves the parameters for the that if any of the MAXSNP-hard problems have a MAX-CLIQUE result of Arora and Safra. We show, PTAS then every problem in MAXSNP has a PTAS. that there is an > 0 such that approximating MAX- MAX-3SAT is a typical MAXSNP-complete prob- CLIQUE within a factor of n in NP-hard. lem: Given a conjunction ' of clauses, each a disjunc- In the next section we elaborate on the definition tion of 3 variables or their negation, maximize the the of the class MAXSNP and then move on to give back- number of satisfied clauses over all possible assign- ground on probabilistically checkable proofs and for- ments to the variables of '. mulate our main lemma, which is a characterization of The following problems are also MAXSNP- NP that improves on the one in [AS92]. Then we re- complete or hard: MAX-SAT; MAX-2SAT; INDE- late this characterization to the non-approximability PENDENT SET-B; NODE COVER-B; MAX-CUT; of MAXSNP. The rest of the paper will be devoted to MAX-DIRECTED CUT; METRIC TSP; STEINER the proof of the main lemma. TREE; SHORTEST SUPERSTRING; MULTIWAY CUTS; THREE DIMENSIONAL MATCHING. The 1.1 The class MAXSNP exact definitions can be found in [PY91, PY92, BP89, BJLTY91, DJPSY92, Kan91]. Given a maximization problem, formulated as optF (x) = maxy F (x; y), an approximation algorithm 1.2 Probabilistically checkable proofs A is said to achieve worst-case performance ratio α(n) −1 if for every input x : F (x; A(x)) ≥ α(n) optF (x), The notion of probabilistically checkable proofs where n is the size of x. (PCP) was introduced by Arora and Safra [AS92], In 1988 Papadimitriou and Yannakakis [PY91] us- as a slight variation of the notions of randomized or- ing Fagin's definition of NP [Fag74] observed that acle machines due to Fortnow, Rompel and Sipser there is an approximation algorithm which has con- [FRS88] and transparent proofs due to Babai, Fort- stant performance ratio for any maximization problem now, Levin and Szegedy [BFLS91]. All these mod- that is defined by a quantifier free first order formula els are in turn variations of interactive proof systems ' as [Bab85, GMR89] and multiprover interactive proof systems [BGKW88]. opt'(X) = max : jfzj'(S; X; z)gj ; (1) S Definition 1.3 (Arora-Safra [AS92]) A language where z is a vector of constant number of first order L is in PCP(f(n); g(n)) if there is polynomial-time variables. randomized oracle machine M y(r; x) which works as Definition 1.1 (Papadimitriou-Yannakakis) An follows: optimization problem opt(x) belongs to MAXSNP if there is a polynomial-time algorithm that encodes in- 1. It takes input x and a (random) string r of length O(f(n)), where n = jxj. put x into X in 1 such that opt(x) becomes opt'(X). They further considered the following type of \con- 2. Generates a query set Q(r; x) = fq1; : : : ; qmg of stant gap preserving" reductions: size m = O(g(n)). Definition 1.2 (Papadimitriou-Yannakakis) Let 0 opt and opt be two optimization problems defined by 3. Reads the bits yq1 ; : : : ; yqm . 0 0 functions F and F . We say that opt L-reduces to opt 4. Makes a polynomial-time computation using the if there exist two polynomial time algorithms f and g y data r, x and yq1 ; : : : ; yqm and outputs M (r; x) 2 and constants α; β > 0 such that for each instance x f0; 1g. of opt: Moreover the following acceptance conditions hold for 0 0 1. Algorithm f produces an instance x of opt such some δ > 0 and for all x: that opt0(x0) ≤ αopt(x). 1. If x 2 L then there exists a y such that for every 0 2. For any y algorithm g outputs a y with the prop- r we have M y(r; x) = 1. erty: 2. If x 62 L then for every y we have 0 0 0 0 y jF (x; y) − opt(x)j ≤ βjF (x ; y ) − opt(x )j: P robr(M (r; x)) = 0) ≥ δ. Observe that the definition does not say anything Proof checking itself is a maximization problem about the length of y, but it is easy to see that from the point of view of the prover: the task is we can assume without loss of generality that jyj = to maximize the chance of acceptance of the verifier. 2O(f(n))g(n). This definition is a generalization of NP More formally, let M y(x; r) be a probabilistic oracle since it is clear that NP = PCP(0; nO(1)). machine and let R be the set of all possible values of Theorem 1 (Arora-Safra [AS92]) r for a given parameter n. The prover's intention is NP = PCP(log n; (log log n)O(1)). to solve the following optimization problem: We improve on the second parameter: y opt(x) = maxyjfr 2 Rj M (x; r) = 1gj: (2) Theorem 2 NP = PCP(log n; 1).

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    10 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us