Electronic Colloquium on Computational Complexity, Report No. 154 (2017)
The Relation between Polynomial Calculus, Sherali-Adams, and Sum-of-Squares Proofs
Christoph Berkholz Humboldt-Universit¨atzu Berlin [email protected]
October 12, 2017
Abstract We relate different approaches for proving the unsatisfiability of a system of real polynomial equations over Boolean variables. On the one hand, there are the static proof systems Sherali- Adams and sum-of-squares (a.k.a. Lasserre), which are based on linear and semi-definite programming relaxations. On the other hand, we consider polynomial calculus, which is a dynamic algebraic proof system that models Gr¨obner basis computations. Our first result is that sum-of-squares simulates polynomial calculus: any polynomial calculus refutation of degree d can be transformed into a sum-of-squares refutation of degree 2d and only polynomial increase in size. In contrast, our second result shows that this is not the case for Sherali-Adams: there are systems of polynomial equations that have polynomial calculus refutations√ of degree 3 and polynomial size, but require Sherali-Adams refutations of degree Ω( n/ log n) and exponential size.
1 Introduction
The area of proof complexity was founded in [6] and studies the complexity of proofs for co-NP complete problems. Traditionally, one considers proof systems for proving the unsatisfiability of (or refuting) a propositional formula in conjunctive normal form. If one faces a proof system, there are two important questions to ask:
1. Does the system always produce proofs of polynomial size?
2. How strong is the system compared to other proof systems?
If the answer to the first question is yes, in which case the system is called p-bounded, then NP = co-NP. Therefore, it is conjectured that no proof system is p-bounded and this has been proven for a number of weak proof systems. For the second question, one considers the notion of polynomial simulation: A proof system P polynomial simulates a proof system Q if for every Q-proof of size S there is a P-proof of size poly(S). Nowadays, a large part of proof complexity focuses on weak proof systems, for which the first question has already been answered negatively. One reason for this is that they often model algorithms for solving hard problems and understanding the complexity of proofs might shed light on the complexity of algorithmic approaches that implicitly or explicitly search for proofs in the underlying proof system. The (semi-)algebraic proof systems we consider in this paper also fall into this category and are used to prove the unsatisfiability of a system F of 1 real polynomial equations fi = 0 over n Boolean variables xj ∈ {0, 1}. On the one hand, we consider polynomial calculus, which is a dynamic algebraic proof system that allows to derive new
1Note that this subsumes the problem of refuting 3-CNF formulas, because a clause x ∨ y ∨ z can be encoded as polynomial equation (1 − x)y(1 − z) = 0.
1 ISSN 1433-8092 polynomial equations that follow from F line-by-line. This proof system was introduced in [5] to model Gr¨obnerbasis computations and proofs of degree d (where the degree of all polynomials in the derivation is bounded by d) can be found in time nO(d) by a bounded-degree variant of the Gr¨obnerbasis algorithm. On the other hand, we consider the semi-algebraic proof system Sherali-Adams and the stronger sum-of-squares proof system. They are based on the linear and semi-definite pro- gramming hierarchies of Sherali-Adams [19] and Lasserre [13] and can be used to prove the unsatisfiability of a system of polynomial equations and inequalities. Proofs of degree d can be found algorithmically by solving a linear program (for Sherali-Adams) or a semi-definite program (for sum-of-squares) of size nO(d). Contrary to polynomial calculus, both systems are static in the sense that they provide the whole proof at once. In order to compare these semi-algebraic proof systems with polynomial calculus, we first remark that it is known that both systems cannot be simulated by polynomial calculus. A simple Pn example is the linear equation i=1 xi = n + 1, which has a refutation of linear size and degree 2 in Sherali-Adams and sum-of-squares, but requires polynomial calculus refutations of degree Ω(n) and size 2Ω(n) [11]. Our first theorem states that sum-of-squares is strictly stronger than polynomial calculus. Theorem 1.1. Let F be a system of polynomial equations over the reals. If F has a polynomial calculus refutation of degree d and size S, then it has a sum-of-squares refutation of degree 2d and size poly(S). For the author of this paper, this theorem was highly unexpected. In fact, there has been some evidence that the converse might be true. First, in the non-Boolean setting there are systems of equations that are easier to refute for polynomial calculus than for sum-of-squares [10] (see Section 2.4 for a discussion). Second, even for systems of polynomial equations over Boolean variables, separations of polynomial calculus from its static version Nullstellensatz were known [4]. Since sum-of-squares extends Nullstellensatz, it follows that the semi-definite lifts in the sum- of-squares/Lasserre hierarchy are necessary for “flattening” a dynamic polynomial calculus proof into a static one, although polynomial calculus is a purely algebraic system without semi-definite components. Our second theorem concerns the question whether the weaker Sherali-Adams linear programming hierarchy is already able to simulate polynomial calculus. Here we have a negative answer (that we would have expected for sum-of-squares as well).
Theorem 1.2. There is a system F of polynomial equations over R[x1, . . . , xn] such that: 1. F has a polynomial calculus refutation of degree 3 and size O(n2). √ √ 2. Every Sherali-Adams refutation of F has degree Ω( n/ log n) and size 2Ω( n/ log n). The lower bound is based on a modified version of the pebbling contradictions. The original pebbling contradictions have already been used to separate Nullstellensatz degree from polynomial calculus degree [4], but it turns out that they are easy to refute in Sherali-Adams. To obtain contradictions that are hard for Sherali-Adams (and still easy for polynomial calculus), we apply a substitution trick twice: first to show that the resulting contradiction requires high degree in Sherali-Adams and second to obtain a size lower bound from a degree lower bound. We believe that both techniques are also helpful for future lower bound arguments for static proof systems.
Acknowledgements Part of this work was done during the Oberwolfach workshop 1733 Proof and complexity and Beyond and the author acknowledges helpful discussions with Albert Atserias, Edward Hirsch, Massimo Lauria, Joanna Ochremiak, and Iddo Tzameret on Theorem 1.1. The author thanks Edward Hirsch for providing helpful comments on an earlier version the paper. We acknowledge the financial support by the German Research Foundation DFG under grant SCHW 837/5-1.
2 2 Proof Systems
For this section we fix a system of real polynomial equations F = {f1 = 0, . . . , fm = 0} and a system of polynomial inequalities H = {h1 > 0, . . . , hs > 0} over variables x1, . . . , xn. As it is common in propositional proof complexity, we focus on the special case of polynomial equations (and inequalities) over Boolean variables and consider the task of proving that a system of polynomial equations (and/or inequalities) has no 0/1-solution. To enforce Boolean variables, 2 the axioms xj = xj are always included in the proof systems. In Section 2.4 we briefly discuss non-Boolean variants. Algebraic proof systems are used for proving the unsatisfiability of a system of multivariate polynomial equations over some field F. As we focus on real polynomials we set F = R, unless mentioned otherwise. Semi-algebraic proof systems are used to prove the unsatisfiability of a system of polynomial equations and/or polynomial inequalities (in this setting the polynomials are always real).
2.1 Algebraic Proof Systems: Nullstellensatz and Polynomial Calculus
Nullstellensatz [2] is a static algebraic proof system that is based on Hilbert’s Nullstellensatz. A Nullstellensatz proof of f = 0 from F is a sequence of polynomials (g1, . . . , gm; q1, . . . , qn) such that m n X X 2 gifi + qj(xj − xj) = f. (1) i=1 j=1
Note that the proof is sound in the sense every 0/1-assignment that satisfies F also satisfies f = 0. The degree of the Nullstellensatz proof is max {deg(gifi): i ∈ [m]} ∪ {deg(hj) + 2 : j ∈ [n]} . The size of the derivation is the sum of the sizes of the binary encoding of the polynomials f, gifi, 2 qj(xj − xj), each represented as a sum of monomials. A Nullstellensatz refutation of F is a proof of −1 = 0 from F, in which case F is unsatisfiable (i. e., has no 0/1-solution). The Nullstellensatz system is also complete: If F is an unsatisfiable system of multi-linear polynomials, then it has a refutation of degree at most n. Nullstellensatz is a static (or one-shot) proof system, as it provides the whole proof at once. The dynamic version of Nullstellensatz is polynomial calculus (PC) [5]. It consists of the following derivation rules for polynomial equations (fi = 0) ∈ F, polynomials f, g, variables xj, and numbers a, b ∈ R:
f = 0 g = 0 f = 0 , 2 , , . (2) fi = 0 xj − xj = 0 xjf = 0 ag + bf = 0
A polynomial calculus derivation of f = 0 from F is a sequence (r1 = 0, . . . , rL = 0) of polynomial equations that are iteratively derived using the rules (2) and lead to f = rL = 0. The degree of a derivation is the maximum degree of the polynomials in the derivation and the size is the sum of the sizes of the binary encoding of the polynomials in the derivation. A polynomial calculus refutation is a derivation of −1 = 0. It is straightforward to check that polynomial calculus simulates Nullstellensatz: If F has a Nullstellensatz refutation of degree d and size N, then it has a polynomial calculus refutation of degree d and size polynomial in N. In both systems proofs of bounded degree d can be found in time nO(d): for Nullstellensatz the coefficients of the polynomials can be computed by solving a system of linear equations of size nO(d), and for polynomial calculus this can be done by using a bounded degree variant of the Gr¨obnerbasis algorithm [5].
3 2.2 Semi-algebraic proof systems: Sherali-Adams, Sum-of-Squares, Positivstel- lensatz Sherali-Adams is a static proof system that models the Sherali-Adams lift-and-project hierarchy of linear programming relaxations [19]. It can also be viewed as an extension of the Nullstel- lensatz system. A Sherali-Adams proof of f > 0 from (F, H) is a sequence of polynomials (g1, . . . , gm; q1, . . . , qn; p0, . . . , ps) such that
m n s X X 2 X gifi + qj(xj − xj) + p0 + p`h` = f, (3) i=1 j=1 `=1
P ` Q Q and where every p` has the form p` = A,B aA,B j∈A xj j∈B(1 − xj) with non-negative ` 2 n n coefficients aA,B. Note that the polynomials p` : R → R are positive in [0, 1] and hence the proof is sound in the sense every 0/1-assignment that satisfies F and H also satisfies f > 0. The degree (sometime called rank) of a Sherali-Adams proof is the maximum degree of the polynomials 2 gifi, qj(xj − xj), p0, p`h` and the size is the sum of the sizes of their encoding. A Sherali-Adams refutation of (F, H) is a proof of −1 > 0 from (F, H). Note that every Nullstellensatz refutation of F is a Sherali-Adams refutation of (F, ∅) by choosing p0 = 0. Sum-of-squares (SOS) is a semi-algebraic proof system that extends Nullstellensatz and Sherali-Adams. It models the Lasserre hierarchy of semi-definite programming relaxations [13], for which reason it is sometimes called Lasserre, and also builds on Putinar’s Positivstellensatz [18]. The difference to Sherali-Adams is that the positive polynomials p` are now sums of squares. Formally, a sum-of-squares proof of f > 0 from (F, H) is a sequence of polynomials (g1, . . . , gm; q1, . . . , qn; p0, . . . , ps) such that
m n s X X 2 X gifi + qj(xj − xj) + p0 + p`h` = f, (4) i=1 j=1 `=1
Pt` 2 and where every p` has the form p` = c=1(p`,c) (and is encoded as such) for arbitrary polynomials p`,c (in standard monomial form). Again, the degree of a proof is the maximum 2 degree of the polynomials gifi, qj(xj − xj), p0, p`h`, the size is the sum of the sizes of their encoding. A sum-of-squares refutation is a proof of −1 > 0. It is not hard to see that the positive P Q Q polynomials p = A,B aA,B j∈A xj j∈B(1 − xj) in the Sherali-Adams proof system have a sum-of-squares proof (from F = H = ∅) of degree |A| + |B| + 1 and size poly(p). It immediately follows that sum-of-squares simulates Sherali-Adams.
Lemma 2.1. If (F, H) has a Sherali-Adams refutation of degree d and size N, then it has a sum-of-squares refutation of degree d + 1 and size poly(N).
Another semi-algebraic system that is related to sum-of-squares is Positivstellensatz. It builds on Stengle’s Positivstellensatz (independently proven by Krivine [12] and Stengle [20]), which has also been used to define a hierarchy of relaxations, see [17]. Our definition of the Positivstellensatz proof system follows the one introduced in [10], a different way of formalising Stengle’s Positivstellensatz as a proof system (without focusing on complexity) was presented in [14]. We remark that Stengle’s Positivstellensatz and the Positivstellensatz proof system as defined 2 in [10] do not necessarily include the Boolean axioms xj − xj and also work for polynomials over non-Boolean variables. To be precise, we will call the system that is named “Positivstellensatz” in [10] “non-Boolean Positivstellensatz” in this paper (see Section 2.4). To define the proof system, we consider for the system of polynomial inequalities H = {h1 > 0, . . . , hs > 0} the system Q Hb = { `∈I h` > 0 : I ⊆ [s]}, which extends H by taking products of polynomial inequalities.
2 We assume that the p` are explicitely provided in this form, whereas gi and qj are arbitrary polynomials encoded in the standard way as a sum of monomials.
4 Clearly, (F, H) is satisfiable if and only if (F, Hb) is satisfiable. A Positivstellensatz proof of f > 0 from (F, H) is a sum-of-squares proof of f > 0 from (F, Hb). Note that on systems of polynomial equations (where H = ∅) sum-of-squares and Positivstellensatz are the same. One way of combining polynomial calculus with semi-algebraic proof systems is as follows. Note that a Sherali-Adams, sum-of-squares, or Positivstellensatz proof of f > 0 can be decomposed to X g + p0 + p`h` = f, (5) ` m n X X 2 gifi + qj(xj − xj) = g, (6) i=1 j=1 where (6) is a Nullstellensatz proof of g = 0. By replacing this Nullstellensatz proof of g = 0 with a polynomial calculus proof of g = 0, we obtain dynamic versions of the static semi-algebraic proof systems. The dynamic version of Positivstellensatz is called Positivstellensatz calculus and was also introduced in [10]. However, the proof of Theorem 1.1 (in particular Lemma 3.1) implies that Positivstellensatz and Positivstellensatz calculus can simulate each other. Corollary 2.2. If (F, H) has a Positivstellensatz calculus refutation of degree d and size S, then it has a Positivstellensatz refutation of degree 2d and size poly(S). Proof. By definition, a Positivstellensatz calculus refutation of (F, H) is a polynomial calculus P derivation of −1 − p0 − ` p`h` from F, where h` ∈ Hb. By Lemma 3.1, there is a degree-2d, size poly(S) sum-of-squares proof of non-negativity of