Lower bounds for polynomial kernelization

Michal Pilipczuk

Institute of Informatics, University of Warsaw, Poland

Parameterized Complexity Summer School Vienna, September 2nd, 2017

Michal Pilipczuk Kernelization lower bounds 1/33 k P-time

instance of L instance of L

size 6 f (k)

Kernelization — recap

Michal Pilipczuk Kernelization lower bounds 2/33 k P-time

instance of L instance of L

size 6 f (k)

Kernelization — recap

instance of L

Michal Pilipczuk Kernelization lower bounds 2/33 k P-time

instance of L instance of L

size 6 f (k)

Kernelization — recap

k

instance of L

Michal Pilipczuk Kernelization lower bounds 2/33 k

instance of L instance of L

size 6 f (k)

Kernelization — recap

k P-time

instance of L

Michal Pilipczuk Kernelization lower bounds 2/33 k

instance of L

Kernelization — recap

k P-time

instance of L instance of L

size 6 f (k)

Michal Pilipczuk Kernelization lower bounds 2/33 Parameterized problems ⇔ Sets of pairs (x, k), where x ∈ Σ? and k is a nonnegative integer Unparameterized variant: k is appended to x in unary. Kernelization takes on input an instance (x, k), and outputs an instance (x 0, k0) such that 0 0 0 0 (x, k) ∈ L ⇔ (x , k ) ∈ L and |x | + k 6 f (k) for some computable function f .

Background in complexity theory

Unparameterized problems ⇔ Languages over Σ, for a finite alphabet Σ ⇔ Subsets of Σ?

Michal Pilipczuk Kernelization lower bounds 3/33 Unparameterized variant: k is appended to x in unary. Kernelization algorithm takes on input an instance (x, k), and outputs an instance (x 0, k0) such that 0 0 0 0 (x, k) ∈ L ⇔ (x , k ) ∈ L and |x | + k 6 f (k) for some computable function f .

Background in complexity theory

Unparameterized problems ⇔ Languages over Σ, for a finite alphabet Σ ⇔ Subsets of Σ?

Parameterized problems ⇔ Sets of pairs (x, k), where x ∈ Σ? and k is a nonnegative integer

Michal Pilipczuk Kernelization lower bounds 3/33 Kernelization algorithm takes on input an instance (x, k), and outputs an instance (x 0, k0) such that 0 0 0 0 (x, k) ∈ L ⇔ (x , k ) ∈ L and |x | + k 6 f (k) for some computable function f .

Background in complexity theory

Unparameterized problems ⇔ Languages over Σ, for a finite alphabet Σ ⇔ Subsets of Σ?

Parameterized problems ⇔ Sets of pairs (x, k), where x ∈ Σ? and k is a nonnegative integer Unparameterized variant: k is appended to x in unary.

Michal Pilipczuk Kernelization lower bounds 3/33 Background in complexity theory

Unparameterized problems ⇔ Languages over Σ, for a finite alphabet Σ ⇔ Subsets of Σ?

Parameterized problems ⇔ Sets of pairs (x, k), where x ∈ Σ? and k is a nonnegative integer Unparameterized variant: k is appended to x in unary. Kernelization algorithm takes on input an instance (x, k), and outputs an instance (x 0, k0) such that 0 0 0 0 (x, k) ∈ L ⇔ (x , k ) ∈ L and |x | + k 6 f (k) for some computable function f .

Michal Pilipczuk Kernelization lower bounds 3/33 Let (x, k) be the input instance. If |x| 6 f (k), then we already have a kernel. Otherwise f (k) · |x|c = O(|x|c+1).

Any FPT problem admits a kernelization algorithm:

Question of existence of any kernel is equivalent to being FPT. We are interested in polynomial kernels, where f is a polynomial. Before 2008, no tool to classify FPT problems wrt. whether they have polykernels or not.

Kernelization and FPT

If a decidable problem has a kernelization algorithm, then it is FPT.

Michal Pilipczuk Kernelization lower bounds 4/33 Let (x, k) be the input instance. If |x| 6 f (k), then we already have a kernel. Otherwise f (k) · |x|c = O(|x|c+1). Question of existence of any kernel is equivalent to being FPT. We are interested in polynomial kernels, where f is a polynomial. Before 2008, no tool to classify FPT problems wrt. whether they have polykernels or not.

Kernelization and FPT

If a decidable problem has a kernelization algorithm, then it is FPT. Any FPT problem admits a kernelization algorithm:

Michal Pilipczuk Kernelization lower bounds 4/33 If |x| 6 f (k), then we already have a kernel. Otherwise f (k) · |x|c = O(|x|c+1). Question of existence of any kernel is equivalent to being FPT. We are interested in polynomial kernels, where f is a polynomial. Before 2008, no tool to classify FPT problems wrt. whether they have polykernels or not.

Kernelization and FPT

If a decidable problem has a kernelization algorithm, then it is FPT. Any FPT problem admits a kernelization algorithm: Let (x, k) be the input instance.

Michal Pilipczuk Kernelization lower bounds 4/33 Otherwise f (k) · |x|c = O(|x|c+1). Question of existence of any kernel is equivalent to being FPT. We are interested in polynomial kernels, where f is a polynomial. Before 2008, no tool to classify FPT problems wrt. whether they have polykernels or not.

Kernelization and FPT

If a decidable problem has a kernelization algorithm, then it is FPT. Any FPT problem admits a kernelization algorithm: Let (x, k) be the input instance. If |x| 6 f (k), then we already have a kernel.

Michal Pilipczuk Kernelization lower bounds 4/33 Question of existence of any kernel is equivalent to being FPT. We are interested in polynomial kernels, where f is a polynomial. Before 2008, no tool to classify FPT problems wrt. whether they have polykernels or not.

Kernelization and FPT

If a decidable problem has a kernelization algorithm, then it is FPT. Any FPT problem admits a kernelization algorithm: Let (x, k) be the input instance. If |x| 6 f (k), then we already have a kernel. Otherwise f (k) · |x|c = O(|x|c+1).

Michal Pilipczuk Kernelization lower bounds 4/33 We are interested in polynomial kernels, where f is a polynomial. Before 2008, no tool to classify FPT problems wrt. whether they have polykernels or not.

Kernelization and FPT

If a decidable problem has a kernelization algorithm, then it is FPT. Any FPT problem admits a kernelization algorithm: Let (x, k) be the input instance. If |x| 6 f (k), then we already have a kernel. Otherwise f (k) · |x|c = O(|x|c+1). Question of existence of any kernel is equivalent to being FPT.

Michal Pilipczuk Kernelization lower bounds 4/33 Before 2008, no tool to classify FPT problems wrt. whether they have polykernels or not.

Kernelization and FPT

If a decidable problem has a kernelization algorithm, then it is FPT. Any FPT problem admits a kernelization algorithm: Let (x, k) be the input instance. If |x| 6 f (k), then we already have a kernel. Otherwise f (k) · |x|c = O(|x|c+1). Question of existence of any kernel is equivalent to being FPT. We are interested in polynomial kernels, where f is a polynomial.

Michal Pilipczuk Kernelization lower bounds 4/33 Kernelization and FPT

If a decidable problem has a kernelization algorithm, then it is FPT. Any FPT problem admits a kernelization algorithm: Let (x, k) be the input instance. If |x| 6 f (k), then we already have a kernel. Otherwise f (k) · |x|c = O(|x|c+1). Question of existence of any kernel is equivalent to being FPT. We are interested in polynomial kernels, where f is a polynomial. Before 2008, no tool to classify FPT problems wrt. whether they have polykernels or not.

Michal Pilipczuk Kernelization lower bounds 4/33 Suppose for a moment that k-Path admits a kernelization algorithm that, say, produces kernels with at most k3 vertices. 7 Take t = k instances (G1, k), (G2, k),..., (Gt , k).

Let H be a disjoint union of G1, G2,..., Gt . Then the answer to (H, k) is YES if and only if the answer to any (Gi , k) is YES. Apply kernelization to (H, k) obtaining an instance with k3 vertices, encodable in k6 bits.

Intuition The final number of bits is much less than the number input instances. Most of the instances have to be discarded completely.

Motivating intuition

Consider the k-Path problem: verify whether the input graph contains a simple path on k vertices.

Michal Pilipczuk Kernelization lower bounds 5/33 7 Take t = k instances (G1, k), (G2, k),..., (Gt , k).

Let H be a disjoint union of G1, G2,..., Gt . Then the answer to (H, k) is YES if and only if the answer to any (Gi , k) is YES. Apply kernelization to (H, k) obtaining an instance with k3 vertices, encodable in k6 bits.

Intuition The final number of bits is much less than the number input instances. Most of the instances have to be discarded completely.

Motivating intuition

Consider the k-Path problem: verify whether the input graph contains a simple path on k vertices. Suppose for a moment that k-Path admits a kernelization algorithm that, say, produces kernels with at most k3 vertices.

Michal Pilipczuk Kernelization lower bounds 5/33 Let H be a disjoint union of G1, G2,..., Gt . Then the answer to (H, k) is YES if and only if the answer to any (Gi , k) is YES. Apply kernelization to (H, k) obtaining an instance with k3 vertices, encodable in k6 bits.

Intuition The final number of bits is much less than the number input instances. Most of the instances have to be discarded completely.

Motivating intuition

Consider the k-Path problem: verify whether the input graph contains a simple path on k vertices. Suppose for a moment that k-Path admits a kernelization algorithm that, say, produces kernels with at most k3 vertices. 7 Take t = k instances (G1, k), (G2, k),..., (Gt , k).

Michal Pilipczuk Kernelization lower bounds 5/33 Apply kernelization to (H, k) obtaining an instance with k3 vertices, encodable in k6 bits.

Intuition The final number of bits is much less than the number input instances. Most of the instances have to be discarded completely.

Motivating intuition

Consider the k-Path problem: verify whether the input graph contains a simple path on k vertices. Suppose for a moment that k-Path admits a kernelization algorithm that, say, produces kernels with at most k3 vertices. 7 Take t = k instances (G1, k), (G2, k),..., (Gt , k).

Let H be a disjoint union of G1, G2,..., Gt . Then the answer to (H, k) is YES if and only if the answer to any (Gi , k) is YES.

Michal Pilipczuk Kernelization lower bounds 5/33 Intuition The final number of bits is much less than the number input instances. Most of the instances have to be discarded completely.

Motivating intuition

Consider the k-Path problem: verify whether the input graph contains a simple path on k vertices. Suppose for a moment that k-Path admits a kernelization algorithm that, say, produces kernels with at most k3 vertices. 7 Take t = k instances (G1, k), (G2, k),..., (Gt , k).

Let H be a disjoint union of G1, G2,..., Gt . Then the answer to (H, k) is YES if and only if the answer to any (Gi , k) is YES. Apply kernelization to (H, k) obtaining an instance with k3 vertices, encodable in k6 bits.

Michal Pilipczuk Kernelization lower bounds 5/33 Motivating intuition

Consider the k-Path problem: verify whether the input graph contains a simple path on k vertices. Suppose for a moment that k-Path admits a kernelization algorithm that, say, produces kernels with at most k3 vertices. 7 Take t = k instances (G1, k), (G2, k),..., (Gt , k).

Let H be a disjoint union of G1, G2,..., Gt . Then the answer to (H, k) is YES if and only if the answer to any (Gi , k) is YES. Apply kernelization to (H, k) obtaining an instance with k3 vertices, encodable in k6 bits.

Intuition The final number of bits is much less than the number input instances. Most of the instances have to be discarded completely.

Michal Pilipczuk Kernelization lower bounds 5/33 KERNELIZATION

k P-time

instance of L instance of L size 6 p(k)

COMPRESSION

k 01000100 P-time 01010101 01010000? 01000001

instance of L instance of R (any) bitsizesize 66p(pk()k)

Kernelization and Compression

Michal Pilipczuk Kernelization lower bounds 6/33 COMPRESSION

k 01000100 P-time 01010101 01010000? 01000001

instance of L instance of R (any) bitsizesize 66p(pk()k)

Kernelization and Compression

KERNELIZATION

k P-time

instance of L instance of L size 6 p(k)

Michal Pilipczuk Kernelization lower bounds 6/33 01000100 01010101 01010000 01000001

instance of R (any) bitsize 6 p(k)

Kernelization and Compression

KERNELIZATION

k P-time

instance of L instance of L size 6 p(k)

COMPRESSION

k P-time ?

instance of L instance of R (any) size 6 p(k)

Michal Pilipczuk Kernelization lower bounds 6/33 ?

instance of R (any) size 6 p(k)

Kernelization and Compression

KERNELIZATION

k P-time

instance of L instance of L size 6 p(k)

COMPRESSION

k 01000100 P-time 01010101 01010000 01000001

instance of L instance of R (any) bitsize 6 p(k)

Michal Pilipczuk Kernelization lower bounds 6/33 For instance, when R ∈ NP and L is NP-hard.

A polynomial kernelization is always a polynomial compression. A polynomial compression can be turned into a polynomial kernelization provided that there is a P-reduction from R to L.

Note: There are examples when a poly-compression is known but a poly-kernel is not known, because it is unclear whether R is in NP.

Kernelization and Compression

Intuition: In compression we only care about shrinking the size of the instance to a small size without mixing YES- and NO-instances.

Michal Pilipczuk Kernelization lower bounds 7/33 For instance, when R ∈ NP and L is NP-hard.

A polynomial compression can be turned into a polynomial kernelization provided that there is a P-reduction from R to L.

Note: There are examples when a poly-compression is known but a poly-kernel is not known, because it is unclear whether R is in NP.

Kernelization and Compression

Intuition: In compression we only care about shrinking the size of the instance to a small size without mixing YES- and NO-instances. A polynomial kernelization is always a polynomial compression.

Michal Pilipczuk Kernelization lower bounds 7/33 For instance, when R ∈ NP and L is NP-hard. Note: There are examples when a poly-compression is known but a poly-kernel is not known, because it is unclear whether R is in NP.

Kernelization and Compression

Intuition: In compression we only care about shrinking the size of the instance to a small size without mixing YES- and NO-instances. A polynomial kernelization is always a polynomial compression. A polynomial compression can be turned into a polynomial kernelization provided that there is a P-reduction from R to L.

Michal Pilipczuk Kernelization lower bounds 7/33 Note: There are examples when a poly-compression is known but a poly-kernel is not known, because it is unclear whether R is in NP.

Kernelization and Compression

Intuition: In compression we only care about shrinking the size of the instance to a small size without mixing YES- and NO-instances. A polynomial kernelization is always a polynomial compression. A polynomial compression can be turned into a polynomial kernelization provided that there is a P-reduction from R to L. For instance, when R ∈ NP and L is NP-hard.

Michal Pilipczuk Kernelization lower bounds 7/33 Kernelization and Compression

Intuition: In compression we only care about shrinking the size of the instance to a small size without mixing YES- and NO-instances. A polynomial kernelization is always a polynomial compression. A polynomial compression can be turned into a polynomial kernelization provided that there is a P-reduction from R to L. For instance, when R ∈ NP and L is NP-hard. Note: There are examples when a poly-compression is known but a poly-kernel is not known, because it is unclear whether R is in NP.

Michal Pilipczuk Kernelization lower bounds 7/33 OR-distillation of L into R

Input: Words x1, x2,..., xt , each of length at most k. Pt Time: poly(t + i=1 |xi |). Output: One word y such that (a) |y| = poly(k), and (b) y ∈ R if and only if xi ∈ L for at least one i.

OR-distillation

Let L, R be unparameterized languages.

Michal Pilipczuk Kernelization lower bounds 8/33 OR-distillation

Let L, R be unparameterized languages.

OR-distillation of L into R

Input: Words x1, x2,..., xt , each of length at most k. Pt Time: poly(t + i=1 |xi |). Output: One word y such that (a) |y| = poly(k), and (b) y ∈ R if and only if xi ∈ L for at least one i.

Michal Pilipczuk Kernelization lower bounds 8/33 t instances z }| { 6 k 6 k 6 k 6 k 6 k 6 k 6 k 6 k 6 k

P-time

6 poly(k)

Intuition: Necessary loss of information Contradiction for an NP-hard L

Define OR-L = {x1#x2# ... #xt : xi ∈ L for at least one i}.

OR-distillation L → R is a polynomial compression OR-L/max |xi | → R

OR-distillation on picture

Michal Pilipczuk Kernelization lower bounds 9/33 P-time

6 poly(k)

Intuition: Necessary loss of information Contradiction for an NP-hard L

Define OR-L = {x1#x2# ... #xt : xi ∈ L for at least one i}.

OR-distillation L → R is a polynomial compression OR-L/max |xi | → R

OR-distillation on picture

t instances z }| { 6 k 6 k 6 k 6 k 6 k 6 k 6 k 6 k 6 k

Michal Pilipczuk Kernelization lower bounds 9/33 6 poly(k)

Intuition: Necessary loss of information Contradiction for an NP-hard L

Define OR-L = {x1#x2# ... #xt : xi ∈ L for at least one i}.

OR-distillation L → R is a polynomial compression OR-L/max |xi | → R

OR-distillation on picture

t instances z }| { 6 k 6 k 6 k 6 k 6 k 6 k 6 k 6 k 6 k

P-time

Michal Pilipczuk Kernelization lower bounds 9/33 Intuition: Necessary loss of information Contradiction for an NP-hard L

Define OR-L = {x1#x2# ... #xt : xi ∈ L for at least one i}.

OR-distillation L → R is a polynomial compression OR-L/max |xi | → R

OR-distillation on picture

t instances z }| { 6 k 6 k 6 k 6 k 6 k 6 k 6 k 6 k 6 k

P-time

6 poly(k)

Michal Pilipczuk Kernelization lower bounds 9/33 Define OR-L = {x1#x2# ... #xt : xi ∈ L for at least one i}.

OR-distillation L → R is a polynomial compression OR-L/max |xi | → R

OR-distillation on picture

t instances z }| { 6 k 6 k 6 k 6 k 6 k 6 k 6 k 6 k 6 k

P-time

6 poly(k)

Intuition: Necessary loss of information Contradiction for an NP-hard L

Michal Pilipczuk Kernelization lower bounds 9/33 OR-distillation on picture

t instances z }| { 6 k 6 k 6 k 6 k 6 k 6 k 6 k 6 k 6 k

P-time

6 poly(k)

Intuition: Necessary loss of information Contradiction for an NP-hard L

Define OR-L = {x1#x2# ... #xt : xi ∈ L for at least one i}.

OR-distillation L → R is a polynomial compression OR-L/max |xi | → R

Michal Pilipczuk Kernelization lower bounds 9/33 Intuition: Verifying proofs in P-time cannot be turned into verifying counterexamples in P-time, even if we allow polynomial advice. P NP ⊆ coNP/poly implies PH = Σ3 . Not as bad as P = NP, but still considered very unlikely.

Corollary No NP-hard problem admits an OR-distillation algorithm into any language R, unless NP ⊆ coNP/poly.

Assumption NP ⊆ coNP/poly may seem mysterious.

The proof is very short, but very tricky.

Backbone theorem

OR-distillation theorem [Fortnow, Santhanam; 2008] SAT does not admit an OR-distillation algorithm into any language R, unless NP ⊆ coNP/poly.

Michal Pilipczuk Kernelization lower bounds 10/33 Intuition: Verifying proofs in P-time cannot be turned into verifying counterexamples in P-time, even if we allow polynomial advice. P NP ⊆ coNP/poly implies PH = Σ3 . Not as bad as P = NP, but still considered very unlikely.

Assumption NP ⊆ coNP/poly may seem mysterious.

The proof is very short, but very tricky.

Backbone theorem

OR-distillation theorem [Fortnow, Santhanam; 2008] SAT does not admit an OR-distillation algorithm into any language R, unless NP ⊆ coNP/poly.

Corollary No NP-hard problem admits an OR-distillation algorithm into any language R, unless NP ⊆ coNP/poly.

Michal Pilipczuk Kernelization lower bounds 10/33 Intuition: Verifying proofs in P-time cannot be turned into verifying counterexamples in P-time, even if we allow polynomial advice. P NP ⊆ coNP/poly implies PH = Σ3 . Not as bad as P = NP, but still considered very unlikely. The proof is very short, but very tricky.

Backbone theorem

OR-distillation theorem [Fortnow, Santhanam; 2008] SAT does not admit an OR-distillation algorithm into any language R, unless NP ⊆ coNP/poly.

Corollary No NP-hard problem admits an OR-distillation algorithm into any language R, unless NP ⊆ coNP/poly.

Assumption NP ⊆ coNP/poly may seem mysterious.

Michal Pilipczuk Kernelization lower bounds 10/33 P NP ⊆ coNP/poly implies PH = Σ3 . Not as bad as P = NP, but still considered very unlikely. The proof is very short, but very tricky.

Backbone theorem

OR-distillation theorem [Fortnow, Santhanam; 2008] SAT does not admit an OR-distillation algorithm into any language R, unless NP ⊆ coNP/poly.

Corollary No NP-hard problem admits an OR-distillation algorithm into any language R, unless NP ⊆ coNP/poly.

Assumption NP ⊆ coNP/poly may seem mysterious. Intuition: Verifying proofs in P-time cannot be turned into verifying counterexamples in P-time, even if we allow polynomial advice.

Michal Pilipczuk Kernelization lower bounds 10/33 Not as bad as P = NP, but still considered very unlikely. The proof is very short, but very tricky.

Backbone theorem

OR-distillation theorem [Fortnow, Santhanam; 2008] SAT does not admit an OR-distillation algorithm into any language R, unless NP ⊆ coNP/poly.

Corollary No NP-hard problem admits an OR-distillation algorithm into any language R, unless NP ⊆ coNP/poly.

Assumption NP ⊆ coNP/poly may seem mysterious. Intuition: Verifying proofs in P-time cannot be turned into verifying counterexamples in P-time, even if we allow polynomial advice. P NP ⊆ coNP/poly implies PH = Σ3 .

Michal Pilipczuk Kernelization lower bounds 10/33 The proof is very short, but very tricky.

Backbone theorem

OR-distillation theorem [Fortnow, Santhanam; 2008] SAT does not admit an OR-distillation algorithm into any language R, unless NP ⊆ coNP/poly.

Corollary No NP-hard problem admits an OR-distillation algorithm into any language R, unless NP ⊆ coNP/poly.

Assumption NP ⊆ coNP/poly may seem mysterious. Intuition: Verifying proofs in P-time cannot be turned into verifying counterexamples in P-time, even if we allow polynomial advice. P NP ⊆ coNP/poly implies PH = Σ3 . Not as bad as P = NP, but still considered very unlikely.

Michal Pilipczuk Kernelization lower bounds 10/33 Backbone theorem

OR-distillation theorem [Fortnow, Santhanam; 2008] SAT does not admit an OR-distillation algorithm into any language R, unless NP ⊆ coNP/poly.

Corollary No NP-hard problem admits an OR-distillation algorithm into any language R, unless NP ⊆ coNP/poly.

Assumption NP ⊆ coNP/poly may seem mysterious. Intuition: Verifying proofs in P-time cannot be turned into verifying counterexamples in P-time, even if we allow polynomial advice. P NP ⊆ coNP/poly implies PH = Σ3 . Not as bad as P = NP, but still considered very unlikely. The proof is very short, but very tricky.

Michal Pilipczuk Kernelization lower bounds 10/33 OR-composition algorithm for L

Input: Instances (x1, k), (x2, k),..., (xt , k). Pt Time: poly(t + i=1 |xi | + k). Output: One instance (y, k?) such that (a) k? = poly(k), and ? (b) (y, k ) ∈ L iff (xi , k) ∈ L for at least one i.

OR-composition

Let L be a parameterized language.

Michal Pilipczuk Kernelization lower bounds 11/33 OR-composition

Let L be a parameterized language.

OR-composition algorithm for L

Input: Instances (x1, k), (x2, k),..., (xt , k). Pt Time: poly(t + i=1 |xi | + k). Output: One instance (y, k?) such that (a) k? = poly(k), and ? (b) (y, k ) ∈ L iff (xi , k) ∈ L for at least one i.

Michal Pilipczuk Kernelization lower bounds 11/33 t instances z }| { k k k k k k k k k

P-time

poly(k)

OR-composition theorem [BDFH; 2008] Suppose a parameterized problem L admits an OR-composition algorithm, and the unparameterized version of L is NP-hard. Then L does not admit a polynomial kernel unless NP ⊆ coNP/poly.

OR-composition on picture

Michal Pilipczuk Kernelization lower bounds 12/33 P-time

poly(k)

OR-composition theorem [BDFH; 2008] Suppose a parameterized problem L admits an OR-composition algorithm, and the unparameterized version of L is NP-hard. Then L does not admit a polynomial kernel unless NP ⊆ coNP/poly.

OR-composition on picture

t instances z }| { k k k k k k k k k

Michal Pilipczuk Kernelization lower bounds 12/33 poly(k)

OR-composition theorem [BDFH; 2008] Suppose a parameterized problem L admits an OR-composition algorithm, and the unparameterized version of L is NP-hard. Then L does not admit a polynomial kernel unless NP ⊆ coNP/poly.

OR-composition on picture

t instances z }| { k k k k k k k k k

P-time

Michal Pilipczuk Kernelization lower bounds 12/33 OR-composition theorem [BDFH; 2008] Suppose a parameterized problem L admits an OR-composition algorithm, and the unparameterized version of L is NP-hard. Then L does not admit a polynomial kernel unless NP ⊆ coNP/poly.

OR-composition on picture

t instances z }| { k k k k k k k k k

P-time

poly(k)

Michal Pilipczuk Kernelization lower bounds 12/33 OR-composition on picture

t instances z }| { k k k k k k k k k

P-time

poly(k)

OR-composition theorem [BDFH; 2008] Suppose a parameterized problem L admits an OR-composition algorithm, and the unparameterized version of L is NP-hard. Then L does not admit a polynomial kernel unless NP ⊆ coNP/poly.

Michal Pilipczuk Kernelization lower bounds 12/33 OR-SAT OR-SAT NP NP NP NP NP NP NP NP NP -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd

1 1 1 2 2 2 k0 k0 k0 ˜ L L cmp cmp cmp

poly(k) poly(k) poly(k) L kern kern kern L L OR-

Proof

Michal Pilipczuk Kernelization lower bounds 13/33 NP NP NP NP NP NP NP NP NP -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd

1 1 1 2 2 2 k0 k0 k0 ˜ L L cmp cmp cmp

poly(k) poly(k) poly(k) L kern kern kern L L OR-

Proof OR-SAT OR-SAT

Michal Pilipczuk Kernelization lower bounds 13/33 NP NP NP NP NP NP NP NP NP -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd

1 1 1 2 2 2 k0 k0 k0 L cmp cmp cmp

poly(k) poly(k) poly(k) L kern kern kern L L OR-

Proof OR-SAT OR-SAT NP NP NP NP NP NP NP NP NP -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd ˜ L

Michal Pilipczuk Kernelization lower bounds 13/33 NP NP NP NP NP NP NP NP NP -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd ˜ L cmp cmp cmp

poly(k) poly(k) poly(k) L kern kern kern L L OR-

Proof OR-SAT OR-SAT NP NP NP NP NP NP NP NP NP -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd

1 1 1 2 2 2 k0 k0 k0 L

Michal Pilipczuk Kernelization lower bounds 13/33 NP NP NP NP NP NP NP NP NP -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd ˜ L kern kern kern L L OR-

Proof OR-SAT OR-SAT NP NP NP NP NP NP NP NP NP -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd

1 1 1 2 2 2 k0 k0 k0 L cmp cmp cmp

poly(k) poly(k) poly(k) L

Michal Pilipczuk Kernelization lower bounds 13/33 NP NP NP NP NP NP NP NP NP -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd ˜ L L OR-

Proof OR-SAT OR-SAT NP NP NP NP NP NP NP NP NP -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd

1 1 1 2 2 2 k0 k0 k0 L cmp cmp cmp

poly(k) poly(k) poly(k) L kern kern kern L

Michal Pilipczuk Kernelization lower bounds 13/33 NP NP NP NP NP NP NP NP NP -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd ˜ L L OR-

Proof OR-SAT OR-SAT NP NP NP NP NP NP NP NP NP -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd

1 1 1 2 2 2 k0 k0 k0 L cmp cmp cmp

poly(k) poly(k) poly(k) L kern kern kern L L OR-

Michal Pilipczuk Kernelization lower bounds 13/33 NP NP NP NP NP NP NP NP NP -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd ˜ L L OR-

Proof OR-SAT OR-SAT NP NP NP NP NP NP NP NP NP -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd -hrd

1 1 1 2 2 2 k0 k0 k0 L cmp cmp cmp

poly(k) poly(k) poly(k) L kern kern kern L L OR-

Michal Pilipczuk Kernelization lower bounds 13/33 A graph admits a k-path iff any of its connected components does.

Composition: Take the disjoint union of the input graphs and the same parameter.

Same for k-Cycle and many other problems. Today, investigating the existence of a polynomial kernel is often a secondary goal after showing that a problem is FPT.

Corollaries

k-Path does not admit a polykernel, unless NP ⊆ coNP/poly.

Michal Pilipczuk Kernelization lower bounds 14/33 A graph admits a k-path iff any of its connected components does. Same for k-Cycle and many other problems. Today, investigating the existence of a polynomial kernel is often a secondary goal after showing that a problem is FPT.

Corollaries

k-Path does not admit a polykernel, unless NP ⊆ coNP/poly. Composition: Take the disjoint union of the input graphs and the same parameter.

Michal Pilipczuk Kernelization lower bounds 14/33 Same for k-Cycle and many other problems. Today, investigating the existence of a polynomial kernel is often a secondary goal after showing that a problem is FPT.

Corollaries

k-Path does not admit a polykernel, unless NP ⊆ coNP/poly. Composition: Take the disjoint union of the input graphs and the same parameter. A graph admits a k-path iff any of its connected components does.

Michal Pilipczuk Kernelization lower bounds 14/33 Today, investigating the existence of a polynomial kernel is often a secondary goal after showing that a problem is FPT.

Corollaries

k-Path does not admit a polykernel, unless NP ⊆ coNP/poly. Composition: Take the disjoint union of the input graphs and the same parameter. A graph admits a k-path iff any of its connected components does. Same for k-Cycle and many other problems.

Michal Pilipczuk Kernelization lower bounds 14/33 Corollaries

k-Path does not admit a polykernel, unless NP ⊆ coNP/poly. Composition: Take the disjoint union of the input graphs and the same parameter. A graph admits a k-path iff any of its connected components does. Same for k-Cycle and many other problems. Today, investigating the existence of a polynomial kernel is often a secondary goal after showing that a problem is FPT.

Michal Pilipczuk Kernelization lower bounds 14/33 No, the composition algorithm can compose instances of any NP-hard language Q into one instance of L.

Yes, as long as we have polynomial number of buckets.

Well, not larger than |Σ|k+1, as we may remove duplicates of the input instances. Hence, we may assume that log t = O(k). Ergo, the parameter of the composed instance may depend polynomially on both k and log t. Observed also earlier via different arguments. (Dom, Lokshtanov, and Saurabh; 2009)

Sure, we will just end up with an instance of OR-R. Do we need to start the composition with the same language L as we apply the compression to?

Can we add more refined bucket sorting? For instance, also by the number of vertices in the graph?

How large can t be?

Adding features

Does the proof actually exclude even polynomial compression into any R, not just kernelization?

Michal Pilipczuk Kernelization lower bounds 15/33 No, the composition algorithm can compose instances of any NP-hard language Q into one instance of L.

Yes, as long as we have polynomial number of buckets.

Well, not larger than |Σ|k+1, as we may remove duplicates of the input instances. Hence, we may assume that log t = O(k). Ergo, the parameter of the composed instance may depend polynomially on both k and log t. Observed also earlier via different arguments. (Dom, Lokshtanov, and Saurabh; 2009)

Do we need to start the composition with the same language L as we apply the compression to?

Can we add more refined bucket sorting? For instance, also by the number of vertices in the graph?

How large can t be?

Adding features

Does the proof actually exclude even polynomial compression into any R, not just kernelization? Sure, we will just end up with an instance of OR-R.

Michal Pilipczuk Kernelization lower bounds 15/33 Yes, as long as we have polynomial number of buckets.

Well, not larger than |Σ|k+1, as we may remove duplicates of the input instances. Hence, we may assume that log t = O(k). Ergo, the parameter of the composed instance may depend polynomially on both k and log t. Observed also earlier via different arguments. (Dom, Lokshtanov, and Saurabh; 2009)

No, the composition algorithm can compose instances of any NP-hard language Q into one instance of L. Can we add more refined bucket sorting? For instance, also by the number of vertices in the graph?

How large can t be?

Adding features

Does the proof actually exclude even polynomial compression into any R, not just kernelization? Sure, we will just end up with an instance of OR-R. Do we need to start the composition with the same language L as we apply the compression to?

Michal Pilipczuk Kernelization lower bounds 15/33 Yes, as long as we have polynomial number of buckets.

Well, not larger than |Σ|k+1, as we may remove duplicates of the input instances. Hence, we may assume that log t = O(k). Ergo, the parameter of the composed instance may depend polynomially on both k and log t. Observed also earlier via different arguments. (Dom, Lokshtanov, and Saurabh; 2009)

Can we add more refined bucket sorting? For instance, also by the number of vertices in the graph?

How large can t be?

Adding features

Does the proof actually exclude even polynomial compression into any R, not just kernelization? Sure, we will just end up with an instance of OR-R. Do we need to start the composition with the same language L as we apply the compression to? No, the composition algorithm can compose instances of any NP-hard language Q into one instance of L.

Michal Pilipczuk Kernelization lower bounds 15/33 Well, not larger than |Σ|k+1, as we may remove duplicates of the input instances. Hence, we may assume that log t = O(k). Ergo, the parameter of the composed instance may depend polynomially on both k and log t. Observed also earlier via different arguments. (Dom, Lokshtanov, and Saurabh; 2009)

Yes, as long as we have polynomial number of buckets. How large can t be?

Adding features

Does the proof actually exclude even polynomial compression into any R, not just kernelization? Sure, we will just end up with an instance of OR-R. Do we need to start the composition with the same language L as we apply the compression to? No, the composition algorithm can compose instances of any NP-hard language Q into one instance of L. Can we add more refined bucket sorting? For instance, also by the number of vertices in the graph?

Michal Pilipczuk Kernelization lower bounds 15/33 Well, not larger than |Σ|k+1, as we may remove duplicates of the input instances. Hence, we may assume that log t = O(k). Ergo, the parameter of the composed instance may depend polynomially on both k and log t. Observed also earlier via different arguments. (Dom, Lokshtanov, and Saurabh; 2009)

How large can t be?

Adding features

Does the proof actually exclude even polynomial compression into any R, not just kernelization? Sure, we will just end up with an instance of OR-R. Do we need to start the composition with the same language L as we apply the compression to? No, the composition algorithm can compose instances of any NP-hard language Q into one instance of L. Can we add more refined bucket sorting? For instance, also by the number of vertices in the graph? Yes, as long as we have polynomial number of buckets.

Michal Pilipczuk Kernelization lower bounds 15/33 Well, not larger than |Σ|k+1, as we may remove duplicates of the input instances. Hence, we may assume that log t = O(k). Ergo, the parameter of the composed instance may depend polynomially on both k and log t. Observed also earlier via different arguments. (Dom, Lokshtanov, and Saurabh; 2009)

Adding features

Does the proof actually exclude even polynomial compression into any R, not just kernelization? Sure, we will just end up with an instance of OR-R. Do we need to start the composition with the same language L as we apply the compression to? No, the composition algorithm can compose instances of any NP-hard language Q into one instance of L. Can we add more refined bucket sorting? For instance, also by the number of vertices in the graph? Yes, as long as we have polynomial number of buckets. How large can t be?

Michal Pilipczuk Kernelization lower bounds 15/33 Hence, we may assume that log t = O(k). Ergo, the parameter of the composed instance may depend polynomially on both k and log t. Observed also earlier via different arguments. (Dom, Lokshtanov, and Saurabh; 2009)

Adding features

Does the proof actually exclude even polynomial compression into any R, not just kernelization? Sure, we will just end up with an instance of OR-R. Do we need to start the composition with the same language L as we apply the compression to? No, the composition algorithm can compose instances of any NP-hard language Q into one instance of L. Can we add more refined bucket sorting? For instance, also by the number of vertices in the graph? Yes, as long as we have polynomial number of buckets. How large can t be? Well, not larger than |Σ|k+1, as we may remove duplicates of the input instances.

Michal Pilipczuk Kernelization lower bounds 15/33 Ergo, the parameter of the composed instance may depend polynomially on both k and log t. Observed also earlier via different arguments. (Dom, Lokshtanov, and Saurabh; 2009)

Adding features

Does the proof actually exclude even polynomial compression into any R, not just kernelization? Sure, we will just end up with an instance of OR-R. Do we need to start the composition with the same language L as we apply the compression to? No, the composition algorithm can compose instances of any NP-hard language Q into one instance of L. Can we add more refined bucket sorting? For instance, also by the number of vertices in the graph? Yes, as long as we have polynomial number of buckets. How large can t be? Well, not larger than |Σ|k+1, as we may remove duplicates of the input instances. Hence, we may assume that log t = O(k).

Michal Pilipczuk Kernelization lower bounds 15/33 Observed also earlier via different arguments. (Dom, Lokshtanov, and Saurabh; 2009)

Adding features

Does the proof actually exclude even polynomial compression into any R, not just kernelization? Sure, we will just end up with an instance of OR-R. Do we need to start the composition with the same language L as we apply the compression to? No, the composition algorithm can compose instances of any NP-hard language Q into one instance of L. Can we add more refined bucket sorting? For instance, also by the number of vertices in the graph? Yes, as long as we have polynomial number of buckets. How large can t be? Well, not larger than |Σ|k+1, as we may remove duplicates of the input instances. Hence, we may assume that log t = O(k). Ergo, the parameter of the composed instance may depend polynomially on both k and log t.

Michal Pilipczuk Kernelization lower bounds 15/33 Adding features

Does the proof actually exclude even polynomial compression into any R, not just kernelization? Sure, we will just end up with an instance of OR-R. Do we need to start the composition with the same language L as we apply the compression to? No, the composition algorithm can compose instances of any NP-hard language Q into one instance of L. Can we add more refined bucket sorting? For instance, also by the number of vertices in the graph? Yes, as long as we have polynomial number of buckets. How large can t be? Well, not larger than |Σ|k+1, as we may remove duplicates of the input instances. Hence, we may assume that log t = O(k). Ergo, the parameter of the composed instance may depend polynomially on both k and log t. Observed also earlier via different arguments. (Dom, Lokshtanov, and Saurabh; 2009)

Michal Pilipczuk Kernelization lower bounds 15/33 As we’ll see later, there can be much more intricate compositions than just “disjoint union”. Examples: Max Leaf Subtree, Set Cover/m, Set Cover/n, Steiner Tree, Connected , Disjoint Paths, Directed Multiway Cut with 2 terminals, ... Most of the works use a subset of mentioned features. Later: a new formalism cross-composition gathers all the features. (Bodlaender, Jansen, and Kratsch; 2011)

Towards a unified methodology

After the invention of the technique of OR-compositions, there was a huge number of no-polykernel results.

Michal Pilipczuk Kernelization lower bounds 16/33 Examples: Max Leaf Subtree, Set Cover/m, Set Cover/n, Steiner Tree, Connected Vertex Cover, Disjoint Paths, Directed Multiway Cut with 2 terminals, ... Most of the works use a subset of mentioned features. Later: a new formalism cross-composition gathers all the features. (Bodlaender, Jansen, and Kratsch; 2011)

Towards a unified methodology

After the invention of the technique of OR-compositions, there was a huge number of no-polykernel results. As we’ll see later, there can be much more intricate compositions than just “disjoint union”.

Michal Pilipczuk Kernelization lower bounds 16/33 Most of the works use a subset of mentioned features. Later: a new formalism cross-composition gathers all the features. (Bodlaender, Jansen, and Kratsch; 2011)

Towards a unified methodology

After the invention of the technique of OR-compositions, there was a huge number of no-polykernel results. As we’ll see later, there can be much more intricate compositions than just “disjoint union”. Examples: Max Leaf Subtree, Set Cover/m, Set Cover/n, Steiner Tree, Connected Vertex Cover, Disjoint Paths, Directed Multiway Cut with 2 terminals, ...

Michal Pilipczuk Kernelization lower bounds 16/33 Later: a new formalism cross-composition gathers all the features. (Bodlaender, Jansen, and Kratsch; 2011)

Towards a unified methodology

After the invention of the technique of OR-compositions, there was a huge number of no-polykernel results. As we’ll see later, there can be much more intricate compositions than just “disjoint union”. Examples: Max Leaf Subtree, Set Cover/m, Set Cover/n, Steiner Tree, Connected Vertex Cover, Disjoint Paths, Directed Multiway Cut with 2 terminals, ... Most of the works use a subset of mentioned features.

Michal Pilipczuk Kernelization lower bounds 16/33 Towards a unified methodology

After the invention of the technique of OR-compositions, there was a huge number of no-polykernel results. As we’ll see later, there can be much more intricate compositions than just “disjoint union”. Examples: Max Leaf Subtree, Set Cover/m, Set Cover/n, Steiner Tree, Connected Vertex Cover, Disjoint Paths, Directed Multiway Cut with 2 terminals, ... Most of the works use a subset of mentioned features. Later: a new formalism cross-composition gathers all the features. (Bodlaender, Jansen, and Kratsch; 2011)

Michal Pilipczuk Kernelization lower bounds 16/33 partitioning with respect to the number of vertices of the graph; or with respect to (i) the number of vertices, (ii) the number of edges, (iii) size of the maximum , (iv) budget.

Examples, supposing some reasonable graph encoding:

Polynomial equivalence relation

Polynomial equivalence relation Equivalence relation ∼ on Σ? is a polynomial equivalence relation if: checking whether two words x, y ∈ Σ? are ∼-equivalent can be done in poly(|x| + |y|) time; and ∼ partitions words of length 6 n into poly(n) equivalence classes.

Michal Pilipczuk Kernelization lower bounds 17/33 partitioning with respect to the number of vertices of the graph; or with respect to (i) the number of vertices, (ii) the number of edges, (iii) size of the maximum matching, (iv) budget.

Examples, supposing some reasonable graph encoding:

Polynomial equivalence relation

Polynomial equivalence relation Equivalence relation ∼ on Σ? is a polynomial equivalence relation if: checking whether two words x, y ∈ Σ? are ∼-equivalent can be done in poly(|x| + |y|) time; and ∼ partitions words of length 6 n into poly(n) equivalence classes.

Michal Pilipczuk Kernelization lower bounds 17/33 partitioning with respect to the number of vertices of the graph; or with respect to (i) the number of vertices, (ii) the number of edges, (iii) size of the maximum matching, (iv) budget.

Polynomial equivalence relation

Polynomial equivalence relation Equivalence relation ∼ on Σ? is a polynomial equivalence relation if: checking whether two words x, y ∈ Σ? are ∼-equivalent can be done in poly(|x| + |y|) time; and ∼ partitions words of length 6 n into poly(n) equivalence classes.

Examples, supposing some reasonable graph encoding:

Michal Pilipczuk Kernelization lower bounds 17/33 or with respect to (i) the number of vertices, (ii) the number of edges, (iii) size of the maximum matching, (iv) budget.

Polynomial equivalence relation

Polynomial equivalence relation Equivalence relation ∼ on Σ? is a polynomial equivalence relation if: checking whether two words x, y ∈ Σ? are ∼-equivalent can be done in poly(|x| + |y|) time; and ∼ partitions words of length 6 n into poly(n) equivalence classes.

Examples, supposing some reasonable graph encoding: partitioning with respect to the number of vertices of the graph;

Michal Pilipczuk Kernelization lower bounds 17/33 Polynomial equivalence relation

Polynomial equivalence relation Equivalence relation ∼ on Σ? is a polynomial equivalence relation if: checking whether two words x, y ∈ Σ? are ∼-equivalent can be done in poly(|x| + |y|) time; and ∼ partitions words of length 6 n into poly(n) equivalence classes.

Examples, supposing some reasonable graph encoding: partitioning with respect to the number of vertices of the graph; or with respect to (i) the number of vertices, (ii) the number of edges, (iii) size of the maximum matching, (iv) budget.

Michal Pilipczuk Kernelization lower bounds 17/33 Cross-composition theorem [Bodlaender, Jansen, Kratsch] If some NP-hard problem Q cross-composes into L, then L has no polynomial compression into any language R, unless NP ⊆ coNP/poly.

Cross-composition

Cross-composition An unparameterized problem Q cross-composes into a parameterized problem L, if there exists a polynomial equivalence relation ∼ and an algorithm that, given ∼-equivalent strings x1, x2,..., xt , in time Pt  ? poly t + i=1 |xi | produces one instance (y, k ) such that ? (y, k ) ∈ L iff xi ∈ Q for at least one i = 1, 2,..., t, ? t k = poly (log t + maxi=1 |xi |).

Michal Pilipczuk Kernelization lower bounds 18/33 Cross-composition

Cross-composition An unparameterized problem Q cross-composes into a parameterized problem L, if there exists a polynomial equivalence relation ∼ and an algorithm that, given ∼-equivalent strings x1, x2,..., xt , in time Pt  ? poly t + i=1 |xi | produces one instance (y, k ) such that ? (y, k ) ∈ L iff xi ∈ Q for at least one i = 1, 2,..., t, ? t k = poly (log t + maxi=1 |xi |).

Cross-composition theorem [Bodlaender, Jansen, Kratsch] If some NP-hard problem Q cross-composes into L, then L has no polynomial compression into any language R, unless NP ⊆ coNP/poly.

Michal Pilipczuk Kernelization lower bounds 18/33 k = max |xi |, log t = O(k) Q Q compo compo compo

poly(k) poly(k) poly(k) L cmpr cmpr cmpr R R OR-

Proof

Michal Pilipczuk Kernelization lower bounds 19/33 compo compo compo

poly(k) poly(k) poly(k) L cmpr cmpr cmpr R R OR-

Proof

k = max |xi |, log t = O(k) Q Q

Michal Pilipczuk Kernelization lower bounds 19/33 compo compo compo

poly(k) poly(k) poly(k) L cmpr cmpr cmpr R R OR-

Proof

k = max |xi |, log t = O(k) Q Q

Michal Pilipczuk Kernelization lower bounds 19/33 cmpr cmpr cmpr R R OR-

Proof

k = max |xi |, log t = O(k) Q Q compo compo compo

poly(k) poly(k) poly(k) L

Michal Pilipczuk Kernelization lower bounds 19/33 R OR-

Proof

k = max |xi |, log t = O(k) Q Q compo compo compo

poly(k) poly(k) poly(k) L cmpr cmpr cmpr R

Michal Pilipczuk Kernelization lower bounds 19/33 R OR-

Proof

k = max |xi |, log t = O(k) Q Q compo compo compo

poly(k) poly(k) poly(k) L cmpr cmpr cmpr R R OR-

Michal Pilipczuk Kernelization lower bounds 19/33 R OR-

Proof

k = max |xi |, log t = O(k) Q Q compo compo compo

poly(k) poly(k) poly(k) L cmpr cmpr cmpr R R OR-

Michal Pilipczuk Kernelization lower bounds 19/33 In fact, cross-composition is a good framework to express also all the previous results. Plan for now: show some non-trivial cross-composition to give an intuition about basic tricks.

Applications

Original application of Bodlaender, Jansen and Kratsch was that of structural parameters.

Michal Pilipczuk Kernelization lower bounds 20/33 Plan for now: show some non-trivial cross-composition to give an intuition about basic tricks.

Applications

Original application of Bodlaender, Jansen and Kratsch was that of structural parameters. In fact, cross-composition is a good framework to express also all the previous results.

Michal Pilipczuk Kernelization lower bounds 20/33 Applications

Original application of Bodlaender, Jansen and Kratsch was that of structural parameters. In fact, cross-composition is a good framework to express also all the previous results. Plan for now: show some non-trivial cross-composition to give an intuition about basic tricks.

Michal Pilipczuk Kernelization lower bounds 20/33 We show a cross-composition of Set Splitting into itself. We may assume that the universes are of the same size, hence we think of them as of one, common universe. Assume that t is a power of 2 (by copying the instances).

Application 1: Set Splitting

Set Splitting I: Universe U and family of subsets F ⊆ 2U P: |U| Q: Is there a coloring C : U → {B, W} such that every set X ∈ F is split, i.e., contains a black and a white element?

Michal Pilipczuk Kernelization lower bounds 21/33 We may assume that the universes are of the same size, hence we think of them as of one, common universe. Assume that t is a power of 2 (by copying the instances).

Application 1: Set Splitting

Set Splitting I: Universe U and family of subsets F ⊆ 2U P: |U| Q: Is there a coloring C : U → {B, W} such that every set X ∈ F is split, i.e., contains a black and a white element?

We show a cross-composition of Set Splitting into itself.

Michal Pilipczuk Kernelization lower bounds 21/33 Assume that t is a power of 2 (by copying the instances).

Application 1: Set Splitting

Set Splitting I: Universe U and family of subsets F ⊆ 2U P: |U| Q: Is there a coloring C : U → {B, W} such that every set X ∈ F is split, i.e., contains a black and a white element?

We show a cross-composition of Set Splitting into itself. We may assume that the universes are of the same size, hence we think of them as of one, common universe.

Michal Pilipczuk Kernelization lower bounds 21/33 Application 1: Set Splitting

Set Splitting I: Universe U and family of subsets F ⊆ 2U P: |U| Q: Is there a coloring C : U → {B, W} such that every set X ∈ F is split, i.e., contains a black and a white element?

We show a cross-composition of Set Splitting into itself. We may assume that the universes are of the same size, hence we think of them as of one, common universe. Assume that t is a power of 2 (by copying the instances).

Michal Pilipczuk Kernelization lower bounds 21/33 INSTANCE SELECTOR 1 + log t pairs of vertices

Take any solution C ∗ X0 : X , left special vertex, Thereand binary is exactly encoding one index of i ini with IS monochromatic parts fromIS. |U∗| = |U| + 2 log t + 2 |U∗| = |U| + 2 log t + 2 (⇒): ∗ C onIS defines, which∗ instance must be X1 : reverse X0 on IS F∗ consists of: Fsolved∗ consists inPL of: 1 + log t 2-element sets for pairs, 1 + log t 2-element sets for pairs, i ∗ ∗ (⇐): i i ∗ ∗ ∀X ∈ F , two sets X0 , X1 If∀ (XU,∈F F) is, two solvable, sets X we0 , setX1IS accordingly, and solve this instance inPL. Remaining sets are split for free.

PLAYGROUND joint universe U

Cross-composing into Set Splitting

Input: Instances (U, Fi ) Input: Instances (U, Fi )

Output: Instance (U∗, F∗) Output: Instance (U∗, F∗)

Micha l Pilipczuk Kernelization lower bounds 22/33 INSTANCE SELECTOR 1 + log t pairs of vertices

Take any solution C ∗ X0 : X , left special vertex, Thereand binary is exactly encoding one index of i ini with IS monochromatic parts fromIS. |U∗| = |U| + 2 log t + 2 |U∗| = |U| + 2 log t + 2 (⇒): ∗ C onIS defines, which∗ instance must be X1 : reverse X0 on IS F∗ consists of: Fsolved∗ consists inPL of: 1 + log t 2-element sets for pairs, 1 + log t 2-element sets for pairs, i ∗ ∗ (⇐): i i ∗ ∗ ∀X ∈ F , two sets X0 , X1 If∀ (XU,∈F F) is, two solvable, sets X we0 , setX1IS accordingly, and solve this instance inPL. Remaining sets are split for free.

Cross-composing into Set Splitting

Input: Instances (U, Fi ) Input: Instances (U, Fi )

Output: Instance (U∗, F∗) Output: Instance (U∗, F∗)

PLAYGROUND joint universe U

Micha l Pilipczuk Kernelization lower bounds 22/33 Take any solution C ∗ X0 : X , left special vertex, Thereand binary is exactly encoding one index of i ini with IS monochromatic parts fromIS. |U∗| = |U| + 2 log t + 2 |U∗| = |U| + 2 log t + 2 (⇒): ∗ C onIS defines, which∗ instance must be X1 : reverse X0 on IS F∗ consists of: Fsolved∗ consists inPL of: 1 + log t 2-element sets for pairs, 1 + log t 2-element sets for pairs, i ∗ ∗ (⇐): i i ∗ ∗ ∀X ∈ F , two sets X0 , X1 If∀ (XU,∈F F) is, two solvable, sets X we0 , setX1IS accordingly, and solve this instance inPL. Remaining sets are split for free.

Cross-composing into Set Splitting

INSTANCE SELECTOR 1 + log t pairs of vertices Input: Instances (U, Fi ) Input: Instances (U, Fi )

Output: Instance (U∗, F∗) Output: Instance (U∗, F∗)

PLAYGROUND joint universe U

Micha l Pilipczuk Kernelization lower bounds 22/33 Take any solution C ∗ X0 : X , left special vertex, Thereand binary is exactly encoding one index of i ini with IS monochromatic parts fromIS.

(⇒): ∗ C onIS defines, which∗ instance must be X1 : reverse X0 on IS F∗ consists of: Fsolved∗ consists inPL of: 1 + log t 2-element sets for pairs, 1 + log t 2-element sets for pairs, i ∗ ∗ (⇐): i i ∗ ∗ ∀X ∈ F , two sets X0 , X1 If∀ (XU,∈F F) is, two solvable, sets X we0 , setX1IS accordingly, and solve this instance inPL. Remaining sets are split for free.

Cross-composing into Set Splitting

INSTANCE SELECTOR 1 + log t pairs of vertices Input: Instances (U, Fi ) Input: Instances (U, Fi )

Output: Instance (U∗, F∗) Output: Instance (U∗, F∗)

|U∗| = |U| + 2 log t + 2 |U∗| = |U| + 2 log t + 2

PLAYGROUND joint universe U

Micha l Pilipczuk Kernelization lower bounds 22/33 Take any solution C ∗ X0 : X , left special vertex, Thereand binary is exactly encoding one index of i ini with IS monochromatic parts fromIS.

(⇒): ∗ C onIS defines, which∗ instance must be X1 : reverse X0 on IS solved inPL 1 + log t 2-element sets for pairs, 1 + log t 2-element sets for pairs, i ∗ ∗ (⇐): i i ∗ ∗ ∀X ∈ F , two sets X0 , X1 If∀ (XU,∈F F) is, two solvable, sets X we0 , setX1IS accordingly, and solve this instance inPL. Remaining sets are split for free.

Cross-composing into Set Splitting

INSTANCE SELECTOR 1 + log t pairs of vertices Input: Instances (U, Fi ) Input: Instances (U, Fi )

Output: Instance (U∗, F∗) Output: Instance (U∗, F∗)

|U∗| = |U| + 2 log t + 2 |U∗| = |U| + 2 log t + 2

F∗ consists of: F∗ consists of:

PLAYGROUND joint universe U

Micha l Pilipczuk Kernelization lower bounds 22/33 Take any solution C ∗ X0 : X , left special vertex, Thereand binary is exactly encoding one index of i ini with IS monochromatic parts fromIS.

(⇒): ∗ C onIS defines, which∗ instance must be X1 : reverse X0 on IS solved inPL

i ∗ ∗ (⇐): i i ∗ ∗ ∀X ∈ F , two sets X0 , X1 If∀ (XU,∈F F) is, two solvable, sets X we0 , setX1IS accordingly, and solve this instance inPL. Remaining sets are split for free.

Cross-composing into Set Splitting

INSTANCE SELECTOR 1 + log t pairs of vertices Input: Instances (U, Fi ) Input: Instances (U, Fi )

Output: Instance (U∗, F∗) Output: Instance (U∗, F∗)

|U∗| = |U| + 2 log t + 2 |U∗| = |U| + 2 log t + 2

F∗ consists of: F∗ consists of: 1 + log t 2-element sets for pairs, 1 + log t 2-element sets for pairs,

PLAYGROUND joint universe U

Micha l Pilipczuk Kernelization lower bounds 22/33 Take any solution C ∗ X0 : X , left special vertex, Thereand binary is exactly encoding one index of i ini with IS monochromatic parts fromIS.

(⇒): ∗ C onIS defines, which∗ instance must be X1 : reverse X0 on IS solved inPL

(⇐): If (U, Fi ) is solvable, we setIS accordingly, and solve this instance inPL. Remaining sets are split for free.

Cross-composing into Set Splitting

INSTANCE SELECTOR 1 + log t pairs of vertices Input: Instances (U, Fi ) Input: Instances (U, Fi )

Output: Instance (U∗, F∗) Output: Instance (U∗, F∗)

|U∗| = |U| + 2 log t + 2 |U∗| = |U| + 2 log t + 2

F∗ consists of: F∗ consists of: 1 + log t 2-element sets for pairs, 1 + log t 2-element sets for pairs, i ∗ ∗ i ∗ ∗ ∀X ∈ F , two sets X0 , X1 ∀X ∈ F , two sets X0 , X1

PLAYGROUND joint universe U

Micha l Pilipczuk Kernelization lower bounds 22/33 Take any solution C

There is exactly one index i with monochromatic parts fromIS.

(⇒): ∗ C onIS defines, which∗ instance must be X1 : reverse X0 on IS solved inPL

(⇐): If (U, Fi ) is solvable, we setIS accordingly, and solve this instance inPL. Remaining sets are split for free.

Cross-composing into Set Splitting

INSTANCE SELECTOR 1 + log t pairs of vertices Input: Instances (U, Fi ) Input: Instances (U, Fi )

∗ ∗ ∗ ∗ ∗ Output: Instance (U , F ) X0 : OutputX ,: left Instance special ( vertex,U , F ) and binary encoding of i in IS

|U∗| = |U| + 2 log t + 2 |U∗| = |U| + 2 log t + 2

F∗ consists of: F∗ consists of: 1 + log t 2-element sets for pairs, 1 + log t 2-element sets for pairs, i ∗ ∗ i ∗ ∗ ∀X ∈ F , two sets X0 , X1 ∀X ∈ F , two sets X0 , X1

PLAYGROUND joint universe U

Micha l Pilipczuk Kernelization lower bounds 22/33 Take any solution C

There is exactly one index i with monochromatic parts fromIS.

(⇒): C onIS defines, which instance must be solved inPL

(⇐): If (U, Fi ) is solvable, we setIS accordingly, and solve this instance inPL. Remaining sets are split for free.

Cross-composing into Set Splitting

INSTANCE SELECTOR 1 + log t pairs of vertices Input: Instances (U, Fi ) Input: Instances (U, Fi )

∗ ∗ ∗ ∗ ∗ Output: Instance (U , F ) X0 : OutputX ,: left Instance special ( vertex,U , F ) and binary encoding of i in IS

|U∗| = |U| + 2 log t + 2 |U∗| = |U| + 2 log t + 2 ∗ ∗ X1 : reverse X0 on IS F∗ consists of: F∗ consists of: 1 + log t 2-element sets for pairs, 1 + log t 2-element sets for pairs, i ∗ ∗ i ∗ ∗ ∀X ∈ F , two sets X0 , X1 ∀X ∈ F , two sets X0 , X1

PLAYGROUND joint universe U

Micha l Pilipczuk Kernelization lower bounds 22/33 ∗ X0 : X , left special vertex, Thereand binary is exactly encoding one index of i ini with IS monochromatic parts fromIS.

(⇒): ∗ C onIS defines, which∗ instance must be X1 : reverse X0 on IS solved inPL

(⇐): If (U, Fi ) is solvable, we setIS accordingly, and solve this instance inPL. Remaining sets are split for free.

Cross-composing into Set Splitting

INSTANCE SELECTOR 1 + log t pairs of vertices Input: Instances (U, Fi ) Input: Instances (U, Fi ) Take any solution C Output: Instance (U∗, F∗) Output: Instance (U∗, F∗)

|U∗| = |U| + 2 log t + 2 |U∗| = |U| + 2 log t + 2

F∗ consists of: F∗ consists of: 1 + log t 2-element sets for pairs, 1 + log t 2-element sets for pairs, i ∗ ∗ i ∗ ∗ ∀X ∈ F , two sets X0 , X1 ∀X ∈ F , two sets X0 , X1

PLAYGROUND joint universe U

Micha l Pilipczuk Kernelization lower bounds 22/33 ∗ X0 : X , left special vertex, and binary encoding of i in IS

(⇒): ∗ C onIS defines, which∗ instance must be X1 : reverse X0 on IS solved inPL

(⇐): If (U, Fi ) is solvable, we setIS accordingly, and solve this instance inPL. Remaining sets are split for free.

Cross-composing into Set Splitting

INSTANCE SELECTOR 1 + log t pairs of vertices Input: Instances (U, Fi ) Input: Instances (U, Fi ) Take any solution C Output: Instance (U∗, F∗) Output: Instance (U∗, F∗) There is exactly one index i with monochromatic parts fromIS. |U∗| = |U| + 2 log t + 2 |U∗| = |U| + 2 log t + 2

F∗ consists of: F∗ consists of: 1 + log t 2-element sets for pairs, 1 + log t 2-element sets for pairs, i ∗ ∗ i ∗ ∗ ∀X ∈ F , two sets X0 , X1 ∀X ∈ F , two sets X0 , X1

PLAYGROUND joint universe U

Micha l Pilipczuk Kernelization lower bounds 22/33 ∗ X0 : X , left special vertex, and binary encoding of i in IS

(⇒): ∗ C onIS defines, which∗ instance must be X1 : reverse X0 on IS solved inPL

(⇐): If (U, Fi ) is solvable, we setIS accordingly, and solve this instance inPL. Remaining sets are split for free.

Cross-composing into Set Splitting

INSTANCE SELECTOR 1 + log t pairs of vertices Input: Instances (U, Fi ) Input: Instances (U, Fi ) Take any solution C Output: Instance (U∗, F∗) Output: Instance (U∗, F∗) There is exactly one index i with monochromatic parts fromIS. |U∗| = |U| + 2 log t + 2 |U∗| = |U| + 2 log t + 2

F∗ consists of: F∗ consists of: 1 + log t 2-element sets for pairs, 1 + log t 2-element sets for pairs, i ∗ ∗ i ∗ ∗ ∀X ∈ F , two sets X0 , X1 ∀X ∈ F , two sets X0 , X1

PLAYGROUND joint universe U

Micha l Pilipczuk Kernelization lower bounds 22/33 ∗ X0 : X , left special vertex, and binary encoding of i in IS

(⇒): ∗ C onIS defines, which∗ instance must be X1 : reverse X0 on IS solved inPL

(⇐): If (U, Fi ) is solvable, we setIS accordingly, and solve this instance inPL. Remaining sets are split for free.

Cross-composing into Set Splitting

INSTANCE SELECTOR 1 + log t pairs of vertices Input: Instances (U, Fi ) Input: Instances (U, Fi ) Take any solution C Output: Instance (U∗, F∗) Output: Instance (U∗, F∗) There is exactly one index i with monochromatic parts fromIS. |U∗| = |U| + 2 log t + 2 |U∗| = |U| + 2 log t + 2

F∗ consists of: F∗ consists of: 1 + log t 2-element sets for pairs, 1 + log t 2-element sets for pairs, i ∗ ∗ i ∗ ∗ ∀X ∈ F , two sets X0 , X1 ∀X ∈ F , two sets X0 , X1

PLAYGROUND joint universe U

Micha l Pilipczuk Kernelization lower bounds 22/33 ∗ X0 : X , left special vertex, and binary encoding of i in IS

∗ ∗ X1 : reverse X0 on IS

(⇐): If (U, Fi ) is solvable, we setIS accordingly, and solve this instance inPL. Remaining sets are split for free.

Cross-composing into Set Splitting

INSTANCE SELECTOR 1 + log t pairs of vertices Input: Instances (U, Fi ) Input: Instances (U, Fi ) Take any solution C Output: Instance (U∗, F∗) Output: Instance (U∗, F∗) There is exactly one index i with monochromatic parts fromIS. |U∗| = |U| + 2 log t + 2 |U∗| = |U| + 2 log t + 2 (⇒): C onIS defines, which instance must be F∗ consists of: Fsolved∗ consists inPL of: 1 + log t 2-element sets for pairs, 1 + log t 2-element sets for pairs, i ∗ ∗ i ∗ ∗ ∀X ∈ F , two sets X0 , X1 ∀X ∈ F , two sets X0 , X1

PLAYGROUND joint universe U

Micha l Pilipczuk Kernelization lower bounds 22/33 ∗ X0 : X , left special vertex, and binary encoding of i in IS

∗ ∗ X1 : reverse X0 on IS

Cross-composing into Set Splitting

INSTANCE SELECTOR 1 + log t pairs of vertices Input: Instances (U, Fi ) Input: Instances (U, Fi ) Take any solution C Output: Instance (U∗, F∗) Output: Instance (U∗, F∗) There is exactly one index i with monochromatic parts fromIS. |U∗| = |U| + 2 log t + 2 |U∗| = |U| + 2 log t + 2 (⇒): C onIS defines, which instance must be F∗ consists of: Fsolved∗ consists inPL of: 1 + log t 2-element sets for pairs, 1 + log t 2-element sets for pairs, i ∗ ∗ (⇐): i i ∗ ∗ ∀X ∈ F , two sets X0 , X1 If∀ (XU,∈F F) is, two solvable, sets X we0 , setX1IS accordingly, and solve this instance inPL. Remaining sets are split for free.

PLAYGROUND joint universe U

Micha l Pilipczuk Kernelization lower bounds 22/33 Model the choice of the instance to be solved. Idea: choose log t bits of its index on an appropriate gadget. Choice of the index makes the instance active, while the other instances are “switched off”.

Unparameterized Set Splitting is NP-hard. Hence, Set Splitting parameterized by |U| does not admit a polynomial kernel, unless NP ⊆ coNP/poly. Main lesson:

Set Splitting: wrap up

Unparameterized Set Splitting cross-composes into Set Splitting parameterized by |U|.

Michal Pilipczuk Kernelization lower bounds 23/33 Model the choice of the instance to be solved. Idea: choose log t bits of its index on an appropriate gadget. Choice of the index makes the instance active, while the other instances are “switched off”.

Hence, Set Splitting parameterized by |U| does not admit a polynomial kernel, unless NP ⊆ coNP/poly. Main lesson:

Set Splitting: wrap up

Unparameterized Set Splitting cross-composes into Set Splitting parameterized by |U|. Unparameterized Set Splitting is NP-hard.

Michal Pilipczuk Kernelization lower bounds 23/33 Model the choice of the instance to be solved. Idea: choose log t bits of its index on an appropriate gadget. Choice of the index makes the instance active, while the other instances are “switched off”.

Main lesson:

Set Splitting: wrap up

Unparameterized Set Splitting cross-composes into Set Splitting parameterized by |U|. Unparameterized Set Splitting is NP-hard. Hence, Set Splitting parameterized by |U| does not admit a polynomial kernel, unless NP ⊆ coNP/poly.

Michal Pilipczuk Kernelization lower bounds 23/33 Model the choice of the instance to be solved. Idea: choose log t bits of its index on an appropriate gadget. Choice of the index makes the instance active, while the other instances are “switched off”.

Set Splitting: wrap up

Unparameterized Set Splitting cross-composes into Set Splitting parameterized by |U|. Unparameterized Set Splitting is NP-hard. Hence, Set Splitting parameterized by |U| does not admit a polynomial kernel, unless NP ⊆ coNP/poly. Main lesson:

Michal Pilipczuk Kernelization lower bounds 23/33 Idea: choose log t bits of its index on an appropriate gadget. Choice of the index makes the instance active, while the other instances are “switched off”.

Set Splitting: wrap up

Unparameterized Set Splitting cross-composes into Set Splitting parameterized by |U|. Unparameterized Set Splitting is NP-hard. Hence, Set Splitting parameterized by |U| does not admit a polynomial kernel, unless NP ⊆ coNP/poly. Main lesson: Model the choice of the instance to be solved.

Michal Pilipczuk Kernelization lower bounds 23/33 Choice of the index makes the instance active, while the other instances are “switched off”.

Set Splitting: wrap up

Unparameterized Set Splitting cross-composes into Set Splitting parameterized by |U|. Unparameterized Set Splitting is NP-hard. Hence, Set Splitting parameterized by |U| does not admit a polynomial kernel, unless NP ⊆ coNP/poly. Main lesson: Model the choice of the instance to be solved. Idea: choose log t bits of its index on an appropriate gadget.

Michal Pilipczuk Kernelization lower bounds 23/33 Set Splitting: wrap up

Unparameterized Set Splitting cross-composes into Set Splitting parameterized by |U|. Unparameterized Set Splitting is NP-hard. Hence, Set Splitting parameterized by |U| does not admit a polynomial kernel, unless NP ⊆ coNP/poly. Main lesson: Model the choice of the instance to be solved. Idea: choose log t bits of its index on an appropriate gadget. Choice of the index makes the instance active, while the other instances are “switched off”.

Michal Pilipczuk Kernelization lower bounds 23/33 Polynomial parameter transformation (PPT) A polynomial parameter transformation from a parameterized problem P to a parameterized problem Q is a polynomial-time algorithm that transforms a given instance (x, k) of P into an equivalent instance (x 0, k0) of Q such that k0 = poly(k).

Observation If problem P PPT-reduces to Q, and P does not admit a polynomial compression algorithm (into any language R), then neither does Q.

Proof: Compose the PPT with the assumed compression for Q.

PPTs

Idea: Hardness of kernelization can be transferred via reductions, similarly to NP-hardness.

Michal Pilipczuk Kernelization lower bounds 24/33 Observation If problem P PPT-reduces to Q, and P does not admit a polynomial compression algorithm (into any language R), then neither does Q.

Proof: Compose the PPT with the assumed compression for Q.

PPTs

Idea: Hardness of kernelization can be transferred via reductions, similarly to NP-hardness.

Polynomial parameter transformation (PPT) A polynomial parameter transformation from a parameterized problem P to a parameterized problem Q is a polynomial-time algorithm that transforms a given instance (x, k) of P into an equivalent instance (x 0, k0) of Q such that k0 = poly(k).

Michal Pilipczuk Kernelization lower bounds 24/33 Proof: Compose the PPT with the assumed compression for Q.

PPTs

Idea: Hardness of kernelization can be transferred via reductions, similarly to NP-hardness.

Polynomial parameter transformation (PPT) A polynomial parameter transformation from a parameterized problem P to a parameterized problem Q is a polynomial-time algorithm that transforms a given instance (x, k) of P into an equivalent instance (x 0, k0) of Q such that k0 = poly(k).

Observation If problem P PPT-reduces to Q, and P does not admit a polynomial compression algorithm (into any language R), then neither does Q.

Michal Pilipczuk Kernelization lower bounds 24/33 PPTs

Idea: Hardness of kernelization can be transferred via reductions, similarly to NP-hardness.

Polynomial parameter transformation (PPT) A polynomial parameter transformation from a parameterized problem P to a parameterized problem Q is a polynomial-time algorithm that transforms a given instance (x, k) of P into an equivalent instance (x 0, k0) of Q such that k0 = poly(k).

Observation If problem P PPT-reduces to Q, and P does not admit a polynomial compression algorithm (into any language R), then neither does Q.

Proof: Compose the PPT with the assumed compression for Q.

Michal Pilipczuk Kernelization lower bounds 24/33 We show that Steiner Tree has no polykernel (unless...) using a PPT from a auxiliary problem.

Michal Pilipczuk Kernelization lower bounds 25/33

Application 2: Steiner Tree

Steiner Tree I: Graph G with terminals T ⊆ V (G), k ∈ N P: k + |T | Q: Is there a set X ⊆ V (G) \ T , such that |X | 6 k and G[T ∪ X ] is connected? Application 2: Steiner Tree

Steiner Tree I: Graph G with terminals T ⊆ V (G), k ∈ N P: k + |T | Q: Is there a set X ⊆ V (G) \ T , such that |X | 6 k and G[T ∪ X ] is connected?

We show that Steiner Tree has no polykernel (unless...) using a PPT from a auxiliary problem.

Michal Pilipczuk Kernelization lower bounds 25/33 Then design a PPT from P to the target problem. Idea: Move the weight of the proof to the transformation and the actual definition of P. High level: Extract the essence of the original problem into the auxiliary problem.

The auxiliary problem technique

Introduce a simpler problem P, which is almost trivially compositional.

Michal Pilipczuk Kernelization lower bounds 26/33 Idea: Move the weight of the proof to the transformation and the actual definition of P. High level: Extract the essence of the original problem into the auxiliary problem.

The auxiliary problem technique

Introduce a simpler problem P, which is almost trivially compositional. Then design a PPT from P to the target problem.

Michal Pilipczuk Kernelization lower bounds 26/33 High level: Extract the essence of the original problem into the auxiliary problem.

The auxiliary problem technique

Introduce a simpler problem P, which is almost trivially compositional. Then design a PPT from P to the target problem. Idea: Move the weight of the proof to the transformation and the actual definition of P.

Michal Pilipczuk Kernelization lower bounds 26/33 The auxiliary problem technique

Introduce a simpler problem P, which is almost trivially compositional. Then design a PPT from P to the target problem. Idea: Move the weight of the proof to the transformation and the actual definition of P. High level: Extract the essence of the original problem into the auxiliary problem.

Michal Pilipczuk Kernelization lower bounds 26/33 Colorful Graph Motif

Colorful Graph Motif I: Graph G and a coloring function φ: V (G) → {1, 2,..., k} P: k Q: Does there exists a connected subgraph of G that contains exactly one vertex of each color?

Michal Pilipczuk Kernelization lower bounds 27/33 Colorful Graph Motif

Colorful Graph Motif I: Graph G and a coloring function φ: V (G) → {1, 2,..., k} P: k Q: Does there exists a connected subgraph of G that contains exactly one vertex of each color?

Michal Pilipczuk Kernelization lower bounds 27/33 There is a connected colorful motif in the composed instance iff there is one in any of the input instances.

FPT for various variants using the algebraic approach. Composition: Take the disjoint union of instances, reuse colors.

Corollary: no polykernel for CGM unless NP ⊆ coNP/poly. Now: PPT from CGM to ST.

About CGM

The problem is NP-hard even on trees.

Michal Pilipczuk Kernelization lower bounds 28/33 There is a connected colorful motif in the composed instance iff there is one in any of the input instances.

Composition: Take the disjoint union of instances, reuse colors.

Corollary: no polykernel for CGM unless NP ⊆ coNP/poly. Now: PPT from CGM to ST.

About CGM

The problem is NP-hard even on trees. FPT algorithms for various variants using the algebraic approach.

Michal Pilipczuk Kernelization lower bounds 28/33 There is a connected colorful motif in the composed instance iff there is one in any of the input instances. Corollary: no polykernel for CGM unless NP ⊆ coNP/poly. Now: PPT from CGM to ST.

About CGM

The problem is NP-hard even on trees. FPT algorithms for various variants using the algebraic approach. Composition: Take the disjoint union of instances, reuse colors.

Michal Pilipczuk Kernelization lower bounds 28/33 Corollary: no polykernel for CGM unless NP ⊆ coNP/poly. Now: PPT from CGM to ST.

About CGM

The problem is NP-hard even on trees. FPT algorithms for various variants using the algebraic approach. Composition: Take the disjoint union of instances, reuse colors. There is a connected colorful motif in the composed instance iff there is one in any of the input instances.

Michal Pilipczuk Kernelization lower bounds 28/33 Now: PPT from CGM to ST.

About CGM

The problem is NP-hard even on trees. FPT algorithms for various variants using the algebraic approach. Composition: Take the disjoint union of instances, reuse colors. There is a connected colorful motif in the composed instance iff there is one in any of the input instances. Corollary: no polykernel for CGM unless NP ⊆ coNP/poly.

Michal Pilipczuk Kernelization lower bounds 28/33 About CGM

The problem is NP-hard even on trees. FPT algorithms for various variants using the algebraic approach. Composition: Take the disjoint union of instances, reuse colors. There is a connected colorful motif in the composed instance iff there is one in any of the input instances. Corollary: no polykernel for CGM unless NP ⊆ coNP/poly. Now: PPT from CGM to ST.

Michal Pilipczuk Kernelization lower bounds 28/33 Attach a terminal to every color class.

Give budget k for connecting nodes.

From CGM to ST

Michal Pilipczuk Kernelization lower bounds 29/33 From CGM to ST

Attach a terminal to every color class.

Give budget k for connecting nodes.

Michal Pilipczuk Kernelization lower bounds 29/33 From CGM to ST

Attach a terminal to every color class.

Give budget k for connecting nodes.

Michal Pilipczuk Kernelization lower bounds 29/33 CGM PPT-reduces to Steiner Tree par. by k + |T |. Hence Steiner Tree par. by k + |T | does not admit a polynomial kernel, unless NP ⊆ coNP/poly. Note: Composition for CGM is far simpler than trying to do this directly for Steiner Tree.

CGM: wrap up

CGM has no polynomial kernel, unless NP ⊆ coNP/poly.

Michal Pilipczuk Kernelization lower bounds 30/33 Hence Steiner Tree par. by k + |T | does not admit a polynomial kernel, unless NP ⊆ coNP/poly. Note: Composition for CGM is far simpler than trying to do this directly for Steiner Tree.

CGM: wrap up

CGM has no polynomial kernel, unless NP ⊆ coNP/poly. CGM PPT-reduces to Steiner Tree par. by k + |T |.

Michal Pilipczuk Kernelization lower bounds 30/33 Note: Composition for CGM is far simpler than trying to do this directly for Steiner Tree.

CGM: wrap up

CGM has no polynomial kernel, unless NP ⊆ coNP/poly. CGM PPT-reduces to Steiner Tree par. by k + |T |. Hence Steiner Tree par. by k + |T | does not admit a polynomial kernel, unless NP ⊆ coNP/poly.

Michal Pilipczuk Kernelization lower bounds 30/33 CGM: wrap up

CGM has no polynomial kernel, unless NP ⊆ coNP/poly. CGM PPT-reduces to Steiner Tree par. by k + |T |. Hence Steiner Tree par. by k + |T | does not admit a polynomial kernel, unless NP ⊆ coNP/poly. Note: Composition for CGM is far simpler than trying to do this directly for Steiner Tree.

Michal Pilipczuk Kernelization lower bounds 30/33 Example of problem admitting an AND-composition: Treewidth.

The proof of Fortnow and Santhanam fails for AND. The conjecture was proved by Drucker in 2012.

What about replacing it with, say, AND? AND-distillation, AND-(cross)-composition: Same as before, but with AND instead of OR.

AND-conjecture: If 3SAT has an AND-distillation, then NP ⊆ coNP/poly.

Corollary: The whole framework works for AND instead of OR.

AND-compositions

In the compositionality framework, we used the OR function to compose instances.

Michal Pilipczuk Kernelization lower bounds 31/33 Example of problem admitting an AND-composition: Treewidth.

The proof of Fortnow and Santhanam fails for AND. The conjecture was proved by Drucker in 2012.

AND-distillation, AND-(cross)-composition: Same as before, but with AND instead of OR.

AND-conjecture: If 3SAT has an AND-distillation, then NP ⊆ coNP/poly.

Corollary: The whole framework works for AND instead of OR.

AND-compositions

In the compositionality framework, we used the OR function to compose instances. What about replacing it with, say, AND?

Michal Pilipczuk Kernelization lower bounds 31/33 The proof of Fortnow and Santhanam fails for AND. The conjecture was proved by Drucker in 2012.

Example of problem admitting an AND-composition: Treewidth. AND-conjecture: If 3SAT has an AND-distillation, then NP ⊆ coNP/poly.

Corollary: The whole framework works for AND instead of OR.

AND-compositions

In the compositionality framework, we used the OR function to compose instances. What about replacing it with, say, AND? AND-distillation, AND-(cross)-composition: Same as before, but with AND instead of OR.

Michal Pilipczuk Kernelization lower bounds 31/33 The proof of Fortnow and Santhanam fails for AND. The conjecture was proved by Drucker in 2012.

AND-conjecture: If 3SAT has an AND-distillation, then NP ⊆ coNP/poly.

Corollary: The whole framework works for AND instead of OR.

AND-compositions

In the compositionality framework, we used the OR function to compose instances. What about replacing it with, say, AND? AND-distillation, AND-(cross)-composition: Same as before, but with AND instead of OR. Example of problem admitting an AND-composition: Treewidth.

Michal Pilipczuk Kernelization lower bounds 31/33 The proof of Fortnow and Santhanam fails for AND. The conjecture was proved by Drucker in 2012. Corollary: The whole framework works for AND instead of OR.

AND-compositions

In the compositionality framework, we used the OR function to compose instances. What about replacing it with, say, AND? AND-distillation, AND-(cross)-composition: Same as before, but with AND instead of OR. Example of problem admitting an AND-composition: Treewidth. AND-conjecture: If 3SAT has an AND-distillation, then NP ⊆ coNP/poly.

Michal Pilipczuk Kernelization lower bounds 31/33 The conjecture was proved by Drucker in 2012. Corollary: The whole framework works for AND instead of OR.

AND-compositions

In the compositionality framework, we used the OR function to compose instances. What about replacing it with, say, AND? AND-distillation, AND-(cross)-composition: Same as before, but with AND instead of OR. Example of problem admitting an AND-composition: Treewidth. AND-conjecture: If 3SAT has an AND-distillation, then NP ⊆ coNP/poly. The proof of Fortnow and Santhanam fails for AND.

Michal Pilipczuk Kernelization lower bounds 31/33 Corollary: The whole framework works for AND instead of OR.

AND-compositions

In the compositionality framework, we used the OR function to compose instances. What about replacing it with, say, AND? AND-distillation, AND-(cross)-composition: Same as before, but with AND instead of OR. Example of problem admitting an AND-composition: Treewidth. AND-conjecture: If 3SAT has an AND-distillation, then NP ⊆ coNP/poly. The proof of Fortnow and Santhanam fails for AND. The conjecture was proved by Drucker in 2012.

Michal Pilipczuk Kernelization lower bounds 31/33 AND-compositions

In the compositionality framework, we used the OR function to compose instances. What about replacing it with, say, AND? AND-distillation, AND-(cross)-composition: Same as before, but with AND instead of OR. Example of problem admitting an AND-composition: Treewidth. AND-conjecture: If 3SAT has an AND-distillation, then NP ⊆ coNP/poly. The proof of Fortnow and Santhanam fails for AND. The conjecture was proved by Drucker in 2012. Corollary: The whole framework works for AND instead of OR.

Michal Pilipczuk Kernelization lower bounds 31/33 Cor: A framework for lower bounds on kernel sizes. Weak cross-composition A weak cross-composition of dimension d from an unpar. problem Q to a par. problem L, is an algorithm that, given ∼-equivalent strings x1, x2,..., xt for some polynomial equivalence relation ∼, in time Pt  ? poly t + i=1 |xi | produces one instance (y, k ) such that ? (y, k ) ∈ L iff xi ∈ Q for at least one i = 1, 2,..., t, ? 1/d t k = t · poly (maxi=1 |xi |).

Weak cross-composition theorem Suppose NP * coNP/poly. If some NP-hard problem Q has a cross-composition of dimension d into L, then L does not admit a compression into any language R with bitsize O(kd−ε) for any ε > 0.

Ex: Vertex Cover has no compression into bitsize O(k2−ε). Note: The 2k-kernel for VC needs O(k2) bits for the encoding.

Weak compositions

Idea: Inspect the proof of FS to get precise estimates.

Michal Pilipczuk Kernelization lower bounds 32/33 Weak cross-composition A weak cross-composition of dimension d from an unpar. problem Q to a par. problem L, is an algorithm that, given ∼-equivalent strings x1, x2,..., xt for some polynomial equivalence relation ∼, in time Pt  ? poly t + i=1 |xi | produces one instance (y, k ) such that ? (y, k ) ∈ L iff xi ∈ Q for at least one i = 1, 2,..., t, ? 1/d t k = t · poly (maxi=1 |xi |).

Weak cross-composition theorem Suppose NP * coNP/poly. If some NP-hard problem Q has a cross-composition of dimension d into L, then L does not admit a compression into any language R with bitsize O(kd−ε) for any ε > 0.

Ex: Vertex Cover has no compression into bitsize O(k2−ε). Note: The 2k-kernel for VC needs O(k2) bits for the encoding.

Weak compositions

Idea: Inspect the proof of FS to get precise estimates. Cor: A framework for lower bounds on kernel sizes.

Michal Pilipczuk Kernelization lower bounds 32/33 Weak cross-composition theorem Suppose NP * coNP/poly. If some NP-hard problem Q has a cross-composition of dimension d into L, then L does not admit a compression into any language R with bitsize O(kd−ε) for any ε > 0.

Ex: Vertex Cover has no compression into bitsize O(k2−ε). Note: The 2k-kernel for VC needs O(k2) bits for the encoding.

Weak compositions

Idea: Inspect the proof of FS to get precise estimates. Cor: A framework for lower bounds on kernel sizes. Weak cross-composition A weak cross-composition of dimension d from an unpar. problem Q to a par. problem L, is an algorithm that, given ∼-equivalent strings x1, x2,..., xt for some polynomial equivalence relation ∼, in time Pt  ? poly t + i=1 |xi | produces one instance (y, k ) such that ? (y, k ) ∈ L iff xi ∈ Q for at least one i = 1, 2,..., t, ? 1/d t k = t · poly (maxi=1 |xi |).

Michal Pilipczuk Kernelization lower bounds 32/33 Ex: Vertex Cover has no compression into bitsize O(k2−ε). Note: The 2k-kernel for VC needs O(k2) bits for the encoding.

Weak compositions

Idea: Inspect the proof of FS to get precise estimates. Cor: A framework for lower bounds on kernel sizes. Weak cross-composition A weak cross-composition of dimension d from an unpar. problem Q to a par. problem L, is an algorithm that, given ∼-equivalent strings x1, x2,..., xt for some polynomial equivalence relation ∼, in time Pt  ? poly t + i=1 |xi | produces one instance (y, k ) such that ? (y, k ) ∈ L iff xi ∈ Q for at least one i = 1, 2,..., t, ? 1/d t k = t · poly (maxi=1 |xi |).

Weak cross-composition theorem Suppose NP * coNP/poly. If some NP-hard problem Q has a cross-composition of dimension d into L, then L does not admit a compression into any language R with bitsize O(kd−ε) for any ε > 0.

Michal Pilipczuk Kernelization lower bounds 32/33 Note: The 2k-kernel for VC needs O(k2) bits for the encoding.

Weak compositions

Idea: Inspect the proof of FS to get precise estimates. Cor: A framework for lower bounds on kernel sizes. Weak cross-composition A weak cross-composition of dimension d from an unpar. problem Q to a par. problem L, is an algorithm that, given ∼-equivalent strings x1, x2,..., xt for some polynomial equivalence relation ∼, in time Pt  ? poly t + i=1 |xi | produces one instance (y, k ) such that ? (y, k ) ∈ L iff xi ∈ Q for at least one i = 1, 2,..., t, ? 1/d t k = t · poly (maxi=1 |xi |).

Weak cross-composition theorem Suppose NP * coNP/poly. If some NP-hard problem Q has a cross-composition of dimension d into L, then L does not admit a compression into any language R with bitsize O(kd−ε) for any ε > 0.

Ex: Vertex Cover has no compression into bitsize O(k2−ε).

Michal Pilipczuk Kernelization lower bounds 32/33 Weak compositions

Idea: Inspect the proof of FS to get precise estimates. Cor: A framework for lower bounds on kernel sizes. Weak cross-composition A weak cross-composition of dimension d from an unpar. problem Q to a par. problem L, is an algorithm that, given ∼-equivalent strings x1, x2,..., xt for some polynomial equivalence relation ∼, in time Pt  ? poly t + i=1 |xi | produces one instance (y, k ) such that ? (y, k ) ∈ L iff xi ∈ Q for at least one i = 1, 2,..., t, ? 1/d t k = t · poly (maxi=1 |xi |).

Weak cross-composition theorem Suppose NP * coNP/poly. If some NP-hard problem Q has a cross-composition of dimension d into L, then L does not admit a compression into any language R with bitsize O(kd−ε) for any ε > 0.

Ex: Vertex Cover has no compression into bitsize O(k2−ε). Note: The 2k-kernel for VC needs O(k2) bits for the encoding.

Michal Pilipczuk Kernelization lower bounds 32/33 Composition framework does not apply. There are problems that have polynomial Turing kernels, but no polynomial kernel under NP * coNP/poly. Open: A technique for ruling out Turing kernels.

Message 1: What is hard for kernelization is unbounded choice. Message 2: Turning intuition into a lower bound via cross-composition often needs a good understanding of the problem. Turing kernelization: A Turing kernel is a poly-time algorithm that has oracle access to solving instances of size poly(k).

Complexity theory for kernelization: Using PPT as reductions, one can build a hierarchy of complexity classes [HKSWW]. Thank you for your attention!

Tikz faces based on a code by Raoul Kessels, http://www.texample.net/tikz/examples/emoticons/,

under Creative Commons Attribution 2.5 license (CC BY 2.5)

Conclusions

Composition: a versatile framework for proving lower bounds for polynomial kernelization.

Michal Pilipczuk Kernelization lower bounds 33/33 Composition framework does not apply. There are problems that have polynomial Turing kernels, but no polynomial kernel under NP * coNP/poly. Open: A technique for ruling out Turing kernels.

Message 2: Turning intuition into a lower bound via cross-composition often needs a good understanding of the problem. Turing kernelization: A Turing kernel is a poly-time algorithm that has oracle access to solving instances of size poly(k).

Complexity theory for kernelization: Using PPT as reductions, one can build a hierarchy of complexity classes [HKSWW]. Thank you for your attention!

Tikz faces based on a code by Raoul Kessels, http://www.texample.net/tikz/examples/emoticons/,

under Creative Commons Attribution 2.5 license (CC BY 2.5)

Conclusions

Composition: a versatile framework for proving lower bounds for polynomial kernelization. Message 1: What is hard for kernelization is unbounded choice.

Michal Pilipczuk Kernelization lower bounds 33/33 Composition framework does not apply. There are problems that have polynomial Turing kernels, but no polynomial kernel under NP * coNP/poly. Open: A technique for ruling out Turing kernels.

Turing kernelization: A Turing kernel is a poly-time algorithm that has oracle access to solving instances of size poly(k).

Complexity theory for kernelization: Using PPT as reductions, one can build a hierarchy of complexity classes [HKSWW]. Thank you for your attention!

Tikz faces based on a code by Raoul Kessels, http://www.texample.net/tikz/examples/emoticons/,

under Creative Commons Attribution 2.5 license (CC BY 2.5)

Conclusions

Composition: a versatile framework for proving lower bounds for polynomial kernelization. Message 1: What is hard for kernelization is unbounded choice. Message 2: Turning intuition into a lower bound via cross-composition often needs a good understanding of the problem.

Michal Pilipczuk Kernelization lower bounds 33/33 Composition framework does not apply. There are problems that have polynomial Turing kernels, but no polynomial kernel under NP * coNP/poly. Open: A technique for ruling out Turing kernels. Complexity theory for kernelization: Using PPT as reductions, one can build a hierarchy of complexity classes [HKSWW]. Thank you for your attention!

Tikz faces based on a code by Raoul Kessels, http://www.texample.net/tikz/examples/emoticons/,

under Creative Commons Attribution 2.5 license (CC BY 2.5)

Conclusions

Composition: a versatile framework for proving lower bounds for polynomial kernelization. Message 1: What is hard for kernelization is unbounded choice. Message 2: Turning intuition into a lower bound via cross-composition often needs a good understanding of the problem. Turing kernelization: A Turing kernel is a poly-time algorithm that has oracle access to solving instances of size poly(k).

Michal Pilipczuk Kernelization lower bounds 33/33 There are problems that have polynomial Turing kernels, but no polynomial kernel under NP * coNP/poly. Open: A technique for ruling out Turing kernels. Complexity theory for kernelization: Using PPT as reductions, one can build a hierarchy of complexity classes [HKSWW]. Thank you for your attention!

Tikz faces based on a code by Raoul Kessels, http://www.texample.net/tikz/examples/emoticons/,

under Creative Commons Attribution 2.5 license (CC BY 2.5)

Conclusions

Composition: a versatile framework for proving lower bounds for polynomial kernelization. Message 1: What is hard for kernelization is unbounded choice. Message 2: Turning intuition into a lower bound via cross-composition often needs a good understanding of the problem. Turing kernelization: A Turing kernel is a poly-time algorithm that has oracle access to solving instances of size poly(k). Composition framework does not apply.

Michal Pilipczuk Kernelization lower bounds 33/33 Open: A technique for ruling out Turing kernels. Complexity theory for kernelization: Using PPT as reductions, one can build a hierarchy of complexity classes [HKSWW]. Thank you for your attention!

Tikz faces based on a code by Raoul Kessels, http://www.texample.net/tikz/examples/emoticons/,

under Creative Commons Attribution 2.5 license (CC BY 2.5)

Conclusions

Composition: a versatile framework for proving lower bounds for polynomial kernelization. Message 1: What is hard for kernelization is unbounded choice. Message 2: Turning intuition into a lower bound via cross-composition often needs a good understanding of the problem. Turing kernelization: A Turing kernel is a poly-time algorithm that has oracle access to solving instances of size poly(k). Composition framework does not apply. There are problems that have polynomial Turing kernels, but no polynomial kernel under NP * coNP/poly.

Michal Pilipczuk Kernelization lower bounds 33/33 Complexity theory for kernelization: Using PPT as reductions, one can build a hierarchy of complexity classes [HKSWW]. Thank you for your attention!

Tikz faces based on a code by Raoul Kessels, http://www.texample.net/tikz/examples/emoticons/,

under Creative Commons Attribution 2.5 license (CC BY 2.5)

Conclusions

Composition: a versatile framework for proving lower bounds for polynomial kernelization. Message 1: What is hard for kernelization is unbounded choice. Message 2: Turning intuition into a lower bound via cross-composition often needs a good understanding of the problem. Turing kernelization: A Turing kernel is a poly-time algorithm that has oracle access to solving instances of size poly(k). Composition framework does not apply. There are problems that have polynomial Turing kernels, but no polynomial kernel under NP * coNP/poly. Open: A technique for ruling out Turing kernels.

Michal Pilipczuk Kernelization lower bounds 33/33 Thank you for your attention!

Tikz faces based on a code by Raoul Kessels, http://www.texample.net/tikz/examples/emoticons/,

under Creative Commons Attribution 2.5 license (CC BY 2.5)

Conclusions

Composition: a versatile framework for proving lower bounds for polynomial kernelization. Message 1: What is hard for kernelization is unbounded choice. Message 2: Turning intuition into a lower bound via cross-composition often needs a good understanding of the problem. Turing kernelization: A Turing kernel is a poly-time algorithm that has oracle access to solving instances of size poly(k). Composition framework does not apply. There are problems that have polynomial Turing kernels, but no polynomial kernel under NP * coNP/poly. Open: A technique for ruling out Turing kernels. Complexity theory for kernelization: Using PPT as reductions, one can build a hierarchy of complexity classes [HKSWW].

Michal Pilipczuk Kernelization lower bounds 33/33 Conclusions

Composition: a versatile framework for proving lower bounds for polynomial kernelization. Message 1: What is hard for kernelization is unbounded choice. Message 2: Turning intuition into a lower bound via cross-composition often needs a good understanding of the problem. Turing kernelization: A Turing kernel is a poly-time algorithm that has oracle access to solving instances of size poly(k). Composition framework does not apply. There are problems that have polynomial Turing kernels, but no polynomial kernel under NP * coNP/poly. Open: A technique for ruling out Turing kernels. Complexity theory for kernelization: Using PPT as reductions, one can build a hierarchy of complexity classes [HKSWW]. Thank you for your attention!

Tikz faces based on a code by Raoul Kessels, http://www.texample.net/tikz/examples/emoticons/,

under Creative Commons Attribution 2.5 license (CC BY 2.5)

Michal Pilipczuk Kernelization lower bounds 33/33