<<

POLARS AND DUAL

VERA ROSHCHINA

Abstract. The goal of this note is to remind the basic definitions of convex sets and their polars. For more details see the classic references [1, 2] and [3] for polytopes.

1. Convex sets and convex hulls A convex set has no indentations, i.e. it contains all segments connecting points of this set.

Definition 1 (Convex set). A set C ⊆ Rn is called convex if for any x, y ∈ C the point λx+(1−λ)y also belongs to C for any λ ∈ [0, 1]. The sets shown in Fig. 1 are convex: the first set has a smooth , the second set is a

Figure 1. Convex sets convex . Singletons and lines are also convex sets, and linear subspaces are convex. Some nonconvex sets are shown in Fig. 2.

Figure 2. Nonconvex sets

A connecting two points x, y ∈ Rn is denoted by [x, y], so we have [x, y] = {(1 − α)x + αy, α ∈ [0, 1]}. Likewise, we can consider open line segments (x, y) = {(1 − α)x + αy, α ∈ (0, 1)}. and half open segments such as [x, y) and (x, y]. Clearly any line segment is a convex set. The notion of a line segment connecting two points can be generalised to an arbitrary finite set n n by means of a . Given a finite set {x1, x2, . . . , xp} ⊂ R , a point x ∈ R is the convex combination of x1, x2, . . . , xp if it can be represented as p p X X x = αixi, αi ≥ 0 ∀i = 1, . . . , p, αi = 1. i=1 i=1 Given an arbitrary set S ⊆ Rn, we can consider all possible finite convex combinations of points from this set. 1 2 VERA ROSHCHINA

Definition 2 (). The convex hull co S of a set S ⊆ Rn is the set of all convex combinations of finite of S, ( p p )

X X co S = x = αixi p ∈ N, xi ∈ S, αi ≥ 0 ∀i = 1, . . . , p, αi = 1 . i=1 i=1 Example 1. A standard in Rn is the convex hull of the coordinate vectors (see Fig. 3).

x2 x3 (0,0,1) (0,1)

(0,1,0)

x2

(1,0,0) (1,0) x1

x1

Figure 3. Standard simplices ∆2 and ∆3.

We have explicitly

∆n = co {e1, . . . , en} ( n n )

X X = αiei αi ≥ 0 ∀ i = 1, . . . , n, αi = 1 i=1 i=1 ( n ) X = (x1, . . . , xn) | xi ≥ 0 ∀ i = 1, . . . , n; xi = 1 , i=1 where by ei, i ∈ {1, . . . , n} we denote the coordinate vectors:

e1 = (1, 0,..., 0), e2 = (0, 1, 0,..., 0), . . . , en = (0,..., 0, 1). It is not difficult to observe that the standard simplex is a convex set. Indeed, for every two points x and y in ∆n we have n n X X xi = 1, yi = 1, xi, yi ≥ 0 ∀ i ∈ {1, . . . , n}. i=1 i=1 Therefore, for any λ ∈ [0, 1] and z = (1 − λ)x + λy n n n n X X X X zi = [(1 − λ)xi + λyi] = (1 − λ) xi + λ yi = (1 − λ) + λ = 1, i=1 i=1 i=1 i=1 and at the same time zi = (1 − λ)xi + λyi ≥ 0, so z ∈ ∆n. This proof can be generalised to show that the convex hull of any set is convex.

Lemma 1. The convex hull co S of any set S ⊆ Rn is a convex set. Proof. Let x, y ∈ co S. By the definition of a convex hull, we have the representations r s X X x = αixi, y = βjyj, i=1 j=1 POLARS AND DUAL CONES 3 where αi’s and βj’s are the relevant convex coefficients, and xi, yj ∈ S for i = 1, . . . , r and j = 1, . . . , s. For z = µx + (1 − µ)y with µ ∈ [0, 1] we have r s r s X X X X (1) z = µ αixi + (1 − µ) βjyj = µαi xi + (1 − µ)βj yj, i=1 j=1 i=1 |{z} j=1 | {z } γi γj+r so we can define new coefficients γ1, . . . , γr+s as follows

γi := µαi i = 1, . . . , r; γj+r := (1 − µ)βj, j = 1, . . . , s. It is not difficult to observe that r+s X γi ≥ 0 ∀ i ∈ {1, . . . , r + s} and γi = 1, i=1 hence, (1) is a representation of z as the convex combination of r + s elements from S, and we therefore have z ∈ co S by the definition of a convex hull. The result now follows from the arbitrariness of our choice of x, y and µ.  The convex hull of a finite set is called a polytope. Platonic solids are classic examples of polytopes (see Fig. 4).

Figure 4. Platonic solids.

The convex hull of S ⊆ Rn can be defined equivalently as the smallest convex set containing S. To guarantee that such definition is correct, we need the following result. Theorem 1. Let S ⊆ Rn, then co S is the intersection of all convex sets in Rn containing S. Proof. First observe that S ⊆ co S, since for every point x in S we have x = 1 · x, a convex combination of one element. Furthermore, the set co S is convex by Lemma 1. We will show that any convex set that contains S also contains co S, and hence co S is the smallest convex set containing S. Assume the contrary. Then there exists a convex set C such that S ⊆ C and at the same time there is x ∈ co S \ C. Since x ∈ co S, we have

(2) x = α1x1 + α2x2 + ··· + αpxp, where xi ∈ S ⊆ C for i ∈ {1, . . . , p} and α1, . . . , αp are convex combination coefficients: p X αi = 1, αi ≥ 0 ∀ i ∈ {0, . . . , p}. i=1

Without loss of generality we can assume that αi 6= 0 for all i ∈ {1, . . . , p} (otherwise we discard the points with zero coefficients). If p = 1, then x = 1 · x ∈ S ⊆ C. If p = 2, then x = (1 − λ)x1 + λx2 ∈ C by the convexity of C. It remains to consider the case when p > 2. Let α α0 := p−1 ∈ (0, 1), αp−1 + αp and define a new quantity 0 0 0 αp−1 αp xp−1 := α xp−1 + (1 − α )xp = xp−1 + xp ∈ C αp−1 + αp αp−1 + αp 4 VERA ROSHCHINA by the definition of convexity. Also note that 0 αp−1xp−1 + αpxp = (αp−1 + αp)xp−1, 0 and letting αp−1 = αp−1 + αp we can rewrite (2) as a convex combination of p − 1 points from C: 0 0 x = α1x1 + α2x2 + ··· αp−1xp−1. We can continue this reduction process until the number of points in the representation is reduced to at most two, and conclude as before that x ∈ C, which contradicts the assumption. Therefore, there is no smaller convex set than co S that contains C.  Corollary 1. A set S is convex if and only if S = co S. Theorem 1 allows us to give two additional alternative definitions of a convex set. Definition 3 (Convex hull as the smallest convex set). Let S ⊆ Rn. The convex hull co S of the set S is the smallest convex set that contains S. Definition 4 (Convex hull as the intersection). Let S ⊆ Rn. The convex hull co S of the set S is the intersection of all convex sets that contain S. Separation theorem is one of the most important tools in convex . We give the statement of this result without proof for now. Theorem 2 (Separation). Let C ⊆ Rn be a nonempty closed convex set, and let z ∈ Rn be such that z∈ / C. Then there exists u ∈ Sn−1 and x0 ∈ C such that

(3) hu, z − x0i > 0; hu, x − x0i ≤ 0 ∀x ∈ C, i.e. the point z can be strictly separated from the set C by a . Here we use the standard inner product hx, yi = xT y.

2. Polars of convex sets Definition 5 (Polar). Let C ⊆ Rn be a convex set. Its polar is ◦ n C = {s ∈ R | hs, xi ≤ 1 ∀x ∈ C}. It is not difficult to observe that a polar of any set (not necessarily convex) is a closed convex set. Example 2. Consider the unit in the , B = {x | kxk ≤ 1}. The unit ball is self polar or self dual, i.e. B◦ = B. To see this, we can use the definition directly, so   ◦ n n (4) B = {y ∈ R | hy, xi ≤ 1 ∀x ∈ B} = y ∈ R | suphy, xi ≤ 1 . x∈B We have from the Cauchy-Schwarz inequality (5) hx, yi ≤ kxk · kyk ≤ 1 · kyk = kyk ∀x ∈ B, on the other hand, for y 6= 0 andx ¯ = y/kyk we have hy, yi (6) hx,¯ yi = = kyk kyk The equations (5) and (6) together yield suphy, xi = kyk, x∈B and from (4) we have   ◦ n B = y ∈ R | suphy, xi ≤ 1 = {y | kyk ≤ 1} = B. x∈B POLARS AND DUAL CONES 5

Example 3. The polar of a centred at zero is an octahedron. Consider the cube

C = {(x1, x2, x3) | − 1 ≤ xi ≤ 1 ∀ i ∈ 1 : 3}. We have explicitly

suphy, xi = sup (x1y1 + x2y2 + x3y3) = |y1| + |y2| + |y3| = kyk1, x∈C −1≤xi≤1 hence,   ◦ n C = y ∈ R | suphy, xi ≤ 1 x∈C n = {y ∈ R | kyk1 ≤ 1} , an octahedron bounded by the 8 planes ±y1 ± y2 ± y3 ≤ 1. Observe that if we make the cube larger or smaller, so

Cα := {(x1, x2, x3) | − α ≤ xi ≤ α ∀ i ∈ 1 : 3} for some positive α, the same kind of computation yields ◦  n −1 C = y ∈ R | kyk1 ≤ α . Note that the polars of other Platonic solids are also (appropriately positioned and scaled) Platonic solids. Next we will learn how to construct polars of arbitrary convex sets. Lemma 2. Let a closed convex set C be defined as the intersection of half-spaces, n C = {x ∈ R | hx, ati ≤ bt > 0, t ∈ T }, where T is an arbitrary index set, not necessarily finite. Then ◦ C = cl co ({at/bt, t ∈ T } ∪ {0}) , where cl co is the closed convex hull ( of the convex hull). Proof. Denote D := cl co ({at/bt, t ∈ T } ∪ {0}) . Observe that for every x ∈ C by the definition of C we have

hat/bt, xi ≤ 1, ◦ ◦ ◦ hence, at/bt ∈ C . Also obviously 0 ∈ C . Since C is convex and closed, D ⊆ C◦. ◦ To show the reverse inclusion, assume the contrary: there is z0 ∈ C \ D. By the separation theorem (Theorem 2) there exists u ∈ Rn such that

hu, z0i > suphu, xi ≥ 0, x∈D the last inequality due to 0 ∈ D. Without loss of generality (scaling u if necessary), we have

hu, z0i > 1 ≥ suphu, xi ≥ 0. x∈D Observe that 1 ≥ suphu, xi = max{0, suphat/bt, ui}, x∈D t∈T therefore hat, ui ≤ bt ∀t ∈ T, ◦ and hence u ∈ C. On the other hand, we have z0 ∈ C and hu, z0i > 1, which is impossible. This contradiction finishes the proof.  In view of Lemma 2 it is evident that dodecahederon and are polars of each other. We will consider another example. 6 VERA ROSHCHINA

Example 4. Let C ⊂ R3 be a cylinder, 2 2 C = {(x1, x2, x3 | x1 + x2 ≤ 1, −1 ≤ x3 ≤ 1}. We will show using Lemma 2 that ◦ 2 2 C = co{(0, 0, 1), (0, 0, −1), {(x1, x2, 0), x1 + x2 = 1}}, ◦ i.e. C is the convex hull of a circle located at x3 = 0 and two points (0, 0, 1) and (0, 0, −1), see Fig. 5.

Figure 5. A cylinder and its polar

First we need to obtain the explicit representation of the cylinder C in terms of at and bt. The last constraint −1 ≤ x3 ≤ 1 translates into two inequalities

x3 ≤ 1, −x3 ≤ 1, 0 0 0 0 so we can let a1 = (0, 0, 1), a2 = (0, 0, −1), b1 = b2 = 1. The inequality 2 2 x1 + x2 ≤ 1 can be equivalently written as the system of linear inequalities

x1y1 + x2y2 ≤ 1 ∀y, kyk = 1. From here we have the second family 2 2 (at, bt) = (t, 1), t = (t1, t2), t1 + t2 = 1. By Lemma 2 we have ◦ 0 0 C = cl co{03, a1, a2, {at, t ∈ T }} 2 2 = co{(0, 0, 1), (0, 0, −1), {(t1, t2, 0), t1 + t2 = 1}}. 2 2 This is the convex hull of the circle x1 + x2 = 1 and the two points (0, 0, 1) and (0, 0, −1). We can consider the operation of taking polars of polars, so we can have a bipolar C◦◦. We summarise the properties of polars in the next statement (see [3, Theorem 2.11]).

Theorem 3 (Elementary properties of polars). Let P,Q ⊆ Rn. We have the following properties of their polars: (i) P ⊆ Q implies P ◦ ⊇ Q◦ and P ◦◦ ⊆ Q◦◦, (ii) P ⊆ P ◦◦, (iii) P ◦ is convex and closed, (iv) 0 ∈ P ◦. POLARS AND DUAL CONES 7

Lemma 3 (Double Polar). For any C ⊆ Rn one has C◦◦ := cl co{{0} ∪ C}. Proof. Observe that C◦◦ is a closed convex set. We will first show that (7) cl co{{0} ∪ C} ⊆ C◦◦. Indeed, h0, yi ≤ 1 ∀y ∈ C◦, and for all x ∈ C hx, yi ≤ 1 ∀y ∈ C◦ by the definition of polar, and we deduce (7) first by observing that the inequalities hold for the convex hull, and then for the closure of this convex hull. It remains to show that C◦◦ ⊆ cl co{{0} ∪ C}. Assume that this is not true. Then there exists z ∈ C◦◦ \ cl co{{0} ∪ C}. We can separate z strictly from cl co{{0} ∪ C} by the separation theorem. There is a vector u such that hu, zi > sup hu, xi. x∈cl co{{0}∪C} Observe that the supremum on the right hand side is nonnegative (because it is taken over a set that contains zero), hence, w.l.o.g. we can assume (by scaling u) that hu, zi > 1 ≥ sup hu, xi. x∈cl co{{0}∪C} ◦ ◦◦ By the definition of polar we have u ∈ C , but then z∈ / C , which contradicts the assumption.  3. Dual cones Definition 6 (). A set K ⊆ Rn is called a cone if for any point x ∈ K the ray

R++x = {λx λ > 0} is a of K. Cones are collections of rays emanating from the origin (that may not include the origin). A is a cone that is also a convex set. When K is a cone, its polar is a cone as well, and we can write ◦ n (8) K = {s ∈ R | hs, xi ≤ 0 ∀x ∈ K}, i.e. in the definition one can be replaced by zero. The equivalence is not difficult to see from the fact that K is a cone. Let us note some straightforward properties. • The set K◦ is a closed convex cone. • If K is a subspace, then K◦ coincides with its orthogonal , K⊥ = K◦. • The only possible element in K ∩ K◦ is 0. We can compute the double polar of a cone K by taking the polar of the polar to obtain K◦◦. This set is called a bipolar cone of K. It does not make sense to consider any further polars of polars due to the following statement. Lemma 4. If K is a nonempty convex cone, then K◦◦ = cl K. Proof. Follows from Lemma 3 observing that 0 ∈ cl K.  8 VERA ROSHCHINA

Polars of polyhedral cones have a particularly convenient and explicit representation. We have the following result. Lemma 5. Let K be a polyhedral cone, i.e.

K = {x | hx, aii ≤ 0 ∀ i ∈ {1, . . . , m}}. Then ◦ K = cone{a1, . . . , am}. Proof. Follows directly from the representation ◦ n K = {s ∈ R | hs, xi ≤ 0 ∀x ∈ K} and the separation argument.  Definition 7. Let K ⊆ Rn be a closed convex cone. Its dual cone is defined as ∗ n K = {u ∈ R | hx, ui ≥ 0 ∀x ∈ K}. It follows directly from the definition that the dual of a linear subspace is its orthogonal com- plement. Also note that K∗ = −K◦ for any convex cone K. Definition 8. A closed convex cone K ∈ Rn is called self-dual if K = K∗. n Some notable examples of self-dual cones are the nonnegative orthant R+ = {x | xi ≥ 0 ∀ i ∈ {1, . . . , n}}, the second-order cone n Ln = {x = (¯x, x0) ∈ R × R | kx¯k ≤ x0} n and the cone of positive semidefinite matrices S+. References [1] Jean-Baptiste Hiriart-Urruty and Claude Lemar´echal. Fundamentals of . Grundlehren Text Edi- tions. Springer-Verlag, Berlin, 2001. Abridged version of ıt Convex analysis and minimization algorithms. I and II. [2] R. Tyrrell Rockafellar. Convex analysis. Princeton Mathematical Series, No. 28. Princeton University Press, Princeton, N.J., 1970. [3] G¨unter M. Ziegler. Lectures on polytopes, volume 152 of Graduate Texts in Mathematics. Springer-Verlag, New York, 1995.