Min Common/Max Crossing Duality: a Geometric View of Conjugacy
Total Page:16
File Type:pdf, Size:1020Kb
August 2008 LIDS-P-2796 Revised Jan. 2009 Min Common/Max Crossing Duality: A Geometric View of Conjugacy in Convex Optimization1 Dimitri P. Bertsekas2 Abstract We provide a unifying framework for the visualization and analysis of duality, and other issues in convex optimization. It is based on two simple optimization problems that are dual to each other: the min common point problem and the max crossing point problem. Within the insightful geometry of these problems, several of the core issues in convex analysis become apparent and can be analyzed in a unified way. These issues are related to conditions for strong duality in constrained optimization and zero sum games, existence of dual optimal solutions and saddle points, existence of subgradients, and theorems of the alternative. The generality and power of our framework is due to its close connection with the Legendre/Fenchel conjugacy framework. However, the two frameworks offer complementary starting points for analysis and provide alternative views of the geometric foundation of duality: conjugacy emphasizes functional/algebraic descriptions, while min common/max crossing emphasizes set/epigraph descriptions. The min common/max crossing framework is simpler, and seems better suited for visualizing and investigating questions of strong duality and existence of dual optimal solutions. The conjugacy framework, with its emphasis on functional descriptions, is more suitable when mathematical operations on convex functions are involved, and the calculus of conjugate functions can be brought to bear for analysis or computation. 1 Supported by NSF Grant ECCS-0801549. The MC/MC framework was initially developed in joint research with A. Nedic, and A. Ozdaglar. This research is described in the book by Bertsekas, Nedic, and Ozdaglar [BNO03]. The present account is an improved and more comprehensive development. In particular, it contains some more streamlined proofs and some new results, particularly in connection with minimax problems and separable problems. 2 Dimitri Bertsekas is with the Dept. of Electr. Engineering and Comp. Science, M.I.T., Cambridge, Mass., 02139. 1 Contents 1. Introduction . p. 3 2. General Results and Some Special Cases . p. 7 Connection to Conjugate Convex Functions . p. 8 General Optimization Duality . p. 9 Optimization with Inequality Constraints . p. 9 Fenchel Duality Framework . p. 10 Minimax Problems . p. 11 3. Strong Duality Theorems . p. 16 Conditions for Strong Duality . p. 16 Existence of Dual Optimal Solutions . p. 18 Special Cases Involving Convexity and/or Compactness . p. 21 MC/MC and Polyhedral Convexity . p. 23 4. Applications . p. 28 Minimax Theorems . p. 28 Saddle Point Theorems . p. 33 A Nonlinear Version of Farkas' Lemma . p. 39 Convex Programming Duality . p. 43 Theorems of the Alternative . p. 46 Subdifferential Theory . p. 49 5. Nonconvex Problems and Estimates of the Duality Gap . p. 51 Separable Optimization Problems and their Geometry . p. 51 Estimates of Duality Gap in Separable Problems . p. 55 Estimates of Duality Gap in Minimax . p. 58 6. References . p. 59 7. Appendix: Convex Analysis Background . p. 60 Relative Interior and Closure . p. 60 Recession Cones . p. 61 Partial Minimization of Convex Functions . p. 62 Closures of Convex Functions . p. 64 Hyperplane Theorems . p. 67 Conjugate Functions . p. 68 Minimax Theory . p. 69 2 1. Introduction 1. INTRODUCTION Duality in optimization is often viewed as a manifestation of a fundamental dual/conjugate description of a closed convex set as the intersection of all closed halfspaces containing the set. When specialized to the epigraph of a function f : n [ ; ], this description leads to the formalism of the conjugate convex < 7! −∞ 1 function of f, which is defined by h(y) = sup x0y f(x) ; (1.1) x n − 2< and permeates much of convex optimization theory. In this paper, we focus on a framework referred to as min common/max crossing (MC/MC for short). It is related to the conjugacy framework, but does not involve an algebraic definition such as Eq. (1.1), and it is structured to emphasize optimization duality aspects. For this reason it is simpler, and seems better suited for geometric visualization and analysis in many important convex optimization contexts. Our framework aims to capture the most essential optimization-related features of the preceding conjugate description of closed convex sets in two simple geometrical problems, defined by a nonempty subset M of n+1. < (a) Min Common Point Problem: Consider all vectors that are common to M and the (n + 1)st axis. We want to find one whose (n + 1)st component is minimum. (b) Max Crossing Point Problem: Consider nonvertical hyperplanes that contain M in their corresponding \upper" closed halfspace, i.e., the closed halfspace whose recession cone contains the vertical halfline (0; w) w 0 (see Fig. 1.1). We want to find the maximum crossing point of the (n + 1)st axis with j ≥ such a hyperplane. Figure 1.1 suggests that the optimal value of the max crossing problem is no larger than the optimal value of the min common problem, and that under favorable circumstances the two optimal values are equal. Our purpose in this paper is to formalize the analysis of the two problems, to provide conditions that guarantee equality of their optimal values and the existence of their optimal solutions, and to show that they can be used to develop much of the core theory of convex analysis and optimization in a unified way. Mathematically, the min common problem is minimize w subject to (0; w) M: 2 Its optimal value is denoted by w∗, i.e., w∗ = inf w: (0;w) M 2 3 ! Negative Halfspace {x | a!x ≥ b} Negative Halfspace {x | a x ≥ b} ! Negative Halfspace {x | a!x ≥ b} Neg! ative HalfsPpaocseiti{vxe |Haaxlfs≥pabc}e {x | a!x ≤ b} Positive Halfspace {x | a x ≤ b} ! Positive Halfspace {x | a!x ≤ b} Positive Halfspace {x | a x ≤ b} ⊥ aff(C) C C ∩ S⊥ d z x aff(C) C C ∩ S d z x ⊥ ! ⊥ aff(C) CNegCat∩ivSe Hadlfspzacxe {x | a x ≥affb(}C) C C ∩! S d z x ! ! ! ! ! !Hyp! erplane {x | a x = b} = {x | a x = a x} Negative HalHfsyppaecrep{laxn|eaP{oxsi≥|taivbex} H=ablf}sp=ac{ex{|xa|xa=x a≤! xb}} ! ! ! ! ! ! ! NHeygpa! etrivpelaHneal{fsxp|acaex{=x |ba} x=≥{xb}| a x = a x} Negative HalHfsyPppaoecsrietpi{lvaxen|Heaa{lxfs≥|pabcx}e ={xb|}a=x{≤x |b}a x = a x} ! ! !! Negative Hal∗fsPpaocsieti{vxe |Ha∗alxfs≥pabc}e {x | a x ≤ b} ∗ ∗ NPeogsaittiivvee HHaallffssppaaccee {{xx||aaxx≤≥bb}} x x f!αx! +! (1 − α)x" ⊥ x x f!αx + (1 − α)x" ! ∗ PNoseitgiavteivHe∗aHlfasplfaspceac{ex{|xaaff|x(aC≤x) ≥b}Cb}C∗ ∩ S d z∗ x Positive Halfspace {x | a x ≤ b}x x f αx + (1⊥− α)x ! x x f!αx + (1 − α)x" aff(PCo)sitC!iveCHa∩lfSspacde {zx"|xa x ≤ b} ⊥ ⊥ Hyperplane {x | a!x = b} = a{ffx(|Ca)!xC= aC!x∩} S d z x aff(C) !C C ∩! S d z x ! ! ∗ ⊥ ∗ Hyperplane {x | a x ⊥= b} = {x | a x =aaffx(Cx}) xC! C ∩ S d z x x x Negative Halfspacaeff{(xC|)aCx ≥Cb∩} S Nedgaztivxe Halfspace {x | a x ≥ b!} ⊥ ∗! ! ! ! ! ! ∗ Hyperapffl(aCne) {Cx | Ca x∩=S b} d= {zxxx| xa x = a x} PHoyspiteirvpelaHnaelf{sxpa|cae x{x=|ba}x=≤{xb}| a xP=osaitxivx}e Hxalfspac! e {xx|∗a!xx≤ fb}α! x∗ +!(1 − α)x ! x0 − d! Hx1ypxe2r! pxlaxn4e−{xd |xa5 −x =d db} = {x0| −a! xd =x1axx2}x x4 −"d x5 − d d Hyperplane {x | a x = b} = {x | a x∗ = a x} ∗ ! ! ! x Hxypefrp!lαaxne +{x(01|−a−xdα=x)x1"bx}2=x{xx4|−∗adxx=5 a−xd}d ∗ x0 − d ∗x1 x2 x x4 −∗ d x⊥5 − d d x x f!αx + (1 − α)x" axff(Cx) Cf!αCx∩+S (1 −d αz)xx" ∗aff(C) C C∗ ∩ S⊥ d z x ∗ ∗ x x f!αx + (1 ∗− α)x" x x f!αx + (1 − α)x" ∗ ∗x x ! ! ! !∗ !!x x f!αx +! (1 −!α)x" Hyperplane {x | a x = b} = N{xeg|aatixve=Haalxfs}pxacex{x | a x ≥ b} ∗ Negative HalfHspyap!ceerp{lxan| ea {xx≥| ab}x = b} = {x | a x = a x} !∗ x0 − d x!1 x2 x! x0 4 −1 d2x53− d d x x xˆ0 xˆ1 xˆ2 xˆ3 NegatNiveegHataivlfesPpHaoacsleiftsi{Ppvxxeaocs|Hieaxtai{vxlxfes≥p|Haabac}lxefs{≥pxabc|}ea{xx≤| ab}xˆ ≤xˆb}xˆ xˆ ∗ x0 − ∗d x1 x2 x x!4∗!−∗ d x!5xˆ!−0 xˆd1dxˆ2 xˆ3 x !x !xˆ0 xˆ1 xˆ2 xˆ3 NePgaotsiivteiPvNHeoseaHigltafxisvltpfeisvapHecaxeaHcl{efaxsf{lpf!x|saxαpac|axeaxcx{ex≥+x{≤x|(b1}ab|−}xa αx≤N)e≥xbg}"abt}ivexN0He−aglafdstp∗ixva1ecexH2{axlxfs|xpa4acx−∗e≥∗d{xbx}|5 a−xd≥d b} 0 1 2 4 5 x x f!αxxx+ (1 − α)x" x − d x x x x − d x − d d! ! ⊥ ⊥ ! ! Positive HPaolsfsitpivaceeH{axlf|spaaxce≤{xxb0}|−a dxaPx≤ffo1(sbCixt}2i)avffxeC(HPxC4oa)Csl−fistC∩pidvaSe!xcCe5H∩−{adxlSfds|pzdaacxde≤{zxb}|xa x ≤ b} x0 − d x1 x2 x x4 − d x5 − d d Negative Halfspac⊥e {x | a x ≥ b} aff(C)affC(Cx)0C−C∩dSCx⊥1∩xdS2 zx xxd4a−z0 dax1xa52−ad3 d a0 a1 a2 a3 ∗ ! ! !! !! ! ∗ HyperHpxylapPnxeoersp{iltxaivn|eeaH0{⊥xxal=1|fsapb2}axc==e3⊥{{bxx} ||=a{xx≤=| aba}xx=}xˆ0axˆx1}xˆ2 xˆa3⊥0 a1 a2 a⊥3 aff(C) C! C ∩ aS a daz!ax ! x x Hyperplanaeff!{(xC|) aCxaff=C(Cb∩})S=xCˆ!{0xxdˆC|1a∩zxˆ!x2Sxx=ˆ3adx}z xaff(C) CaffC(C∩) SC Cd ∩z Sx d z x Hyperplane {x | a x = b} = {x | a x = a x} 0 1 2 3 x0 − d x1 x2 x x4 − d x5 − d d ∗ ! ∗ ⊥ xˆ xˆ xˆ xˆ Hyperplane {x | a!x =xˆ0Nbx}ˆe1g=axˆ!t{2ivxxˆex|30Ha−!axlfd=s∗pxa1c!xex}2{!xxaffx|xa(4Cx∗−f!)!≥dαCxb!}5C+−∩(d1Sd−! αd)x"z !x ! ! ! HyperplaHneyp{exrp| laanxe={xb}| =a x{x=|ba}x=xH!=y{pxaex|xrpa}lxfaHn!=αye!px{aexrx+p|}la(1nxexˆ−=0{αxˆb)1}|xax="ˆ2x{xˆx=3 |ba} x=={xa|xa}x = a x} xˆ0∗ xˆ1 xˆ∗2 xˆ3 ∗ ∗ Negative HxaPlfsopsxiaxtcivefe{xHxαxa|lafs+!pxαa(≥xc1e−b+{}xα()1|xa−xα≤)xb"} 0 1 2 3 ! ! ! " xˆ xˆ!a0 xˆa1 xaˆ! 2 a3 f(z) Positive H∗ alfspaHcey∗p{exr∗p| laanxe≤{xb∗}| a x = b} f=(z{∗)x | a x =∗a x∗} ∗ x x fxαx x+ a(f10 −αa1xαa)+2x a(31 − α)x x ∗ x fx αxx+f(1αf−x(zα))+x(1 − α)x ! f! (z) " ∗ x"x ⊥ ! a0 !a1 a2 a"3 " z a0 a1 a2 a3 a∗ff(Cx) Cx Cz∩ S d z x xˆ0 xˆ1 xˆ2 xˆ3 ∗ x x ∗⊥ a0 ∗a1 a2 a3 z a0 a1 a2aaff3(xC)x C zC ∩ Sx dx zf!xαxˆ0 +xˆ1(1xˆ2−xˆα3)x" x0 − d x1 x2 x x4 !− d x5 − d d !a0 a1 !a2 a3 ! x0 − dHxy1pexr2pxlxaxnxe∗4