Convex Optimization Scai Lab Study Group

Total Page:16

File Type:pdf, Size:1020Kb

Convex Optimization Scai Lab Study Group Convex Optimization ScAi Lab Study Group Zhiping (Patricia) Xiao University of California, Los Angeles Winter 2020 Outline 2 Introduction: Convex Optimization Convexity Convex Functions' Properties Definition of Convex Optimization Convex Optimization General Strategy Learning Algorithms Convergence Analysis Examples Source of Materials 3 Textbook: I Convex Optimization and Intro to Linear Algebra by Prof. Boyd and Prof. Vandenberghe Course Materials: I ECE236B, ECE236C offered by Prof. Vandenberghe I CS260 Lecture 12 offered by Prof. Quanquan Gu Notes: I My previous ECE236B notes and ECE236C final report. I My previous CS260 Cheat Sheet. Related Papers: I Accelerated methods for nonconvex optimization I Lipschitz regularity of deep neural networks: analysis and efficient estimation Introduction: Convex Optimization q Notations 5 I iff: if and only if I R+ = fx 2 R j x ≥ 0g I R++ = fx 2 R j x > 0g I int K: interior of set K, not its boundary. I Generalized inequalities (textbook 2.4), based on a proper cone K (convex, closed, solid, pointed | if x 2 K and −x 2 K then x = 0): I x K y () y − x 2 K I x ≺K y () y − x 2 int K n n T I Positive semidefinite matrix X 2 S+, 8y 2 R , y Xy ≥ 0 () X 0. Convexity of Set 6 Set C is convex iff the line segment between any two points in C lies in C, i.e. 8x1; x2 2 C and 8θ 2 [0; 1], we have: θx1 + (1 − θ)x2 2 C Both convex and nonconvex sets have convex hull, which is defined as: k k X X conv C = f θixi j xi 2 C; θi ≥ 0; i = 1; 2; : : : ; k; θi = 1g i=1 i=1 Convexity of Set (Examples from Textbook) 7 Figure: Left: convex, middle & right: nonconvex. Figure: Left: convex hull of the points, right: convex hull of the kidney-shaped set above. Operations that Preserve Convexity of SetsI 8 The most common operations that preserve convexity of convex sets include: I Intersection I Image / inverse image under affine function I Cartesian Product, Minkowski sum, Projection I Perspective function I Linear-fractional functions Operations that Preserve Convexity of SetsII 9 Convexity is preserved under intersection: I S1;S2 are convex sets then S1 \ S2 is also convex set. I If Sα is convex for 8α 2 A, then \α2ASα is convex. Proof: Intersection of a collection of convex sets is convex set. If the intersection is empty, or consists of only a single point, then proved by definition. Otherwise, for any two points A, B in the intersection, line AB must lie wholly within each set in the collection, hence must lie wholly within their intersection. Operations that Preserve Convexity of Sets III 10 n m An affine function f : R ! R is a sum of a linear function and a constant, i.e., if it has the form f(x) = Ax + b, where m×n m A 2 R , b 2 R , thus f represents a hyperplane. n Suppose that S ⊆ R is convex and then the image of S under f is convex: f(S) = ff(x) j x 2 Sg m n Also, if f : R ! R is an affine function, the inverse image of S under f is convex: f −1(S) = fx j f(x) 2 Sg Examples include scaling αS = ff(x) j αx; x 2 Sg (α 2 R) and n translation S + a = ff(x) j x + a; x 2 Sg (a 2 R ); they are both convex sets when S is convex. Operations that Preserve Convexity of SetsIV 11 Proof: the image of convex set S under affine function f(x) = Ax + b is also convex. If S is empty or contains only one point, then f(S) is obviously convex. Otherwise, take xS; yS 2 f(S). xS = f(x) = Ax + b, xS = f(y) = Ay + b. Then 8θ 2 [0; 1], we have: θxS + (1 − θ)yS = A(θx + (1 − θ)y) + b = f(θx + (1 − θ)y) Since x; y 2 S, and S is convex set, then θx + (1 − θ)y 2 S, and thus f(θx + (1 − θ)y) 2 f(S). Operations that Preserve Convexity of SetsV 12 n m The Cartesian Product of convex sets S1 ⊆ R , S2 ⊆ R is obviously convex: S1 × S2 = f(x1; x2) j x1 2 S1; x2 2 S2g The Minkowski sum of the two sets is defined as: S1 + S2 = fx1 + x2 j x1 2 S1; x2 2 S2g and it is also obviously convex. The projection of a convex set onto some of its coordinates is also obviously convex. (consider the definition of convexity reflected on each coordinate) m n T = fx1 2 R j (x1; x2) 2 S for some x2 2 R g Operations that Preserve Convexity of SetsVI 13 n+1 n We define the perspective function P : R ! R , with domain n dom P = R × R++, as P (z; t) = z=t. The perspective function scales or normalizes vectors so the last component is one, and then drops the last component. We can interpret the perspective function as the action of a pin-hole camera. (x1; x2; x3) through a hold at (0; 0; 0) on plane x3 = 0 forms an image at −(x1=x3; x2=x3; 1) at x3 = −1. The last component could be dropped, since the image point is fixed. Proof: That this operation preserves convexity is already proved by affine function + projection preserve convexity. Operations that Preserve Convexity of Sets VII 14 n m A linear-fractional function f : R ! R is formed by composing the perspective function with an affine function. n m+1 Consider the following affine function g : R ! R : A b g(x) = x + cT d m×n m n where A 2 R , b 2 R , c 2 R , d 2 R. m+1 m Followed by a perspective function P : R ! R we have: f(x) = (Ax + b)=(cT x + d); dom f = fx j cT x + d > 0g And it naturally preserves convexity because both affine function and perspective function preserve convexity. Convexity of Function 15 Convex Functions Strict Convex Functions Strong Convex Functions Figure: The three commonly-seen types of convex functions and their relations. In brief, strong convex functions ) strict convex functions ) convex functions. Convexity of Function 16 n f : R ! R is convex iff it satisfies: I dom f is a convex set. I 8x; y 2 dom f, θ 2 [0; 1], we have the Jensen's inequality: f(θx + (1 − θ)y) ≤ θf(x) + (1 − θ)f(y) f is strictly convex iff when x 6= y and θ 2 (0; 1), strict inequality of the above inequation holds. f is concave when −f is convex, strictly concave when −f strictly convex, and vice versa. f is strong convex iff 9α > 0 such that f(x) − αkxk2 is convex. k·k is any norm. Convexity of Function 17 Proof: strong convex functions ) strict convex functions ) convex functions. That all strict convex functions are convex functions, and that convex functions are not necessarily strict convex. Strong convexity implies, 8x; y 2 dom f; θ 2 [0; 1], x 6= y, 9α > 0: f(θx + (1 − θ)y) − αkθx + (1 − θ)yk2 (1.1) ≤ θf(x) + (1 − θ)f(y) − θαkxk2 − (1 − θ)αkyk2 Something we didn't prove yet but is true: k·k2 is strictly convex. We need it for this proof. kθx + (1 − θ)yk2 < θkxk2 + (1 − θ)kyk2 Convexity of Function 18 (proof continues) αkθx + (1 − θ)yk2 < θαkxk2 + (1 − θ)αkyk2 t = −αkθx + (1 − θ)yk2 + θαkxk2 + (1 − θ)αkyk2 > 0 (1.1) is equivalent with: f(θx + (1 − θ)y) + t ≤ θf(x) + (1 − θ)f(y) where t > 0, thus: f(θx + (1 − θ)y) < θf(x) + (1 − θ)f(y) Convexity of Function 19 Figure: Convex function illustration from Prof. Gu's Slides. This figure shows a typical convex function f, and instead of our expression of x and y he used u & v instead. Convexity of Function 20 Commonly-seen uni-variate convex functions include: I Constant: C I Exponential function: eax I Power function: xa (a 2 (−∞; 0] [ [1; 1), otherwise it is concave) I Powers of absolute value: jxjp (p ≥ 1) I Logarithm: − log(x)(x 2 R++) I x log(x)(x 2 R++) I All norm functions kxk I \The inequality follows from the triangle inequality, and the equality follows from homogeneity of a norm." Affine functions are Convex & Concave 21 n m An affine function f : R ! R , f(x) = Ax + b, where m×n m A 2 R , b 2 R , is convex & concave (neither strict convex nor strict concave). Conversely, all functions that are both convex and concave are affine functions. Proof: 8θ 2 [0; 1]; x; y 2 dom f, we have: f(θx + (1 − θ)y) = A(θx + (1 − θ)y) + b = θ(Ax + b) + (1 − θ)(Ay + b) = θf(x) + (1 − θ)f(y) th 0 -Order Characterization 22 f is convex iff it is convex when restricted to any line that intersects its domain. n In other words, f is convex iff 8x 2 dom f and 8v 2 R , the function: g(t) = f(x + tv) is convex. dom g = ft j x + tv 2 dom fg This property allows us to check convexity of a function by restricting it to a line. st 1 -Order Characterization 23 Suppose f is differentiable (its gradient rf exists at each point in dom f, which is open). Then f is convex iff: I dom f is a convex set I 8x; y 2 dom f: f(y) ≥ f(x) + rf(x)T (y − x) It states that, for a convex function, the first-order Taylor approximation (f(x) + rf(x)T (y − x) is the first-order Taylor approximation of f near x) is in fact a global underestimator of the function.
Recommended publications
  • On Choquet Theorem for Random Upper Semicontinuous Functions
    CORE Metadata, citation and similar papers at core.ac.uk Provided by Elsevier - Publisher Connector International Journal of Approximate Reasoning 46 (2007) 3–16 www.elsevier.com/locate/ijar On Choquet theorem for random upper semicontinuous functions Hung T. Nguyen a, Yangeng Wang b,1, Guo Wei c,* a Department of Mathematical Sciences, New Mexico State University, Las Cruces, NM 88003, USA b Department of Mathematics, Northwest University, Xian, Shaanxi 710069, PR China c Department of Mathematics and Computer Science, University of North Carolina at Pembroke, Pembroke, NC 28372, USA Received 16 May 2006; received in revised form 17 October 2006; accepted 1 December 2006 Available online 29 December 2006 Abstract Motivated by the problem of modeling of coarse data in statistics, we investigate in this paper topologies of the space of upper semicontinuous functions with values in the unit interval [0,1] to define rigorously the concept of random fuzzy closed sets. Unlike the case of the extended real line by Salinetti and Wets, we obtain a topological embedding of random fuzzy closed sets into the space of closed sets of a product space, from which Choquet theorem can be investigated rigorously. Pre- cise topological and metric considerations as well as results of probability measures and special prop- erties of capacity functionals for random u.s.c. functions are also given. Ó 2006 Elsevier Inc. All rights reserved. Keywords: Choquet theorem; Random sets; Upper semicontinuous functions 1. Introduction Random sets are mathematical models for coarse data in statistics (see e.g. [12]). A the- ory of random closed sets on Hausdorff, locally compact and second countable (i.e., a * Corresponding author.
    [Show full text]
  • EE364 Review Session 2
    EE364 Review EE364 Review Session 2 Convex sets and convex functions • Examples from chapter 3 • Installing and using CVX • 1 Operations that preserve convexity Convex sets Convex functions Intersection Nonnegative weighted sum Affine transformation Composition with an affine function Perspective transformation Perspective transformation Pointwise maximum and supremum Minimization Convex sets and convex functions are related via the epigraph. • Composition rules are extremely important. • EE364 Review Session 2 2 Simple composition rules Let h : R R and g : Rn R. Let f(x) = h(g(x)). Then: ! ! f is convex if h is convex and nondecreasing, and g is convex • f is convex if h is convex and nonincreasing, and g is concave • f is concave if h is concave and nondecreasing, and g is concave • f is concave if h is concave and nonincreasing, and g is convex • EE364 Review Session 2 3 Ex. 3.6 Functions and epigraphs. When is the epigraph of a function a halfspace? When is the epigraph of a function a convex cone? When is the epigraph of a function a polyhedron? Solution: If the function is affine, positively homogeneous (f(αx) = αf(x) for α 0), and piecewise-affine, respectively. ≥ Ex. 3.19 Nonnegative weighted sums and integrals. r 1. Show that f(x) = i=1 αix[i] is a convex function of x, where α1 α2 αPr 0, and x[i] denotes the ith largest component of ≥ ≥ · · · ≥ ≥ k n x. (You can use the fact that f(x) = i=1 x[i] is convex on R .) P 2. Let T (x; !) denote the trigonometric polynomial T (x; !) = x + x cos ! + x cos 2! + + x cos(n 1)!: 1 2 3 · · · n − EE364 Review Session 2 4 Show that the function 2π f(x) = log T (x; !) d! − Z0 is convex on x Rn T (x; !) > 0; 0 ! 2π .
    [Show full text]
  • NP-Hardness of Deciding Convexity of Quartic Polynomials and Related Problems
    NP-hardness of Deciding Convexity of Quartic Polynomials and Related Problems Amir Ali Ahmadi, Alex Olshevsky, Pablo A. Parrilo, and John N. Tsitsiklis ∗y Abstract We show that unless P=NP, there exists no polynomial time (or even pseudo-polynomial time) algorithm that can decide whether a multivariate polynomial of degree four (or higher even degree) is globally convex. This solves a problem that has been open since 1992 when N. Z. Shor asked for the complexity of deciding convexity for quartic polynomials. We also prove that deciding strict convexity, strong convexity, quasiconvexity, and pseudoconvexity of polynomials of even degree four or higher is strongly NP-hard. By contrast, we show that quasiconvexity and pseudoconvexity of odd degree polynomials can be decided in polynomial time. 1 Introduction The role of convexity in modern day mathematical programming has proven to be remarkably fundamental, to the point that tractability of an optimization problem is nowadays assessed, more often than not, by whether or not the problem benefits from some sort of underlying convexity. In the famous words of Rockafellar [39]: \In fact the great watershed in optimization isn't between linearity and nonlinearity, but convexity and nonconvexity." But how easy is it to distinguish between convexity and nonconvexity? Can we decide in an efficient manner if a given optimization problem is convex? A class of optimization problems that allow for a rigorous study of this question from a com- putational complexity viewpoint is the class of polynomial optimization problems. These are op- timization problems where the objective is given by a polynomial function and the feasible set is described by polynomial inequalities.
    [Show full text]
  • Arxiv:2103.15804V4 [Math.AT] 25 May 2021
    Decorated Merge Trees for Persistent Topology Justin Curry, Haibin Hang, Washington Mio, Tom Needham, and Osman Berat Okutan Abstract. This paper introduces decorated merge trees (DMTs) as a novel invariant for persistent spaces. DMTs combine both π0 and Hn information into a single data structure that distinguishes filtrations that merge trees and persistent homology cannot distinguish alone. Three variants on DMTs, which empha- size category theory, representation theory and persistence barcodes, respectively, offer different advantages in terms of theory and computation. Two notions of distance|an interleaving distance and bottleneck distance|for DMTs are defined and a hierarchy of stability results that both refine and generalize existing stability results is proved here. To overcome some of the computational complexity inherent in these dis- tances, we provide a novel use of Gromov-Wasserstein couplings to compute optimal merge tree alignments for a combinatorial version of our interleaving distance which can be tractably estimated. We introduce computational frameworks for generating, visualizing and comparing decorated merge trees derived from synthetic and real data. Example applications include comparison of point clouds, interpretation of persis- tent homology of sliding window embeddings of time series, visualization of topological features in segmented brain tumor images and topology-driven graph alignment. Contents 1. Introduction 1 2. Decorated Merge Trees Three Different Ways3 3. Continuity and Stability of Decorated Merge Trees 10 4. Representations of Tree Posets and Lift Decorations 16 5. Computing Interleaving Distances 20 6. Algorithmic Details and Examples 24 7. Discussion 29 References 30 Appendix A. Interval Topology 34 Appendix B. Comparison and Existence of Merge Trees 35 Appendix C.
    [Show full text]
  • Linear Algebra Review
    CSE 203B: Convex Optimization Week 4 Discuss Session 1 Contents • Convex functions (Ref. Chap.3) • Review: definition, first order condition, second order condition, operations that preserve convexity • Epigraph • Conjugate function • Dual norm 2 Review of Convex Function • Definition ( often simplified by restricting to a line) • First order condition: a global underestimator • Second order condition • Operations that preserve convexity • Nonnegative weighted sum • Composition with affine function • Pointwise maximum and supremum • Composition • Minimization See Chap 3.2, try to prove why the convexity • Perspective is preserved with those operations. 3 Epigraph • 훼-sublevel set of 푓: 푅푛 → 푅 퐶훼 = 푥 ∈ 푑표푚 푓 푓 푥 ≤ 훼} sublevel sets of a convex function are convex for any value of 훼. • Epigraph of 푓: 푅푛 → 푅 is defined as 퐞퐩퐢 푓 = (푥, 푡) 푥 ∈ 푑표푚 푓, 푓 푥 ≤ 푡} ⊆ 푅푛+1 • A function is convex iff its epigraph is a convex set. 4 Relation between convex sets and convex functions • A function is convex iff its epigraph is a convex set. • Consider a convex function 푓 and 푥, 푦 ∈ 푑표푚 푓 푡 ≥ 푓 푦 ≥ 푓 푥 + 훻푓 푥 푇(푦 − 푥) epi 풇 First order condition for convexity • The hyperplane supports epi 풇 at (푥, 푓 푥 ), for any 푡 푥 푦, 푡 ∈ 퐞퐩퐢 푓 ⇒ 훻푓 푥 푇 푦 − 푥 + 푓 푥 − 푡 ≤ 0 훻푓 푥 푇 푦 푥 ⇒ − ≤ 0 −1 푡 푓 푥 Supporting hyperplane, derived from first order condition 5 Pointwise Supremum • If for each 푦 ∈ 푈, 푓(푥, 푦): 푅푛 → 푅 is convex in 푥, then function 푔(푥) = sup 푓(푥, 푦) 푦∈푈 is convex in 푥.
    [Show full text]
  • Lecture 3: Convex Sets and Functions 3.1 Convex Sets
    EE 227A: Convex Optimization and Applications January 24, 2012 Lecture 3: Convex Sets and Functions Lecturer: Laurent El Ghaoui Reading assignment: Chapters 2 (except x2.6) and sections 3.1, 3.2, 3.3 of BV. You can also look at section 3.1 of the web textbook. 3.1 Convex Sets Definition. A subset C of Rn is convex if and only if it contains the line segment between any two points in it: 8 x1; x2 2 C; 8 θ1 ≥ 0; θ2 ≥ 0; θ1 + θ2 = 1 : θ1x1 + θ2x2 2 C: Some important special cases of convex sets are the following. • The set is said to be an affine subspace if it contains the entire line passing through any two points. This corresponds to the condition above, with θ1; θ2 arbitrary. Subspaces and affine subspaces are convex. • The set is said to be a convex cone if the condition above holds, but with the restriction θ1 + θ2 = 1 removed. Examples. • The convex hull of a set of points fx1; : : : ; xmg is defined as ( m m ) X m X Co(x1; : : : ; xm) := λixi : λ 2 R+ ; λi = 1 ; i=1 i=1 and is convex. The conic hull: ( m ) X m λixi : λ 2 R+ i=1 is a convex cone. • For a 2 Rn, and b 2 R, the hyperplane H = fx : aT x = bg is affine. The half-space fx : aT x ≤ bg is convex. 3-1 EE 227A Lecture 3 | January 24, 2012 Sp'12 n×n n • For a square, non-singular matrix R 2 R , and xc 2 R , the ellipsoid fxc + Ru : kuk2 ≤ 1g is convex.
    [Show full text]
  • Convex Sets and Functions (Part III)
    Convex Sets and Functions (part III) Prof. Dan A. Simovici UMB 1 / 14 Outline 1 The epigraph and hypograph of functions 2 Constructing Convex Functions 2 / 14 The epigraph and hypograph of functions Definition Let (S; O) be a topological space and let f : S −! R^ be a function. Its epigraph is the set epi(f ) = f(x; y) 2 S × R j f (x) 6 yg: The hypograph of f is the set hyp(f ) = f(x; y) 2 S × R j y 6 f (x)g: 3 / 14 The epigraph and hypograph of functions 2 The epigraph of a function f : R −! R is the dotted area in R located above the graph of the function f ; the hypograph of f is the dotted area below the graph. y y x x (a) (b) 4 / 14 The epigraph and hypograph of functions Note that the intersection epi(f ) \ hyp(f ) = f(x; y) 2 S × R j y = f (x)g is the graph of the function f . If f (x) = 1, then (x; 1) 62 epi(f ). Thus, for the function f1 defined by f1(x) = 1 for x 2 S we have epi(f1) = ;. 5 / 14 The epigraph and hypograph of functions Theorem Let f : S −! R be a function defined on the convex subset S of a real linear space L. Then, f is convex on S if and only if its epigraph is a convex subset of S × R; f is concave if and only if its hypograph is a convex subset of S × R.
    [Show full text]
  • Convex Analysis
    Convex Analysis Taught by Professor Adrian Lewis Notes by Mateo D´ıaz [email protected] Cornell University Spring, 2018 Last updated May 18, 2018. The latest version is online here. Notes template by David Mehrle. Contents 1 Convex Sets .................................................. 4 2 Convex functions............................................... 6 2.1 Convexity and calculus......................................... 7 2.1.1 Gradients ............................................ 7 2.1.2 Recognizing smooth convexity................................ 8 2.2 Continuity of convex functions.................................... 8 2.3 Subgradients............................................... 9 3 Duality..................................................... 10 3.1 Fenchel Duality............................................. 10 3.2 Conic programs............................................. 12 3.3 Convex calculus............................................. 14 3.4 Lagrangian duality........................................... 15 4 Algorithms................................................... 19 4.1 Augmented Lagrangian Method ................................... 19 4.2 Proximal Point method and fixed points............................... 20 4.3 Smooth Minimization.......................................... 25 4.4 Splitting Problems and Alternating projections........................... 26 4.5 Proximal Gradient ........................................... 28 4.6 Douglas-Rachford............................................ 29 4.6.1 Consensus optimization...................................
    [Show full text]
  • On a Conic Approach to Convex Analysis 1 Introduction
    On a Conic Approach to Convex Analysis Jan Brinkhuis February 2008 Econometric Institute Report EI 2008-05 Abstract The aim of this paper is to make an attempt to justify the main results from Convex Analysis by one elegant tool, the conification method, which consists of three steps: conify, work with convex cones, deconify. It is based on the fact that the standard operations and facts (‘the calculi’) are much simpler for special convex sets, convex cones. By considering suitable classes of convex cones, we get the standard operations and facts for all situations in the complete generality that is required. The main advantages of this conification method are that the standard operations— linear image, inverse linear image, closure, the duality operator, the binary operations and the inf- operator—are defined on all objects of each class of convex objects—convex sets, convex functions, convex cones and sublinear functions—and that moreover the standard facts—such as the duality theorem—hold forall closed convex objects. This requires that the analysis is carried out in the context of convex objects over cosmic space, the space that is obtained from ordinary space by adding a horizon, representing the directions of ordinary space. 1 Introduction Reduction to conic calculus. This paper presents some progress on joint work with V.M.Tikhomirov that was published in [7]. It is an attempt to develop a general and universal tool for justifying the main results of Convex Analysis. The idea is as follows. Many standard operations and facts are much simpler and more complete for special convex objects, convex cones.
    [Show full text]
  • Generalized Convexity in Mathematical Programming
    BULL. AUSTRAL. MATH. SOC. 90C25, 90C30 VOL. 27 (1983), 185-202. GENERALIZED CONVEXITY IN MATHEMATICAL PROGRAMMING B. MOND The role of convexity in the development of mathematical programming is reviewed. Many recent generalizations of convexity and their applications to optimization theory are discussed. 1. Introduction As is well known, convexity plays a key role in mathematical programming. However, functions that are, in some sense, close to, but not quite, convex often have many of the important properties of convex functions. This, plus the fact that many practical problems involve functions that are 'almost1 convex has led to many generalizations of the notion of convex functions. Here we survey some of these extensions of convexity and indicate briefly the role they play in optimization theory. Most of the results in Sections 2 to 5 can be found in the books by Avriel [7], Mangasarian [76] and Martos [77] while the results in Section 6 on cone-convexity can be found in the book by Craven [JS]. In view of this, no additional references or original sources will be given for results in these sections except for the new dual for non-linear programs in Section 5 that does not appear in any of the above-mentioned books. More detailed references will be given in the later sections. Proofs will only be given where they are extremely brief and not generally available in the literature. It should be noted that most proofs Received 7 October 1982. An invited lecture delivered to the Annual Congress of the South African Mathematical Society, October 1982.
    [Show full text]
  • GEOMETRIC APPROACH to CONVEX SUBDIFFERENTIAL CALCULUS October 10, 2018
    GEOMETRIC APPROACH TO CONVEX SUBDIFFERENTIAL CALCULUS October 10, 2018 BORIS S. MORDUKHOVICH1 and NGUYEN MAU NAM2 Dedicated to Franco Giannessi and Diethard Pallaschke with great respect Abstract. In this paper we develop a geometric approach to convex subdifferential calculus in finite dimensions with employing some ideas of modern variational analysis. This approach allows us to obtain natural and rather easy proofs of basic results of convex subdifferential calculus in full generality and also derive new results of convex analysis concerning optimal value/marginal func- tions, normals to inverse images of sets under set-valued mappings, calculus rules for coderivatives of single-valued and set-valued mappings, and calculating coderivatives of solution maps to param- eterized generalized equations governed by set-valued mappings with convex graphs. Key words. convex analysis, generalized differentiation, geometric approach, convex separation, normal cone, subdifferential, coderivative, calculus rules, maximum function, optimal value function AMS subject classifications. 49J52, 49J53, 90C31 1 Introduction The notion of subdifferential (collection of subgradients) for nondifferentiable convex func- tions was independently introduced and developed by Moreau [15] and Rockafellar [19] who were both influenced by Fenchel [6]. Since then this notion has become one of the most central concepts of convex analysis and its various applications, including first of all convex optimization. The underlying difference between the standard derivative of a differentiable function and the subdifferential of a convex function at a given point is that the subdif- ferential is a set (of subgradients) which reduces to a singleton (gradient) if the function is differentiable. Due to the set-valuedness of the subdifferential, deriving calculus rules for it is a significantly more involved task in comparison with classical differential calculus.
    [Show full text]
  • Lecture 3 Convex Functions
    Lecture 3 Convex functions (Basic properties; Calculus; Closed functions; Continuity of convex functions; Subgradients; Optimality conditions) 3.1 First acquaintance Definition 3.1.1 [Convex function] A function f : M ! R defined on a nonempty subset M of Rn and taking real values is called convex, if • the domain M of the function is convex; • for any x; y 2 M and every λ 2 [0; 1] one has f(λx + (1 − λ)y) ≤ λf(x) + (1 − λ)f(y): (3.1.1) If the above inequality is strict whenever x 6= y and 0 < λ < 1, f is called strictly convex. A function f such that −f is convex is called concave; the domain M of a concave function should be convex, and the function itself should satisfy the inequality opposite to (3.1.1): f(λx + (1 − λ)y) ≥ λf(x) + (1 − λ)f(y); x; y 2 M; λ 2 [0; 1]: The simplest example of a convex function is an affine function f(x) = aT x + b { the sum of a linear form and a constant. This function clearly is convex on the entire space, and the \convexity inequality" for it is equality; the affine function is also concave. It is easily seen that the function which is both convex and concave on the entire space is an affine function. Here are several elementary examples of \nonlinear" convex functions of one variable: 53 54 LECTURE 3. CONVEX FUNCTIONS • functions convex on the whole axis: x2p, p being positive integer; expfxg; • functions convex on the nonnegative ray: xp, 1 ≤ p; −xp, 0 ≤ p ≤ 1; x ln x; • functions convex on the positive ray: 1=xp, p > 0; − ln x.
    [Show full text]