Convex Functions

Lecture 3: Convex Functions Xiugang Wu University of Delaware Fall 2019 1 Outline • Basic Properties and Examples • Operations that Preserve Convexity • The Conjugate Function • Quasiconvex Functions • Log-concave and Log-convex Functions • Convexity with respect to Generalized Inequalities 2 Outline • Basic Properties and Examples • Operations that Preserve Convexity • The Conjugate Function • Quasiconvex Functions • Log-concave and Log-convex Functions • Convexity with respect to Generalized Inequalities 3 Convex Function A function f : Rn R is convex if dom f is a convex set and ! f(✓x +(1 ✓)y) ✓f(x)+(1 ✓)f(y) − − for all x, y dom f and ✓ [0, 1] 2 2 - f is strictly convex if the above holds with “ ” replaced by “<” - f is concave if f is convex − -affine functions are both convex and concave; conversely, if a function is both convex and concave, then it is affine - f is convex i↵ it is convex when restricted to any line that intersects its domain, i.e. i↵ for all x dom f and all v, the function g(t)=f(x + tv) on its domain t x + tv dom2 f is convex { | 2 } 4 Extended-Value Extension - The extended-value extension f˜ of f is defined by f(x) x dom f f˜(x)= 2 x/dom f (1 2 - This simplifies notation as f is convex i↵ f˜(✓x +(1 ✓)y) ✓f˜(x)+(1 ✓)f˜(y), x, y Rn, ✓ [0, 1] − − 8 2 2 - We can recover the domain of f from the extension by taking dom f = x f˜(x) < { | 1} 5 First Order Condition - f is di↵erentiable if dom f is open and the gradient @f @f @f f(x)= , ,..., r @x @x @x ✓ 1 2 n ◆ exists at each x dom f 2 -di↵erentiable f with convex domain is convex i↵ f(y) f(x)+ f(x)T (y x), x, y dom f ≥ r − 8 2 - Note that the affine function g(y)=f(x)+ f(x)T (y x) is the first-order Taylor approximation of f near x. The abover says that− f is convex i↵ the first-order Taylor approximation is always an underestimator of f. 6 Second Order Condition - f is twice di↵erentiable if dom f is open and the Hessian 2f(x) Sn, r 2 2 2 @ f f(x)ij = ,i,j [1 : n] r @xi@xj 2 exists at each x dom f 2 -twicedi↵erentiable f with convex domain is convex i↵ the Hessian 2f(x)is PSD for all x dom f r 2 -twicedi↵erentiable f with convex domain is strictly convex if the Hessian 2f(x) is PD for all x dom f r 2 1 T T - Quadratic function f(x)= 2 x Px+ q x + r is convex i↵ its Hessian P is PSD [Note that (xT Px)=2Px and 2(xT Px)=2P ] r r 2 T - Least-squares objective f(x)= Ax b 2 is convex because f =2A (Ax b) and 2f =2AT A 0 for any Ak − k r − r ⌫ 7 Examples in One Dimension Convex -affine: ax + b on R, for any a, b R 2 - exponential: eax on R, for any a R 2 -powers:xa on R , for a 1 or a 0 ++ ≥ - powers of absolute value: x a on R, for a 1 | | ≥ - negative entropy: x log x on R++ Concave -affine: ax + b on R, for any a, b R 2 -powers:xa on R , for a [0, 1] ++ 2 - logarithm: log x on R++ 8 Examples in High Dimensions affine functions are convex and concave; all norms are convex Examples on Rn -affine function: f(x)=aT x + b n p 1/p - norms: e.g., x p =( i=1 xi ) for p 1; x = maxi [1:n] xi k k | | ≥ k k1 2 | | P m n Examples on R ⇥ -affine function: T f(X)=tr(A X)+b = AijXij + b i [1:m] j [1:n] 2X 2X - spectral (maximum singlar value) norm: f(X)= X = σ (X)=(λ (XT X))1/2 k k2 max max 9 Examples in High Dimensions - max function: f(x) = maxi xi is convex - Quadratic over linear function: f(x, y)=x2/y is convex for y>0 2 y2 xy 2 y 2f(x, y)= − = y x 0 r y3 xy x2 y3 x − ⌫ − − ⇥ ⇤ - Log-sum-exponential: f(x) = log(ex1 + +exn ) is convex; also called softmax ··· 1/n - Geometric mean: f(x)=( i xi) is concave Q n - Log determinant: f(x) = log detX, dom f = S++ is concave g(t) = log det(Z + tV ) 1/2 1/2 1/2 1/2 = log det(Z (I + tZ− VZ− )Z ) 1/2 1/2 = log detZ + log det(I + tZ− VZ− ) n = log detZ + log(1 + tλi) i=1 X 1/2 1/2 where λ ,i [1 : n] are the eigenvalues of Z− VZ− i 2 10 Sublevel Set and Epigraph -The↵-sublevel set of f is x domf f(x) ↵ .Iff is convex, then its sublevel set is convex. Converse{ 2 is not true:| ex is} concave, but its sublevel sets are convex. − - The graph of function f is defined as (x, f(x) x dom f) ,whichisasubset in Rn+1. The epigraph is defined as{epif =| (2x, t) x dom} f,t f(x) . Function f is convex i↵ its epigraph is a convex{ set. | 2 ≥ } 11 Jensen’s Inequality -Iff is convex, then f(Ex) Ef(x) for any random variable x with x dom f w.p. 1. If f is non-convex, then there is a random variable x with x2 domf w.p. 1, such that f(Ex) > Ef(x). 2 - An intepretation: Consider convex function f. For any x dom f and random variable z with Ez = 0, we have 2 Ef(x + z) f(E[x + z]) = f(x), ≥ which means that randomization or dithering (adding a zero mean random vector to the argument) only increases the value of convex function on average. 12 Outline • Basic Properties and Examples • Operations that Preserve Convexity • The Conjugate Function • Quasiconvex Functions • Log-concave and Log-convex Functions • Convexity with respect to Generalized Inequalities 13 A Calculus of Convex Functions Practical methods for establishing convexity of a function: - verify definition (often simplified by restricting to a line) - for twice di↵erentiable function, show its Hessian is PSD - show that f is obtained from simple convex functions by operations that preserve convexity – nonnegative weighted sum – composition with affine function – pointwise maximum and supremum – composition – minimization – perspective 14 Positive Weighted Sum and Composition with Affine Nonnegative weighted sum: If f ,...,f are convex then f = w f + +w f 1 m 1 1 ··· m m is also convex, where w1,...,wm are nonnegative Composition with affine function: f(Ax + b) is convex if f is convex - log barrier for linear inequalities: m f(x)= log(b aT x), dom f = x aT x<b,i [1 : m] − i − i { | i i 2 } i=1 X - any norm of affine function: f(x)= Ax + b k k 15 Pointwise Maximum If f ,...,f are convex then f(x) = max f (x),...,f (x) is also convex 1 m { 1 m } T - piecewise-linear function: f(x) = maxi [1:m](ai x + b) is convex 2 - sum of r largest components of x Rn: 2 f(x)=x[1] + x[2] + ...+ x[r] is convex, where x[i] is the ith largest component of x; this is because f(x) = max xi I [1:n], I =r ✓ | | i I X2 16 Pointwise Supremum If f(x, y) is convex in x for each y A then 2 g(x)=supf(x, y) y A 2 is also convex T - support function of a set C: SC (x)=supy C y x is convex 2 - distance to farthest point in a set C: f(x)=sup x y y C k − k 2 - maximum eigenvalue of symmetric matrix: for X Sn, 2 T λmax(X)= sup y Xy y =1 k k2 17 Composition with Scalar Functions Composition of h : R R with scalar function g : Rn R: ! ! f(x)=h(g(x)) f is convex if: - g convex, h convex, h˜ nondecreasing - g concave, h convex, h˜ nonincreasing Remark: - proof (for n = 1, di↵erentiable g, h) 2 f 00(x)=h00(g(x))g0(x) + h0(g(x))g00(x) - monotonicity must hold for extended-value extension h˜ Examples: -expg(x) is convex if g is convex -1/g(x) is convex if g is concave and positive 18 Composition with Vector Functions Composition of h : Rk R with scalar function g : Rn Rk: ! ! f(x)=h(g1(x),g2(x),...,gk(x)) f is convex if: - gi convex, h convex, h˜ nondecreasing in each argument - gi concave, h convex, h˜ nonincreasing in each argument proof (for n = 1, di↵erentiable g, h) T 2 T f 00(x)=g0(x) h(g(x))g0(x)+ h(g(x)) g00(x) r r Examples: m - i=1 log gi(x) is concave if gi are concave and positive m - log exp gi(x) is convex if g are convex P i=1 P 19 Minimization If f is convex in (x, y) and C is a convex nonempty set, then the function g(x)=infy C f(x, y) is also convex 2 E.g., distance to a set: d(x, S)=infy S x y is convex if S is convex [function x y is convex in (x, y)soifS is convex2 k − thenk d(x, S) is convex in x] k − k 20 Perspective The perspective of f : Rn R is the function g : Rn+1 R given by ! ! g(x, t)=tf(x/t) with domg = (x, t) x/t domf,t > 0 .

Convex Functions

Details

Download

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

Support