Chapter 2, Lecture 2: Convex Functions 1 Definition Of
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  CORE View Metadata, Citation and Similar Papers at Core.Ac.UkView metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Bulgarian Digital Mathematics Library at IMI-BAS Serdica Math. J. 27 (2001), 203-218 FIRST ORDER CHARACTERIZATIONS OF PSEUDOCONVEX FUNCTIONS Vsevolod Ivanov Ivanov Communicated by A. L. Dontchev Abstract. First order characterizations of pseudoconvex functions are investigated in terms of generalized directional derivatives. A connection with the invexity is analysed. Well-known first order characterizations of the solution sets of pseudolinear programs are generalized to the case of pseudoconvex programs. The concepts of pseudoconvexity and invexity do not depend on a single definition of the generalized directional derivative. 1. Introduction. Three characterizations of pseudoconvex functions are considered in this paper. The first is new. It is well-known that each pseudo- convex function is invex. Then the following question arises: what is the type of 2000 Mathematics Subject Classification: 26B25, 90C26, 26E15. Key words: Generalized convexity, nonsmooth function, generalized directional derivative, pseudoconvex function, quasiconvex function, invex function, nonsmooth optimization, solution sets, pseudomonotone generalized directional derivative. 204 Vsevolod Ivanov Ivanov the function η from the definition of invexity, when the invex function is pseudo- convex. This question is considered in Section 3, and a first order necessary and sufficient condition for pseudoconvexity of a function is given there. It is shown that the class of strongly pseudoconvex functions, considered by Weir [25], coin- cides with pseudoconvex ones. The main result of Section 3 is applied to characterize the solution set of a nonlinear programming problem in Section 4. The base results of Jeyakumar and Yang in the paper [13] are generalized there to the case, when the function is pseudoconvex.
- 
												  Convexity I: Sets and FunctionsConvexity I: Sets and Functions Ryan Tibshirani Convex Optimization 10-725/36-725 See supplements for reviews of basic real analysis • basic multivariate calculus • basic linear algebra • Last time: why convexity? Why convexity? Simply put: because we can broadly understand and solve convex optimization problems Nonconvex problems are mostly treated on a case by case basis Reminder: a convex optimization problem is of ● the form ● ● min f(x) x2D ● ● subject to gi(x) 0; i = 1; : : : m ≤ hj(x) = 0; j = 1; : : : r ● ● where f and gi, i = 1; : : : m are all convex, and ● hj, j = 1; : : : r are affine. Special property: any ● local minimizer is a global minimizer ● 2 Outline Today: Convex sets • Examples • Key properties • Operations preserving convexity • Same for convex functions • 3 Convex sets n Convex set: C R such that ⊆ x; y C = tx + (1 t)y C for all 0 t 1 2 ) − 2 ≤ ≤ In words, line segment joining any two elements lies entirely in set 24 2 Convex sets Figure 2.2 Some simple convexn and nonconvex sets. Left. The hexagon, Convex combinationwhich includesof x1; its : :boundary : xk (shownR : darker), any linear is convex. combinationMiddle. The kidney shaped set is not convex, since2 the line segment between the twopointsin the set shown as dots is not contained in the set. Right. The square contains some boundaryθ1x points1 + but::: not+ others,θkxk and is not convex. k with θi 0, i = 1; : : : k, and θi = 1. Convex hull of a set C, ≥ i=1 conv(C), is all convex combinations of elements. Always convex P 4 Figure 2.3 The convex hulls of two sets in R2.
- 
												  Circles ProjectCircles Project C. Sormani, MTTI, Lehman College, CUNY MAT631, Fall 2009, Project X BACKGROUND: General Axioms, Half planes, Isosceles Triangles, Perpendicular Lines, Con- gruent Triangles, Parallel Postulate and Similar Triangles. DEFN: A circle of radius R about a point P in a plane is the collection of all points, X, in the plane such that the distance from X to P is R. That is, C(P; R) = fX : d(X; P ) = Rg: Note that a circle is a well defined notion in any metric space. We say P is the center and R is the radius. If d(P; Q) < R then we say Q is an interior point of the circle, and if d(P; Q) > R then Q is an exterior point of the circle. DEFN: The diameter of a circle C(P; R) is a line segment AB such that A; B 2 C(P; R) and the center P 2 AB. The length AB is also called the diameter. THEOREM: The diameter of a circle C(P; R) has length 2R. PROOF: Complete this proof using the definition of a line segment and the betweenness axioms. DEFN: If A; B 2 C(P; R) but P2 = AB then AB is called a chord. THEOREM: A chord of a circle C(P; R) has length < 2R unless it is the diameter. PROOF: Complete this proof using axioms and the above theorem. DEFN: A line L is said to be a tangent line to a circle, if L \ C(P; R) is a single point, Q. The point Q is called the point of contact and L is said to be tangent to the circle at Q.
- 
												  Continuous Functions, Smooth Functions and the Derivative C 2012 Yonatan Katznelsonucsc supplementary notes ams/econ 11a Continuous Functions, Smooth Functions and the Derivative c 2012 Yonatan Katznelson 1. Continuous functions One of the things that economists like to do with mathematical models is to extrapolate the general behavior of an economic system, e.g., forecast future trends, from theoretical assumptions about the variables and functional relations involved. In other words, they would like to be able to tell what's going to happen next, based on what just happened; they would like to be able to make long term predictions; and they would like to be able to locate the extreme values of the functions they are studying, to name a few popular applications. Because of this, it is convenient to work with continuous functions. Definition 1. (i) The function y = f(x) is continuous at the point x0 if f(x0) is defined and (1.1) lim f(x) = f(x0): x!x0 (ii) The function f(x) is continuous in the interval (a; b) = fxja < x < bg, if f(x) is continuous at every point in that interval. Less formally, a function is continuous in the interval (a; b) if the graph of that function is a continuous (i.e., unbroken) line over that interval. Continuous functions are preferable to discontinuous functions for modeling economic behavior because continuous functions are more predictable. Imagine a function modeling the price of a stock over time. If that function were discontinuous at the point t0, then it would be impossible to use our knowledge of the function up to t0 to predict what will happen to the price of the stock after time t0.
- 
												  Convex FunctionsConvex Functions Prof. Daniel P. Palomar ELEC5470/IEDA6100A - Convex Optimization The Hong Kong University of Science and Technology (HKUST) Fall 2020-21 Outline of Lecture • Definition convex function • Examples on R, Rn, and Rn×n • Restriction of a convex function to a line • First- and second-order conditions • Operations that preserve convexity • Quasi-convexity, log-convexity, and convexity w.r.t. generalized inequalities (Acknowledgement to Stephen Boyd for material for this lecture.) Daniel P. Palomar 1 Definition of Convex Function • A function f : Rn −! R is said to be convex if the domain, dom f, is convex and for any x; y 2 dom f and 0 ≤ θ ≤ 1, f (θx + (1 − θ) y) ≤ θf (x) + (1 − θ) f (y) : • f is strictly convex if the inequality is strict for 0 < θ < 1. • f is concave if −f is convex. Daniel P. Palomar 2 Examples on R Convex functions: • affine: ax + b on R • powers of absolute value: jxjp on R, for p ≥ 1 (e.g., jxj) p 2 • powers: x on R++, for p ≥ 1 or p ≤ 0 (e.g., x ) • exponential: eax on R • negative entropy: x log x on R++ Concave functions: • affine: ax + b on R p • powers: x on R++, for 0 ≤ p ≤ 1 • logarithm: log x on R++ Daniel P. Palomar 3 Examples on Rn • Affine functions f (x) = aT x + b are convex and concave on Rn. n • Norms kxk are convex on R (e.g., kxk1, kxk1, kxk2). • Quadratic functions f (x) = xT P x + 2qT x + r are convex Rn if and only if P 0.
- 
												  A Brief Summary of Math 125 the Derivative of a Function F Is AnotherA Brief Summary of Math 125 The derivative of a function f is another function f 0 defined by f(v) − f(x) f(x + h) − f(x) f 0(x) = lim or (equivalently) f 0(x) = lim v!x v − x h!0 h for each value of x in the domain of f for which the limit exists. The number f 0(c) represents the slope of the graph y = f(x) at the point (c; f(c)). It also represents the rate of change of y with respect to x when x is near c. This interpretation of the derivative leads to applications that involve related rates, that is, related quantities that change with time. An equation for the tangent line to the curve y = f(x) when x = c is y − f(c) = f 0(c)(x − c). The normal line to the curve is the line that is perpendicular to the tangent line. Using the definition of the derivative, it is possible to establish the following derivative formulas. Other useful techniques involve implicit differentiation and logarithmic differentiation. d d d 1 xr = rxr−1; r 6= 0 sin x = cos x arcsin x = p dx dx dx 1 − x2 d 1 d d 1 ln jxj = cos x = − sin x arccos x = −p dx x dx dx 1 − x2 d d d 1 ex = ex tan x = sec2 x arctan x = dx dx dx 1 + x2 d 1 d d 1 log jxj = ; a > 0 cot x = − csc2 x arccot x = − dx a (ln a)x dx dx 1 + x2 d d d 1 ax = (ln a) ax; a > 0 sec x = sec x tan x arcsec x = p dx dx dx jxj x2 − 1 d d 1 csc x = − csc x cot x arccsc x = − p dx dx jxj x2 − 1 d product rule: F (x)G(x) = F (x)G0(x) + G(x)F 0(x) dx d F (x) G(x)F 0(x) − F (x)G0(x) d quotient rule: = chain rule: F G(x) = F 0G(x) G0(x) dx G(x) (G(x))2 dx Mean Value Theorem: If f is continuous on [a; b] and differentiable on (a; b), then there exists a number f(b) − f(a) c in (a; b) such that f 0(c) = .
- 
												  Secant Lines TEACHER NOTESTEACHER NOTES Secant Lines About the Lesson In this activity, students will observe the slopes of the secant and tangent line as a point on the function approaches the point of tangency. As a result, students will: • Determine the average rate of change for an interval. • Determine the average rate of change on a closed interval. • Approximate the instantaneous rate of change using the slope of the secant line. Tech Tips: Vocabulary • This activity includes screen • secant line captures taken from the TI-84 • tangent line Plus CE. It is also appropriate for use with the rest of the TI-84 Teacher Preparation and Notes Plus family. Slight variations to • Students are introduced to many initial calculus concepts in this these directions may be activity. Students develop the concept that the slope of the required if using other calculator tangent line representing the instantaneous rate of change of a models. function at a given value of x and that the instantaneous rate of • Watch for additional Tech Tips change of a function can be estimated by the slope of the secant throughout the activity for the line. This estimation gets better the closer point gets to point of specific technology you are tangency. using. (Note: This is only true if Y1(x) is differentiable at point P.) • Access free tutorials at http://education.ti.com/calculato Activity Materials rs/pd/US/Online- • Compatible TI Technologies: Learning/Tutorials • Any required calculator files can TI-84 Plus* be distributed to students via TI-84 Plus Silver Edition* handheld-to-handheld transfer. TI-84 Plus C Silver Edition TI-84 Plus CE Lesson Files: * with the latest operating system (2.55MP) featuring MathPrint TM functionality.
- 
												  Visual Differential CalculusProceedings of 2014 Zone 1 Conference of the American Society for Engineering Education (ASEE Zone 1) Visual Differential Calculus Andrew Grossfield, Ph.D., P.E., Life Member, ASEE, IEEE Abstract— This expository paper is intended to provide = (y2 – y1) / (x2 – x1) = = m = tan(α) Equation 1 engineering and technology students with a purely visual and intuitive approach to differential calculus. The plan is that where α is the angle of inclination of the line with the students who see intuitively the benefits of the strategies of horizontal. Since the direction of a straight line is constant at calculus will be encouraged to master the algebraic form changing techniques such as solving, factoring and completing every point, so too will be the angle of inclination, the slope, the square. Differential calculus will be treated as a continuation m, of the line and the difference quotient between any pair of of the study of branches11 of continuous and smooth curves points. In the case of a straight line vertical changes, Δy, are described by equations which was initiated in a pre-calculus or always the same multiple, m, of the corresponding horizontal advanced algebra course. Functions are defined as the single changes, Δx, whether or not the changes are small. valued expressions which describe the branches of the curves. However for curves which are not straight lines, the Derivatives are secondary functions derived from the just mentioned functions in order to obtain the slopes of the lines situation is not as simple. Select two pairs of points at random tangent to the curves.
- 
												  Sum of Squares and Polynomial ConvexityCONFIDENTIAL. Limited circulation. For review only. Sum of Squares and Polynomial Convexity Amir Ali Ahmadi and Pablo A. Parrilo Abstract— The notion of sos-convexity has recently been semidefiniteness of the Hessian matrix is replaced with proposed as a tractable sufficient condition for convexity of the existence of an appropriately defined sum of squares polynomials based on sum of squares decomposition. A multi- decomposition. As we will briefly review in this paper, by variate polynomial p(x) = p(x1; : : : ; xn) is said to be sos-convex if its Hessian H(x) can be factored as H(x) = M T (x) M (x) drawing some appealing connections between real algebra with a possibly nonsquare polynomial matrix M(x). It turns and numerical optimization, the latter problem can be re- out that one can reduce the problem of deciding sos-convexity duced to the feasibility of a semidefinite program. of a polynomial to the feasibility of a semidefinite program, Despite the relative recency of the concept of sos- which can be checked efficiently. Motivated by this computa- convexity, it has already appeared in a number of theo- tional tractability, it has been speculated whether every convex polynomial must necessarily be sos-convex. In this paper, we retical and practical settings. In [6], Helton and Nie use answer this question in the negative by presenting an explicit sos-convexity to give sufficient conditions for semidefinite example of a trivariate homogeneous polynomial of degree eight representability of semialgebraic sets. In [7], Lasserre uses that is convex but not sos-convex. sos-convexity to extend Jensen’s inequality in convex anal- ysis to linear functionals that are not necessarily probability I.
- 
												  Calculus TerminologyAP Calculus BC Calculus Terminology Absolute Convergence Asymptote Continued Sum Absolute Maximum Average Rate of Change Continuous Function Absolute Minimum Average Value of a Function Continuously Differentiable Function Absolutely Convergent Axis of Rotation Converge Acceleration Boundary Value Problem Converge Absolutely Alternating Series Bounded Function Converge Conditionally Alternating Series Remainder Bounded Sequence Convergence Tests Alternating Series Test Bounds of Integration Convergent Sequence Analytic Methods Calculus Convergent Series Annulus Cartesian Form Critical Number Antiderivative of a Function Cavalieri’s Principle Critical Point Approximation by Differentials Center of Mass Formula Critical Value Arc Length of a Curve Centroid Curly d Area below a Curve Chain Rule Curve Area between Curves Comparison Test Curve Sketching Area of an Ellipse Concave Cusp Area of a Parabolic Segment Concave Down Cylindrical Shell Method Area under a Curve Concave Up Decreasing Function Area Using Parametric Equations Conditional Convergence Definite Integral Area Using Polar Coordinates Constant Term Definite Integral Rules Degenerate Divergent Series Function Operations Del Operator e Fundamental Theorem of Calculus Deleted Neighborhood Ellipsoid GLB Derivative End Behavior Global Maximum Derivative of a Power Series Essential Discontinuity Global Minimum Derivative Rules Explicit Differentiation Golden Spiral Difference Quotient Explicit Function Graphic Methods Differentiable Exponential Decay Greatest Lower Bound Differential
- 
												  SubgradientsSubgradients Ryan Tibshirani Convex Optimization 10-725/36-725 Last time: gradient descent Consider the problem min f(x) x n for f convex and differentiable, dom(f) = R . Gradient descent: (0) n choose initial x 2 R , repeat (k) (k−1) (k−1) x = x − tk · rf(x ); k = 1; 2; 3;::: Step sizes tk chosen to be fixed and small, or by backtracking line search If rf Lipschitz, gradient descent has convergence rate O(1/) Downsides: • Requires f differentiable next lecture • Can be slow to converge two lectures from now 2 Outline Today: crucial mathematical underpinnings! • Subgradients • Examples • Subgradient rules • Optimality characterizations 3 Subgradients Remember that for convex and differentiable f, f(y) ≥ f(x) + rf(x)T (y − x) for all x; y I.e., linear approximation always underestimates f n A subgradient of a convex function f at x is any g 2 R such that f(y) ≥ f(x) + gT (y − x) for all y • Always exists • If f differentiable at x, then g = rf(x) uniquely • Actually, same definition works for nonconvex f (however, subgradients need not exist) 4 Examples of subgradients Consider f : R ! R, f(x) = jxj 2.0 1.5 1.0 f(x) 0.5 0.0 −0.5 −2 −1 0 1 2 x • For x 6= 0, unique subgradient g = sign(x) • For x = 0, subgradient g is any element of [−1; 1] 5 n Consider f : R ! R, f(x) = kxk2 f(x) x2 x1 • For x 6= 0, unique subgradient g = x=kxk2 • For x = 0, subgradient g is any element of fz : kzk2 ≤ 1g 6 n Consider f : R ! R, f(x) = kxk1 f(x) x2 x1 • For xi 6= 0, unique ith component gi = sign(xi) • For xi = 0, ith component gi is any element of [−1; 1] 7 n
- 
												  CHAPTER 3: DerivativesCHAPTER 3: Derivatives 3.1: Derivatives, Tangent Lines, and Rates of Change 3.2: Derivative Functions and Differentiability 3.3: Techniques of Differentiation 3.4: Derivatives of Trigonometric Functions 3.5: Differentials and Linearization of Functions 3.6: Chain Rule 3.7: Implicit Differentiation 3.8: Related Rates • Derivatives represent slopes of tangent lines and rates of change (such as velocity). • In this chapter, we will define derivatives and derivative functions using limits. • We will develop short cut techniques for finding derivatives. • Tangent lines correspond to local linear approximations of functions. • Implicit differentiation is a technique used in applied related rates problems. (Section 3.1: Derivatives, Tangent Lines, and Rates of Change) 3.1.1 SECTION 3.1: DERIVATIVES, TANGENT LINES, AND RATES OF CHANGE LEARNING OBJECTIVES • Relate difference quotients to slopes of secant lines and average rates of change. • Know, understand, and apply the Limit Definition of the Derivative at a Point. • Relate derivatives to slopes of tangent lines and instantaneous rates of change. • Relate opposite reciprocals of derivatives to slopes of normal lines. PART A: SECANT LINES • For now, assume that f is a polynomial function of x. (We will relax this assumption in Part B.) Assume that a is a constant. • Temporarily fix an arbitrary real value of x. (By “arbitrary,” we mean that any real value will do). Later, instead of thinking of x as a fixed (or single) value, we will think of it as a “moving” or “varying” variable that can take on different values. The secant line to the graph of f on the interval []a, x , where a < x , is the line that passes through the points a, fa and x, fx.