Derivative Characterization of Constrained Extrema of Functionals

DERIVATIVE CHARACTERIZATION OF CONSTRAINED EXTREMA OF FUNCTIONALS A SURVEY Mohamed El-Hodiri University of Kansas Research Paper No. 34 AUGUST 1970 RESEARCH PAPERS IN THEORETICAL AND APPLIED ECONOMICS DEPARTMENT OF ECONOMICS THE UNIVERSITY OF KANSAS LAURENCE, KANSAS INTRODUCTION One of the basic axioms of economic analysis is the axiom of ration-.: ality, idea,of postulating that economic behavior results from a process of optimization. As it is, at best, difficult to directly test the axiom, economists have been interested in characterization theorems of optirniza- tion theory. In this paper we review the theorems that characterize opti- mality by way of derivatives. First we formulate a very general optimization problem. Then we present characterization theorems for three types of problems: Finite dimensional, variational and problems in linear topological spaces. In each case we present theorems for equality - inequality constraints. The theorems in each case are: first order necessary conditions, first order sufficient conditions, second order necessary conditions and second or~er:,sufficienTy&osditions. The scheme of repre- sentation is as follows: Statements of theorems are followed by remarks. referring the reader to the earliest, known to us, proofs of the theorems. In some instances, slight generalizations of some theorems appear here for the first time, an indication of necessary modifications to existing proofs are indicated. A case is "solved" if proofs for all the four types of characterization theorems exist. The only "unsolved" case is that of problems in linear topological spaces with inequality and with equality - inequality constraints. For this case, we present two conjectures about second order conditions that are analcgousto the equality constraint case. 2 We now state the "general" optimization problem: Let A,B,C,D be real linear topological spaces. Consider the functions : Let P be a partial ordering of By let >- be defined on C - as usual - by way of a convex cone and let el e2 e3 and e4 be the neutral element of 33' addition - zero elements - of A,B,c and D respectively. The problem may (1) be stated as follows .- A a Find E A such that f (xf is P-maximal (2) subject to: In case B is the real line, P is the relation ">"- defined for real numbers and we have a problem of scaler optimization. In case B is finite dimensional we have..a finite-vector maximization problem. In general, characterizations of solutions to finite-vector maximization problems may be derived from characterizations of solutions of scaler maximization problems. We shall restrict our presentation to scaler maximization prob3ems. As an application of scaler maximization theorems we may "solve" a part- ticular finite-vector maximization problem, namely for the case where P is taken to be a aret to'^) ordering of B. FQ' infinite-vector maximization problems, a scalerization theorem appears to be the most appropriate inter- 3 mediate step to derive characterization theorems. Such a theorem was proved by Hurwicz [20]. The method, for finite-vector m3ximization, con- sists(') of simply observing that the problem is equivalent to an infinite number of scaler maximization problems. However, we shall only be concerned with scaler maximization problems. 2. FINITE DIMENSIONAL PROBLEMS In this part we let A = E", B = E', C = E~ and D = E', where E", E~,Em II. E Euclidian spaces of dimensions n, one, m and R respectively. For theis part we reformulate the scaler maximization problem as follows: Problem 1. Find x E E~ such that f (x) >- f for all satisfying B ga(x) = 0, a = 1,..., m, h (x) 20,- B =1,..., R . .- We further state some properties of the constraints that will be used in discussing the theorems in this section. Some of the nomenclature, designated by " " is Karush's C231. 0 D.1) Definition. Effective constraints. 'Let xbe a point that satisfies hB 0 B (x) >- 0, = 1,.. , m. Let r (R) be the set of indices B such that h ($1 = 0. The constraints with indices B E r i:) 'kill be referred to as the 0 effective constraint6 at x. D.2) Definition. "Admissible Diyectiop'! Let h be differentiable. We say that X is an admissible direction if X is a non-tridal. solution to the inequalities: 0 D.3) Definition. "A curve issuing from x in the direction X." By that we mean,. an n-vector valued function, §(t), of a real variable t such that d E(0) = $ and 5 '(0) = - = A'. dt S(t) Itzo 0 D.4) Definition. admissible arc issuing from x in the direction A" is 0 an arc issuing from x in the direction X such that h (S(t)) >- 0. 0 D.5) Definition. "Property Q" -for inequality constraints, is satisfied at x. iff: For each admissible direction A, there exists an admissible arc issuing from 8. D.6) Definition. The rank condition for inequality constraints. We say that the rank condition h is satisfied at $ iff 1.) is differentiable and 2) the B rank of the matrix cR. I, f3 E r ($) , equals the number of effective con- 1 straints. D.7) Definition. The rank condition for equality_- inequality constraints is -- - -- 0 satisfied at x iff 1) the functions g and h are differentiable and 2) the 0 ag" 9 constraints at x, where EOL.i = -ax. I x =x: L .> ,.: . ..) D.8) Definition. The rank condition for equality constraints is satisfied at if the rank of the matrix is a. [E?I1 Finally wa define local and global constraint maxima: D.9) Definition. x is said to be a local solution to problem 1. iff: A ,. ,. There exists a neighborhood of x, N(,x), such that f(x) >- f(x) for all ,. ? r N(;) satisfying g(*) = 0 and h(x) 2 0. ,. ,. ,. D.lO) Definition. x is said to -.be a global solution to.problem 1. iff f(x) 3 f(x) for all x satisfying g(x) = 0 and h(x) > 0. 2. 1. First order necessary conditions. +. Theorem -1. If f, g -and h -are contjnuously differentiable and if x --is a global solution to problem 1, then there exists a vector (A0' v,p) = (Ao, 1 m L v v ,e ,pa) 0 such that, 1 > 0 and ; ,. , ,. 0 - 1, eB>0,- ,OhB (;) =o.. 2) Fx- 0 = 0, where F0 = X f + w. g + p. h, Fx0 is the vector of partial 0 A derivatives of F with respect to the component's of x evaluated at k = x. n For the case of equality constraints, i .e. the set {x I h (x) 0 )=E , (6) Theorem 1 was proved by ~aratheodor~(~)C93 and Bliss C71. For the case n of inequality constraints, i.e. the set {x I g (y) = 0) = E , theorem 1 - except for the non-negativity of p in condition 1 of the conclusion - was proved by ~arush'~)[231. The non-negativity of p may be proved by using a separation theorem (8 . This was, essentially, the crux of Fritz John's [22] proof who has treated a more general problem of an infinite number of inequality constraints. For the equality-inequality constraints the proof involves writing equality constraints as inequality constraints (g lo and - g (x) >- 0) and applying Karush's or Fritz Jone's theorem. Theorem 2. -If, in addition to the assumptions of theorem 1 we have either.(9) . a) Property Q for inequality constraints, D. 4, and the rank condition for .. .. equality constraints, D. 7, are satisfied at x. I a ) --The rank condition ~-for .. equality -inequality constraints is satisfied A -at x. Then the conclusions of theorem 1 follow with X 0. ---L--I)- For equality constraints the theorem was proved as a corollary of theorem 1 by ~liss(lO' [7]. For the case of inequality constraints the theorem was proved by Karush (I1) C231. The proof, for equality-inequality constraints, may be accomplished by converting inequality constraints to equality constraints and obtaining the theorem as a corollary to theorem 1. This was presented by Pennisi (13) C271. A direct proof was presenied by Hestenes (l" [l9]. 2.2. First Order Sufficient Conditions. Theorem 3. If: 1) f. E and h are differentiable 2) The conclusions of A ,. A theorem 1 hold with X_ > J at a point x with g(x) = 0, h (x) > 0 3) " . Either (3.a) FO, of theorem 1, is concave or (3.b) G! is linear and h is ,. concave. Then x is a rloh31 solution to ~roblem1. The theorem follows from the fact that a concave function lies above its tangent plane. The implications of this fact was utilized by Kuhn- Tucker C241 in the proof of their equivalence theorem. The present theorem may be proved by applying Lemma 3 of Kuhn and Tucker C241 to FO. 2.3 Second Order Necessary Conditicns. Theorem 4. If: 1) --f, g and h have second order continuous partial de- A rivatives. 2: x is a solutLon-- to problem 1. 3) The rank condition for equality - inequality ccastraints , 9.5. I.... -- 5 .I--- . ..-!:. 1 m 1 R Thenthere exis-? multipliers ( v,p) = (v ,. :, v ; LI ,. , LI ) such that: For the equality constraints, thc! theorem was proved by Caratheodory (15) [9] and ~Liss(~~)['7].For inequality constraints the theorem was proved by Karush (17) [231. For the equality-inequality case, the theorem was (19) proved by Pennisi (lR) [27], under the, dispensible , stipulation that the number 02 no~l-~sromultipliers attached to effective inequality constraints is at most one. 2.4. Second order sxffi,.ent conditions. Theorem 5,. If: l? f, g c7d h have continuoas second order derivatives.

Derivative Characterization of Constrained Extrema of Functionals

Some History of the Conjugate Gradient and Lanczos Algorithms: 1948-1976

William Karush and the KKT Theorem

NBS-INA-The Institute for Numerical Analysis

Pi Mu Epsilon Journal the Official Publication of the Honorary Mathematical Fraternity

NBS-INA-The Institute for Numerical Analysis

Conjugate Gradient and Lancz'os Algorithmsi

Perspectives on American Mathematics

Program of the Sessions San Diego, California, January 10–13, 2018

Engineering Optimization: an Introduction with Metaheuristic Applications

Some History of Conjugate Gradients and Other Krylov Subspace Methods October 2009 SIAM Applied Linear Algebra Meeting 2009 Dianne P

Engineering Optimization