Derivative Characterization of Constrained Extrema of Functionals
Total Page:16
File Type:pdf, Size:1020Kb
DERIVATIVE CHARACTERIZATION OF CONSTRAINED EXTREMA OF FUNCTIONALS A SURVEY Mohamed El-Hodiri University of Kansas Research Paper No. 34 AUGUST 1970 RESEARCH PAPERS IN THEORETICAL AND APPLIED ECONOMICS DEPARTMENT OF ECONOMICS THE UNIVERSITY OF KANSAS LAURENCE, KANSAS INTRODUCTION One of the basic axioms of economic analysis is the axiom of ration-.: ality, idea,of postulating that economic behavior results from a process of optimization. As it is, at best, difficult to directly test the axiom, economists have been interested in characterization theorems of optirniza- tion theory. In this paper we review the theorems that characterize opti- mality by way of derivatives. First we formulate a very general optimiza- tion problem. Then we present characterization theorems for three types of problems: Finite dimensional, variational and problems in linear topo- logical spaces. In each case we present theorems for equality - inequal- ity constraints. The theorems in each case are: first order necessary conditions, first order sufficient conditions, second order necessary conditions and second or~er:,sufficienTy&osditions. The scheme of repre- sentation is as follows: Statements of theorems are followed by remarks. referring the reader to the earliest, known to us, proofs of the theorems. In some instances, slight generalizations of some theorems appear here for the first time, an indication of necessary modifications to existing proofs are indicated. A case is "solved" if proofs for all the four types of characterization theorems exist. The only "unsolved" case is that of problems in linear topological spaces with inequality and with equality - inequality constraints. For this case, we present two conjectures about second order conditions that are analcgousto the equality constraint case. 2 We now state the "general" optimization problem: Let A,B,C,D be real linear topological spaces. Consider the func- tions : Let P be a partial ordering of By let >- be defined on C - as usual - by way of a convex cone and let el e2 e3 and e4 be the neutral element of 33' addition - zero elements - of A,B,c and D respectively. The problem may (1) be stated as follows .- A a Find E A such that f (xf is P-maximal (2) subject to: In case B is the real line, P is the relation ">"- defined for real numbers and we have a problem of scaler optimization. In case B is finite dimen- sional we have..a finite-vector maximization problem. In general, charac- terizations of solutions to finite-vector maximization problems may be derived from characterizations of solutions of scaler maximization prob- lems. We shall restrict our presentation to scaler maximization prob3ems. As an application of scaler maximization theorems we may "solve" a part- ticular finite-vector maximization problem, namely for the case where P is taken to be a aret to'^) ordering of B. FQ' infinite-vector maximization problems, a scalerization theorem appears to be the most appropriate inter- 3 mediate step to derive characterization theorems. Such a theorem was proved by Hurwicz [20]. The method, for finite-vector m3ximization, con- sists(') of simply observing that the problem is equivalent to an in- finite number of scaler maximization problems. However, we shall only be concerned with scaler maximization problems. 2. FINITE DIMENSIONAL PROBLEMS In this part we let A = E", B = E', C = E~ and D = E', where E", E~,Em II. E Euclidian spaces of dimensions n, one, m and R respectively. For theis part we reformulate the scaler maximization problem as follows: Problem 1. Find x E E~ such that f (x) >- f for all satisfying B ga(x) = 0, a = 1,..., m, h (x) 20,- B =1,..., R . .- We further state some properties of the constraints that will be used in discussing the theorems in this section. Some of the nomenclature, designated by " " is Karush's C231. 0 D.1) Definition. Effective constraints. 'Let xbe a point that satisfies hB 0 B (x) >- 0, = 1,.. , m. Let r (R) be the set of indices B such that h ($1 = 0. The constraints with indices B E r i:) 'kill be referred to as the 0 effective constraint6 at x. D.2) Definition. "Admissible Diyectiop'! Let h be differentiable. We say that X is an admissible direction if X is a non-tridal. solution to the inequalities: 0 D.3) Definition. "A curve issuing from x in the direction X." By that we mean,. an n-vector valued function, §(t), of a real variable t such that d E(0) = $ and 5 '(0) = - = A'. dt S(t) Itzo 0 D.4) Definition. admissible arc issuing from x in the direction A" is 0 an arc issuing from x in the direction X such that h (S(t)) >- 0. 0 D.5) Definition. "Property Q" -for inequality constraints, is satisfied at x. iff: For each admissible direction A, there exists an admissible arc issuing from 8. D.6) Definition. The rank condition for inequality constraints. We say that the rank condition h is satisfied at $ iff 1.) is differentiable and 2) the B rank of the matrix cR. I, f3 E r ($) , equals the number of effective con- 1 straints. D.7) Definition. The rank condition for equality_- inequality constraints is -- - -- 0 satisfied at x iff 1) the functions g and h are differentiable and 2) the 0 ag" 9 constraints at x, where EOL.i = -ax. I x =x: L .> ,.: . ..) D.8) Definition. The rank condition for equality constraints is satis- fied at if the rank of the matrix is a. [E?I1 Finally wa define local and global constraint maxima: D.9) Definition. x is said to be a local solution to problem 1. iff: A ,. ,. There exists a neighborhood of x, N(,x), such that f(x) >- f(x) for all ,. ? r N(;) satisfying g(*) = 0 and h(x) 2 0. ,. ,. ,. D.lO) Definition. x is said to -.be a global solution to.problem 1. iff f(x) 3 f(x) for all x satisfying g(x) = 0 and h(x) > 0. 2. 1. First order necessary conditions. +. Theorem -1. If f, g -and h -are contjnuously differentiable and if x --is a global solution to problem 1, then there exists a vector (A0' v,p) = (Ao, 1 m L v v ,e ,pa) 0 such that, 1 > 0 and ; ,. , ,. 0 - 1, eB>0,- ,OhB (;) =o.. 2) Fx- 0 = 0, where F0 = X f + w. g + p. h, Fx0 is the vector of partial 0 A derivatives of F with respect to the component's of x evaluated at k = x. n For the case of equality constraints, i .e. the set {x I h (x) 0 )=E , (6) Theorem 1 was proved by ~aratheodor~(~)C93 and Bliss C71. For the case n of inequality constraints, i.e. the set {x I g (y) = 0) = E , theorem 1 - except for the non-negativity of p in condition 1 of the conclusion - was proved by ~arush'~)[231. The non-negativity of p may be proved by using a separation theorem (8 . This was, essentially, the crux of Fritz John's [22] proof who has treated a more general problem of an infinite number of inequality constraints. For the equality-inequality constraints the proof involves writing equality constraints as inequality constraints (g lo and - g (x) >- 0) and applying Karush's or Fritz Jone's theorem. Theorem 2. -If, in addition to the assumptions of theorem 1 we have either.(9) . a) Property Q for inequality constraints, D. 4, and the rank condition for .. .. equality constraints, D. 7, are satisfied at x. I a ) --The rank condition ~-for .. equality -inequality constraints is satisfied A -at x. Then the conclusions of theorem 1 follow with X 0. ---L--I)- For equality constraints the theorem was proved as a corollary of theorem 1 by ~liss(lO' [7]. For the case of inequality constraints the theorem was proved by Karush (I1) C231. The proof, for equality-inequality constraints, may be accomplished by converting inequality constraints to equality constraints and obtaining the theorem as a corollary to theorem 1. This was presented by Pennisi (13) C271. A direct proof was presenied by Hestenes (l" [l9]. 2.2. First Order Sufficient Conditions. Theorem 3. If: 1) f. E and h are differentiable 2) The conclusions of A ,. A theorem 1 hold with X_ > J at a point x with g(x) = 0, h (x) > 0 3) " . Either (3.a) FO, of theorem 1, is concave or (3.b) G! is linear and h is ,. concave. Then x is a rloh31 solution to ~roblem1. The theorem follows from the fact that a concave function lies above its tangent plane. The implications of this fact was utilized by Kuhn- Tucker C241 in the proof of their equivalence theorem. The present the- orem may be proved by applying Lemma 3 of Kuhn and Tucker C241 to FO. 2.3 Second Order Necessary Conditicns. Theorem 4. If: 1) --f, g and h have second order continuous partial de- A rivatives. 2: x is a solutLon-- to problem 1. 3) The rank condition for equality - inequality ccastraints , 9.5. I.... -- 5 .I--- . ..-!:. 1 m 1 R Thenthere exis-? multipliers ( v,p) = (v ,. :, v ; LI ,. , LI ) such that: For the equality constraints, thc! theorem was proved by Caratheodory (15) [9] and ~Liss(~~)['7].For inequality constraints the theorem was proved by Karush (17) [231. For the equality-inequality case, the theorem was (19) proved by Pennisi (lR) [27], under the, dispensible , stipulation that the number 02 no~l-~sromultipliers attached to effective inequality con- straints is at most one. 2.4. Second order sxffi,.ent conditions. Theorem 5,. If: l? f, g c7d h have continuoas second order derivatives.