Constrained Nonlinear Programming

149 CONSTRAINED NONLINEAR PROGRAMMING We now turn to methods for general constrained nonlinear programming. These may be broadly classified into two categories: 1. TRANSFORMATION METHODS: In this approach the constrained nonlinear program is transformed into an unconstrained problem (or more commonly, a series of unconstrained problems) and solved using one (or some variant) of the algorithms we have already seen. Transformation methods may further be classified as (1) Exterior Penalty Function Methods (2) Interior Penalty (or Barrier) Function Methods (3) Augmented Lagrangian (or Multiplier) Methods 2. PRIMAL METHODS: These are methods that work directly on the original problem. We shall look at each of these in turn; we first start with transformation methods. 150 Exterior Penalty Function Methods These methods generate a sequence of infeasible points whose limit is an optimal solution to the original problem. The original constrained problem is transformed into an unconstrained one (or a series of unconstrained problems).The constraints are incorporated into the objective by means of a "penalty parameter" which penalizes any constraint violation. Consider the problem Min f(x) n st gj(x) ≤ 0, j=1,2,…,m, hj(x) = 0, j=1,2,…,p, x∈R Define a penalty function p as follows: ( ) = Max{0, ( )} + ( ) ℎ where α is some positive integer. It is easy to see that If gj(x) ≤ 0 then Max{0, gj(x)} = 0 If gj(x) > 0 then Max{0, gj(x)} = gj(x) (violation) Now define the auxiliary function as P(x)= f(x) + µ (x),where µ >> 0. Thus if a constraint is violated, (x)>0 and a "penalty" of µ (x) is incurred. 151 The (UNCONSTRAINED) problem now is: Minimize P(x)= f(x) + µ (x) st x∈Rn EXAMPLE: Minimize f(x)=x, st g(x)=2-x ≤ 0, x ∈R (Note that minimum is at x* =2) Let (x)= Max[{0,g(x)}]2 , ) if < 2 i. e. , ( ) = 0 if (2 − If (x)=0, then the optimal solution to Minimize ( ) +≥ 2( ) is at x*= -∞ and this is infeasible; so ( ) + ( ) = + ) Minimize P( ) = ( ) + ( ) has(optimum2 − solution . ∗ 1 = 2 − : As (= ) 2 ∗ → ∞, 2 − → 2 f+µ2p µ1=0.5 µ1=0.5 µ2p µ2=1.5 µ2=1.5 p f+µ1p µ1p -3 -2 -1 0 1 2 -1 0 1 2 3 Penalty Function Auxiliary Function 152 In general: 1. We convert the constrained problem to an unconstrained one by using the penalty function. 2. The solution to the unconstrained problem can be made arbitrarily close to that of the original one by choosing sufficiently large µ. In practice, if µ is very large, too much emphasis is placed on feasibility. Often, algorithms for unconstrained optimization would stop prematurely because the step sizes required for improvement become very small. Usually, we solve a sequence of problems with successively increasing µ values; the optimal point from one iteration becomes the starting point for the next problem. PROCEDURE: Choose the following initially: 1. a tolerance ε, 2. an “increase” factor β, 3. a starting point x1, and 4. an initial µ1. At Iteration i n i 1. Solve the problem Minimize f(x)+µip(x); st x∈X (usually R ). Use x as the starting point and let the optimal solution be xi+1 i+1 2. If µip(x ) ≤ ε then STOP; else let µi+1 =(µi)β and start iteration (i+1) 153 2 2 EXAMPLE: Minimize f(x) = x1 + 2x2 2 st g(x) = 1–x1 –x2 ≤ 0; x∈R Define the penalty function ) if ( ) > 0 ( ) = 0 if ( ) (1 − − The unconstrained problem is ≤ 0 2 2 Minimize x1 + 2x2 + µ (x) If (x)=0, then the optimal solution is x* =(0,0) INFEASIBLE! 2 2 2 2 ∴ (x) = (1–x1 –x2) , ⇒ P(x) = x1 + 2x2 + µ [(1–x1 –x2) ], and the necessary conditions for the optimal solution (∇P(x)=0) yield the following: P P = 2 + 2 )( ) = 0, = 4 + 2 )( ) = 0 (1 − − −1 (1 − − −1 2 Thus = and = (2 + 3 ) (2 + 3 ) ∗ ∗ Starting with µ =0.1, β=10 and x1=(0,0) and using a tolerance of 0.001 (say), we have the following: i+1 i+1 i+1 Iter. (i) µi x g(x ) µip(x ) 1 0.1 (0.087, 0.043) 0.87 0.0757 2 1.0 (0.4, 0.2) 0.40 0.16 3 10 (0.625, 0.3125) 0.0625 0.039 4 100 (0.6622, 0.3311) 0.0067 0.00449 5 1000 (0.666, 0.333) 0.001 0.001 Thus the optimal solution is x*= (2/3, 1/3). 154 INTERIOR PENALTY (BARRIER) FUNCTION METHODS These methods also transform the original problem into an unconstrained one; however the barrier functions prevent the current solution from ever leaving the feasible region. These require that the interior of the feasible sets be nonempty, which is impossible if equality constraints are present. Therefore, they are used with problems having only inequality constraints. A barrier function B is one that is continuous and nonnegative over the interior of {x | g(x)≤0}, i.e., over the set {x | g(x)<0}, and approaches ∞ as the boundary is approached from the interior. Let ( ) ≥ 0 if y<0 and lim ( ) → = ∞ Then ( ) = [ ( )] Usually, ( ) or ( ) ( )) ( ) = − ∑ = − ∑ (− In both cases, note that lim ( ) ( ) → = ∞ The auxiliary function is now f(x) + µB(x) where µ is a SMALL positive number. 155 Q. Why should µ be SMALL? A. Ideally we would like B(x)=0 if gj(x)<0 and B(x)=∞ if gj(x)=0, so that we never leave the region{x | g(x) ≤ 0}. However, B(x) is now discontinuous. This causes serious computational problems during the unconstrained optimization. Consider the same problem as the one we looked at with (exterior) penalty functions: Minimize f(x) = x , st –x+2≤0 Define the barrier function B(x)= -1/(-x+2) So the auxiliary function is f(x) + µB(x) = x - µ/(-x+2), and has its optimum solution at x*=2 + , and as µ→0, x*→2. 8 √ 12 µ1=0.5 10 f+µ2B 6 µ1=0.5 µ2B 8 µ2=1.5 4 µ2=1.5 6 B 4 2 2 f+µ1B 0 µ1B 0 2 3 4 5 6 7 8 2 3 4 5 6 7 8 Barrier Function Auxiliary Function Similar to exterior penalty functions we don't just choose one small value for rather we start with some µ1 and generate a sequence of points. 156 PROCEDURE: Initially choose a tolerance ε, a “decrease” factor β, an interior 1 starting point x , and an initial µ1. At Iteration i n i 1. Solve the problem Minimize f(x)+µiB(x),st x∈X (usually R ). Use x as the starting point and let the optimal solution be xi+1 i+1 2. If µiB(x ) < ε then STOP; else let µi+1 =(µi)β and start iteration (i+1) Consider the previous example once again… 2 2 EXAMPLE: Minimize f(x) = x1 + 2x2 2 st g(x) = 1–x1 –x2 ≤ 0; x∈R Define the barrier function ( ) ( ) ( + The unconstrained problem = is− − = − − 1) 2 2 2 2 Minimize x1 + 2x2 + µ (x) = x1 + 2x2 - µ ( + The necessary conditions for the optimal solution (∇f( x)=0) yield− 1 the) following: = 2 /( + = 4 /( + − − 1) = 0, − − 1) = 0 157 1 ± 1 + 3 1 ± 1 + 3 Solving, we get = and = 3 6 Since the negative signs lead to infeasibility, we = and = ∗ ∗ Starting with µ =1, β=0.1 and x1=(0,0) and using a tolerance of 0.005 (say), we have the following: i+1 i+1 i+1 Iter. (i) µi x g(x ) µiB(x ) 1 1 (1.0, 0.5) -0.5 0.693 2 0.1 (0.714, 0.357) -0.071 0.265 3 0.01 (0.672, 0.336) -0.008 0.048 4 0.001 (0.6672, 0.3336) -0.0008 0.0070 5 0.0001 (0.6666, 0.3333) -0.0001 0.0009 Thus the optimal solution is x*= (2/3, 1/3). 158 Penalty Function Methods and Lagrange Multipliers Consider the penalty function approach to the problem below Minimize f(x) st gj(x) ≤ 0; j = 1,2,…,m k n hj(x) = 0; j = m+1,m+2,…,l x ∈R . Suppose we use the usual interior penalty function described earlier, with p=2. The auxiliary function that we minimize is then given by 2 2 P(x) = f(x) + µp(x) = f(x) + µ{∑j[Max(0, gj(x))] + ∑j[hj(x)] } 2 2 = f(x) + {∑jµ[Max(0, gj(x))] + ∑jµ[hj(x)] } The necessary condition for this to have a minimum is that ∇P(x) = ∇f(x) + µ∇p(x) = 0, i.e., ∇f(x) + ∑j2µ[Max(0, gj(x)]∇gj(x) + ∑j2µ[hj(x)]∇hj(x) = 0 (1) k Suppose that the solution to (1) for a fixed µ (say µk>0) is given by x . Let us also designate k 2µk[Max{0, gj(x )}] = λj(µk); j=1,2,…,m (2) k 2µk[hj(x )] = λj(µk); j=m+1,…,l (3) so that for µ=µk we may rewrite (1) as 159 ∇f(x) + ∑jλj(µk) ∇gj(x) + ∑jλj(µk) ∇hj(x) = 0 (4) Now consider the Lagrangian for the original problem: L(x,λ) = f(x) + ∑jλjgj(x) + ∑jλjhj(x) The usual K-K-T necessary conditions yield ∇f(x) + ∑j=1..m {λj∇gj(x)} + ∑j=m+1..l {λj∇hj(x)} = 0 (5) plus the original constraints, complementary slackness for the inequality constraints and λj≥0 for j=1,…,m. Comparing (4) and (5) we can see that when we minimize the auxiliary function using µ=µk, the λj(µk) values given by (2) and (3) estimate the Lagrange multipliers in (5).

Constrained Nonlinear Programming

Lecture 3 1 Geometry of Linear Programs

Extreme Points and Basic Solutions

The Feasible Region for Consecutive Patterns of Permutations Is a Cycle Polytope

Linear Programming

Math 112 Review for Exam Ii (Ws 12 -18)

The Feasible Region for Consecutive Patterns of Permutations Is a Cycle Polytope

Mathematical Modelling and Applications of Particle Swarm Optimization

Lesson . Geometry and Algebra of “Corner Points”

Geometry and Visualizations of Linear Programs (PDF)

B1 Optimization – Solutions

Load Feasible Region Determination by Using Adaptive Particle Swarm Optimization

Modeling with Nonlinear Programming