0.1 Tangent Spaces and Lagrange Multipliers

0.1. TANGENT SPACES AND LAGRANGE MULTIPLIERS 1 0.1 Tangent Spaces and Lagrange Multipliers ~ n+k k If a di®erentiable function G = (G1;:::;Gk): E ! E then the surface S de¯ned by S = f~x j G~ (~x) = ~vg is called the level surface for G~ (~x) = ~v. Note that each of the functions n+k Gi : E ! R. If we denote by Si the level surface for the equation Tk Gi(~x) = vi, then S = i=1 Si. 0 0 0 ~ 0 0 n+k k Suppose that ~x = (x1; : : : ; xn+k) 2 S and that G (~x ) 2 L(E ; E ) has rank k. Let ±i;j = 1 if i = j and 0 if i 6= j. With respect to the standard basis f~ej = (±1;j; : : : ; ±n+k;j) j j = 1; 2; : : : ; n + kg for En+k and the analogous smaller basis for Ek, we note that the matrix ~ 0 0 [G (~x )]k£(n+k) has its whole set of k row vectors linearly independent, and 0 0 0 these row vectors are the gradient vectors rG1(~x ); rG2(~x );:::; rGk(~x ). Let Á~ : R ! S be a di®erentiable function for which Á~(0) = ~x0. Then we call the vector ~v = Á~0(0) a tangent vector to S at ~x0. 0 De¯nition 0.1.1. The tangent space T~x0 (S) at ~x 2 S is the set of all tangent vectors to S at ~x0. The translate 0 0 ~x + T~x0 = f~x + ~v j ~v 2 T~x0 (S)g is called the tangent plane to the surface S, with point of tangency at ~x0. A translate ~a + V = f~a + ~x j ~x 2 V g of a vector subspace V of En is called an a±ne subspace of En. An a±ne subspace is a vector subspace if and only if ~a 2 V . (See Exercise 1.) Theorem 0.1.1. Let G~ : En+k ! Ek be a di®erentiable function. Let ~x0 2 S = f~x j G~ (~x) = ~vg: ~ 0 0 Suppose G (~x ) has rank k. Then the tangent space T~x0 (S) is the vector subspace ¡ 0 0 ¢? T~x0 (S) = spanRfrG1(~x );:::; rGk(~x )g of dimension n. In words, T~x0 (S) is the orthogonal complement of the span 0 0 of the k gradient vectors rG1(~x );:::; rGk(~x ). 2 ~0 ~ Proof. Suppose ¯rst that ~v = Á (0) 2 T~x0 (S). This implies that Á maps 0 into each level surface Gi(~x) = vi. We will show that ~v ? rGi(~x ) for ~ each i = 1; : : : ; k. In fact, Gi(Á(t)) ´ vi, a real constant. We di®erentiate 0 ~ ~0 using the Chain Rule to ¯nd that Gi(Á(0))Á (0) = 0. In terms of the matrix 0 ~0 representation of the left side of the latter equation, we have rGi(~x )¢Á (0) = 0 0 ? 0, so that ~v ? rGi(~x ). This shows that T~x0 (S) ⊆ rGi(~x ) for each i. This implies that ¡ 0 0 ¢? T~x0 (S) ⊆ spanRfrG1(~x );:::; rGk(~x )g : ~ 0 0 The hypothesis that rank(G (~x )) = k implies that dimT~x0 (S) · n. If we can show that the tangent space is at least n-dimensional, then it will have to be the entire orthogonal complement of the span of the gradient vectors as claimed. Thus it will su±ce to produce a linearly independent set of n vectors in the tangent space. Because the rank of a matrix is also the number of linearly independent column vectors, it follows that the matrix [G~ 0(~x0)] has k independent columns. We can rearrange the order of the n elements of the standard basis of En+k to arrange that the ¯rst k columns are linearly independent. By the Implicit Function Theorem, there exists an open set U ½ Ek containing 0 0 n 0 0 (x1; : : : ; xk) and an open set V ½ E containing (xk+1; : : : ; xk+n) such that there are unique di®erentiable functions x1 = Ã1(xk+1; : : : ; xk+n) . xk = Ãk(xk+1; : : : ; xk+n) solving the equation ~ G(Ã1(xk+1; : : : ; xk+n);:::;Ãk(xk+1; : : : ; xk+n); xk+1; : : : ; xk+n) = ~v: Next we de¯ne n di®erentiable curves on S by the equations ~ 0 0 0 0 0 0 0 0 0 Á1(t) = (Ã1(xk+1 + t; xk+2; : : : ; xk+n);:::;Ãk(xk+1 + t; xk+2; : : : ; xk+n); xk+1 + t; xk+2; : : : ; xk+n) . ~ 0 0 0 0 0 0 0 0 0 Án(t) = (Ã1(xk+1; : : : ; xn+k¡1; xk+n + t);:::;Ãk(xk+1; : : : ; xk+n¡1; xk+n + t); xk+1; : : : xk+n¡1; xk+n + t) ~0 In comparing the vectors Ái(0) for i = 1; : : : ; n, observe that for each of these vectors the ¯nal n entries are all 0 except for a single entry which is 1. The location of the 1 is di®erent for each of these vectors. Thus the n vectors are independent and the theorem is proved. 0.1. TANGENT SPACES AND LAGRANGE MULTIPLIERS 3 Corollary 0.1.1. Let G~ : Ek+n ! Ek be a di®erentiable function and let ~x0 2 S = f~x j G~ (~x) = ~vg: Suppose ~x0 is a local extreme point of a di®erentiable function f : S ! R ~ 0 0 and that G (~x ) has rank k. Then there exist numbers ¸1; : : : ; ¸k such that 0 0 0 rf(~x ) = ¸1rG1(~x ) + ¢ ¢ ¢ + ¸krGk(~x ) (1) The numbers ¸1; : : : ; ¸k are called Lagrange multipliers. Proof. If Á~ : R ! S is a di®erentiable curve on S with Á~(0) = ~x0, let Ã(t) = f(Á~(t)). Since this function has an extreme point at 0, we have Ã0(0) = rf(Á~(0)) ¢ Á~0(0) = 0: It follows from Theorem 0.1.1 that rf(~x0) is orthogonal to the tangent space 0 T~x0 (S). Since the co-dimension of T~x0 (S) is k, it follows that rf(~x ) lies in 0 0 the span of the k vectors rG1(~x );:::; rGk(~x ). This proves the corollary. The method of Lagrange multipliers permits an optimization problem to be replaced by a problem of solving a system of equations. From the k + n components of the vectors in Equation 1, we obtain a system of k + n equations in the n+2k unknowns x1; : : : ; xk+n; ¸1; : : : ; ¸k. We get k additional equations from the k components of the equation G~ (~x) = ~v. Thus we obtain a system of n+2k equations in n+2k unknowns. Although we have replaced a calculus problem with an algebraic problem, the algebraic problem can be challenging. Nevertheless, the method of Lagrange multipliers is a powerful tool for optimization problems. Example 0.1.1. We will begin with a three-dimensional example. Consider the surface S de¯ned by the equation x4 +y4 +z4 = 1 in E3, shown in Figure 1. We will ¯nd both the maximum and the minimum values of the function f(~x) = x2+y2+z2 on S. (In e®ect, we are determining the closest and furthest distances from the origin on S.) In this example, we denote ~x = (x; y; z). Observe that if we de¯ne G(~x) = x4 + y4 + z4 then S = G¡1(f1g). Hence S is closed because G is continuous. S is also bounded. (Why?) Hence the function f must achieve both a maximum and a minimum value somewhere on S. Since S is smooth at all points and since rG is non-vanishing on S, 4 1 y 0.5 0 -0.5 -1 1 0.5 z 0 -0.5 -1 -1 -0.5 0 x 0.5 1 Figure 1: x4 + y4 + z4 = 1 the extreme points must occur at those points for which rf(~x) = ¸rG(~x). This yields the following system of equations. x(1 ¡ 2¸x2) = 0 y(1 ¡ 2¸y2) = 0 z(1 ¡ 2¸z2) = 0 x4 + y4 + z4 = 1 The reader should check the following by making the necessary calculations. ² If none of the three variables is zero, then x2 = y2 = z2 = 1 showing p p 2¸ 3 that ¸ = § 2 . This implies that f(x; y; z) = 3. ² If exactly one of the three variables is zero, then atp a point satisfying the system of equations we must have f(x; y; z) = 2. ² If exactly two of the variables are zero, then at a point satisfying the system we must have f(x; y; z) = 1. 0.1. TANGENT SPACES AND LAGRANGE MULTIPLIERS 5 p It follows that the maximum value of f on S is 3. But the reader should be able to explain why at least one of the variables must be non-zero. Thus the minimum value is 1. There is also an easy way to explain even from the outset why f(x; y; z) ¸ 1 everywhere on S. Exercises 0.1. 0 n 1. Prove that the tangent plane ~x + T~x0 (S) is a vector subspace of E if 0 and only if ~x 2 T~x0 (S). 2. Describe both the tangent space and the tangent³ plane to the´ sphere n¡1 n 0 p1 p1 p1 S = f~x 2 E j k~xk = 1g at the point ~x = n ; n ;:::; n . 3. The sphere S3 ½ E4 is de¯ned by ( ) X4 3 2 S = ~x j xi = 1: i=1 3 P4 De¯ne f : S ! R by f(~x) = i=1 aixi where ai is a constant for each i 2 f1; 2; 3; 4g.

0.1 Tangent Spaces and Lagrange Multipliers

Details

Download

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

Support