Lecture 7 Gradient and Directional Derivative (Cont'd)

Total Page:16

File Type:pdf, Size:1020Kb

Lecture 7 Gradient and Directional Derivative (Cont'd) Lecture 7 Gradient and directional derivative (cont’d) In the previous lecture, we showed that the rate of change of a function f(x,y) in the direction of a vector u, called the directional derivative of f at a in the direction uˆ, is simply the dot product of the gradient vector ∇~ f(a) with the unit direction vector uˆ: ∂f ∂f Duf(a)= ∇~ f(a) · uˆ = (a)u + (a)u . (1) ˆ ∂x 1 ∂y 2 The gradient vector ∇~ f(a) contains all the information necessary to compute the directional derivative of f at a in any direction. We then considered the “hotplate temperature function” f(x,y) = 50 − x2 − 2y2 and computed the rate of change of temperature at the reference point (1, −1) – the location of an ant – in several directions. We found that the direction u = (1, −1) was a good direction if the ant wanted to cool itself, but the question remained: Is it the best direction? In order to answer this question, we should return to Eq. (1). Let’s rewrite the dot product in Eq. (1) as follows, Duˆ f(a) = k ∇~ f(a) kk uˆ k cos θ (2) = k ∇~ f(a) k cos θ, where θ is the angle between the unit vector uˆ and ∇~ f(a). We have expressed the directional derivative Duˆ f(a) in terms of the magnitude of the gradient vector ∇~ f(a) evaluated at a and the angle between the gradient vector and the direction vector uˆ. We have all that we need. The function cos θ can assume all values between 1 and −1. Its maximum value 1 corresponds to θ = 0. Its minimum value −1 corresponds to θ = π. It assumes the value of 0 at θ = ±π/2. This leads to the following three special cases: 1. θ = 0. Then uˆ points in the direction of ∇~ f(a). In this case, the rate of change Duˆ f assumes its maximum (or most positive) value, k ∇~ f(a) k≥ 0. This is the direction of steepest ascent of f at (a, b). 45 2. θ = π. Then uˆ points in the direction of −∇~ f(a). In this case, the rate of change Duˆ f assumes its minimum (or most negative) value, − k ∇~ f(a) k≤ 0. This is the direction of steepest descent of f at (a, b). 3. θ = ±π/2. Then uˆ points in a direction that is perpendicular to ∇~ f(a). In this case, the rate of change Duˆ f is zero. There are some noteworthy consequences of these consequences! 1. Directions in which the rate of change of f are zero must be tangent to the level curve of f that passes through (a, b). Why? If you travel on a level curve, the value of f does not change. And the instantaneous direction of motion at any point on this curve is the tangent vector to the curve at that point. 2. The gradient vector ∇~ f(a, b) must be perpendicular to the level curve of f that passes through (a, b). These results are sketched below. direction of steepest ascent of f ∇~ f −∇~ f level set of f(x, y) passing through (x, y) direction of steepest descent of f Example: We now return to the ant-hotplate problem f(x,y) = 50 − x2 − 2y2. Recall that the gradient vector field of f is ∇~ f(x,y)= −2xi − 4yj. (3) And recall that the ant was situated at (a, b) = (1, −1). The question that remained unanswered in the last lecture was, “In which direction should the ant start to travel in order to cool itself as quickly as possible?” We now know the answer to this question - the ant should travel in the direction of 46 −∇~ f(1, −1), the direction of steepest descent at (1, −1). This is the vector ∇~ f(1, −1) = −2i + 4j. (4) Note that this vector does not point directly at the origin, the hottest point on the plate, but this is because of the elliptical nature of the level curves, as we’ll see below. If we now consider all points (x,y) on the hotplate, the temperature function f(x,y) defines a scalar field on this plate – you need only one number to characterize the temperature at a point. Associated with this scalar field is the vector field defined by the gradient vector ∇~ f(x,y). Why is it a vector field? Because it is measuring rates of change of the scalar field – in particular, ∇~ f(x,y) defines (i) the direction of steepest ascent as well as (ii) the magnitude of the rate of change in that direction. Magnitude + direction = vector. The gradient field ∇~ f(x,y) defined by the temperature function f(x,y) is sketched roughly in the next figure. As expected, the vectors point “inward.” y x Sketch of the gradient vector field ∇~ f(x, y)= −2xi − 4yj. The gradient field points inward because f(x,y) is increasing as we move toward (0, 0), at which f achieves its global maximum: ∇~ f(0, 0) = (0, 0). Notice that the magnitudes of the gradient vectors decrease as we approach (0, 0) – this implies that the magnitudes of the rates of change are decreasing, indicating that the graph of f is flattening out as we approach the local maximum (0, 0). The relationship between the gradient vectors – as directions of steepest ascent – and level curves – as contours of equal value – is clearly illustrated in the second figure below, in which both are plotted for the hotplate temperature function. 47 y direction f of steepest ascent of f(x,y) x level set f(x,y)= C Sketch of gradient field vectors ∇~ f(x, y) and level curves for the hotplate function f(x, y)=50 − x2 − 2y2. Actually, the ant – and nature, as we’ll see below – is more interested in the vector field −∇~ f: the direction of maximum decrease, or steepest descent, of the temperature function f(x,y). At each point (x,y), the vector −∇~ f(x,y) gives the best direction for which the ant to travel in order to cool itself as quickly as possible. A sketch of this vector field, along with some level curves, is given below. y direction f of steepest descent of f(x,y) x level set f(x,y) = C Sketch of gradient field vectors −∇~ f(x, y) and level curves for the hotplate function f(x, y)=50 − x2 − 2y2. In fact, this vector field is quite relevant to the physical phenomenon of heat flow: Heat always travels from a region of higher temperature to one of lower temperature in roughly the most efficient manner. (We use the word “roughly” because the process of heat transfer involves the random collision of molecules that leads to transfer of kinetic and rotational energy.) A simplified version of Fourier’s “Law” of Cooling is as follows: 48 h = −κ∇~ T (5) Here, h is the “heat flux vector” that characterizes the heat flow, both in terms of direction and the amount of heat going through a unit volume per unit time. T (x,y,z) is the temperature function and κ is the thermal conductivity, a constant that is specific to the medium of interest. Once again, note that heat flows in the direction of the negative gradient, i.e., the direction of steepest descent of the temperature function T . And the greater the magnitude of ∇~ T , i.e., the greater the rate of change of T in this direction, the greater the flow of heat, which makes intuitive sense. Notes: 1. The word “Law” was put in quotes since it is not a law, but rather a mathematical model of a physical process. In the same way, as we’ll discuss later, Hooke’s “Law” for springs is not a law but a simplified mathematical model. 2. The thermal conductivity κ above was assumed to be constant in this simplified form of Fourier’s Law. In reality, κ may vary from region to region. As well, because of the microstructure of the medium, i.e., the way that atoms in the medium are bound to each other, the conductivity may be different in various directions – for example, it may be easier to flow in the x-direction than in the y- and z-directions. For this reason, κ may have to be represented by a tensor. You will encounter tensors in your third-year mathematical physics course.) A few words on heat transfer and transport processes in general In fact, heat transfer is a special case of a transport process – the movement of “something,” whether it be heat, a chemical in solution, or bacteria in air – from regions of higher concentration to regions of lower concentration. The transfer is described by a flux density vector field F that gives the direction of motion at a point as well as the rate of transfer of the “something.” Heat transfer is a special case of Fick’s Law of transport which states that the flux vector F points in the direction of steepest descent of the concentration f of the “something” concerned, i.e., F(x,y,z)= −k∇~ f(x,y,z), (6) where k > 0 is a constant specific to the process and material being studied. Once again, the direction 49 of the flow is away from regions of higher concentration. This idea will be important in your future studies of transport processes. The gradient vector and directional derivatives in higher dimensions The definition of the gradient vector given earlier for functions of two variables f(x,y) extends in a natural way to scalar valued functions f : Rn → R where n ≥ 2: ∂f ∂f ∇~ f(x)= (x)e1 + · · · + (x)en, (7) ∂x1 ∂xn n where x = (x1,x2, · · · ,xn) and the the ek, k = 1, 2, · · · ,n are unit vectors in R .
Recommended publications
  • The Directional Derivative the Derivative of a Real Valued Function (Scalar field) with Respect to a Vector
    Math 253 The Directional Derivative The derivative of a real valued function (scalar field) with respect to a vector. f(x + h) f(x) What is the vector space analog to the usual derivative in one variable? f 0(x) = lim − ? h 0 h ! x -2 0 2 5 Suppose f is a real valued function (a mapping f : Rn R). ! (e.g. f(x; y) = x2 y2). − 0 Unlike the case with plane figures, functions grow in a variety of z ways at each point on a surface (or n{dimensional structure). We'll be interested in examining the way (f) changes as we move -5 from a point X~ to a nearby point in a particular direction. -2 0 y 2 Figure 1: f(x; y) = x2 y2 − The Directional Derivative x -2 0 2 5 We'll be interested in examining the way (f) changes as we move ~ in a particular direction 0 from a point X to a nearby point . z -5 -2 0 y 2 Figure 2: f(x; y) = x2 y2 − y If we give the direction by using a second vector, say ~u, then for →u X→ + h any real number h, the vector X~ + h~u represents a change in X→ position from X~ along a line through X~ parallel to ~u. →u →u h x Figure 3: Change in position along line parallel to ~u The Directional Derivative x -2 0 2 5 We'll be interested in examining the way (f) changes as we move ~ in a particular direction 0 from a point X to a nearby point .
    [Show full text]
  • Notes for Math 136: Review of Calculus
    Notes for Math 136: Review of Calculus Gyu Eun Lee These notes were written for my personal use for teaching purposes for the course Math 136 running in the Spring of 2016 at UCLA. They are also intended for use as a general calculus reference for this course. In these notes we will briefly review the results of the calculus of several variables most frequently used in partial differential equations. The selection of topics is inspired by the list of topics given by Strauss at the end of section 1.1, but we will not cover everything. Certain topics are covered in the Appendix to the textbook, and we will not deal with them here. The results of single-variable calculus are assumed to be familiar. Practicality, not rigor, is the aim. This document will be updated throughout the quarter as we encounter material involving additional topics. We will focus on functions of two real variables, u = u(x;t). Most definitions and results listed will have generalizations to higher numbers of variables, and we hope the generalizations will be reasonably straightforward to the reader if they become necessary. Occasionally (notably for the chain rule in its many forms) it will be convenient to work with functions of an arbitrary number of variables. We will be rather loose about the issues of differentiability and integrability: unless otherwise stated, we will assume all derivatives and integrals exist, and if we require it we will assume derivatives are continuous up to the required order. (This is also generally the assumption in Strauss.) 1 Differential calculus of several variables 1.1 Partial derivatives Given a scalar function u = u(x;t) of two variables, we may hold one of the variables constant and regard it as a function of one variable only: vx(t) = u(x;t) = wt(x): Then the partial derivatives of u with respect of x and with respect to t are defined as the familiar derivatives from single variable calculus: ¶u dv ¶u dw = x ; = t : ¶t dt ¶x dx c 2016, Gyu Eun Lee.
    [Show full text]
  • Tensor Calculus and Differential Geometry
    Course Notes Tensor Calculus and Differential Geometry 2WAH0 Luc Florack March 10, 2021 Cover illustration: papyrus fragment from Euclid’s Elements of Geometry, Book II [8]. Contents Preface iii Notation 1 1 Prerequisites from Linear Algebra 3 2 Tensor Calculus 7 2.1 Vector Spaces and Bases . .7 2.2 Dual Vector Spaces and Dual Bases . .8 2.3 The Kronecker Tensor . 10 2.4 Inner Products . 11 2.5 Reciprocal Bases . 14 2.6 Bases, Dual Bases, Reciprocal Bases: Mutual Relations . 16 2.7 Examples of Vectors and Covectors . 17 2.8 Tensors . 18 2.8.1 Tensors in all Generality . 18 2.8.2 Tensors Subject to Symmetries . 22 2.8.3 Symmetry and Antisymmetry Preserving Product Operators . 24 2.8.4 Vector Spaces with an Oriented Volume . 31 2.8.5 Tensors on an Inner Product Space . 34 2.8.6 Tensor Transformations . 36 2.8.6.1 “Absolute Tensors” . 37 CONTENTS i 2.8.6.2 “Relative Tensors” . 38 2.8.6.3 “Pseudo Tensors” . 41 2.8.7 Contractions . 43 2.9 The Hodge Star Operator . 43 3 Differential Geometry 47 3.1 Euclidean Space: Cartesian and Curvilinear Coordinates . 47 3.2 Differentiable Manifolds . 48 3.3 Tangent Vectors . 49 3.4 Tangent and Cotangent Bundle . 50 3.5 Exterior Derivative . 51 3.6 Affine Connection . 52 3.7 Lie Derivative . 55 3.8 Torsion . 55 3.9 Levi-Civita Connection . 56 3.10 Geodesics . 57 3.11 Curvature . 58 3.12 Push-Forward and Pull-Back . 59 3.13 Examples . 60 3.13.1 Polar Coordinates in the Euclidean Plane .
    [Show full text]
  • Matrix Calculus
    Appendix D Matrix Calculus From too much study, and from extreme passion, cometh madnesse. Isaac Newton [205, §5] − D.1 Gradient, Directional derivative, Taylor series D.1.1 Gradients Gradient of a differentiable real function f(x) : RK R with respect to its vector argument is defined uniquely in terms of partial derivatives→ ∂f(x) ∂x1 ∂f(x) , ∂x2 RK f(x) . (2053) ∇ . ∈ . ∂f(x) ∂xK while the second-order gradient of the twice differentiable real function with respect to its vector argument is traditionally called the Hessian; 2 2 2 ∂ f(x) ∂ f(x) ∂ f(x) 2 ∂x1 ∂x1∂x2 ··· ∂x1∂xK 2 2 2 ∂ f(x) ∂ f(x) ∂ f(x) 2 2 K f(x) , ∂x2∂x1 ∂x2 ··· ∂x2∂xK S (2054) ∇ . ∈ . .. 2 2 2 ∂ f(x) ∂ f(x) ∂ f(x) 2 ∂xK ∂x1 ∂xK ∂x2 ∂x ··· K interpreted ∂f(x) ∂f(x) 2 ∂ ∂ 2 ∂ f(x) ∂x1 ∂x2 ∂ f(x) = = = (2055) ∂x1∂x2 ³∂x2 ´ ³∂x1 ´ ∂x2∂x1 Dattorro, Convex Optimization Euclidean Distance Geometry, Mεβoo, 2005, v2020.02.29. 599 600 APPENDIX D. MATRIX CALCULUS The gradient of vector-valued function v(x) : R RN on real domain is a row vector → v(x) , ∂v1(x) ∂v2(x) ∂vN (x) RN (2056) ∇ ∂x ∂x ··· ∂x ∈ h i while the second-order gradient is 2 2 2 2 , ∂ v1(x) ∂ v2(x) ∂ vN (x) RN v(x) 2 2 2 (2057) ∇ ∂x ∂x ··· ∂x ∈ h i Gradient of vector-valued function h(x) : RK RN on vector domain is → ∂h1(x) ∂h2(x) ∂hN (x) ∂x1 ∂x1 ··· ∂x1 ∂h1(x) ∂h2(x) ∂hN (x) h(x) , ∂x2 ∂x2 ··· ∂x2 ∇ .
    [Show full text]
  • Math 1320-9 Notes of 3/30/20 11.6 Directional Derivatives and Gradients
    Math 1320-9 Notes of 3/30/20 11.6 Directional Derivatives and Gradients • Recall our definitions: f(x0 + h; y0) − f(x0; y0) fx(x0; y0) = lim h−!0 h f(x0; y0 + h) − f(x0; y0) and fy(x0; y0) = lim h−!0 h • You can think of these definitions in several ways, e.g., − You keep all but one variable fixed, and differentiate with respect to the one special variable. − You restrict the function to a line whose direction vector is one of the standard basis vectors, and differentiate that restriction with respect to the corresponding variable. • In the case of fx we move in the direction of < 1; 0 >, and in the case of fy, we move in the direction for < 0; 1 >. Math 1320-9 Notes of 3/30/20 page 1 How about doing the same thing in the direction of some other unit vector u =< a; b >; say. • We define: The directional derivative of f at (x0; y0) in the direc- tion of the (unit) vector u is f(x0 + ha; y0 + hb) − f(x0; y0) Duf(x0; y0) = lim h−!0 h (if this limit exists). • Thus we restrict the function f to the line (x0; y0) + tu, think of it as a function g(t) = f(x0 + ta; y0 + tb); and compute g0(t). • But, by the chain rule, d f(x + ta; y + tb) = f (x ; y )a + f (x ; y )b dt 0 0 x 0 0 y 0 0 =< fx(x0; y0); fy(x0; y0) > · < a; b > • Thus we can compute the directional derivatives by the formula Duf(x0; y0) =< fx(x0; y0); fy(x0; y0) > · < a; b > : • Of course, the partial derivatives @=@x and @=@y are just directional derivatives in the directions i =< 0; 1 > and j =< 0; 1 >, respectively.
    [Show full text]
  • Derivatives Along Vectors and Directional Derivatives
    DERIVATIVES ALONG VECTORS AND DIRECTIONAL DERIVATIVES Math 225 Derivatives Along Vectors Suppose that f is a function of two variables, that is, f : R2 → R, or, if we are thinking without coordinates, f : E2 → R. The function f could be the distance to some point or curve, the altitude function for some landscape, or temperature (assumed to be static, i.e., not changing with time). Let P ∈ E2, and assume some disk centered at p is contained in the domain of f, that is, assume that P is an interior point of the domain of f.This allows us to move in any direction from P , at least a little, and stay in the domain of f. We want to ask how fast f(X) changes as X moves away from P , and to express it as some kind of derivative. The answer clearly depends on which direction you go. If f is not constant near P ,then f(X) increases in some directions, and decreases in others. If we move along a level curve of f,thenf(X) doesn’t change at all (that’s what a level curve means—a curve on which the function is constant). The answer also depends on how fast you go. Suppose that f measures temperature, and that some particle is moving along a path through P . The particle experiences a change of temperature. This happens not because the temperature is a function of time, but rather because of the particle’s motion. Now suppose that a second particle moves along the same path in the same direction, but faster.
    [Show full text]
  • Directional Derivatives
    Directional Derivatives The Question Suppose that you leave the point (a, b) moving with velocity ~v = v , v . Suppose further h 1 2i that the temperature at (x, y) is f(x, y). Then what rate of change of temperature do you feel? The Answers Let’s set the beginning of time, t = 0, to the time at which you leave (a, b). Then at time t you are at (a + v1t, b + v2t) and feel the temperature f(a + v1t, b + v2t). So the change in temperature between time 0 and time t is f(a + v1t, b + v2t) f(a, b), the average rate of change of temperature, −f(a+v1t,b+v2t) f(a,b) per unit time, between time 0 and time t is t − and the instantaneous rate of f(a+v1t,b+v2t) f(a,b) change of temperature per unit time as you leave (a, b) is limt 0 − . We apply → t the approximation f(a + ∆x, b + ∆y) f(a, b) fx(a, b) ∆x + fy(a, b) ∆y − ≈ with ∆x = v t and ∆y = v t. In the limit as t 0, the approximation becomes exact and we have 1 2 → f(a+v1t,b+v2t) f(a,b) fx(a,b) v1t+fy (a,b) v2t lim − = lim t 0 t t 0 t → → = fx(a, b) v1 + fy(a, b) v2 = fx(a, b),fy(a, b) v , v h i·h 1 2i The vector fx(a, b),fy(a, b) is denoted ~ f(a, b) and is called “the gradient of the function h i ∇ f at the point (a, b)”.
    [Show full text]
  • Lecture Notes of Thomas' Calculus Section 11.5 Directional Derivatives
    Lecture Notes of Thomas’ Calculus Section 11.5 Directional Derivatives, Gradient Vectors and Tangent Planes Instructor: Dr. ZHANG Zhengru Lecture Notes of Section 11.5 Page 1 OUTLINE • Directional Derivatives in the Plane • Interpretation of the Directional Derivative • Calculation • Properties of Directional Derivatives • Gradients and Tangents to Level Curves • Algebra Rules for Gradients • Increments and Distances • Functions of Three Variables • Tangent Panes and Normal Lines • Planes Tangent to a Surface z = f(x, y) • Other Applications Why introduce the directional derivatives? • Let’s start from the derivative of single variable functions. Consider y = f(x), its ′ ′ derivative f (x0) implies the rate of change of f. f (x0) > 0 means f increasing as x ′ becomes large. f (x0) < 0 means f decreasing as x becomes large. • Consider two-variable function f(x, y). The partial derivative fx(x0,y0) implies the rate of change in the direction of~i while keep y as a constant y0. fx(x0,y0) > 0 implies increasing in x direction. Another interpretation, the vertical plane passes through (x0,y0) and parallel to x-axis intersects the surface f(x, y) in the curve C. fx(x0,y0) means the slope of the curve C. Similarly, The partial derivative fy(x0,y0) implies the rate of change in the direction of ~j while keep x as a constant x0. • It is natural to consider the rate of change of f(x, y) in any direction ~u instead of ~i and ~j. Therefore, directional derivatives should be introduced. ... Lecture Notes of Section 11.5 Page 2 Definition 1 The derivative of f at P0(x0,y0) in the direction of the unit vector ~u = u1~i + u2~j is the number df f(x0 + su1,y0 + su2) − f(x0,y0) = lim , ds s→0 s ~u,P0 provided the limit exists.
    [Show full text]
  • Multivariable Calculus Review
    Outline Multi-Variable Calculus Point-Set Topology Compactness The Weierstrass Extreme Value Theorem Operator and Matrix Norms Mean Value Theorem Multivariable Calculus Review Multivariable Calculus Review Outline Multi-Variable Calculus Point-Set Topology Compactness The Weierstrass Extreme Value Theorem Operator and Matrix Norms Mean Value Theorem Multi-Variable Calculus Point-Set Topology Compactness The Weierstrass Extreme Value Theorem Operator and Matrix Norms Mean Value Theorem Multivariable Calculus Review n I ν(x) ≥ 0 8 x 2 R with equality iff x = 0. n I ν(αx) = jαjν(x) 8 x 2 R α 2 R n I ν(x + y) ≤ ν(x) + ν(y) 8 x; y 2 R We usually denote ν(x) by kxk. Norms are convex functions. lp norms 1 Pn p p kxkp := ( i=1 jxi j ) ; 1 ≤ p < 1 kxk1 = maxi=1;:::;n jxi j Outline Multi-Variable Calculus Point-Set Topology Compactness The Weierstrass Extreme Value Theorem Operator and Matrix Norms Mean Value Theorem Multi-Variable Calculus Norms: n n A function ν : R ! R is a vector norm on R if Multivariable Calculus Review n I ν(αx) = jαjν(x) 8 x 2 R α 2 R n I ν(x + y) ≤ ν(x) + ν(y) 8 x; y 2 R We usually denote ν(x) by kxk. Norms are convex functions. lp norms 1 Pn p p kxkp := ( i=1 jxi j ) ; 1 ≤ p < 1 kxk1 = maxi=1;:::;n jxi j Outline Multi-Variable Calculus Point-Set Topology Compactness The Weierstrass Extreme Value Theorem Operator and Matrix Norms Mean Value Theorem Multi-Variable Calculus Norms: n n A function ν : R ! R is a vector norm on R if n I ν(x) ≥ 0 8 x 2 R with equality iff x = 0.
    [Show full text]
  • Derivatives in N-Dimensional Spaces
    5 DERIVATIVES IN N-DIMENSIONAL VECTOR SPACES A Introduction , -' ,.- * -..: The approach of our textbook (as d&ioped in the previous two units) in emphasizing the geometrical aspects of partial derivatives, directional derivatives, and the gradient is, in many ways, an excellent approach. We get a picture of what appears to be happening, and, especially for the beginning student, this is much more meaningful than a more precise and general discussion that would apply to all vector spaces, regardless of dimension. In fact, if we want to use the latter approach, it is wise to use the former first. This is in accord with many such decisions which have already been made .in our previous treatment of mathematics. As an example, within the framework of our present course, notice how carefully we ex- ploited the geometric aspects of vectors as arrows before we even mentjoned n-dimensional vector spaces. And, in this respect, recall how sophisticated many of our "arrow" results were in their own right. That is,.granted that arrows were simpler than n-tuples, when all we had were arrows, there were still some rather difficult ideas for us to assimilate. In any event, our aim in this chapter is to continue within the spirit of the game of mathematics which, thus far, has served us so well. Namely, we have introduced n-dimensional vector spaces so that we could utilize the similarity in form between the previously studied functions of the form f(x) and our new functions of the form f(x). For example, when it came time to define continuity and limits for functions of several real variables, we rewrote f(xl, ...,xn) as f (5) and then mimicked the f (x) situation by copying definitions verbatim, subject only to making the appropriate "vectorizations." Rather than review this procedure, let us instead move to the next plateau.
    [Show full text]
  • Generalized Directional Derivatives and Subgradients of Nonconvex
    ( )i,\ , . t, ., \\\rt to.o j -i .1.. GENERT\LIZED DIRECTIO\AL DERI\TATIVES AND SUBGRADIENTS OF NONCONVEX FUNCTIONS R T. ROCKAFIJLLAI{ l lntroduction. Srudjes of optimjzation prollens and ccrlah kiD(ls of difierentiaL eqLrations hale led in rcccnt )enrs to thc dclelopnrent of a gen cralized rheor] of dift-erentiaiion qLrite distincr iD spirit ancl rarse of applicati,m from the onc base.l on L. Schwari:z's 'distriblriions. ' This thco( associares .l-r,,t-\. u" t, r. .. o ., ;r . oto,og: .r, .. ,. L point r a I certain elements of the clLral space E+ caLLed \tLy tints tit gtarlitnt js stneralircd s ot .f at '. 1 lcsc form A set dI(r) rhat a]lrat.s conre: and Ncak*-closed L'possiLl_v cmprl ). The nrlrifurctior df : r + bflx) is thc Rules that relate Dl to scncraljzed djrectional cterilatives of /, or:rlloN d,i 1o b€ expr.ssed or estimarcd ;n terns of thc sLrbcljlTerenrjaLs of other tunctjoos (\i hen.l : ,h + f'. f = ! o ,1, etc ), co prise the vrrd.r'fl./.rrill ar/.&/,s. Such rrLes arc used espcciall! ir anall,zing ihe con.Liijon 0 a D.f(s), Nhich i]picalt,v Deans that:f is some sort of'quasi-optlnaL'poinr ior,l The cxtcnded-real- laluccl nrtlrc of,i is essential in such a contcxr as a delicc for representins SulijlTerenrial caiclils bcgan rrith com-cx lLrrctions on R". Rockafeilar l43l defincd d,f ior su.h funcriorrs, shon'cd hor t.} characicrjze A_ft]j) nr rerms of one sidcd cLirectional rleriratiles _l'l:;r r), and prolecl that rndcr mild restrictions rLlles sllcI .s (1.1) d(.n +/,)(r) : d,no) + dj,ir) are valj.l.
    [Show full text]
  • Derivation of the Formula for a Directional Derivative
    Derivation of the Directional Derivative Formula We let ~u = u1~{ + u2~| be a fixed unit vector in the xy-plane, and as usual, we let z = f(x; y). We want to show that the simple formula for the directional derivative f~u = fxu1 + fyu2 is valid wherever f is a differentiable function. We fix a particular point (a; b) in the xy-plane where f is differentiable, and we recall that the formal definition of fu(a; b) is f(a + u1h; b + u2h) f(a; b) fu(a; b) = lim − ; h 0 h ! so we want to show that the limit simplifies to fx(a; b)u1 + fy(a; b)u2 : ( ) ∗ Now we can certainly rewrite the difference quotient in the definition in the following form, dividing it into one part where the x-value varies while y remains fixed and the other where the reverse holds: f(a + u1h; b + u2h) f(a; b) f(a + u1h; b + u2h) f(a; b + u2h) f(a; b + u2h) f(a; b) − = − u1 + − u2 : ( ) h u1h u2h ∗∗ Because the change in the x-value in the first term is u1h, we have arranged for the denominator to have that value, in hopes that it will approach the partial derivative of f with respect to x; and similarly for y in the second term. Now as h 0, we also have u2h 0, so the fraction [f(a; b + u2h) f(a; b)]=(u2h) ! ! − in the second term goes to fy(a; b). And the factors u1 and u2 are constants.
    [Show full text]