Lecture 30 : Maxima, Minima, Second Derivative Test
Total Page:16
File Type:pdf, Size:1020Kb
 
											Load more
										Recommended publications
									
								- 
												  The Mean Value Theorem Math 120 Calculus I Fall 2015The Mean Value Theorem Math 120 Calculus I Fall 2015 The central theorem to much of differential calculus is the Mean Value Theorem, which we'll abbreviate MVT. It is the theoretical tool used to study the first and second derivatives. There is a nice logical sequence of connections here. It starts with the Extreme Value Theorem (EVT) that we looked at earlier when we studied the concept of continuity. It says that any function that is continuous on a closed interval takes on a maximum and a minimum value. A technical lemma. We begin our study with a technical lemma that allows us to relate 0 the derivative of a function at a point to values of the function nearby. Specifically, if f (x0) is positive, then for x nearby but smaller than x0 the values f(x) will be less than f(x0), but for x nearby but larger than x0, the values of f(x) will be larger than f(x0). This says something like f is an increasing function near x0, but not quite. An analogous statement 0 holds when f (x0) is negative. Proof. The proof of this lemma involves the definition of derivative and the definition of limits, but none of the proofs for the rest of the theorems here require that depth. 0 Suppose that f (x0) = p, some positive number. That means that f(x) − f(x ) lim 0 = p: x!x0 x − x0 f(x) − f(x0) So you can make arbitrarily close to p by taking x sufficiently close to x0.
- 
												  Chapter 9 Optimization: One Choice VariableRS - Ch 9 - Optimization: One Variable Chapter 9 Optimization: One Choice Variable 1 Léon Walras (1834-1910) Vilfredo Federico D. Pareto (1848–1923) 9.1 Optimum Values and Extreme Values • Goal vs. non-goal equilibrium • In the optimization process, we need to identify the objective function to optimize. • In the objective function the dependent variable represents the object of maximization or minimization Example: - Define profit function: = PQ − C(Q) - Objective: Maximize - Tool: Q 2 1 RS - Ch 9 - Optimization: One Variable 9.2 Relative Maximum and Minimum: First- Derivative Test Critical Value The critical value of x is the value x0 if f ′(x0)= 0. • A stationary value of y is f(x0). • A stationary point is the point with coordinates x0 and f(x0). • A stationary point is coordinate of the extremum. • Theorem (Weierstrass) Let f : S→R be a real-valued function defined on a compact (bounded and closed) set S ∈ Rn. If f is continuous on S, then f attains its maximum and minimum values on S. That is, there exists a point c1 and c2 such that f (c1) ≤ f (x) ≤ f (c2) ∀x ∈ S. 3 9.2 First-derivative test •The first-order condition (f.o.c.) or necessary condition for extrema is that f '(x*) = 0 and the value of f(x*) is: • A relative minimum if f '(x*) changes its sign y from negative to positive from the B immediate left of x0 to its immediate right. f '(x*)=0 (first derivative test of min.) x x* y • A relative maximum if the derivative f '(x) A f '(x*) = 0 changes its sign from positive to negative from the immediate left of the point x* to its immediate right.
- 
												  Critical Points and Classifying Local Maxima and Minima Don Byrd, RevNotes on Textbook Section 4.1 Critical Points and Classifying Local Maxima and Minima Don Byrd, rev. 25 Oct. 2011 To find and classify critical points of a function f (x) First steps: 1. Take the derivative f ’(x) . 2. Find the critical points by setting f ’ equal to 0, and solving for x . To finish the job, use either the first derivative test or the second derivative test. Via First Derivative Test 3. Determine if f ’ is positive (so f is increasing) or negative (so f is decreasing) on both sides of each critical point. • Note that since all that matters is the sign, you can check any value on the side you want; so use values that make it easy! • Optional, but helpful in more complex situations: draw a sign diagram. For example, say we have critical points at 11/8 and 22/7. It’s usually easier to evaluate functions at integer values than non-integers, and it’s especially easy at 0. So, for a value to the left of 11/8, choose 0; for a value to the right of 11/8 and the left of 22/7, choose 2; and for a value to the right of 22/7, choose 4. Let’s say f ’(0) = –1; f ’(2) = 5/9; and f ’(4) = 5. Then we could draw this sign diagram: Value of x 11 22 8 7 Sign of f ’(x) negative positive positive ! ! 4. Apply the first derivative test (textbook, top of p. 172, restated in terms of the derivative): If f ’ changes from negative to positive: f has a local minimum at the critical point If f ’ changes from positive to negative: f has a local maximum at the critical point If f ’ is positive on both sides or negative on both sides: f has neither one For the above example, we could tell from the sign diagram that 11/8 is a local minimum of f (x), while 22/7 is neither a local minimum nor a local maximum.
- 
												  Optimization and Gradient Descent INFO-4604, Applied Machine Learning University of Colorado BoulderOptimization and Gradient Descent INFO-4604, Applied Machine Learning University of Colorado Boulder September 11, 2018 Prof. Michael Paul Prediction Functions Remember: a prediction function is the function that predicts what the output should be, given the input. Prediction Functions Linear regression: f(x) = wTx + b Linear classification (perceptron): f(x) = 1, wTx + b ≥ 0 -1, wTx + b < 0 Need to learn what w should be! Learning Parameters Goal is to learn to minimize error • Ideally: true error • Instead: training error The loss function gives the training error when using parameters w, denoted L(w). • Also called cost function • More general: objective function (in general objective could be to minimize or maximize; with loss/cost functions, we want to minimize) Learning Parameters Goal is to minimize loss function. How do we minimize a function? Let’s review some math. Rate of Change The slope of a line is also called the rate of change of the line. y = ½x + 1 “rise” “run” Rate of Change For nonlinear functions, the “rise over run” formula gives you the average rate of change between two points Average slope from x=-1 to x=0 is: 2 f(x) = x -1 Rate of Change There is also a concept of rate of change at individual points (rather than two points) Slope at x=-1 is: f(x) = x2 -2 Rate of Change The slope at a point is called the derivative at that point Intuition: f(x) = x2 Measure the slope between two points that are really close together Rate of Change The slope at a point is called the derivative at that point Intuition: Measure the
- 
												  Calculus TerminologyAP Calculus BC Calculus Terminology Absolute Convergence Asymptote Continued Sum Absolute Maximum Average Rate of Change Continuous Function Absolute Minimum Average Value of a Function Continuously Differentiable Function Absolutely Convergent Axis of Rotation Converge Acceleration Boundary Value Problem Converge Absolutely Alternating Series Bounded Function Converge Conditionally Alternating Series Remainder Bounded Sequence Convergence Tests Alternating Series Test Bounds of Integration Convergent Sequence Analytic Methods Calculus Convergent Series Annulus Cartesian Form Critical Number Antiderivative of a Function Cavalieri’s Principle Critical Point Approximation by Differentials Center of Mass Formula Critical Value Arc Length of a Curve Centroid Curly d Area below a Curve Chain Rule Curve Area between Curves Comparison Test Curve Sketching Area of an Ellipse Concave Cusp Area of a Parabolic Segment Concave Down Cylindrical Shell Method Area under a Curve Concave Up Decreasing Function Area Using Parametric Equations Conditional Convergence Definite Integral Area Using Polar Coordinates Constant Term Definite Integral Rules Degenerate Divergent Series Function Operations Del Operator e Fundamental Theorem of Calculus Deleted Neighborhood Ellipsoid GLB Derivative End Behavior Global Maximum Derivative of a Power Series Essential Discontinuity Global Minimum Derivative Rules Explicit Differentiation Golden Spiral Difference Quotient Explicit Function Graphic Methods Differentiable Exponential Decay Greatest Lower Bound Differential
- 
												  Maxima and MinimaBasic Mathematics Maxima and Minima R Horan & M Lavelle The aim of this document is to provide a short, self assessment programme for students who wish to be able to use differentiation to find maxima and mininima of functions. Copyright c 2004 [email protected] , [email protected] Last Revision Date: May 5, 2005 Version 1.0 Table of Contents 1. Rules of Differentiation 2. Derivatives of order 2 (and higher) 3. Maxima and Minima 4. Quiz on Max and Min Solutions to Exercises Solutions to Quizzes Section 1: Rules of Differentiation 3 1. Rules of Differentiation Throughout this package the following rules of differentiation will be assumed. (In the table of derivatives below, a is an arbitrary, non-zero constant.) y axn sin(ax) cos(ax) eax ln(ax) dy 1 naxn−1 a cos(ax) −a sin(ax) aeax dx x If a is any constant and u, v are two functions of x, then d du dv (u + v) = + dx dx dx d du (au) = a dx dx Section 1: Rules of Differentiation 4 The stationary points of a function are those points where the gradient of the tangent (the derivative of the function) is zero. Example 1 Find the stationary points of the functions (a) f(x) = 3x2 + 2x − 9 , (b) f(x) = x3 − 6x2 + 9x − 2 . Solution dy (a) If y = 3x2 + 2x − 9 then = 3 × 2x2−1 + 2 = 6x + 2. dx The stationary points are found by solving the equation dy = 6x + 2 = 0 . dx In this case there is only one solution, x = −1/3.
- 
												  Calculus 120, Section 7.3 Maxima & Minima of Multivariable FunctionsCalculus 120, section 7.3 Maxima & Minima of Multivariable Functions notes by Tim Pilachowski A quick note to start: If you’re at all unsure about the material from 7.1 and 7.2, now is the time to go back to review it, and get some help if necessary. You’ll need all of it for the next three sections. Just like we had a first-derivative test and a second-derivative test to maxima and minima of functions of one variable, we’ll use versions of first partial and second partial derivatives to determine maxima and minima of functions of more than one variable. The first derivative test for functions of more than one variable looks very much like the first derivative test we have already used: If f(x, y, z) has a relative maximum or minimum at a point ( a, b, c) then all partial derivatives will equal 0 at that point. That is ∂f ∂f ∂f ()a b,, c = 0 ()a b,, c = 0 ()a b,, c = 0 ∂x ∂y ∂z Example A: Find the possible values of x, y, and z at which (xf y,, z) = x2 + 2y3 + 3z 2 + 4x − 6y + 9 has a relative maximum or minimum. Answers : (–2, –1, 0); (–2, 1, 0) Example B: Find the possible values of x and y at which fxy(),= x2 + 4 xyy + 2 − 12 y has a relative maximum or minimum. Answer : (4, –2); z = 12 Notice that the example above asked for possible values. The first derivative test by itself is inconclusive. The second derivative test for functions of more than one variable is a good bit more complicated than the one used for functions of one variable.
- 
												  Summary of Derivative TestsSummary of Derivative Tests Note that for all the tests given below it is assumed that the function f is continuous. Critical Numbers Definition. A critical number of a function f is a number c in the domain of f such that either f 0(c) = 0 or f 0(c) does not exist. Critical numbers tell you where a possible max/min occurs. Closed Interval Method Use: To find absolute mins/maxes on an interval [a; b] Steps: 1. Find the critical numbers of f [set f 0(x) = 0 and solve] 2. Find the value of f at each critical number 3. Find the value of f at each endpoint [i.e., find f(a) and f(b)] 4. Compare all of the above function values. The largest one is the absolute max and the smallest one is the absolute min. First Derivative Test Use: To find local max/mins Statement of the test: Suppose c is a critical number. 1. If f 0 changes from positive to negative at c; then f has a local max at c 2. If f 0 changes from negative to positive at c, then f has a local min at c 3. If f 0 is positive to the left and right of c [or negative to the left and right of c], then f has no local max or min at c Trick to Remember: If you are working on a problem and forget what means what in the above test, draw a picture of tangent lines around a minimum or around a maximum 1 Steps: 1.
- 
												  Mean Value, Taylor, and All ThatMean Value, Taylor, and all that Ambar N. Sengupta Louisiana State University November 2009 Careful: Not proofread! Derivative Recall the definition of the derivative of a function f at a point p: f (w) − f (p) f 0(p) = lim (1) w!p w − p Derivative Thus, to say that f 0(p) = 3 means that if we take any neighborhood U of 3, say the interval (1; 5), then the ratio f (w) − f (p) w − p falls inside U when w is close enough to p, i.e. in some neighborhood of p. (Of course, we can’t let w be equal to p, because of the w − p in the denominator.) In particular, f (w) − f (p) > 0 if w is close enough to p, but 6= p. w − p Derivative So if f 0(p) = 3 then the ratio f (w) − f (p) w − p lies in (1; 5) when w is close enough to p, i.e. in some neighborhood of p, but not equal to p. Derivative So if f 0(p) = 3 then the ratio f (w) − f (p) w − p lies in (1; 5) when w is close enough to p, i.e. in some neighborhood of p, but not equal to p. In particular, f (w) − f (p) > 0 if w is close enough to p, but 6= p. w − p • when w > p, but near p, the value f (w) is > f (p). • when w < p, but near p, the value f (w) is < f (p). Derivative From f 0(p) = 3 we found that f (w) − f (p) > 0 if w is close enough to p, but 6= p.
- 
												  Math 105: Multivariable Calculus Seventeenth Lecture (3/17/10)Daily Summary Fast Taylor Series Critical Points and Extrema Lagrange Multipliers Math 105: Multivariable Calculus Seventeenth Lecture (3/17/10) Steven J Miller Williams College [email protected] http://www.williams.edu/go/math/sjmiller/ public html/341/ Bronfman Science Center Williams College, March 17, 2010 1 Daily Summary Fast Taylor Series Critical Points and Extrema Lagrange Multipliers Summary for the Day 2 Daily Summary Fast Taylor Series Critical Points and Extrema Lagrange Multipliers Summary for the day Fast Taylor Series. Critical Points and Extrema. Constrained Maxima and Minima. 3 Daily Summary Fast Taylor Series Critical Points and Extrema Lagrange Multipliers Fast Taylor Series 4 Daily Summary Fast Taylor Series Critical Points and Extrema Lagrange Multipliers Formula for Taylor Series Notation: f twice differentiable function ∂f ∂f Gradient: f =( ∂ ,..., ∂ ). ∇ x1 xn Hessian: ∂2f ∂2f ∂x1∂x1 ∂x1∂xn . ⋅⋅⋅. Hf = ⎛ . .. ⎞ . ∂2f ∂2f ⎜ ∂ ∂ ∂ ∂ ⎟ ⎝ xn x1 ⋅⋅⋅ xn xn ⎠ Second Order Taylor Expansion at −→x 0 1 T f (−→x 0) + ( f )(−→x 0) (−→x −→x 0)+ (−→x −→x 0) (Hf )(−→x 0)(−→x −→x 0) ∇ ⋅ − 2 − − T where (−→x −→x 0) is the row vector which is the transpose of −→x −→x 0. − − 5 Daily Summary Fast Taylor Series Critical Points and Extrema Lagrange Multipliers Example 3 Let f (x, y)= sin(x + y) + (x + 1) y and (x0, y0) = (0, 0). Then ( f )(x, y) = cos(x + y)+ 3(x + 1)2y, cos(x + y) + (x + 1)3 ∇ and sin(x + y)+ 6(x + 1)2y sin(x + y)+ 3(x + 1)2 (Hf )(x, y) = − − , sin(x + y)+ 3(x + 1)2 sin(x + y) − − so 0 3 f (0, 0)= 0, ( f )(0, 0) = (1, 2), (Hf )(0, 0)= ∇ 3 0 which implies the second order Taylor expansion is 1 0 3 x 0 + (1, 2) (x, y)+ (x, y) ⋅ 2 3 0 y 1 3y = x + 2y + (x, y) = x + 2y + 3xy.
- 
												  Lecture 13: ExtremaMath S21a: Multivariable calculus Oliver Knill, Summer 2013 Lecture 13: Extrema An important problem in multi-variable calculus is to extremize a function f(x, y) of two vari- ables. As in one dimensions, in order to look for maxima or minima, we consider points, where the ”derivative” is zero. A point (a, b) in the plane is called a critical point of a function f(x, y) if ∇f(a, b) = h0, 0i. Critical points are candidates for extrema because at critical points, all directional derivatives D~vf = ∇f · ~v are zero. We can not increase the value of f by moving into any direction. This definition does not include points, where f or its derivative is not defined. We usually assume that a function is arbitrarily often differentiable. Points where the function has no derivatives are not considered part of the domain and need to be studied separately. For the function f(x, y) = |xy| for example, we would have to look at the points on the coordinate axes separately. 1 Find the critical points of f(x, y) = x4 + y4 − 4xy + 2. The gradient is ∇f(x, y) = h4(x3 − y), 4(y3 − x)i with critical points (0, 0), (1, 1), (−1, −1). 2 f(x, y) = sin(x2 + y) + y. The gradient is ∇f(x, y) = h2x cos(x2 + y), cos(x2 + y) + 1i. For a critical points, we must have x = 0 and cos(y) + 1 = 0 which means π + k2π. The critical points are at ... (0, −π), (0, π), (0, 3π),.... 2 2 3 The graph of f(x, y) = (x2 + y2)e−x −y looks like a volcano.
- 
												  Just the Definitions, Theorem Statements, Proofs OnMATH 3333{INTERMEDIATE ANALYSIS{BLECHER NOTES 55 Abbreviated notes version for Test 3: just the definitions, theorem statements, proofs on the list. I may possibly have made a mistake, so check it. Also, this is not intended as a REPLACEMENT for your classnotes; the classnotes have lots of other things that you may need for your understanding, like worked examples. 6. The derivative 6.1. Differentiation rules. Definition: Let f :(a; b) ! R and c 2 (a; b). If the limit lim f(x)−f(c) exists and is finite, then we say that f is differentiable at x!c x−c 0 df c, and we write this limit as f (c) or dx (c). This is the derivative of f at c, and also obviously equals lim f(c+h)−f(c) by setting x = c + h or h = x − c. If f is h!0 h differentiable at every point in (a; b), then we say that f is differentiable on (a; b). Theorem 6.1. If f :(a; b) ! R is differentiable at at a point c 2 (a; b), then f is continuous at c. Proof. If f is differentiable at c then f(x) − f(c) f(x) = (x − c) + f(c) ! f 0(c)0 + f(c) = f(c); x − c as x ! c. So f is continuous at c. Theorem 6.2. (Calculus I differentiation laws) If f; g :(a; b) ! R is differentiable at a point c 2 (a; b), then (1) f(x) + g(x) is differentiable at c and (f + g)0(c) = f 0(c) + g0(c).