Lagrange Multipliers

Total Page:16

File Type:pdf, Size:1020Kb

Lagrange Multipliers Advanced Deep Learning Approximate Inference-2 U Kang Seoul National University U Kang 1 Outline Inference as Optimization Expectation Maximization MAP Inference Variational Inference and Learning U Kang 2 MAP Inference If we wish to develop a learning process based on maximizing 퐿(푣, ℎ, 푞), then it is helpful to think of MAP inference as a procedure that provides a value of 푞. What is MAP inference? Finds the most likely value of a variable U Kang 3 MAP Inference Exact inference consists of maximizing with respect to q over an unrestricted family of probability distributions, using an exact optimization algorithm. We restrict the family of distributions of q to take on a Dirac distribution U Kang 4 MAP Inference We can now control 푞 entirely via µ. Dropping terms of 퐿 that do not vary with µ, we are left with the optimization problem which is equivalent to the MAP inference problem ELBO: U Kang 5 MAP Inference Thus, we can think of the following algorithm for maximizing ELBO, which is similar to EM Alternate the following two steps Perform MAP inference to infer h* while fixing 휃 Update 휃 to increase log p(h*, v) ELBO: U Kang 6 Outline Inference as Optimization Expectation Maximization MAP Inference Variational Inference and Learning Discrete Latent Variables Calculus of Variations Continuous Latent Variables U Kang 7 Previously Lower bound L Inference = max L w.r.t. q Learning = max L w.r.t. θ EM algorithm -> allows us to make large learning steps with fixed q MAP inference enable us to learn using a point of estimate rather than inferring the entire distribution U Kang 8 Variational Methods We want to do the following: Given this surveillance footage X, did the suspect show up in it? Given this twitter feed X, is the author depressed? Problem: cannot compute P(z|x) U Kang 9 Variational Methods Idea: Allow us to re-write statistical inference problems as optimization problems Statistical inference = infer value of RV given another Optimization problem = find the parameter values that minimize cost function Bridge between optimization and statistical techniques U Kang 10 Variational Learning Key idea: We can maximize L over a restricted family of distributions q L is the lower bound of log 푝(푣; 휃) Chose family such that is easy to compute. Typically: impose that q is a factorial distribution U Kang 11 The Mean field approach q is a factorial distribution where ℎ푖 are independent (and thus q(h|v) cannot match the true distribution p(h|v)) Advantage: no need to specify parametric form for q. The optimization problem determines the optimal probability under the constraints U Kang 12 Discrete Latent Variable The goal is to maximize ELBO: In the mean field approach, we assume q is a factorial distribution We can parameterize q with a vector ℎ෠ whose entries are probabilities; then 푞 ℎ푖 = 1 푣) = ℎ෡푖 Then, we simply optimize the parameters ℎ෠ by any standard optimization technique (e.g., gradient descent) U Kang 13 Outline Inference as Optimization Expectation Maximization MAP Inference and Sparse Coding Variational Inference and Learning Discrete Latent Variables Calculus of Variations Continuous Latent Variables U Kang 14 Recap 푝(푣|ℎ)푝(ℎ) Inference is to compute 푝 ℎ 푣 = 푝(푣) However, exact inference requires an exponential amount of time in these models Computing 푝(푣) is intractable Approximate inference is needed U Kang 15 Recap We compute Evidence Lower Bound (ELBO) instead of 푝(푣). ELBO: After rearranging the equation, For any choice of 푞, 퐿 provides a lower bound. If we take 푞 equals to 푝, we can get 푝(푣) exactly. U Kang 16 Calculus of Variations In machine learning, minimizing a function 퐽(휃) by finding the input vector 휃 ∈ 푅푛 is the purpose. This can be accomplished by solving for the critical points where 훻휃퐽 휃 = 0. U Kang 17 Calculus of Variations But in some cases, we actually want to solve for a function 푓(푥). Calculus of variations is a method of finding the critical points w.r.t 푓(푥). A function of a function 푓 is functional 퐽[푓]. We can take functional derivatives (a.k.a. variational derivatives), of 퐽[푓] with respect to individual values of the function 푓(푥) at any specific value of 푥. 훿 Functional derivatives is denoted 퐽. 훿푓(푥) U Kang 18 Calculus of Variations Euler-Lagrange equation (simplified form) Consider a functional 퐽 푓 = ׬ 푔 푓 푥 , 푥 푑푥. Extreme point 휕 of 퐽 is given by the condition 푔 푓 푥 , 푥 = 0 휕푓(푥) (Proof) Assume we change f by the amount of 휖 ∙ 휂(푥) by an arbitrary function 휂(푥) . Then, 퐽 푓(푥) + 휖휂 푥 = 휕푔 = ׬ 푔 푓 푥 + 휖휂 푥 , 푥 푑푥 = ׬[푔 푓 푥 , 푥 + 휖휂 푥 ] 푑푥 휕푓 휕푔 퐽 + 휖 ׬ 휂 푥 푑푥 휕푓 퐷 휕푦 2 =: Use 푦 푥1 + 휖1, … , 푥퐷 + 휖퐷 = 푦 푥1, … , 푥퐷 + σ푖=1 휖푖 + 푂(휖 ) 휕푥푖 Note that at the extreme point, 퐽 푓 + 휖휂 푥 = 퐽, which 휕푔 휕푔 .implies ׬ 휂 푥 = 0. Since this is true for any 휂 푥 , = 0 휕푓 휕푓 U Kang 19 Calculus of Variations 퐻 푞 = − ׬ 푞(푥) log 푞(푥) 푑푥 We want to maximize 퐿. So we have to find the 푞 which becomes a critical point of L 훿 Find 퐿=0 훿푞(푥) U Kang 20 Example Consider the problem of finding the probability distribution function over 푥 ∈ 푅 that has the maximal differential entropy 퐻 푝 = ׬ 푝 푥 log 푝(푥) 푑푥 among the distribution with − 퐸 푥 = 휇 and 푉푎푟 푥 = 휎2 I.e., the problem is to find 푝 푥 to maximize 퐻 푝 = ׬ 푝 푥 log 푝(푥) 푑푥, such that − p(x) integrates to 1 퐸 푥 = 휇 2 푉푎푟 푥 = 휎 U Kang 21 Lagrange Multipliers How to maximize (or minimize) a function with equality constraint? Lagrange multipliers Problem: maximize f(x) when g(x)=0 Solution Maximize L(푥, 휆) = 푓(푥) + 휆 · 푔(푥) 휆 is called Lagrange multiplier 휕퐿 We find x and 휆 s.t. 훻 퐿 = 0 and = 0 푥 휕휆 훻푥푓(푥) and 훻푥푔(푥) are orthogonal to the surface g(x)=0; thus 훻푥푓 푥 = −휆 훻푥푔(푥) for some 휆 휕퐿 = 0 leads to g(x)=0 휕휆 U Kang 22 Calculus of Variations Goal: find 푝 푥 which maximizes 퐻 푝 = = ׬ 푝 푥 log 푝(푥) 푑푥, s.t. ׬ 푝(푥) 푑푥 = 1, 퐸 푥 − 휇, and 푉푎푟 푥 = 휎2 Using Lagrange multiplier, we maximize U Kang 23 Calculus of Variations We set the functional derivatives equal to 0: We obtain We are free to choose Lagrange multipliers as long as ׬ 푝(푥) 푑푥 = 1, 퐸 푥 = 휇, and 푉푎푟 푥 = 휎2 We may set the followings. Then, we obtain This is one reason for using the normal distribution when we do not know the true distribution. U Kang 24 Outline Inference as Optimization Expectation Maximization MAP Inference and Sparse Coding Variational Inference and Learning Discrete Latent Variables Calculus of Variations Continuous Latent Variables U Kang 25 Continuous Latent Variables When our model contains continuous latent variables, we can perform variational inference by maximizing 퐿 using calculus of variations. If we make the mean field approximation, 푞 ℎ 푣 = ς푖 푞 ℎ푖 푣 , and fix 푞 ℎ푖 푣 for all 푖 ≠ 푗, then the optimal 푞 ℎ푗 푣 can be obtained by normalizing the unnormalized distribution 푞෤ ℎ푗 푣 = exp(퐸 log 푝 ℎ, 푣 ) ℎ−푗~푞 ℎ−푗 푣 Thus, we apply the above equation iteratively for each value of j until convergence U Kang 26 Proof of Mean Field Approximation (claim) Assuming 푞 ℎ 푣 = ς푖 푞 ℎ푖 푣 , the optimal 푞 ℎ푗 푣 is given by normalizing the unnormalized distribution 푞෤ ℎ푗 푣 = exp(퐸 log 푝 ℎ, 푣 ) ℎ−푗~푞 ℎ−푗 푣 (proof) Note that ELBO = 퐸ℎ~푞 log 푝 ℎ, 푣 + 퐻(푞), and 푞 ℎ 푣 = ς푖 푞푖(ℎ푖|푣). = Thus, ELBO = ׬ ς푖 푞푖 (log 푝(ℎ, 푣)) 푑ℎ − ׬(ς푖 푞푖)(log ς푖 푞푖) 푑ℎ ׬ 푞푗 {׬ log 푝 ℎ, 푣 (ς푖≠푗 푞푖 푑ℎ푖)} 푑ℎ푗 − ׬(ς푖 푞푖)(σ푖 log 푞푖)푑ℎ If we take out terms related to 푞푗, then ELBO becomes ׬ 푞푗 퐸ℎ−푗(log 푝(ℎ, 푣))푑ℎ푗 − ׬ 푞푗 log 푞푗 푑ℎ푗 + 푐표푛푠푡 ∗ (׬ 푞푗 log 푝 (ℎ푗, 푣) 푑ℎ푗 − ׬ 푞푗 log 푞푗 푑ℎ푗 + 푐표푛푠푡 (1 = ∗ ∗ where 푝 (ℎ푗, 푣) is a prob. distribution and log 푝 (ℎ푗, 푣) = 퐸ℎ−푗 log 푝 ℎ, 푣 + 푐표푛푠푡 ∗ Note that (1) is negative KL divergence -퐷퐾퐿(푞푗||푝 (ℎ푗, 푣)); thus, the best 푞푗 ∗ maximizing ELBO is given by 푞푗 = 푝 (ℎ푗, 푣). ∗ In that case, log 푞푗 = log 푝 (ℎ푗, 푣) = 퐸ℎ−푗 log 푝 ℎ, 푣 + 푐표푛푠푡. Thus, 푞푗 ∝ exp 퐸ℎ log 푝 ℎ, 푣 = exp(퐸 log 푝 ℎ, 푣 ) −푗 ℎ−푗~푞 ℎ−푗 푣 U Kang 27 What you need to know Inference as Optimization Expectation Maximization MAP Inference and Sparse Coding Variational Inference and Learning U Kang 28 Questions? U Kang 29.
Recommended publications
  • Introduction to the Modern Calculus of Variations
    MA4G6 Lecture Notes Introduction to the Modern Calculus of Variations Filip Rindler Spring Term 2015 Filip Rindler Mathematics Institute University of Warwick Coventry CV4 7AL United Kingdom [email protected] http://www.warwick.ac.uk/filiprindler Copyright ©2015 Filip Rindler. Version 1.1. Preface These lecture notes, written for the MA4G6 Calculus of Variations course at the University of Warwick, intend to give a modern introduction to the Calculus of Variations. I have tried to cover different aspects of the field and to explain how they fit into the “big picture”. This is not an encyclopedic work; many important results are omitted and sometimes I only present a special case of a more general theorem. I have, however, tried to strike a balance between a pure introduction and a text that can be used for later revision of forgotten material. The presentation is based around a few principles: • The presentation is quite “modern” in that I use several techniques which are perhaps not usually found in an introductory text or that have only recently been developed. • For most results, I try to use “reasonable” assumptions, not necessarily minimal ones. • When presented with a choice of how to prove a result, I have usually preferred the (in my opinion) most conceptually clear approach over more “elementary” ones. For example, I use Young measures in many instances, even though this comes at the expense of a higher initial burden of abstract theory. • Wherever possible, I first present an abstract result for general functionals defined on Banach spaces to illustrate the general structure of a certain result.
    [Show full text]
  • Finite Elements for the Treatment of the Inextensibility Constraint
    Comput Mech DOI 10.1007/s00466-017-1437-9 ORIGINAL PAPER Fiber-reinforced materials: finite elements for the treatment of the inextensibility constraint Ferdinando Auricchio1 · Giulia Scalet1 · Peter Wriggers2 Received: 5 April 2017 / Accepted: 29 May 2017 © Springer-Verlag GmbH Germany 2017 Abstract The present paper proposes a numerical frame- 1 Introduction work for the analysis of problems involving fiber-reinforced anisotropic materials. Specifically, isotropic linear elastic The use of fiber-reinforcements for the development of novel solids, reinforced by a single family of inextensible fibers, and efficient materials can be traced back to the 1960s are considered. The kinematic constraint equation of inex- and, to date, it is an active research topic. Such materi- tensibility in the fiber direction leads to the presence of als are composed of a matrix, reinforced by one or more an undetermined fiber stress in the constitutive equations. families of fibers which are systematically arranged in the To avoid locking-phenomena in the numerical solution due matrix itself. The interest towards fiber-reinforced mate- to the presence of the constraint, mixed finite elements rials stems from the mechanical properties of the fibers, based on the Lagrange multiplier, perturbed Lagrangian, and which offer a significant increase in structural efficiency. penalty method are proposed. Several boundary-value prob- Particularly, these materials possess high resistance and lems under plane strain conditions are solved and numerical stiffness, despite the low specific weight, together with an results are compared to analytical solutions, whenever the anisotropic mechanical behavior determined by the fiber derivation is possible. The performed simulations allow to distribution.
    [Show full text]
  • A Variational Approach to Lagrange Multipliers
    JOTA manuscript No. (will be inserted by the editor) A Variational Approach to Lagrange Multipliers Jonathan M. Borwein · Qiji J. Zhu Received: date / Accepted: date Abstract We discuss Lagrange multiplier rules from a variational perspective. This allows us to highlight many of the issues involved and also to illustrate how broadly an abstract version can be applied. Keywords Lagrange multiplier Variational method Convex duality Constrained optimiza- tion Nonsmooth analysis · · · · Mathematics Subject Classification (2000) 90C25 90C46 49N15 · · 1 Introduction The Lagrange multiplier method is fundamental in dealing with constrained optimization prob- lems and is also related to many other important results. There are many different routes to reaching the fundamental result. The variational approach used in [1] provides a deep under- standing of the nature of the Lagrange multiplier rule and is the focus of this survey. David Gale's seminal paper [2] provides a penetrating explanation of the economic meaning of the Lagrange multiplier in the convex case. Consider maximizing the output of an economy with resource constraints. Then the optimal output is a function of the level of resources. It turns out the derivative of this function, if exists, is exactly the Lagrange multiplier for the constrained optimization problem. A Lagrange multiplier, then, reflects the marginal gain of the output function with respect to the vector of resource constraints. Following this observation, if we penalize the resource utilization with a (vector) Lagrange multiplier then the constrained optimization problem can be converted to an unconstrained one. One cannot emphasize enough the importance of this insight. In general, however, an optimal value function for a constrained optimization problem is nei- ther convex nor smooth.
    [Show full text]
  • Derivative, Gradient, and Lagrange Multipliers
    EE263 Prof. S. Boyd Derivative, Gradient, and Lagrange Multipliers Derivative Suppose f : Rn → Rm is differentiable. Its derivative or Jacobian at a point x ∈ Rn is × denoted Df(x) ∈ Rm n, defined as ∂fi (Df(x))ij = , i =1, . , m, j =1,...,n. ∂x j x The first order Taylor expansion of f at (or near) x is given by fˆ(y)= f(x)+ Df(x)(y − x). When y − x is small, f(y) − fˆ(y) is very small. This is called the linearization of f at (or near) x. As an example, consider n = 3, m = 2, with 2 x1 − x f(x)= 2 . x1x3 Its derivative at the point x is 1 −2x2 0 Df(x)= , x3 0 x1 and its first order Taylor expansion near x = (1, 0, −1) is given by 1 1 1 0 0 fˆ(y)= + y − 0 . −1 −1 0 1 −1 Gradient For f : Rn → R, the gradient at x ∈ Rn is denoted ∇f(x) ∈ Rn, and it is defined as ∇f(x)= Df(x)T , the transpose of the derivative. In terms of partial derivatives, we have ∂f ∇f(x)i = , i =1,...,n. ∂x i x The first order Taylor expansion of f at x is given by fˆ(x)= f(x)+ ∇f(x)T (y − x). 1 Gradient of affine and quadratic functions You can check the formulas below by working out the partial derivatives. For f affine, i.e., f(x)= aT x + b, we have ∇f(x)= a (independent of x). × For f a quadratic form, i.e., f(x)= xT Px with P ∈ Rn n, we have ∇f(x)=(P + P T )x.
    [Show full text]
  • Contactmethodsintegratingplasti
    Contact methodsintegrating plasticity B13/1 modelswith application to soil mechanics ChristianWeißenfels Leibniz Universität Hannover Contact methods integrating plasticity models with application to soil mechanics Von der Fakultät für Maschinenbau der Gottfried Wilhelm Leibniz Universität Hannover zur Erlangung des akademischen Grades Doktor-Ingenieur genehmigte Dissertation von Dipl.-Ing. Christian Weißenfels geboren am 30.01.1979 in Rosenheim 2013 Herausgeber: Prof. Dr.-Ing. Peter Wriggers Verwaltung: Institut für Kontinuumsmechanik Gottfried Wilhelm Leibniz Universität Hannover Appelstraße 11 30167 Hannover Tel: +49 511 762 3220 Fax: +49 511 762 5496 Web: www.ikm.uni-hannover.de © Dipl.-Ing. Christian Weißenfels Institut für Kontinuumsmechanik Gottfried Wilhelm Leibniz Universität Hannover Appelstraße 11 30167 Hannover Alle Rechte, insbesondere das der Übersetzung in fremde Sprachen, vorbehalten. Ohne Genehmigung des Autors ist es nicht gestattet, dieses Heft ganz oder teilweise auf photomechanischem, elektronischem oder sonstigem Wege zu vervielfältigen. ISBN 978-3-941302-06-8 1. Referent: Prof. Dr.-Ing. Peter Wriggers 2. Referent: Prof. Dr.-Ing. Karl Schweizerhof Tag der Promotion: 13.12.2012 i Zusammenfassung Der Schwerpunkt der vorliegenden Arbeit liegt in der Entwicklung von neuartigen Kon- zepten zur direkten Integration von drei-dimensionalen Plastizit¨atsmodellen in eine Kontaktformulierung. Die allgemeinen Konzepte wurden speziell auf Boden Bauwerk Interaktionen angewandt und im Rahmen der Mortar-Methode numerisch umgesetzt. Zwei unterschiedliche Strategien wurden dabei verfolgt. Die Erste integriert die Boden- modelle in eine Standard-Kontaktdiskretisierung, wobei die zweite Variante die Kon- taktformulierung in Richtung eines drei-dimensionalen Kontaktelements erweitert. Innerhalb der ersten Variante wurden zwei unterschiedliche Konzepte ausgearbeitet, wobei das erste Konzept das Bodenmodell mit dem Reibbeiwert koppelt. Dabei muss die H¨ohe der Lokalisierungszone entlang der Kontaktfl¨ache bestimmt werden.
    [Show full text]
  • Dual-Dual Formulations for Frictional Contact Problems in Mechanics
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Institutionelles Repositorium der Leibniz Universität... Dual-dual formulations for frictional contact problems in mechanics Von der Fakultät für Mathematik und Physik der Gottfried Wilhelm Leibniz Universität Hannover zur Erlangung des Grades Doktor der Naturwissenschaften Dr. rer. nat. genehmigte Dissertation von Dipl. Math. Michael Andres geboren am 28. 12. 1980 in Siemianowice (Polen) 2011 Referent: Prof. Dr. Ernst P. Stephan, Leibniz Universität Hannover Korreferent: Prof. Dr. Gabriel N. Gatica, Universidad de Concepción, Chile Korreferent: PD Dr. Matthias Maischak, Brunel University, Uxbridge, UK Tag der Promotion: 17. 12. 2010 ii To my Mum. Abstract This thesis deals with unilateral contact problems with Coulomb friction. The main focus of this work lies on the derivation of the dual-dual formulation for a frictional contact problem. First, we regard the complementary energy minimization problem and apply Fenchel’s duality theory. The result is a saddle point formulation of dual type involving Lagrange multipliers for the governing equation, the symmetry of the stress tensor as well as the boundary conditions on the Neumann boundary and the contact boundary, respectively. For the saddle point problem an equivalent variational inequality problem is presented. Both formulations include a nondiffer- entiable functional arising from the frictional boundary condition. Therefore, we introduce an additional dual Lagrange multiplier denoting the friction force. This procedure yields a dual-dual formulation of a two-fold saddle point structure. For the corresponding variational inequality problem we show the unique solvability. Two different inf-sup conditions are introduced that allow an a priori error analysis of the dual-dual variational inequality problem.
    [Show full text]
  • 18.02SC Notes: Lagrange Multipliers
    Lagrange Multipliers We will give the argument for why Lagrange multipliers work later. Here, we’ll look at where and how to use them. Lagrange multipliers are used to solve constrained optimization problems. That is, suppose you have a function, say f(x; y), for which you want to find the maximum or minimum value. But, you are not allowed to consider all (x; y) while you look for this value. Instead, the (x; y) you can consider are constrained to lie on some curve or surface. There are lots of examples of this in science, engineering and economics, for example, optimizing some utility function under budget constraints. Lagrange multipliers problem: Minimize (or maximize) w = f(x; y; z) constrained by g(x; y; z) = c. Lagrange multipliers solution: Local minima (or maxima) must occur at a critical point. This is a point where rf = λrg, and g(x; y; z) = c. Example: Making a box using a minimum amount of material. A box is made of cardboard with double thick sides, a triple thick bottom, single thick front and back and no top. It’s volume is fixed at 3. What dimensions use the least amount of cardboard? Answer: We did this problem once before by solving for z in terms of x and y and substi­ tuting for it. That led to an unconstrained optimization problem in x and y. Here we will do it as a constrained problem. It is important to be able to do this because eliminating one variable is not always easy. The box shown has dimensions x, y, and z.
    [Show full text]
  • ECON 301 Two Variable Optimization (With- and Without- Constraints)
    ECON 301 Two Variable Optimization (with- and without- constraints) Review of Some Derivative Rules 1. Partial Derivative Rules: U = xy ∂U/∂x = Ux = y ∂U/∂y = Uy = x a b a 1 b a b 1 U = x y ∂U/∂x = Ux = ax − y ∂U/∂y = Uy = bx y − a b xa a 1 b a b 1 U = x y− = b ∂U/∂x = Ux = ax − y− ∂U/∂y = Uy = bx y− − y − U = ax + by ∂U/∂x = Ux = a ∂U/∂y = Uy = b 1/2 1/2 1 1/2 1 1/2 U = ax + by ∂U/∂x = Ux = a 2 x− ∂U/∂y = Uy = b 2 y− 2. Logarithm (Natural log) ln x ¡ ¢ ¡ ¢ (a) Rules of natural log If Then y = AB ln y =ln(AB)=lnA +lnB y = A/B ln y =lnA ln B y = Ab ln y =ln(Ab−)=b ln A NOTE: ln(A + B) =lnA +lnB 6 (b) derivatives IF THEN dy 1 y =lnx dx = x dy 1 y =ln(f(x)) = f 0(x) dx f(x) · (c) Examples If Then 2 1 y =ln(x 2x) dy/dx = (x2 2x) (2x 2) 1/−2 1 1 − 1 −1 y =ln(x )= 2 ln xdy/dx= 2 x = 2x ¡ ¢¡ ¢ 3. The Number e dy if y = ex then = ex dx f(x) dy f(x) if y = e then = e f 0(x) dx · (a) Examples 3x dy 3x y = e dx = e (3) 7x3 dy 7x3 2 y = e dx = e (21x ) rt dy rt y = e dt = re 1 Using Calculus For Maximization Problems OneVariableCase If we have the following function y =10x x2 − we have an example of a dome shaped function.
    [Show full text]
  • Math 241: Multivariable Calculus, Lecture 15 Lagrange Multipliers
    Math 241: Multivariable calculus, Lecture 15 Lagrange Multipliers. go.illinois.edu/math241fa17 Wednesday, October 4th, 2017 go.illinois.edu/math241fa17. Math 241: Problem of the day Problem: Find the absolute maximum and minimum value of the function f (x; y) = x3 − x + y 2x + y + 2 on the unit disk D = f(x; y) j x2 + y 2 ≤ 1g. What guarantees that the max and min exist? go.illinois.edu/math241fa17. rf (x; y) = h3x2 − 1 + y 2; 2yx + 1i −1 If this is the zero vector we get 2xy + 1 = 0, or y = 2x . (Note that x nor y can be zero, so dividing by x or y is no problem.) Substitute in the first component to get 1 1 3x2 − 1 + = 0 =) 3x4 − x2 + = 0: 4x2 4 2 2 1 Put z = x , get 3z − z + 4 = 0. The discriminant b2 − 4ac = 1 − 3 = −2, so there are NO critical points. So we need to look at the boundary! Solution Part 1: Critical points If f (x; y) = x3 − x + y 2x + y + 2 the gradient vector is go.illinois.edu/math241fa17. −1 If this is the zero vector we get 2xy + 1 = 0, or y = 2x . (Note that x nor y can be zero, so dividing by x or y is no problem.) Substitute in the first component to get 1 1 3x2 − 1 + = 0 =) 3x4 − x2 + = 0: 4x2 4 2 2 1 Put z = x , get 3z − z + 4 = 0. The discriminant b2 − 4ac = 1 − 3 = −2, so there are NO critical points. So we need to look at the boundary! Solution Part 1: Critical points If f (x; y) = x3 − x + y 2x + y + 2 the gradient vector is rf (x; y) = h3x2 − 1 + y 2; 2yx + 1i go.illinois.edu/math241fa17.
    [Show full text]
  • Notes on Calculus and Optimization
    Economics 101A Section Notes GSI: David Albouy Notes on Calculus and Optimization 1 Basic Calculus 1.1 Definition of a Derivative Let f (x) be some function of x, then the derivative of f, if it exists, is given by the following limit df (x) f (x + h) f (x) = lim − (Definition of Derivative) dx h 0 h → df although often this definition is hard to apply directly. It is common to write f 0 (x),or dx to be shorter, or dy if y = f (x) then dx for the derivative of y with respect to x. 1.2 Calculus Rules Here are some handy formulas which can be derived using the definitions, where f (x) and g (x) are functions of x and k is a constant which does not depend on x. d df (x) [kf (x)] = k (Constant Rule) dx dx d df (x) df (x) [f (x)+g (x)] = + (Sum Rule) dx dx dy d df (x) dg (x) [f (x) g (x)] = g (x)+f (x) (Product Rule) dx dx dx d f (x) df (x) g (x) dg(x) f (x) = dx − dx (Quotient Rule) dx g (x) [g (x)]2 · ¸ d f [g (x)] dg (x) f [g (x)] = (Chain Rule) dx dg dx For specific forms of f the following rules are useful in economics d xk = kxk 1 (Power Rule) dx − d ex = ex (Exponent Rule) dx d 1 ln x = (Logarithm Rule) dx x 1 Finally assuming that we can invert y = f (x) by solving for x in terms of y so that x = f − (y) then the following rule applies 1 df − (y) 1 = 1 (Inverse Rule) dy df (f − (y)) dx Example 1 Let y = f (x)=ex/2, then using the exponent rule and the chain rule, where g (x)=x/2,we df (x) d x/2 d x/2 d x x/2 1 1 x/2 get dx = dx e = d(x/2) e dx 2 = e 2 = 2 e .
    [Show full text]
  • Unit #23 : Lagrange Multipliers Goals: • to Study Constrained Optimization
    Unit #23 : Lagrange Multipliers Goals: • To study constrained optimization; that is, the maximizing or minimizing of a function subject to a constraint (or side condition). Constrained Optimization - Examples - 1 In the previous section, we saw some of the difficulties of working with optimization when there are multiple variables. Many of those problems can be cast into an important class of problems called constrained optimization problems, which can be solved in an alternative way. Examples of Problems with Constraints 1. Show that the rectangle of maximum area that has a given perimeter p is a square. The function to be maximized: A(x; y) = xy The constraint: 2x + 2y = p Constrained Optimization - Examples - 2 2. Find the point on the sphere x2 + y2 + z2 = 4 that is closest to the point (3; 3; 5). Here we want to minimize the distance D(x; y; z) = p(x − 3)2 + (y − 3)2 + (z − 5)2 subject to the constraint x2 + y2 + z2 = 4 Lagrange Multipliers - 1 Lagrange Multipliers To solve optimization problems when we have constraints on our choice of x and y, we can use the method of Lagrange multipliers. Suppose we want to maximize the function f(x; y) subject to the constraint g(x; y) = k. We consider the relative positions of some sample level curves on the next page. Lagrange Multipliers - 2 Black lines: contours of f(x; y). Blue lines: the constraint g(x; y) = k. f = 10 f = 20 f = 30 g(x,y)= k f = 40 f = 50 f = 60 On the diagram, locate the maximum and minimum of f(x; y) (label with M and m respectively) on the level curve g(x; y) = k.
    [Show full text]
  • A Global Divergence Conforming DG Method for Hyperbolic Conservation Laws with Divergence Constraint
    A global divergence conforming DG method for hyperbolic conservation laws with divergence constraint Praveen Chandrashekar∗ Abstract We propose a globally divergence conforming discontinuous Galerkin (DG) method on Cartesian meshes for curl-type hyperbolic conservation laws based on directly evolving the face and cell moments of the Raviart-Thomas approximation polyno- mials. The face moments are evolved using a 1-D discontinuous Gakerkin method that uses 1-D and multi-dimensional Riemann solvers while the cell moments are evolved using a standard 2-D DG scheme that uses 1-D Riemann solvers. The scheme can be implemented in a local manner without the need to solve a global mass matrix which makes it a truly DG method and hence useful for explicit time stepping schemes for hyperbolic problems. The scheme is also shown to exactly pre- serve the divergence of the vector field at the discrete level. Numerical results using second and third order schemes for induction equation are presented to demonstrate the stability, accuracy and divergence preservation property of the scheme. Keywords: Hyperbolic conservation laws; curl-type equations; discontinuous Galerkin; constraint-preserving; divergence-free; induction equation. 1 Introduction Constraint-preserving approximations are important in the numerical simulation of prob- lems in computational electrodynamics (CED) and magnetohydrodynamics (MHD). The time domain Maxwell equations used in CED for the electric and magnetic fields may be written in non-dimensional units as @E @B B = J; + E = 0 @t − r × − @t r × with the constraint that @ρ B = 0; E = ρ, + J = 0 r · r · @t r · where ρ is the electric charge density and J is the current which can be related to E, arXiv:1809.03294v1 [math.NA] 10 Sep 2018 B through Ohm's Law.
    [Show full text]