On Detx , Logdetx and Logdetxtx

Total Page:16

File Type:pdf, Size:1020Kb

On Detx , Logdetx and Logdetxtx On det X , log det X and log det XT X Andersen Ang Math´ematiqueet de Recherche op´erationnelle Facult´epolytechnique de Mons UMONS Mons, Belgium email: [email protected] homepage: angms.science August 14, 2017 1 / 17 Overview 1 On det X Physical meaning of det X @ det X −T Jacobi's Formula and @X = det X · X 2 On log det X Physical meaning of log det X log det X is concave (Fast) computation of log det 3 On log det XT X Physical meaning of log det XT X Derivative of log det XT X + δI 4 Summary 2 / 17 What is det : the volume of parallelepiped n×n For a square matrix X = [x1 x2 ::: xn] 2 R with linearly independent columns, det X tells the volume of the parallelepiped spaned by the columns xi det X = vol(fx1 x2 ::: xng) 2 Why? Consider the case n = 2 : two vectors x1; x2 2 R , we can always perform Gram-Schmidt orthogonalization to get two vectors v1; v2 v1 = x1 ? v2 = c12v1 + v2 ? where vector v2 is orthogonal to v1. Then ? det(v1; v2) = det(v1; c12v1 + v2 ) ? = det(v1; v2 ) = signed volume(v1; v2) same argument applies to any n For detail, see Hannah, John. "A geometric approach to determinants." American Mathematical Monthly (1996): 401-409. 3 / 17 Derivative of detX - the Jacobi's Formula For a non-singular matrix X, recall that : adjugate-det-inverse relationship: adjX = det X · X−1 adjugate-cofactor relationship : adjX = CT Jacobi's Formula gives the derivative of det X with respect to (w.r.t.) scalar x @ det X @X = det X · Tr X−1 (1) @x @x @X = Tr det X · X−1 | {z } @x adjX @X = Tr adjX | {z } @x CT @X = C; @x F So the derivative of det X equals to the matrix inner product of cofactor and derivative of X 4 / 17 Derivative of detX - the Jacobi's Formula @ det X @X The equation = C; makes sense because : @x @x F The derivative of scalar value det X w.r.t. scalar x is a scalar @X C; is a scalar @x F The derivative of det X w.r.t. matrix X is a matrix. The expression is @ det X = det X · X−T @X For derivation, refer to previous document. 5 / 17 What is log det The log-determinant of a matrix X is log det X X has to be square (* det) X has to be positive definite (pd), because Q I det X = i λi I all eigenvalues of pd matrix are positive I domain of log has to be positive real number (log of negative number produces complex number which is out of context here) if X is positive semidefinite (psd), it is better to consider a regularized version log det(X + δI), where δ > 0 is a small positive number. As all eigenvalues of a psd matrix is non-negative (zero or positve), adding a small positive number δ removes all the zero eigenvalues, turns X + δI to pd log det is a concave function @ log det X = X−T @X 6 / 17 Why log det Q For a matrix X, the expression det X = λi is dominated by the leading eigenvalues. Such domination is not a problem if we only care about the leading eigenvalues. When all the eigenvalues are important, one way to suppress the leading eigenvalues is to use log. Hence we have X log det X = log λi = log λ1 + log λ2 + ::: Again, since the input of log should not be negative nor zero, the matrix X here should be pd to make λi > 0 7 / 17 log det X is concave n For f(X) = log det X, X 2 S++, f is concave on X. Proof. To show f is concave on X is equivalent to show g(t) = f(X) = f(Z + tV ) is concave n on t where Z; V 2 S , and domg = ftjZ + tV 0g \ ft = 0g g(t) = log det(Z + tV ) 1 − 1 − 1 1 = log det Z 2 (I + tZ 2 VZ 2 )Z 2 1 1 − 1 − 1 = log det Z 2 (I + tV^ )Z 2 let V^ = Z 2 VZ 2 h ^ i = log det I + tV det Z * det(AB) = det(A) det(B) = log det I + tV^ + log det Z = log det QQT + tQΛQT + log det Z V^ = QΛQT ; QQT = QT Q = I = log det Q(I + tΛ)QT + log det Z = log det(QQT ) + log det(I + tΛ) + log det Z = log det(I + tΛ) + log det Z QQT = I; det I = 1; log 1 = 0 Q Q = log (1 + tλi) + log det Z det Λ = λi P = log(1 + tλi) + log det Z @g λ @2g λ2 = P i and = − P i < 0, g(t) is concave, so f(X) is concave (and 2 2 @t 1 + tλi @t (1 + tλi) −f(X) is convex) Reference: appendix of the Convex Optimization book by Boyd 8 / 17 Application of concavity of log det X : Taylor bound Fact : first order Taylor approximation is a global over-estimator of a concave function. That is, f(x) ≤ f(y) + rf(y)T (x − y) As log det X is concave, so it is upper bounded by its first order Taylor approximation. −T log det X ≤ log det Y + Y ; (X − Y ) F Again, Y −T and X − Y are matrices while log det X and log det Y are scalars, so the matrix inner product h ; i has to be applied. log det X ≤ log det Y + Tr Y −1(X − Y ) Such expression can be used for minimization of log det X. 9 / 17 (Fast) computation of log det Ways to compute log det X : 1. By definition. Compute det X, then take log. For example, using Laplace cofactor expansion formula Q 2. Eigenvalues (det X = λi) P Compute the eigenvalues of X, then log det X = log λi 3. Cholesky factorization. For X being pd, apply Cholesky decomoposition on X = LLT , then with Q P det L = Lii , log det X = 2 log Lii There are many other methods , for example approximating det X for big matrix X. 10 / 17 On log det XT X : why log det XT X The det X requires X to be square matrix. For non-square X, one can try det XT X, where XT X is the Gram matrix T T 2 of X and it is always psd : y X Xy = kXyk2 ≥ 0. Again it is better to consider a regularied version log det(XX + δI) for removing the possibility of having det(XT X + δI) = 0. Note. log det XT X and log det(XT X + δI) is not conave nor covex in X 11 / 17 Derivative of log det XT X + δI Let matrix B = XT X + δI to shorten the notation. Again, the derivative of the scalar-valued function log det B w.r.t. to a scalar x is a scalar, so the following expression after using chain rule is correct : @ log det B @ log det B @ det B = (2) @x @ det B @x And the following expression after using chain rule is wrong : @ log det B @ log det B @B = @x @B @x The correct expression (with inner product operator) should be @ log det B h@ log det B T @B i = Tr (3) @x @B @x 12 / 17 Derivative of log det XT X + δI ... 2 @ log det B @ log det B @ det B Consider equation (2) : = . @x @ det B @x @ log det B @ det B is just a simple log differentiation as det B is a scalar @ log det B 1 = (4) @ det B det B @ det B For @x , apply Jacobi's Formula @ det B @B = det B Tr B−1 (5) @x @x Put (4),(5) to (2) : @ log det B @B = Tr B−1 @x @x 13 / 17 Derivative of log det XT X + δI ... 3 Put B = XT X + δI back ,we have @ log det(XT X + δI) @(XT X + δI) = Tr (XT X + δI)−1 @x @x @(XT X+δI) @XT X @δI @δI As @x = @x + @x and @x gives zero matrix so @ log det(XT X + δI) @XT X = Tr (XT X + δI)−1 (6) @x @x Again, note that log det(:) and x are scalars but (XT X + δI)−1 and @XT X are matrices. It is the trace operator turns the matrix @x @XT X (XT X + δI)−1 back to scalar so that equation (6) makes sense! @x 14 / 17 Derivative of log det XT X + δI ... 4 Now consider another way to show the same result. Consider (3) : @ log det B h@ log det B T @B i = Tr @x @B @x @ log det B −T Since @B = B (page5), so we have @ log det B h @B i = Tr B−1 @x @x @B @XT X From last page we have @x = @x , so @ log det B h @XT X i = Tr B−1 @x @x Put back B = XT X + δI we get the same result as last page. 15 / 17 Derivative of log det XT X + δI ... 5 @ log det(XT X + δI) h @XT X i So we have = Tr (XT X + δI)−1 @x @x T @X X T ij ji ij Put x = Xij we have = X J + J X where J is single entry @Xij matrix. We get @ log det(XT X + δI) h i = Tr (XT X + δI)−1(XT Jij + Jij X) @Xij h i h i = Tr (XT X + δI)−1XT Jij + Tr (XT X + δI)−1Jij X) ij T ji By Tr(AJ ) = [A ]ij and Tr(AJ B) = [BA]ij, we have @ log det(XT X + δI) h iT h i = (XT X + δI)−1XT + X(XT X + δI)−1 @Xij ij ij As XT X + δI is symmetric so @ log det(XT X + δI) = 2X(XT X + δI)−1 @X 16 / 17 Last page - summary Physical meanings of det X and log det X log det X is concave in X Jacobi Formula and derivatives w.r.t.
Recommended publications
  • SOME CONSEQUENCES of the MEAN-VALUE THEOREM Below
    SOME CONSEQUENCES OF THE MEAN-VALUE THEOREM PO-LAM YUNG Below we establish some important theoretical consequences of the mean-value theorem. First recall the mean value theorem: Theorem 1 (Mean value theorem). Suppose f :[a; b] ! R is a function defined on a closed interval [a; b] where a; b 2 R. If f is continuous on the closed interval [a; b], and f is differentiable on the open interval (a; b), then there exists c 2 (a; b) such that f(b) − f(a) = f 0(c)(b − a): This has some important corollaries. 1. Monotone functions We introduce the following definitions: Definition 1. Suppose f : I ! R is defined on some interval I. (i) f is said to be constant on I, if and only if f(x) = f(y) for any x; y 2 I. (ii) f is said to be increasing on I, if and only if for any x; y 2 I with x < y, we have f(x) ≤ f(y). (iii) f is said to be strictly increasing on I, if and only if for any x; y 2 I with x < y, we have f(x) < f(y). (iv) f is said to be decreasing on I, if and only if for any x; y 2 I with x < y, we have f(x) ≥ f(y). (v) f is said to be strictly decreasing on I, if and only if for any x; y 2 I with x < y, we have f(x) > f(y). (Can you draw some examples of such functions?) Corollary 2.
    [Show full text]
  • Advanced Calculus: MATH 410 Functions and Regularity Professor David Levermore 6 August 2020
    Advanced Calculus: MATH 410 Functions and Regularity Professor David Levermore 6 August 2020 5. Functions, Continuity, and Limits 5.1. Functions. We now turn our attention to the study of real-valued functions that are defined over arbitrary nonempty subsets of R. The subset of R over which such a function f is defined is called the domain of f, and is denoted Dom(f). We will write f : Dom(f) ! R to indicate that f maps elements of Dom(f) into R. For every x 2 Dom(f) the function f associates the value f(x) 2 R. The range of f is the subset of R defined by (5.1) Rng(f) = f(x): x 2 Dom(f) : Sequences correspond to the special case where Dom(f) = N. When a function f is given by an expression then, unless it is specified otherwise, Dom(f) will be understood to be all x 2 R forp which the expression makes sense. For example, if functions f and g are given by f(x) = 1 − x2 and g(x) = 1=(x2 − 1), and no domains are specified explicitly, then it will be understood that Dom(f) = [−1; 1] ; Dom(g) = x 2 R : x 6= ±1 : These are natural domains for these functions. Of course, if these functions arise in the context of a problem for which x has other natural restrictions then these domains might be smaller. For example, if x represents the population of a species or the amountp of a product being manufactured then we must further restrict x to [0; 1).
    [Show full text]
  • Matrix Convex Functions with Applications to Weighted Centers for Semidefinite Programming
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Research Papers in Economics Matrix Convex Functions With Applications to Weighted Centers for Semidefinite Programming Jan Brinkhuis∗ Zhi-Quan Luo† Shuzhong Zhang‡ August 2005 Econometric Institute Report EI-2005-38 Abstract In this paper, we develop various calculus rules for general smooth matrix-valued functions and for the class of matrix convex (or concave) functions first introduced by L¨ownerand Kraus in 1930s. Then we use these calculus rules and the matrix convex function − log X to study a new notion of weighted centers for semidefinite programming (SDP) and show that, with this definition, some known properties of weighted centers for linear programming can be extended to SDP. We also show how the calculus rules for matrix convex functions can be used in the implementation of barrier methods for optimization problems involving nonlinear matrix functions. ∗Econometric Institute, Erasmus University. †Department of Electrical and Computer Engineering, University of Minnesota. Research supported by National Science Foundation, grant No. DMS-0312416. ‡Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Shatin, Hong Kong. Research supported by Hong Kong RGC Earmarked Grant CUHK4174/03E. 1 Keywords: matrix monotonicity, matrix convexity, semidefinite programming. AMS subject classification: 15A42, 90C22. 2 1 Introduction For any real-valued function f, one can define a corresponding matrix-valued function f(X) on the space of real symmetric matrices by applying f to the eigenvalues in the spectral decomposition of X. Matrix functions have played an important role in scientific computing and engineering.
    [Show full text]
  • Lecture 1: Basic Terms and Rules in Mathematics
    Lecture 7: differentiation – further topics Content: - additional topic to previous lecture - derivatives utilization - Taylor series - derivatives utilization - L'Hospital's rule - functions graph course analysis Derivatives – basic rules repetition from last lecture Derivatives – basic rules repetition from last lecture http://www.analyzemath.com/calculus/Differentiation/first_derivative.html Lecture 7: differentiation – further topics Content: - additional topic to previous lecture - derivatives utilization - Taylor series - derivatives utilization - L'Hospital's rule - functions graph course analysis Derivatives utilization – Taylor series Definition: The Taylor series of a real or complex-valued function ƒ(x) that is infinitely differentiable at a real or complex number a is the power series: In other words: a function ƒ(x) can be approximated in the close vicinity of the point a by means of a power series, where the coefficients are evaluated by means of higher derivatives evaluation (divided by corresponding factorials). Taylor series can be also written in the more compact sigma notation (summation sign) as: Comment: factorial 0! = 1. Derivatives utilization – Taylor series When a = 0, the Taylor series is called a Maclaurin series (Taylor series are often given in this form): a = 0 ... and in the more compact sigma notation (summation sign): a = 0 graphical demonstration – approximation with polynomials (sinx) nice examples: https://en.wikipedia.org/wiki/Taylor_series http://mathdemos.org/mathdemos/TaylorPolynomials/ Derivatives utilization – Taylor (Maclaurin) series Examples (Maclaurin series): Derivatives utilization – Taylor (Maclaurin) series Taylor (Maclaurin) series are utilized in various areas: 1. Approximation of functions – mostly in so called local approx. (in the close vicinity of point a or 0). Precision of this approximation can be estimated by so called remainder or residual term Rn, describing the contribution of ignored higher order terms: 2.
    [Show full text]
  • Multivariate Calculus∗
    Chapter 3 - Multivariate Calculus∗ Justin Leducy These lecture notes are meant to be used by students entering the University of Mannheim Master program in Economics. They constitute the base for a pre-course in mathematics; that is, they summarize elementary concepts with which all of our econ grad students must be familiar. More advanced concepts will be introduced later on in the regular coursework. A thorough knowledge of these basic notions will be assumed in later coursework. Although the wording is my own, the definitions of concepts and the ways to approach them is strongly inspired by various sources, which are mentioned explicitly in the text or at the end of the chapter. Justin Leduc I have slightly restructured and amended these lecture notes provided by my predecessor Justin Leduc in order for them to suit the 2016 course schedule. Any mistakes you might find are most likely my own. Simona Helmsmueller ∗This version: 2018 yCenter for Doctoral Studies in Economic and Social Sciences. 1 Contents 1 Multivariate functions 4 1.1 Invertibility . .4 1.2 Convexity, Concavity, and Multivariate Real-Valued Functions . .6 2 Multivariate differential calculus 10 2.1 Introduction . 10 2.2 Revision: single-variable differential calculus . 12 2.3 Partial Derivatives and the Gradient . 15 2.4 Differentiability of real-valued functions . 18 2.5 Differentiability of vector-valued functions . 22 2.6 Higher Order Partial Derivatives and the Taylor Approximation Theorems . 23 2.7 Characterization of convex functions . 26 3 Integral theory 28 3.1 Univariate integral calculus . 28 3.2 The definite integral and the fundamental theorem of calculus .
    [Show full text]
  • MA 105 : Calculus Division 1, Lecture 06
    MA 105 : Calculus Division 1, Lecture 06 Prof. Sudhir R. Ghorpade IIT Bombay Prof. Sudhir R. Ghorpade, IIT Bombay MA 105 Calculus: Division 1, Lecture 6 Recap of the previous lecture Rolle's theorem and its proof (using Extreme Value Property) (Lagrange's) Mean Value Theorem and its proof Consequences of MVT Monotonicity. Relation with derivatives Convexity and Concavity. Prof. Sudhir R. Ghorpade, IIT Bombay MA 105 Calculus: Division 1, Lecture 6 Convexity and Concavity Let I be an interval. A function f : I R is called convex on I if the line segment joining any two points! on the graph of f lies on or above the graph of f . If x1; x2 I , then the equation 2 of the line passing through (x1; f (x1)) and (x2; f (x2)) is given by f (x2) f (x1) y = f (x1) + − (x x1). Thus f is convex on I if x2 x1 − − f (x2) f (x1) x1; x2; x I ; x1 < x < x2 = f (x) f (x1)+ − (x x1): 2 ) ≤ x2 x1 − − Similarly, a function f : I R is called concave on I if ! f (x2) f (x1) x1; x2; x I ; x1 < x < x2 = f (x) f (x1)+ − (x x1): 2 ) ≥ x2 x1 − − Remark: It is easy to see that f f ((1 λ)x1 + λx2) (1 λ)f (x1)+λf (x2) λ [0; 1] and x1; x2 I : − ≤ − 8 2 2 Likewise for concacity (with changed to ). ≤ ≥ Prof. Sudhir R. Ghorpade, IIT Bombay MA 105 Calculus: Division 1, Lecture 6 A function f : I R is called strictly convex on I if the inequality in the! definition of a convex function can be replaced by≤ the strict inequality <.
    [Show full text]
  • 2Nd Order Derivatives, Concavity
    Mathematics for Economics Anthony Tay 10. The Second-Order Derivative, Concavity and Convexity The second-order derivative (or simply ‘second derivative’) is the derivative of the derivative fa′′(+− h ) fa () fa′′( )= lim . h→0 h The objective of this section is to provide examples of their application in general, and in economics in particular. We can also compute ‘third-order derivatives’, ‘fourth-order derivatives’, etc. These are useful, although we will be using them less frequently. Example 10.1 If fx()= x3 , then fx′()= 3 x2 , fx′′()= 6 x, fx′′′()= 6. Notation If y = f (x) , then d 2 dy2 Second Derivative f ′′(x), f (2)(x), y′′ , fx(), dx2 dx2 d n dyn n-th ordered derivative f (n)(x), y(n) , fx() , dxn dxn If y is a function of time, then the second derivative is often written y , following the convention that the first-derivative in such applications is written y . The most straightforward interpretation of the second-derivative is that it is the rate of change of the rate of change of fx() as x increases. Geometrically, the second derivative can also help to describe the shape of a function. When used in specific disciplines, the first and second derivations can often be given an interpretation appropriate to the context of that discipline. You are probably familiar with the concepts of velocity and acceleration in physics. Example 10.2 If xt() gives you the position of an object relative to some reference point x(0) , and t is time, then x is the velocity of the object, and x is its acceleration.
    [Show full text]
  • Maxima and Minima
    Maxima and Minima 9 This unit is designed to introduce the learners to the basic concepts associated with Optimization. The readers will learn about different types of functions that are closely related to optimization problems. This unit discusses maxima and minima of simple polynomial functions and develops the concept of critical points along with the first derivative test and next the concavity test. This unit also discusses the procedure of determining the optimum (maximum or minimum) point with single variable function, multivariate function and constrained equations. Some relevant business and economic applications of maxima and minima are also provided in this unit for clear understanding to the learners. School of Business Blank Page Unit-9 Page-200 Bangladesh Open University Lesson-1: Optimization of Single Variable Function After studying this lesson, you should be able to: Describe the concept of different types of functions; Explain the maximum, minimum and point of inflection of a function; Describe the methodology for determining optimization conditions for mathematical functions with single variable; Determine the maximum and minimum of a function with single variable; Determine the inflection point of a function with single variable. Introduction Optimization is a predominant theme in management and economic analysis. For this reason, the classical calculus methods of finding free and constrained extrema and the more recent techniques of mathematical programming occupy an important place in management and economics. The most common The most common criterion of choice among alternatives in economics is criterion of choice the goal of maximizing something (i.e. profit maximizing, utility among alternatives in maximizing, growth rate maximizing etc.) or of minimizing something economics is the goal of maximizing (i.e.
    [Show full text]
  • Log-Concave Polynomials II: High-Dimensional Walks and an FPRAS for Counting Bases of a Matroid
    Log-Concave Polynomials II: High-Dimensional Walks and an FPRAS for Counting Bases of a Matroid Nima Anari1, Kuikui Liu2, Shayan Oveis Gharan2, and Cynthia Vinzant3 1Stanford University, [email protected] 2University of Washington, [email protected], [email protected] 3North Carolina State University, [email protected] February 11, 2019 Abstract We design an FPRAS to count the number of bases of any matroid given by an independent set oracle, and to estimate the partition function of the random cluster model of any matroid in the regime where 0 < q < 1. Consequently, we can sample random spanning forests in a graph and (approximately) compute the reliability polynomial of any matroid. We also prove the thirty year old conjecture of Mihail and Vazirani that the bases exchange graph of any matroid has expansion at least 1. Our algorithm and the proof build on the recent results of Dinur, Kaufman, Mass and Oppenheim [KM17; DK17; KO18] who show that high dimensional walks on simplicial complexes mix rapidly if the corresponding localized random walks on 1-skeleton of links of all complexes are strong spectral expanders. One of our key observations is a close connection between pure simplicial complexes and multiaffine homogeneous polynomials. Specifically, if X is a pure simplicial complex with positive weights on its maximal faces, we can associate with X a multiaffine homogeneous polynomial pX such that the eigenvalues of the localized random walks on X correspond to the eigenvalues of the Hessian of derivatives of pX. 1 Introduction [n] Let m : 2 ! R>0 be a probability distribution on the subsets of the set [n] = f1, 2, ..
    [Show full text]
  • On Hopital-Style Rules for Monotonicity and Oscillation
    On Hˆopital-style rules for monotonicity and oscillation MAN KAM KWONG∗ Department of Applied Mathematics The Hong Kong Polytechnic University, Hunghom, Hong Kong [email protected] Abstract We point out the connection of the so-called Hˆopital-style rules for monotonicity and oscillation to some well-known properties of concave/convex functions. From this standpoint, we are able to generalize the rules under no differentiability requirements and greatly extend their usability. The improved rules can handle situations in which the functions involved have non-zero initial values and when the derived functions are not necessarily monotone. This perspective is not new; it can be dated back to Hardy, Littlewood and P´olya. 2010 Mathematics Subject Classification. 26.70, 26A48, 26A51 Keywords. Monotonicity, Hˆopital’s rule, oscillation, inequalities. arXiv:1502.07805v1 [math.CA] 27 Feb 2015 ∗The research of this author is supported by the Hong Kong Government GRF Grant PolyU 5003/12P and the Hong Kong Polytechnic University Grants G-UC22 and G-YBCQ. 1 1 Introduction and historical remarks Since the 1990’s, many authors have successfully applied the so-called monotone L’Hˆopital’s† rules to establish many new and useful inequalities. In this article we make use of the connection of these rules to some well-known properties of concave/convex functions (via a change of variable) to extend the usability of the rules. We show how the rules can be formulated under no differentiability requirements, and how to characterize all possible situations, which are normally not covered by the conventional form of the rules. These include situations in which the functions involved have non-zero initial values and when the derived functions are not necessarily monotone.
    [Show full text]
  • Convexity Conditions and Intersections with Smooth
    TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY Volume 289. Number 2, June 1985 CONVEXITYCONDITIONS AND INTERSECTIONS WITH SMOOTH FUNCTIONS BY S. AGRONSKY, A. M. BRUCKNER1, M. LACZKOVICH AND D. PREISS Abstract. A continuous function that agrees with each member of a family J^of smooth functions in a small set must itself possess certain desirable properties. We study situations that arise when ^consists of the family of polynomials of degree at most n, as well as certain larger families and when the small sets of agreement are finite. The conclusions of our theorems involve convexity conditions. For example, if a continuous function / agrees with each polynomial of degree at most n in only a finite set, then/is (n + l)-convex or (n + l)-concave on some interval. We consider also certain variants of this theorem, provide examples to show that certain improve- ments are not possible and present some applications of our results. 1. Introduction. We shall examine the extent to which certain theorems related to monotonicity extend to convexity and, more generally, n-convexity (monotonicity itself can be viewed as 1-convexity). By a theorem of E. Cech [2], if a continuous function / defined in an interval takes every value in finitely many points, then / is monotonie in some subinterval. Our main result is the following generalization of Cech's theorem: If the graph of / intersects the graph of every polynomial of degree at most n in finitely many points, then /is (n + l)-convex or (n + l)-concave in some subinterval (Theorem 13). Cech's result is the case of n = 0.
    [Show full text]
  • Multi-Variable Calculus and Optimization
    Multi-variable Calculus and Optimization Dudley Cooke Trinity College Dublin Dudley Cooke (Trinity College Dublin) Multi-variable Calculus and Optimization 1 / 51 EC2040 Topic 3 - Multi-variable Calculus Reading 1 Chapter 7.4-7.6, 8 and 11 (section 6 has lots of economic examples) of CW 2 Chapters 14, 15 and 16 of PR Plan 1 Partial differentiation and the Jacobian matrix 2 The Hessian matrix and concavity and convexity of a function 3 Optimization (profit maximization) 4 Implicit functions (indifference curves and comparative statics) Dudley Cooke (Trinity College Dublin) Multi-variable Calculus and Optimization 2 / 51 Multi-variable Calculus Functions where the input consists of many variables are common in economics. For example, a consumer's utility is a function of all the goods she consumes. So if there are n goods, then her well-being or utility is a function of the quantities (c1, ... , cn) she consumes of the n goods. We represent this by writing U = u(c1, ... , cn) Another example is when a firm’s production is a function of the quantities of all the inputs it uses. So, if (x1, ... , xn) are the quantities of the inputs used by the firm and y is the level of output produced, then we have y = f (x1, ... , xn). We want to extend the calculus tools studied earlier to such functions. Dudley Cooke (Trinity College Dublin) Multi-variable Calculus and Optimization 3 / 51 Partial Derivatives Consider a function y = f (x1, ... , xn). The partial derivative of f with respect to xi is the derivative of f with respect to xi treating all other variables as constants and is denoted, ¶f or fxi ¶xi Consider the following function: y = f (u, v) = (u + 4)(6u + v).
    [Show full text]