Arithmetic Subderivatives: P-Adic Discontinuity and Continuity

Total Page:16

File Type:pdf, Size:1020Kb

Arithmetic Subderivatives: P-Adic Discontinuity and Continuity 1 2 Journal of Integer Sequences, Vol. 23 (2020), 3 Article 20.7.3 47 6 23 11 Arithmetic Subderivatives: p-adic Discontinuity and Continuity Pentti Haukkanen and Jorma K. Merikoski1 Faculty of Information Technology and Communication Sciences FI-33014 Tampere University Finland [email protected] [email protected] Timo Tossavainen Department of Arts, Communication and Education Lulea University of Technology SE-97187 Lulea Sweden [email protected] Abstract In a previous paper, we proved that the arithmetic subderivative DS is discontinuous at any rational point with respect to the ordinary absolute value. In the present paper, we study this question with respect to the p-adic absolute value. In particular, we show that DS is in this sense continuous at the origin if S is finite or p∈ / S. 1 Introduction Let 0 =6 x ∈ Q. There exists a unique sequence (νp(x))p∈P of integers (with only finitely many nonzero terms) such that 1Corresponding author. 1 x = (sgn x) pνp(x). (1) Y p∈P Here P stands for the set of primes, and sgn x = x/|x|. Define that sgn0 = 0 and νp(0) = ∞ for all p ∈ P. In addition to the ordinary axioms of ∞, we state that 0 · ∞ = 0. Then (1) holds also for x = 0. We recall the basic properties of the p-adic order νp. Proposition 1. For all x,y ∈ Q, (a) νp(x)= ∞ if and only if x = 0; (b) νp(xy)= νp(x)+ νp(y); (c) νp(x + y) ≥ min(νp(x),νp(y)); (d) νp(x + y) = min(νp(x),νp(y)) if νp(x) =6 νp(y). Proof. Properties (a) and (b) are trivial. For (c) and (d), see, e.g., [2, Proposition 2.4]. Throughout this paper, we let a ∈ Q, p,q ∈ P, p =6 q, and ∅= 6 S ⊆ P. The arithmetic subderivative [11, 8, 9] of x ∈ Q with respect to S, a.k.a. the arithmetic type derivative [4] is ν (x) D (x)= x p . S X p p∈S The arithmetic partial derivative [10, 7] of x with respect to p is Dp(x) = D{p}(x). The arithmetic derivative [12, 3, 13] of x is D(x)= DP(x). Clearly, D (x)= D (x),D(x)= D (x). S X p X p p∈S p∈P The function DS is very strongly discontinuous at any a [8, Theorem 4] with respect to the ordinary absolute value. But do we succeed better if we use the p-adic absolute value of x, defined by 1 |x|p = ? pνp(x) (In particular, |0|p =1/∞ = 0.) We recall the basic properties of |·|p. Proposition 2. For all x,y ∈ Q, (a) |x|p =0 if and only if x = 0; (b) |xy|p = |x|p|y|p; 2 (c) |x + y|p ≤ max(|x|p, |y|p); (d) |x + y|p = max(|x|p, |y|p) if |x|p =6 |y|p. Proof. This proposition is equivalent to Proposition 1. Let us write 1 νp(x) νp(x) νp(x) x = x p = x|x|pp = µp(x)p , pνp(x) where x µp(x)= x|x|p = . (2) pνp(x) Proposition 3. For all x,y ∈ Q, (a) νp(µp(x))=0 if x =06 , νp(µp(0)) = νp(0) = ∞; (b) |µp(x)|p =1 if x =06 , |µp(0)|p = 0; (c) µp(xy)= µp(x)µp(y); (d) Dp(µp(x))=0. Proof. Trivial. The p-adic distance |x − y|p is smaller the larger νp(x − y) is. We say that a function Q → Q is p-adically continuous, in short p-continuous, if it is continuous with respect to |·|p. So we ask: Is DS p-continuous at some a? We study this question by considering sequences (xi) of rational numbers. If |xi − a|p → 0, equivalently νp(xi − a) → ∞, then (xi) converges p-adically, in short p-converges, to a. Let xi →p a denote this convergence. Sections 2–3 are introductory. We present in Section 2 a “light version” of Dirichlet’s theorem on arithmetic progressions. We study p-convergence in Section 3. Sections 4–7 contain our main results. We prove in Section 4 that DS is p-continuous at a = 0 if S is finite or p∈ / S. We also prove that Dp is p-continuous also at a =6 0. On the other hand, we show in Section 5 that Dq can be (and conjecture that it always is) p-discontinuous at a =6 0. In Section 6, we extend the results of Section 5 to DS when S is finite. Although Section 5 is only a special case of Section 6, we find it instructive to present it separately. We complete our paper with the conclusion in Section 7. 2 “Poor man’s theorem on arithmetic progressions” Throughout this section, a,b ∈ Z with gcd(a,b) = 1. As suggested by Graham et al. [5], we let a ⊥ b denote that gcd(a,b)=1. See also [6, 4]. We recall Dirichlet’s theorem on arithmetic progressions. 3 Theorem 4. If b> 0, then the set T = {a + nb : n ∈ Z+} (3) contains infinitely many primes. Proof. See, e.g., [1, Theorem 7.9]. Corollary 5. If b =6 0, then the set (3) contains infinitely many primes or their additive inverses. Proof. Trivial. Theorem 4 is advanced, while our paper is elementary. We do not need the full force of this theorem, and we want to use only elementary methods. Therefore we apply, instead of Theorem 4, the following “poor man’s theorem on arithmetic progressions.” It is elementary but strong enough for us. (Remember that ∅= 6 S ⊆ P throughout.) Theorem 6. If S is finite and b =06 , then the set (3) contains infinitely many numbers that are not divisible by any element of S. Proof. If a = 0, then b = ±1, since otherwise a 6⊥ b, a contradiction. Therefore T = Z+ or T = Z−, and the claim is trivially true. Now assume that a =6 0. Let S = {p1,...,ph,q1,...,qk}, p1,...,ph ∤ a, q1,...,qk | a. (Either the pi or the qi can be missing. Clearly, pi =6 qj for all i, j.) We show that the numbers a + nb apply when n goes through the set N = {mp1 ··· ph : m ∈ Z+, q1,...,qk ∤ m}. Write c = p1 ··· ph. (If the pi are missing, then the “empty product” c = 1.) Let x ∈ T with n ∈ N, that is, x = a + mcb, q1,...,qk ∤ m. (4) Each pi | c but pi ∤ a, so pi ∤ x. Each qi | a but qi ∤ mcb. (Clearly, qi ∤ m,c. If qi | b, then a 6⊥ b, a contradiction.) Therefore also qi ∤ x. Consequently, s ∤ x for all s ∈ S. Because there are infinitely many numbers (4), the claim follows. 4 3 Convergence Continuity is usually proved by the “ε − δ technique”, while discontinuity is often proved using suitable sequences. For consistency, we use sequences also in proving continuity. To that end, we need a characterization of p-convergence. Proposition 7. Let (xi) be a sequence of rational numbers. If a =6 0, then the following conditions are equivalent. (a) xi →p a; (b) µp(xi) →p µp(a) and there is i0 ∈ Z+ such that νp(xi0 )= νp(xi0+1)= ··· = νp(a). If a =0, then (b) ⇒ (a) but not conversely. Proof. Case 1: a =6 0. (a) ⇒ (b): If i0 does not exist, then (xi) has a subsequence (xik ) whose each term satisfies νp(xik ) =6 νp(a). Consequently, Prop.1(d) a6=0 νp(xik − a) = min(νp(xik ),νp(a)) ≤ νp(a) < ∞, k ≥ 1. Hence νp(xik − a) 6→ ∞, implying xi 6→p a, a contradiction. Therefore i0 exists, i.e., νp(a) xi = µp(xi)p , i ≥ i0. Now, for i ≥ i0, we have νp(a) xi − a =(µp(xi) − µp(a))p , (5) and further (5),Prop. 1(b) (a) νp(µp(xi) − µp(a)) + νp(a) = νp(xi − a) → ∞, verifying µp(xi) →p µp(a). (b) ⇒ (a): Since (b) νp(a) νp(a) νp(a) xi − a = µp(xi)p − µp(a)p =(µp(xi) − µp(a))p , i ≥ i0, we have Prop. 1(b) (b) νp(xi − a) = νp(µp(xi) − µp(a)) + νp(a) → ∞, verifying (a). Case 2: a = 0. (b) ⇒ (a). Since νp(xi0 )= νp(xi0+1)= ··· = νp(0) = ∞, it follows that xi0 = xi0+1 = ··· = 0. Therefore xi →p 0. i (a) 6⇒ (b). If xi = p , then xi →p 0, but µp(xi)=1 →p 1 =0=6 µp(0). 5 Proposition 8. A function F : Q → Q is p-continuous at a if and only if any sequence (xi) of rational numbers satisfying xi →p a satisfies F (xi) →p F (a). Proof. Proceed as in proving the corresponding property of the ordinary continuity. There are three formally different ways to consider p-convergence. First, use νp every- where. Second, use |·|p everywhere. Third, use either νp or |·|p, depending on the situation. We follow the first way. 4 The cases of DS, a =0, and Dp, a arbitrary We begin with a lemma that may be interesting on its own. Lemma 9. Let S be finite and y ∈ Q. Assume that {q ∈ P | νq(y) =06 }⊆ S. Factorize y = qνq(y) = u(y)w(y), Y q∈S where u(y)= qνq(y)−1, w(y)= q. Y Y q∈S q∈S Then DS(y)= u(y)v(y), where v(y)= ν (y) r. X q Y q∈S r∈S\{q} Proof. We have D (y)= D (qνq(y)) rνr(y) = ν (y)qνq(y)−1 rνr(y) S X S Y X q Y q∈S r∈S\{q} q∈S r∈S\{q} = ν (y)qνq(y)−1 rνr(y)−1r = ν (y) rνr(y)−1 r , X q Y X q Y Y q∈S r∈S\{q} q∈S r∈S r∈S\{q} verifying the claim.
Recommended publications
  • Subderivative-Subdifferential Duality Formula
    Subderivative-subdifferential duality formula Marc Lassonde Universit´edes Antilles, BP 150, 97159 Pointe `aPitre, France; and LIMOS, Universit´eBlaise Pascal, 63000 Clermont-Ferrand, France E-mail: [email protected] Abstract. We provide a formula linking the radial subderivative to other subderivatives and subdifferentials for arbitrary extended real-valued lower semicontinuous functions. Keywords: lower semicontinuity, radial subderivative, Dini subderivative, subdifferential. 2010 Mathematics Subject Classification: 49J52, 49K27, 26D10, 26B25. 1 Introduction Tyrrell Rockafellar and Roger Wets [13, p. 298] discussing the duality between subderivatives and subdifferentials write In the presence of regularity, the subgradients and subderivatives of a function f are completely dual to each other. [. ] For functions f that aren’t subdifferentially regular, subderivatives and subgradients can have distinct and independent roles, and some of the duality must be relinquished. Jean-Paul Penot [12, p. 263], in the introduction to the chapter dealing with elementary and viscosity subdifferentials, writes In the present framework, in contrast to the convex objects, the passages from directional derivatives (and tangent cones) to subdifferentials (and normal cones, respectively) are one-way routes, because the first notions are nonconvex, while a dual object exhibits convexity properties. In the chapter concerning Clarke subdifferentials [12, p. 357], he notes In fact, in this theory, a complete primal-dual picture is available: besides a normal cone concept, one has a notion of tangent cone to a set, and besides a subdifferential for a function one has a notion of directional derivative. Moreover, inherent convexity properties ensure a full duality between these notions. [. ]. These facts represent great arXiv:1611.04045v2 [math.OC] 5 Mar 2017 theoretical and practical advantages.
    [Show full text]
  • Techniques of Variational Analysis
    J. M. Borwein and Q. J. Zhu Techniques of Variational Analysis An Introduction October 8, 2004 Springer Berlin Heidelberg NewYork Hong Kong London Milan Paris Tokyo To Tova, Naomi, Rachel and Judith. To Charles and Lilly. And in fond and respectful memory of Simon Fitzpatrick (1953-2004). Preface Variational arguments are classical techniques whose use can be traced back to the early development of the calculus of variations and further. Rooted in the physical principle of least action they have wide applications in diverse ¯elds. The discovery of modern variational principles and nonsmooth analysis further expand the range of applications of these techniques. The motivation to write this book came from a desire to share our pleasure in applying such variational techniques and promoting these powerful tools. Potential readers of this book will be researchers and graduate students who might bene¯t from using variational methods. The only broad prerequisite we anticipate is a working knowledge of un- dergraduate analysis and of the basic principles of functional analysis (e.g., those encountered in a typical introductory functional analysis course). We hope to attract researchers from diverse areas { who may fruitfully use varia- tional techniques { by providing them with a relatively systematical account of the principles of variational analysis. We also hope to give further insight to graduate students whose research already concentrates on variational analysis. Keeping these two di®erent reader groups in mind we arrange the material into relatively independent blocks. We discuss various forms of variational princi- ples early in Chapter 2. We then discuss applications of variational techniques in di®erent areas in Chapters 3{7.
    [Show full text]
  • Proximal Analysis and Boundaries of Closed Sets in Banach Space Part Ii: Applications
    Can. J. Math., Vol. XXXIX, No. 2, 1987, pp. 428-472 PROXIMAL ANALYSIS AND BOUNDARIES OF CLOSED SETS IN BANACH SPACE PART II: APPLICATIONS J. M. BORWEIN AND H. M. STROJWAS Introduction. This paper is a direct continuation of the article "Proximal analysis and boundaries of closed sets in Banach space, Part I: Theory", by the same authors. It is devoted to a detailed analysis of applications of the theory presented in the first part and of its limitations. 5. Applications in geometry of normed spaces. Theorem 2.1 has important consequences for geometry of Banach spaces. We start the presentation with a discussion of density and existence of improper points (Definition 1.3) for closed sets in Banach spaces. Our considerations will be based on the "lim inf ' inclusions proven in the first part of our paper. Ti IEOREM 5.1. If C is a closed subset of a Banach space E, then the K-proper points of C are dense in the boundary of C. Proof If x is in the boundary of C, for each r > 0 we may find y £ C with \\y — 3c|| < r. Theorem 2.1 now shows that Kc(xr) ^ E for some xr e C with II* - *,ll ^ 2r. COROLLARY 5.1. ( [2] ) If C is a closed convex subset of a Banach space E, then the support points of C are dense in the boundary of C. Proof. Since for any convex set C and x G C we have Tc(x) = Kc(x) = Pc(x) = P(C - x) = WTc(x) = WKc(x) = WPc(x\ the /^-proper points of C at x, where Rc(x) is any one of the above cones, are exactly the support points.
    [Show full text]
  • Variational Properties of Polynomial Root Functions and Spectral Max Functions
    Julia Eaton Workshop on Numerical Linear Algebra and Optimization [email protected] University of British Columbia University of Washington Tacoma Vancouver, BC Canada Variational properties of polynomial root functions and spectral max functions Abstract 1 Spectral Functions 4 Damped oscillator abscissa (see panel 3) 9 Applications 14 n×n Eigenvalue optimization problems arise in the control of continuous We say :C !R=R [ f+1g is a spectral function if it • Smooth everywhere but -2 and 2. Recall damped oscillator problem in panel 3. The characteristic and discrete time dynamical systems. The spectral abscissa (largest • depends only on the eigenvalues of its argument • Non-Lipschitz at -2 and 2 polynomial of A(x) is: p(λ, x) = λ(λ+x)+1 = λ2+xλ+1: real part of an eigenvalue) and spectral radius (largest eigenvalue • is invariant under permutations of those eigenvalues • Non-convex and abscissa graph is shown in panel 3. in modulus) are examples of functions of eigenvalues, or spectral • @h^ (2) = [−1=2; 1) = @h(2) A spectral max function ': n×n ! is a spectral fcn. of the form Consider the parameterization H : !P2, where functions, connected to these problems. A related class of functions C R • subdifferentially regular at 2 R '(A) = maxff (λ) j det(λI −A) = 0g H(x) = (λ−λ )2+h (x)(λ−λ )+h (x); λ 2 , h ; h : ! . are polynomial root functions. In 2001, Burke and Overton showed 0 1 0 2 0 C 1 2 R R where f : C ! R. The spectral abscissa and radius are spectral max 2 2 that the abscissa mapping on polynomials is subdifferentially Variational properties of spectral functions 10 Require p(λ, x) = λ +xλ+1 = (λ−λ0) +h1(x)(λ−λ0)+h2(x) so functions.
    [Show full text]
  • Convexity: an Introduction
    Convexity: an introduction Convexity: an introduction Geir Dahl CMA, Dept. of Mathematics and Dept. of Informatics University of Oslo 1 / 74 Convexity: an introduction 1. Introduction 1. Introduction what is convexity where does it arise main concepts and results Literature: Rockafellar: Convex analysis, 1970. Webster: Convexity, 1994. Gr¨unbaum: Convex polytopes, 1967. Ziegler: Lectures on polytopes, 1994. Hiriart-Urruty and Lemar´echal: Convex analysis and minimization algorithms, 1993. Boyd and Vandenberghe: Convex optimization, 2004. 2 / 74 Convexity: an introduction 1. Introduction roughly: a convex set in IR2 (or IRn) is a set \with no holes". more accurately, a convex set C has the following property: whenever we choose two points in the set, say x; y 2 C, then all points in the line segment between x and y also lie in C. a sphere (ball), an ellipsoid, a point, a line, a line segment, a rectangle, a triangle, halfplane, the plane itself the union of two disjoint (closed) triangles is nonconvex. 3 / 74 Convexity: an introduction 1. Introduction Why are convex sets important? Optimization: mathematical foundation for optimization feasible set, optimal set, .... objective function, constraints, value function closely related to the numerical solvability of an optimization problem Statistics: statistics: both in theory and applications estimation: \estimate" the value of one or more unknown parameters in a stochastic model. To measure quality of a solution one uses a \loss function" and, quite often, this loss function is convex. statistical decision theory: the concept of risk sets is central; they are convex sets, so-called polytopes. 4 / 74 Convexity: an introduction 1.
    [Show full text]
  • Arxiv:1703.03069V1 [Math.OC]
    Upper semismooth functions and the subdifferential determination property March 10, 2017 Marc Lassonde Universit´edes Antilles, BP 150, 97159 Pointe `aPitre, France; and LIMOS, Universit´eBlaise Pascal, 63000 Clermont-Ferrand, France E-mail: [email protected] Dedicated to the memory of Jon Borwein. Abstract. In this paper, an upper semismooth function is defined to be a lower semicon- tinuous function whose radial subderivative satisfies a mild directional upper semicontinuity property. Examples of upper semismooth functions are the proper lower semicontinuous convex functions, the lower-C1 functions, the regular directionally Lipschitz functions, the Mifflin semismooth functions, the Thibault-Zagrodny directionally stable functions. It is shown that the radial subderivative of such functions can be recovered from any subdiffer- ential of the function. It is also shown that these functions are subdifferentially determined, in the sense that if two functions have the same subdifferential and one of the functions is upper semismooth, then the two functions are equal up to an additive constant. Keywords: upper semismooth, Dini subderivative, radial subderivative, subdifferential, sub- differential determination property, approximately convex function, regular function. 2010 Mathematics Subject Classification: 49J52, 49K27, 26D10, 26B25. 1 Introduction Jon Borwein discussing generalisations in the area of nonsmooth optimisation [1, p. 4] writes: In his thesis Francis Clarke extended Moreau-Rockafellar max formula to all locally Lip- schitz functions. Clarke replaced the right Dini directional derivative Dhf(x) by c f(y + th) f(y) Dhf(x) = lim sup − . 0<t→0,y→x t arXiv:1703.03069v1 [math.OC] 8 Mar 2017 c Somewhat miraculously the mapping p sending h Dhf(x) is always continuous and C → ∗ c sublinear in h, and so if we define ∂ f(x)= ∂p(0) = y X : y,h Dhf(x), h X , Moreau-Rockafellar max formula leads directly to: { ∈ h i≤ ∀ ∈ } Theorem (Clarke).
    [Show full text]
  • For Lower Semicontinuous Functions
    TRANSACTIONSof the AMERICAN MATHEMATICALSOCIETY Volume 347, Number 10, October 1995 MEAN VALUE PROPERTY AND SUBDIFFERENTIAL CRITERIA FOR LOWER SEMICONTINUOUS FUNCTIONS DIDIER AUSSEL, JEAN-NOËL CORVELLEC, AND MARC LASSONDE Abstract. We define an abstract notion of subdifferential operator and an as- sociated notion of smoothness of a norm covering all the standard situations. In particular, a norm is smooth for the Gâteaux (Fréchet, Hadamard, Lipschitz- smooth) subdifferential if it is Gâteaux (Fréchet, Hadamard, Lipschitz) smooth in the classical sense, while on the other hand any norm is smooth for the Clarke-Rockafellar subdifferential. We then show that lower semicontinüous functions on a Banach space satisfy an Approximate Mean Value Inequality with respect to any subdifferential for which the norm is smooth, thus pro- viding a new insight on the connection between the smoothness of norms and the subdifferentiability properties of functions. The proof relies on an adapta- tion of the "smooth" variational principle of Bonvein-Preiss. Along the same vein, we derive subdifferential criteria for coercivity, Lipschitz behavior, cone- monotonicity, quasiconvexity, and convexity of lower semicontinüous functions which clarify, unify and extend many existing results for specific subdifferen- tials. 1. Introduction Let X be a Banach space. Since the pioneering observation of Asplund [ 1] that there is a close connection between Gâteaux-differentiability of the norm of X and Gâteaux-differentiability of convex continuous functions on dense
    [Show full text]
  • Twice Epi-Differentiability of Extended-Real-Valued Functions with Applications in Composite Optimization
    TWICE EPI-DIFFERENTIABILITY OF EXTENDED-REAL-VALUED FUNCTIONS WITH APPLICATIONS IN COMPOSITE OPTIMIZATION ASHKAN MOHAMMADI1 and M. EBRAHIM SARABI2 Abstract. The paper is devoted to the study of the twice epi-differentiablity of extended-real-valued functions, with an emphasis on functions satisfying a certain composite representation. This will be conducted under parabolic regularity, a second-order regularity condition that was recently utilized in [14] for second-order variational analysis of constraint systems. Besides justifying the twice epi-differentiablity of composite functions, we obtain precise formulas for their second subderivatives under the metric subregularity constraint qualification. The latter allows us to derive second-order optimality conditions for a large class of composite optimization problems. Key words. Variational analysis, twice epi-differentiability, parabolic regularity, composite optimization, second-order optimality conditions Mathematics Subject Classification (2000) 49J53, 49J52, 90C31 1 Introduction This paper aims to provide a systematic study of the twice epi-differentiability of extend-real- valued functions in finite dimensional spaces. In particular, we pay special attention to the composite optimization problem minimize ϕ(x)+ g(F (x)) over all x ∈ X, (1.1) where ϕ : X → IR and F : X → Y are twice differentiable and g : Y → IR := (−∞, +∞] is a lower semicontinuous (l.s.c.) convex function and where X and Y are two finite dimen- sional spaces, and verify the twice epi-differentiability of the objective function in (1.1) under verifiable assumptions. The composite optimization problem (1.1) encompasses major classes of constrained and composite optimization problems including classical nonlinear programming problems, second-order cone and semidefinite programming problems, eigenvalue optimizations problems [24], and fully amenable composite optimization problems [19], see Example 4.7 for arXiv:1911.05236v2 [math.OC] 13 Apr 2020 more detail.
    [Show full text]
  • A Course in Machine Learning
    A Course in Machine Learning Hal Daumé III Draft: Copyright © 2012 Hal Daumé III http://ciml.info Do Not This book is for the use of anyone anywhere at no cost and with almost no re- strictions whatsoever. You may copy it or re-use it under the terms of the CIML License online at ciml.info/LICENSE. You may not redistribute it yourself, but are encouraged to provide a link to the CIML web page for others to download for free. You may not charge a fee for printed versions, though you can print it for your own use. Distribute version 0.8 , August 2012 6|LinearModels The essence of mathematics is not to make simple things Learning Objectives: complicated, but to make complicated things simple. -- Stan- • Define and plot four surrogate loss functions: squared loss, logistic loss, ley Gudder exponential loss and hinge loss. • Compare and contrast the optimiza- tion of 0/1 loss and surrogate loss functions. In Chapter ??, you learned about the perceptron algorithm • Solve the optimization problem for linear classification. This was both a model (linear classifier) and for squared loss with a quadratic algorithm (the perceptron update rule) in one. In this section, we regularizer in closed form. will separate these two, and consider general ways for optimizing • Implement and debug gradient descent and subgradient descent. linear models. This will lead us into some aspects of optimization (aka mathematical programming), but not very far. At the end of this chapter, there are pointers to more literature on optimization for those who are interested. The basic idea of the perceptron is to run a particular algorithm until a linear separator is found.
    [Show full text]
  • CONVEX ANALYSIS and NONLINEAR OPTIMIZATION Theory and Examples
    CONVEX ANALYSIS AND NONLINEAR OPTIMIZATION Theory and Examples JONATHAN M. BORWEIN Centre for Experimental and Constructive Mathematics Department of Mathematics and Statistics Simon Fraser University, Burnaby, B.C., Canada V5A 1S6 [email protected] http://www.cecm.sfu.ca/ jborwein ∼ and ADRIAN S. LEWIS Department of Combinatorics and Optimization University of Waterloo, Waterloo, Ont., Canada N2L 3G1 [email protected] http://orion.uwaterloo.ca/ aslewis ∼ To our families 2 Contents 0.1 Preface . 5 1 Background 7 1.1 Euclidean spaces . 7 1.2 Symmetric matrices . 16 2 Inequality constraints 22 2.1 Optimality conditions . 22 2.2 Theorems of the alternative . 30 2.3 Max-functions and first order conditions . 36 3 Fenchel duality 42 3.1 Subgradients and convex functions . 42 3.2 The value function . 54 3.3 The Fenchel conjugate . 61 4 Convex analysis 78 4.1 Continuity of convex functions . 78 4.2 Fenchel biconjugation . 90 4.3 Lagrangian duality . 103 5 Special cases 113 5.1 Polyhedral convex sets and functions . 113 5.2 Functions of eigenvalues . 120 5.3 Duality for linear and semidefinite programming . 126 5.4 Convex process duality . 132 6 Nonsmooth optimization 143 6.1 Generalized derivatives . 143 3 6.2 Nonsmooth regularity and strict differentiability . 151 6.3 Tangent cones . 158 6.4 The limiting subdifferential . 167 7 The Karush-Kuhn-Tucker theorem 176 7.1 An introduction to metric regularity . 176 7.2 The Karush-Kuhn-Tucker theorem . 184 7.3 Metric regularity and the limiting subdifferential . 191 7.4 Second order conditions . 197 8 Fixed points 204 8.1 Brouwer's fixed point theorem .
    [Show full text]
  • A Generalized Variational Principle
    Canad. J. Math. Vol. 53 (6), 2001 pp. 1174–1193 A Generalized Variational Principle Philip D. Loewen and Xianfu Wang Abstract. We prove a strong variant of the Borwein-Preiss variational principle, and show that on As- plund spaces, Stegall’s variational principle follows from it via a generalized Smulyan test. Applications are discussed. 1 Introduction The Borwein-Preiss smooth variational principle [1] is an important tool in infinite dimensional nonsmooth analysis. Its statement, given a Banach space X,readsas follows. Theorem 1.1 Given f : X → (−∞, +∞] lower semicontinuous, x0 ∈ X, ε>0, λ>0,andp≥ 1, suppose f (x0) <ε+inff . X µ ≥ ∞ µ = Then there exist a sequence n 0,with n=1 n 1, and a point v in X, expressible as the (norm-) limit of some sequence (vn),suchthat ε ε (1.1) f (x)+ (x) ≥ f (v)+ (v) ∀x ∈ X, λp p λp p = ∞ µ − p − <λ ≤ ε where p(x): n=1 n x vn .Moreover, x0 v ,andf(v) +infX f. If, in addition, X has a β-smooth norm and p > 1,then∂β f (v) ∩ (ε/λ)pB∗ = ∅. In a subsequent development [4], Deville, Godefroy and Zizler improved Theo- rem 1.1 by considering a certain Banach space Y of bounded continuous functions g : X → R, and showing that the following subset of Y is residual: {g ∈ Y : f + g attains a strong minimum somewhere in X}. (See Definition 2.1.) However, their result gives no information about the location of the strong minimizer, and offers no way to identify explicitly a perturbation g with the desired property.
    [Show full text]
  • A Smooth Variational Principle with Applications to Subdifferentiability and to Differentiability of Convex Functions
    transactions of the american mathematical society Volume 303, Number 2, October 1987 A SMOOTH VARIATIONAL PRINCIPLE WITH APPLICATIONS TO SUBDIFFERENTIABILITY AND TO DIFFERENTIABILITY OF CONVEX FUNCTIONS J. M. BORWEIN AND D. PREISS ABSTRACT. We show that, typically, lower semicontinuous functions on a Ba- nach space densely inherit lower subderivatives of the same degree of smooth- ness as the norm. In particular every continuous convex function on a space with a Gâteaux (weak Hadamard, Fréchet) smooth renorm is densely Gâteaux (weak Hadamard, Fréchet) differentiable. Our technique relies on a more pow- erful analogue of Ekeland's variational principle in which the function is per- turbed by a quadratic-like function. This "smooth" variational principle has very broad applicability in problems of nonsmooth analysis. 1. Introduction. Ekeland's variational principle [11-13] has proved, along with its variants, to be a potent and flexible tool in analysis and in optimization theory [2, 7, 12, 13, 17]. One notable limitation on its application is that even when the original function is differentiable the perturbed function is not. A reasonable smooth variant has long been sought. In §2 we provide such a "smooth" variational principle. The geometric idea behind this proof is as follows. Given a fixed penalty function one cannot, in gen- eral, use penalization techniques to densely obtain minima for lower semicontinuous functions on a given Banach space (but see Theorem 5.2). One can however adap- tively adjust the penalty as one moves around the epigraph of the function, and the final cummulative penalty can be well enough controlled so as to inherit the differentiability properties of the underlying norm on the space.
    [Show full text]