4. Constrained Minimization

Total Page:16

File Type:pdf, Size:1020Kb

4. Constrained Minimization 4. CONSTRAINED MINIMIZATION n The minimization of a function f0 over a set C ⊂ IR can be much harder than n the minimization of f0 over all of IR , and it raises a host of issues. Some of these concern the ways a numerical method might be able to maneuver around in C or near to it, while others come up in characterizing optimality itself. Feasible directions: A vector w 6= 0 is said to give a feasible direction into C at a point x ∈ C if there’s an ε > 0 such that the line segment x + τw 0 ≤ τ ≤ ε lies in C. Descent methods with constraints: The general class of descent methods described for unconstrained optimization could be adapted to the minimization of f0 over C if the nature of C is such that feasible directions can readily be found. The rough idea is this. Starting from a point x0 ∈ C, a sequence of points is generated by ν ν the scheme that, when at x ∈ C, a descent vector w is chosen for f0 which at the same time gives a feasible direction into C at xν . (If no such wν exists, the method terminates.) Next, some sort of line search is executed to produce ν ν ν ν ν ν ν ν a value τ > 0 such that both x + τ w ∈ C and f0(x + τ w ) < f0(x ). The next point is taken then to be xν+1 := xν + τ ν wν . In particular, one can ν ν ν imagine choosing τ to be a value that minimizes ϕ(τ) = f0(x + τw ) over the ν ν set τ ≥ 0 x + τw ∈ C ; this would be the analogue of exact line search. Pitfalls: Many troubles can plague this scheme, unless the situation is safeguarded by rather special features. Finding a descent vector wν that gives a feasible direction may be no easy matter, and even if one is found, there may be difficulties in using it effectively because of the need to keep within the confines of C. A phenomenon called jamming is possible, where progress is stymied by frequent collisions with the boundary of C and the method “gets stuck in a corner” of C or makes too many little zigzagging steps. Lack of feasible directions at all: Of course, this kind of approach doesn’t make much sense unless one is content to regard, as a “quasi-solution” to the prob- lem, any point x ∈ C at which there is no descent vector for f0 giving a feasible direction into C. That may be acceptable for some sets C such as boxes, but not for sets C in which curvature dominates. For example, if C is a curved surface there may be no point x ∈ C at which there’s a feasible direction into C, because feasible directions refer only to movement along straight line segments embedded in C. Then there would be no point of C from which progress could be made by a modified descent method in minimizing f0. 49 Variational geometry: Whether or not some form of descent method might be made to work, it’s essential to have a solid grip on the geometry of the feasible set C. Classical geometry doesn’t meet the needs, so new concepts have to be brought in. To get rolling, we need a notion of tangency which includes the tangent spaces long associated with “linearization” of curved structures, but at the same time covers vectors giving feasible directions. Tangent vectors: For a closed set C ⊂ IRn and a pointx ¯ ∈ C, a vector w is said to be tangent to C atx ¯ if there is a sequence of vectors xν → x¯ in C along with scalars τ ν & 0 such that the vectors wν = (xν − x¯)/τ ν converge to w. Interpretation: The tangent vectors w to C atx ¯, apart from w = 0 (which corresponds in the definition to xν ≡ x¯, wν ≡ 0), are the vectors pointing in a possibly asymptotic direction from which a sequence of points xν ∈ C can converge tox ¯. The direction of xν as seen fromx ¯, which is the direction of wν , is not necessarily that of w, but gets closer and closer to it as ν → ∞. Relation to feasible directions: Every vector w 6= 0 giving a feasible direction into C atx ¯ is a tangent vector to C atx ¯. Indeed, for such a vector w one can take xν =x ¯ + τ ν w for any sequence of values τ ν & 0 sufficiently small and have for wν = (xν − x¯)/τ ν that wν ≡ w, hence trivially wν → w. Note from this that tangent vectors, in the sense defined here, can well point right into the interior of C, if that is nonempty; they don’t have to lie along the boundary of C. Relation to classical tangent spaces: When C is a “nice two-dimensional surface” in IR3, the tangent vectors w to C at a pointx ¯ form a two-dimensional linear subspace of IR3, which gives the usual tangent space to C atx ¯. When C is a “nice one-dimensional curve,” a one-dimensional linear subspace is obtained instead. Generalization can be made to tangent spaces to “nice curvilinear manifolds” of various dimensions d in IRn, with 0 < d < n. These comments are offered here on a heuristic, motivational level, but they will latter be supplied with rigor once such manifolds are identified with sets defined by equality constraints satisfying a constraint qualification that ensures a robust representation. Tangent cone at a point: In general, the set of all vectors w that are tangent to C at a pointx ¯ ∈ C is called the tangent cone to C atx ¯ and is denoted by TC (¯x). Basic properties: Always, the set TC (¯x) contains the vector 0. Further, for every vector w ∈ TC (¯x) and every scalar λ > 0, the vector λw is again in TC (¯x) (because of the arbitrary size of the scaling factors in the definition). But in 50 many situations one can have w ∈ TC (¯x), yet −w∈ / TC (¯x). In particular, TC (¯x) can well be something other than a linear subspace of IRn. Cones: A subset of IRn is called a cone if it contains the zero vector and contains with each of its vectors all positive multiples of that vector. Geometrically, this means that a cone, unless it consists of 0 alone, is a “bundle of rays.” ν Limits of tangent vectors: The limit of any sequence of vectors w ∈ TC (¯x) is another vector w ∈ TC (¯x). In other words, TC (¯x) is always a closed set. (This can readily be gleaned from the definition of TC (¯x) and the general properties of convergent sequences.) In particular, if the vectors wν give feasible directions to C atx ¯ and ν w → w, then w ∈ TC (¯x). n n Tangents in some extreme cases: If C = IR , then TC (¯x) = IR . Indeed, any time n x¯ lies in the interior of C, one has TC (¯x) = IR . If C is a one-element set {a} and x¯ = a, then TC (¯x) = {0}. More generally, the latter holds wheneverx ¯ is an isolated point of C in the sense that there is no sequence of points xν → x¯ in C apart from the constant sequence xν ≡ x¯. n Tangents to boxes: If X = I1 × · · · × In in IR with Ij a closed interval in IR, then at any pointx ¯ = (¯x1,..., x¯n) ∈ X (the componentx ¯j lying in Ij) one has TX (¯x) = TI1 (¯x1) × · · · × TIn (¯xn), where [0, ∞) ifx ¯j is the left endpoint (only) of Ij, (−∞, 0] ifx ¯j is the right endpoint (only) of Ij, TIj (¯xj) = (−∞, ∞) ifx ¯j lies in the interior of Ij, [0, 0] if Ij is a one-point interval, consisting just ofx ¯j. In other words, the condition w ∈ TX (¯x) for w = (w1, . , wn) amounts to restricting wj to lie in a one of the intervals (−∞, 0], [0, ∞), (−∞, ∞), or [0, 0] (the one-point interval consisting just of 0). The particular interval for an index j depends on the location ofx ¯j relative to the endpoints of Ij. n Tangents to the nonnegative orthant: The set X = IR+ is a box, the product of the intervals Ij = [0, ∞). In this case one has w ∈ TX (¯x) if and only if wj ≥ 0 for all indices j withx ¯j = 0. (For indices j withx ¯j > 0, wj can be any real number.) Feasible directions in a box: When X is a box, not only does every vector w 6= 0 giving a feasible direction in X atx ¯ belong to TX (¯x), but conversely, every vector w 6= 0 in TX (¯x) gives a feasible direction in X atx ¯. This important property generalizes as follows to any set specified by finitely many linear constraints. 51 Tangents to polyhedral sets: For a polyhedral set C, like a box, simple formulas describe all the tangent cones. As a specific case ripe for many applications, suppose ai·x ≤ bi for i ∈ [1, s], x ∈ C ⇐⇒ ai·x = bi for i ∈ [s + 1, m], x ∈ X, with X a box. Letx ¯ ∈ C and denote by I(¯x) the set of i ∈ [1, s] for which actually ai·x = bi. Then ai·w ≤ 0 for i ∈ I(¯x), w ∈ TC (¯x) ⇐⇒ ai·w = 0 for i ∈ [s + 1, m], w ∈ TX (¯x). Argument: Let K denote the set of vectors w described by the final conditions (on the right). The question is whether K = TC (¯x). For a vector w and scalar τ > 0, one hasx ¯ + τw ∈ C if and only ifx ¯ + τw ∈ X and ai·w ≤ [bi − ai·x¯]/τ for i ∈ [1, s], whereas ai·w = 0 for i ∈ [s + 1, m].
Recommended publications
  • Convergence Results for Projected Line-Search Methods on Varieties of Low-Rank Matrices Via Lojasiewicz Inequality∗
    SIAM J. OPTIM. c 2015 Society for Industrial and Applied Mathematics Vol. 25, No. 1, pp. 622–646 CONVERGENCE RESULTS FOR PROJECTED LINE-SEARCH METHODS ON VARIETIES OF LOW-RANK MATRICES VIA LOJASIEWICZ INEQUALITY∗ REINHOLD SCHNEIDER† AND ANDRE´ USCHMAJEW‡ Abstract. The aim of this paper is to derive convergence results for projected line-search meth- ods on the real-algebraic variety M≤k of real m×n matrices of rank at most k. Such methods extend Riemannian optimization methods, which are successfully used on the smooth manifold Mk of rank- k matrices, to its closure by taking steps along gradient-related directions in the tangent cone, and afterwards projecting back to M≤k. Considering such a method circumvents the difficulties which arise from the nonclosedness and the unbounded curvature of Mk. The pointwise convergence is obtained for real-analytic functions on the basis of aLojasiewicz inequality for the projection of the antigradient to the tangent cone. If the derived limit point lies on the smooth part of M≤k, i.e., in Mk, this boils down to more or less known results, but with the benefit that asymptotic convergence rate estimates (for specific step-sizes) can be obtained without an a priori curvature bound, simply from the fact that the limit lies on a smooth manifold. At the same time, one can give a convincing justification for assuming critical points to lie in Mk:ifX is a critical point of f on M≤k,then either X has rank k,or∇f(X)=0. Key words. convergence analysis, line-search methods, low-rank matrices, Riemannian opti- mization, steepest descent,Lojasiewicz gradient inequality, tangent cones AMS subject classifications.
    [Show full text]
  • On Tangent Cones in Wasserstein Space
    PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY Volume 145, Number 7, July 2017, Pages 3127–3136 http://dx.doi.org/10.1090/proc/13415 Article electronically published on December 8, 2016 ON TANGENT CONES IN WASSERSTEIN SPACE JOHN LOTT (Communicated by Guofang Wei) Abstract. If M is a smooth compact Riemannian manifold, let P (M)denote the Wasserstein space of probability measures on M.IfS is an embedded submanifold of M,andμ is an absolutely continuous measure on S,thenwe compute the tangent cone of P (M)atμ. 1. Introduction In optimal transport theory, a displacement interpolation is a one-parameter family of measures that represents the most efficient way of displacing mass between two given probability measures. Finding a displacement interpolation between two probability measures is the same as finding a minimizing geodesic in the space of probability measures, equipped with the Wasserstein metric W2 [9, Proposition 2.10]. For background on optimal transport and Wasserstein space, we refer to Villani’s book [14]. If M is a compact connected Riemannian manifold with nonnegative sectional curvature, then P (M) is a compact length space with nonnegative curvature in the sense of Alexandrov [9, Theorem A.8], [13, Proposition 2.10]. Hence one can define the tangent cone TμP (M)ofP (M)atameasureμ ∈ P (M). If μ is absolutely continuous with respect to the volume form dvolM ,thenTμP (M)isaHilbert space [9, Proposition A.33]. More generally, one can define tangent cones of P (M) without any curvature assumption on M, using Ohta’s 2-uniform structure on P (M) [11].
    [Show full text]
  • Convergence of Complete Ricci-Flat Manifolds Jiewon Park
    Convergence of Complete Ricci-flat Manifolds by Jiewon Park Submitted to the Department of Mathematics in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Mathematics at the MASSACHUSETTS INSTITUTE OF TECHNOLOGY May 2020 © Massachusetts Institute of Technology 2020. All rights reserved. Author . Department of Mathematics April 17, 2020 Certified by. Tobias Holck Colding Cecil and Ida Green Distinguished Professor Thesis Supervisor Accepted by . Wei Zhang Chairman, Department Committee on Graduate Theses 2 Convergence of Complete Ricci-flat Manifolds by Jiewon Park Submitted to the Department of Mathematics on April 17, 2020, in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Mathematics Abstract This thesis is focused on the convergence at infinity of complete Ricci flat manifolds. In the first part of this thesis, we will give a natural way to identify between two scales, potentially arbitrarily far apart, in the case when a tangent cone at infinity has smooth cross section. The identification map is given as the gradient flow of a solution to an elliptic equation. We use an estimate of Colding-Minicozzi of a functional that measures the distance to the tangent cone. In the second part of this thesis, we prove a matrix Harnack inequality for the Laplace equation on manifolds with suitable curvature and volume growth assumptions, which is a pointwise estimate for the integrand of the aforementioned functional. This result provides an elliptic analogue of matrix Harnack inequalities for the heat equation or geometric flows. Thesis Supervisor: Tobias Holck Colding Title: Cecil and Ida Green Distinguished Professor 3 4 Acknowledgments First and foremost I would like to thank my advisor Tobias Colding for his continuous guidance and encouragement, and for suggesting problems to work on.
    [Show full text]
  • Tangent Cones, Tangent Spaces, Tangent Stars. Secant, Tangent, Tangent Star and Dual Varieties of an Algebraic Variety
    Chapter 1 Tangent cones, tangent spaces, tangent stars. Secant, tangent, tangent star and dual varieties of an algebraic variety 1.1 Tangent cones and tangent spaces of an algebraic variety and their associated varieties Let X be an algebraic variety, or more generally a scheme of finite type, over a fixed algebraically closed field K. Let x X be a closed point. We briefly recall the 2 definitions of tangent cone to X at x and of tangent space to X at x. For more details one can consult [150] or [189]. Definition 1.1.1. (Tangent cone at a point). Let U X be an open affine neighbor- ⇢ hood of x, let i : U AN be a closed immersion and let U be defined by the ideal ! N I K[X1,...,XN]. There is no loss of generality in supposing i(x)=(0,...,0) A . ⇢ 2 Given f K[X ,...,X ] with f (0,...,0)=0, we can define the leading form (or ini- 2 1 N tial form/term) f in of f as the non-zero homogeneous polynomial of lowest degree in its expression as a sum of homogenous polynomials in the variables Xi’s. Let Iin = the ideal generated by the leading form (or initial term) f in of all f I . { 2 } Then K[X ,...,X ] C X := Spec( 1 N ), (1.1) x Iin is called the affine tangent cone to X at x. The previous definition does not depend on the choice of U X or on the choice N ⇢ of i : U A .
    [Show full text]
  • Tangent Cones and Normal Cones in RCDD
    Tangent Cones and Normal Cones in RCDD Charles J. Geyer February 27, 2008 1 Introduction The tangent cone of a convex set C at a point x ∈ C is given by TC (x) = cl{ s(y − x): y ∈ C and s ≥ 0 } (Rockafellar and Wets, 2004, Theorem 6.9). Note that the closure is nec- essary (consider a closed ball). It is not necessary to consider arbitrary se- quences; as always in convex analysis it is enough to consider line segments. The normal cone of a convex set C at a point x ∈ C is given by ∗ NC (x) = TC (x) = { w : hw, y − xi ≤ 0, y ∈ C } (Rockafellar and Wets, 2004, Theorem 6.9). Specializing these to the polyhedral convex set C = { x : hai, xi ≤ bi, i ∈ I, and hai, xi = bi, i ∈ E } we have TC (x) = { x : hai, xi ≤ 0, i ∈ A, and hai, xi = 0, i ∈ E } where A = { i ∈ I : hai, xi = bi } is the active set of inequality constraints (in essence, we drop the inactive constraints and make x the new origin). We also have NC (x) = pos { ai : i ∈ A ∪ E } ∪ { −ai : i ∈ E } In the language of the rcdd package, we have an H-representation for the tangent cone and a V-representation of the normal cone. 1 2 Example: Cube Let us consider a very simple example, the unit cube in three dimensions. Let us start with a V-representation for it. > v <- rbind(c(0, 0, 0), c(0, 0, 1), c(0, 1, 0), c(0, + 1, 1), c(1, 0, 0), c(1, 0, 1), c(1, 1, 0), c(1, + 1, 1)) > m <- cbind(0, 1, v) > m [,1] [,2] [,3] [,4] [,5] [1,] 0 1 0 0 0 [2,] 0 1 0 0 1 [3,] 0 1 0 1 0 [4,] 0 1 0 1 1 [5,] 0 1 1 0 0 [6,] 0 1 1 0 1 [7,] 0 1 1 1 0 [8,] 0 1 1 1 1 Then we convert to the H-representation.
    [Show full text]
  • Tangent Spaces to Metric Spaces: Overview and Motivations
    Tangent Spaces to Metric Spaces: Overview and Motivations Marc Troyanov - EPFL, April 9, 2003 Abstract. This is the text of an introductory lecture I seen important developpments the last decade. This made at the S´eminaire Borel 2003 (III`eme Cycle Romand new analysis will obviously play an important role in de Math´ematiques). The general theme of that seminaire the study of tangent spaces to metric spaces. was Tangent Spaces of Metric Spaces. There are several motivations, even for a differential The goal of the Borel 03 seminar is to study the notion geometer, to study the field of metric geometry. of tangent spaces in various geometric contexts. This notion has its origin in the early developements of dif- The first motivation is that several important subjects ferential geometry and plays a central role in the whole such as Riemannian geometry, Sub-riemannian geome- theory. In fact, we can define, in a modern language, try or Finsler geometry belong to the intersection of dif- differential geometry as the study of smooth tensors liv- ferential geometry and metric geometry; and the metric ing on smooth manifolds. viewpoint can sometimes bring unification and a new insight in these subjects. Recall that, by definition, a manifold M of dimension n is a space which locally looks like Euclidean space Rn A second motivation is that various type of non Rie- in a small neighborhood of any of its point p ∈ M.A mannian metric spaces naturally appear as “boundary blowing up of M near the point p provides a better and at infinity” or as “limits” (in a sense to be made pre- better approximation of the geometry of M near p with cise) of Riemannian manifolds.
    [Show full text]
  • Local Polar Varieties in the Geometric Study of Singularities Arturo Giles Flores, Bernard Teissier
    Local polar varieties in the geometric study of singularities Arturo Giles Flores, Bernard Teissier To cite this version: Arturo Giles Flores, Bernard Teissier. Local polar varieties in the geometric study of singularities. 2016. hal-01348838v3 HAL Id: hal-01348838 https://hal.archives-ouvertes.fr/hal-01348838v3 Preprint submitted on 10 Dec 2017 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Distributed under a Creative Commons Attribution - NoDerivatives| 4.0 International License Local polar varieties in the geometric study of singularities Arturo Giles Flores and Bernard Teissier December 11, 2017 This paper is dedicated to those administrators of research who realize how much damage is done by the evaluation of mathematical research solely by the rankings of the journals in which it is published, or more generally by bibliomet- ric indices. We hope that their lucidity will become widespread in all countries. Abstract This text presents several aspects of the theory of equisingularity of com- plex analytic spaces from the standpoint of Whitney conditions. The goal is to describe from the geometrical, topological, and algebraic viewpoints a canonical locally finite partition of a reduced complex analytic space X into nonsingular strata with the property that the local geometry of X is constant on each stratum.
    [Show full text]
  • Flatness of Tangent Cones of a Family of Hypersurfaces
    CORE Metadata, citation and similar papers at core.ac.uk Provided by Elsevier - Publisher Connector JOURNAL OF ALGEBRA 128, 24&256 (1990) Flatness of Tangent Cones of a Family of Hypersurfaces GARY KENNEDY * Department of Mathematics, Duke University, Durham, North Carolina 27706 Communicated by J. Harris Received March 29, 1988 The notion of the “tangent cone” at a singular point of an algebraic variety ought to generalize that of the tangent space at a nonsingular point. In [S] Whitney proposed various candidates, including the usual one. The usual tangent cone stems from the geometric notion of a secant line; as a set the tangent cone at a point ;I; consists of the limiting directions of lines determined by fi and by a mobile point a, as a approaches fi. One obtains a different notion by allowing two points ai and pz to be mobile; following Johnson [3] we call the resulting set of limiting lines (as a, and a2 simultaneously approach #) the “tangent star” of the variety at the point. An even larger “tangent cone” is the Zariski tangent space. For a nonsingular variety X the tangent space at a point is the fiber of the tangent bundle TX. It is natural to ask whether the tangent cone at a point of a singular variety (or other algebraic scheme) is, in a similar fashion, the fiber of some variety (or scheme) over the base. For the usual tangent cone, the answer is unsatisfactory: for example, at the vertex of the cone on a plane curve the projectivized tangent cone is a copy of the curve, whereas by continuity it ought to be a projective plane.
    [Show full text]
  • Selected Mathematical Reviews Mr1375255
    BULLETIN (New Series) OF THE AMERICAN MATHEMATICAL SOCIETY Volume 48, Number 1, January 2011, Pages 91–100 S 0273-0979(2010)01324-0 Article electronically published on October 7, 2010 SELECTED MATHEMATICAL REVIEWS related to the paper in the previous section by SIMON BRENDLE AND RICHARD SCHOEN MR1375255 (97e:53075) 53C21; 58G30 Hamilton, Richard S. The formation of singularities in the Ricci flow. (English) Surveys in differential geometry, Vol. II (Cambridge, MA, 1993), 7–136, Int. Press, Cambridge, MA, 1995. In this paper the author surveys some of the basic geometrical properties of the Ricci flow with a view to considering what kind of singularities might form. The Ricci flow was introduced by the author in his celebrated work on deformation of metrics the by Ricci flow on compact 3-manifolds with positive Ricci curvature [J. Differential Geom. 17 (1982), no. 2, 255–306; MR0664497 (84a:53050)]. It has been found to be a useful tool in the study of Riemannian geometry, both for compact and for open manifolds. The idea is to study the evolution of a Riemannian metric along its Ricci curvature. Many interesting results have been obtained using the Ricci flow, and it is most desirable for the author to have written a survey in the field where he initiated and continues to make contributions. This article is a beautiful example of how to use techniques in differential equations and Riemannian geometry to study manifolds. The paper starts with some examples of how singularities might be formed under the Ricci flow. Short-time existence, derivative estimates and long-time existence are reviewed.
    [Show full text]
  • Seminar on Fundamentals of Algebraic Geometry I
    SEMINAR ON FUNDAMENTALS OF ALGEBRAIC GEOMETRY I HONGˆ VANˆ LEˆ ∗ AND PETR SOMBERG ∗∗ Abstract. Algebraic geometry is one of the central subjects of mathematics. Mathematical physicists, homotopy theorists, com- plex analysts, symplectic geometers, representation theorists speak the language of algebraic geometry. In this seminar we shall discuss some basic topics of algebraic geometry and their relation with current problems in mathematics. Recommended textbook: J. Harris, Algebraic Geometry: A First Course, I. R. Shafarevich: Basic Algebraic Geometry 1,2, Dol- gachev: Introduction to Algebraic Geometry (lecture notes) Contents 1. Overview and proposed topics 2 1.1. Overview of our plan 2 1.2. A brief history 3 1.3. Tentative plan of our seminar 4 2. Algebraic sets and the Hilbert basic theorem 4 2.1. Algebraic sets 5 2.2. The Hilbert basic theorem 5 3. Hilbert’s Nullstellensatz and Zariski topology 6 3.1. Hilbert’s Nullstellensatz 7 3.2. Zariski topology 9 4. Tangent space - a motivation + definition ... 10 4.1. Tangent space and the ring of dual numbers 11 4.2. Tangent space and the Hensel’s Lemma 12 4.3. Hensel’s lemma for algebraic varieties of dimension 0 13 5. The Hilbert basis theorem and Groebner bases 14 5.1. Groebner basis and Buchberger’s algorithm 14 5.2. Some applications of Groebner bases 16 ∗ Institute of Mathematics of ASCR, Zitna 25, 11567 Praha 1, email: [email protected] ∗∗ Mathematical Institute, Charles University, Sokolovska 83, 180 00 Praha 8, Czech Republic, email: [email protected]ff.cuni.cz. 1 2 HONGˆ VANˆ LEˆ ∗ AND PETR SOMBERG ∗∗ 6.
    [Show full text]
  • Towards Optimization Techniques on Diffeological Spaces by Generalizing
    Towards optimization techniques on diffeological spaces by generalizing Riemannian concepts Nico Goldammer∗ and Kathrin Welker† Abstract Diffeological spaces firstly introduced by J.M. Souriau in the 1980s are a natural generalization of smooth manifolds. However, optimization techniques are only known on manifolds so far. Generalizing these techniques to diffeo- logical spaces is very challenging because of several reasons. One of the main reasons is that there are various definitions of tangent spaces which do not co- incide. Additionally, one needs to deal with a generalization of a Riemannian space in order to define gradients which are indispensable for optimization methods. This paper is devoted to an optimization technique on diffeological spaces. Thus, one main aim of this paper is a suitable definition of a tangent space in view to optimization methods. Based on this definition, we present a diffeological Riemannian space and a diffeological gradient, which we need to formulate an optimization algorithm on diffeological spaces. Moreover, in order to be able to update the iterates in an optimization algorithm on diffeological spaces, we present a diffeological retraction and the Levi-Civita connection on diffeological spaces. We give examples for the novel objects and apply the presented diffeological algorithm to an optimization problem. Key words. Diffeological space, manifold, smooth space, Riemannian metric, tangent space, steepest descent method AMS subject classifications. 49Q10, 53C15, 57P99,54C56, 57R35, 65K05 1 Introduction The goal of all optimization processes is either to minimize effort or to maximize benefit. Optimization techniques are used for example in logistics, control engi- neering, economics and the finance market. Furthermore, the use of optimization in machine learning becomes of greater interest, see, e.g., [32].
    [Show full text]
  • Tangent Cones and Quasi-Interiorly Tangent Cones to the Graphs of Multifunctions Following the Way Opened by Rockafellar
    TRANSACTIONS of the AMERICAN MATHEMATICAL SOCIETY Volume 277, Number 2, June 1983 TANGENT CONES AND QUASI-INTERIORLYTANGENT CONES TO MULTIFUNCTIONS BY LIONEL THIBAULT Abstract. R. T. Rockaiellar has proved a number of rules of subdifferential calculus for nonlocally lipschitzian real-valued functions by investigating the Clarke tangent cones to the epigraphs of such functions. Following these lines we study in this paper the tangent cones to the sum and the composition of two multifunctions. This will be made possible thanks to the notion of quasi-interiorly tangent cone which has been introduced by the author for vector-valued functions in [29] and whose properties in the context of multifunctions are studied. The results are strong enough to cover the cases of real-valued or vector-valued functions. Introduction. Rockafellar has introduced in [23] the very important notion of directionally lipschitzian behaviour for extended real-valued functions, and with the aid of this notion he has proved in [24] a number of rules of subgradient calculus of nonconvex functions. If / is a function from a topological vector space E into R U {-oo, +00} with/(x) G R and if /(epi /; x, fix)) denotes the interiorly tangent cone to epi/={(x,^)G£XR:/(x)<j} at (x, fix)), that is, the set of all (ü, w) E E X R such that there exist a neighbour- hood X of (x, fix)), a neighbourhood Vof (v, w) in E X R and a real number e > 0 such that X n epi /+ ]0, e[VC epi /, then the proof of Theorem 3 of [23, p.
    [Show full text]