Using Dual Techniques to Derive Componentwise and Mixed Condition Numbers for a Linear Function of a Linear Least Squares Solution

Total Page:16

File Type:pdf, Size:1020Kb

Using Dual Techniques to Derive Componentwise and Mixed Condition Numbers for a Linear Function of a Linear Least Squares Solution BIT Numer Math (2009) 49: 3–19 DOI 10.1007/s10543-009-0213-4 Using dual techniques to derive componentwise and mixed condition numbers for a linear function of a linear least squares solution Marc Baboulin · Serge Gratton Received: 22 July 2008 / Accepted: 21 January 2009 / Published online: 12 February 2009 © Springer Science + Business Media B.V. 2009 Abstract We prove duality results for adjoint operators and product norms in the framework of Euclidean spaces. We show how these results can be used to derive condition numbers especially when perturbations on data are measured componen- twise relatively to the original data. We apply this technique to obtain formulas for componentwise and mixed condition numbers for a linear function of a linear least squares solution. These expressions are closed when perturbations of the solution are measured using a componentwise norm or the infinity norm and we get an upper bound for the Euclidean norm. Keywords Dual norm · Adjoint operator · Componentwise perturbations · Condition number · Linear least squares Mathematics Subject Classification (2000) 65F35 1 Introduction The condition number of a problem measures the effect on the solution of small changes in the data and this quantity depends on the norms chosen for measuring Communicated by Lars Eldén. M. Baboulin () Departamento de Matemática, Universidade de Coimbra, 3001-454 Coimbra, Portugal e-mail: [email protected] M. Baboulin University of Tennessee, Knoxville, TN 37996-3450, USA S. Gratton Centre National d’Etudes Spatiales and CERFACS, 31057 Toulouse Cédex 1, France e-mail: [email protected] 4 M. Baboulin, S. Gratton perturbations on data and solution. A commonly used approach consists of measuring these perturbations globally using classical norms (e.g. ·p, p = 1, 2, ∞ or · F ) resulting in so-called normwise condition numbers. But as mentioned in [20], using norms for measuring matrix perturbations has several drawbacks. First, it does not give information about how the perturbation is distributed among the data. Second, as pointed out in [7, p. 31], errors can be overestimated by a normwise sensitivity analysis when the problem is badly scaled. In the componentwise analysis, perturbations are measured using metrics that take into account the structure of the matrix like sparsity or scaling. With such metrics, we could expect to minimize the amplification of perturbations resulting in a mini- mal condition number. Componentwise metrics are well suited for that because the perturbations on data are measured relatively to a given tolerance for each compo- nent of the data. For instance, if E is the matrix whose nonnegative entries are the individual tolerance for errors in components of A, then the componentwise relative error bound ω for a perturbation ΔA will be such that |ΔA|≤ωE (here partial order- ing and absolute values must be understood component by component). A common choice for E is E =|A|. Componentwise error analysis that provide us with exact expressions or bounds for componentwise condition numbers can be found for example in [5, 8, 19, 26, 28–30] for linear systems and in [4, 6, 9, 19, 23] for linear least squares. In particular, com- ponentwise backward errors are commonly used as stopping criteria in iterative re- finement for solving linear systems (see e.g. [3]) or linear least squares (see e.g. [10]). For the full rank linear least squares problem (LLSP), generalizing [16], [2]gives exact formulas for the conditioning of a linear transformation of the LLSP solu- tion when the perturbations of data and solution are measured normwise. Our ob- jective in this paper is to obtain similar quantities when perturbations on data are, contrary to [2], measured componentwise and the perturbations on the solution are measured either componentwise or normwise resulting in respectively component- wise and mixed condition numbers. In [17], a technique is presented to compute or estimate condition numbers using adjoint formulas. The results are presented in Banach spaces, and make use of the corresponding duality results in order to derive normwise condition numbers. In our paper, we show that these dual techniques are easy and helpful when presented in the framework of Euclidean spaces. In particular, as also mentioned in [17], they enable us to derive condition numbers by maximizing a linear function over a space of smaller dimension than the data space. We show in this paper that dual techniques can be used to derive condition num- bers when perturbations on the data are measured componentwise and we apply this method to LLSP. We propose exact formulas for the conditioning of LT x, linear func- tion of the LLSP solution when perturbations on data are measured componentwise and perturbations on the solution are measured either componentwise or normwise. Studying the condition number of LT x is relevant for instance when there is a dispar- ity between the conditioning of the solution components or when the computation of the least squares solution involves auxiliary variables without physical significance. The situations of common interest correspond to the cases where L is the identity ma- trix (condition number of an LLSP solution), a canonical vector (condition number of Componentwise condition of linear least squares 5 one solution component), or a projector, when we are interested in the sensitivity of a subset of the solution components. The conditioning of a nonlinear function of an LLSP solution can also be obtained by replacing in the condition number expression LT by the Jacobian matrix at the solution. When L is the identity matrix and when perturbations on the solution are measured using the infinity norm or a component- wise norm, we obtain the exact formula given in [9]. By considering the special case where we have a residual equal to zero, we obtain componentwise and mixed condi- tion numbers for LT x where x is the solution of a consistent linear system. When L is the identity matrix, these quantities recover the expressions known from [21, p. 123] and [18]. 2 Deriving condition numbers using dual techniques 2.1 Preliminary results on dual norms and adjoint operators We consider a linear mapping J : E → G where the Euclidean spaces E and G are equipped respectively with any norms .E, .G and scalar products ., . E and ., . G. Note that the norms .E and .G may not be, and in general won’t be, the particular norms induced by the scalar products ., . E and ., . G. Definition 1 The adjoint operator of J , J ∗ : G → E is defined by = ∗ y,Jx G J y,x E , where (x, y) ∈ E × G. We also define the dual norm .E∗ of .E by x,u E xE∗ = max , u =0 uE and define similarly the dual norm .G∗. For the common vector norms, the dual norms with respect to the canonical scalar product in Rn are well-known, and are given by: ·1∗ = ·∞ and ·∞∗ =·1 and ·2∗ =·2. For the matrix norms in Rm×n with respect to the scalar product A,B = T trace(A B),wehaveA2∗ =σ(A)1 (Lemma 3.5 of [31, p. 78]) where σ(A) T = 2 is the vector containing the singular values of A, and since trace(A A) A F we also have that AF ∗ =AF . For the linear applications mapping E to G, we denote by .E,G the operator norm induced by the norms .E and .G. Likewise, the norm .G∗,E∗ is the op- erator norm for linear applications mapping G to E and induced by the dual norms .G∗ and .E∗. Then we have the following theorem. Theorem 1 ∗ J E,G =J G∗,E∗. 6 M. Baboulin, S. Gratton Proof JxG J E,G = max x∈E xE Jx,u = max G we use the “duality theorem” [22, p. 287] x∈E,u∈G uG∗xE 1 x,J∗u = max max E u∈G uG∗ x∈E xE ∗ J u ∗ = max E u∈G uG∗ ∗ =J G∗,E∗. ∗ As mentioned in [17], it may be interesting to compute J G∗,E∗ (instead of J E,G) when G∗ is an Euclidean space of lower dimension than E because it im- plies a maximization over a space of smaller dimension. We now consider a product space E = E1 ×···×Ep where each Euclidean space Ei is equipped with the norm . Ei and the scalar product ., . Ei . In E, we consider the following scalar product (u1,...,up), (v1,...,vp) = u1,v1 +···+ up,vp , E1 Ep and the product norm = (u1,...,up) ν ν( u1 E1 ,..., up Ep ), where ν is an absolute norm on Rp (i.e. such that ν(|x|) = ν(x) ∀x ∈ Rp,[24, p. 367]). We denote by ν∗ the dual of ν with respect to the canonical inner-product p of R and we are interested in determining the dual .ν∗ of the product norm .ν with respect to the scalar product of E. Then we have the following result. Theorem 2 The dual of the product norm can be expressed as = (u1,...,up) ν∗ ν∗( u1 E1∗,..., up Ep∗). ui ,vi E Proof From u ∗ = max = i ,wehave∀v ∈ E u ,v ≤ i Ei vi 0 vi E i i i i Ei i ui Ei ∗ vi Ei and then p (u1,...,up)ν∗ = max ui,vi E (v ,...,v ) =1 i 1 p ν i=1 p ≤ max ui Ei ∗ vi Ei (v ,...,v ) =1 1 p ν i=1 Componentwise condition of linear least squares 7 ⎛ ⎞ ⎛ ⎞ T u1 E1∗ v1 E1 ⎜ . ⎟ ⎜ . ⎟ = max ⎝ . ⎠ ⎝ . ⎠ ν(v ,...,v )=1 . 1 E1 p Ep up Ep∗ vp Ep = ν∗( u1 E1∗,..., up Ep∗). So, we have shown that ν∗( u1 E1∗,..., up Ep∗) is an upper-bound for the dual of the product norm.
Recommended publications
  • The Nonstandard Theory of Topological Vector Spaces
    TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY Volume 172, October 1972 THE NONSTANDARDTHEORY OF TOPOLOGICAL VECTOR SPACES BY C. WARD HENSON AND L. C. MOORE, JR. ABSTRACT. In this paper the nonstandard theory of topological vector spaces is developed, with three main objectives: (1) creation of the basic nonstandard concepts and tools; (2) use of these tools to give nonstandard treatments of some major standard theorems ; (3) construction of the nonstandard hull of an arbitrary topological vector space, and the beginning of the study of the class of spaces which tesults. Introduction. Let Ml be a set theoretical structure and let *JR be an enlarge- ment of M. Let (E, 0) be a topological vector space in M. §§1 and 2 of this paper are devoted to the elementary nonstandard theory of (F, 0). In particular, in §1 the concept of 0-finiteness for elements of *E is introduced and the nonstandard hull of (E, 0) (relative to *3R) is defined. §2 introduces the concept of 0-bounded- ness for elements of *E. In §5 the elementary nonstandard theory of locally convex spaces is developed by investigating the mapping in *JK which corresponds to a given pairing. In §§6 and 7 we make use of this theory by providing nonstandard treatments of two aspects of the existing standard theory. In §6, Luxemburg's characterization of the pre-nearstandard elements of *E for a normed space (E, p) is extended to Hausdorff locally convex spaces (E, 8). This characterization is used to prove the theorem of Grothendieck which gives a criterion for the completeness of a Hausdorff locally convex space.
    [Show full text]
  • BS II: Elementary Banach Space Theory
    BS II c Gabriel Nagy Banach Spaces II: Elementary Banach Space Theory Notes from the Functional Analysis Course (Fall 07 - Spring 08) In this section we introduce Banach spaces and examine some of their important features. Sub-section B collects the five so-called Principles of Banach space theory, which were already presented earlier. Convention. Throughout this note K will be one of the fields R or C, and all vector spaces are over K. A. Banach Spaces Definition. A normed vector space (X , k . k), which is complete in the norm topology, is called a Banach space. When there is no danger of confusion, the norm will be omitted from the notation. Comment. Banach spaces are particular cases of Frechet spaces, which were defined as locally convex (F)-spaces. Therefore, the results from TVS IV and LCVS III will all be relevant here. In particular, when one checks completeness, the condition that a net (xλ)λ∈Λ ⊂ X is Cauchy, is equivalent to the metric condition: (mc) for every ε > 0, there exists λε ∈ Λ, such that kxλ − xµk < ε, ∀ λ, µ λε. As pointed out in TVS IV, this condition is actually independent of the norm, that is, if one norm k . k satisfies (mc), then so does any norm equivalent to k . k. The following criteria have already been proved in TVS IV. Proposition 1. For a normed vector space (X , k . k), the following are equivalent. (i) X is a Banach space. (ii) Every Cauchy net in X is convergent. (iii) Every Cauchy sequence in X is convergent.
    [Show full text]
  • Dual-Norm Least-Squares Finite Element Methods for Hyperbolic Problems
    Dual-Norm Least-Squares Finite Element Methods for Hyperbolic Problems by Delyan Zhelev Kalchev Bachelor, Sofia University, Bulgaria, 2009 Master, Sofia University, Bulgaria, 2012 M.S., University of Colorado at Boulder, 2015 A thesis submitted to the Faculty of the Graduate School of the University of Colorado in partial fulfillment of the requirements for the degree of Doctor of Philosophy Department of Applied Mathematics 2018 This thesis entitled: Dual-Norm Least-Squares Finite Element Methods for Hyperbolic Problems written by Delyan Zhelev Kalchev has been approved for the Department of Applied Mathematics Thomas A. Manteuffel Stephen Becker Date The final copy of this thesis has been examined by the signatories, and we find that both the content and the form meet acceptable presentation standards of scholarly work in the above mentioned discipline. Kalchev, Delyan Zhelev (Ph.D., Applied Mathematics) Dual-Norm Least-Squares Finite Element Methods for Hyperbolic Problems Thesis directed by Professor Thomas A. Manteuffel Least-squares finite element discretizations of first-order hyperbolic partial differential equations (PDEs) are proposed and studied. Hyperbolic problems are notorious for possessing solutions with jump discontinuities, like contact discontinuities and shocks, and steep exponential layers. Furthermore, nonlinear equations can have rarefaction waves as solutions. All these contribute to the challenges in the numerical treatment of hyperbolic PDEs. The approach here is to obtain appropriate least-squares formulations based on suitable mini- mization principles. Typically, such formulations can be reduced to one or more (e.g., by employing a Newton-type linearization procedure) quadratic minimization problems. Both theory and numer- ical results are presented.
    [Show full text]
  • On the Duality of Strong Convexity and Strong Smoothness: Learning Applications and Matrix Regularization
    On the duality of strong convexity and strong smoothness: Learning applications and matrix regularization Sham M. Kakade and Shai Shalev-Shwartz and Ambuj Tewari Toyota Technological Institute—Chicago, USA fsham,shai,[email protected] Abstract strictly convex functions). Here, the notion of strong con- vexity provides one means to characterize this gap in terms We show that a function is strongly convex with of some general norm (rather than just Euclidean). respect to some norm if and only if its conjugate This work examines the notion of strong convexity from function is strongly smooth with respect to the a duality perspective. We show a function is strongly convex dual norm. This result has already been found with respect to some norm if and only if its (Fenchel) con- to be a key component in deriving and analyz- jugate function is strongly smooth with respect to its dual ing several learning algorithms. Utilizing this du- norm. Roughly speaking, this notion of smoothness (defined ality, we isolate a single inequality which seam- precisely later) provides a second order upper bound of the lessly implies both generalization bounds and on- conjugate function, which has already been found to be a key line regret bounds; and we show how to construct component in deriving and analyzing several learning algo- strongly convex functions over matrices based on rithms. Using this relationship, we are able to characterize strongly convex functions over vectors. The newly a number of matrix based penalty functions, of recent in- constructed functions (over matrices) inherit the terest, as being strongly convex functions, which allows us strong convexity properties of the underlying vec- to immediately derive online algorithms and generalization tor functions.
    [Show full text]
  • An Introduction to Some Aspects of Functional Analysis, 7: Convergence of Operators
    An introduction to some aspects of functional analysis, 7: Convergence of operators Stephen Semmes Rice University Abstract Here we look at strong and weak operator topologies on spaces of bounded linear mappings, and convergence of sequences of operators with respect to these topologies in particular. Contents I The strong operator topology 2 1 Seminorms 2 2 Bounded linear mappings 4 3 The strong operator topology 5 4 Shift operators 6 5 Multiplication operators 7 6 Dense sets 9 7 Shift operators, 2 10 8 Other operators 11 9 Unitary operators 12 10 Measure-preserving transformations 13 II The weak operator topology 15 11 Definitions 15 1 12 Multiplication operators, 2 16 13 Dual linear mappings 17 14 Shift operators, 3 19 15 Uniform boundedness 21 16 Continuous linear functionals 22 17 Bilinear functionals 23 18 Compactness 24 19 Other operators, 2 26 20 Composition operators 27 21 Continuity properties 30 References 32 Part I The strong operator topology 1 Seminorms Let V be a vector space over the real or complex numbers. A nonnegative real-valued function N(v) on V is said to be a seminorm on V if (1.1) N(tv) = |t| N(v) for every v ∈ V and t ∈ R or C, as appropriate, and (1.2) N(v + w) ≤ N(v) + N(w) for every v, w ∈ V . Here |t| denotes the absolute value of a real number t, or the modulus of a complex number t. If N(v) > 0 when v 6= 0, then N(v) is a norm on V , and (1.3) d(v, w) = N(v − w) defines a metric on V .
    [Show full text]
  • November 7, 2013 1 Introduction
    CS 229r: Algorithms for Big Data Fall 2013 Lecture 20 | November 7, 2013 Prof. Jelani Nelson Scribe: Yun William Yu 1 Introduction Today we're going to go through the analysis of matrix completion. First though, let's go through the history of prior work on this problem. Recall the setup and model: • Matrix completion setup: n ×n { Want to recover M 2 R 1 2 , under the assumption that rank(M) = r, where r is small. { Only some small subset of the entries (Mij)ij2Ω are revealed, where Ω ⊂ [n1]×[n2]; jΩj = m n1; n2 • Model: { m times we sample i; j uniformly at random + insert into Ω (so Ω is a multiset). { Note that the same results hold if we choose m entries without replacement, but it's easier to analyze this way. In fact, if you can show that if recovery works with replacement, then that implies that recovery works without replacement, which makes sense because you'd only be seeing more information about M. • We recover M by Nuclear Norm Minimization (NNM): { Solve the program min kXk∗ s.t. 8i; j 2 Ω;Xij = Mij • [Recht, Fazel, Parrilo '10] [RFP10] was first to give some rigorous guarantees for NNM. { As you'll see on the pset, you can actually solve this in polynomial time since it's an instance of what's known as a semi-definite program. • [Cand´es,Recht, '09] [CR09] was the first paper to show provable guarantees for NNM applied to matrix completion. • There were some quantitative improvements (in the parameters) in two papers: [Cand´es, Tao '09] [CT10] and [Keshavan, Montanari, Oh '09] [KMO10] • Today we're going to cover an even newer analysis given in [Recht, 2011] [Rec11], which has a couple of advantages.
    [Show full text]
  • Grothendieck's Theorem, Past and Present
    Grothendieck's Theorem, past and present by Gilles Pisier∗ Texas A&M University College Station, TX 77843, U. S. A. and Universit´eParis VI Equipe d'Analyse, Case 186, 75252 Paris Cedex 05, France June 7, 2011 Abstract Probably the most famous of Grothendieck's contributions to Banach space theory is the result that he himself described as \the fundamental theorem in the metric theory of tensor products". That is now commonly referred to as \Grothendieck's theorem" (GT in short), or sometimes as \Grothendieck's inequality". This had a major impact first in Banach space theory (roughly after 1968), then, later on, in C∗-algebra theory, (roughly after 1978). More recently, in this millennium, a new version of GT has been successfully developed in the framework of \operator spaces" or non-commutative Banach spaces. In addition, GT independently surfaced in several quite unrelated fields: in connection with Bell's inequality in quantum mechanics, in graph theory where the Grothendieck constant of a graph has been introduced and in computer science where the Grothendieck inequality is invoked to replace certain NP hard problems by others that can be treated by “semidefinite programming' and hence solved in polynomial time. This expository paper (where many proofs are included), presents a review of all these topics, starting from the original GT. We concentrate on the more recent developments and merely outline those of the first Banach space period since detailed accounts of that are already available, for instance the author's 1986 CBMS notes. arXiv:1101.4195v3 [math.FA] 4 Jun 2011 ∗Partially supported by NSF grant 0503688 Contents 1 Introduction 1 2 Classical GT 7 3 Classical GT with tensor products 11 4 The Grothendieck constants 18 5 The \little" GT 20 6 Banach spaces satisfying GT 22 7 Non-commutative GT 23 8 Non-commutative \little GT" 25 9 Non-commutative Khintchine inequality 26 10 Maurey factorization 33 11 Best constants (Non-commutative case) 34 12 C∗-algebra tensor products, Nuclearity 36 13 Operator spaces, c.b.
    [Show full text]
  • On a New Method for Defining the Norm of Fourier-Stieltjes Algebras
    Pacific Journal of Mathematics ON A NEW METHOD FOR DEFINING THE NORM OF FOURIER-STIELTJES ALGEBRAS MARTIN E. WALTER Vol. 137, No. 1 January 1989 PACIFIC JOURNAL OF MATHEMATICS Vol. 137, No. 1, 1989 ON A NEW METHOD FOR DEFINING THE NORM OF FOURIER-STIELTJES ALGEBRAS MARTIN E. WALTER This paper is dedicated to the memory of Henry Abel Dye P. Eymard equipped B(G), the Fourier-Stieltjes algebra of a locally compact group G, with a norm by considering it the dual Banach space of bounded linear functional on another Banach space, namely the universal C*-algebra, C*(G). We show that B(G) can be given the exact same norm if it is considered as a Banach subalgebra of 3f(C*(G))9 the Banach algebra of completely bounded maps of C* (G) into itself equipped with the completely bounded norm. We show here how the latter approach leads to a duality theory for finite (and, more generally, discrete) groups which is not available if one restricts attention to the "linear functional" [as opposed to the "completely bounded map99] approach. 1. Preliminaries. For most of this paper G will be a finite (discrete) group, although we consider countably infinite discrete groups in the last section. We now summarize certain aspects of duality theory that we will need. The discussion is for a general locally compact group G whenever no simplification is achieved by assuming G to be finite. The reader familiar with Fourier and Fourier-Stieltjes algebras may proceed to §2 and refer back to this section as might be necessary.
    [Show full text]
  • Math 554 Linear Analysis Autumn 2006 Lecture Notes
    Math 554 Linear Analysis Autumn 2006 Lecture Notes Ken Bube and James Burke October 2, 2014 ii Contents Linear Algebra and Matrix Analysis 1 Vector Spaces . 1 Linear Independence, Span, Basis . 3 Change of Basis . 4 Constructing New Vector Spaces from Given Ones . 5 Dual Vector Spaces . 6 Dual Basis in Finite Dimensions . 7 Linear Transformations . 8 Projections . 10 Nilpotents . 12 Dual Transformations . 14 Bilinear Forms . 16 Norms . 20 Equivalence of Norms . 22 Norms induced by inner products . 24 Closed Unit Balls in Finite Dimensional Normed Linear Spaces . 26 Completeness . 27 Completion of a Metric Space . 29 Series in normed linear spaces . 30 Norms on Operators . 31 Bounded Linear Operators and Operator Norms . 31 Dual norms . 33 Submultiplicative Norms . 36 Norms on Matrices . 37 Consistent Matrix Norms . 38 Analysis with Operators . 39 Adjoint Transformations . 41 Condition Number and Error Sensitivity . 42 Finite Dimensional Spectral Theory . 45 Unitary Equivalence . 49 Schur Unitary Triangularization Theorem . 50 Cayley-Hamilton Theorem . 51 Rayleigh Quotients and the Courant-Fischer Minimax Theorem . 51 Non-Unitary Similarity Transformations . 53 iii iv Jordan Form . 54 Spectral Decomposition . 56 Jordan Form over R ............................... 57 Non-Square Matrices . 59 Singular Value Decomposition (SVD) . 59 Applications of SVD . 62 Linear Least Squares Problems . 63 Linear Least Squares, SVD, and Moore-Penrose Pseudoinverse . 64 LU Factorization . 68 QR Factorization . 71 Using QR Factorization to Solve Least Squares Problems . 73 The QR Algorithm . 73 Convergence of the QR Algorithm . 74 Resolvent . 77 Perturbation of Eigenvalues and Eigenvectors . 80 Functional Calculus . 87 The Spectral Mapping Theorem . 91 Logarithms of Invertible Matrices . 93 Ordinary Differential Equations 95 Existence and Uniqueness Theory .
    [Show full text]
  • The Prox Map
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS 156. 42X-443 [ 1991) The Prox Map GEKALU BEEK Deparrmenl of Marhemarics, Calij&niu Srare University, Los Angeles, Cahforniu 90032 AND DEVIDAS PAI * Department of Mathematics, Indian Invrituk qf’ Technolog.y, Powai, Bombay, 400076, India Submirred by R. P. Boas Recetved March 20, 198Y Let B be a weak*-compact subset of a dual normcd space X* and let A be a weak*-closed subset of X*. With d(B, A) denoting the distance between B and A in the usual sense, Prox(B, A) is the nonempty weak*-compact subset of BX A consisting of all (b, a) for which lib-al/ =d(B, A). In this article we study continuity properties of (B, A) + d(B, A) and (B, A) --* Prox(B, A), obtaining a generic theorem on points of single valuedness of the Prox map for convex sets. Best approximation and fixed point theorems for convex-valued multifunctions are obtained as applications. x 1991 Acddrmc Prraa. Inc. 1. INTRODUCTION Let X be a normed linear space. Given nonempty subsets A and B of X we write d(B,A) for inf{lIb-all :beB and UEA}. Points bEB and UEA are called proximal points of the pair (B, A) provided )(b - a/I = d(B, A). We denote the (possibly void) set of ordered pairs (h, a) of proximal points for (B, A) by Prox(B, A). Note that (6, a) E Prox(B, A) if an only if a-b is a point nearest the origin in A ~ B.
    [Show full text]
  • Low-Rank Inducing Norms with Optimality Interpretations
    Low-Rank Inducing Norms with Optimality Interpretations Christian Grussler ∗ Pontus Giselsson y June 12, 2018 Abstract Optimization problems with rank constraints appear in many diverse fields such as control, machine learning and image analysis. Since the rank constraint is non-convex, these problems are often approximately solved via convex relax- ations. Nuclear norm regularization is the prevailing convexifying technique for dealing with these types of problem. This paper introduces a family of low-rank inducing norms and regularizers which includes the nuclear norm as a special case. A posteriori guarantees on solving an underlying rank constrained optimization problem with these convex relaxations are provided. We evaluate the performance of the low-rank inducing norms on three matrix completion problems. In all ex- amples, the nuclear norm heuristic is outperformed by convex relaxations based on other low-rank inducing norms. For two of the problems there exist low-rank inducing norms that succeed in recovering the partially unknown matrix, while the nuclear norm fails. These low-rank inducing norms are shown to be representable as semi-definite programs. Moreover, these norms have cheaply computable prox- imal mappings, which makes it possible to also solve problems of large size using first-order methods. 1 Introduction Many problems in machine learning, image analysis, model order reduction, multivari- ate linear regression, etc. (see [2,4,5,7, 22, 26, 27, 32, 33, 37]), can be posed as a low-rank estimation problems based on measurements and prior information about a arXiv:1612.03186v2 [math.OC] 11 Jun 2018 data matrix. These estimation problems often take the form minimize f0(M) M (1) subject to rank(M) ≤ r; ∗Department of Engineering, Control Group, Cambridge University, United Kingdom [email protected].
    [Show full text]
  • On Group Representations Whose C* Algebra Is an Ideal in Its Von
    ANNALES DE L’INSTITUT FOURIER EDMOND E. GRANIRER On group representations whoseC∗ algebra is an ideal in its von Neumann algebra Annales de l’institut Fourier, tome 29, no 4 (1979), p. 37-52 <http://www.numdam.org/item?id=AIF_1979__29_4_37_0> © Annales de l’institut Fourier, 1979, tous droits réservés. L’accès aux archives de la revue « Annales de l’institut Fourier » (http://annalif.ujf-grenoble.fr/) implique l’accord avec les conditions gé- nérales d’utilisation (http://www.numdam.org/conditions). Toute utilisa- tion commerciale ou impression systématique est constitutive d’une in- fraction pénale. Toute copie ou impression de ce fichier doit conte- nir la présente mention de copyright. Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques http://www.numdam.org/ Ann. Inst. Fourier, Grenoble 29, 4 (1979), 37-52. ON GROUP REPRESENTATIONS WHOSE C*-ALGEBRA IS AN IDEAL IN ITS VON NEUMANN ALGEBRA by Edmond E. GRANIRER Introduction. — Let T be a unitary continuous representation of the locally compact group G on the Hilbert space H^ and denote by L(H,)[LC(H^)] the algebra of all bounded [compact] linear operators on H^. T can be lifted in the usual way to a *-representation of L^G). Denote by C*(G) = C* the norm closure of rEL^G)] in L(H,) (with operator norm) and by VN,(G) = VN, the W*-algebra generated by ^L^G)] in L(H,). Let M,(C*) = {(peVN,; (pC* + C*(p c: C*} i.e. the two sided multipliers of C* in VN, (not in the bidual (C*)" of C*).
    [Show full text]