Riemannian Gradient Descent Methods for Graph-Regularized Matrix Completion
Shuyu Dong†, P.-A. Absil†, K. A. Gallivan‡∗
†ICTEAM Institute, UCLouvain, Belgium ‡Florida State University, U.S.A.
December 2, 2019
Abstract Low-rank matrix completion is the problem of recovering the missing entries of a data matrix by using the assumption that the true matrix admits a good low-rank approxi- mation. Much attention has been given recently to exploiting correlations between the column/row entities to improve the matrix completion quality. In this paper, we propose preconditioned gradient descent algorithms for solving the low-rank matrix completion problem with graph Laplacian-based regularizers. Experiments on synthetic data show that our approach achieves significant speedup compared to an existing method based on alternating minimization. Experimental results on real world data also show that our methods provide low-rank solutions of similar quality in comparable or less time than the state-of-the-art method.
Keywords: matrix completion; graph information; Riemannian optimization; low-rank op- timization MSC: 90C90, 53B21, 15A83
1 Introduction
Low-rank matrix completion arises in applications such as recommender systems, forecasting and imputation of data; see [38] for a recent survey. Given a data matrix with missing entries, the objective of matrix completion can be formulated as the minimization of an error function of a matrix variable with respect to the data matrix restricted to its revealed entries. In various applications, the data matrix either has a rank much lower than its dimensions or can be approximated by a low-rank matrix. As a consequence, restricting the rank of the matrix variable in the matrix completion objective not only corresponds to a reasonable assumption for successful recovery of the data matrix but also reduces the model complexity. In certain situations, besides the low-rank constraint, it is useful to add a regularization term to the error function, in order to favor other structural properties related to the true data matrix. This regularization term can often be built from side information, that is, any information associated with row and column indices of the data matrix. Recent efforts in ∗Email:{shuyu.dong, pa.absil}@uclouvain.be, [email protected]. This work was supported by the Fonds de la Recherche Scientifique – FNRS and the Fonds Wetenschappelijk Onderzoek – Vlaanderen under EOS Project no 30468160. The first author is supported by the FNRS through a FRIA scholarship. Part of this work was performed while the third author was a visiting professor at UCLouvain, funded by the Science and Technology Sector, and with additional support by the Netherlands Organization for Scientific Research.
1 exploiting side information for matrix completion include inductive low-rank matrix comple- tion [53, 23, 58] and graph-regularized matrix completion [61, 26, 59, 42, 55]. In particular, graph-regularized matrix completion involves designing the regularization term using graph Laplacian matrices that encode pairwise similarities between the row and/or column entities. In other words, the pairwise similarities between the entries of the data matrix are modeled using an undirected graph. Depending on the application, the graph information is available through the connections between the data entities or can be inferred from the data itself. Rao et al. [42] addressed the task of graph-regularized matrix completion by a low-rank factorization problem of the following form, 2 1 X T ? T r T c minimize (GH )ij − Mij + λrTr G Θ G + λcTr H Θ H , (1) (G,H)∈ m×k× n×k 2 R R (i,j)∈Ω
? m×n where M ∈ R is the data matrix to be completed, k is the maximal rank of the low-rank model, Ω is the set of revealed entries and λr ≥ 0 and λc ≥ 0 are parameters. The matrices r r c c r m×m Θ := Im + L and Θ := In + L with given graph Laplacian matrices L ∈ R and c n×n r c L ∈ R . The graph Laplacian matrices L and L incorporate the pairwise correlations or similarities between the columns or rows of the data matrix M ?. Indeed, observe that the graph Laplacian-based penalty terms in (1) in the form of Tr F T LF can be written as