Manifold Interpolation and Model Reduction

MANIFOLD INTERPOLATION AND MODEL REDUCTION RALF ZIMMERMANN∗ Abstract. One approach to parametric and adaptive model reduction is via the interpolation of orthogonal bases, subspaces or positive definite system matrices. In all these cases, the sampled inputs stem from matrix sets that feature a geometric structure and thus form so-called matrix manifolds. This work will be featured as a chapter in the upcoming Handbook on Model Order Reduction, (P. Benner, S. Grivet-Talocia, A. Quarteroni, G. Rozza, W. H. A. Schilders, L. M. Silveira, eds, to appear on DE GRUYTER) and reviews the numerical treatment of the most important matrix manifolds that arise in the context of model reduction. Moreover, the principal approaches to data interpolation and Taylor-like extrapolation on matrix manifolds are outlined and complemented by algorithms in pseudo-code. Key words. parametric model reduction, matrix manifold, Riemannian computing, geodesic interpolation, interpolation on manifolds, Grassmann manifold, Stiefel manifold, matrix Lie group AMS subject classifications. 15-01, 15A16, 15B10, 15B48, 53-04, 65F60, 41-01, 41A05, 65F99, 93A15, 93C30 1. Introduction & Motivation. This work addresses interpolation approaches for parametric model reduction. This includes techniques for • computing trajectories of parameterized subspaces, • computing trajectories of parameterized reduced orthogonal bases, • structure-preserving interpolation. Mathematically, this requires data processing on nonlinear matrix manifolds. The exposition at hand intends to be an introduction and a reference guide to numerical procedures with matrix manifold-valued data. As such it addresses practitioners and scientists new to the field. It covers the essentials of those matrix manifolds that arise most frequently in practical problems in model reduction. The main purpose is not to discuss concrete model reduction applications, but rather to provide the essential tools, building blocks and background theory to enable the reader to devise her/his own approaches for such applications. The text was designed such that it works as a commented formula collection, meanwhile giving sufficient context, explanations and, not least, precise references to enable the interested reader to immerse further in the topic. 1.1. Parametric model reduction via manifold interpolation: An intro- ductory example. The basic objective in model reduction is to emulate a large-scale dynamical system with very few degrees of freedom such that its input/output behavior is preserved as well as possible. While classical model reduction techniques aim at producing an accurate low-order approximation to the autonomous behavior of the arXiv:1902.06502v2 [math.NA] 11 Sep 2019 original system, parametric model reduction (pMOR) tries to account for additional system parameters. If we look for instance at aircraft aerodynamics, an important task is to solve the unsteady Navier-Stokes equations at various flight conditions, which are, amongst others, specified by the altitude, the viscosity of the fluid (i.e. the Reynolds number) and the relative velocity (i.e. the Mach number).We explain the objective of pMOR with the aid of a generic example in the context of proper orthogonal decomposition-based model reduction. Similar considerations apply to frequency domain approaches, Krylov subspace methods and balanced truncation, which are ∗Department of Mathematics and Computer Science, University of Southern Denmark (SDU) Odense, ([email protected]). 1 discussed in other chapters of the upcoming Handbook on Model Order Reduction. Consider a spatio-temporal dynamical system in semi-discrete form @ x(t; µ) = f(x(t; µ); µ); x(t ; µ) = x ; (1.1) @t 0 0,µ where x(t; µ) 2 Rn is the spatially discretized state vector of dimension n, the vec- d tor µ = (µ1; : : : ; µd) 2 R accounts for additional system parameters and f( · ; µ): Rn ! Rn is the (possibly nonlinear, parameter-dependent) right hand side function. Projection-based MOR starts with constructing a suitable low-dimensional subspace that acts as a space of candidate solutions. Subspace construction. One way to construct the required projection subspace is the proper orthogonal decomposition (POD), [48].In its simplest form, the POD 1 can be summarized as follows. For a fixed system parameter µ = µ0, let x := m n x(t1; µ0); :::; x := x(tm; µ0) 2 R be a set of state vectors satisfying (1.1) and let 1 m n×m i S := x ; :::; x 2 R . The state vectors x are called snapshots and the matrix S is called the associated snapshot matrix. POD is concerned with finding a subspace n×r V of dimension r ≤ m represented by a column-orthogonal matrix Vr 2 R such that the error between the input snapshots and their orthogonal projection onto V = ran(Vr) is minimized: X k T k 2 T 2 min kx − VV x k2 , min kS − VV SkF : V 2 n×r ;V T V =I V 2 n×r ;V T V =I R k R The main result of POD is that for any r ≤ m, the best r-dimensional approximation of ran(x1; :::; xm) in the above sense is V = ran(v1; :::; vr), where fv1; :::; vrg are the eigenvectors of the matrix SST corresponding to the r largest eigenvalues. The sub- 1 r space V is called the POD subspace and the matrix Vr = (v ; :::; v ) is the POD basis matrix. The same subspace is obtained via a compact singular value decomposition (SVD) of the snapshot matrix S = VΣZT , truncated to the first r ≤ m columns of n×m V 2 R by setting V := ran(Vr). For more details, see, e.g. [17, x3.3]. In the following, we drop the index r and assume that V is already the truncated matrix V = (v1; :::; vr) 2 Rn×r. Since the input snapshots are supplied at a fixed system parameter vector µ0, the POD subspace is considered to be an appropriate space of solution candidates V(µ0) = ran(V(µ0)) at µ0. Projection. POD leads to a parameter decoupling x~(t; µ0) = V(µ0)xr(t): (1.2) In this way, the time trajectory of the reduced model is uniquely defined by the coef- r ficient vector xr(t) 2 R that represents the reduced state vector with respect to the subspace ran(V(µ0)). Given a matrix W(µ0) such that the matrix pair V(µ0); W(µ0) T is bi-orthogonal, i.e. W(µ0) V(µ0) = I, the original system (1.1) can be reduced in T dimension as follows. Substituting (1.2) in (1.1) and multiplying with W(µ0) from the left leads to d x (t) = T (µ )f( (µ )x (t); µ ); x (t ) = T (µ )x : (1.3) dt r W 0 V 0 r 0 r 0 V 0 0,µ0 This approach goes by the name of Petrov-Galerkin projection, if W(µ0) 6= V(µ0) and Galerkin projection if W(µ0) = V(µ0). There are various ways to proceed from (1.3) 2 depending on the nature of the function f and many of them are discussed in other chapters of the upcoming Handbook on Model Order Reduction. 1 For illustration purposes, we proceed with W(µ0) = V(µ0) and assume that the right hand side function f splits into a linear and a nonlinear part: f(x; µ0) = n×n A(µ0)x + f(x; µ0), where A(µ0) 2 R is, say, a symmetric and negative definite matrix to foster stability. Then, (1.3) becomes d x (t) = T (µ )A(µ ) (µ )x (t) + T (µ )f (µ )x (t); µ : dt r V 0 0 V 0 r V 0 V 0 r 0 In the discrete empirical interpolation method (DEIM, [27]), the large-scale nonlinear n×s term f V(µ0)xr(t); µ0) is approximated via a mask matrix P = (ei1 ; : : : ; eis ) 2 R , j T n where fi1; : : : ; isg ⊂ f1; : : : ; ng and ej = (:::; 1;:::) 2 R is the jth canonical unit vector. The mask matrix P acts as an entry selector on a given n-vector via T T s n×s P v = (vi1 ; : : : ; vis ) 2 R . In addition, another POD basis matrix U(µ0) 2 R is used, which is obtained from snapshots of the nonlinear term. The matrices P and U(µ0) are combined to form an oblique projection of the non-linear term onto the subspace ran(U(µ0)). This leads to the reduced model d x (t) = T (µ )A(µ ) (µ )x (t) dt r V 0 0 V 0 r T T −1 T +V (µ0)U(µ0)(P U(µ0)) P f V(µ0)xr(t); µ0 ; (1.4) whose computational complexity is formally independent of the full-order dimension T n, see [27] for details. Mind that by assumption, M(µ0) := −V (µ0)A(µ0)V(µ0) is symmetric positive definite and that both V(µ0) and U(µ0) are column-orthogonal. Moreover, for a fixed mask matrix P , coordinate changes of V(µ0) and U(µ0) do not affect the approximated statex ~(t; µ0) = V(µ0)xr(t), so that essentially, the reduced system (1.4) depends only on the subspaces ran(V(µ0)) and ran(U(µ0)) rather than 2 the matrices V(µ0) and U(µ0). Solving (1.3), (1.4) constitutes the online stage of model reduction. The main focus of this exposition is not on the efficient solution of the reduced systems (1.3) or (1.4) at a fixed µ0, but on tackling parametric variations in µ. In view of the associated computational costs, it is important that this can be achieved without computing additional snapshots in the online stage. A straightforward way to achieve this is to extend the snapshot sampling to the µ- parameter range to produce POD basis matrices that are to cover all input parameters.

Manifold Interpolation and Model Reduction

The Grassmann Manifold

Cheap Orthogonal Constraints in Neural Networks: a Simple Parametrization of the Orthogonal and Unitary Group

A Geometric Take on Metric Learning

Building Deep Networks on Grassmann Manifolds

On Manifolds of Tensors of Fixed Tt-Rank

INTRODUCTION to ALGEBRAIC GEOMETRY 1. Preliminary Of

Lecture 2 Tangent Space, Differential Forms, Riemannian Manifolds

DIFFERENTIAL GEOMETRY COURSE NOTES 1.1. Review of Topology. Definition 1.1. a Topological Space Is a Pair (X,T ) Consisting of A

Manifold Reconstruction in Arbitrary Dimensions Using Witness Complexes Jean-Daniel Boissonnat, Leonidas J

5. the Inverse Function Theorem We Now Want to Aim for a Version of the Inverse Function Theorem

Stiefel Manifolds and Polygons

Hodge Theory