A Spectral Nonlocal Block for Deep Neural Networks

Total Page:16

File Type:pdf, Size:1020Kb

A Spectral Nonlocal Block for Deep Neural Networks A Spectral Nonlocal Block for Deep Neural Networks Lei Zhu* 1 Qi She* 1 Lidan Zhang 1 Ping Guo 1 Intel Research Lab China, Beijing, China [email protected] Abstract Sun, 2015). Secondly, stacking more layers cannot always The nonlocal-based blocks are designed for cap- increase the effective receptive fields (Luo et al., 2016), turing long-range spatial-temporal dependencies which indicates the convolutional layers may still lack the in computer vision tasks. Although having shown mechanism to efficiently model these dependencies. excellent performances, they lack the mechanism To address these issues, inspired by the classical “nonlocal to encode the rich, structured information among mean” method in image denoising field, Wang et al.(2018) elements in an image. In this paper, to theoreti- proposes the nonlocal block to encode the “full-range” de- cally analyze the property of these nonlocal-based pendencies in one module by exploring the relation between blocks, we provide a unified approach to interpret- pairwise positions. For each central position, the nonlocal ing them, where we view them as a graph filter block firstly computes the pairwise relations between the generated on a fully-connected graph. When the central position and all positions to form an attention map, graph filter is approximated by Chebyshev poly- and then aggregates the features of all positions by weighted nomials, a generalized formulation can be derived mean according to the attention map. The aggregated fea- for explaining the existing nonlocal-based blocks tures are filtered by the convolutional operator and finally (e:g:; nonlocal block, nonlocal stage, double at- added to the features of each central position to form the tention block). Furthermore, we propose an effi- output. Due to its simplicity and effectiveness, the nonlo- cient and robust spectral nonlocal block, which cal neural network1 has been widely applied in image and can be flexibly inserted into deep neural networks video classification (Wang et al., 2018; Yue et al., 2018; Tao to catch the long-range dependencies between spa- et al., 2018; Chen et al., 2018), image segmentation (Huang tial pixels or temporal frames. Experimental re- et al., 2019; Yue et al., 2018; Wang et al., 2018) and person sults demonstrate the clear-cut improvements and re-identification (Liao et al., 2018; Zhang et al., 2019) tasks. practical applicabilities of the spectral nonlocal 2 block on image classification (Cifar-10/100, Ima- The nonlocal (mean) operator in the nonlocal block is re- geNet), fine-grained image classification (CUB- lated to the spatial-based graph convolution, which takes the 200), action recognition (UCF-101), and person aggregation of the central position/node and its neighbor po- re-identification (ILID-SVID, Mars, Prid-2011) sitions/nodes in the image/graph to get a new representation. tasks. Battaglia et al.(2018) shows the relation of the nonlocal block and the spatial-based graph convolutions in Message Passing Network (Gilmer et al., 2017). However, the nonlo- arXiv:1911.01059v4 [cs.CV] 10 Feb 2020 1. Introduction cal block focuses on a fully-connected graph, considering all the other positions in the image. Instead, the spatial based Capturing the long-range spatial-temporal dependencies be- graph convolution focuses on a sparse graph, which natu- tween spatial pixels or temporal frames plays a crucial role rally eliminates the redundant information. Specifically, the in the computer vision tasks. Convolutional neural net- affinity matrix (the similarity metric of pairwise positions) works (CNNs) are inherently limited by their convolution computed in the nonlocal block is obtained from all the fea- operators which are devoted to capture local features and re- tures of the upper layer, thus leading to the interference of lations, e:g:; a 7 × 7 region, and are inefficient in modeling feature aggregations. Therefore, the current nonlocal block long-range dependencies. Deep CNNs model these depen- needs an elaborate arrangement for its position, number, and dencies, which commonly refers to enlarge receptive fields, channel number to eliminate this negative effect. via stacking multiple convolution operators. However, two 1 unfavorable issues are raised in practice. Firstly, repeating The nonlocal neural network is the deep CNNs with nonlocal block inserted 2 convolutional operations comes with higher computation The nonlocal block consists a nonlocal operator and a residual connection and memory cost as well as the risk of over-fitting (He & Spectral Nonlocal Block Figure 1. The spatial (A) and spectral (B) view of a nonlocal block. The pink dots indicate each patch in the feature map and the “Aggregation” means calculating the weighted mean as the numerator of Eq. (1). The dotted arrows mean “copy” and full arrows mean “feed forward”. The green bars are the node features and the length means their strength (best view in color). To increase the robustness and applicability of the nonlocal ing nonlocal blocks in multiple vision tasks, including block in real-world applications, from the spectral-based image classification, fine-grained image classification, graph convolution views (Defferrard et al., 2016b;a; Levie action recognition, and person re-identification. et al., 2018), we reformulate the nonlocal block based on the property of graph spectral domain. As shown in Fig.1, the 2. Preliminary input image is fed into the convolutional layers to extract discriminative features such as the wing, the head, the claw In this paper, we use bold uppercase characters to denote and the neck. These features can be seen as the input of the matrix-valued random variable and italic bold uppercase the nonlocal block. Different from the spatial view which to denote the matrix. Vectors are denoted with lowercase. firstly aggregates the input features by weighted mean and The Nonlocal Block (NL) follows the nonlocal operator that then uses convolutional operator to filter as in Fig.1 A, our calculates a weighted mean between the features of each spectral view constructs a fully-connected graph based on position and all possible positions as shown in Fig.1 A. The their similarity and then directly filters the input features in nonlocal operator is defined as: a global view profited by the graph filter shown in Fig.1 B. P h i In practice, the Chebyshev polynomials is utilized to ap- j f(Xi;:; Xj;:)g(Xj;:) proximate the graph filter for reducing the number of pa- F (Xi;:) = ; (1) P f(X ; X ) rameters and computational cost (Phillips, 2003). This ap- j i;: j;: proximated formulation has successfully filled the gap be- where X 2 RN×C1 is the input feature map, i; j are the tween the spectral view and the spatial view of the nonlocal position indexes in the feature map, f(·) is the affinity kernel block (Wu et al., 2019). Thus other extended nonlocal-based which can adopt the “Dot Product”, “Traditional Gaussian”, e:g:; blocks ( nonlocal block, nonlocal stage, double atten- “Embedded Gaussian” or other kernel metrics with a finite tion block) can be further theoretically interpreted in the Frobenius norm. g(·) is a linear embedding that is defined as: spectral view. C1×Cs g(Xj;:) = Xj;:WZ with WZ 2 R . Here N is the Based on the points above, we propose Spectral Nonlocal total positions of each features and C1;Cs are the number of Block (SNL) that concerns the rich, structured information channels for the input and the transferred features. j When in an image via encoding the graph structure. The SNL inserting the NL block into the network structure, a linear guarantees the existence of the graph spectral domain and transformation and a residual connection are added: has more mathematical guarantees to improve its robustness and accuracy. In a nutshell, our contributions are threefold: Yi;: = Xi;: + F (Xi;:)W; (2) where W 2 RCs×C1 is the weight matrix. • We have theoretically bridged the gap between non- local block (spatial-based approaches) and the graph spectral filter method (spectral-based approach) 3. Spectral Nonlocal (SNL) Block • We propose a spectral nonlocal block as an efficient, The nonlocal operator can be explained under the graph simple, and generic component of deep neural net- spectral domain. It can be briefly divided into two steps: works, which captures long-range spatial-temporal de- generating a fully-connected graph to model the relation pendencies between spatial pixels or temporal frames. between the position pairs; converting the input features into the graph domain and learning a graph filter. In this section, • The SNL achieves a clear-cut improvement over exist- we firstly give the definition of the spectral view for the Spectral Nonlocal Block nonlocal operator. Then, we interpret other nonlocal-based Property 1. Theorem.1 requires the graph Laplacian L operators from this spectral view. Finally, we propose the has non-singular eigenvalues and eigenvectors. Thus, the Spectral Nonlocal (SNL) Block and highlight its properties. affinity matrix A should be non-negative and symmetric. Remark 1. Based on Theorem.1, new nonlocal operators 3.1. Nonlocal block in the spectral view can be theoretically designed by using different types of graph filter such as the Chebyshev filter (Defferrard et al., The matrix form of the nonlocal operator in Eq. (1) is: 2016a;b), the graph wavelet filter (Hammond et al., 2011) −1 F (X) = (DM M)ZW = AZW; (3) and the Caylet filter (Levie et al., 2018). In this work, we utilize the Chebyshev filter. where M = (M ), M = f(X ; X ) and Z = XW . ij ij i;: j;: Z Different from the spectral-based graph con- The matrix M 2 N×N is composed by pairwise similari- Remark 2. R volutions that focus on the sparse graph with fixed struc- ties between pixels. Z 2 N×Cs is the transferred feature R ture (Defferrard et al., 2016a), Theorem.1 focuses on the map that compresses the channels of X by a linear trans- fully-connected graph structure in which affinity matrix A formation with W 2 C1×Cs .
Recommended publications
  • A Posteriori Error Analysis of Finite Element Method for Linear Nonlocal Diffusion and Peridynamic Models
    MATHEMATICS OF COMPUTATION Volume 82, Number 284, October 2013, Pages 1889–1922 S 0025-5718(2013)02708-1 Article electronically published on May 8, 2013 A POSTERIORI ERROR ANALYSIS OF FINITE ELEMENT METHOD FOR LINEAR NONLOCAL DIFFUSION AND PERIDYNAMIC MODELS QIANG DU, LILI JU, LI TIAN, AND KUN ZHOU Abstract. In this paper, we present some results on a posteriori error analysis of finite element methods for solving linear nonlocal diffusion and bond-based peridynamic models. In particular, we aim to propose a general abstract frame work for a posteriori error analysis of the peridynamic problems. A posteriori error estimators are consequently prompted, the reliability and efficiency of the estimators are proved. Connections between nonlocal a posteriori error estimation and classical local estimation are studied within continuous finite element space. Numerical experiments (1D) are also given to test the theoret- ical conclusions. 1. Introduction Since peridynamic (PD) models were first introduced by Silling in [34], their effectiveness has been demonstrated in a number of applications, e.g., fracture and failure of composites, fracture of polycrystals, nanofiber networks and so on, where numerical simulations of singular behavior such as crack growth or damage are involved. Peridynamics is becoming an attractive emerging tool for the multiscale material simulations. On one hand, it is an integral-type nonlocal continuum theory which provides a more general set-up than the classical partial differential equation (PDE) based elasticity theories for physical problems with discontinuities. On the other hand, it can also be viewed as a continuum version of molecular dynamics. Along with research efforts on the modeling and application to practical prob- lems, there have been some recent studies concerning the mathematical and numer- ical analysis of PD models.
    [Show full text]
  • The Calderón Problem for Variable Coefficients Nonlocal Elliptic Operators
    The Calder´on problem for variable coefficients nonlocal elliptic operators Tuhin Ghosh,∗ Yi-Hsuan Lin,† Jingni Xiao‡ Abstract In this paper, we introduce an inverse problem of a Schr¨odinger type s variable nonlocal elliptic operator (−∇ · (A(x)∇)) + q), for 0 <s< 1. We determine the unknown bounded potential q from the exterior partial measurements associated with the nonlocal Dirichlet-to-Neumann map for any dimension n ≥ 2. Our results generalize the recent initiative [17] of introducing and solving inverse problem for fractional Schr¨odinger operator ((−∆)s + q) for 0 <s< 1. We also prove some regularity results of the direct problem corresponding to the variable coefficients fractional differential operator and the associated degenerate elliptic operator. Key words. The Calder´on problem, nonlocal Schr¨odinger equation, anisotropic, unique continuation principle, Runge approximation property, degenerate ellip- tic equations, Ap weight, Almgren’s frequency function, doubling inequality Mathematics Subject Classification: 35R30, 26A33, 35J10, 35J70 1 Introduction Let L be an elliptic partial differential operator. We consider in this paper an inverse problem associated to the nonlocal fractional operator Ls with the power s ∈ (0, 1). We introduce the corresponding Calder´on problem of deter- mining the unknown bounded potentials q(x) from the exterior measurements on the Dirichlet-to-Neumann (DN) map of the nonlocal Schr¨odinger equation (Ls +q)u = 0. It intends to generalize the recent study on the Calder´on problem for the fractional Schr¨odinger equation [17]. The study of the nonlocal opera- tors is currently an active research area in mathematics and often covers vivid problems coming from different fields including mathematical physics, finance, biology, geology.
    [Show full text]
  • Theoretical Foundations of Incorporating Local Boundary Conditions Into Nonlocal Problems
    Vol. XX (XXXX) No. X THEORETICAL FOUNDATIONS OF INCORPORATING LOCAL BOUNDARY CONDITIONS INTO NONLOCAL PROBLEMS Burak Aksoylu ∗ Department of Mathematics, TOBB University of Economics and Technology Sogutozu Caddesi No:43, Ankara, 06560, Turkey & Department of Mathematics, Wayne State University 656 W. Kirby, Detroit, MI 48202, USA. email: [email protected] Horst Reinhard Beyer Department of Mathematics, TOBB University of Economics and Technology Sogutozu Caddesi No:43, Ankara, 06560, Turkey & Universidad Polit´ecnicade Uruapan Carretera Carapan, 60120 Uruapan Michoac´an,M´exico& Theoretical Astrophysics, IAAT, Eberhard Karls University of T¨ubingen T¨ubingen72076, Germany. email: [email protected] Fatih Celiker y Department of Mathematics, Wayne State University 656 W. Kirby, Detroit, MI 48202, USA. email: [email protected] (Received 2016) We study nonlocal equations from the area of peridynamics on bounded domains. We present four main results. In our recent paper, we discover that, on R, the governing operator in peridynamics, which involves a convolution, is a bounded function of the classical (local) governing operator. Building on this, as main result 1, we construct an abstract convolution operator on bounded domains which is a generalization of the standard convolution based on integrals. The abstract convolution operator is a function of the classical operator, defined by a Hilbert basis available due to the purely discrete spectrum of the latter. As governing operator of the nonlocal equation we use a function of the classical operator, this allows us to incorporate local boundary conditions into nonlocal theories. As main result 2, we prove that the solution operator can be uniquely decomposed into a Hilbert-Schmidt operator and a multiple of the identity operator.
    [Show full text]
  • A Comparative Study on Nonlocal Diffusion Operators Related to the Fractional Laplacian
    DISCRETE AND CONTINUOUS doi:10.3934/dcdsb.2018110 DYNAMICAL SYSTEMS SERIES B Volume 24, Number 1, January 2019 pp. 231{256 A COMPARATIVE STUDY ON NONLOCAL DIFFUSION OPERATORS RELATED TO THE FRACTIONAL LAPLACIAN Siwei Duo Department of Mathematics and Statistics Missouri University of Science and Technology Rolla, MO 65409-0020, USA Hong Wang Department of Mathematics, University of South Carolina Columbia, SC 29208, USA Yanzhi Zhang Department of Mathematics and Statistics Missouri University of Science and Technology Rolla, MO 65409-0020, USA (Communicated by Erik Van Vleck) Abstract. In this paper, we study four nonlocal diffusion operators, includ- ing the fractional Laplacian, spectral fractional Laplacian, regional fractional Laplacian, and peridynamic operator. These operators represent the infinitesi- mal generators of different stochastic processes, and especially their differences on a bounded domain are significant. We provide extensive numerical exper- iments to understand and compare their differences. We find that these four operators collapse to the classical Laplace operator as α ! 2. The eigenvalues and eigenfunctions of these four operators are different, and the k-th (for k 2 N) eigenvalue of the spectral fractional Laplacian is always larger than those of the fractional Laplacian and regional fractional Laplacian. For any α 2 (0; 2), the peridynamic operator can provide a good approximation to the fractional Laplacian, if the horizon size δ is sufficiently large. We find that the solution of the peridynamic model converges to that of the fractional Laplacian model at a rate of O(δ−α). In contrast, although the regional fractional Laplacian can be used to approximate the fractional Laplacian as α ! 2, it generally provides inconsistent result from that of the fractional Laplacian if α 2.
    [Show full text]
  • A Nonlocal Vector Calculus with Application to Nonlocal Boundary-Value Problems
    A NONLOCAL VECTOR CALCULUS WITH APPLICATION TO NONLOCAL BOUNDARY-VALUE PROBLEMS Max Gunzburger Department of Scientific Computing Florida State University Collaboration with: Richard Lehoucq Sandia National Laboratories Last updated in 2009 LOCAL AND NONLOCAL DIFFUSION • The elliptic equation −∇ · D(x) · ∇w(x) = b(x) in Ω ⊂ Rd models (steady state) diffusion • The “nonlocal” equation 2 u(x′) − u(x) µ(x, x′) dx′ = b(x) in Ω ⊂ Rd Z Ω models (steady state) nonlocal diffusion – to see this, consider the nonlocal diffusion equation u = u(x′) − u(x) µ(x, x′) dx′ t Z Ω – suppose that µ(x, x′) dx′ dx =1 ZΩ ZΩ and µ(x, x′)= µ(x′, x) ≥ 0 so that µ(x, x′) can be interpreted as the joint probability density of moving between x and x′ – then u(x′) − u(x) µ(x, x′) dx′ Z Ω = u(x′)µ(x, x′) dx′ − u(x) µ(x, x′) dx′ ZΩ ZΩ is the rate at which u “enters” x less the rate at which u “departs” x – the nonlocal diffusion equation can be derived from a “nonlocal” random walk – any skew-symmetry of µ represents drift – the nonlocal diffusion equation is an example of a differential Chapman-Kolmogorov equation Motivations and goals u(x′) − u(x) µ(x, x′) dx′ vs. ∇· D(x) · ∇w(x) Z Ω • The nonlocal operator always contains length scales it is a multiscale operator – the local operator contains length scales only when the diffusion tensor D does • The nonlocal operator has lower regularity requirements u may be discontinuous • The nonlocal diffusion equation does not necessarily smooth discontinuous initial conditions • Extension to the vector case is (formally)
    [Show full text]
  • Fractional Calculus: Theory and Applications
    Fractional Calculus: Theory and Applications Edited by Francesco Mainardi Printed Edition of the Special Issue Published in Mathematics www.mdpi.com/journal/mathematics Fractional Calculus: Theory and Applications Fractional Calculus: Theory and Applications Special Issue Editor Francesco Mainardi MDPI • Basel • Beijing • Wuhan • Barcelona • Belgrade Special Issue Editor Francesco Mainardi University of Bologna Italy Editorial Office MDPI St. Alban-Anlage 66 Basel, Switzerland This is a reprint of articles from the Special Issue published online in the open access journal Mathematics (ISSN 2227-7390) from 2017 to 2018 (available at: http://www.mdpi.com/journal/ mathematics/special issues/Fractional Calculus Theory Applications) For citation purposes, cite each article independently as indicated on the article page online and as indicated below: LastName, A.A.; LastName, B.B.; LastName, C.C. Article Title. Journal Name Year, Article Number, Page Range. ISBN 978-3-03897-206-8 (Pbk) ISBN 978-3-03897-207-5 (PDF) Articles in this volume are Open Access and distributed under the Creative Commons Attribution (CC BY) license, which allows users to download, copy and build upon published articles even for commercial purposes, as long as the author and publisher are properly credited, which ensures maximum dissemination and a wider impact of our publications. The book taken as a whole is c 2018 MDPI, Basel, Switzerland, distributed under the terms and conditions of the Creative Commons license CC BY-NC-ND (http://creativecommons.org/licenses/by-nc-nd/4.0/). Contents About the Special Issue Editor ...................................... vii Francesco Mainardi Fractional Calculus: Theory and Applications Reprinted from: Mathematics 2018, 6, 145, doi: 10.3390/math6090145 ...............
    [Show full text]
  • Sandia National Labs SAND 2010-8353J
    Archive for Rational Mechanics and Analysis manuscript No. (will be inserted by the editor) Q. Du · M. D. Gunzburger · R. B. Lehoucq · K. Zhou A nonlocal vector calculus, nonlocal volume-constrained problems, and nonlocal balance laws DRAFT date 15 May 2011 Received: date / Accepted: date 2010-8353J Q. Du and K. Zhou were supported in part by the U.S. Department of Energy Office of Science under grant DE-SC0005346. M. Gunzburger was supported in part by the U.S. Department of Energy Office of Science under grant DE-SC0004970. Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the U.S. Department of Energy under contract DE-AC04-94AL85000. SAND Q. Du Department of Mathematics Penn State University Tel.: +1-814-865-3674 State College, PA 16802, USA Labs Fax: +1-814-865-3735 E-mail: [email protected] M. D. Gunzburger Department of Scientific Computing Florida State University Tallahassee FL 32306-4120, USA Tel.: +1-850-644-7060 Fax: +1-850-644-0098 E-mail: [email protected] National R. B. Lehoucq Sandia National Laboratories P.O. Box 5800, MS 1320 Albuquerque NM 87185-1320, USA Tel.: +1-505-845-8929 Fax: +1-505-845-7442 E-mail: [email protected] K. Zhou Sandia Department of Mathematics Penn State University State College, PA 16802, USA Tel.: +1-814-865-3674 Fax: +1-814-865-3735 E-mail: [email protected] 2 Q. Du et al. Abstract A vector calculus for nonlocal operators is developed, including the definition of nonlocal generalized divergence, gradient, and curl operators and the derivation of their adjoint operators.
    [Show full text]
  • Properties and Convergence of State-Based Laplacians Kelsey Wells [email protected]
    University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Dissertations, Theses, and Student Research Papers Mathematics, Department of in Mathematics 5-2018 Properties and Convergence of State-based Laplacians Kelsey Wells [email protected] Follow this and additional works at: https://digitalcommons.unl.edu/mathstudent Part of the Analysis Commons, and the Partial Differential Equations Commons Wells, Kelsey, "Properties and Convergence of State-based Laplacians" (2018). Dissertations, Theses, and Student Research Papers in Mathematics. 83. https://digitalcommons.unl.edu/mathstudent/83 This Article is brought to you for free and open access by the Mathematics, Department of at DigitalCommons@University of Nebraska - Lincoln. It has been accepted for inclusion in Dissertations, Theses, and Student Research Papers in Mathematics by an authorized administrator of DigitalCommons@University of Nebraska - Lincoln. PROPERTIES AND CONVERGENCE OF STATE-BASED LAPLACIANS by Kelsey Wells A DISSERTATION Presented to the Faculty of The Graduate College at the University of Nebraska In Partial Fulfilment of Requirements For the Degree of Doctor of Philosophy Major: Mathematics Under the Supervision of Professor Petronela Radu Lincoln, Nebraska May, 2018 PROPERTIES AND CONVERGENCE OF STATE-BASED LAPLACIANS Kelsey Wells, Ph.D. University of Nebraska, 2018 Adviser: Petronela Radu The classical Laplace operator is a vital tool in modeling many physical behaviors, such as elasticity, diffusion and fluid flow. Incorporated in the Laplace operator is the requirement of twice differentiability, which implies continuity that many physical processes lack. In this thesis we introduce a new nonlocal Laplace-type operator, that is capable of dealing with strong discontinuities. Motivated by the state-based peri- dynamic framework, this new nonlocal Laplacian exhibits double nonlocality through the use of iterated integral operators.
    [Show full text]
  • Mathematical Analysis on an Age-Structured SIS Epidemic Model with Nonlocal Diffusion
    Journal of Mathematical Biology (2021) 83:5 https://doi.org/10.1007/s00285-021-01634-x Mathematical Biology Mathematical analysis on an age-structured SIS epidemic model with nonlocal diffusion Hao Kang1 · Shigui Ruan1 Received: 4 August 2020 / Revised: 2 June 2021 / Accepted: 13 June 2021 / Published online: 26 June 2021 © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2021 Abstract In this paper we propose an age-structured susceptible-infectious-susceptible epidemic model with nonlocal (convolution) diffusion to describe the geographic spread of infec- tious diseases via long-distance travel. We analyze the well-posedness of the model, investigate the existence and uniqueness of the nontrivial steady state corresponding to an endemic state, and study the local and global stability of this nontrivial steady state. Moreover, we discuss the asymptotic properties of the principal eigenvalue and nontrivial steady state with respect to the nonlocal diffusion rate. The analysis is car- ried out by using the theory of semigroups and the method of monotone and positive operators. The spectral radius of a positive linear operator associated to the solution flow of the model is identified as a threshold. Keywords Age-structure · Nonlocal diffusion · SIS model · Semigroup theory · Principal eigenvalue · Monotone and positive operators · Global dynamics Mathematics Subject Classification 35L60 · 92D25 · 47H07 1 Introduction Age structure and spatial movement are two important features on the geographic spread of infectious diseases. The age could be chronological age (the age of the host), infection age (the time elapsed since infection), recovery age (the time elapsed since the last infection), class age (the length of time in the present group), etc.
    [Show full text]
  • Asymptotically Compatible Discretization of Multidimensional
    IMA Journal of Numerical Analysis (2019) 39, 607–625 doi:10.1093/imanum/dry011 Advance Access publication on 5 April 2018 Asymptotically compatible discretization of multidimensional nonlocal diffusion May 2019 02 on user University Columbia by https://academic.oup.com/imajna/article-abstract/39/2/607/4959864 from Downloaded models and approximation of nonlocal Green’s functions Qiang Du, Yunzhe Tao and Xiaochuan Tian Department of Applied Physics and Applied Mathematics, Columbia University, New York, NY 10027 and Jiang Yang∗ Department Mathematics, Southern University of Science and Technology, Shenzhen, 518055, China and Department of Applied Physics and Applied Mathematics, Columbia University, New York, NY 10027 ∗Corresponding author: [email protected] [Received on 2 September 2017; revised on 17 January 2018] Nonlocal diffusion equations and their numerical approximations have attracted much attention in the literature as nonlocal modeling becomes popular in various applications. This paper continues the study of robust discretization schemes for the numerical solution of nonlocal models. In particular, we present quadrature-based finite difference approximations of some linear nonlocal diffusion equations in multidimensions. These approximations are able to preserve various nice properties of the nonlocal continuum models such as the maximum principle and they are shown to be asymptotically compatible in the sense that as the nonlocality vanishes, the numerical solutions can give consistent local limits. The approximation errors are proved to be of optimal order in both nonlocal and asymptotically local settings. The numerical schemes involve a unique design of quadrature weights that reflect the multidimensional nature and require technical estimates on nonconventional divided differences for their numerical analysis.
    [Show full text]
  • Fredholm Properties of Nonlocal Differential Operators Via Spectral Flow
    Fredholm properties of nonlocal differential operators via spectral flow Gr´egoryFaye1 and Arnd Scheel2 1,2University of Minnesota, School of Mathematics, 206 Church Street S.E., Minneapolis, MN 55455, USA September 16, 2013 Abstract We establish Fredholm properties for a class of nonlocal differential operators. Using mild convergence and localization conditions on the nonlocal terms, we also show how to compute Fredholm indices via a generalized spectral flow, using crossing numbers of generalized spatial eigenvalues. We illustrate possible applications of the results in a nonlinear and a linear setting. We first prove the existence of small viscous shock waves in nonlocal conservation laws with small spatially localized source terms. We also show how our results can be used to study edge bifurcations in eigenvalue problems using Lyapunov-Schmidt reduction instead of a Gap Lemma. Keywords: Nonlocal operator; Fredholm index; Spectral flow; Nonlocal conservation law; Edge bifurca- tions. 1 Introduction 1.1 Motivation Our aim in this paper is the study of the following class of nonlocal linear operators: 1 n 2 n d T : H ( ; ) −! L ( ; );U 7−! U − Keξ ∗ U (1.1) R R R R dξ where the matrix convolution kernel Keξ(ζ) = Ke(ζ; ξ) acts via Z 0 0 0 Keξ ∗ U(ξ) = Ke(ξ − ξ ; ξ)U(ξ )dξ : R Operators such as (1.1) appear when linearizing at coherent structures such as traveling fronts or pulses in nonlinear nonlocal differential equations. One is interested in properties of the linearization when analyzing robustness, stability or interactions of these coherent structures. A prototypical example are neural field equations which are used in mathematical neuroscience to model cortical traveling waves.
    [Show full text]
  • Data-Driven Learning of Robust Nonlocal Physics from High-Fidelity
    Data-driven learning of robust nonlocal physics from high-fidelity synthetic data Huaiqian Youa, Yue Yua,, Nathaniel Traskb, Mamikon Gulianb, Marta D'Eliac aDepartment of Mathematics, Lehigh University, Bethlehem, PA bCenter for Computing Research, Sandia National Laboratories, Albuquerque, NM cComputational Science and Analysis, Sandia National Laboratories, Livermore, CA Abstract A key challenge to nonlocal models is the analytical complexity of deriving them from first principles, and frequently their use is justified a posteriori. In this work we extract nonlocal models from data, circumventing these challenges and providing data-driven jus- tification for the resulting model form. Extracting provably robust data-driven surrogates is a major challenge for machine learning (ML) approaches, due to nonlinearities and lack of convexity. Our scheme allows extraction of provably invertible nonlocal models whose kernels may be partially negative. To achieve this, based on established nonlocal theory, we embed in our algorithm sufficient conditions on the non-positive part of the kernel that guarantee well-posedness of the learnt operator. These conditions are imposed as inequality constraints and ensure that models are robust, even in small-data regimes. We demonstrate this workflow for a range of applications, including reproduction of manufactured nonlocal kernels; numerical homogenization of Darcy flow associated with a heterogeneous periodic microstructure; nonlocal approximation to high-order local transport phenomena; and ap- proximation of globally supported fractional diffusion operators by truncated kernels. Keywords: nonlocal models, data-driven learning, machine learning, optimization, homogenization, fractional models arXiv:2005.10076v1 [math.OC] 17 May 2020 1. Background In contrast to partial differential equation (PDE) models which typically govern classical continuum mechanics and standard diffusion, nonlocal models describe systems in terms of integro-differential equations (IDEs).
    [Show full text]