Parareal in Julia with Singularity

Total Page:16

File Type:pdf, Size:1020Kb

Parareal in Julia with Singularity Fachhochschule Aachen Campus Jülich Department: Medizintechnik und Technomathematik Degree program: Angewandte Mathematik und Informatik Parareal in Julia with Singularity Masterthesis from Isabel Heisters Jülich, December 2020 Declaration of independence This work is made and written by myself. No other sources and tools have been used than those indicated. Place and date Signature The thesis was supervised by: First examiner: Prof. Dr. Johannes Grotendorst Second examiner: Dr. Robert Speck This thesis was done at : Jülich Supercomputing Centre Forschungszentrum Jülich GmbH Abstract In this thesis a Julia version of the parallel-in-time method Parareal is in- troduced. Parareal decomposes the time as an approach for parallelization. Parareal is an hierarchical, iterative algorithm that uses a coarse, cheap inte- grator to propagate information quickly forward in time in order to provide initial values for the parallelized original time integration scheme. The prob- lems here used to test the Parareal algorithm are the Lorenz equation, the heat equation and the Allen-Cahn equation. Julia is a programming language that was specifically designed to be used for numerical applications and par- allelization. Julia is becoming more popular due to the fact that it is easier to implement than C but has a better runtime than Python. Julia is a new language and not available on most host systems. Singularity is a container solution to create the necessities for scientific application-driven workloads. By using a container the user can configure the environment in which the applica- tion can run independently of the host system and its software specifications. This thesis shows how a Singularity container for the Parareal algorithm implemented in Julia can be built. The container is portable to different hosts like HPC and cloud systems without having a runtime overhead in comparison to the runtime without a container. V Zusammenfassung In dieser Arbeit wird der Parareal-Algorithmus als Werkzeug zur Paralleli- sierung vorgestellt. Parareal zerlegt die Zeit als Ansatz zur Parallelisierung. Parareal ist ein hierarchischer, iterativer Algorithmus, der einen günstigen In- tegrator verwendet, um Informationen schnell in der Zeit vorwärts zu senden, um Anfangswerte für das parallelisierte ursprüngliche Zeitintegrationsschema bereitzustellen. Die Probleme, die hier zum Testen des Parareal-Algorithmus verwendet werden, sind die Lorenz-Gleichung, die Wärmeleitungsgleichung und die Allen-Cahn-Gleichung. Julia ist eine Programmiersprache, die speziell für numerische Anwendungen und Parallelisierung entwickelt wurde. Julia wird immer beliebter, da sie einfacher zu implementieren ist als C, aber eine bes- sere Laufzeit als Python hat. Julia ist eine neue Sprache und auf den meisten Host-Systemen nicht verfügbar. Singularity ist eine Containerlösung, mit dem Ziel die Vorraussetzung für wissenschaftliche Projekte zu schaffen. Durch die Verwendung eines Containers kann der Benutzer die Umgebung konfigurieren, in der die Anwendung unabhängig vom Hostsystem und dessen Softwarespe- zifikationen laufen kann. Diese Arbeit zeigt, wie ein Singularity-Container für den in Julia implemen- tierten Parareal-Algorithmus gebaut werden kann. Der Container ist auf ver- schiedene Hosts wie HPC- und Cloud-Systeme portierbar, ohne einen Laufzeit- Overhead im Vergleich zur Laufzeit ohne Container zu haben. VI Contents 1. Motivation1 2. The Parareal algorithm3 2.1. Classification in literature . .3 2.2. Mathematical approach . .4 2.3. Parallelization . .7 3. A testbed 11 3.1. Preliminary remarks . 11 3.2. Lorenz system . 13 3.3. Heat equation in 1D with periodic boundary conditions . 15 3.4. Allen-Cahn Equation . 18 4. Containerization 21 4.1. Introduction to virtualization . 21 4.2. Singularity . 24 4.3. Example: Singularity container for Julia and OpenMPI . 26 5. Testing with Singularity 31 5.1. Singularity on Jureca . 31 5.2. Portability to other hosts . 34 5.3. Singularity with a different MPI version . 36 6. Summary and outlook 39 A. Appendix 41 A.1. Singularity Definition File . 41 Bibliography 48 VII List of Figures 2.1. Visualization of the Parareal algorithm with c=1 . .7 2.2. Visualization of the Parareal algorithm with c=2 . .9 3.1. Solution of the Lorenzsystem over time t = (0, 100) with stepsize ∆t = 0.01 and explicit Euler as solver . 13 3.2. Error and Iteration of the Lorenz system over time solved with Euler as coarse and fine solver with nf = 10 for different nc run on 8 cores . 14 3.3. Runtime of the Lorenz equation with different number of cores and number of time steps nt .................... 14 3.4. Error of the heat equation over nx and IMEX-Euler method as coarse and fine solver for different nc and nf = 10 ........ 16 3.5. Number of iterations of the heat equation with periodic boundry conditions over time and IMEX-Euler method as coarse and fine solver with nf = 10 for different nc and nx = 256 ........ 17 3.6. Runtime of the heat equation for nx = 256 with different number of cores and number of time steps nc and nf = 10 ........ 17 3.7. Calculated and exact Radius of the Allen-Cahn equation with IMEX-Euler and stabilized IMEX-Euler . 19 3.8. Calculations with Parareal of the radius, error of the radius and iteration number against time with stabilized IMEX-Euler for different nc, different nx and nf = 10 ............... 20 3.9. Runtime of the Allen-Cahn equation for nx = 128 with different number of cores and number of time steps nt and . 20 5.1. Run time in seconds of the Lorenz equation run on Jureca with Open MPI comparing a native run and a run with a container . 31 5.2. Run time in seconds of the heat equation run on Jureca with Open MPI comparing a native run and a run with a container . 32 5.3. Run time in seconds of the Allen-Cahn equation run on Jureca with Open MPI comparing a native run and a run with a container 33 5.4. Run time in seconds of the Allen-Cahn equation run on Jureca on multiple nodes with Open MPI comparing a native run and a run with a container . 33 5.5. Run time in seconds of the Allen-Cahn equation run on Jusuf with Open MPI compared to the runtime on Jureca run with a container . 35 VIII 5.6. Run time in seconds of the Allen-Cahn equation run on Jureca with Intel MPI comparing a native run and a run with a con- tainer . 37 5.7. Run time in seconds of the Allen-Cahn equation run on Jureca on multiple nodes with Intel MPI comparing a native run and a run with a container . 38 5.8. Run time in seconds of the Allen-Cahn equation run on Jureca on multiple nodes comparing Open MPI and Intel MPI as well a native run and a run with a container . 38 IX List of Tables 5.1. Runtime in seconds of the Allen-Cahn equation with Parareal and IMEX Euler as integrator, nc = 128 and nf = 10 compared for different hosts . 36 X Listings 4.1. Header of the definition file . 27 4.2. Environment of the definition file of the Julia Open MPI container 27 4.3. Installation of Julia into the container . 28 4.4. Installation of Open MPI into the Container . 29 4.5. Binding MPI into Julia . 30 4.6. Building of Julia packages . 30 5.1. Specifications of the container with Intel MPI . 37 A.1. Definition file Open MPI . 41 A.2. Definition file Intel MPI . 44 XI 1. Motivation Classical numerical methods to solve initial value problems run serial in time, which means the time is divided into individual time steps and one step is pro- cessed after the next. To speed up solving the problems mostly parallelization in the spatial domain is used especially since supercomputers are easier to use parallelization as a tool to reduce the run time is very common. The fast devel- opment of supercomputers leads to limitations where more computing power cannot save you run time. An idea to solve this problem is a parallelization in the time domain. Most of the time when working on parallelization C or Python is used. C is popular for achieving a high execution speed. In Python on the other hand it is easy to write code. A new language that is trying to combine these two ben- efits is Julia. Julia is a programming language that was specifically designed to be used for numerical applications and high-performance computing. Ju- lia [Bez+14] is a dynamic high-performance programming language. The first stable open-source version was released in February 2012, the development of the language began in 2009. Julia is mainly used in scientific computing. The language Julia is in contrast to other programming languages almost ex- clusively written in Julia itself. Only basic functions, such as integer opera- tions, for loops, recursion or float operations, use C operations or work with C structures. Julia works with a Just-in-Time (JIT) compiler. The code is converted to machine code during compilation. Just-in-time compilation dif- fers from so-called Ahead-of-Time (AOT) compilations in that these programs compile during run time, whereas AOT compilers compile before run time. The parser, which converts the Julia code to machine code, is implemented in Scheme [Dyb09] and is converted to optimized machine code using the LLVM compiler framework [LA04], a virtual machine. For this purpose, the frame- work uses the compiler backend of Clang for C/C++, which provides excel- lent auto-SIMD vectorization and can lead to high performance. This happens even though Julia is a dynamic language since this usually only occurs in static programming languages. Julia was also developed to make parallel implemen- tation, whether shared memory computing or distributed memory computing, easier.
Recommended publications
  • MPI–Openmp Algorithms for the Parallel Space–Time Solution of Time Dependent Pdes
    MPI–OpenMP algorithms for the parallel space–time solution of Time Dependent PDEs Ronald D. Haynes1 and Benjamin W. Ong2 1 Introduction Modern high performance computers offer hundreds of thousands of processors that can be leveraged, in parallel, to compute numerical solutions to time dependent par- tial differential equations (PDEs). For grid-based solutions to these PDEs, domain decomposition (DD) is often employed to add spatial parallelism [19]. Parallelism in the time variable is more difficult to exploit due to the inherent causality. Recently, researchers have explored this issue as a means to improve the scalability of existing parallel spatial solvers applied to time dependent problems. There are several general approaches to combine temporal parallelism with spatial parallelism. Waveform relaxation [15] is an example of a “parallel across the prob- lem” method. The “parallel across the time domain” approaches include the parareal method [11, 17, 16]. The parareal method decomposes a time domain into smaller temporal subdomains and alternates between applying a coarse (relatively fast) se- quential solver to compute an approximate (not very accurate) solution, and apply- ing a fine (expensive) solver on each temporal subdomain in parallel. Alternatively, one can consider “parallel across the step” methods. Examples of such approaches include the computation of intermediate Runge–Kutta stage values in parallel [18], and Revisionist Integral Deferred Correction (RIDC) methods, which are the family of parallel time integrators considered in this paper. Parallel across the step meth- ods allow for “small scale” parallelism in time. Specifically, we will show that if a DD implementation scales to Nx processors, a RIDC-DD parallelism will scale to Nt × Nx processors, where Nt < 12 in practice.
    [Show full text]
  • Numerical Investigation of Parallel-In-Time Methods for Dominantly Hyperbolic Equations
    Numerical Investigation of Parallel-in-Time Methods for Dominantly Hyperbolic Equations Vom Fachbereich Maschinenbau an der Technischen Universität Darmstadt zur Erlangung des akademischen Grades eines Doktor-Ingenieurs (Dr.-Ing.) genehmigte Dissertation vorgelegt von Andreas Schmitt, M.Sc. aus Düsseldorf Berichterstatter: Prof. Dr. rer. nat. Michael Schäfer Mitberichterstatter: Prof. Dr. rer. nat. Sebastian Schöps Tag der Einreichung: 23.07.2018 Tag der mündlichen Prüfung: 10.10.2018 Darmstadt 2019 D 17 Schmitt, Andreas Numerical Investigation of Parallel-in-Time Methods for Dominantly Hyperbolic Equations Technische Universität Darmstadt Tag der mündlichen Prüfung: 10.10.2018 Jahr der Veröffentlichung auf TUprints: 2019 URN: urn:nbn:de:tuda-tuprints-83286 Veröffentlicht unter CC BY-NC-ND 4.0 International https://creativecommons.org/licences/by-nc-nd/4.0/ Vorwort Die vorliegende Arbeit “Numerical Investigation of Parallel-in-Time Methods for Dominantly Hyperbolic Equations” ist mein Beitrag zum Forschungsgebiet der zeitparallelen Methoden. Diese Dissertation wurde als Abschlussarbeit meines Promotionsvorhabens an der Technischen Universität Darmstadt am Fachgebiet für Numerische Berechnungsverfahren im Maschinenbau und der Graduiertenschule Computational Engineering verfasst. Mit den Inhalten der Dissertation habe ich mich von Mai 2015 bis Juli 2018 beschäftigt. Ich möchte mich bei Prof. Michael Schäfer für die Möglichkeit zur Pro- motion, den Vorschlag des spannenden Themengebietes und die Betreuung bedanken. Insbesondere danke ich Ihm auch für die gegebenen Freiheiten, so dass ich das Thema nach meinen Interessen erforschen konnte. Des Weiteren danke ich Prof. Sebastian Schöps für die Übernahme der Mitberichterstatter- schaft und die konstruktiven Gespräche. Für viele konstruktive Gespräche möchte ich mich auch bei den Mitentwick- lern von SWEET bedanken.
    [Show full text]
  • A Parareal Algorithm for Optimality Systems Martin Gander, Félix Kwok, Julien Salomon
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Archive Ouverte en Sciences de l'Information et de la Communication PARAOPT: A parareal algorithm for optimality systems Martin Gander, Félix Kwok, Julien Salomon To cite this version: Martin Gander, Félix Kwok, Julien Salomon. PARAOPT: A parareal algorithm for optimality systems. 2019. hal-02346535 HAL Id: hal-02346535 https://hal.archives-ouvertes.fr/hal-02346535 Submitted on 5 Nov 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. PARAOPT: A PARAREAL ALGORITHM FOR OPTIMALITY SYSTEMS MARTIN J. GANDER, FELIX KWOK, AND JULIEN SALOMON Abstract. The time parallel solution of optimality systems arising in PDE constraint optimization could be achieved by simply applying any time paral- lel algorithm, such as Parareal, to solve the forward and backward evolution problems arising in the optimization loop. We propose here a different strat- egy by devising directly a new time parallel algorithm, which we call ParaOpt, for the coupled forward and backward non-linear partial differential equations. ParaOpt is inspired by the Parareal algorithm for evolution equations, and thus is automatically a two-level method.
    [Show full text]
  • Asynchronous Iterations of Parareal Algorithm for Option Pricing Models Frédéric Magoulès, Guillaume Gbikpi-Benissan, Qinmeng Zou
    Asynchronous Iterations of Parareal Algorithm for Option Pricing Models Frédéric Magoulès, Guillaume Gbikpi-Benissan, Qinmeng Zou To cite this version: Frédéric Magoulès, Guillaume Gbikpi-Benissan, Qinmeng Zou. Asynchronous Iterations of Parareal Algorithm for Option Pricing Models. Mathematics , MDPI, 2018, 6 (4), pp.45. 10.3390/math6040045. hal-01741114v2 HAL Id: hal-01741114 https://hal.archives-ouvertes.fr/hal-01741114v2 Submitted on 27 Jun 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Asynchronous Iterations of Parareal Algorithm for Option Pricing Models Frédéric Magoulès∗ Guillaume Gbikpi-Benissany Qinmeng Zouz Abstract Spatial domain decomposition method has been largely investigated in the last several decades, while time decomposition seems against intuition, which is not as popular as the former. However, there are still many attractive methods being proposed, espe- cially the parareal algorithm, which shows both theoretical and experimental efficiency in the context of parallel computing. In this paper, we present an original model of asynchronous variant based on parareal scheme, applied to the European option pricing problem. Some numerical experiments are given to illustrate the convergence perfor- mance and computational efficiency of such method. Keywords: Parallel computing; asynchronous iterations; parareal method; European options; domain decomposition; time-dependent problems 1 Introduction Today’s dominating high-performance computer architecture is parallel.
    [Show full text]
  • Θ-Parareal Schemes
    θ-parareal schemes Gil Ariel,∗ Hieu Nguyen†and Richard Tsai‡ October 17, 2018 Abstract A weighted version of the parareal method for parallel-in-time computation of time dependent problems is presented. Linear stability analysis for a scalar weighing strat- egy shows that the new scheme may enjoy favorable stability properties with marginal reduction in accuracy at worse. More complicated matrix-valued weights are analyzed and applied in numerical examples. The weights are optimized using information from past iterations, providing a systematic framework for using the parareal iterations as an approach to multiscale coupling. The advantage of the method is demonstrated using numerical examples, including some well-studied nonlinear Hamiltonian systems. 1 Introduction Parallelization of computation for spatial domain, such as the standard domain decomposi- tion methods, has been extensively developed and successfully applied to many important applications. Due to causality, parallel-in-time computations have not been as successful as parallel computations in space. However, numerical simulations will not benefit from available exa-scale computing power unless parallelization-in-time can be performed. De- spite recent advances, the presence of strong causalities in the sense that local perturbations arXiv:1704.06882v2 [math.NA] 7 Feb 2018 are not damped out by the system’s dissipation, e.g. in hyperbolic problems and fast os- cillations in the solutions, typically hinders the efficiency of such types of algorithms. For example, approaches involving shooting and Newton’s solvers may become virtually unus- able. It is widely recognized that robust and convergent numerical computation using such parallel-in-time algorithms still remains a main challenge.
    [Show full text]
  • Accelerating Chemical Simulation Through Model Modification
    Masaryk University Faculty of Informatics Accelerating Chemical Simulation through Model Modification Doctoral Thesis Jana Pazúriková Brno, Fall 2017 Masaryk University Faculty of Informatics Accelerating Chemical Simulation through Model Modification Doctoral Thesis Jana Pazúriková Brno, Fall 2017 Declaration Hereby I declare that this paper is my original authorial work, which I have worked out on my own. All sources, references, and literature used or excerpted during elaboration of this work are properly cited and listed in complete reference to the due source. Jana Pazúriková Advisor: Luděk Matyska i Acknowledgement The years of my doctoral studies have been filled with life lessons and people who helped me to get through them. Let’s tackle those lessons first. Working on such a huge project for years, mostly alone, as I have, often can be (and sure was) overwhelming. I have fought self-doubt and the imposter syndrome almost every week and I have not won. However, if you open up and share, you find out that everyone has its doubts. And suddenly, you are not alone. I have invested enourmous time and energy to the research of one of the problems and the results did not correspond with the effort. I ended up with more problems to solve than I had at the beginning and little to show for years of focus. Science can be like that sometimes, even the most elegant solutions might not work despite reasonable expectations. Leaving behind so much of my work, so much of my time has been the most challenging and at the same time the most freeing thing I have done.
    [Show full text]
  • Modified Parareal Method for Solving the Two-Dimensional Nonlinear
    Modified parareal method for solving the two-dimensional nonlinear shallow water equations using finite volumes Joao Guilherme Caldas Steinstraesser, Vincent Guinot, Antoine Rousseau To cite this version: Joao Guilherme Caldas Steinstraesser, Vincent Guinot, Antoine Rousseau. Modified parareal method for solving the two-dimensional nonlinear shallow water equations using finite volumes. In press. hal-02894841 HAL Id: hal-02894841 https://hal.inria.fr/hal-02894841 Preprint submitted on 9 Jul 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Journal of Computational Mathematics Vol. 00, Pages 000{000 (XXXX) Modified parareal method for solving the two-dimensional nonlinear shallow water equations using finite volumes Joao G. Caldas Steinstraesser 1 Vincent Guinot 2 Antoine Rousseau 3 1 Inria, IMAG, Univ Montpellier, CNRS, Montpellier, France Email address: [email protected] 2 Univ Montpellier, HSM, CNRS, IRD, Inria, Montpellier, France Email address: [email protected] 3 Inria, IMAG, Univ Montpellier, CNRS, Montpellier, France Email address: [email protected]. Abstract. In this work, the POD-DEIM-based parareal method introduced in [6] is implemented for the resolution of the two-dimensional nonlinear shallow water equations using a finite volume scheme.
    [Show full text]
  • A Hybrid Parareal Spectral Deferred Corrections Method Michael L.Minion
    Communications in Applied Mathematics and Computational Science A HYBRID PARAREAL SPECTRAL DEFERRED CORRECTIONS METHOD MICHAEL L. MINION vol. 5 no. 2 2010 mathematical sciences publishers COMM. APP. MATH. AND COMP. SCI. Vol. 5, No. 2, 2010 msp A HYBRID PARAREAL SPECTRAL DEFERRED CORRECTIONS METHOD MICHAEL L. MINION The parareal algorithm introduced in 2001 by Lions, Maday, and Turinici is an iterative method for the parallelization of the numerical solution of ordinary differential equations or partial differential equations discretized in the temporal direction. The temporal interval of interest is partitioned into successive domains which are assigned to separate processor units. Each iteration of the parareal algorithm consists of a high accuracy solution procedure performed in parallel on each domain using approximate initial conditions and a serial step which propagates a correction to the initial conditions through the entire time interval. The original method is designed to use classical single-step numerical methods for both of these steps. This paper investigates a variant of the parareal algorithm first outlined by Minion and Williams in 2008 that utilizes a deferred correction strategy within the parareal iterations. Here, the connections between parareal, parallel deferred corrections, and a hybrid parareal-spectral deferred correction method are further explored. The parallel speedup and efficiency of the hybrid methods are analyzed, and numerical results for ODEs and discretized PDEs are presented to demonstrate the performance of the hybrid approach. 1. Introduction The prospect of parallelizing the numerical solution of ordinary differential equations (ODEs) in the temporal direction has been the topic of research dating back at least to the early work of Nievergelt[55] and Miranker and Liniger[54].
    [Show full text]
  • Real Time Computing with the Parareal Algorithm
    Florida State University Libraries Electronic Theses, Treatises and Dissertations The Graduate School 2008 Realtime Computing with the Parareal Algorithm Christopherr.Harden Follow this and additional works at the FSU Digital Library. For more information, please contact [email protected] FLORIDA STATE UNIVERSITY COLLEGE OF ARTS AND SCIENCES REALTIME COMPUTING WITH THE PARAREAL ALGORITHM By CHRISTOPHERR.HARDEN A Thesis submitted to the School of Computational Science in partial fulfillment of the requirements for the degree of Master of Science Degree Awarded: Spring Semester, 2008 The members of the Committee approve the Masters Thesis of Christopher R. Harden defended on April 8, 2008. Janet Peterson Professor Directing Masters Thesis Max Gunzburger Committee Member Robert Van Engelen Committee Member Approved: Max Gunzburger, Director Department of School of Computational Science The Office of Graduate Studies has verified and approved the above named committee members. ii This thesis is dedicated to all of the people who have helped and guided me throughout my research including but not limited to Janet Peterson, Max Gunzburger, John Burkardt, Robert Van Engelen, and the many other professors who have provided me with an excellent level of instruction throughout all of my course work here at FSU. Also, I would like to dedicate this work to my wife, Jennifer Alligood, and my son, Youth, for their infinite depth of understanding of my situation as a graduate student and for their unending support throughout this endeavor. iii ACKNOWLEDGEMENTS I would like to acknowledge my gratitude to my committee members, who have taken the time to review my work and to pose the difficult questions which have kept me honest and thus have allowed me to grow as an academic throughout this process.
    [Show full text]
  • Parareal Neural Networks Emulating a Parallel-In-Time Algorithm∗
    Parareal Neural Networks Emulating a Parallel-in-time Algorithm∗ Chang-Ock Leey , Youngkyu Leey , and Jongho Park z Abstract. As deep neural networks (DNNs) become deeper, the training time increases. In this perspective, multi-GPU parallel computing has become a key tool in accelerating the training of DNNs. In this paper, we introduce a novel methodology to construct a parallel neural network that can utilize multiple GPUs simultaneously from a given DNN. We observe that layers of DNN can be interpreted as the time steps of a time-dependent problem and can be parallelized by emulating a parallel-in-time algorithm called parareal. The parareal algorithm consists of fine structures which can be implemented in parallel and a coarse structure which gives suitable approximations to the fine structures. By emulating it, the layers of DNN are torn to form a parallel structure, which is connected using a suitable coarse network. We report accelerated and accuracy-preserved results of the proposed methodology applied to VGG-16 and ResNet-1001 on several datasets. Key words. deep neural network, parallel computing, time-dependent problem, parareal AMS subject classifications. 68T01, 68U10, 68W10 1. Introduction. Deep neural networks (DNNs) have demonstrated success for many classification and regression tasks such as image recognition [15, 22] and natural language processing [5, 19]. A principal reason for why DNN performs well is the depth of DNN, i.e., the number of sequential layers of DNN. Each layer of DNN is composed of a combination of an affine transformation and a nonlinear activation function, e.g., a rectified linear unit (ReLU).
    [Show full text]
  • A Parareal Algorithm for Optimality Systems Martin Gander, Félix Kwok, Julien Salomon
    PARAOPT: A parareal algorithm for optimality systems Martin Gander, Félix Kwok, Julien Salomon To cite this version: Martin Gander, Félix Kwok, Julien Salomon. PARAOPT: A parareal algorithm for optimality systems. SIAM Journal on Scientific Computing, Society for Industrial and Applied Mathematics, 2020, 42(5), pp.A2773–A2802. 10.1137/19M1292291. hal-02346535v2 HAL Id: hal-02346535 https://hal.archives-ouvertes.fr/hal-02346535v2 Submitted on 10 Jul 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. PARAOPT: A PARAREAL ALGORITHM FOR OPTIMALITY SYSTEMS MARTIN J. GANDER, FELIX KWOK, AND JULIEN SALOMON Abstract. The time parallel solution of optimality systems arising in PDE constrained optimization could be achieved by simply applying any time par- allel algorithm, such as Parareal, to solve the forward and backward evolution problems arising in the optimization loop. We propose here a different strat- egy by devising directly a new time parallel algorithm, which we call ParaOpt, for the coupled forward and backward nonlinear partial differential equations. ParaOpt is inspired by the Parareal algorithm for evolution equations, and thus is automatically a two-level method. We provide a detailed convergence analysis for the case of linear parabolic PDE constraints.
    [Show full text]
  • Shared Memory Pipelined Parareal
    This is a repository copy of Shared Memory Pipelined Parareal. White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/115836/ Version: Accepted Version Proceedings Paper: Ruprecht, D orcid.org/0000-0003-1904-2473 (2017) Shared Memory Pipelined Parareal. In: Lecture Notes in Computer Science. Euro-Par 2017: 23rd International European Conference on Parallel and Distributed Computing, 28 Aug - 01 Sep 2017, Santiago de Compostela, Spain. Springer Verlag , pp. 669-681. ISBN 978-3-319-64202-4 https://doi.org/10.1007/978-3-319-64203-1_48 © 2017, Springer International Publishing AG. This is an author produced version of a paper published in Lecture Notes in Computer Science. The final publication is available at Springer via https://doi.org/10.1007/978-3-319-64203-1_48. Uploaded in accordance with the publisher's self-archiving policy. Reuse Items deposited in White Rose Research Online are protected by copyright, with all rights reserved unless indicated otherwise. They may be downloaded and/or printed for private study, or other acts as permitted by national copyright laws. The publisher or other rights holders may allow further reproduction and re-use of the full text version. This is indicated by the licence information on the White Rose Research Online record for the item. Takedown If you consider content in White Rose Research Online to be in breach of UK law, please notify us by emailing [email protected] including the URL of the record and the reason for the withdrawal request. [email protected] https://eprints.whiterose.ac.uk/ Shared memory pipelined Parareal Daniel Ruprecht1,2 1 School of Mechanical Engineering, LS2 9JT, Leeds, United Kingdom [email protected] 2 Institute of Computational Science, Universit`adella Svizzera italiana, Via Giuseppe Buffi 13, CH-6900 Lugano, Switzerland Abstract.
    [Show full text]