Arxiv:1911.09220V2 [Cs.MS] 13 Jul 2020
Total Page:16
File Type:pdf, Size:1020Kb

Load more
Recommended publications
-
Accelerating the LOBPCG Method on Gpus Using a Blocked Sparse Matrix Vector Product
Accelerating the LOBPCG method on GPUs using a blocked Sparse Matrix Vector Product Hartwig Anzt and Stanimire Tomov and Jack Dongarra Innovative Computing Lab University of Tennessee Knoxville, USA Email: [email protected], [email protected], [email protected] Abstract— the computing power of today’s supercomputers, often accel- erated by coprocessors like graphics processing units (GPUs), This paper presents a heterogeneous CPU-GPU algorithm design and optimized implementation for an entire sparse iter- becomes challenging. ative eigensolver – the Locally Optimal Block Preconditioned Conjugate Gradient (LOBPCG) – starting from low-level GPU While there exist numerous efforts to adapt iterative lin- data structures and kernels to the higher-level algorithmic choices ear solvers like Krylov subspace methods to coprocessor and overall heterogeneous design. Most notably, the eigensolver technology, sparse eigensolvers have so far remained out- leverages the high-performance of a new GPU kernel developed side the main focus. A possible explanation is that many for the simultaneous multiplication of a sparse matrix and a of those combine sparse and dense linear algebra routines, set of vectors (SpMM). This is a building block that serves which makes porting them to accelerators more difficult. Aside as a backbone for not only block-Krylov, but also for other from the power method, algorithms based on the Krylov methods relying on blocking for acceleration in general. The subspace idea are among the most commonly used general heterogeneous LOBPCG developed here reveals the potential of eigensolvers [1]. When targeting symmetric positive definite this type of eigensolver by highly optimizing all of its components, eigenvalue problems, the recently developed Locally Optimal and can be viewed as a benchmark for other SpMM-dependent applications. -
Elmer Finite Element Software for Multiphysical Problems
Elmer finite element software for multiphysical problems Peter Råback ElmerTeam CSC – IT Center for Science 10/04/2014 PRACE Spring School, 15-17 April, 2014 1 Elmer track in PRACE spring school • Wed 16th, 10.45-12.15=1.5 h Introduction to Elmer finite element software • Wed 16th, 13.30-15.00=1.5 h Hands-on session using ElmerGUI • Wed 16th, 15.15-16.45=1.5 h OpenLab • Thu 17th, 10.45-12.15=1.5 h Advanced use of Elmer • Thu 17th, 13.30-14.30=1 h Hands-on session on advanced features and parallel computation • Thu 17th, 14.45-15.45=1h Hands-on & improviced 10/04/2014 2 Elmer finite element software for multiphysical problems Figures by Esko Järvinen, Mikko Lyly, Peter Råback, Timo Veijola (TKK) & Thomas Zwinger Short history of Elmer 1995 Elmer development was started as part of a national CFD program – Collaboration of CSC, TKK, VTT, JyU, and Okmetic Ltd. 2000 After the initial phase the development driven by number of application projects – MEMS, Microfluidics, Acoustics, Crystal Growth, Hemodynamics, Glaciology, … 2005 Elmer published under GPL-license 2007 Elmer version control put under sourceforge.net – Resulted to a rapid increase in the number of users 2010 Elmer became one of the central codes in PRACE project 2012 ElmerSolver library published under LGPL – More freedom for serious developers Elmer in numbers ~350,000 lines of code (~2/3 in Fortran, 1/3 in C/C++) ~500 code commits yearly ~280 consistency tests in 3/2014 ~730 pages of documentation in LaTeX ~60 people participated on Elmer courses in 2012 9 Elmer related visits -
MFEM: a Modular Finite Element Methods Library
MFEM: A Modular Finite Element Methods Library Robert Anderson1, Andrew Barker1, Jamie Bramwell1, Jakub Cerveny2, Johann Dahm3, Veselin Dobrev1,YohannDudouit1, Aaron Fisher1,TzanioKolev1,MarkStowell1,and Vladimir Tomov1 1Lawrence Livermore National Laboratory 2University of West Bohemia 3IBM Research July 2, 2018 Abstract MFEM is a free, lightweight, flexible and scalable C++ library for modular finite element methods that features arbitrary high-order finite element meshes and spaces, support for a wide variety of discretization approaches and emphasis on usability, portability, and high-performance computing efficiency. Its mission is to provide application scientists with access to cutting-edge algorithms for high-order finite element meshing, discretizations and linear solvers. MFEM also enables researchers to quickly and easily develop and test new algorithms in very general, fully unstructured, high-order, parallel settings. In this paper we describe the underlying algorithms and finite element abstractions provided by MFEM, discuss the software implementation, and illustrate various applications of the library. Contents 1 Introduction 3 2 Overview of the Finite Element Method 4 3Meshes 9 3.1 Conforming Meshes . 10 3.2 Non-Conforming Meshes . 11 3.3 NURBS Meshes . 12 3.4 Parallel Meshes . 12 3.5 Supported Input and Output Formats . 13 1 4 Finite Element Spaces 13 4.1 FiniteElements....................................... 14 4.2 DiscretedeRhamComplex ................................ 16 4.3 High-OrderSpaces ..................................... 17 4.4 Visualization . 18 5 Finite Element Operators 18 5.1 DiscretizationMethods................................... 18 5.2 FiniteElementLinearSystems . 19 5.3 Operator Decomposition . 23 5.4 High-Order Partial Assembly . 25 6 High-Performance Computing 27 6.1 Parallel Meshes, Spaces, and Operators . 27 6.2 Scalable Linear Solvers . -
Overview of Elmer
Overview of Elmer Peter Råback and Mika Malinen CSC – IT Center for Science October 19, 2012 1 2 Copyright This document is licensed under the Creative Commons Attribution-No Derivative Works 3.0 License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nd/3.0/. 1 Introduction This exposition gives an overview of the Elmer software package. General information on the capabilities of the software, its usage, and how the material of the package is organized is presented. More detailed information is given in the other Elmer manuals, the scopes of which are described in this document. What is Elmer Elmer is a finite element software package for the solution of partial differential equations. Elmer can deal with a great number of different equations, which may be coupled in a generic manner making Elmer a versatile tool for multiphysical simulations. As an open source software, Elmer also gives the user the means to modify the existing solution procedures and to develop new solvers for equations of interest to the user. History of Elmer The development of Elmer was started in 1995 as part of a national CFD technology program funded by the Finnish funding agency for technology and innovation, Tekes. The original development consortia included partners from CSC – IT Center for Science (formely known as CSC – Scientific Computing). Helsinki University of Technology TKK, VTT Technical Research Centre of Finland, University of Jyväskylä, and Okmetic Ltd. CSC is a governmental non-profit company fully owned by the Ministry of Education. After the five years initial project ended the development has been continued by CSC in different application fields. -
FELICITY: a MATLAB/C++ TOOLBOX for DEVELOPING FINITE ELEMENT METHODS and SIMULATION MODELING\Ast
SIAM J. SCI.COMPUT. \bigcircc 2018 Society for Industrial and Applied Mathematics Vol. 40, No. 2, pp. C234{C257 FELICITY: A MATLAB/C++ TOOLBOX FOR DEVELOPING FINITE ELEMENT METHODS AND SIMULATION MODELING\ast SHAWN W. WALKERy Abstract. This paper describes a MATLAB/C++ finite element toolbox, called FELICITY, for simulating various types of systems of partial differential equations (e.g., coupled elliptic/parabolic problems) using the finite element method. It uses MATLAB in an object-oriented way for high-level manipulation of data structures in finite element codes, while utilizing a domain-specific language (DSL) and code generation to automate low-level tasks such as matrix assembly (via the MATLAB mex interface). We describe the fundamental functionality of the toolbox's MATLAB interface, such as using higher order Lagrange (simplicial) meshes, defining finite element spaces, allocating degrees- of-freedom, assembling discrete bilinear and linear forms, and interpolation over meshes. Moreover, we describe in-depth how automatic code generation is implemented in FELICITY. Two example problems and their implementation are provided to demonstrate the ability of FELICITY to solve coupled problems with interacting subdomains of different co-dimension. Future improvements are also discussed. Key words. finite elements, coupled systems, geometric flows, code generation, MATLAB, open source software AMS subject \bfc \bfl \bfa \bfs \bfifi\bfc\bfa\bft\bfo\bfn. 68N30, 65N30, 65M60, 68N19, 68N20 DOI. 10.1137/17M1128745 1. Introduction. The development of numerical methods to solve partial dif- ferential equations (PDEs) continues to advance to address new domain areas and problems of increasing complexity. With this, the number of software packages has increased to address the needs for researching new methods and simulating large prob- lems. -
Mofem: an Open Source, Parallel Finite Element Library
MoFEM: An open source, parallel finite element library Kaczmarczyk, Ł., Ullah, Z., Lewandowski, K., Meng, X., Zhou, X-Y., Athanasiadis, I., Nguyen, H., Chalons- Mouriesse, C-A., Richardson, E. J., Miur, E., Shvarts, A. G., Wakeni, M., & Pearce, C. J. (2020). MoFEM: An open source, parallel finite element library. The Journal of Open Source Software, 5(45). https://doi.org/doi.org/10.21105/joss.01441 Published in: The Journal of Open Source Software Document Version: Publisher's PDF, also known as Version of record Queen's University Belfast - Research Portal: Link to publication record in Queen's University Belfast Research Portal Publisher rights © 2020 The Authors. This is an open access article published under a Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium, provided the author and source are cited. General rights Copyright for the publications made accessible via the Queen's University Belfast Research Portal is retained by the author(s) and / or other copyright owners and it is a condition of accessing these publications that users recognise and abide by the legal requirements associated with these rights. Take down policy The Research Portal is Queen's institutional repository that provides access to Queen's research output. Every effort has been made to ensure that content in the Research Portal does not infringe any person's rights, or applicable UK laws. If you discover content in the Research Portal that you believe breaches copyright or violates any law, please contact [email protected]. Download date:06. -
Open-Source Automatic Nonuniform Mesh Generation for FDTD Simulation
This is a repository copy of Structured Mesh Generation : Open-source automatic nonuniform mesh generation for FDTD simulation. White Rose Research Online URL for this paper: https://eprints.whiterose.ac.uk/100155/ Version: Accepted Version Article: Berens, Michael, Flintoft, Ian David orcid.org/0000-0003-3153-8447 and Dawson, John Frederick orcid.org/0000-0003-4537-9977 (2016) Structured Mesh Generation : Open- source automatic nonuniform mesh generation for FDTD simulation. IEEE Antennas and Propagation Magazine. pp. 45-55. ISSN 1045-9243 https://doi.org/10.1109/MAP.2016.2541606 Reuse Items deposited in White Rose Research Online are protected by copyright, with all rights reserved unless indicated otherwise. They may be downloaded and/or printed for private study, or other acts as permitted by national copyright laws. The publisher or other rights holders may allow further reproduction and re-use of the full text version. This is indicated by the licence information on the White Rose Research Online record for the item. Takedown If you consider content in White Rose Research Online to be in breach of UK law, please notify us by emailing [email protected] including the URL of the record and the reason for the withdrawal request. [email protected] https://eprints.whiterose.ac.uk/ 1 Open Source Automatic Non-uniform Mesh Generation for FDTD Simulation Michael K. Berens, Ian D. Flintoft, Senior Member, IEEE, and John F. Dawson, Member, IEEE, Abstract—This article describes a cuboid structured mesh generator suitable for 3D numerical modelling using techniques such as finite-difference time-domain (FDTD) and transmission-line matrix (TLM). -
Aparallel Implementation of a Femsolver in Scilab
powered by A PARALLEL IMPLEMENTATION OF A FEM SOLVER IN SCILAB Author: Massimiliano Margonari Keywords. Scilab; Open source software; Parallel computing; Mesh partitioning, Heat transfer equation. Abstract: In this paper we describe a parallel application in Scilab to solve 3D stationary heat transfer problems using the finite element method. Two benchmarks have been done: we compare the results with reference solutions and we report the speedup to measure the effectiveness of the approach. Contacts [email protected] This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. EnginSoft SpA - Via della Stazione, 27 - 38123 Mattarello di Trento | P.I. e C.F. IT00599320223 A parallel FEM solver in Scilab 1. Introduction Nowadays many simulation software have the possibility to take advantage of multi- processors/cores computers in the attempt to reduce the solution time of a given task. This not only reduces the annoying delays typical in the past, but allows the user to evaluate larger problems and to do more detailed analyses and to analyze a greater number of scenarios. Engineers and scientists who are involved in simulation activities are generally familiar with the terms “high performance computing” (HPC). These terms have been coined to indicate the ability to use a powerful machine to efficiently solve hard computational problems. One of the most important keywords related to the HPC is certainly parallelism. The total execution time can be reduced if the original problem can be divided into a given number of subtasks. Total time is reduced because these subtasks can be tackled concurrently, that means in parallel, by a certain number of machines or cores. -
XAMG: a Library for Solving Linear Systems with Multiple Right-Hand
XAMG: A library for solving linear systems with multiple right-hand side vectors B. Krasnopolsky∗, A. Medvedev∗∗ Institute of Mechanics, Lomonosov Moscow State University, 119192 Moscow, Michurinsky ave. 1, Russia Abstract This paper presents the XAMG library for solving large sparse systems of linear algebraic equations with multiple right-hand side vectors. The library specializes but is not limited to the solution of linear systems obtained from the discretization of elliptic differential equations. A corresponding set of numerical methods includes Krylov subspace, algebraic multigrid, Jacobi, Gauss-Seidel, and Chebyshev iterative methods. The parallelization is im- plemented with MPI+POSIX shared memory hybrid programming model, which introduces a three-level hierarchical decomposition using the corre- sponding per-level synchronization and communication primitives. The code contains a number of optimizations, including the multilevel data segmen- tation, compression of indices, mixed-precision floating-point calculations, arXiv:2103.07329v1 [cs.MS] 12 Mar 2021 vector status flags, and others. The XAMG library uses the program code of the well-known hypre library to construct the multigrid matrix hierar- chy. The XAMG’s own implementation for the solve phase of the iterative methods provides up to a twofold speedup compared to hypre for the tests ∗E-mail address: [email protected] ∗∗E-mail address: [email protected] Preprint submitted to Elsevier March 15, 2021 performed. Additionally, XAMG provides extended functionality to solve systems with multiple right-hand side vectors. Keywords: systems of linear algebraic equations, Krylov subspace iterative methods, algebraic multigrid method, multiple right-hand sides, hybrid programming model, MPI+POSIX shared memory Nr. -
Customizing Paraview
Customizing ParaView Utkarsh A. Ayachit∗ David E. DeMarle† Kitware Inc. ABSTRACT VTK developers must overcome in order to extend ParaView. For- ParaView is an Open Source Visualization Application that scales, tunately the ServerManager includes both Module [2] and server via data parallel processing, to massive problem sizes. The nature side Plugin [3] facilities. The two facilities differ in that Modules of Open Source software means that ParaView has always been well are statically linked into ParaView at compile time, whereas Plug- suited to customization. However having access to the source code ins are dynamically linked into ParaView at run time. Either facility does not imply that it is trivial to extend or reuse the application. An standardizes, through CMake[6] configuration macros, the process ongoing goal of the ParaView project is to make the code simple to by which new types of VTK objects can be added to ParaView. Plu- extend, customize, and change arbitrarily. gin and Module macros turn the build environment definition into In this paper we present the ways by which the application can be a simple procedure call. A standard list of arguments is populated modified to date. We then describe in more detail the latest efforts with information such as the names of the C++ files for the new rou- to simplify the task of reusing the most visible part of ParaView, the tines, and the macro creates the correct platform independent build client GUI application. These efforts include a new CMake config- environment to make a library that is compatible with the applica- uration macro that makes it simple to assemble the major functional tion. -
Usr/Bin/Cmake # You Can Edit This File to Change Values Found and Used by Cmake
# This is the CMakeCache file. # For build in directory: /home/build2 # It was generated by CMake: /usr/bin/cmake # You can edit this file to change values found and used by cmake. # If you do not want to change any of the values, simply exit the editor. # If you do want to change a value, simply edit, save, and exit the editor. # The syntax for the file is as follows: # KEY:TYPE=VALUE # KEY is the name of a variable in the cache. # TYPE is a hint to GUIs for the type of VALUE, DO NOT EDIT TYPE!. # VALUE is the current value for the KEY. ######################## # EXTERNAL cache entries ######################## //Build shared libraries (so/dylib/dll). BUILD_SHARED_LIBS:BOOL=ON //Build testing BUILD_TESTING:BOOL=OFF //Path to a program. BZRCOMMAND:FILEPATH=BZRCOMMAND-NOTFOUND //Path to a program. CMAKE_AR:FILEPATH=/usr/bin/ar //The build mode CMAKE_BUILD_TYPE:STRING=Release //The build type for the paraview project. CMAKE_BUILD_TYPE_paraview:STRING=Release //Enable/Disable color output during build. CMAKE_COLOR_MAKEFILE:BOOL=ON //CXX compiler CMAKE_CXX_COMPILER:FILEPATH=/usr/bin/c++ //A wrapper around 'ar' adding the appropriate '--plugin' option // for the GCC compiler CMAKE_CXX_COMPILER_AR:FILEPATH=/usr/bin/gcc-ar //A wrapper around 'ranlib' adding the appropriate '--plugin' option // for the GCC compiler CMAKE_CXX_COMPILER_RANLIB:FILEPATH=/usr/bin/gcc-ranlib //Flags used by the CXX compiler during all build types. CMAKE_CXX_FLAGS:STRING= //Flags used by the CXX compiler during DEBUG builds. CMAKE_CXX_FLAGS_DEBUG:STRING=-g //Flags used by the CXX compiler during MINSIZEREL builds. CMAKE_CXX_FLAGS_MINSIZEREL:STRING=-Os -DNDEBUG //Flags used by the CXX compiler during RELEASE builds. CMAKE_CXX_FLAGS_RELEASE:STRING=-O2 -DNDEBUG //Flags used by the CXX compiler during RELWITHDEBINFO builds. -
Understanding the Relationships Between Aesthetic Properties Of
Understanding the relationships between aesthetic properties of shapes and geometric quantities of free-form curves and surfaces using Machine Learning Techniques Aleksandar Petrov To cite this version: Aleksandar Petrov. Understanding the relationships between aesthetic properties of shapes and ge- ometric quantities of free-form curves and surfaces using Machine Learning Techniques. Mechanical engineering [physics.class-ph]. Ecole nationale supérieure d’arts et métiers - ENSAM, 2016. English. NNT : 2016ENAM0007. tel-01344873 HAL Id: tel-01344873 https://pastel.archives-ouvertes.fr/tel-01344873 Submitted on 12 Jul 2016 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. N°: 2009 ENAM XXXX 2016-ENAM-0007 PhD THESIS in cotutelle To obtain the degree of Docteur de l’Arts et Métiers ParisTech Spécialité “Mécanique – Conception” École doctorale n° 432: “Science des Métiers de l’ingénieur” and Dottore di Ricerca della Università degli Studi di Genova Specialità “Ingeneria Meccanica” Scuola di Dottorato: “Scienze e tecnologie per l’ingegneria” ciclo XXVI Presented and defended publicly by Aleksandar PETROV January 25th, 2016 Understanding the relationships between aesthetic properties of shapes and geometric quantities of free-form curves and surfaces using Machine Learning Techniques Director of thesis: Philippe VÉRON Co-director of thesis: Franca GIANNINI T Co-supervisors of the thesis: Jean-Philippe PERNOT, Bianca FALCIDIENO H È Jury M.