The US DOE Exascale Computing Project (ECP) Perspective for the HEP Community
Total Page:16
File Type:pdf, Size:1020Kb
The US DOE Exascale Computing Project (ECP) Perspective for the HEP Community Douglas B. Kothe (ORNL), ECP Director Lori Diachin (LLNL), ECP Deputy Director Erik Draeger (LLNL), ECP Deputy Director of Application Development Tom Evans (ORNL), ECP Energy Applications Lead Blueprint Workshop on A Coordinated Ecosystem for HL-LHC Computing R&D Washington, DC October 23, 2019 DOE Exascale Program: The Exascale Computing Initiative (ECI) Three Major Components of the ECI ECI US DOE Office of Science (SC) and National partners Nuclear Security Administration (NNSA) Exascale Selected program Computing office application Project development ECI Accelerate R&D, acquisition, and deployment to (BER, BES, (ECP) deliver exascale computing capability to DOE NNSA) mission national labs by the early- to mid-2020s Exascale system ECI Delivery of an enduring and capable exascale procurement projects & computing capability for use by a wide range facilities focus of applications of importance to DOE and the US ALCF-3 (Aurora) OLCF-5 (Frontier) ASC ATS-4 (El Capitan) 2 ECP Mission and Vision Enable US revolutions in technology development; scientific discovery; healthcare; energy, economic, and national security ECP ECP mission vision Develop exascale-ready applications Deliver exascale simulation and and solutions that address currently data science innovations and intractable problems of strategic solutions to national problems importance and national interest. that enhance US economic competitiveness, change our quality Create and deploy an expanded and of life, and strengthen our national vertically integrated software stack on security. DOE HPC exascale and pre-exascale systems, defining the enduring US exascale ecosystem. Deliver US HPC vendor technology advances and deploy ECP products to DOE HPC pre-exascale and exascale systems. 3 Vision: Exascale Computing Project (ECP) Lifts all U.S. High Performance Computing to a New Trajectory 10X Capability 5X 2016 2021 2022 2023 2024 2025 2026 2027 Time 4 Relevant US DOE Pre-Exascale and Exascale Systems for ECP 5 The three technical areas in ECP have the necessary components to meet national goals Performant mission and science applications @ scale Aggressive RD&D Mission apps & Deployment to DOE Hardware tech Project integrated S/W stack HPC Facilities advances Application Software Hardware Development (AD) Technology (ST) and Integration (HI) Deliver expanded and vertically Integrated delivery of ECP Develop and enhance the integrated software stack to products on targeted systems at predictive capability of achieve full potential of exascale leading DOE HPC facilities applications critical to the DOE computing 6 US HPC vendors focused on 24 applications including 70 unique software products exascale node and system national security, to energy, earth spanning programming models design; application integration systems, economic security, and run times, math libraries, and software deployment to materials, and data data and visualization facilities 6 ECP is a large, complex project Effective project management with three technical focus areas designed to deliver a capable exascale ecosystem Distinctive characteristics A capable exascale computing ecosystem made • RD&D and software development in nature Hardware and possible by integrating ECP applications, software • Two sponsoring DOE programs Integration (HI) and hardware innovations within DOE facilities • Numerous participating institutions Build a comprehensive, coherent software stack that enables the productive development of • Decentralized cost system Software highly parallel applications that effectively target • External project dependence Technology (ST) diverse exascale architectures • Broad and qualitative Develop and enhance predictive capability of mission need requirements applications critical to DOE across science, Application • Outcomes: Products and solutions Development (AD) energy, and national security mission space • Key performance parameters require innovation Measure progress and ensure execution Project within scope, schedule, and budget • Application of scope contingency Management (PM) • End of project transition 7 ECP by the Numbers 7 A seven-year, $1.8 B R&D effort that launched in 2016 YEARS $1.7B Six core DOE National Laboratories: Argonne, Lawrence 6 CORE DOE Berkeley, Lawrence Livermore, Oak Ridge, Sandia, Los Alamos LABS • Staff from most of the 17 DOE national laboratories take part in the project 4 FOCUS Four focus areas: Hardware and Integration, Software Technology, AREAS Application Development, Project Management 81 More than 80 top-notch R&D teams R&D TEAMS 1000 Hundreds of consequential milestones delivered on RESEARCHERS schedule and within budget since project inception 8 Barb Helland Thuc Hoang ECP Organization ASCR Program Manager ASC Program Manager Board of Directors Bill Goldstein, Chair (Director, LLNL) Thomas Zacharia, Vice Chair (Director, ORNL) Dan Hoag Federal Project Director DOE HPC Facilities Laboratory Operations Task Force (LOTF) Exascale Computing Project Core Laboratories Doug Kothe, ORNL Al Geist, ORNL Industry Council Project Director Chief Technology Officer Dave Kepczynski, GE, Chair Lori Diachin, LLNL Deputy Project Director Julia White, ORNL Mike Bernhardt, ORNL Technical Operations Communications Project Management Project Office Support Kathlyn Boudwin, ORNL Megan Fielden, Human Resources Director Willy Besancenez, Procurement Manuel Vigil, LANL Sam Howard, Export Control Analyst Deputy Director Mike Hulsey, Business Management Doug Collins, ORNL Kim Milburn, Finance Officer Associate Director Susan Ochs, Partnerships Michael Johnson, Legal and Points of Contacts at the Monty Middlebrook Doug Collins Project Controls & Risk IT & Quality Core Laboratories Application Software Technology Hardware & Integration Development Mike Heroux, SNL Terri Quinn, LLNL Andrew Siegel, ANL Director Director Director Jonathan Carter, LBNL Susan Coghlan, ANL Erik Draeger, LLNL Deputy Director Deputy Director Deputy Director 9 81 WBS L4 subprojects have set their FY20-23 performance baseline with ECP Work Breakdown Structure (WBS) scope and technical plans to execute on Key leaders at WBS Level 1, 2, 3 RD&D objectives in ECP’s Final Design Exascale Computing Project 2.0 Kothe (ORNL) Project Management Application Development Software Technology Hardware and Integration Application Development Application Development 2.1 2.2 2.3 2.4 2.2 2.2 Boudwin (ORNL) Siegel (ANL) Heroux (SNL) Quinn (LLNL) Project Planning and Chemistry and Materials Programming Models and Chemistry and Materials ChemistryPathForward and Materials Management Applications Runtimes Applications Applications2.4.1 2.1.1 2.2.1 2.3.1 2.2.1 de Supinski2.2.1 (LLNL) Boudwin (ORNL) Deslippe (LBL) Thakur (ANL) Project Controls and Risk Energy Applications Development Tools Hardware Evaluation Management Energy Applications Energy Applications 2.2.2 2.3.2 2.4.2 2.1.2 2.2.2 2.2.2 Evans (ORNL) Vetter (ORNL) Pakin (LANL) Middlebrook (ORNL) Earth and Space Science Application Integration Business Management Earth and Space Science Mathematical Libraries Earth and Space Science Applications at Facilities 2.1.3 Applications 2.3.3 Applications 2.2.3 2.4.3 Hulsey (ORNL) 2.2.3 McInnes (ANL) 2.2.3 Dubey (ANL) Hill (ORNL) Data Analytics and Optimization Software Deployment Procurement Management Data and Visualization Applications at Facilities 2.1.4 2.3.4 2.2.4 2.4.4 Besancenez (ORNL) Ahrens (LANL) Hart (SNL) Adamson (ORNL) Information Technology and National Security Applications Software Ecosystem and Delivery Facility Resource Utilization Quality Management National Security Applications National Security Applications 2.2.5 2.3.5 2.4.5 2.1.5 2.2.5 2.2.5 Francois (LANL) Munson (ANL) White (ORNL) Collins (ORNL) Communications and Outreach Co-Design NNSA Software Technologies Training and Productivity Co-Design Co-Design 2.1.6 2.2.6 2.3.6 2.4.6 2.2.6 2.2.6 Bernhardt (ORNL) Germann (LANL) Neely (LLNL) Barker (ORNL) 10 ECP High Level Schedule and Access to Systems 11 ECP applications target national problems in DOE mission areas National security Energy security Economic security Scientific discovery Earth system Health care Next-generation, Turbine wind plant Additive Cosmological probe Accurate regional Accelerate stockpile efficiency manufacturing of the standard model impact assessments and translate stewardship codes of qualifiable of particle physics in Earth system cancer research Design and metal parts models (partnership with NIH) Reentry-vehicle- commercialization Validate fundamental environment of SMRs Reliable and laws of nature Stress-resistant crop simulation efficient planning analysis and catalytic Nuclear fission of the power grid Plasma wakefield conversion Multi-physics science and fusion reactor accelerator design of biomass-derived simulations of high- materials design Seismic hazard alcohols energy density risk assessment Light source-enabled physics conditions Subsurface use analysis of protein Metagenomics for carbon capture, and molecular for analysis of petroleum extraction, structure and design biogeochemical waste disposal cycles, climate Find, predict, change, High-efficiency, and control materials environmental low-emission and properties remediation combustion engine and gas turbine Predict and control design magnetically confined fusion Scale up of clean plasmas fossil fuel combustion Demystify origin of chemical elements Biofuel catalyst design 12 Co-design Subprojects • Co-design centers address computational