Jet Propulsion Laboratory California Instute of Technology

Mulcore in Space NASA and AFRL Invest in the Future of Flight Compung

Richard Doyle and Raphael Some Jet Propulsion Laboratory, California Instute of Technology

Gabriel Mounce, Air Force Research Laboratory, Kirtland AFB

Wesley Powell, NASA Goddard Space Flight Center

Montgomery Goforth, NASA Johnson Space Center

Stephen Horan, NASA Langley Research Center

Michael Lowry, NASA Ames Research Center

presented at Future In-Space Operaons Working Group September 17, 2014

© 2014 California Instute of Technology. Government sponsorship acknowledged. The NASA Need There are important mission scenarios that today cannot be accomplished robustly and cost-effecvely • Space-based compung has not kept up with the needs of current and future NASA missions • Government and industry are developing high- performance space-qualifiable processors • NASA connues to have unique requirements – Deep space, long duraon, roboc and human missions – Higher performance, smaller spacecra and lower cost • Onboard science data processing • Autonomous operaons and greater resilience – Extreme needs for low power and energy management, efficiency, fault tolerance and resilience

9/16/14 2 NASA Space Technology Mission Directorate The Game Changing Development Program (GCDP) commissioned a High Performance Spaceflight Compung (HPSC) Study

Idenfy Driving Capture Applicaons Desirements (Human) (HEOMD) Derive Requirements Perform Trade: Idenfy Driving Capture Applicaons Desirements Architectures vs. (Roboc) (SMD) KPPs

Perform Gap Analysis Derive Results Review and Evaluate each Analyze Architecture for Compung each Key Make Architectures Performance Recommendaons Parameter (KPP) 9/16/14 3 HPSC Formulaon Phase Study Summary General-purpose mul-core provides opmum ROI for NASA

The Assignment The Results

Idenfy relevant NASA use cases Developed 9 human spaceflight (HEOMD) and 10 science mission (SMD) What are the paradigm-shiing NASA space-based use cases for future flight compung, spanning crical mission applicaons that will drive next generaon flight funcons, high data rate instruments, and autonomy ulizing model- compung? based reasoning techniques Derive requirements 100X performance increase, low power (down to 7W) with scaling, What are the future onboard compung support for a range of fault tolerance, common programming requirements? languages, avoidance of addional V&V effort, interoperable with co- processors Perform a gap analysis No exisng or emerging spaceflight processors possess all necessary How/where do commercial and defense industry performance, power efficiency, reliability, and programmability developments in compung fall short of NASA’s unique aributes requirements and architectural needs? Trade architectures against defined Key Performance Rad-hard general-purpose mul-core best addresses the future flight Parameters (KPPs) compung requirements and presents the most affordable gap against Which compung architecture will make the most the KPPs difference? Make a recommendaon Competed/directed program plan for rad-hard general-purpose mul- How can NASA best invest limited resources to meet core, with soluons for power/energy, fault tolerance and other NASA the future needs of its space systems? requirements, leveraging other agency and industry investments

9/16/14 Specifics appear in the following charts 4 NASA Applicaons for High Performance Spaceflight Compung Human Spaceflight Science Mission (HEOMD) Use Cases (SMD) Use Cases 1. Cloud Services 1. Extreme Terrain Landing 2. Advanced Vehicle Health 2. Proximity Operaons / Formaon Management Flying 3. Crew Knowledge Augmentaon 3. Fast Traverse Systems 4. New Surface Mobility Methods 4. Improved Displays and Controls 5. Imaging Spectrometers 5. Augmented Reality for 6. Radar Recognion and Cataloging 7. Low Latency Products for Disaster 6. Tele-Presence Response 7. Autonomous & Tele-Roboc 8. Space Weather Construcon 9. Science Event Detecon and 8. Automated Guidance, Navigaon, Response and Control (GNC) 10. Immersive Environments for 9. Human Movement Assist Science Ops / Outreach High value and mission crical applicaons idenfied by 9/16/14 NASA sciensts and engineers 5 Science Mission Applicaons 10X improvement for exisng applicaons Enables new science and mission capabilies on future missions

SMD Use Cases Benefits to Missions 1. Extreme Terrain Landing • Extreme Terrain Landing 2. Proximity Operaons / – Enables reliable and safe landing in Formaon Flying hazardous terrain: TRN and HDA 3. Fast Traverse algorithms benchmarked by Mars 4. New Surface Mobility Methods Program - required six (6) dedicated RAD750s 5. Imaging Spectrometers 6. Radar • Fast Traverse 7. Low Latency Products for – Remove computaon as a liming factor Disaster Response to mobility – drive 10X faster and more, safely (wheel slip, obstacle detecon) 8. Space Weather 9. Science Event Detecon and • Science Event Detecon and Response Response 10. Immersive Environments for – Increase capture rate for dynamic, Science Ops / Outreach transient events from ~10% to >75%, with <5% false posives, for increased and more mely science return

No longer need to size science / mission scope 9/16/14 to flight compung capability 6 Deriving Requirements Requirements Template

• Space Environment(s) – Radiaon environment at the me of applicaon; e.g., geosynchronous (GEO), low-Earth orbit (LEO), deep space? • Spacecra Power Environment(s) / Constraint(s) – Available power for avionics and compung, e.g., small spacecra or rover with limited power availability (6 Was processor power, 10-15 Was computer power), medium sized spacecra (7-12 Was processor power, 15-30 Was computer power), or large spacecra with large power budget (>12-25 Was processor power, >30-100 Was computer power)? • Cricality/Fault Tolerance – Is this applicaon life or mission crical, must it operate through faults, can it tolerate errors if detected? • Real-Time – Does the applicaon have a hard real-me deadline; if so, what is the required ming? • Type(s) of Processing – Algorithm kernel(s). Is it primarily e.g., digital signal processing (DSP), data base query, is it parallelizable, is it amenable to a data flow approach? • Memory Access Paern – What is the primary data movement paern, e.g., does it fetch data from memory once and then flow through a processing chain, or does it access data in a connuous random access paern, or does it access sequenal data in a connuous and regular paern? • Duty Cycle – What is the paern of execuon, e.g., is the applicaon called connuously over a long period of me, is it called once and operate for only a short duraon, is the applicaon execuon profile spiky and/or unpredictable? • Data Rate 9/16/14 – What are the I/O and memory access data rates? 7 NASA Flight Compung High-Level Requirements as derived from the NASA use cases

Computaon Mission Need Objecve of Flight Architecture Processor Type and Category Computaon Aribute Requirements

Vision-based • Terrain Relave • Conduct safe proximity • Severe fault tolerance • Hard real me / mission crical Algorithms Navigaon (TRN) operaons around and real-me • Connuous digital signal processing with Real-Time • Hazard Avoidance primive bodies requirements (DSP) + sequenal control processing Requirements • Entry, Descent & • Land safely and • Fail-operaonal (fault protecon) Landing (EDL) accurately • High peak power • High I/O rate • Pinpoint Landing • Achieve robust results needs • Irregular memory use within available • General-purpose (GP) processor meframe as input to (10’s – 100’s GFLOPS) + high I/O rate, control decisions augmented by co-processor(s) Model-Based • Mission planning, • Conngency planning to • High computaonal • So real me / crical Reasoning scheduling & migate execuon complexity • Heurisc search, data base Techniques for resource failures • Graceful degradaon operaons, Bayesian inference Autonomy management • Detect, diagnose and • Memory usage (data • Extreme intensive & irregular • Fault management recover from faults movement) impacts memory use (mul-GB/s) in uncertain energy management • > 1GOPS GP processor arrays with environments low latency interconnect High Rate High resoluon • Downlink images and • Distributed, dedicated • So real me Instrument sensors, e.g., SAR, products rather than raw processors at sensors • DSP/Vector processing with Data Hyper-spectral data • Less stringent fault 10-100’s GOPS (high data flow) Processing • Opportunisc science tolerance • GP array (10-100’s GFLOPS) required for feature ID / triage

9/16/14 Future NASA use cases require dramac improvement over RAD750 (~400 MOPS) 8 Mapping of Use Cases to the Computaon Categories

Crew Knowledge Extreme Terrain Landing Improved Displays & Augmentaon Systems Controls Automated Guidance, Immersive Environments Vision Based Algorithms with Augmented reality for Nav & Controls (GNC) for Science Ops Outreach Real Time Requirements Recognion & Cataloging Human Movement Assist Fast Transverse

Autonomous Mission Planning Advanced Vehicle Health Management Proximity Operaons / Model Based Reasoning Autonomous & Tele- Formaon Flying Techniques Roboc Construcon New Surface Mobility Methods

Imaging Spectrometers Tele-Presence Low Latency Products for High Rate Instrument Data Radar Disaster Response Processing Cloud Services Space Weather 9 HEOMD SMD Compung Architectures Candidates evaluated under the HPSC task

• General-purpose mul-core – Rad-hardened – COTS • DSP mul-core – Rad-hardened – COTS • Reconfigurable compung (e.g., FPGAs) – Rad-hardened – COTS • Graphics processing units (GPUs) – Rad-hardened – COTS • Also: Hybrid architectures ulizing co-processors – General Purpose Mulcore + Reconfigurable Compung – General Purpose Mulcore + GPU – General Purpose Mulcore + DSP Mulcore 9/16/14 10 Key Performance Parameters

Applicaon-referenced KPPs – Computaonal performance – Radiaon and fault tolerance – Power and energy management – Soware verficaon and validaon Architecture-referenced KPPs – Soware verificaon and validaon (this is the single cross-over KPP) – Programmability and flight soware applicability – Interoperability – Extensibility and evolveability Addional KPPs – Non-recurring cost – Recurring cost – Cross-cung applicability across the NASA mission set

Other Constraints

– TRL 5/6 in <=3 years within available resources 11 9/16/14 Evaluaon of the Compung Architectures

Key Performance Rad-hard Parameters Rad-hard Rad-hard Graphics (KPPs) – General Purpose Rad-hard DSP Reconfigurable COTS-based Processing Scoring Multicore Multicore Computing Multicore Units

Total KPP 1 5 2 3 4 score (rank)

#KPP scores 12/12 4/12 7/12 6/12 5/12 above mean

9/16/14 12 Study Recommendaon Rad-hard General Purpose Mul-core

Rad-hard General Purpose Mulcore • Best overall fit to applicaon requirements 3000 – Provides both general purpose and some DSP BEST Existence Proof – (44 GOPS) capability as well as interoperability with co- 2500 processors (DSP, reconfigurable) 2000 • Conducive to Power Scaling at core-level – Power Dissipaon issues to address fit within 1500 SOA available investment resource envelope 1000 • Conducive to thread-based Fault Tolerance 500 SOP

– Fault detecon/correcon/isolaon Power Efficiency (MOPS/W) RAD750 (0.2 GOPS) WORST – Ability to segregate failed cores from the pool of 0 available cores in support of graceful degradaon 0 5 10 15 20 25 • Scalable to 10x increase in the number of cores Power (W) – Combined with power scaling, allows the increased Performance cores to consume power only as thread-load (GOPS) dictates

Computaonal Performance, Efficiency, and Scalability of Mul-core 9/16/14 redefines general-purpose compung for space systems 13 HPSC Formulaon Task Recommendaons Investment Focus and Approach

• Focus on Rad-hard General-purpose Mul-core – Leverage government and industry investments • Issue a BAA for hardware architecture designs in FY13 – Solicit flight compung system concepts – Prepare NASA requirements and benchmarks for early evaluaon of architectures – A compeve inial phase, seeking innovave soluons and early risk rerement • Product of the investment – Mul-core hardware chip with bundled real-me operang system (RTOS) and FSW development environment, integrated on an evaluaon board • Include a directed soware investment – Middleware elements for allocang/managing cores for varying operaonal objecves, working closely with the FSW community, driven by knowledge of the NASA applicaons

Challenge the community to develop an innovave, extremely high performance, low power, flexible, rad-hard GP mul-core processor within available budget and schedule

9/16/14 Enter AFRL! 14 Future Flight Compung AFRL vs. NASA Requirements

Venn Diagram Next Generation Space Processor NASA vs AFRL AFRL NASA

Segregation and Routing around failed/depowered cores

Performance Throughput Memory Power Scalability Radiation Hardened Memory Scheme and Interface Core Power Scalability Radiation Tolerant Replication Based F.T. Fault Tolerant Interconnects Middleware Software Strategy Interconnect Strategy

There was sufficient common ground to jusfy a joint AFRL-NASA program 9/16/14 15 Joint NASA-AFRL Program Concept Next Generaon Space Processor (NGSP)

FY13 FY14 FY15 FY16 FY17 FY18

AFRL Development Phase

NASA

NASA

Inial joint program plan was approved and funding allocated for Innovaon Phase BAA issued, vendors selected, and benchmark development iniated 9/16/14 16 NGSP / HPSC Program Innovaon Phase – Top Level Objecves Summary

• 24 Cores providing 20 GOPS/10GFLOPS, mulple 10Gb/s I/O and DDR 3 Memory Ports at 7 Was • Dynamically power scalable at core level granularity by powering and depowering cores in real me without disrupng system operaon, with very low idle power load (<<1W) • Fault tolerant interconnects between cores and to external I/O and memory devices • Built in self test with faulty core bypass and power up into known good state • Navely supports N-Modular Redundancy where N=2, 3 • Segregaon and roung around failed and depowered cores • Radiaon tolerant to 1MRad, Latch-up Immune, with Single Event Upset (SEU) rate of not greater than 0.01 event/day in Adams 90% worst case GEO environment • Interoperable with other high performance compung architectures, e.g., reconfigurable compung FPGAs • Real me support • System soware and applicaon development environment

**Proposers were encouraged to offer alternaves ** 9/16/14 17 NGSP / HPSC Program Selected Vendors

Objecve (HPSC Innovaon Phase) • Reduce technical risk via the development of mulple hardware architecture designs for a rad-hard general-purpose mulcore flight compung system • Evaluate those designs against a set of benchmarks representave of the future NASA applicaons

Selected Vendors • Vendor Kick-Off meeng held at Aerospace Corp, Nov 6-7, 2013 • Briefings provided by the three selected vendors • BAE Systems (Manassas, VA) • Boeing (Seale, WA) • Honeywell (Clearwater, FL) • Aended on government side by • NASA (GCDP, JPL, GSFC) • USAF (AFRL, SMC, Aerospace) Kick-Off Meeng Summary: Shown: MAESTRO, Rad-hard by • Official start to effort to finalize USAF use cases / requirements design (RHBD) 7X7 led mulcore • To be reintegrated with NASA requirements by March 2014 chip (proof-of-concept for space- • NASA benchmark acvity, with technical and programmac objecves, qualified mulcore compung) described to vendors 18 • Early insight on architectural approaches being taken by vendors NGSP / HPSC Program Use of Benchmarks

• Without benchmarking, there is considerable risk that a computer will not live up to its expected performance • Prospecve users typically assess performance based on processor , word width, and instrucon set • However, design features can introduce bolenecks that can limit performance for real-world applicaons to a fracon of the expected performance • Oen these bolenecks are difficult to idenfy via design reviews • Benchmarks provide the most reliable means to assess real-world performance • Case in point – RAD750 • Original processor had serious performance limitaons due to memory performance • Modificaons were necessary in the implementaon of the L2 cache and DMA • The risk may be greater • For mul-core processors, because applicaon performance can be sensive to architecture (i.e., topology, interconnect methodology, memory implementaon), it is essenal that the opmal architecture be established up front for the broad array of space-based applicaons

19 NGSP / HPSC – Development Phase Alignment and Infusion Strategy 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022

Board-level Flight Compung HPSC (NASA) $1M $13M Product Multi-core Processor Board Non-Flt Ethernet UART If dual port DDR/DDR2 is available, should consider I/F connecting the Actel FPGA to the DDR/DDR2 for optimal I/F data transfer to/from the host bus 10/100/1G RS232 DDDDRR/D/DDDRR22 ( s(sizizee??)) Companion Board PHY XCVR DDRD/RD/DRD2R 24 G(sbizyete?s)

s • e Rad-hard general-purpose mulcore chip c a f

r Core Ref Clk e

s PLL t t n I

n

c DDR Ref Clk i c e

f PLL i i f i c OSC c m e

c XAUI XAUI e i XAUI XAUI XAUI Ref Clk p u p g PLL S r Maestro o

S t

L n p s

o Processor GBE Ref Clk i p t Cfg PLL n A a I c

i I/F l

p Clock Generation Logic p XAUI • A Virtex5 Real-me operang system Innovaon Phase Development Phase CPU_RST Housekeeping TLM Ckt

POR CLK_ENA XAUI TLM I/F WD_TO • SDRAM Flight soware development 5.0V ASIC 128M x 48bits 3.3V 2.5V V5 Cfg WD Mem L1 Boot Ctlr TImer Ctlr Code (x3) 1.8V Virtex5 Cfg 1.2V Host Bus Bridge File 1.0V (With DMA capability) NGSP (AFRL) Power Dist Unit NOR Flash $1M $9M 32M x 48 bits environment Host Bus • Middleware for core management, opmized for NASA applicaons

Testbed Evaluaon A promising area of discussion is a technology AAE Testbed Mars 2020 payload concept for Mars 2020 to provide HPSC Technology The Advanced Exploraon Systems (AES) Avionic Architectures for capability for landed funcons, which would be phased in gradually over the course of the surface Payload Exploraon (AAE) project is considering the use of a SW simulaon during 2014 and an FPGA implementaon in late 2015. The AAE mission. A spare board slot on the Rover Compute testbed would also evaluate an HPSC implementaon as it becomes Element (RCE) is available but needs to be available. reserved soon.

Fast Surface Mission Traverse Science Infusion The HPSC team is engaged with NASA SMD and HEOMD mission customers on the use cases documented in the HPSC Formulaon Phase Study Report. Via the partnership with AFRL, a similar engagement is occurring with SMC for USAF customers. In addion, HPSC team members are engaging with other potenal naonal-level customers and programmac partners (e.g., NRL/USN). Human- 20 AFRL High Data Rate Roboc Qualificaon Instruments Teaming NGSP / HPSC Program Product Descripon

• Rad-Hard, General-Purpose, Mulcore Processor – Enabling new flight capabilies and missions for the next 15+ years • 100X computaonal performance / efficiency vs. RAD750 – MIPS to 10s of GOPS/GFLOPS at <<1Wa to 7Was • Extensible by cascading, ling, paralleling – For performance and/or fault tolerance • Dynamically configurable over power/performance/fault tolerance space • Applicable from Cubesats/Smallsats to flagship missions to 3000 crewed missions BEST Existence Proof – (44 GOPS) – Available at chip level as well as board level 2500 2000 • System Soware, Runme and Development Environments – OS, Hypervisor, Debugger, C/C++ 1500 SOA 1000 • Middleware Layer – Dynamic System Configuraon 500 SOP

– Resource Allocator, Power Manager, Fault Tolerance Manager Power Efficiency (MOPS/W) RAD750 (0.2 GOPS) WORST 0 0 5 10 15 20 25 • Evaluaon board and reference design kit Power (W)

Performance (GOPS) 21 Summary

• Future NASA mission scenarios call out for significantly improved flight compung capability • Improved flight compung means enhanced computaonal performance, energy efficiency, and fault tolerance • AFRL independently idenfies the need for improved flight compung • The NGSP program was iniated to address this joint need • NASA and AFRL have defined a set of objecves to guide the definion of processor architectures • Benchmarks have been developed to evaluate those architectures against candidate use cases • The study results will guide a Development Phase to follow

It is me to move beyond the 1990’s technology of the RAD750 and 9/16/14 Redefine the role of compung in space systems 22 Acknowledgements

• This research was carried out in part at the Jet Propulsion Laboratory, California Instute of Technology, under a contract with the Naonal Aeronaucs and Space Administraon (NASA), funded by the Game Changing Development Program of the NASA Space Technology Mission Directorate.

9/16/14 23