Fast Virtual Prototyping for Embedded Computing Systems Design and Exploration Amir Charif, Gabriel Busnot, Rania Mameesh, Tanguy Sassolas, Nicolas Ventroux

Fast Virtual Prototyping for Embedded Computing Systems Design and Exploration Amir Charif, Gabriel Busnot, Rania Mameesh, Tanguy Sassolas, Nicolas Ventroux To cite this version: Amir Charif, Gabriel Busnot, Rania Mameesh, Tanguy Sassolas, Nicolas Ventroux. Fast Virtual Pro- totyping for Embedded Computing Systems Design and Exploration. RAPIDO2019 - 11th Workshop on Rapid Simulation and Performance Evaluation: Methods and Tools, Jan 2019, Valence, Spain. pp.1-8, 10.1145/3300189.3300192. hal-02023805 HAL Id: hal-02023805 https://hal.archives-ouvertes.fr/hal-02023805 Submitted on 18 Feb 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Fast Virtual Prototyping for Embedded Computing Systems Design and Exploration Amir Charif, Gabriel Busnot, Rania Mameesh, Tanguy Sassolas and Nicolas Ventroux Computing and Design Environment Laboratory CEA, LIST Gif-sur-Yvette CEDEX, France [email protected] ABSTRACT 2.0 standard [1], which, in addition to offering interoper- Virtual Prototyping has been widely adopted as a cost- ability and reusability of SystemC models, provides several effective solution for early hardware and software co-validation. abstraction levels to cope with varying needs in accuracy and However, as systems grow in complexity and scale, both the speed. More recently, in response to an increasing demand time required to get to a correct virtual prototype, and for simulation speed, the use Dynamic Binary Translation the time required to run real software on it can quickly be- (DBT) for CPU modelling has gained in relevance [17], [20], come unmanageable. This paper introduces a feature-rich [10], and has effectively set a new standard for simulation integrated virtual prototyping solution, designed to meet performance in early prototypes. industrial needs not only in terms of performance, but also in However, as systems grow in complexity and scale, it is terms of ease, rapidity and automation of modelling and ex- now more vital than ever that virtual prototyping solutions ploration. It introduces novel methods to leverage the QEMU be able to wisely exploit these technologies to offer proper dynamic binary translator and the abstraction levels offered balance between simulation representativeness and execution by SystemC/TLM 2.0 to provide the best possible trade-offs speed throughout the design process. Moreover, at this level between accuracy and performance at all steps of the design. of complexity, the time required to model and explore new The solution also ships with a dynamic platform composition architectures can quickly become unmanageable, especially infrastructure that makes it possible to model and explore a at the earliest stages, where it is often necessary to make myriad of architectures using a compact high-level descrip- heavy alterations before reaching a stable prototype. tion. Results obtained simulating a RISC-V SMP architecture To cope with these new difficulties, virtual prototyping running the PARSEC benchmark suite reveal that simulation solutions need to meet new requirements not only in terms speed can range from 30 MIPS in accurate simulation mode of performance, but also in terms of ease, rapidity and au- to 220 MIPS in fast functional validation mode. tomation of system modelling and design space exploration. This paper introduces VPSim, a feature-rich integrated virtual prototyping solution that addresses the aforementioned 1 INTRODUCTION challenges based on three major contributions: To keep up with a fast-evolving and highly competitive embedded system industry, designers are compelled to deliver ∙ A new way of integrating the QEMU emulator, making complete working solutions under tight delay and budget its rich CPU and peripheral model portfolio available constraints. To make this possible, hardware architectures, as a collection of SystemC modules. Our method can as well as the full software stacks that drive them should be leverage the technologies used in QEMU, such as Dy- validated and optimized as early as possible in the design namic Binary Translation (DBT) and paravirtualiza- process. In this context, Virtual Prototyping has been widely tion to bring outstanding simulation speeds into the adopted as a cost-effective solution for early hardware and more deterministic and standardized SystemC domain. software co-validation. ∙ An Accuracy Control framework, that can be used to The rapid adoption of virtual prototyping solutions was define regions of interest in both the software and the greatly facilitated by the emergence of the SystemC/TLM simulated hardware dynamically. A system designer can therefore enable accurate simulation only on parts Permission to make digital or hard copies of all or part of this work of the design and portions of code that are of inter- for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage est, guaranteeing the best possible trade-off between and that copies bear this notice and the full citation on the first page. performance and accuracy given specific needs. Copyrights for components of this work owned by others than the au- ∙ A generic and powerful infrastructure for dynamic thor(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, platform composition and design space exploration requires prior specific permission and/or a fee. Request permissions (DSE). It allows a single compiled executable to be used from [email protected]. to model an infinity of architectures. Combined with RAPIDO '19, January 21{23, 2019, Valencia, Spain © 2019 Copyright held by the owner/author(s). Publication rights a Python scripting front-end, it makes it possible to licensed to the Association for Computing Machinery. simulate, explore and optimize highly complex systems ACM ISBN 978-x-xxxx-xxxx-x/YY/MM. $15.00 in a single compact script. https://doi.org/10.1145/3300189.3300192 RAPIDO '19, January 21{23, 2019, Valencia, Spain A.Charif, G.Busnot, R.Mameesh, T.Sassolas, N.Ventroux The remainder of this paper is organized as follows: In Sec- or on a periodic basis, the simulation speed varies from tion 2, we review some related tools and methods. Section 3 to 60 MIPS. However, the cumbersome annotation is to 3 provides a complete high-level view of the features and be performed for every ISA and possible accuracy settings capabilities of the VPSim tool. In Section 4, details on the are limited to a few predefined configurations. GreenSocs underlying implementation challenges are presented. The [10, 12, 13] propose a QEMU-based framework that exploits performance of our tool is evaluated in Section 5, before QEMU MMIO callback mechanisms to access external Sys- concluding in Section 6. temC peripherals. This allows for very fast simulation for applications with few IO communications, as the execution 2 RELATED WORKS mostly takes place in the context of QEMU. Unfortunately, this solution runs QEMU and SystemC in separate kernel The first generation of Virtual Prorotyping (VP) solutions threads, requiring frequent synchronization and precluding used functional Instruction Set Simulators (ISSs) with more determinism. or less internal low-level details such as pipeline stages. The solution we propose integrates QEMU by execut- Among such solutions, GEM5 [7, 16] is a discrete-event ing its CPU and peripheral models in the context of Sys- simulator able to dynamically switch between different ab- temC threads, thereby preserving the predictability of Single- straction levels. Detailed CPU models with full pipelining Threaded SystemC simulation. The accuracy of memory description can be simulated at 0.1 Million Instructions Per accesses, including instruction fetches, can be configured at Second (MIPS), whereas instruction-accurate CPU models a very fine granularity, and may change dynamically during can reach 1 MIPS. Other MPSoC modeling environments, simulation. such as SESAM [24] or Unisim [3] used instruction-based ISS generation librairies to support the modeling of various CPUs, reaching approximately 10 MIPS. However, this accuracy and 3 A USER-LEVEL VIEW OF VPSIM ISA flexibility come at the cost of limited simulation speed, VPSim is a key asset in charge of virtual prototyping and DSE which hampers their capacity to address complex systems within SESAM, an integrated EDA framework for complex embedding several CPUs and running full-fledged OSes. electronic systems ranging from Cyber-Physical Systems to To address this complexity, a recent trend is the use of Microservers. SESAM provides a holistic environment to ad- Dynamic Binary Translation (DBT). DBT consists in dy- dress HW/SW co-design, exploration and validation through namically translating instructions of the modeled ISA (guest Virtual Prototyping, HW prototyping and emulation while instructions) to host ones (usually x86), whenever needed taking into account power, temperature and reliability factors. during guest code execution, yielding

Fast Virtual Prototyping for Embedded Computing Systems Design and Exploration Amir Charif, Gabriel Busnot, Rania Mameesh, Tanguy Sassolas, Nicolas Ventroux

Imperas Tools Overview

Ibex Documentation Release 0.1.Dev50+G873e228.D20211001

OVP Debugging Applications with Eclipse User Guide

Virtualized Hardware Environments for Supporting Digital I&C Verification

OVP Guide to Using Processor Models

Imperas Platform User Guide

QEMU As a Platform for PLC Virtualization an Analysis from a Cyber Security Perspective

OVP Guide to Using Processor Models

An OVP Simulator for the Evaluation of Cluster

Imperas Installation and Getting Started Guide

Imperas Platform User Guide

OVP Debugging Applications with GDB User Guide