Reverse Engineering Power Management on NVIDIA Gpus - Anatomy of an Autonomic-Ready System Martin Peres
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
High Speed Visualization in the Jetos Aviation Operating System Using Hardware Acceleration*
High Speed Visualization in the JetOS Aviation Operating System Using Hardware Acceleration* Boris Barladian[0000-0002-2391-2067], Nikolay Deryabin[0000-0003-1248-6047], Alexey Voloboy[0000-0003-1252-8294], Vladimir Galaktionov[0000-0001-6460-7539], and Lev Shapiro[0000-0002-6350-851X] The Keldysh Institute of the Applied Mathematics of RAS, Moscow, Russia [email protected],{voloboy, vlgal, pls}@gin.keldysh.ru Abstract. The paper discusses details of the pilot display visualization that uses the hardware acceleration capabilities of the Vivante graphics processor in the JetOS aviation operating system. Previously the OpenGL Safety Critical library was implemented without hardware acceleration. This was done in such a way because software library is easier to certify in accordance with the avionics re- quirements. But usage of the software OpenGL does not provide acceptable visualization speed for modern Flight Display and 3D relief applications. So more complex visualization approach utilized the GPU acceleration capabilities was elaborated. Although the OpenGL library was implemented for a specific GPU and took into account its specificity, the described approach to adapt the MESA open source library can be used for other GPUs. An effective algorithm for multi-window visualization using the implemented library with hardware acceleration is present. The described approach allows you to achieve the visu- alization speed acceptable for the pilot display of the aircraft. Keywords: Pilot Display, Embedded Systems, Real-time Operating System, OpenGL Safety Critical, Multi-windowing. 1 Introduction In [1] requirements were formulated for a real-time operating system (RTOS) de- signed to work with integrated modular avionics. In particular, the RTOS should comply with the ARINC 653 standard [2]. -
The Linux Graphics Stack Attributions
I - Hardware : Anatomy of a GPU II - Host : The Linux graphics stack Attributions Introduction to GPUs and to the Linux Graphics Stack Martin Peres CC By-SA 3.0 Nouveau developer Ph.D. student at LaBRI November 26, 2012 1 / 36 I - Hardware : Anatomy of a GPU II - Host : The Linux graphics stack Attributions General overview Outline 1 I - Hardware : Anatomy of a GPU General overview Driving screens Host < − > GPU communication 2 II - Host : The Linux graphics stack General overview DRM and libdrm Mesa X11 Wayland X11 vs Wayland 3 Attributions Attributions 2 / 36 I - Hardware : Anatomy of a GPU II - Host : The Linux graphics stack Attributions General overview General overview of a modern GPU's functions Display content on a screen Accelerate 2D operations Accelerate 3D operations Decode videos Accelerate scientific calculations 3 / 36 I - Hardware : Anatomy of a GPU II - Host : The Linux graphics stack Attributions General overview CPU Clock Front-side Graphics Generator bus card slot Chipset Memory Slots High-speed graphics bus (AGP or PCI Northbridge Memory Express) bus (memory controller hub) Internal Bus PCI Bus Onboard Southbridge graphics PCI (I/O controller controller Bus hub) IDE SATA USB Cables and Ethernet ports leading Audio Codec CMOS Memory off-board PCI Slots LPC Bus Super I/O Serial Port Parallel Port Flash ROM Floppy Disk Keyboard (BIOS) Mouse 4 / 36 I - Hardware : Anatomy of a GPU II - Host : The Linux graphics stack Attributions General overview Hardware architecture GPU: Where all the calculations are made VRAM: Stores -
GPU4S: Embedded Gpus in Space
© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes,creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. “The final publication is available at: DOI: 10.1109/DSD.2019.00064 GPU4S: Embedded GPUs in Space Leonidas Kosmidis∗,Jer´ omeˆ Lachaizey, Jaume Abella∗ Olivier Notebaerty, Francisco J. Cazorla∗;z, David Steenarix ∗Barcelona Supercomputing Center (BSC), Spain yAirbus Defence and Space, France zSpanish National Research Council (IIIA-CSIC), Spain xEuropean Space Agency, The Netherlands Abstract—Following the same trend of automotive and avion- in space [1][2]. Those studies concluded that although their ics, the space domain is witnessing an increase in the on-board energy efficiency is high, their power consumption is an order computing performance demands. This raise in performance of magnitude higher than the limited power budget of a space needs comes from both control and payload parts of the space- craft and calls for advanced electronics able to provide high system, which is limited to a couple of Watts. computational power under the constraints of the harsh space Interestingly, GPUs entered in the embedded domain to environment. On the non-technical side, for strategic reasons it is satisfy the increasing demand for multimedia-based hand- mandatory to get European independence on the used computing held and consumer devices such as smartphones, in-vehicle technology. In this project, which is still in its early phases, we entertainment systems, televisions, set-top boxes etc. -
Mechdyne-TGX-2.1-Installation-Guide
TGX Install Guide Version 2.1.3 Mechdyne Corporation March 2021 TGX INSTALL GUIDE VERSION 2.1.3 Copyright© 2021 Mechdyne Corporation All Rights Reserved. Purchasers of TGX licenses are given limited permission to reproduce this manual, provided the copies are for their use only and are not sold or distributed to third parties. All such copies must contain the title page and this notice page in their entirety. The TGX software program and accompanying documentation described herein are sold under license agreement. Their use, duplication, and disclosure are subject to the restrictions stated in the license agreement. Consistent with FAR 12.211 and 12.212, Commercial Computer Software, Computer Software Documentation, and Technical Data for Commercial Items are licensed to the U.S. Government under vendor's standard commercial license. This publication is provided “as is” without warranty of any kind, either express or implied, including, but not limited to, the implied warranties of merchantability, fitness for a particular purpose, or non- infringement. Any Mechdyne Corporation publication may include inaccuracies or typographical errors. Changes are periodically made to these publications, and changes may be incorporated in new editions. Mechdyne may improve or change its products described in any publication at any time without notice. Mechdyne assumes no responsibility for and disclaims all liability for any errors or omissions in this publication. Some jurisdictions do not allow the exclusion of implied warranties, so the above exclusion may not apply. TGX is a trademark of Mechdyne Corporation. Windows® is registered trademarks of Microsoft Corporation. Linux® is registered trademark of Linus Torvalds. -
Thermal Covert Channels Leveraging Package-On-Package DRAM
Thermal Covert Channels Leveraging Package-On-Package DRAM Shuai Chen∗, Wenjie Xiongy, Yehan Xu∗, Bing Li∗ and Jakub Szefery ∗Southeast University, Nanjing, China fchenshuai ic,220174472, bernie [email protected] yYale University, New Haven, CT, USA fwenjie.xiong, [email protected] Abstract—Package-on-Package (PoP) is an intergraded circuit observation of execution time [2], [3], or on observation packaging technique where multiple separate packages are of physical emanation, such as heat [4] or electromagnetic mounted vertically one on top of the other, allowing for more (EM) field [5], for example. This work focuses on thermal compact system design and reduction in the distance between channels, and shows how the heat, or temperature, can modules. However, this can introduce security vulnerabilities. be measured without special measurement equipment or In particular, this work shows that due to the close physical physical access in commodity SoC devices. Especially, this proximity of a System-on-a-Chip (SoC) package and a PoP work focuses on an SoC DRAM Package-on-Package (PoP) DRAM that is on top of it, a thermal covert channel exists configuration [6] where two chips (SoC and DRAM) are between the SoC and the PoP DRAM. The thermal covert stacked on top of each other. The heat transfers between the channel can allow multiple cores in the SoC to communicate two can be observed by measuring decay rate of DRAM by modulating the temperature of the PoP DRAM. Especially, cells, which is the basis for this work. it is possible for one core in the SoC to generate heat patterns, Previously, heat-based or thermal covert channels have which encode data that is to be transmitted, and another been demonstrated in data centers [7], or in multicore core to observe the transmitted pattern by measuring the processors [8], but these require dedicated thermal sensors decay rate of DRAM cells. -
Linux, Yocto and Fpgas
Embedded Open Source Experts Linux, Yocto and FPGAs Integrating Linux and Yocto builds into different SoCs From a Linux software perspective: ➤ Increased demand for Linux on FPGAs ➤ Many things to mange, both technical and practical ➤ FPGAs with integrated CPU cores – very similar many other SoCs Here are some experiences and observations... © Codiax 2019 ● Page 2 Why use Linux? ➤ De-facto standard ➤ Huge HW support ➤ FOSS ➤ Flexible ➤ Adaptable ➤ Stable ➤ Scalable ➤ Royalty free ➤ Vendor independent ➤ Large community ➤ Long lifetime Why not Linux? ➤ Too big ➤ Real-time requirements ➤ Certification ➤ Boot time ➤ Licensing ➤ Too open? Desktop Shells: Desktop Display server: Display BrailleDisplay Touch-Screen Mouse & Keyboard Wayland Compositor Wayland + development tools = a lot code!of source Linux system example weston, clayton,mutter,KWin evdev libinput GNOME Shell D radeon nouveau lima etna_viv freedreno tegra-re lima nouveau radeon freedreno etna_viv e libwayland-server libwayland-server s Cinnamon k t o kms p Linux kernel, Linux kernel, Plasma 2 w i (Kernel Mode Setting) Mode (Kernel d g Cairo-Dock e t s drm (Direct Rendering Manager) Rendering (Direct drm cache coherent L2-Caches L2-Caches cache coherent CPU &GPU Enlight. DR19 System libraries: System oflibraries): form (in the Toolkits Interface User µClibc Pango glibc glibc main memory possibly adaptations to Wayland/Mir libwayland / COGL libwayland Cairo Cairo (Xr) GTK+ Clutter 2D Application 2D GModule GThread GThread GLib GObject Glib GIO ATK devicedrivers other& modules System -
Valorisation Du Logiciel Open Source Via HAL Et Software Heritage
Valorisation du logiciel open source via HAL et Software Heritage Morane Gruenpeter Estelle Nivault Jozefina Sadowska Software Heritage, Inria Inria Inria Journées CasuHAL 07-11 juin 2021 Agenda ★ Contexte ★ Qu’est-ce qu’un logiciel ? ★ Pourquoi partager les codes sources de recherche ? ★ Comment déposer le logiciel dans Hal ? ★ Comment modérer ? ★ Comment citer ? Speaker’s name Journées CasuHAL 07-11 juin 2021 Presentation – city – date 2/10 Contexte - un nouveau type de dépôt HAL Les dates clés Les acteurs ★ 2017 - Collaboration débute ★ mars 2018 - Phase de test sur HAL-Inria ★ septembre 2018 - Ouverture sur HAL ★ avril 2020 - BibLaTeX @software Prochainement: ★ été 2021 - Dépôt avec SWHID Speaker’sMorane name GruenpeterJournées -Journées Journée CasuHAL CasuHAL de la reproducibilité 07-11 juin juin 2021 2021 - 10-05-2021Presentation – city – date 3/10 Qu’est-ce qu’un logiciel ? https://www.reddit.com/r/ProgrammerHumor/comments/70fump/prog ramming_is_magic/ Speaker’sMorane name GruenpeterJournées -Journées Journée CasuHAL CasuHAL de la 07-11 reproducibilité 07-11 juin juin 2021 2021 - 10-05-2021Presentation – city – date 4/10 Le logiciel - un objet digital multiforme “Ensemble des programmes, procédés et règles, et éventuellement de la documentation, relatifs au fonctionnement d'un ensemble de traitement de données.” Le Larousse (date d'accès: 21.5.2021) Le concept logiciel L’objet logiciel ● projet ou entité ● chaque version du code source ● la communauté autour du projet (fichiers textes) ● l’idée / algorithmes / solutions ● les -
Applications and Implementations
Applications and Implementations Hwanyong LEE CTO and Technical Marketing Director HUONE © Copyright Khronos Group, 2010 - Page 1 Khronos Family of Standards 3D Digital Asset Plugin-free Mobile OS accessibility Exchange format 3D Web Content Abstraction Authoring and Authoring A coordinated ecosystem of compute, graphics and media Cross platform Parallel Context and Surface Embedded 3D desktop 3D Computing Management standards and APIs Application Acceleration Acceleration Safety Critical 3D Steaming Media Advanced Audio Vector 2D Video, Audio and Window System System System Codec Creation Image Acceleration Acceleration Integration Integration Hundreds of man years invested by industry experts to create a coordinated visual computing ecosystem for accelerated parallel computation, 3D, video, audio and image processing on desktop, embedded and mobile systems © Copyright Khronos Group, 2010 - Page 2 OpenVG • Royalty-free open standard API • Low-level 2D vector graphics rendering API • OpenGL-style programming model • Advanced feature set enables - SVG, - Flash, - PDF, Postscript, Applications and UI - Java (JSR 287, 271, 226) SVG, Vector and - etc. Font Packages etc.. • Portable content • Map Applications • Hardware Acceleration Hardware Acceleration © Copyright Khronos Group, 2010 - Page 3 OpenVG Rendering Pipeline © Copyright Khronos Group, 2010 - Page 4 OpenVG with Native Graphics Processor • CPU sending data and commands to OpenVG hardware • OpenVG rendering pipeline is in the hardware CPU Native Vector Graphics Hardware © Copyright -
Tizen* IVI 3.0 M2 Kernel 3.10 LTSI Linux* Support Package for the Intel® Atom™ Processor E3800 Series
Tizen* IVI 3.0 M2 Kernel 3.10 LTSI Linux* Support Package for the Intel® Atom™ Processor E3800 Series Getting Started Guide January 2015 Document Number: 546069 Introduction INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL'S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. A "Mission Critical Application" is any application in which failure of the Intel Product could result, directly or indirectly, in personal injury or death. SHOULD YOU PURCHASE OR USE INTEL'S PRODUCTS FOR ANY SUCH MISSION CRITICAL APPLICATION, YOU SHALL INDEMNIFY AND HOLD INTEL AND ITS SUBSIDIARIES, SUBCONTRACTORS AND AFFILIATES, AND THE DIRECTORS, OFFICERS, AND EMPLOYEES OF EACH, HARMLESS AGAINST ALL CLAIMS COSTS, DAMAGES, AND EXPENSES AND REASONABLE ATTORNEYS' FEES ARISING OUT OF, DIRECTLY OR INDIRECTLY, ANY CLAIM OF PRODUCT LIABILITY, PERSONAL INJURY, OR DEATH ARISING IN ANY WAY OUT OF SUCH MISSION CRITICAL APPLICATION, WHETHER OR NOT INTEL OR ITS SUBCONTRACTOR WAS NEGLIGENT IN THE DESIGN, MANUFACTURE, OR WARNING OF THE INTEL PRODUCT OR ANY OF ITS PARTS. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked "reserved" or "undefined". -
Fun with the Linux Desktop
Fun with the Linux Desktop 3D-Desktop with XGL/AIGLX and compiz come2linux, Essen, 9. + 10.9.2006 Hanno Böck, http://www.hboeck.de/ Why we want fancy desktops? ● Can improve usability (virtual desktop -> side of a cube) ● A nicer system »feels« more usable ● People like MacOS X, there must be a reason Linux-Desktop in the past ● Xfree86 - very slow develop-ment, rarely new features ● »Nice« stuff only with strange workarounds (kompose, 3ddesk) ● E17 - the default desktop for GNU Hurd? Xorg ● Spring 2004 - license change at xfree86 ● Xorg-project forks last version ● Lot's of new development: composite, modularization, xgl, aiglx Composite Extension ● Renders window-content offscreen ● xcompmgr lets you have shadows and transparent windows ● Not very spectacular, but base for future stuff First look on XGL and friends ● Summer 2005, first impressions of XGL, rotating cube ● Around the same time, Luminocity presents wobbly windows XGL ● David Reveman (Novell) ● Xserver based on OpenGL ● (currently) Xserver on another Xserver ● All cards with DRI EGL/XEGL ● Problem: XGL needs already running Xserver ● EGL: direct interface to GL, no Xserver needed ● XEGL: XGL running on EGL ● Not working yet Compiz ● Composite and Windowmanager ● Very modular ● Lot's of plugins - rotating cube, wobbly windows AIGLX ● Kristian Høgsberg, Fedora ● Extension to already existing Xserver ● Same functionality as XGL ● All cards with free drivers We need better drivers ● Fast ● Stable ● Full-Featured ● Free! Binary drivers suck ● Security, Bugs ● Functionality (Composite, -
Lico 5.2.1 Installation Guide (For SLES) Second Edition (January 2019)
LiCO 5.2.1 Installation Guide (for SLES) Second Edition (January 2019) © Copyright Lenovo 2018, 2019. LIMITED AND RESTRICTED RIGHTS NOTICE: If data or software is delivered pursuant to a General Services Administration (GSA) contract, use, reproduction, or disclosure is subject to restrictions set forth in Contract No. GS-35F- 05925. Reading instructions • To ensure that you get correct command lines using the copy/paste function, open this Guide with Adobe Acrobat Reader, a free PDF viewer. You can download it from the official Web site https://get.adobe.com/ reader/. • Replace values in angle brackets with the actual values. For example, when you see <*_USERNAME> and <*_PASSWORD>, enter your actual username and password. • Between the command lines and in the configuration files, ignore all annotations starting with #. © Copyright Lenovo 2018, 2019 ii iii LiCO 5.2.1 Installation Guide (for SLES) Contents Reading instructions. ii Check the OS installation . 23 Check NFS . 23 Chapter 1. Overview. 1 Check Slurm . 24 Introduction to LiCO . 1 Check Ganglia . 24 Typical cluster deployment . 1 Check MPI . 25 Operating environment . 2 Check Singularity . 25 Supported servers and chassis models . 3 Check OpenHPC installation . 25 Prerequisites . 3 List of LiCO dependencies to be installed. 25 Install RabbitMQ . 26 Chapter 2. Deploy the cluster Install PostgreSQL. 26 environment . 5 Install InfluxDB . 26 Install an OS on the management node . 5 Install Confluent. 27 Deploy the OS on other nodes in the cluster. 5 Configure user authentication . 28 Configure environment variables . 5 Install OpenLDAP-server . 28 Create a local repository . 8 Install libuser . 29 Install Lenovo xCAT . -
GPU Scheduling for Real-Time Multi-Tasking Environments
TimeGraph: GPU Scheduling for Real-Time Multi-Tasking Environments Shinpei Kato† ‡, Karthik Lakshmanan†, and Ragunathan (Raj) Rajkumar†, Yutaka Ishikawa‡ † Department of Electrical and Computer Engineering, Carnegie Mellon University ‡ Department of Computer Science, The University of Tokyo Abstract transition. Especially recent trends on 3-D browser and desktop applications, such as SpaceTime, Web3D, 3D- The Graphics Processing Unit (GPU) is now commonly Desktop, Compiz Fusion, BumpTop, Cooliris, and Win- used for graphics and data-parallel computing. As more dows Aero, are all intriguing possibilities for future user and more applications tend to accelerate on the GPU in interfaces. GPUs are also leveraged in various domains multi-tasking environments where multiple tasks access of general-purpose GPU (GPGPU) processing to facili- the GPU concurrently, operating systems must provide tate data-parallel compute-intensive applications. prioritization and isolation capabilities in GPU resource Real-time multi-tasking support is a key requirement management, particularly in real-time setups. for such emerging GPU applications. For example, users We present TimeGraph, a real-time GPU scheduler could launch multiple GPU applications concurrently in at the device-driver level for protecting important GPU their desktop computers, including games, video play- workloads from performance interference. TimeGraph ers, web browsers, and live messengers, sharing the same adopts a new event-driven model that synchronizes the GPU. In such a case, quality-aware soft real-time ap- GPU with the CPU to monitor GPU commands issued plications like games and video players should be pri- from the user space and control GPU resource usage in oritized over live messengers and any other applications a responsive manner.