NIC-Based Offload of Dynamic User-Defined Modules for Myrinet

Total Page:16

File Type:pdf, Size:1020Kb

NIC-Based Offload of Dynamic User-Defined Modules for Myrinet NIC-Based Offload of Dynamic User-Defined Modules for Myrinet Clusters Adam Wagner, Hyun-Wook Jin and Dhabaleswar K. Panda Rolf Riesen Network-Based Computing Laboratory Scalable Computing Systems Dept. Dept. of Computer Science and Engineering Sandia National Laboratories The Ohio State University [email protected] ¡ wagnera, jinhy, panda ¢ @cse.ohio-state.edu Abstract The common approach to NIC-based offload is to hard- code an optimization into the control program which runs Many of the modern networks used to interconnect nodes on the NIC in order to achieve the highest possible perfor- in cluster-based computing systems provide network inter- mance gain. While such approaches have proved successful face cards (NICs) that offer programmable processors. Sub- in improving performance, they suffer from several draw- stantial research has been done with the focus of offloading backs. First, NIC-based coding is quite complex and error processing from the host to the NIC processor. However, the prone due to the specialized nature of the NIC firmware research has primarily focused on the static offload of spe- and the difficulty of validating and debugging code on the cific features to the NIC, mainly to support the optimization NIC. Because of the level of difficulty involved in making of common collective and synchronization-based communi- such changes and the potential consequences of erroneous cations. In this paper, we describe the design and implemen- code, these sorts of optimizations may only be performed by tation of a new framework based on MPICH-GM to sup- system experts. Second, hard-coding features into the NIC port the dynamic NIC-based offload of user-defined mod- firmware is inflexible. The resources available on the NIC ules for Myrinet clusters. We evaluate our implementation are typically an order of magnitude less than those on the on a 16-node cluster using a NIC-based version of the com- host. This means that only a limited number of features may mon broadcast operation and we find a maximum factor of be compiled into the firmware at a given time. Furthermore, improvement of 1.2 with respect to total latency as well as frequent changes may be impractical on production systems a maximum factor of improvement of 2.2 with respect to av- which demand high levels of stability, availability and secu- erage CPU utilization under conditions of process skew. In rity. addition, we see that these improvements increase with sys- tem size, indicating that our NIC-based framework offers enhanced scalability when compared to a purely host-based approach. Message Passing Library Message Passing Library Feature 1 ... Feature n Add Remove Delegate 1. Introduction Communication Library Communication Library Feature 1 ... Feature n Add Remove Delegate Many of the interconnection networks used in cur- rent cluster-based computing systems include network in- NIC Firmware NIC Firmware terface cards (NICs) with programmable processors. Much Feature 1 ... Feature n Add Remove Execute research has been done toward utilizing these CPUs to pro- vide various benefits by offloading processing from the (a) Static Of¯oad (b) Dynamic Of¯oad host. These works have mainly focused on customiza- Figure 1. Static, hard-coded, ad-hoc offload tions to enhance the performance of specific operations of features to NIC vs. flexible framework for including collective communications [6, 11] like multi- dynamic offload of user modules to NIC. cast [2] and reduce [14] and synchronization operations such as barrier [4]. The potential benefits of NIC-based of- fload include lowered communication latency, reduced host CPU utilization, improved tolerance of process skew [3, 17] and better overlap of computation and communica- Figure 1 illustrates the difference between a hard-coded, tion. static approach and a more flexible, dynamic approach to NIC-based offload. We can see that in the static approach, we are limited to a fixed number of features, while in the £ This research is supported in part by a grant from Sandia National Lab- dynamic approach features may be added and removed as oratory (a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of needed. When a feature is added it is propagated down Energy under contract DE-AC004-94AL85000.), Department of En- through the software layers to the NIC, where it is com- ergy's Grant #DE=FC02-01ER25506, and National Science Founda- piled and stored for later use. The upper layers may then tion's grants #EIA-9986052 and #CCR-0204429. delegate tasks down to the NIC for execution and incom- ing messages may be handled by the code on the NIC with- MPI [13] is a standard interface for message passing in out host involvement. When a feature is no longer needed, it parallel programs. MPICH [10] is the reference implemen- may be purged from the NIC to free up resources for other tation of MPI and has been ported to a variety of hardware uses. platforms including GM over Myrinet. The standard imple- This paper describes our design and implementation of mentation of MPICH over GM (MPICH-GM) does not in- a new framework to support the offload of user code to the clude support for NIC-based offload techniques. NIC in Myrinet [1] clusters. Our approach addresses many of the negative aspects associated with hard-coding features 3. Design Challenges into the NIC. We accomplish this by introducing a flexible framework which we refer to as NICVM (NIC-based Vir- tual Machine). This framework allows users to dynamically This section discusses the design challenges we encoun- add and remove code modules from the NIC. The code is tered while implementing our NICVM framework. The added by the user in source form and compiled into an in- specifics regarding our solutions to each issue will be ad- termediate format which is later interpreted by a special- dressed in detail in the next section. purpose virtual machine embedded in the NIC firmware. By interpreting the code we have the benefit of complete con- 3.1. Performance of User Code trol, and perhaps counter-intuitively we can still realize the performance benefits associated with offload. We have im- One of our main challenges was designing the frame- plemented the common broadcast operation on the NIC as work so that the user code could be efficiently executed. a user module and measured performance with respect to There are two different areas where performance of user both latency and host CPU utilization. When compared to code is critical. The first is the startup latency required to ac- a similar host-based implementation on 16 nodes, we ob- tivate a given user module on the NIC. This latency includes serve a maximum factor of improvement of 1.2 with re- the time to determine which module should be activated as spect to latency, and under conditions of process skew we well as the time to perform any sort of environmental setup observe a maximum factor of improvement of 2.2 with re- required for module execution. The second area where per- spect to CPU utilization. Furthermore, we find that in both formance is critical is the actual time required to execute a cases these performance benefits increase with system size. given module of user-code once it has been located and its The remainder of this paper is organized as follows. In execution environment has been initialized. If the startup la- the next section, we provide brief background information. tency is too high, then performance will be poor regardless In section 3 we discuss the design challenges we encoun- of the time taken to perform the actual work associated with tered while implementing our framework and in section 4 the module. Such startup latencies could easily outweigh the we detail our implementation. In section 5 we evaluate the positive effect of offload-related benefits like avoiding PCI performance of our implementation and in section 6 we dis- bus traffic. Of course, the complete time taken to execute cuss related work. Finally, in section 7 we present our con- the user code is important as well. The MCP is structured clusions and discuss future work. as a state machine with different states for sending, receiv- ing and performing DMAs to and from host memory. The transitions between states are highly tuned and adding any 2. Background extra delay to process user code can have a negative impact on overall performance. For example, if a user code mod- ule takes too long to execute it may cause temporary re- GM [15] is a user-level message-passing subsys- ceive queue buffers on the NIC to overflow, which will re- tem for Myrinet networks. Myrinet [1] is a low-latency, sult in packets being dropped and potentially even a reset of high-bandwidth interconnection network that employs pro- the associated communication port. grammable network interface cards (NICs), cut-through crossbar switches and operating-system-bypass techniques to achieve full-duplex 2 Gbps data rates. GM consists of a 3.2. Support for Multiple Reliable NIC-Based lightweight kernel-space driver, a user-space library and a Sends control program (MCP) which executes on the NIC proces- sor. The kernel-space code is only used for housekeeping Providing an infrastructure to allow user code to initi- purposes like allocating and registering memory. After tak- ate multiple reliable NIC-based sends proved to be another ing care of such initialization tasks, the user-space library challenge. It’s relatively straightforward to initiate a send can communicate directly with the NIC-based control pro- from the NIC, especially if reliability is not a requirement. gram, removing the operating system from the critical However, we imagined that a common scenario for user path.
Recommended publications
  • Embedded Systems Building and Programming Embedded Devices
    Embedded Systems Building and Programming Embedded Devices PDF generated using the open source mwlib toolkit. See http://code.pediapress.com/ for more information. PDF generated at: Tue, 29 May 2012 01:04:04 UTC Contents Articles Wikibooks:Collections Preface 1 Embedded Systems/Embedded Systems Introduction 3 Embedded Systems/Terminology 7 Microprocessor Basics 10 Embedded Systems/Microprocessor Introduction 10 Embedded Systems/Embedded System Basics 11 Embedded Systems/Microprocessor Architectures 13 Embedded Systems/Programmable Controllers 16 Embedded Systems/Floating Point Unit 18 Embedded Systems/Parity 20 Embedded Systems/Memory 21 Embedded Systems/Memory Units 24 Programming Embedded Systems 25 Embedded Systems/C Programming 25 Embedded Systems/Assembly Language 31 Embedded Systems/Mixed C and Assembly Programming 34 Embedded Systems/IO Programming 42 Embedded Systems/Serial and Parallel IO 43 Embedded Systems/Super Loop Architecture 44 Embedded Systems/Protected Mode and Real Mode 46 Embedded Systems/Bootloaders and Bootsectors 47 Embedded Systems/Terminate and Stay Resident 48 Real Time Operating Systems 49 Embedded Systems/Real-Time Operating Systems 49 Embedded Systems/Threading and Synchronization 51 Embedded Systems/Interrupts 54 Embedded Systems/RTOS Implementation 55 Embedded Systems/Locks and Critical Sections 57 Embedded Systems/Common RTOS 60 Embedded Systems/Common RTOS/Palm OS 63 Embedded Systems/Common RTOS/Windows CE 64 Embedded Systems/Common RTOS/DOS 64 Embedded Systems/Linux 65 Interfacing 68 Embedded Systems/Interfacing
    [Show full text]
  • Optimisation Des Performances Et De La Consommation De Puissance Électrique Pour Architecture Intel Ltanium/EPIC
    Université de Valenciennes et du Hainaut-Cambrésis Numéro d'ordre : 08/40 Optimisation des performances et de la consommation de puissance électrique pour architecture Intel ltanium/EPIC THÈSE Présentée et soutenue publiquement le : 25 novembre 2008 pour l'obtention du Doctorat de l'université de Valenciennes et du Hainaut-Cambrésis Spécialité Automatique et Informatique des Systèmes Industriels et Humains Discipline : Informatique par JamelTAYEB Rapporteurs : Dr. Albert COHEN, INRlA, École Polytechnique de Paris Pr. William JALBY, Université de Versailles-Saint-Quentin-en­ Yvelines Examinateurs : Pr. Jean-Luc DEKEYSER, Laboratoire d'Informatique Fondamentale à Lille Pr. Pierre MANNEBACK Faculté Polytechnique de Mons, Belgique Invité : M. Gwenolé BEAUCHESNE, Ingénieur, Mandriva S.A. Paris Directeur: Pr. Sm ail N IAR, LAMIH, Université de Valenciennes Laboratoire d'Automatique, de Mécanique et d'Informatique Industrielles et Humaines - UMR 8530 i Remerciements Merci tout d'abord à Smail Niar qui il y a tous juste quatre ans m 'a offert, en plus de son amitié, l'opportunité d'effectuer ma thèse au sein du laboratoire LAMll-1. Je le remercie pour cette chance inespérée, de la confiance et de son soutien de tous les moments, les plus heureux comme les plus difficiles. Son aide et ses conseils rn 'ont aidé à garder le cap. J'exprime ma profonde gratitude à Albert Cohen et William Jalby qui m'ont fait l'honneur d'être rapporteurs de cette thèse et pour avoir pris le temps de rédiger un rapport sur celle-ci. Je les remercie pour 1' intérêt sincère qu ' ils ont porté à mon travail. Je tiens à remercier ici les examinateurs : Jean-Luc Dekeyser et Pierre Manneback qui m'ont fait le plaisir d'examiner ce travail et de faire partie du Jury.
    [Show full text]
  • 4Th, the Friendly Forth Compiler
    4tH, the friendly Forth compiler J.L. Bezemer May 3, 2012 Contents 1 What's new 16 1.1 What's new in version 3.61.5 . 16 1.2 What's new in version 3.61.4 . 17 1.3 What's new in version 3.61.3 . 19 1.4 What's new in version 3.61.2 . 21 1.5 What's new in version 3.61.1 . 24 1.6 What's new in version 3.61.0 . 25 1.7 What's new in version 3.60.1 . 28 1.8 What's new in version 3.60.0 . 29 1.9 What's new in version 3.5d, release 3 . 31 1.10 What's new in version 3.5d, release 2 . 33 1.11 What's new in version 3.5d . 34 1.12 What's new in version 3.5c, release 3 . 35 1.13 What's new in version 3.5c, release 2 . 36 1.14 What's new in version 3.5c . 38 1.15 What's new in version 3.5b, release 2 . 39 1.16 What's new in version 3.5b . 40 1.17 What's new in version 3.5a, release 2 . 40 1.18 What's new in version 3.5a . 41 1.19 What's new in version 3.3d, release 2 . 46 1.20 What's new in version 3.3d . 47 1.21 What's new in version 3.3c . 49 1.22 What's new in version 3.3a . 51 1.23 What's new in version 3.2e .
    [Show full text]
  • Pipenightdreams Osgcal-Doc Mumudvb Mpg123-Alsa Tbb
    pipenightdreams osgcal-doc mumudvb mpg123-alsa tbb-examples libgammu4-dbg gcc-4.1-doc snort-rules-default davical cutmp3 libevolution5.0-cil aspell-am python-gobject-doc openoffice.org-l10n-mn libc6-xen xserver-xorg trophy-data t38modem pioneers-console libnb-platform10-java libgtkglext1-ruby libboost-wave1.39-dev drgenius bfbtester libchromexvmcpro1 isdnutils-xtools ubuntuone-client openoffice.org2-math openoffice.org-l10n-lt lsb-cxx-ia32 kdeartwork-emoticons-kde4 wmpuzzle trafshow python-plplot lx-gdb link-monitor-applet libscm-dev liblog-agent-logger-perl libccrtp-doc libclass-throwable-perl kde-i18n-csb jack-jconv hamradio-menus coinor-libvol-doc msx-emulator bitbake nabi language-pack-gnome-zh libpaperg popularity-contest xracer-tools xfont-nexus opendrim-lmp-baseserver libvorbisfile-ruby liblinebreak-doc libgfcui-2.0-0c2a-dbg libblacs-mpi-dev dict-freedict-spa-eng blender-ogrexml aspell-da x11-apps openoffice.org-l10n-lv openoffice.org-l10n-nl pnmtopng libodbcinstq1 libhsqldb-java-doc libmono-addins-gui0.2-cil sg3-utils linux-backports-modules-alsa-2.6.31-19-generic yorick-yeti-gsl python-pymssql plasma-widget-cpuload mcpp gpsim-lcd cl-csv libhtml-clean-perl asterisk-dbg apt-dater-dbg libgnome-mag1-dev language-pack-gnome-yo python-crypto svn-autoreleasedeb sugar-terminal-activity mii-diag maria-doc libplexus-component-api-java-doc libhugs-hgl-bundled libchipcard-libgwenhywfar47-plugins libghc6-random-dev freefem3d ezmlm cakephp-scripts aspell-ar ara-byte not+sparc openoffice.org-l10n-nn linux-backports-modules-karmic-generic-pae
    [Show full text]
  • A Framework for Developing Finite Element Codes for Multi- Disciplinary Applications
    A Framework for Developing Finite Element Codes for Multi- Disciplinary Applications Pooyan Dadvand Eugenio Oñate Monograph CIMNE Nº-109, January 2008 A Framework for Developing Finite Element Codes for Multi-Disciplinary Applications Pooyan Dadvand Eugenio Oñate Monograph CIMNE Nº-109, January 2008 Centro Internacional de Métodos Numéricos en Ingeniería Gran Capitán s/n, 08034 Barcelona, España CENTRO INTERNACIONAL DE MÉTODOS NUMÉRICOS EN INGENIERÍA Edificio C1, Campus Norte UPC Gran Capitán s/n 08034 Barcelona, Spain Primera edición: Januray 2008 A FRAMEWORK FOR DEVELOPING FINITE ELEMENT CODES FOR MULTI-DISCIPLINARY APPLICATIONS Monografía CIMNE M109 Los autores To Hengameh Abstract The world of computing simulation has experienced great progresses in recent years and requires more exigent multidisciplinary challenges to satisfy the new upcoming demands. Increasing the importance of solving multi-disciplinary problems makes developers put more attention to these problems and deal with difficulties involved in developing software in this area. Conventional finite element codes have several difficulties in dealing with multi-disciplinary problems. Many of these codes are designed and implemented for solving a certain type of problems, generally involving a single field. Extending these codes to deal with another field of analysis usually consists of several problems and large amounts of modifications and implementations. Some typical difficulties are: predefined set of degrees of freedom per node, data structure with fixed set of defined variables, global list of variables for all entities, domain based interfaces, IO restriction in reading new data and writing new results and algorithm definition inside the code. A common approach is to connect different solvers via a master program which implements the interaction algorithms and also transfers data from one solver to another.
    [Show full text]
  • PFORTH for MVS
    PFORTH for MVS Port of pforth to MVS 3.8j PFORTH A portable forth written in ANSI-C Public Domain http://www.softsynth.com/pforth/ MVS 3.8j MVS is “Multiple Virtual Storage” IBM operating system for the S/370 architecture MVS 3.8j is a public domain version of the ‘80 Can be run under Linux using Hercules Hercules Available at http://www.conmicro.cx/hercules/ Emulator for the IBM mainframe hardware (System/370, System/390 and z/Series) Runs under Linux and is released under the open source software license QPL. Dignus C/C++ A cross compiler targeting the 390 architecture Also available under an hobbist license. Objective A native pforth running on MVS under TSO and under JES2 A cross pforth running on Linux and able to generate full resident load modules (statically linked binary) Hacking guidelines Free-time project Minimize the work needed to have a version of pforth running on MVS (just minimalist a port) As much ANS as possible for an EBCDIC platform Start doing some forth on this platform and decide about the next steps. Make and install (1) The base dictionary is generated on Linux using a big-endian version of pforth This produce a .h containing the dictionary But this .h contains hard coded ASCII character Make and install (2) Compile with the Python wrapper Dcc.py (from a make –n) u.s.w. This produce a load module in XMIT format that can be binary transferred via 3270 and instantiated NPA100.TEST.BIN(PFORTH) NPA100.TEST.CLIST(PFORTH) Run EX 'NPA100.TEST.CLIST(PFORTH)‘ (from TSO) /* ALLOCATE NEEDED DD AND RUN /* PFORTH ALLOC F(STDOUT) DA(*) ALLOC F(STDERR) DA(*) ALLOC F(STDIN) DA(*) CALL 'NPA100.TEST.BIN(PFORTH)‚ FREE F(STDIN) FREE F(STDERR) FREE F(STDOUT) Status „Half working“ EBCDIC related problems: for ex.
    [Show full text]
  • Forth and the Neuron Chip
    ISS 0265-5195 March 2003 Issue 120 news people reviews projects programming Forth and the Neuron Chip FIGUK magazine: Sorting a List nnCron Across the Big Teich F11 UK Hardware Project 1 March 2003 Issue 120 events euroFORTH 2003 ................... 24 news Forth News ............................. 3 Feedback on Forth Code Index ................................................ 20 reviews nnCron ................................... 26 Across the Big Teich ............. 29 programming Sorting a List ......................... 11 people Presenting The FIG UK Awards of 2002 ................................................ 28 projects Forth and the Neuron Chip ... 5 FIG Hardware Project ............ 9 1 Editorial Welcome to new members Stephen Hayes, Anton Mans and Alan York. There's lots going on in Forth right now and FIG UK continues to play a substantial role. euroFORTH is back in the UK this year and this time FIG UK is getting involved – see details in this issue. As well as the usual free Forths, Forth News includes a healthy list of new Forth Resources in this issue such as tools for accessing the Internet. Look out for the first items in our new Forth Inside series – In the first one, we've managed to pull together some inside information on a well-hidden Forth story. There's quite a backlog of potential candidates for this series. Sorting A List is a tutorial article. The original was written using MS Word as an experiment to combine code, comment and graphics in the same document. Our A5 format is a bit tight but it works well at A4. Expect to see more about this in due course. Finally, congratulations to the winners of our FIG UK Awards 2002.
    [Show full text]
  • Vierte Dimension 2/2007 Das Forth–Magazin
    Vierte Dimension 2/2007 Das Forth–Magazin für Wissenschaft und Technik, für kommerzielle EDV, für MSR-Technik, für den interessierten Hobbyisten In dieser Ausgabe: Zentraler Grenzwertsatz Factor, Postscript, und Forth Named Bits Der Forth-Stammbaum Forth-Tagung 2007 Geburtstage T4 Embedded–Forth–Experiment 23. Jahrgang — Ausg. 2/2007 — 4,– e Organ der Forth–Gesellschaft e.V. Dienstleistungen und Produkte fördernder Mitglieder des Vereins tematik GmbH LEGO RCX–Verleih Technische Informatik Seit unserem Gewinn (VD 1/2001 S.30) verfügt unsere Feldstrasse 143 Schule über so ausreichend viele RCX–Komponenten, D–22880 Wedel dass ich meine privat eingebrachten Dinge nun An- Fon 04103 – 808989 – 0 deren, vorzugsweise Mitgliedern der Forth–Gesellschaft Fax 04103 – 808989 – 9 e. V., zur Verfügung stellen kann. ma li@t e mait k .d e Angeboten wird: Ein komplettes LEGO–RCX–Set, so www.tematik.de wie es für ca. 230,– e im Handel zu erwerben ist. Gegründet 1985 als Partnerinstitut der FH–Wedel be- Inhalt: schäftigten wir uns in den letzten Jahren vorwiegend 1 RCX, 1 Sendeturm, 2 Motoren, 4 Sensoren und ca. mit Industrieelektronik und Präzisionsmeßtechnik und 1.000 LEGO Steine. bauen z. Z. eine eigene Produktpalette auf. Anfragen bitte an Know–How Schwerpunkte liegen in den Berei- Martni.Bit ter @t- o.ln i n e d e chen Industriewaagen SWA & SWW, Differential– Dosierwaagen, DMS–Messverstärker, 68000 und Letztlich enthält das Ganze auch nicht mehr als einen 68HC11 Prozessoren, Sigma–Delta A/D. Wir program- Mikrocontroller der Familie H8/300 von Hitachi, ein mieren in Pascal, C und Forth auf SwiftX86k und seit paar Treiber und etwas Peripherie.
    [Show full text]
  • Debian-Paketmanagement
    Debian-Paketmanagement Axel Beckert und Frank Hofmann Onyx Neon Debian-Paketmanagement ii Copyright © 2012, 2013, 2014, 2015, 2016, 2017, 2018, 2019, 2020, 2021 Axel Beckert, Frank Hofmann Das Buch "Debian-Paketmanagement" von Frank Hofmann und Axel Beckert ist lizenziert unter einer Creative Commons Na- mensnennung - Weitergabe unter gleichen Bedingungen 4.0 International Lizenz. Debian-Paketmanagement iii VERSIONSGESCHICHTE NUMMER DATUM BESCHREIBUNG NAME debian/0_2021.03.01- 2021-09- 42-g5ca0d33 17T23:18:06+00:00 Debian-Paketmanagement iv Inhaltsverzeichnis I Konzepte 1 1 Willkommen im Linux-Dschungel! 2 1.1 Was ist Debian?....................................................2 1.2 Debian-Architekturen.................................................3 1.2.1 Debian-Ports-Projekt.............................................3 1.2.2 Pakete für alle Architekturen.........................................4 1.2.3 Multiarch: Mehrere Architekturen gleichzeitig auf einem System......................4 1.2.4 Bevor es Multiarch gab............................................5 1.3 Vom tar.gz zur Linux-Distribution........................................5 1.4 Debians Paketsystem.................................................5 1.5 Welche UNIX-artigen Betriebssysteme verwenden das Paketformat und das APT-Paketmanagement.......6 2 Software in Paketen organisieren 7 2.1 Was ist Paketmanagement...............................................7 2.2 Varianten und Formate für Softwarepakete......................................8 2.3 Softwarestapel und Ebenen..............................................9
    [Show full text]
  • Adventures in the Forth Dimension
    Adventures in the Forth Dimension Julian V. Noble Forth is a language that, for most programmers, is like the back side of the Mo on: they know it is there but have never laid eyes on it. Yet Forth is mature (ab out as old as Lisp or C); ubiquitous in everyday applications such as automated tellers, package tracking, airline reservations, telephone switching systems, and micro controllers everywhere; and built into millions of p ersonal computers and workstations in the guise of IEEE Op en Firmware. The page description language PostScript also has its ro ots in Forth. Since Forth is, like the game of Go, simple yet deep, scientists and engi- neers should b e b etter acquainted with it. This article provides an overview 1 2 of the language with emphasis on availability , ease of use , structure, read- ability,and features considered imp ortantor essential in mo dern languages. I will also discuss execution sp eed and co de size. To comp ensate for the omissions imp osed by brevity I include an App endix of Web links to sources of further information. Forth is usually considered a language for embedded programming and control. However it is frequently applied in the less familiar contexts of rob otics, data pro cessing, security systems, music, and number crunching. Though not often taught in academic settings, Forth is a go o d p edagogical language since it o ers access to all the computer's facilities and few prede- ned amenities. A student learns intimately how a heap or linked list works 1 Taygeta Scienti c Inc.
    [Show full text]
  • My History with Forth & Stack Machines
    My history with Forth & stack machines 9/30/12 9:49 PM Proper Fixation a substitute for anaesthesia blog about yosefk.com ← The Internet age/reputation paradox The Iron Fist Coding Standard → My history with Forth & stack machines September 10th, 2010 | hardware, software My VLSI tools take a chip from conception through testing. Perhaps 500 lines of source code. Cadence, Mentor Graphics do the same, more or less. With how much source/object code? – Chuck Moore, the inventor of Forth This is a personal account of my experience implementing and using the Forth programming language and the stack machine architecture. "Implementing and using" - in that order, pretty much; a somewhat typical order, as will become apparent. It will also become clear why, having defined the instruction set of a processor designed to run Forth that went into production, I don't consider myself a competent Forth programmer (now is the time to warn that my understanding of Forth is just that - my own understanding; wouldn't count on it too much.) Why the epigraph about Chuck Moore's VLSI tools? Because Forth is very radical. Black Square kind of radical. An approach to programming seemingly leaving out most if not all of programming: …Forth does it differently. There is no syntax, no redundancy, no typing. There are no errors that can be detected. …there are no parentheses. No indentation. No hooks, no compatibility. …No files. No operating system. I've never been a huge fan of suprematism or modernism in general. However, a particular modernist can easily get my attention if he's a genius in a traditional sense, with superpowers.
    [Show full text]
  • A Basis for Ubiquitous Virtual Machines
    Mite a basis for ubiquitous virtual machines Reuben Rhys Thomas St John's College A dissertation submitted for the Ph.D. degree November 2000 Ars est celare artem To Basil Rose, who did it first, and Tony Thomas, who hooked me. Preface Mite is a virtual machine intended to provide fast language and machine-neutral just-in-time translation of binary-portable object code into high quality native code, with a formal foundation. Chapter 1 discusses the need for fast good quality translation of portable code at run time, and, having considered what functionality is needed, sets out a list of goals that Mite should reach, then states the contribution of the thesis. Chapter 2 discusses related work, concentrating on the extent to which the various systems examined fulfil Mite's goals. Chapter 3 elaborates Mite's design, and chapter 4 anal- yses the choices that it makes. Chapter 5 examines the implementation, consisting of a C compiler back end targeting Mite and a virtual code translator for the ARM processor, and shows how it could be extended to other languages and processors. Chapter 6 describes and analyses a series of tests performed on the implementation, then assesses both the design and implementation in their light. Chapter 7 describes future work; finally, chapter 8 concludes the thesis with an appraisal of how well Mite meets its goals, and a final perspective. Appendices A–C give Mite's definition, appendix 6.1 lists two of the benchmark programs used in chapter 6, and appendix E gives the data collected from the tests.
    [Show full text]