Clustered VLIW Architectures : a Quantitative Approach

Total Page:16

File Type:pdf, Size:1020Kb

Clustered VLIW Architectures : a Quantitative Approach Clustered VLIW architectures : a quantitative approach Citation for published version (APA): Terechko, A. S. (2007). Clustered VLIW architectures : a quantitative approach. Technische Universiteit Eindhoven. https://doi.org/10.6100/IR617141 DOI: 10.6100/IR617141 Document status and date: Published: 01/01/2007 Document Version: Publisher’s PDF, also known as Version of Record (includes final page, issue and volume numbers) Please check the document version of this publication: • A submitted manuscript is the version of the article upon submission and before peer-review. There can be important differences between the submitted version and the official published version of record. People interested in the research are advised to contact the author for the final version of the publication, or visit the DOI to the publisher's website. • The final author version and the galley proof are versions of the publication after peer review. • The final published version features the final layout of the paper including the volume, issue and page numbers. Link to publication General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the public portal. If the publication is distributed under the terms of Article 25fa of the Dutch Copyright Act, indicated by the “Taverne” license above, please follow below link for the End User Agreement: www.tue.nl/taverne Take down policy If you believe that this document breaches copyright please contact us at: [email protected] providing details and we will investigate your claim. Download date: 01. Oct. 2021 Clustered VLIW Architectures: a Quantitative Approach Andrei Sergeevich Terechko Doctorate committee: prof.dr.ir. A.C.P.M. Backx Technical University Eindhoven, chairman prof.dr. H. Corporaal Technical University Eindhoven, Netherlands dr.ir. J.T.J. van Eijndhoven NXP Semiconductors, Netherlands prof.dr. A. González Intel and Universitat Politècnica de Catalunya, Spain prof.dr.ir. J.L. van Meerbergen Royal Philips Electronics, Netherlands prof.dr.ir. R.H.J.M. Otten Technical University Eindhoven, Netherlands dr.ir. P. Stravers NXP Semiconductors, Netherlands The cover design by Henny Herps, Floris van den Haar, and Andrei Terechko. The front cover shows a layout of a tiny synchronous digital IC with 24 standard cell gates and 26 nets in the Philips CMOS 65 nm standard Vth technology. The reader is challenged to guess the ASCII string with an extra proposition of this PhD thesis that this IC generates on the 8 out- put pins. Note that the clock and reset are not shown to simplify the fig- ure; the colors for polysilicon, diffusion, etc. are non-standard; metal vias are not visible. The reverse engineering associated with this chal- lenge is a “walk in the park” compared to what Soviet Union engineers did in the 1980s to clone DEC and Intel processors with thousands of gates. If no solution is found before 06 May 2007, I will publish hints to the solution on http://terechko.net/andrei/phd. Clustered VLIW Architectures: a Quantitative Approach PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Technische Universiteit Eindhoven, op gezag van de Rector Magnificus, prof.dr.ir. C.J. van Duijn, voor een commissie aangewezen door het College voor Promoties in het openbaar te verdedigen op dinsdag 6 februari 2007 om 16.00 uur door Andrei Sergeevich Terechko geboren te Minsk, Wit-Rusland Dit proefschrift is goedgekeurd door de promotoren: prof.dr. H. Corporaal en prof.dr.ir. R.H.J.M. Otten Copromotor: dr.ir. P. Stravers CIP-DATA LIBRARY TECHNISCHE UNIVERSITEIT EINDHOVEN Terechko, Andrei S. Clustered VLIW architectures : a quantitative approach / by Andrei Sergeevich Terechko. – Eindhoven : Technische Universiteit Eindhoven, 2007. Proefschrift. – ISBN-10: 90-386-1953-7 ISBN-13: 978-90-386-1953-8 NUR 959 Trefw.: ingebedde systemen / computerontwerp ; microprocessoren / compilers / multimedia. Subject headings: embedded systems / computer architecture / program compilers / multimedia systems. The work described in this thesis has been carried out at the Philips Research Laboratories Eindhoven and NXP Semiconductors as part of the Research pro- gram. © NXP Semiconductors 2007 All rights reserved. Reproduction or dissemination in whole or in part is prohibited without the prior written consent of the copyright holder. Acknowledgments According to T. Kuhn's “The Structure of Scientific Revolutions” a scientific work can be successfully carried out only through intensive interactions among the members of a scientific community. First, I'm deeply grateful to my PhD su- pervisor Professor Henk Corporaal. Henk's mighty creativity and dedication to Science continuously inspired and motivated me in my winding quest for under- standing processor architecture. Furthermore, sheer and extravagant ingenuity of Paul Stravers hugely influenced my scientific and cultural existence. No doubt I'll be writing more open-source code, reading Maarten Biesheuvel and listening to Jacques Brel thanks to Paul. I also like to acknowledge Professor Ralph Otten who helped me better understand technological aspects in my re- search. On top of that, I thank Erwan Le Thénaff, Manish Garg, and Evert-Jan Pol, who supported (and bore with) my first steps in our joint scientific publica- tions. My life got a surrealistic kick in 1999, when I moved from Belarus to The Netherlands accepting a research position at Philips Research laboratories in Eindhoven. Luckily, this step allowed me to submerge into a cutting-edge Com- puter Science community and to hang out with a mind-boggling international crowd. I'm deeply grateful to Jos van Eijndhoven, who not only withstood me for 6 years in the same office at Research, but also dedicated many vivid hours to brainstorming with me about processor architecture, neat programming and the intricate Dutch society. Furthermore, I would like to acknowledge the ele- gant programming style of Jan Hoogerbrugge, whose VLIW instruction sched- uler formed the basis for my quantitative exploration. I will always have great memories of Philips, especially, of my intense scientific and engineering argu- ments with the Indian genius of Jayram Nageswaran and the Dutch genius of Jan-Willem van de Waerdt; let alone the fun of being part of an international micro-society with Giovanni Nisato, Catherine Nisato, Natalino Busa, Enith Vlooswijk, Clara Otero-Perez, Eugenio Cantatore, Manvi Agarwal, Otto Stein- busch, Martijn Rutten, Orlando Moreira and the guitarist Mathias Lang. My very first scientific experience in 1995-1998 was guided by Professor Gen- nadiy Shpackovsky at the Belarusian State University, who introduced me to the most intellectual game invented by the human society – Science. Finally, I ac- knowledge the contribution of many (anonymous) people to the open-source software (GNU Linux, GNU emacs, Mediabench, Putty, OOo, sWiki, WP) and free resources (e.g. http://wikipedia.org) that I used for my research. My love goes to Masha, Polina, and my parents Sergei and Tamara Terechko. Andrei Terechko Eindhoven, The Netherlands, 2007 v vi Table of Contents ACKNOWLEDGMENTS.......................................................................................... V 1. INTRODUCTION.............................................................................................. 1 1.1. Embedded computing........................................................................................... 2 1.1.1. Embedded multimedia workload................................................................. 2 1.1.2. Instruction-Level Parallelism....................................................................... 5 1.1.3. Embedded multimedia System-on-a-Chip................................................... 7 1.2. Deep sub-micron VLSI technology trends......................................................... 10 1.2.1. Poor scalability of interconnect delay........................................................ 11 1.2.2. Power consumption constraints..................................................................12 1.3. Problem statement of the thesis.......................................................................... 14 1.4. Thesis organization............................................................................................. 15 2. ARCHITECTURE OF CLUSTERED PROCESSORS.................................................. 17 2.1. Unicluster VLIW processor architecture............................................................ 17 2.2. From the unicluster to a clustered architecture................................................... 19 2.3. Clustered microarchitecture................................................................................ 23 2.3.1. Inter-cluster communication networks.......................................................23 2.3.1.1. Networks with non-uniform latency..................................................25 2.3.2. Clustering memory hierarchy.....................................................................26 2.3.3. Control for clustered processors.................................................................29
Recommended publications
  • Dynamic Helper Threaded Prefetching on the Sun Ultrasparc® CMP Processor
    Dynamic Helper Threaded Prefetching on the Sun UltraSPARC® CMP Processor Jiwei Lu, Abhinav Das, Wei-Chung Hsu Khoa Nguyen, Santosh G. Abraham Department of Computer Science and Engineering Scalable Systems Group University of Minnesota, Twin Cities Sun Microsystems Inc. {jiwei,adas,hsu}@cs.umn.edu {khoa.nguyen,santosh.abraham}@sun.com Abstract [26], [28], the processor checkpoints the architectural state and continues speculative execution that Data prefetching via helper threading has been prefetches subsequent misses in the shadow of the extensively investigated on Simultaneous Multi- initial triggering missing load. When the initial load Threading (SMT) or Virtual Multi-Threading (VMT) arrives, the processor resumes execution from the architectures. Although reportedly large cache checkpointed state. In software pre-execution (also latency can be hidden by helper threads at runtime, referred to as helper threads or software scouting) [2], most techniques rely on hardware support to reduce [4], [7], [10], [14], [24], [29], [35], a distilled version context switch overhead between the main thread and of the forward slice starting from the missing load is helper thread as well as rely on static profile feedback executed, minimizing the utilization of execution to construct the help thread code. This paper develops resources. Helper threads utilizing run-time a new solution by exploiting helper threaded pre- compilation techniques may also be effectively fetching through dynamic optimization on the latest deployed on processors that do not have the necessary UltraSPARC Chip-Multiprocessing (CMP) processor. hardware support for hardware scouting (such as Our experiments show that by utilizing the otherwise checkpointing and resuming regular execution). idle processor core, a single user-level helper thread Initial research on software helper threads is sufficient to improve the runtime performance of the developed the underlying run-time compiler main thread without triggering multiple thread slices.
    [Show full text]
  • Chapter 2 Java Processor Architectural
    INFORMATION TO USERS This manuscript has been reproduced from the microfilm master. UMI films the text directly from the original or copy submitted. Thus, some thesis and dissertation copies are in typewriter face, while others may be from any type of computer printer. The quality of this reproduction is dependent upon the quality of the copy submitted. Broken or indistinct print, colored or poor quality illustrations and photographs, print bleedthrough, substandard margins, and improper alignment can adversely affect reproduction. In the unlikely event that the author did not send UMI a complete manuscript and there are missing pages, these will be noted. Also, if unauthorized copyright material had to be removed, a note will indicate the deletion. Oversize materials (e.g., maps, drawings, charts) are reproduced by sectioning the original, beginning at the upper left-hand comer and continuing from left to right in equal sections with small overlaps. ProQuest Information and Leaming 300 North Zeeb Road, Ann Arbor, Ml 48106-1346 USA 800-521-0600 UMI" The JAFARDD Processor: A Java Architecture Based on a Folding Algorithm, with Reservation Stations, Dynamic Translation, and Dual Processing by Mohamed Watheq AH Kamel El-Kharashi B. Sc., Ain Shams University, 1992 M. Sc., Ain Shams University, 1996 A Dissertation Submitted in Partial Fulfillment of the Requirements for the Degree of D o c t o r o f P h il o s o p h y in the Department of Electrical and Computer Engineering We accept this dissertation as conforming to the required standard Dr. F. Gebali, Supervisor (Department of Electrical and Computer Engineering) Dr.
    [Show full text]
  • Debugging Multicore & Shared- Memory Embedded Systems
    Debugging Multicore & Shared- Memory Embedded Systems Classes 249 & 269 2007 edition Jakob Engblom, PhD Virtutech [email protected] 1 Scope & Context of This Talk z Multiprocessor revolution z Programming multicore z (In)determinism z Error sources z Debugging techniques 2 Scope and Context of This Talk z Some material specific to shared-memory symmetric multiprocessors and multicore designs – There are lots of problems particular to this z But most concepts are general to almost any parallel application – The problem is really with parallelism and concurrency rather than a particular design choice 3 Introduction & Background Multiprocessing: what, why, and when? 4 The Multicore Revolution is Here! z The imminent event of parallel computers with many processors taking over from single processors has been declared before... z This time it is for real. Why? z More instruction-level parallelism hard to find – Very complex designs needed for small gain – Thread-level parallelism appears live and well z Clock frequency scaling is slowing drastically – Too much power and heat when pushing envelope z Cannot communicate across chip fast enough – Better to design small local units with short paths z Effective use of billions of transistors – Easier to reuse a basic unit many times z Potential for very easy scaling – Just keep adding processors/cores for higher (peak) performance 5 Parallel Processing z John Hennessy, interviewed in the ACM Queue sees the following eras of computer architecture evolution: 1. Initial efforts and early designs. 1940. ENIAC, Zuse, Manchester, etc. 2. Instruction-Set Architecture. Mid-1960s. Starting with the IBM System/360 with multiple machines with the same compatible instruction set 3.
    [Show full text]
  • Ultrasparc-III Ultrasparc-III Vs Intel IA-64
    UltraSparc-III UltraSparc-III vs Intel IA-64 vs • Introduction Intel IA-64 • Framework Definition Maria Celeste Marques Pinto • Architecture Comparition Departamento de Informática, Universidade do Minho • Future Trends 4710 - 057 Braga, Portugal [email protected] • Conclusions ICCA’03 ICCA’03 Introduction Framework Definition • UltraSparc-III (US-III) is the third generation from the • Reliability UltraSPARC family of Sun • Instruction level Parallelism (ILP) • Is a RISC processor and uses the 64-bit SPARC-V9 architecture – instructions per cycle • IA-64 is Intel’s extension into a 64-bit architecture • Branch Handling • IA-64 processor is based on a concept known as EPIC (Explicitly – Techniques: Parallel Instruction Computing) • branch delay slots • predication – Strategies: • static •dynamic ICCA’03 ICCA’03 Framework Definition Framework Definition • Memory Hierarchy • Pipeline – main memory and cache memory – increase the speed of CPU processing – cache levels location – several stages that performs part of the work necessary to execute an instruction – cache organization: • fully associative - every entry has a slot in the "cache directory" to indicate • Instruction Set (IS) where it came from in memory – is the hardware "language" in which the software tells the processor what to do • one-way set associative - only a single directory entry be searched – can be divided into four basic types of operations such as arithmetic, logical, • two-way set associative - two entries per slot to be searched (and is extended to program-control
    [Show full text]
  • Sun Blade 1000 and 2000 Workstations
    Sun BladeTM 1000 and 2000 Workstations Just the Facts Copyrights 2002 Sun Microsystems, Inc. All Rights Reserved. Sun, Sun Microsystems, the Sun logo, Sun Blade, PGX, Solaris, Ultra, Sun Enterprise, Starfire, SunPCi, Forte, VIS, XGL, XIL, Java, Java 3D, SunVideo, SunVideo Plus, Sun StorEdge, SunMicrophone, SunVTS, Solstice, Solstice AdminTools, Solstice Enterprise Agents, ShowMe, ShowMe How, ShowMe TV, Sun Workstation, StarOffice, iPlanet, Solaris Resource Manager, Java 2D, OpenWindows, SunCD, Sun Quad FastEthernet, SunFDDI, SunATM, SunCamera, SunForum, PGX32, SunSpectrum, SunSpectrum Platinum, SunSpectrum Gold, SunSpectrum Silver, SunSpectrum Bronze, SunSolve, SunSolve EarlyNotifier, and SunClient are trademarks, registered trademarks, or service marks of Sun Microsystems, Inc. in the United States and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the United States and other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc. UNIX is a registered trademark in the United States and in other countries, exclusively licensed through X/Open Company, Ltd. FireWire is a registered trademark of Apple Computer, Inc., used under license. OpenGL is a trademark of Silicon Graphics, Inc., which may be registered in certain jurisdictions. Netscape is a trademark of Netscape Communications Corporation. PostScript and Display PostScript are trademarks of Adobe Systems, Inc., which may be registered in
    [Show full text]
  • Robert Garner
    San José State University The Department of Computer Science and The Department of Computer Engineering jointly present THE HISTORY OF COMPUTING SPEAKER SERIES Robert Garner Tales of CISC and RISC from Xerox PARC and Sun Wednesday, December 7 6:00 – 7:00 PM Auditorium ENGR 189 Engineering Building CISCs emerged in the dawn of electronic digital computers, evolved flourishes, and held sway over the first half of the computing revolution. As transistors shrank and memories became faster, larger, and cheaper, fleet- footed RISCs dominated in the second half. In this talk, I'll describe the paradigm shift as I experienced it from Xerox PARC's CISC (Mesa STAR) to Sun's RISC (SPARC Sun-4). I'll also review the key factors that affected microprocessor performance and appraise how well I predicted its growth for the next 12 years back in 1996. I'll also share anecdotes from PARC and Sun in the late 70s and early 80s. Robert’s Silicon Valley career began with a Stanford MSEE in 1977 as his bike route shifted slightly eastward to Xerox’s Palo Alto Research facility to work with Bob Metcalfe on the 10-Mbps Ethernet and the Xerox STAR workstation, shipped in 1981. He subsequently joined Lynn Conway’s VLSI group at Xerox PARC. In 1984, he jumped to the dazzling start-up Sun Microsystems, where with Bill Joy and David Patterson, he led the definition of Sun’s RISC (SPARC) while co-designing the Sun-4/200 workstation, shipped in 1986. He subsequently co-managed several engineering product teams: the SunDragon/SPARCCenter multiprocessor, the UltraSPARC-I microprocessor, the multi-core/threaded Java MAJC microprocessor, and the JINI distributed devices project.
    [Show full text]
  • A Novel Dynamic Scan Low Power Design for Testability Architecture for System-On-Chip Platform
    International Journal of Applied Engineering Research ISSN 0973-4562 Volume 13, Number 7 (2018) pp. 5256-5259 © Research India Publications. http://www.ripublication.com A Novel Dynamic Scan Low Power Design for Testability Architecture for System-On-Chip Platform John Bedford Solomon1, D Jackuline Moni2 and Y. Amar Babu3 1Research Scholar, Karunya University, Coimbatore, Tamil Nadu, India. 2Professor, Karunya University, Coimbatore, Tamil Nadu, India. 3Associate Professor , LBRCE, Mylavaram, Andhra Pradesh, India. Abstract acceleration modules and hardware debug module. The MicroBlaze based SoC platform uses Xilinx microkernel and Design for Testability and Low power design issues are UART custom peripherals. It also support third party demanding requirements for System-On-Chip plat form to RTOS[6-14]. The MicroBlaze required 525 slices in Spartan-3 provide high performance and highly reliable chips. In this FPGA and executes 65 D-MIPS at 80 MHz. The PicoBlaze is paper a novel dynamic scan low power design for testability a 8 bit RISC core, requires 192 logic cells and 212 macrocells architecture has been proposed for reducing test power on in Spartan-3 FPGA[14], executes 37 MIPS at 74MHz. The System-On-Chip platform. The area analysis of proposed Sun Microsystems UltraSPARC T2 microprocessor is also architecture for soft core Microblaze based System-On-chip used to study the performance of proposed dynamic scan low observes 5% reduction of area and test power also reduced by power DFT architecture. 10% approximately. The area and power analysis of proposed architecture for Microblaze based SoC platform, Picoblaze The dynamic scan low power DFT architecture has been based SoC platform and Ultrasparc T2 architectures has been implemented on above mentioned architectures to analyze observed for optimized area and power design metrics.
    [Show full text]
  • Hotchips '99 Hotchips
    MAJCTM Microprocessor Architecture for Java Computing MarcMarc Tremblay,Tremblay, ChiefChief ArchitectArchitect SunSun MicrosystemsMicrosystems IncInc.. HotChipsHotChips ‘99‘99 MicroprocessorMicroprocessor ArchitectureArchitecture forfor JavaJava ComputingComputing Four-year program to develop a new microprocessor family based on the following four assumptions: 1) Current and future compute tasks are/will be very different than benchmarks used to develop CISC and RISC processors – Algorithms - compute intensive, ratio of compute operations to memory operations is much higher • e.g.: IIR filters, VOIP, MPEG2: 3-16 ops/flops per memory op. – Data types - digitized analog signals, lots of data – I/O - streaming, mostly linearly AssumptionsAssumptions (continued)(continued) 2) The dominant platform for networked devices, networked computers, terminals, application servers, etc. will be the JavaTM platform. – > 1 million Java developers, 72% of Fortune 1000 companies using Java (80% client, 100% server) by 2000, present in Navigator, Explorer, AOL browser, PalmPilots, 2-way pagers, cellular phones, etc. – Multi-threaded applications • Observing Java applications with 10’s and 100’s of threads – Run time optimizations is the prime focus – Architecture must consider virtual functions, exception model, garbage collection, etc. AssumptionsAssumptions (continued)(continued) 3) Parallelism is available at multiple levels – ILP improves through: •Data and control speculation – non-faulting loads, explicit MP predication, and traditional methods Thread
    [Show full text]
  • Sun Fire V880z Server and Sun XVR-4000 Graphics Accelerator
    Sun Fire™ V880z Server and Sun™ XVR-4000 Graphics Accelerator Installation and User’s Guide Sun Microsystems, Inc. 4150 Network Circle Santa Clara, CA 95054 U.S.A. 650-960-1300 Part No. 817-2400-10(v2) May 2003, Revision A Submit comments about this document at: http://www.sun.com/hwdocs/feedback Copyright 2003 Sun Microsystems, Inc., 4150 Network Circle, Santa Clara, California 95054, U.S.A. All rights reserved. Sun Microsystems, Inc. has intellectual property rights relating to technology embodied in the product that is described in this document. In particular, and without limitation, these intellectual property rights may include one or more of the U.S. patents listed at http://www.sun.com/patents and one or more additional patents or pending patent applications in the U.S. and in other countries. This document and the product to which it pertains are distributed under licenses restricting their use, copying, distribution, and decompilation. No part of the product or of this document may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Third-party software, including font technology, is copyrighted and licensed from Sun suppliers. Parts of the product may be derived from Berkeley BSD systems, licensed from the University of California. UNIX is a registered trademark in the U.S. and in other countries, exclusively licensed through X/Open Company, Ltd. Sun, Sun Microsystems, the Sun logo, AnswerBook2, docs.sun.com, Sun Fire, Java3D, Java, OpenBoot, and Solaris are trademarks or registered trademarks of Sun Microsystems, Inc.
    [Show full text]
  • Power4 Focuses on Memory Bandwidth IBM Confronts IA-64, Says ISA Not Important
    Power4 Focuses on Memory Bandwidth IBM Confronts IA-64, Says ISA Not Important by Keith Diefendorff Looking for Parallelism in All the Right Places With Power4, IBM is targeting the high-reliability servers Not content to wrap sheet metal around that will power future e-businesses. The company has said Intel microprocessors for its future server that Power4 was designed and optimized primarily for business, IBM is developing a processor it servers but that it will be more than adequate for workstation hopes will fend off the IA-64 juggernaut. Speaking at this duty as well. The market IBM apparently seeks starts just week’s Microprocessor Forum, chief architect Jim Kahle de- above small PC-based servers and runs all the way up scribed IBM’s monster 170-million-transistor Power4 chip, through the high-end high-availability enterprise servers which boasts two 64-bit 1-GHz five-issue superscalar cores, a that run massive commercial and technical workloads for triple-level cache hierarchy, a 10-GByte/s main-memory corporations and governments. interface, and a 45-GByte/s multiprocessor interface, as Much to IBM’s chagrin, Intel and HP have also aimed Figure 1 shows. Kahle said that IBM will see first silicon on IA-64 at servers and workstations. IA-64 system vendors Power4 in 1Q00, and systems will begin shipping in 2H01. such as HP and SGI have their sights set as high up the server scale as IBM does, so there is clearly a large overlap between No Holds Barred the markets all these companies covet.
    [Show full text]
  • Java Microarchitectures
    Book Review / Revue de livre Java Microarchitectures he need for an architectural neutral language that could be T used to produce code that would run on a variety of CPUs by: Purnendu Sinha, Nikhil Varma & Vasudevan Janarthanan, under differing environments led to the emergence of the Concordia University, Montreal, QC. Java programming language. With the rise of the World Wide Web, Java has propelled to the forefront of computer language design, because the web, too, demands portable programs. The environ- Book Edited by: Vijaykrishnan Narayanan, Mario I.Wolczko mental change that prompted Java was the need for platform- independent programs destined for distribution on the Internet. The Java Published by: Kluwer Academic Publishers platform with its target platform neutrality, simplified object model, ISBN: 1-4020-7034-9 strong notions of security and portability, as well as multithreading sup- 2002 port, provides many advantages for a new generation of networked, embedded and real-time systems. All those features would not have been possible without appropriate hardware support. This book delves in-depth into the various hardware requirements (with suitable case- processor architecture for the hardware execution of the bytecodes and studies and examples) for realizing the advantages of Java. resolves the issue of stack dependency by the use of a hardware byte- code folding algorithm. The architecture provides a dual processing Java’s portability is attained by compiling Java programs to Java byte- capability of execution of bytecodes and native binaries. This chapter codes (JBCs) and interpreting them on the platform-independent Java discusses and analyzes the bytecode processing in various stages of the Virtual Machine (JVM).
    [Show full text]
  • Highly Threaded SPARC Architectures
    Highly Threaded SPARC Architectures James Hughes Sun Fellow Sun Microsystems Note: future dates and features within this presentation reflect Sun's current plans but may change without notice. Driving Volume to Value Open Source Free Access Revenue Making SW Development Choice Increases Volume Business Deployment Solaris • Free RTU Solaris 10 NetBeans (unsupported) Java ES Glassfish • Extended Try and Buy ID Management Program Single sign-on Java CAPS • Connected to Sun All Middleware Mobile '(")* +,)$%-.,*/%01"#)"2 : %&#;< %=147>?4@A=;&34?B;#=@4 C $D;EF;%GH;I8F;J@=31H;KLM0N;(O C DPL3=47N;Q>JJNL011A?@04@R3;%*'F *'# %&# : !"#9,$;< %=43S37;!53?>4@A=;#=@41 C P;4B730T1;1B073;30?B;>=@4 C !53?>431;A=3;@=43S37; @=147>?4@A=,?N?J3 !"#9 !"#$ : '(#;< 'A0T,(4A73;#=@4 C KEF;UGH;$DF;J@=31H;PLM0N;(O ++#, C $8KL3=47N;Q>JJNL011A?@04@R3; -.*. U*'F : &/#;< &JA04@=S,/70VB@?1;#=@4 : ()#;< (4730W;)7A?311@=S;#=@4 ()# &/# '(# C X7NV4AS70VB@?;0??3J3704@A= : *'#;< *70V;'AS@?;#=@4 C #VT0431;W0?B@=3;14043H;B0=TJ31; 35?3V4@A=1;0=T;@=4377>V41 /01234 : ++#;< +3WA7N;+0=0S3W3=4;#=@4 C -07TM073;406J3M0J2;Y-.*.Z 5607,'8 C KEFH;DPEFH;P+FH;8[D+F;V0S31 !"#$%%%& ()*% K !7,,GLH(H&,(E&5$M&.I&H*$ !"#$%&'()*&+$!(,& BCD5 $$$$$$-./0.123$ 5(>(5&?)@+*N 0%?0J O =(>(5&?)@+* (E$HD+D,,&,$ +)A +)0 P(*Q$D55$C+$R7,*(H,G$ CH&+D*(CE)$CS$C*Q&+$ :*C+& !"# *Q+&D5) 9E*&'&+?F+GH*C :C7+I&) !"$ K !"#$H&+SC+R)$(E*&'&+$ 455 89:$0;< =(>? R7,*(H,GT$5(>(5&T$HCH7,D*(CE$ !"% 67, :@+* IC7E* !"& K 67,*(H,(&+$&EQDEI&R&E*)$ 9E*&'&+?F+GH*C SC+$RC57,D+$D+(*QR&*(I$ %&)7,* !"' CH&+D*(CE) !( O U7(,*L(E$DII7R7,D*C+
    [Show full text]