Multi-Threading in Electrictm, a Java VLSI CAD Tool

Total Page:16

File Type:pdf, Size:1020Kb

Multi-Threading in Electrictm, a Java VLSI CAD Tool Multi-threading in ElectricTM, a Java VLSI CAD Tool Gilda Garretón Sun Microsystems Laboratories JUG.cl May 2008 Agenda •Who we are •Research activities •Electric, a Java VLSI framework •Multi-Threading 2 projects •Lessons learned Sun Microsystems Laboratories @ 2008 Copyrights 2 Sun Microsystems Laboratories •Center for innovation •Mission •Research Strategy •Collaboration Sun Microsystems Laboratories @ 2008 Copyrights 3 Our Technology Changes the World Fortress Sun SPOT Darkstar Digital Rights Tools for Search Solaris Honeycom Elliptic Demand on b Curve Forecasting Power Cryptogra PC phy UltraSPARC® Electric Sun Sun Ray V9 Cluster Developed in Sun Labs, transferred to Sun products and to the outside world Sun Microsystems Laboratories @ 2008 Copyrights 4 Sun Labs Project Focus System System System Network Hardware Software Science Clients Sun Microsystems Laboratories @ 2008 Copyrights 5 VLSI Research Group •Diverse group (~20) Moscow >Hardware (14) >Software (4) >Both (2) •Education >BS, MS, PhD •Rank >1 PE, 4 DEs and 1 Sun Fellow •Collaboration aims innovation >Research: circuit (HW) and CAD (SW) Sun Microsystems Laboratories @ 2008 Copyrights 6 VLSI Circuit Research •Design circuits to enable novel architectures •Low-power circuits •High-speed circuits •Asynchronous circuits •Communication Links >Proximity Communication Chip2 Chip1 1 Transmit Receive 2 Receive Transmit Sun Microsystems Laboratories @ 2008 Copyrights 7 VLSI CAD Research •Challenges > Size and performance > Hierarchical representation > Shrinking technologies •Research topics > Collaboration with Handling of large data set Harvey Mudd College > Geometric algorithms > Visualization and collaboration •Software Research > Client/server model > 2D/3D visualization > Multi-Threading Metal Cage Structure Sun Microsystems Laboratories @ 2008 Copyrights 8 Making Test Chips: Custom VLSI Chip Design Flow Chip Building Process CAD Testing Process Sun Microsystems Laboratories @ 2008 Copyrights 9 Making Test Chips: Custom VLSI Chip Design Flow 1.Schematic Entry and Transistor Sizing >Logical Effort support Sun Microsystems Laboratories @ 2008 Copyrights 10 Making Test Chips: Custom VLSI Chip Design Flow 1.Schematic Entry and Transistor Sizing 2.Simulation >Interface to CAD Tools of Our Choice >Waveform Viewer and Cross-Probing Sun Microsystems Laboratories @ 2008 Copyrights 11 Making Test Chips: Custom VLSI Chip Design Flow 1.Schematic Entry and Transistor Sizing 2.Simulation 3.Layout > Network Consistency Checker (NCC) > Design Rules Checker (DRC) > Automatic fill generator > Routing, auto stitching tool Sun Microsystems Laboratories @ 2008 Copyrights 12 Making Test Chips: Custom VLSI Chip Design Flow 1.Schematic Entry and Transistor Sizing 2.Simulation 3.Layout 4.Fabrication Foundry Input Foundry Output Sun Microsystems Laboratories @ 2008 Copyrights 13 Making Test Chips: Custom VLSI Chip Design Flow 1.Schematic Entry and Transistor Sizing 2.Simulation 3.Layout 4.Fabrication 5.Testing PxIO Testing Board Sun Microsystems Laboratories @ 2008 Copyrights 14 Electric VLSI Design System • Java development platform • Technical benefits for HW/SW • External/Open source community > www.staticfreesoft.com > www.opensparc.net/ > Google VLSI forum • Dynamic development cycle Electric Framework Sun Microsystems Laboratories @ 2008 Copyrights 15 Multi-Threading in Electric •Used in schematics/layout phases >Analysis tools >Route and placement •2 Multi-threading projects >Using directly JDK threading classes: routing tool >Using threading capabilities in Electric: Integrated solution: DRC tool •Available in next GNU release: June 2008 Sun Microsystems Laboratories @ 2008 Copyrights 16 Multi-Threading a Routing Tool •Routing using cost functions >Based on Dijkstra search >Two parallel techniques • Tech 1: two directions (A<->B) • Tech 2: multiple routes •Implementation > Runtime.getRuntime().availableProcessors() > Class RouteInThread extends Thread > Class Semaphore • acquireUninterruptibly() • release() Sun Microsystems Laboratories @ 2008 Copyrights 17 Multi-Threaded Routing Results Example: 682 segments to route 2-direction Tech. Multiple Routes Tech. 1000 1000 900 900 800 800 700 700 ) ) 600 s s 600 ( ( 2-CPU box 2-CPU box e e 500 4-CPU box 500 4-CPU box m m i i 8-CPU box 400 8-CPU box 400 T T 300 300 200 200 100 100 0 0 1 CPU 2 CPUs 1 CPU Max. # CPUs # CPUs # CPUs Sun Microsystems Laboratories @ 2008 Copyrights 18 Multi-Threading a DRC Tool Jobs •Electric's core database •Jobs Manager jobs >Pools of jobs >Java threads JobManager •Client/server model >Serializable interface • ByteArrayOutputStream • ObjectInputStream Threads •Class MultiTaskJob<Key,Task,Result> extends Job Sun Microsystems Laboratories @ 2008 Copyrights 19 g n i c DRC Introduction a p enclosure width s •Verify correctness extension >Recommended rules area h >Geometrical conditions t g n e •w(l1), f(l1, l1), g(l1,l2) l • Single-Threaded (STD) algorithms Sun Microsystems Laboratories @ 2008 Copyrights 20 Single-Threaded DRC Results Time consuming in layouts DRC Execution Time* (min.) # Transistors in Electric 13:12 2000000 12:00 1800000 Test A 10:48 Test B 1600000 09:36 Test C 1400000 08:24 Test A 1200000 Test B 07:12 1000000 Test C 06:00 800000 04:48 600000 03:36 400000 02:24 200000 01:12 0 00:00 * AMD64, 2.8GHZ, 16GB Sun Microsystems Laboratories @ 2008 Copyrights 21 Multi-Threaded DRC Results •Simplest approach due to STD code >Less than 1 week to implement •Best results in N2 with 16T and in Sun Fire with 4T DRCMT TiDme Ti mein S uinn S Fuirne F (miiren.) DMTRCD T Tiimeme i nin N S2u (nmi Finre.) 0022::3388 0082::2348 07:48 0022::2244 02:24 07:12 0022::0099 SuSunn FiFirere XX4646000,0, 02:09 SPASunR FiC reEn Xter46-00, 44 CCPUPUss,, 22..6GH6GHzz,, 06:36 pri4 CsePU T5s,22 2.06GH z, 0011::5555 6G6GBB 01:55 646GGBB , N2 06:00 1.4Ghz, 8 CPUs SuSunn FiFirere XX2121000,0, Sun Fire X2100, 0011::4400 4C4CPUPUss,, 22..6GH6GHzz,, 0051::2440 4CPUs, 2.6GHz, 1616GBGB 04:48 16GB 0011::2266 01:26 04:12 01:12 01:12 0031::3162 0000::5577 0030::0507 0000::4433 0020::2443 01:48 0000::2288 00:28 01:12 0000::1144 0000::3164 0000::0000 0000::0000 11 22 44 1166 3322 6644 11 22 44 1166 3322 6644 Sun Microsystems Laboratories @ 2008 Copyrights 22 Lessons Learned (1/2) •Applications will require changes to utilize multi-threading > Changes need not be significant > More than one way to multi-thread •Porting to MTD Java is feasible > Application-specific multi-threading strategies •Software infrastructure matters! > Multi-threading using existing job mechanism > Profiling tools understand results •SunStudio Collect/Analyzer Sun Microsystems Laboratories @ 2008 Copyrights 23 Lessons Learned (2/2) •MTD Routing Tool >Speedup of over 4X on an 8-CPU machine •MTD DRC Tool >Results per layer •Up to 41% in Sun Fire X4600 •Up to 37% in N2 with 16 threads >Future work: geometry partition Sun Microsystems Laboratories @ 2008 Copyrights 24 Questions? Gilda Garretón Sun Microsystems Laboratories JUG.cl May 2008.
Recommended publications
  • Computational Science and Engineering İSTANBUL TECHNICAL
    İSTANBUL TECHNICAL UNIVERSITY INFORMATICS INSTITUTE A NEW PARALLEL PROGRAMING LANGUAGE FORTRESS: FEATURES AND APPLICATIONS M.Sc. Thesis by Erdem ÜNEY Department : Informatics Institute Programme : Computational Science and Engineering SEPTEMBER 2009 İ STANBUL TECHNICAL UNIVERSITY INFORMATICS INSTITUTE A NEW PARALLEL PROGRAMING LANGUAGE FORTRESS: FEATURES AND APPLICATIONS M.Sc. Thesis by Erdem ÜNEY (702051007) Date of submission : 28 August 2009 Date of defense examination: 11 September 2009 Supervisor (Chairman) : Prof. Dr. H. Nüzhet DALFES (İTU) Members of the Examining Committee : Prof. Dr. Serdar ÇELEBİ (İTU) Prof. Dr. Hasan DAĞ (KHAS) SEPTEMBER 2009 İSTANBUL TEKNİK ÜNİVERSİTESİ BİLİŞİM ENSTİTÜSÜ YENİ BİR PARALEL PROGRAMLAMA DİLİ FORTRESS: ÖZELLİKLERİ VE UYGULAMALARI YÜKSEK LİSANS TEZİ Erdem ÜNEY (702051007) Tezin Enstitüye Verildiği Tarih : 28 Ağustos 2009 Tezin Savunulduğu Tarih : 11 Eylül 2009 Tez Danışmanı : Prof. Dr. H. Nüzhet DALFES (ITU) Diğer Jüri Üyeleri : Prof. Dr. Serdar ÇELEBİ (ITU) Prof. Dr. Hasan DAĞ (KHAS) EYLÜL 2009 In the loving and constantly illuminating memory of my father, Tuncer Üney… FOREWORD I would like to express my deep appreciation and thanks for my advisor, Prof. Dalfes. Every student presents his regards to his advisor but without the support of my advisor from the days of my undergraduate thesis, I would not be able to seek the academic appretiation and complete the process of graduating. I also would like to thank my good friends İlker Kopan, Sayat Baronyan and lovely Pelin Çallı for their support and motivation during the course of my thesis. Last but not least I would like to thank my little brother and my mother, who is constantly pushing me to go forward.
    [Show full text]
  • Wide I O DRAM Architecture Utilizing Proximity Communication
    WIDE I/O DRAM ARCHITECTURE UTILIZING PROXIMITY COMMUNICATION by Qawi IbnZayd Harvard A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Electrical and Computer Engineering Boise State University October 2009 BOISE STATE UNIVERSITY GRADUATE COLLEGE DEFENSE COMMITTEE APPROVAL of the thesis submitted by Qawi IbnZayd Harvard We have read and discussed the thesis submitted by student Qawi IbnZayd Harvard, and we have also evaluated his presentation and response to questions during the final oral examination. We find that the student has passed the final oral examination, and that the thesis is satisfactory for a master’s degree and ready for any final modifications that we may explicitly require. ______________________ __________________________________________ Date R. Jacob Baker, Ph.D. Chair, Supervisory Committee ______________________ __________________________________________ Date Sin Ming Loo, Ph.D. Member, Supervisory Committee ______________________ __________________________________________ Date Thad Welch, Ph.D. Member, Supervisory Committee i BOISE STATE UNIVERSITY GRADUATE COLLEGE FINAL READING APPROVAL of the thesis submitted by Qawi IbnZayd Harvard To the Graduate College of Boise State University: I have read the thesis of Qawi IbnZayd Harvard in its final form and have found that (1) the modifications required by the defense committee are complete; (2) the format, citations, and bibliographic style are consistent and acceptable; (3) the illustrative materials including figures, tables, and charts are in place; and (4) the final manuscript is ready for submission to the Graduate College. ______________________ __________________________________________ Date R. Jacob Baker, Ph.D. Chair, Supervisory Committee Approved for the Graduate College: ______________________ __________________________________________ Date John R. Pelton, Ph.D.
    [Show full text]
  • Session the Use of Robots and Gamification in Education
    Int'l Conf. Frontiers in Education: CS and CE | FECS'16 | 1 SESSION THE USE OF ROBOTS AND GAMIFICATION IN EDUCATION Chair(s) TBA ISBN: 1-60132-435-9, CSREA Press © 2 Int'l Conf. Frontiers in Education: CS and CE | FECS'16 | ISBN: 1-60132-435-9, CSREA Press © Int'l Conf. Frontiers in Education: CS and CE | FECS'16 | 3 An Evaluation of Simulation in Lego Mindstorms Robot Programming Coursework Frank Klassner1,2 and Sandra Kearney1,2 1Department of Computing Sciences, Villanova University, Villanova, PA USA 2Center of Excellence in Enterprise Technology, Villanova University, Villanova, PA USA Abstract – Robotics programming has become a significant Combining this observation with the research cited at the element of undergraduate computer science curricula over the beginning of the section, it is natural to ask what role past decade. This paper presents an initial examination of the simulation can play in conjunction with physical robots in effectiveness of simulators in helping undergraduates in computer science education, particularly in cases where computer science courses produce moderately debugged code students might be distracted from programming issues by on simulated robots before further refining it on physical “messy” robot interactions with the real world. There is also robots (Mindstorms NXT). The examination included a study the question of how to educate computer science majors about of how students evaluated their experience programming the industrial practice of using simulation to design and test robots in Java with and without a simulator in novice through robot systems before they are construed. advanced level courses. The paper describes the simulator for the study, the robot projects studied by the students, and This paper reports on a research project that evaluates the the survey instrument used in the examination.
    [Show full text]
  • Tightly-Coupled and Fault-Tolerant Communication in Parallel Systems
    Tightly-Coupled and Fault-Tolerant Communication in Parallel Systems Inauguraldissertation zur Erlangung des akademischen Grades eines Doktors der Naturwissenschaften der Universität Mannheim vorgelegt von Dipl.-Inf. David Christoph Slogsnat aus Heidelberg Mannheim, 2008 Dekan: Prof. Dr. Matthias Krause, Universität Mannheim Referent: Prof. Dr. Ulrich Brüning, Universität Heidelberg Koreferent: Prof. Dr. Reinhard Männer, Universität Heidelberg Tag der mündlichen Prüfung: 4. August 2008 Abstract The demand for processing power is increasing steadily. In the past, single processor archi- tectures clearly dominated the markets. As instruction level parallelism is limited in most applications, significant performance can only be achieved in the future by exploiting par- allelism at the higher levels of thread or process parallelism. As a consequence, modern “processors” incorporate multiple processor cores that form a single shared memory multi- processor. In such systems, high performance devices like network interface controllers are connected to processors and memory like every other input/output device over a hierarchy of periph- eral interconnects. Thus, one target must be to couple coprocessors physically closer to main memory and to the processors of a computing node. This removes the overhead of today’s peripheral interconnect structures. Such a step is the direct connection of Hyper- Transport (HT) devices to Opteron processors, which is presented in this thesis. Also, this work analyzes how communication from a device to processors can be optimized on the protocol level. As today’s computing nodes are shared memory systems, the cache coherence protocol is the central protocol for data exchange between processors and devices. Consequently, the analysis extends to classes of devices that are cache coherence protocol aware.
    [Show full text]
  • Multi-Threading in VLSI CAD
    How Open Source and Collaboration aid Innovation in VLSI CAD Gilda Garretón Sun Microsystems Laboratories February 2010 Agenda • Collaboration in a research lab • VLSI circuit research • VLSI CAD research • VLSI design process • CAD projects in multithreading • Conclusions 2 Garretón Sun Microsystems Laboratories @ 2010 Copyrights A Research Lab • Applied research aligned with company business • Expert in n engineering fields, but knowledgeable in m (where m > n) • Communicate and collaborate with colleagues • Collaborate with universities • Contribute to open-source and standards initiatives • “Innovate, Demonstrate, Transfer” 3 Garretón Sun Microsystems Laboratories @ 2010 Copyrights Technology Changes the World Fortress Sun SPOT Darkstar Digital Rights Tools for Demand Search Solaris on Honeycomb Elliptic Curve Forecasting Power PC Cryptography UltraSPARC® V9 Electric Sun Cluster Sun Ray Developed in Sun Labs, transferred to Sun products and to the outside world 4 Garretón Sun Microsystems Laboratories @ 2010 Copyrights Projects Clustering System System Network System Science Hardware Software Clients 5 Garretón Sun Microsystems Laboratories @ 2010 Copyrights Projects Clustering System System Network System Science Hardware Software Clients 6 Garretón Sun Microsystems Laboratories @ 2010 Copyrights VLSI Research Research Cloud VLSI Circuit Optics/photonics Research Research VLSI CAD Datacenter Research Switch Research Proximity CAD Tools Packaging Communication Prototype Test Chips Switch Prototype 7 Garretón Sun Microsystems Laboratories
    [Show full text]
  • How Collaboration Aids Innovation
    Research in Industrial Labs: How Collaboration Aids Innovation Tarik Ono Gilda Garretón Sun Microsystems Laboratories October 6, 2006 Overview Research Lab VLSI Circuit Research Datacenter VLSI CAD Switch Research Research Proximity Communication CAD Tools Test Chips Switch Prototype 2 Sun Microsystems Laboratories Life in a Research Lab Research Lab VLSI Circuit Research Datacenter VLSI CAD Switch Research Research Proximity Communication CAD Tools Test Chips Switch Prototype 3 Sun Microsystems Laboratories Life in a Research Lab: Sun Microsystems Laboratories • Applied research aligned with company business • Expert in n engineering fields, but knowledgeable in m (where m > n) • Communicate and collaborate with colleagues • Collaborate with universities • Contribute to open-source and standards initiatives • “Innovate, Demonstrate, Transfer” 4 Sun Microsystems Laboratories VLSI Circuit and CAD Research Research Lab VLSI Circuit Research Datacenter VLSI CAD Switch Research Research Proximity Communication CAD Tools Test Chips Switch Prototype * VLSI = “Very Large Scale Integration” CAD = “Computer Aided Design” 5 Sun Microsystems Laboratories Usual Interaction Model Company A Company B Circuit Design CAD Development 6 Sun Microsystems Laboratories Usual Interaction Model Sales Company A Company B Circuit Design !#@! CAD Development 7 Sun Microsystems Laboratories VLSI Circuit and CAD Research Group 8 Sun Microsystems Laboratories VLSI Circuit Research Research Lab VLSI Circuit Research Datacenter VLSI CAD Switch Research Research Proximity
    [Show full text]
  • Wide I/O DRAM Architecture Utilizing Proximity Communication
    Wide I/O DRAM Architecture Utilizing Proximity Communication by Qawi Harvard Thesis Defense – October 8th, 2009 Introduction Bandwidth and power consumption of dynamic random access memory stifles computer performance scaling Background Status of Proximity Communication DRAM Market Analysis 4 Gb DRAM Architecture Wide I/O DRAM Architecture Utilizing Proximity Communication Qawi Harvard – Oct. 8th,2009 Thesis Defense 2 Background Memory Gap Main memory does not scale with processor performance Power Current consumption is rising Bandwidth increases power Voltage scaling masks the issue Density Memory channel loading Limits bandwidth Proximity Communication Proposed by Ivan Sutherland – US Patent #6,500,696 Promises to reduce power and increase bandwidth Qawi Harvard – Oct. 8th,2009 Thesis Defense 3 Proximity Communication Chip 1 Chip 2 Transmit Receive Chip 1 Chip 2 Receive Transmit Capacitive Coupled Proximity Communication Top metal forms the parallel plates Chip-to-chip communication through coupling capacitor Ref:[1] Qawi Harvard – Oct. 8th,2009 Thesis Defense 4 Proximity Communication Benefits Increased I/O density Avoids on/off chip wires Eases chip replacement at the system level Enhances system level testability Enables smaller chip sizes Removes the need for ESD protection Challenges Mechanical misalignment Applying power to the chips Thermal solution Ref:[1-5] Qawi Harvard – Oct. 8th,2009 Thesis Defense 5 Proximity Communication Parallel Plate Capacitance A 0 aF C 8.9 m d 0 2 10 pF/mm 4000 2 1000 Chip-to-chip separation Proximity Communication d = 1 µm Area Ball Bonding 100 One channel I/O Density per per mm Density I/O 50 fF 10 2 2003 2004 2005 2006 2007 2008 2009 2010 200 signals/mm [1] Ref:[1] Qawi Harvard – Oct.
    [Show full text]
  • Final Year Project Thesis Mo S H a Di
    Final Year Project Thesis Mo S H A Di Shape-Shifting Human Addressable Digital Money by Chishala Matete Mpundu 29071011 Thesis submitted in partial fulfilment of the requirements for the degree of Bachelor of Science in Computer Science at the Department of Computer Studies, under the School of Natural Sciences, University of Zambia. Supervisor Mr. David M. Zulu 2013 1 DECLARATION I, the undersigned here declare that the Shape-Shifting Human Addressable Digital Money System is my own work, that it has not been submitted for any degree or examination in any other university, and that all the sources I have used or quoted have been indicated and acknowledged by complete references. Name: Chishala Matete Mpundu Signature:…………. Supervisor: Mr. David M. Zulu Signature:.………… Date: September 2013 2 ACKNOWLEDGEMENT I would like to thank all the lecturers at the Department of Computer Studies for teaching me most of what I know and instilling a level of discipline. I thank my supervisor, Mr. David M. Zulu for the guidance and help he offered in this undertaking and I thank my family for being there for me in the good and bad times when it all seemed impossible. I thank God for bringing me this far because years back I never thought I‟d even have an opportunity of having tertiary education. 3 Contents Abstract ................................................................................................................................................................. 6 Chapter 1 .............................................................................................................................................................
    [Show full text]