Asynchronous Logic: a Computer Systems Perspective

Total Page:16

File Type:pdf, Size:1020Kb

Asynchronous Logic: a Computer Systems Perspective Asynchronous Logic: A Computer Systems Perspective Rajit Manohar Computer Systems Lab, Cornell http://vlsi.cornell.edu/ Approaches to computation Discrete Time Continuous Time Digital, Digital, synchronous Discrete Value asynchronous logic logic Switched-capacitor General analog Continuous Value analog computation Asynchronous logic clock receiver sender data signal value time sampling window receiver receiver sender data 0 sender data req data 1 ack ack data 1 X X X X data data 0 request acknowledge acknowledge Explicit versus implicit synchronization A B C A B C C1 C1 A1 B1 A1 B1 C2 A2 A2 B2 C2 B2 time time C3 C3 B3 A3 B3 A3 C4 B4 B4 C4 A4 A4 Synchronous Asynchronous computation computation Differences at a glance Asynchronous Synchronous Continuous time computation. Discrete time computation. Observed delay is the average over Observed delay is the maximum over possible circuit paths. possible circuit paths. Local switching from data-driven Global activity from clock-driven computation. Can lead to low-power computation. Low power requires operation. careful clock gating. Throughput determined by device Throughput determined by device speed. (Margins needed for some speed and additional operating circuit families.) margins. Data must be encoded, requiring Unencoded data acceptable due to additional wires for signals. global clock network. A little bit of (biased) history… Macro modular systems Binary addition Switching Illiac II (UIUC/Muller) (Clarke, Molnar, Stucki,…) (von Neumann) theory (Miller) Atlas, MU5 (Manchester) 1940 1950 1960 1970 Flow tables System Timing (Seitz, in (Huffman) Mead/Conway) Trace theory (Snepscheut) HiAER (Cauwenberghs) Syntactic compilation ARM with Pipelined FPGAs FACETS TrueNorth (Martin, Rem) caches (Furber) (Manohar) (Heidelberg) (IBM/Cornell) 1980 1990 2000 2010 Micro pipelines Asynchronous 17FO4 UltraSparc IIIi 10G Ethernet (Sutherland) CPU (Martin) MIPS CPU mem controller switch (Martin) (Sun) (Fulcrum) General hazard-free synthesis Commercial Event-driven SpiNNaker AER (Mead lab) 80C51 (Phillips) CPU (Manohar) (Furber) Sensory systems (retina, cochlea, etc.) Neurogrid (Boahen) I. Exploiting the gap between average and max delay 1 1 0 0 0 1 0 1 1 0 0 + 0 1 1 0 1 0 1 0 0 0 1 1 0 • Slow part: computing carries ❖ … but only in the worst-case! Theorem [von Neumann, 1946]. The average-case latency for a ripple carry binary adder is O(log N) for i.i.d. inputs A.W. Burks, H.H. Goldstein, J. von Neumann. Preliminary discussion of the logical design of an electronic computing instrument. (1946) I. Exploiting the gap between average and max delay • Standard adder tree ❖ O(log N) latency • Hybrid adder tree ❖ Tree adder + ripple adder Theorem [Winograd, 1965]. The worst-case latency for binary addition is Ω(log N) Theorem. The average-case latency of the hybrid asynchronous adder is O(log log N) for i.i.d. inputs Theorem. The average-case latency of the hybrid asynchronous adder is optimal for any input distribution S.O. Winograd. On the time required to perform addition. JACM (1965) R. Manohar and J. Tierno. Asynchronous parallel prefix computation. IEEE Transactions on Computers (1998) II. Exploiting data-driven power management • Low power FIR filter ❖ External, synchronous I/O ❖ Monitor occupancy of FIFOs ❖ Speed up/slow down using DC/DC adaptive power supply converter Vdd • Break-point add/multiply State ❖ Detect when top half of operand is required FIR filter ❖ Dynamically switch between full width and half width arithmetic L. Nielsen and J. Sparso. A Low-power Asynchronous Datapath for a FIR filter bank. Proc. ASYNC (1996) 0 1 0 1 0 1 II. Exploiting data-driven power management 000 0001 • Width-adaptive numbers 8−bits per VDW block 0 1 0 1 0 1 +000··· 0001 ❖ Compress leading 0’s and 1’s000··· 0010 ··· 128 − bit datapath receiver 000 0001 0 1 sender +000··· 0001 v/s +01 000··· 0010 0 10 ··· SPEC: 124.m88ksim SPEC: 126.gcc 100 0 100 14 47 0 1 Number: 1.4 10 2 90 +01 90 ⇥ ⇤ Average:0 8.4 Average: 10.0 80 0 10 80 70 70 60 60 0 receiver 50 50 sender 0 Average: 10.9 Average: 11.8 40 Average: 8.8 40 Average: 10.5 30 30 20 20 Cumulative Distribution Function (%) 10 no compaction Cumulative Distribution Function (%) 10 no compaction simple compaction simple compaction full compaction 5 full compaction 0 0 Number: 29 2 5 10 15 20 25 30 5 10 15 20 25 30 Operand Widths − ⇤Operand Widths SPEC: 129.compress R. Manohar. Width-Adaptive Data Word SPEC: 132.ijpeg 100 Architectures. Proc. ARVLSI100 (2001) 80 Average: 10.8 80 Average: 10.3 60 60 40 Average: 16.3 40 Average: 12.6 20 Average: 13.4 20 Average: 10.8 Cumulative Distribution Function (%) no compaction Cumulative Distribution Function (%) no compaction simple compaction simple compaction full compaction full compaction 0 0 5 10 15 20 25 30 5 10 15 20 25 30 Operand Widths Operand Widths III. Exploiting elasticity of pipelines f g h Asynchronous computation is* robust to changes in circuit- level pipelining! f g h *R. Manohar, A. J. Martin. Slack Elasticity in Concurrent Computing. Proc. Mathematics of Program Construction (1998) III. Exploiting elasticity of pipelines • Asynchronous FIR filter • Data arrives at variable rates • Data rates are predictable • Fixed amount of time for async computation 1 2 3 9 • Variable number of clock cycles for the fixed time budget M. Singh, J.A. Tierno, A. Rylyakov, S. Rylov, S.M. Nowick. An adaptively pipelined mixed synchronous-asynchronous digital FIR filter chip operating at 1.3 GHz. Proc. ASYNC (2002) III. Exploiting elasticity of pipelines • FPGA throughput low due to overhead of LB CB LB CB programmable interconnect CB SB CB SB • ~3x improvement in peak throughput by CB CB transparent interconnect pipelining LB LB CB SB CB SB J. Teifel and R. Manohar. Highly pipelined asynchronous FPGAs. Proc. FPGA (2004) IV. Exploiting timing robustness Asynchronous FPGA Test Data 1200 Process: TSMC 0.18um 1000 Nominal Vdd: 1.8V 77K 12K 800 294K 600 400 400K Throughput (MHz) 200 Xilinx Virtex 0 0.5 1 1.5 2 2.5 Voltage (V) D. Fang, J. Teifel, and R. Manohar. A High-Performance Asynchronous FPGA: Test Results. Proc. FCCM (2005) IV. Exploiting timing robustness • “GasP” logic family Self-reset Track • 6-10 FO4 reset Single-track w control wire • No margins on control RA(i) RA(i+1) signals Track set Wait for • Standard datapath L set, R reset • Latest 40nm chip operates Datapath >6 GHz driver w I. Sutherland, S. Fairbanks. GasP: a minimal FIFO control. Proc. ASYNC (2001) V. Exploiting low noise and low EMI time domain frequency domain 0 100 200 300 400 MHz 0 100 200 300 400 MHz Synchronous 80C51 Asynchronous 80C51 (Phillips) (Phillips) J. Kessels, T. Kramer, G. den Besten, A. Peeters, V. Timm. Applying Asynchronous Circuits in Contactless Smart Cards. Proc. ASYNC (2000) VI. Exploiting continuous-time information processing • Continuous-time digital signal processing • Automatic adaptation to input bandwidth Nyquist sampling Level-crossing sampling • No aliasing from sampling process B. Schell, Y. Tsividis. A clockless adc/dsp/dac system with activity- dependent power dissipation and no aliasing. Proc. ISSCC (2008) VI. Exploiting continuous-time information processing • Address-event representation in neuromorphic systems ❖ “Time represents itself” ❖ Spike timing and coincidence for information processing • Asynchronous logic ❖ Temporal resolution is decoupled from throughput ❖ Automatic power management • Projects ❖ Silicon retinas, cochleas, … ❖ More recent: FACETS, HiAER, Neurogrid, neuroP, SpiNNaker, TrueNorth A note on oscillations in asynchronous logic • Well-known result in asynchronous circuit theory ❖ Signal transitions: look at the ith occurrence of transition t ❖ time ( t, i ) = the actual time of this event ❖ time 0(t, i)=f(t)+p⇤ i ⇥ • Then: time(t, i) time0(t, i) <B ❖ Bounded,| independent− of i, and t | ❖ If the circuit is strongly connected, this result holds • More recently: stronger periodicity under more general conditions J. Magott. Performance evaluation of concurrent S.M. Burns, A.J. Martin. Performance Analysis and systems using petri nets. IPL (1984) Optimization of Asynchronous Circuits. Proc. ARVLSI (1990) W. Hua and R. Manohar. Exact timing analysis for concurrent systems. To appear, work-in-progress session, DAC (2016) Asynchronous logic in TrueNorth • Spikes trigger activity… ❖ … so they should be asynchronous Token Control Token • Neurons leak periodically… Scheduler ❖ … so add a periodic event to the system (the clock) At the external timing “tick”: SRAM • Update each neuron, delivering spikes plus any local state update Neuron Router • If the neuron spikes, send output spike to destinations, with specified delay Spikes travel through asynchronous network • Receiver buffers spikes until it is time to deliver them Digital, asynchronous logic Asynchronous logic in TrueNorth • Spike communication network ❖ Links are a shared resource ❖ Links might be contended • Spike delivery times can differ based on traffic ❖ Different runs of the same network might produce different spike patterns • Instead, we use a spike scheduler ❖ Guarantees deterministic computation ❖ … and hence, repeatability Summary • Asynchronous logic has a long history • Some key properties exploited in projects ❖ Gap between average and maximum delay for a computation ❖ Automatic data-driven power management ❖ Elastic pipelining ❖ Robustness to timing uncertainty/variation ❖ Reduced EMI and noise ❖ Continuous-time information representation Acknowledgments • The “async” community • Students: ❖ Ned Bingham, Wenmian Hua, Sean Ogden, Praful Purohit, Tayyar Rzayev, Nitish Srivastava ❖ John Teifel, Clinton Kelly, Virantha Ekanayake, Song Peng, David Biermann, David Fang, Chris LaFrieda, Filipp Akopyan, Basit Riaz Sheikh, Benjamin Tang, Nabil Imam, Sandra Jackson, Carlos Otero, Benjamin Hill, Stephen Longfield, Jonathan Tse, Robert Karmazin • Collaborators: ❖ General: David Albonesi, Sunil Bhave, Francois Guimbretiere, Amit Lal, Yoram Moses, Ivan Sutherland, Yannis Tsividis ❖ Neuromorphic: John Arthur, Kwabena Boahen, Tobi Delbruck, Chris Eliasmith, Giacomo Indiveri, Paul Merolla, Dharmendra Modha • Support: AFRL, DARPA, IARPA, NSF, ONR, IBM, Lockheed, Qualcomm, Samsung, SRC .
Recommended publications
  • TFT Asynchronous Microprocessor on Plastic
    7.2 / M. Miyasaka Current Status of Flexible Microelectronics; TFT Asynchronous Microprocessor on Plastic Mitsutoshi MIYASAKA, Nobuo KARAKI 1) and Tatsuya SHIMODA 2) Seiko Epson Corporation Flexible Microelectronics Project 1) Technology Platform Research Center 2) Corporate R&D Tel:81-266-61-1211, Fax:81-266-61-0674, E-mail: [email protected] Thin film transistors (TFTs) are field-effect accordance with the scaling rule for nearly half a transistors that can be used to create large-scale century. Though we hope for further advancement, the integrated (LSI) circuits. The combination of Suftla technology seems to be reaching its limits. technology and high-performance TFTs has the Theoretically the silicon band-gap limits the lowest potential to foster the rise of a new flexible voltage to approximately 0.5 V. In addition, there microelectronics industry. This paper reviews the appear to be serious technical difficulties in achieving history of Suftla technology and discusses the current finer patterning and thinner gate-dielectric. From a status of flexible microelectronics, a TFT business perspective, the advantages gained by further asynchronous microprocessor. We are sure that there scaling are spoiled by the high research, development is a best fit between the advantages of the and fabrication costs. Taking these facts into account, asynchronous circuit and the electrical properties we propose improving semiconductor devices this TFTs possess, because the asynchronous circuit century by eradicating the notion that LSI devices overcomes the deviation of transistor characteristics, must be made on silicon wafers. is free from the signal delay problem and does not Suftla (surface-free technology by laser generate a large amount of heat.
    [Show full text]
  • One Shots and Alternatives in Synchronous Digital System Design
    Lawrence Berkeley National Laboratory Lawrence Berkeley National Laboratory Title ONE SHOTS AND ALTERNATIVES IN SYNCHRONOUS DIGITAL SYSTEM DESIGN Permalink https://escholarship.org/uc/item/7tq8t8hn Author Hui, S. Publication Date 1979 Peer reviewed eScholarship.org Powered by the California Digital Library University of California LBL-8722 UC-37 7"> ONE SHOTS AND ALTERNATIVES IN SYNCHRONOUS DIGITAL SYSTEM DESIGN Steve Hui and John B. Meng January 1979 Prepared for the U. S. Department of Enemy under Contract W-7405-ENG-48 UtitAUigim LBL 8722 ONE SHOTS AND ALTERNATIVES IN SYNCHRONOUS DIGITAL SYSTEM DESIGN Steve Hui 8 John B. Meng January 1979 Prepared for the U.S. Department of Energy under Contract W-7405-ENG-48 NOTICE This re poll was pit pa ted is an account or work sponsored by the United States Government. Neither the United Sulci ooi the Umled Slates Depigment of Energy, nor any of their employees, nor any of their contractor*, lube on Ira clou, oi the If employees, nukes any warranty, express or implied, or auumei any legal liability •( response ill ly for (he accuracy, completene or usefulness of my information, anptiatui, product < process disclosed, c; represents thai ill me would no) infringe privately owned right*. !>TV»U;3 LBL 8722 (i) ONE SHOTS AND ALTERNATIVES IN SYNCHRONOUS DIGITAL SYSTEM DESIGN Steve Hui & John D. Meng The one shot or monostable multivibrator is quite often regarded as a "black sheep" in ditiqai integrated circuits. Its distinctions and versatility are well known and do not necessitate mentioning. Some of the potential problems with this black sheep are summarized as follows: 1.
    [Show full text]
  • An Asynchronous Circuit Design Language System
    Scholars' Mine Doctoral Dissertations Student Theses and Dissertations 1972 An asynchronous circuit design language system Gregory Martin Bednar Follow this and additional works at: https://scholarsmine.mst.edu/doctoral_dissertations Part of the Electrical and Computer Engineering Commons Department: Electrical and Computer Engineering Recommended Citation Bednar, Gregory Martin, "An asynchronous circuit design language system" (1972). Doctoral Dissertations. 194. https://scholarsmine.mst.edu/doctoral_dissertations/194 This thesis is brought to you by Scholars' Mine, a service of the Missouri S&T Library and Learning Resources. This work is protected by U. S. Copyright Law. Unauthorized use including reproduction for redistribution requires the permission of the copyright holder. For more information, please contact [email protected]. AN ASYNCHRONOUS CIRCUIT DESIGN. LANGUAGE SYSTEM by GREGORY MARTIN BEDNAR, 1944- A DISSERTATION Presented to the Faculty of the Graduate School of the UNIVERSITY OF MISSOURI-ROLLA In Partial Fulfillment of the Requirements for the Degree DOCTOR OF PHILOSOPHY T2781 in 157 pages ELECTRICAL ENGINEERING c. I 1972 23?261 ii ABSTRACT This paper presents a system for specifying the behavior of asynchronous sequential circuits. The system consists of a special purpose Asynchronous Circuit Design Language (ACDL), a translator and a flow table generation algorithm. The language includes many special features which permit quick and precise specification of terminal behavior. It is best suited for problems originating from a word description of the circuit's operation. The translator is written with the XPL Translator Writing System and is a syntax-directed compilation method. From the translated ACDL specifications, the flow table algorithm generates a primitive flow table which is the required input for the conventional synthesis procedures of asynchronous sequential circuits.
    [Show full text]
  • Test and Testability of Asynchronous Circuits
    Test and Testability of Asynchronous Circuits Nastaran Nemati A thesis submitted for the degree of Doctor of Philosophy University of New South Wales June 2017 To Mum and Dad ORIGINALITY STATEMENT ‘I hereby declare that this submission is my own work and to the best of my knowl- edge it contains no materials previously published or written by another person, or substantial proportions of material which have been accepted for the award of any other degree or diploma at UNSW or any other educational institution, ex- cept where due acknowledgement is made in the thesis. Any contribution made to the research by others, with whom I have worked at UNSW or elsewhere, is explicitly acknowledged in the thesis. I also declare that the intellectual content of this thesis is the product of my own work, except to the extent that assistance from others in the project’s design and conception or in style, presentation and linguistic expression is acknowledged.’ Signed ................................................ Date .................................................... COPYRIGHT STATEMENT ‘I hereby grant the University of New South Wales or its agents the right to archive and to make available my thesis or dissertation in whole or part in the University libraries in all forms of media, now or here after known, subject to the provisions of the Copyright Act 1968. I retain all proprietary rights, such as patent rights. I also retain the right to use in future works (such as articles or books) all or part of this thesis or dissertation. I also authorise University Microfilms to use the 350 word abstract of my thesis in Dissertation Abstract International (this is applicable to doctoral theses only).
    [Show full text]
  • Impact of C-Elements in Asynchronous Circuits
    Impact of C-Elements in Asynchronous Circuits Matheus Moreira, Bruno Oliveira, Fernando Moraes, Ney Calazans Faculdade de Informática Pontifícia Universidade Católica do Rio Grande do Sul Porto Alegre, Brazil {matheus.moreira, bruno.scherer}@acad.pucrs.br {fernando.moraes, ney.calazans}@pucrs.br Abstract single cell level and the application, or core, level. For the Asynchronous circuits are a potential solution to address former scope, delay and power consumption were measured some of the obstacles in deep submicron (DSM) design. One through electrical simulations at the standard cell level after of the most frequently used devices to build asynchronous electrical extraction of the physical layout. For the latter circuits is the C-element, a device present as a basic building scope, the different C-elements were used to build block in several asynchronous design styles. This work asynchronous implementations of an oscillator ring and an measures the impact of three different C-element types. The RSA cryptographic core. A comparison of performance and paper compares the use of each implementation to build a real required silicon area of the case studies made it possible to case asynchronous circuit, an RSA cryptographic core, and scrutinize the systemic effect of using different C-element reports results of precise electrical simulations of each C- implementations when designing asynchronous circuits in element. Findings in this work show that previous results in DSM technologies. The rest of the paper is organized in seven sections. the literature about C-element implementation types must be Section II describes basic concepts on asynchronous circuits re-evaluated when using C-elements in DSM technologies.
    [Show full text]
  • Asynchronous Control Circuit Design and Hazard Generation: Inertial Delay and Pure Delay Models
    oF a9'3-A *l* It Asynchronous Control Circuit Design and Hazard Generation: Inertial Delay and Pure Delay Models by Nozar Tabrizi B.S.E.E (Sharif University of Technology) 1980 M.S.E.E (Sharif University of Technology) 1988 A thesis submitted for the degree of Doctor of Philosophy in the Centre for High Performance Integrated Technologies and Systems (CHiPTec) Department of Electrical and Electronic Engineering The University of Adelaide June 1997 Table of Contents 1 Motivation for Asynchronous Circuits I 1.1 Introduction. .. 1 1.1.1 Clock skew ,.2 1.I.2 Power consumption............. ..5 Ll.3 Variable computation time.. ..6 1.1.4 Modularity and upgradiblity ..8 t.2 Organization ............... 10 2 Delay constraints and Design Techniques of Asynchronous Control Circuits ..................... L3 2.t Introduction t3 2.1.1 Huffman classical method..... 14 2.1.2 Speed independent circuits 14 2.1.3 Delay insensitive circuits 15 2.2 State based techniques .16 2.2.1 Classical Huffman method............ .16 2.2.2 One-hot coding..... t9 2.2.3 Timing requirements in the Huffman methodology.............. ..... 22 2.2.4 Friedman and Menon's methods to design multiple input change asynchronous circuits. 23 2.3 Burst mode or self clocked circuits ....................26 2.3.1 Burst mode circuits using controlled excitation and edge triggered flip-flops..... .26 2.3.2 Locally clocked asynchronous state machines 29 2.3.3 Q-Modules 32 2.3.4 3D Asynchronous circuits...... 35 2.4 Muller's speed independent circuit theory 36 2.4.1 Introduction... 36 2.4.2 Two restricted types of speed independent circuits .39 2.4.3 A flow table based speed independent circuit realization..
    [Show full text]
  • An Open-Source Design Flow for Asynchronous Circuits
    An Open-Source Design Flow for Asynchronous Circuits Rajit Manohar Computer Systems Lab Yale University New Haven, CT 06520 Email: [email protected] Abstract—There have been a number of small-scale and the clocked paradigm a poorer and poorer abstraction for chip large-scale technology demonstrations of asynchronous circuits, design. Modern application-specific integrated chips (ASICs) showing that they have benefits in performance and power- are designed as a collection of small clocked “islands” that efficiency in a variety of application domains. Most recently, asyn- chronous circuits were used in the TrueNorth neuromorphic chip communicate via interfaces that break the clocking abstraction. to achieve unprecedented energy-efficiency for neuromorphic We are developing a collection of electronic design automa- systems. However, these circuits cannot be easily adopted, because tion (EDA) tools that isolate the designer from the details commercially available design tools do not support asynchronous of the physical implementation technology, especially when logic. As part of the DARPA ERI effort, we are addressing this it comes to delays and timing uncertainty.1 The approach is challenge by developing a set of open-source design tools for asynchronous circuits. based on an asynchronous, modular and hierarchical design methodology for complex chips,and it permits component re- Keywords—asynchronous circuits; open-source EDA tools use from one technology to another with little or no modifi- cation. While individual (small) modules of the chip could be I. INTRODUCTION clocked, the overall system uses an asynchronous integration Scalable computer systems are designed as a collection of approach to achieve modular composition. modular components that communicate through well-defined II.
    [Show full text]
  • Teaching Asynchronous Digital Design in the Undergraduate Computer Engineering Curriculum
    Teaching Asynchronous Digital Design in the Undergraduate Computer Engineering Curriculum Scott C. Smith1, Senior Member, IEEE, and Waleed K. Al-Assadi2, Senior Member, IEEE University of Missouri – Rolla, Department of Electrical and Computer Engineering Emerson Electric Co. Hall, 1870 Miner Circle, Rolla, MO 65409 1Phone: (573) 341-4232, 1E-mail: [email protected], 2Phone: (573) 341-4836, 2E-mail: [email protected] Abstract—As demand continues for circuits with higher Computer Engineering courses; Section IV depicts the performance, higher complexity, and decreased feature size, original VHDL and VLSI course outlines and shows how asynchronous (clockless) paradigms will become more widely these courses have been augmented to include the used in the semiconductor industry, as evidenced by the asynchronous materials; and Section V presents the outcomes International Technology Roadmap for Semiconductors’ (ITRS) prediction of a likely shift from synchronous to asynchronous of the first offerings of the VHDL and VLSI courses with the design styles in order to increase circuit robustness, decrease asynchronous materials included, and provides conclusions power, and alleviate many clock-related issues. ITRS predicts and directions for future work. that asynchronous circuits will account for 19% of chip area within the next 5 years, and 30% of chip area within the next 10 years. To meet this growing industry need, students in Computer II. OVERVIEW OF ASYNCHRONOUS PARADIGMS Engineering should be introduced to asynchronous circuit design to make them more marketable and more prepared for the Asynchronous circuits can be grouped into two main challenges faced by the digital design community for years to categories: bounded-delay and delay-insensitive models.
    [Show full text]
  • Asynchronous Sequential Circuits
    Chapter 22 Asynchronous Sequential Circuits Asynchronous sequential circuits have state that is not synchronized with a clock. Like the synchronous sequential circuits we have studied up to this point they are realized by adding state feedback to combinational logic that imple- ments a next-state function. Unlike synchronous circuits, the state variables of an asynchronous sequential circuit may change at any point in time. This asynchronous state update – from next state to current state – complicates the design process. We must be concerned with hazards in the next state function, as a momentary glitch may result in an incorrect final state. We must also be concerned with races between state variables on transitions between states whose encodings differ in more than one variable. In this chapter we look at the fundamentals of asynchronous sequential cir- cuits. We start by showing how to analyze combinational logic with feedback by drawing a flow table. The flow table shows us which states are stable, which are transient, and which are oscillatory. We then show how to synthesize an asynchronous circuit from a specification by first writing a flow table and then reducing the flow table to logic equations. We see that state assignment is quite critical for asynchronous sequential machines as it determines when a potential race may occur. We show that some races can be eliminated by introducing transient states. After the introduction of this chapter, we continue our discussion of asyn- chronous circuits in Chapter 23 by looking at latches and flip-flops as examples of asynchronous circuits. 22.1 Flow Table Analysis Recall from Section 14.1 that an asynchronous sequential circuit is formed when a feedback path is placed around combinational logic as shown in Figure 22.1(a).
    [Show full text]
  • Hybrid Synchronous / Asynchronous Design
    HYBRID SYNCHRONOUS / ASYNCHRONOUS DESIGN A Dissertation Presented to the Faculty of the Graduate School of Cornell University in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy by Filipp A Akopyan May 2011 c 2011 Filipp A Akopyan ALL RIGHTS RESERVED HYBRID SYNCHRONOUS / ASYNCHRONOUS DESIGN Filipp A Akopyan, Ph.D. Cornell University 2011 In this new era of high-speed and low-power VLSI circuits, the question of which circuit family is best for a given application has become much more relevant. From a designer's point of view, process technology scaling continues to reveal undesirable device behavior. Thus, a designer has to make decisions not only at the micro-architectural scale, but also at a lower, circuit-level scale. However, common circuit families are not sufficient to solve modern engineering problems in many cases. Our goal is to provide resources that will allow designers to select the circuit family that yields the best results in terms of power, area, and performance metrics for each application, with minimal human input. Using our techniques, this choice can be made in a timely manner without in-depth knowledge of each circuit family. We propose an improved hybrid synchronous / asynchronous toolflow that offers significant reduction in the design cycle time and we advise on how our work can be extended to various types of circuit families for any given technology node. We describe tools that we have developed to allow designers to implement their projects using both synchronous and asynchronous circuit families. We also present the cosimulation environment that we have developed to allow designers to run complex digital and analog simulations of various circuits at different levels of abstraction with minimal setup effort.
    [Show full text]
  • Minimal Energy Asynchronous Dynamic Adders
    Accepted for publication (shorter version), IEEE Trans. On VLSI, 2006 Minimal Energy Asynchronous Dynamic Adders Ilya Obridko and Ran Ginosar VLSI Systems Research Center, Technion—Israel Institute of Technology, Haifa 32000, Israel [[email protected], [email protected]] Abstract In battery-operated portable or implantable digital devices, where battery life needs to be maximized, it is necessary to minimize not only power consumption but also energy dissipation. Typical energy optimization measures include voltage reduction and operating at the slowest possible speed. We employ additional methods, including hybrid asynchronous dynamic design to enable operating over a wide range of battery voltage, aggregating large combinational logic blocks, and transistor sizing and reordering. We demonstrate the methods on simple adders, and discuss extension to other circuits. Three novel adders are proposed and analyzed: A two-bit PTL adder and two dynamic two-bit adders. Circuit simulations on a 0.18μm process at low voltage show that leakage energy is below 1% and that short-circuit energy depends on circuit topology and can be as high as 50% of total energy when operating at low voltage and low fanout. The proposed adders achieve up to 40% energy savings relative to previously published results, while also operating faster. 1 Introduction Certain digital battery-operated portable or implanted systems require maximum battery life. To achieve that goal, the designer needs to minimize energy dissipation (rather than optimizing for low power). One useful means is to allow operating at the slowest possible computational speed, and this is possible in devices such as hearing aids in which the computational requirements are limited.
    [Show full text]
  • United States Patent (19) 11 Patent Number: 6,058,439 Devereux (45) Date of Patent: May 2, 2000
    US006058439A United States Patent (19) 11 Patent Number: 6,058,439 Devereux (45) Date of Patent: May 2, 2000 54 ASYNCHRONOUS FIRST-IN-FIRST-OUT 5,469,398 11/1995 Scott et al. .............................. 365/221 BUFFER CIRCUIT BURST MODE CONTROL 5,471,638 11/1995 Keeley .................................... 395/800 5,499.384 3/1996 Lentz et al. ............................. 395/821 75 Inventor: Ian Victor Devereux, Cherry Hinton, S. lyg By s al - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 33. 2Y- - -2 IIl Clal. United Kingdom 5,745,732 4/1998 Cherukuri et al. ...................... 395/495 5,761,533 6/1998 Alderguia et al. .. ... 395/84 73 ASSignee: "inited, Cambridge, United 5,860,119 1/1999 DockSer .................................. 711/156 21 Appl. No.: 08/828,501 EC.SSIS EXfile- I. &WO Sheikh 22 Filed: Mar. 31, 1997 Attorney, Agent, or Firm Nixon & Vanderhye P.C. (51) Int. Cl. ................................................ G06F 13/00 57 ABSTRACT 52 U.S. Cl. .................. 710/52; 710/35; 710/53 58 Field of Search ..................................... 395/821, 824, A data processing system comprising a first circuit block 6 395/840, 841, 845, 855, 872, 873, 877; and a Second circuit block 8 linked via an asynchronous 710f 4 20 21 25 35 52 53 57 first-in-first-out buffer circuit 12 is provided with a burst s is a Yu’s a u-s a as - as a as a -1s marker that identifies the first word in a burst transfer or an 56) References Cited empty Stage. The Second circuit block 8 uses the burst marker to identify the last data word in a burst as being that U.S.
    [Show full text]