Survey on High Performance Reconfigurable Soft - Core Processor for SIMD Applications

Total Page:16

File Type:pdf, Size:1020Kb

Load more

PROTEUS JOURNAL ISSN/eISSN: 0889-6348 Survey on High Performance Reconfigurable Soft - Core Processor for SIMD Applications Maheswari. R Pattabiraman. V Kanagaraj Venusamy Associate Professor Professor Lecturer SCOPE, Vellore Institute of SCOPE, Vellore Institute of University of Technology and Technology, Chennai, Tamilnadu, Technology, Chennai, Tamilnadu, Applied Sciences - AlMusanaah, India. India. Muscat, Sultanate of Oman [email protected] [email protected] [email protected] ABSTRACT—The prospective need of SIMD (Single requirement in most of the modern embedded systems. Instruction and Multiple Data) applications like video and The modern processor comes in a wide spectrum of image processing systems requires a processor with greater architecture ranging from hard-core to soft-core processor flexibility and computation to deliver high quality real time with CISC (Complex Instruction Set) and RISC (Reduced output. The main goal of work is to offer a wider survey over high performance processor through various reconfigurable Instruction Set) architecture. The general purpose hard- techniques targeting on real time SIMD dataset. The real- core processor like Intel, Motorola 68000, etc., falls on its time multimedia streaming with extensive parallel data one side and reconfigurable FPGA based soft-core demands high computational processors with low power processor like MicroBlaze, SPARC, etc., falls on the other consumption. The modern processors with flexible computation supports on-the-fly system redesign with side of the spectrum. Each core contains various ranges of reconfiguration techniques at reduced cost. It also functional sub-blocks implemented either in FPGA (Field- accommodates new functionalities sustaining the increasing Programmable Gate Array) or in an ASIC (Application demand for processor with less NREs (Non-Recurring Specific Integrated Circuits). The major advantage of Engineering) cost and shorter time-to-market. The processors synthesizing the processor in FPGA rather than ASIC is with flexible platform permit the designer to incorporate the demanding changes in the existing standards of any that it provides re-configurability when altering the design application like wireless communication, telecommunication with no significant cost [1]. etc. Adaptive computing is the new processor paradigm evolved in early 90s to bridge the space which exists between A.MOTIVATION the generic processor and application specific processor. It In this embedded era, 90% of applications (mobile phones, also supports the inclusion of specific hardware accelerator in the existing RISC (Reduced Instruction Set Computer) based video surveillance systems, medical equipment like architecture to improve the performance of particular medical imaging systems, patient monitoring systems) application. In the last decade, processor based embedded target high performance computing at low power systems like SoC (System–on–Chip) have become more consumption. To accomplish this multimedia data prevalent due to their high performance at low power. Many embedded application designers like mobile and smart phone processing, there is a huge demand for high performance developers namely Philips N-Experia, Intel PXA etc., and low power processor. The two major solutions to explored SoC as computational devices. Thus this work improve the performance of the processor are through summarizes a literature survey of all the related works such hardware and software. Hardware solutions always ensure as RC (Reconfigurable Computing) , HPRC (High high performance and low power than software solutions, Performance Reconfigurable Computing), FPGA based processors, open source soft-core processor, reconfigurable but with very less flexibility to reconfigurability. Here architecture, parallel computing and SIMD. The detailed comes the need for processor architecture, literature survey analyzing the impact of current techniques targeting processor performance is illustrated. Which has to balance between the power as well as performance of multimedia dataset. There exists a KEYWORDS—Reconfigurable Computing, High Performance significant performance gap between commercial SCP like Computing, FPGA, Soft-core Processor MicroBlaze (Xilinx), Nios (Altera) and open source SCP like OR1200, SPARC. As an example MicroBlaze and I . INTRODUCTION Nios have 31-37% lower run times than OR1200 (Gartner Modern embedded applications are ironic in Dataquest). To fill this gap, there is a need for multimedia data involving audio and video processing. reconfigurable high performance soft-core processor This real time streaming with extensive parallel data architecture. To support DLP and ILP in multimedia demands for high computational devices with low power applications, many techniques such as SIMD, SMP consumption. These high performance processors work as (Symmetric Multi-Processor), VLIW (Very Long the driving tool exploring the resource and design relevant 1 VOLUME 11 ISSUE 9 2020 http://www.proteusresearch.org/ Page No: 713 PROTEUS JOURNAL ISSN/eISSN: 0889-6348 Instruction Word), super-scalar instruction sets are being utilized in the processor. Out of these techniques, this work focused on the wider study on SIMD instruction set along with super-scalar architecture as it provides greater flexibility in the reconfigurable architecture with improved performance. B.Processor Processor architecture works as fundamental component Fig.1. when exploring the resource and design relevant Impact of embedded systems in various application domains requirement in most of the modern embedded systems. As a result, most research investigations aim to achieve good D.FPGA improvement in processor performance at reasonably low An FPGA is a device with large collection of power dissipation. The hard-core processors provide high programmable logic cells interconnected through a performance for real-time embedded applications, but they network. The high density interconnections across the are characterized by non-reconfigurability feature. Fig.1 logic cells provides higher flexibility to implement any represents the trade-off between flexibility and computational model at low power consumption [2]. Since performance of processors based on its implementation. It these logic cells are independent blocks, exploiting is implicit that General Purpose Processor (GPP) proves to temporal and spatial parallelism is very efficient, i.e. it provide more flexibility adapting itself to configure in allows many processing elements to execute concurrently. many fields, but with degradation in performance. An They are typically described by register transfer level Application Specific Processor gives better performance (RTL) abstractions using an HDL (Hardware Description than GPP with optimized instruction set. Whereas ASIC Language). The building blocks of any FPGA are CLB gives higher performance than other implementation, (Configurable Logic Block), input and output blocks and altering its feature after fabrication is not feasible one. interconnection network. Through direct interconnect, all Thus reconfigurable processor lies in between the GPP and the adjacent CLBs are interconnected whereas through ASIC provides greater flexibility with better performance. programmable switching matrix (PSM), the horizontal and Almost all digital circuits are classically partitioned into vertical CLBs are intercomnected. various functional modules called as cores. Each core contains various ranges of functional sub-blocks implemented either in FPGA or in an ASIC. C .EMBEDDED SYSTEMS Embedded system is a computational system with dedicated function to perform specific job. It integrates three components into functional units such as: • Hardware specific for the Application• Embedded software • RTOS (Real Time Operating Systems). Most of the functionality of embedded systems will be implemented using software. Unlike GPP (General Purpose Processor), embedded systems demand variety of hardware design to quench the Figure 2: FPGA functional view need of various applications. The major challenge in The internal architecture of FPGA is shown in Fig. 2. Each designing an embedded system is that it needs to work and slice of FPGA contains flip-flops, multiplexers and sustain in a diverse environment to handle various types of collections of digital logic gates called LUTs (Look-Up tasks and applications. Thus, the embedded designer needs Table). The LUTs are alsocalled functional generator of to address various issues such as high-computation, low FPGA. These FPGAs are reprogrammed dynamically power, reliability, flexibility, predictability and memory during compile time or execution time using virtual management. The occupancy of embedded systems in hardware. The CLB functional view of FPGA is shown in various fields is given in Fig 1. Fig.3. They are typically described by register transfer level (RTL) abstractions using HDL. 2 VOLUME 11 ISSUE 9 2020 http://www.proteusresearch.org/ Page No: 714 PROTEUS JOURNAL ISSN/eISSN: 0889-6348 Fig.3. CLB functional view The FPGA 7 allows the user to program the hardware even electronics, video and image processing, etc. were drifting during post-fabrication. In reality, FPGA exploits dynamic to the FPGA. reconfigurable fabric design methods to improve Fig.4.FPGA with Embedded Processor and without Embedded Processor performance of the targeted architecture (Wang et al. Survey (2009)). The major benefit of synthesizing the design in D.Embedded Processors FPGA is that
Recommended publications
  • Latticemico32 Development Kit User's Guide for Latticeecp

    Latticemico32 Development Kit User's Guide for Latticeecp

    LatticeMico32 Development Kit User’s Guide for LatticeECP Lattice Semiconductor Corporation 5555 NE Moore Court Hillsboro, OR 97124 (503) 268-8000 May 2007 Copyright Copyright © 2007 Lattice Semiconductor Corporation. This document may not, in whole or part, be copied, photocopied, reproduced, translated, or reduced to any electronic medium or machine- readable form without prior written consent from Lattice Semiconductor Corporation. Trademarks Lattice Semiconductor Corporation, L Lattice Semiconductor Corporation (logo), L (stylized), L (design), Lattice (design), LSC, E2CMOS, Extreme Performance, FlashBAK, flexiFlash, flexiMAC, flexiPCS, FreedomChip, GAL, GDX, Generic Array Logic, HDL Explorer, IPexpress, ISP, ispATE, ispClock, ispDOWNLOAD, ispGAL, ispGDS, ispGDX, ispGDXV, ispGDX2, ispGENERATOR, ispJTAG, ispLEVER, ispLeverCORE, ispLSI, ispMACH, ispPAC, ispTRACY, ispTURBO, ispVIRTUAL MACHINE, ispVM, ispXP, ispXPGA, ispXPLD, LatticeEC, LatticeECP, LatticeECP-DSP, LatticeECP2, LatticeECP2M, LatticeMico8, LatticeMico32, LatticeSC, LatticeSCM, LatticeXP, LatticeXP2, MACH, MachXO, MACO, ORCA, PAC, PAC-Designer, PAL, Performance Analyst, PURESPEED, Reveal, Silicon Forest, Speedlocked, Speed Locking, SuperBIG, SuperCOOL, SuperFAST, SuperWIDE, sysCLOCK, sysCONFIG, sysDSP, sysHSI, sysI/O, sysMEM, The Simple Machine for Complex Design, TransFR, UltraMOS, and specific product designations are either registered trademarks or trademarks of Lattice Semiconductor Corporation or its subsidiaries in the United States and/or other countries. ISP, Bringing the Best Together, and More of the Best are service marks of Lattice Semiconductor Corporation. Other product names used in this publication are for identification purposes only and may be trademarks of their respective companies. Disclaimers NO WARRANTIES: THE INFORMATION PROVIDED IN THIS DOCUMENT IS “AS IS” WITHOUT ANY EXPRESS OR IMPLIED WARRANTY OF ANY KIND INCLUDING WARRANTIES OF ACCURACY, COMPLETENESS, MERCHANTABILITY, NONINFRINGEMENT OF INTELLECTUAL PROPERTY, OR FITNESS FOR ANY PARTICULAR PURPOSE.
  • Latticemico32 Software Developer User Guide

    Latticemico32 Software Developer User Guide

    LatticeMico32 Software Developer User Guide May 2014 Copyright Copyright © 2014 Lattice Semiconductor Corporation. This document may not, in whole or part, be copied, photocopied, reproduced, translated, or reduced to any electronic medium or machine-readable form without prior written consent from Lattice Semiconductor Corporation. Trademarks Lattice Semiconductor Corporation, L Lattice Semiconductor Corporation (logo), L (stylized), L (design), Lattice (design), LSC, CleanClock, Custom Mobile Device, DiePlus, E2CMOS, ECP5, Extreme Performance, FlashBAK, FlexiClock, flexiFLASH, flexiMAC, flexiPCS, FreedomChip, GAL, GDX, Generic Array Logic, HDL Explorer, iCE Dice, iCE40, iCE65, iCEblink, iCEcable, iCEchip, iCEcube, iCEcube2, iCEman, iCEprog, iCEsab, iCEsocket, IPexpress, ISP, ispATE, ispClock, ispDOWNLOAD, ispGAL, ispGDS, ispGDX, ispGDX2, ispGDXV, ispGENERATOR, ispJTAG, ispLEVER, ispLeverCORE, ispLSI, ispMACH, ispPAC, ispTRACY, ispTURBO, ispVIRTUAL MACHINE, ispVM, ispXP, ispXPGA, ispXPLD, Lattice Diamond, LatticeCORE, LatticeEC, LatticeECP, LatticeECP-DSP, LatticeECP2, LatticeECP2M, LatticeECP3, LatticeECP4, LatticeMico, LatticeMico8, LatticeMico32, LatticeSC, LatticeSCM, LatticeXP, LatticeXP2, MACH, MachXO, MachXO2, MachXO3, MACO, mobileFPGA, ORCA, PAC, PAC-Designer, PAL, Performance Analyst, Platform Manager, ProcessorPM, PURESPEED, Reveal, SensorExtender, SiliconBlue, Silicon Forest, Speedlocked, Speed Locking, SuperBIG, SuperCOOL, SuperFAST, SuperWIDE, sysCLOCK, sysCONFIG, sysDSP, sysHSI, sysI/O, sysMEM, The Simple Machine for Complex
  • Openpiton: an Open Source Manycore Research Framework

    Openpiton: an Open Source Manycore Research Framework

    OpenPiton: An Open Source Manycore Research Framework Jonathan Balkind Michael McKeown Yaosheng Fu Tri Nguyen Yanqi Zhou Alexey Lavrov Mohammad Shahrad Adi Fuchs Samuel Payne ∗ Xiaohua Liang Matthew Matl David Wentzlaff Princeton University fjbalkind,mmckeown,yfu,trin,yanqiz,alavrov,mshahrad,[email protected], [email protected], fxiaohua,mmatl,[email protected] Abstract chipset Industry is building larger, more complex, manycore proces- sors on the back of strong institutional knowledge, but aca- demic projects face difficulties in replicating that scale. To Tile alleviate these difficulties and to develop and share knowl- edge, the community needs open architecture frameworks for simulation, synthesis, and software exploration which Chip support extensibility, scalability, and configurability, along- side an established base of verification tools and supported software. In this paper we present OpenPiton, an open source framework for building scalable architecture research proto- types from 1 core to 500 million cores. OpenPiton is the world’s first open source, general-purpose, multithreaded manycore processor and framework. OpenPiton leverages the industry hardened OpenSPARC T1 core with modifica- Figure 1: OpenPiton Architecture. Multiple manycore chips tions and builds upon it with a scratch-built, scalable uncore are connected together with chipset logic and networks to creating a flexible, modern manycore design. In addition, build large scalable manycore systems. OpenPiton’s cache OpenPiton provides synthesis and backend scripts for ASIC coherence protocol extends off chip. and FPGA to enable other researchers to bring their designs to implementation. OpenPiton provides a complete verifica- tion infrastructure of over 8000 tests, is supported by mature software tools, runs full-stack multiuser Debian Linux, and has been widespread across the industry with manycore pro- is written in industry standard Verilog.
  • Network on Chip for FPGA Development of a Test System for Network on Chip

    Network on Chip for FPGA Development of a Test System for Network on Chip

    Network on Chip for FPGA Development of a test system for Network on Chip Magnus Krokum Namork Master of Science in Electronics Submission date: June 2011 Supervisor: Kjetil Svarstad, IET Norwegian University of Science and Technology Department of Electronics and Telecommunications Problem Description This assignment is a continuation of the project-assignment of fall 2010, where it was looked into the development of reactive modules for application-test and profiling of the Network on Chip realization. It will especially be focused on the further development of: • The programmability of the system by developing functionality for more advanced surveillance of the communication between modules and routers • Framework that will be used to test and profile entire applications on the Network on Chip The work will primarily be directed towards testing and running the system in a way that resembles a real system at run time. The work is to be compared with relevant research within similar work. Assignment given: January 2011 Supervisor: Kjetil Svarstad, IET i Abstract Testing and verification of digital systems is an essential part of product develop- ment. The Network on Chip (NoC), as a new paradigm within interconnections; has a specific need for testing. This is to determine how performance and prop- erties of the NoC are compared to the requirements of different systems such as processors or media applications. A NoC has been developed within the AHEAD project to form a basis for a reconfigurable platform used in the AHEAD system. This report gives an outline of the project to develop testing and benchmarking systems for a NoC.
  • FPGA Design Guide

    FPGA Design Guide

    FPGA Design Guide Lattice Semiconductor Corporation 5555 NE Moore Court Hillsboro, OR 97124 (503) 268-8000 September 16, 2008 Copyright Copyright © 2008 Lattice Semiconductor Corporation. This document may not, in whole or part, be copied, photocopied, reproduced, translated, or reduced to any electronic medium or machine- readable form without prior written consent from Lattice Semiconductor Corporation. Trademarks Lattice Semiconductor Corporation, L Lattice Semiconductor Corporation (logo), L (stylized), L (design), Lattice (design), LSC, E2CMOS, Extreme Performance, FlashBAK, flexiFlash, flexiMAC, flexiPCS, FreedomChip, GAL, GDX, Generic Array Logic, HDL Explorer, IPexpress, ISP, ispATE, ispClock, ispDOWNLOAD, ispGAL, ispGDS, ispGDX, ispGDXV, ispGDX2, ispGENERATOR, ispJTAG, ispLEVER, ispLeverCORE, ispLSI, ispMACH, ispPAC, ispTRACY, ispTURBO, ispVIRTUAL MACHINE, ispVM, ispXP, ispXPGA, ispXPLD, LatticeEC, LatticeECP, LatticeECP-DSP, LatticeECP2, LatticeECP2M, LatticeMico8, LatticeMico32, LatticeSC, LatticeSCM, LatticeXP, LatticeXP2, MACH, MachXO, MACO, ORCA, PAC, PAC-Designer, PAL, Performance Analyst, PURESPEED, Reveal, Silicon Forest, Speedlocked, Speed Locking, SuperBIG, SuperCOOL, SuperFAST, SuperWIDE, sysCLOCK, sysCONFIG, sysDSP, sysHSI, sysI/O, sysMEM, The Simple Machine for Complex Design, TransFR, UltraMOS, and specific product designations are either registered trademarks or trademarks of Lattice Semiconductor Corporation or its subsidiaries in the United States and/or other countries. ISP, Bringing the Best Together, and More
  • Embedded Networks on Chip for Field-Programmable Gate Arrays by Mohamed Saied Abdelfattah a Thesis Submitted in Conformity With

    Embedded Networks on Chip for Field-Programmable Gate Arrays by Mohamed Saied Abdelfattah a Thesis Submitted in Conformity With

    Embedded Networks on Chip for Field-Programmable Gate Arrays by Mohamed Saied Abdelfattah A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy Graduate Department of Electrical and Computer Engineering University of Toronto © Copyright 2016 by Mohamed Saied Abdelfattah Abstract Embedded Networks on Chip for Field-Programmable Gate Arrays Mohamed Saied Abdelfattah Doctor of Philosophy Graduate Department of Electrical and Computer Engineering University of Toronto 2016 Modern field-programmable gate arrays (FPGAs) have a large capacity and a myriad of embedded blocks for computation, memory and I/O interfacing. This allows the implementation of ever-larger applications; however, the increase in application size comes with an inevitable increase in complexity, making it a challenge to implement on-chip communication. Today, it is a designer's burden to create a customized communication circuit to interconnect an application, using the fine-grained FPGA fab- ric that has single-bit control over every wire segment and logic cell. Instead, we propose embedding a network-on-chip (NoC) to implement system-level communication on FPGAs. A prefabricated NoC improves communication efficiency, eases timing closure, and abstracts system-level communication on FPGAs, separating an application's behaviour and communication which makes the design of complex FPGA applications easier and faster. This thesis presents a complete embedded NoC solution, includ- ing the NoC architecture and interface, rules to guide its use with FPGA design styles, application case studies to showcase its advantages, and a computer-aided design (CAD) system to automatically interconnect applications using an embedded NoC. We compare NoC components when implemented hard versus soft, then build on this component-level analysis to architect embedded NoCs and integrate them into the FPGA fabric; these NoCs are on average 20{23× smaller and 5{6× faster than soft NoCs.
  • An Early Performance Evaluation of Many Integrated Core Architecture Based SGI Rackable Computing System

    An Early Performance Evaluation of Many Integrated Core Architecture Based SGI Rackable Computing System

    NAS Technical Report: NAS-2015-04 An Early Performance Evaluation of Many Integrated Core Architecture Based SGI Rackable Computing System Subhash Saini,1 Haoqiang Jin,1 Dennis Jespersen,1 Huiyu Feng,2 Jahed Djomehri,3 William Arasin,3 Robert Hood,3 Piyush Mehrotra,1 Rupak Biswas1 1NASA Ames Research Center 2SGI 3Computer Sciences Corporation Moffett Field, CA 94035-1000, USA Fremont, CA 94538, USA Moffett Field, CA 94035-1000, USA {subhash.saini, haoqiang.jin, [email protected] {jahed.djomehri, william.f.arasin, dennis.jespersen, piyush.mehrotra, robert.hood}@nasa.gov rupak.biswas}@nasa.gov ABSTRACT The GPGPU approach relies on streaming multiprocessors and Intel recently introduced the Xeon Phi coprocessor based on the uses a low-level programming model such as CUDA or a high- Many Integrated Core architecture featuring 60 cores with a peak level programming model like OpenACC to attain high performance of 1.0 Tflop/s. NASA has deployed a 128-node SGI performance [1-3]. Intel’s approach has a Phi serving as a Rackable system where each node has two Intel Xeon E2670 8- coprocessor to a traditional Intel processor host. The Phi has x86- core Sandy Bridge processors along with two Xeon Phi 5110P compatible cores with wide vector processing units and uses coprocessors. We have conducted an early performance standard parallel programming models such as MPI, OpenMP, evaluation of the Xeon Phi. We used microbenchmarks to hybrid (MPI + OpenMP), UPC, etc. [4-5]. measure the latency and bandwidth of memory and interconnect, Understanding performance of these heterogeneous computing I/O rates, and the performance of OpenMP directives and MPI systems has become very important as they have begun to appear functions.
  • A Comparison of Four Series of Cisco Network Processors

    A Comparison of Four Series of Cisco Network Processors

    Advanced Computational Intelligence: An International Journal (ACII), Vol.3, No.4, October 2016 A COMPARISON OF FOUR SERIES OF CISCO NETWORK PROCESSORS Sadaf Abaei Senejani 1, Hossein Karimi 2 and Javad Rahnama 3 1Department of Computer Engineering, Pooyesh University, Qom, Iran 2 Sama Technical and Vocational Training College, Islamic Azad University, Yasouj Branch, Yasouj, Iran 3 Department of Computer Science and Engineering, Sharif University of Technology, Tehran, Iran. ABSTRACT Network processors have created new opportunities by performing more complex calculations. Routers perform the most useful and difficult processing operations. In this paper the routers of VXR 7200, ISR 4451-X, SBC 7600, 7606 have been investigated which their main positive points include scalability, flexibility, providing integrated services, high security, supporting and updating with the lowest cost, and supporting standard protocols of network. In addition, in the current study these routers have been explored from hardware and processor capacity viewpoints separately. KEYWORDS Network processors, Network on Chip, parallel computing 1. INTRODUCTION The expansion of science and different application fields which require complex and time taking calculations and in consequence the demand for fastest and more powerful computers have made the need for better structures and new methods more significant. This need can be met through solutions called “paralleled massive computers” or another solution called “clusters of computers”. It should be mentioned that both solutions depend highly on intercommunication networks with high efficiency and appropriate operational capacity. Therefore, as a solution combining numerous cores on a single chip via using Network on chip technology for the simplification of communications can be done.
  • MYTHIC MULTIPLIES in a FLASH Analog In-Memory Computing Eliminates DRAM Read/Write Cycles

    MYTHIC MULTIPLIES in a FLASH Analog In-Memory Computing Eliminates DRAM Read/Write Cycles

    MYTHIC MULTIPLIES IN A FLASH Analog In-Memory Computing Eliminates DRAM Read/Write Cycles By Mike Demler (August 27, 2018) ................................................................................................................... Machine learning is a hot field that attracts high-tech The startup plans by the end of this year to tape out investors, causing the number of startups to explode. As its first product, which can store up to 50 million weights. these young companies strive to compete against much larg- At that time, it also aims to release the alpha version of its er established players, some are revisiting technologies the software tools and a performance profiler. The company veterans may have dismissed, such as analog computing. expects to begin volume shipments in 4Q19. Mythic is riding this wave, using embedded flash-memory technology to store neural-network weights as analog pa- Solving a Weighty Problem rameters—an approach that eliminates the power consumed Mythic uses Fujitsu’s 40nm embedded-flash cell, which it in moving data between the processor and DRAM. plans to integrate in an array along with digital-to-analog The company traces its origin to 2012 at the University converters (DACs) and analog-to-digital converters (ADCs), of Michigan, where CTO Dave Fick completed his doctoral as Figure 1 shows. By storing a range of analog voltages in degree and CEO Mike Henry spent two and a half years as the bit cells, the technique uses the cell’s voltage-variable a visiting scholar. The founders worked on a DoD-funded conductance to represent weights with 8-bit resolution. This project to develop machine-learning-powered surveillance drones, eventually leading to the creation of their startup.
  • Small Soft Core up Inventory ©2019 James Brakefield Opencore and Other Soft Core Processors Reverse-U16 A.T

    Small Soft Core up Inventory ©2019 James Brakefield Opencore and Other Soft Core Processors Reverse-U16 A.T

    tool pip _uP_all_soft opencores or style / data inst repor com LUTs blk F tool MIPS clks/ KIPS ven src #src fltg max max byte adr # start last secondary web status author FPGA top file chai e note worthy comments doc SOC date LUT? # inst # folder prmary link clone size size ter ents ALUT mults ram max ver /inst inst /LUT dor code files pt Hav'd dat inst adrs mod reg year revis link n len Small soft core uP Inventory ©2019 James Brakefield Opencore and other soft core processors reverse-u16 https://github.com/programmerby/ReVerSE-U16stable A.T. Z80 8 8 cylcone-4 James Brakefield11224 4 60 ## 14.7 0.33 4.0 X Y vhdl 29 zxpoly Y yes N N 64K 64K Y 2015 SOC project using T80, HDMI generatorretro Z80 based on T80 by Daniel Wallner copyblaze https://opencores.org/project,copyblazestable Abdallah ElIbrahimi picoBlaze 8 18 kintex-7-3 James Brakefieldmissing block622 ROM6 217 ## 14.7 0.33 2.0 57.5 IX vhdl 16 cp_copyblazeY asm N 256 2K Y 2011 2016 wishbone extras sap https://opencores.org/project,sapstable Ahmed Shahein accum 8 8 kintex-7-3 James Brakefieldno LUT RAM48 or block6 RAM 200 ## 14.7 0.10 4.0 104.2 X vhdl 15 mp_struct N 16 16 Y 5 2012 2017 https://shirishkoirala.blogspot.com/2017/01/sap-1simple-as-possible-1-computer.htmlSimple as Possible Computer from Malvinohttps://www.youtube.com/watch?v=prpyEFxZCMw & Brown "Digital computer electronics" blue https://opencores.org/project,bluestable Al Williams accum 16 16 spartan-3-5 James Brakefieldremoved clock1025 constraint4 63 ## 14.7 0.67 1.0 41.1 X verilog 16 topbox web N 4K 4K N 16 2 2009
  • Using Inspiration from Synaptic Plasticity Rules to Optimize Traffic Flow in Distributed Engineered Networks Arxiv:1611.06937V1

    Using Inspiration from Synaptic Plasticity Rules to Optimize Traffic Flow in Distributed Engineered Networks Arxiv:1611.06937V1

    1 Using inspiration from synaptic plasticity rules to optimize traffic flow in distributed engineered networks Jonathan Y. Suen1 and Saket Navlakha2 1Duke University, Department of Electrical and Computer Engineering. Durham, North Carolina, 27708 USA 2The Salk Institute for Biological Studies. Integrative Biology Laboratory. 10010 North Torrey Pines Rd., La Jolla, CA 92037 USA Keywords: synaptic plasticity, distributed networks, flow control arXiv:1611.06937v1 [cs.NE] 21 Nov 2016 Abstract Controlling the flow and routing of data is a fundamental problem in many dis- tributed networks, including transportation systems, integrated circuits, and the Internet. In the brain, synaptic plasticity rules have been discovered that regulate network activ- ity in response to environmental inputs, which enable circuits to be stable yet flexible. Here, we develop a new neuro-inspired model for network flow control that only de- pends on modifying edge weights in an activity-dependent manner. We show how two fundamental plasticity rules (long-term potentiation and long-term depression) can be cast as a distributed gradient descent algorithm for regulating traffic flow in engineered networks. We then characterize, both via simulation and analytically, how different forms of edge-weight update rules affect network routing efficiency and robustness. We find a close correspondence between certain classes of synaptic weight update rules de- rived experimentally in the brain and rules commonly used in engineering, suggesting common principles to both. 1. Introduction In many engineered networks, a payload needs to be transported between nodes without central control. These systems are often represented as weighted, directed graphs. Each edge has a fixed capacity, which represents the maximum amount of traffic the edge can carry at any time.
  • Mico32 White Paper 2008-02

    Mico32 White Paper 2008-02

    OPEN AND EASY MICROPROCESSOR DESIGNS USING THE LatticeMico32 A Lattice Semiconductor White Paper February 2008 Lattice Semiconductor 5555 Northeast Moore Ct. Hillsboro, Oregon 97124 USA Telephone: (503) 268-8000 www.latticesemi.com 1 Open and Easy Microprocessor Designs Using the LatticeMico32 A Lattice Semiconductor White Paper Embedded Microprocessor Trends and Challenges The last few years have witnessed a growing trend to use embedded microprocessors in FPGA designs. Figure 1 illustrates this trend. 120,000 Without Embedded µP 100,000 With Embedded µP 80,000 60,000 40,000 20,000 Number ofDesign FPGA/PLD Starts 0 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 Figure 1 – FPGA Design Starts With Embedded µµµP Source: Gartner, August 9, 2005 As illustrated in the graph, this trend is not showing any signs of slowing down. Three important benefits of the embedded microprocessor approach are driving this trend. The first is that a soft processor provides the preferred way to implement control-plane functionality in an FPGA while leaving the datapath functionality to the programmable hardware. Control plane functionality that can be implemented in software rather than hardware allows the designer much greater freedom to make changes. Also, many control plane functions are just too difficult to reasonably implement in hardware. The second benefit is that, with software-based processing, it is possible for the hardware logic to remain stable because functional upgrades can be made through software modification. The third benefit is that FPGA embedded soft processors will not become obsolete. One of the risks that any user faces when designing with an off-the-shelf microprocessor is obsolescence.