Embedded-DSP Superh Family and Its Applications

Total Page:16

File Type:pdf, Size:1020Kb

Embedded-DSP Superh Family and Its Applications Hitachi Review Vol. 47 (1998), No. 4 121 Embedded-DSP SuperH Family and Its Applications Toru Baji ABSTRACT: Recently sales have grown markedly in the field of mobile Hiroshi Takeyama communication terminals such as global system for mobile communications (GSM), personal digital cellular telecommunication system (PDC), personal Tetsuya Nakagawa handyphone system (PHS), and in digital consumer equipment such as car navigation systems and digital still cameras. These systems are configured with a CPU and a general-purpose digital signal processor (DSP). The CPU executes communication protocol control and system control; the general-purpose DSP performs signal processing such as speech compression and image processing. For mobile communication systems and digital consumer products for the near future, integration of the CPU and DSP is required to achieve a low-price, small-size, low-power consumption system. Hitachi, Ltd., has introduced a new-generation microcomputer SH- DSP into this market. The SH-DSP consists of a SuperH-RISC (reduced instruction set computer) microcomputer and a high-performance DSP engine. With the SH-DSP, Hitachi has enhanced its product line for a new wave of applications, such as digital still cameras, mobile communication terminals, and speech processing systems. INTRODUCTION applications are mobile communication terminals, THE SH-DSP incorporates a 32-bit RISC microcom- digital consumer equipment, and multimedia systems. puter, which is completely upward compatible with One example of digital consumer equipment is the the SH microcomputers (SH-1 and SH-2), and includes digital still camera. Hitachi has developed a digital high performance DSP functions. Examples of target camera system based on the SH-DSP (Fig. 1). SH-DSP HD6437410F Digital still camera system CPU+ DSP board Control AMP CODEC Shutter SH-DSP RS-232C PCMCIA 3.3,5 V slot IrDA PC FLASH cards ROM MP-VRAM DRAM (512 kbyte) × 4 (1 Mbyte) (16 Mbyte) Lens Camera board Y CDS Camera Y/C CCD ADC DAC AGC TG DSP mix C Monitor (TV) Video driver Sub controller CPU: central processing unit DSP: digital signal processor PCMCIA: Personal Computer Memory Card International Association IrDA: Infrared Data Association ROM: read only memory MP-VRAM: multiport video RAM CCD: charge coupled device CDS: correlated double sampling AGC: automatic gain control ADC: analog-to-digital converter TG: timing generator DAC: digital-to-analog converter Y: illuminance signal C: chrominance signal Fig. 1—SH-DSP Used in a Digital Still Camera System. SH-DSP is a compact reduced instruction set computer (RISC) microcomputer with an embedded high-performance DSP engine. Digital still cameras using this processor are also able to perform speech compression and decompression efficiently. Embedded-DSP SuperH Family and Its Applications 122 3-bus configuration SuperH RISC engine Parallel execution of 1 program Can generate three addresses for DSP fetch and 2 data accesses instructions or execute RISC insructions. Address buses DSP engine SHARED IAB General-purpose-register file, single-cycle external memories YAB 16 × 32-bit multiply-accumulate, and other DSP functions Directly connected XAB general-purpose to DRAM, EDO, register file RAM, flash, and Data register file RISC RISC (burst) ROM or or × DSP (X) 8 32 bit and DSP (V) ALU 2 × 40 bit memory memory SHARED Interrupt controller, Auxilliary ALU Control Flexible memory debugger, and clock signal Multiplier Can be accessed by both pulse generator Decoder RISC and DSP instructions. 16/32-bit prefetch ALU/Burrel shifter SHARED Peripheral module XDB YDB IDB Data buses : Enhanced functions in SH-DSP EDO: extended data output RAM: random access memory ALU: arithmetical and logical unit IAB: I address bus YAB: Y address bus XAB: X address bus IDB: I data bus YDB: Y data bus XDB: X data bus Fig. 2—SH Block Diagram and Features. SH-DSP incorporates a high-performance DSP engine and employs a 3-bus configuration for the DSP enhancement of the conventional SuperH RISC microcomputer. There are big advantages in chip size reduction and lower power consumption because memories and peripheral functions can be shared compared with a discrete CPU and DSP integrated in one chip. In the SH-DSP, in addition to CPU operation, the between CPU and DSP reduces the number of circuits RISC engine is also used to generate three addresses on the LSI. and control the DSP when executing DSP instructions. This mode of operation achieves high DSP SH-DSP PERFORMANCE performance with minimum increase in circuitry. A Fig. 3 shows a comparison of CPU and DSP 3-bus configuration is employed that enables parallel performance in each processor. SuperH has higher DSP execution of two data accesses and one program fetch performance than general RISC microcomputers for DSP operation. Fig. 2 shows the SH-DSP block because it has a 3-cycle multiply-accumulator. diagram and features. A general-purpose DSP, on the other hand, has the Besides 1-cycle multiply-accumulate, a total of 10 highest DSP performance because of its 1-cycle general-purpose data registers are incorporated in the multiply-accumulate and 3-bus configuration, but has DSP engine. For making the most of these hardware lower CPU performance because it does not support resources, 32-bit pseudo VLIW (very long instruction 8-/16-/32-bit data access and does not include so many word) instructions are introduced. versatile CPU instructions. Furthermore, functions such as zero-overhead repeat Unlike the SuperH alone or a general-purpose DSP, loop and module addressing, which are indispensable the SH-DSP has higher performance for both the DSP for DSPs, are also included. Memories and peripherals and CPU because it is perfectly upward compatible can be shared between the DSP and CPU. This enables with the SH-1 and SH-2 and executes all of the general- the use of SuperH DRAM (dynamic random access purpose DSP’s functions. CPU performance of the SH- memory) direct interface to the DSP, a function which DSP is 60 million instructions per second (MIPS), is not seen in other DSPs. This is especially effective which is the same as that of a 60-MHz SH-DSP. The for digital still cameras, which require large-capacity DSP performance is 120 million operations per second DRAM image memory. Also sharing of various circuits (MOPS), which is almost the same as that of a general- Hitachi Review Vol. 47 (1998), No. 4 123 purpose DSP. The actual benchmark test shows that has been developed. Still image compression by Joint SH-DSP performance is at the top level among general- Photographic Experts Group (JPEG) processing was purpose processors 1). executed in just 0.55 second for a Video Graphics Array (VGA) full-size image (640 × 480 pixels, Y : U : V = SH-DSP DIGITAL STILL CAMERA SYSTEM 4 : 2 : 2). For a quarter-VGA (QVGA) size image, A digital still camera system based on the SH-DSP four to seven frames can be processed in one second 200 MOPS SH-DSP 120 100 60 RISC + DSP SH-3 40 General-purpose DSP SH-2 20 SH-1 DSP performance 10 × 2 RISC + 3 multiply-accumulate × 2/3 5 RISC MOPS = MIPS Fig. 3—DSP and CPU Performance 3 of SH-DSP. MOPS = MIPS 2 SuperH shows higher DSP 1 2 3 5 10 20 30 60 100 200 performance due to a 3-cycle CPU performance multiply-accumulator. The SH-DSP MOPS (million operations per second) is a DSP performance measure, which shows the number of has 60-MIPS CPU performance as operations such as multiplications executed in one second. well as 120-MOPS DSP MIPS (million instructions per second) is a CPU performance measure, which shows the number of performance. Dhrystone benchmark instructions executed in one second. Control System controller board (DC-DS1) AMP Speaker CODEC Microphone Shutter SH-DSP 115 kbps HD64E7410 RS-232C 68 PCMCIA slot 3.3, 5 V 3.3 V IrDA 115 kbps PCMCIA ATA PC • Flash ATA card (75 Mbyte) • Compact flash card (15 Mbyte) × 16, 1 wait × 32, 0 wait × 32, 0 wait ROM MP-VRAM DRAM HN27C4096 HM538253 HB56TW433 (512 kbyte) (× 4 Mbyte) (16 Mbyte) 5 V 5 V 3.3 V Camera board (MV-DS10) Digital I/F (Y, R-Y/B-Y) Lens HA118184F HD49319F HD49811TFA Y Y/C CCD CDS/AGC ADC DAC DSP mix TG C Monitor (TV) V. Driver Microcontroller H8/300 series ATA: advanced technology attachment bus ADC: analog-to-digital converter Y: Illuminance signal C: chrominance signal Fig. 4—Block Diagram of an SH-DSP Digital Still Camera System. This digital still camera system consists of the camera board (MV-DS10) which captures the still image and converts it to a digital signal, and the system controller board (DC-DS1) that performs both image and speech compression/decompression and system control. The entire camera system can be controlled by a single SH-DSP. Embedded-DSP SuperH Family and Its Applications 124 in parallel with speech compression/decompression JPEG processing time 0.51.0 1.5 (s) processing. SH-1 Encode 1.30 20 MHz Decode 1.60 Digital Still Camera System Concept In a digital still camera system using SH-DSP, SH-2 0.83 28 MHz functions are shared between the DSP engine and the 1.02 compact RISC microcomputer. The DSP engine SH-3 0.35 executes digital signal processing, still image and 60 MHz 0.42 Operating conditions Picture size (QVGA):320×240 speech compression; the compact RISC microcomputer SH-DSP 0.13 Y:Cr:Cb=4:2:2, 24-bit full color executes the disc operating system (DOS) file 60 MHz 0.15 Compression ratio:1/10 management, communications, and camera system control. Although a dedicated LSI is also required for Code size 510 15 (kbyte) a conventional digital still camera systems, only the Com Decode SH-DSP and its software are required to configure a SH JPEG library (1) Encode (8) (6) 15 H-1,2,3 Work digital still camera system using an SH-DSP.
Recommended publications
  • SH-1/SH-2/SH-DSP Programming Manual
    Hitachi SuperH™ RISC Engine SH-1/SH-2/SH-DSP Programming Manual ADE-602-063C Rev. 4.0 5/13/99 Hitachi ,Ltd Cautions 1. Hitachi neither warrants nor grants licenses of any rights of Hitachi’s or any third party’s patent, copyright, trademark, or other intellectual property rights for information contained in this document. Hitachi bears no responsibility for problems that may arise with third party’s rights, including intellectual property rights, in connection with use of the information contained in this document. 2. Products and product specifications may be subject to change without notice. Confirm that you have received the latest product standards or specifications before final design, purchase or use. 3. Hitachi makes every attempt to ensure that its products are of high quality and reliability. However, contact Hitachi’s sales office before using the product in an application that demands especially high quality and reliability or where its failure or malfunction may directly threaten human life or cause risk of bodily injury, such as aerospace, aeronautics, nuclear power, combustion control, transportation, traffic, safety equipment or medical equipment for life support. 4. Design your application so that the product is used within the ranges guaranteed by Hitachi particularly for maximum rating, operating supply voltage range, heat radiation characteristics, installation conditions and other characteristics. Hitachi bears no responsibility for failure or damage when used beyond the guaranteed ranges. Even within the guaranteed ranges, consider normally foreseeable failure rates or failure modes in semiconductor devices and employ systemic measures such as fail-safes, so that the equipment incorporating Hitachi product does not cause bodily injury, fire or other consequential damage due to operation of the Hitachi product.
    [Show full text]
  • Superh RISC Engine SH-1/SH-2
    SuperH RISC Engine SH-1/SH-2 Programming Manual September 3, 1996 Hitachi America Ltd. Notice When using this document, keep the following in mind: 1. This document may, wholly or partially, be subject to change without notice. 2. All rights are reserved: No one is permitted to reproduce or duplicate, in any form, the whole or part of this document without Hitachi’s permission. 3. Hitachi will not be held responsible for any damage to the user that may result from accidents or any other reasons during operation of the user’s unit according to this document. 4. Circuitry and other examples described herein are meant merely to indicate the characteristics and performance of Hitachi’s semiconductor products. Hitachi assumes no responsibility for any intellectual property claims or other problems that may result from applications based on the examples described herein. 5. No license is granted by implication or otherwise under any patents or other rights of any third party or Hitachi, Ltd. 6. MEDICAL APPLICATIONS: Hitachi’s products are not authorized for use in MEDICAL APPLICATIONS without the written consent of the appropriate officer of Hitachi’s sales company. Such use includes, but is not limited to, use in life support systems. Buyers of Hitachi’s products are requested to notify the relevant Hitachi sales offices when planning to use the products in MEDICAL APPLICATIONS. Introduction The SuperH RISC engine family incorporates a RISC (Reduced Instruction Set Computer) type CPU. A basic instruction can be executed in one clock cycle, realizing high performance operation. A built-in multiplier can execute multiplication and addition as quickly as DSP.
    [Show full text]
  • Superh Family Catalog 3A
    SuperH® Family of Microcontrollers and Microprocessors ® 2006.4 Renesas Technology America SuperH® Family of Microcontrollers & Microprocessors 2006 Catalog Renesas — Your Best Decision for Microcontrollers and Microprocessors Choose for your next design; use for all your future ones. Renesas Technology, the leading global supplier of microcontrollers and microprocessors for embedded systems, especially flash microcomputers, makes it easy to select the right device for any application. The hundreds of devices in our H8®, M16C™ and SuperH® families, among others, have upwardly compatible architectures and standard platforms covering wide spans of performance, integration, power efficiency and price points. Your choices extend from cost-effective 8-bit microcontrollers to high-performance 16-bit and high-end 32-bit devices. These easy-to-use chips can enable your design innovations now and in the future. Our advanced silicon solutions simplify system optimizations, facilitate design enhancements and diversifications, and help you meet tight cost budgets. Key markets for Renesas microcontrollers and microprocessors include the mobile, automotive and PC/AV fields. For example, we are a leader in devices for car information systems, the technology forerunner in application processors for multimedia mobile phones, and an innovator in devices for power inverter applications. By choosing world-class Renesas solutions, you get highly reliable designs produced in high-quality fabrication facilities, and benefit from ongoing development defined by solid technology roadmaps. You can shorten your system design cycles and improve your time-to-market by taking advantage of our total system support, including software solutions, starter kits, reference platforms, reference designs, and middleware. Expert third-party support is available as well.
    [Show full text]
  • Low Power Asynchronous Digital Signal Processing
    LOW POWER ASYNCHRONOUS DIGITAL SIGNAL PROCESSING A thesis submitted to the University of Manchester for the degree of Doctor of Philosophy in the Faculty of Science & Engineering October 2000 Michael John George Lewis Department of Computer Science 1 Contents Chapter 1: Introduction ....................................................................................14 Digital Signal Processing ...............................................................................15 Evolution of digital signal processors ....................................................17 Architectural features of modern DSPs .........................................................19 High performance multiplier circuits .....................................................20 Memory architecture ..............................................................................21 Data address generation .........................................................................21 Loop management ..................................................................................23 Numerical precision, overflows and rounding .......................................24 Architecture of the GSM Mobile Phone System ...........................................25 Channel equalization ..............................................................................28 Error correction and Viterbi decoding ...................................................29 Speech transcoding ................................................................................31 Half-rate and enhanced
    [Show full text]
  • Embedded Operating Systems
    7 Embedded Operating Systems Claudio Scordino1, Errico Guidieri1, Bruno Morelli1, Andrea Marongiu2,3, Giuseppe Tagliavini3 and Paolo Gai1 1Evidence SRL, Italy 2Swiss Federal Institute of Technology in Zurich (ETHZ), Switzerland 3University of Bologna, Italy In this chapter, we will provide a description of existing open-source operating systems (OSs) which have been analyzed with the objective of providing a porting for the reference architecture described in Chapter 2. Among the various possibilities, the ERIKA Enterprise RTOS (Real-Time Operating System) and Linux with preemption patches have been selected. A description of the porting effort on the reference architecture has also been provided. 7.1 Introduction In the past, OSs for high-performance computing (HPC) were based on custom-tailored solutions to fully exploit all performance opportunities of supercomputers. Nowadays, instead, HPC systems are being moved away from in-house OSs to more generic OS solutions like Linux. Such a trend can be observed in the TOP500 list [1] that includes the 500 most powerful supercomputers in the world, in which Linux dominates the competition. In fact, in around 20 years, Linux has been capable of conquering all the TOP500 list from scratch (for the first time in November 2017). Each manufacturer, however, still implements specific changes to the Linux OS to better exploit specific computer hardware features. This is especially true in the case of computing nodes in which lightweight kernels are used to speed up the computation. 173 174 Embedded Operating Systems Figure 7.1 Number of Linux-based supercomputers in the TOP500 list. Linux is a full-featured OS, originally designed to be used in server or desktop environments.
    [Show full text]
  • The Digital Signal Processor Derby
    SEMICONDUCTORS The newest breeds trade off speed, energy consumption, and cost to vie for The an ever bigger piece of the action Digital Signal Processor BY JENNIFER EYRE Derby Berkeley Design Technology Inc. pplications that use digital signal-processing purpose processors typically lack these specialized features and chips are flourishing, buoyed by increasing per- are not as efficient at executing DSP algorithms. formance and falling prices. Concurrently, the For any processor, the faster its clock rate or the greater the market has expanded enormously, to an esti- amount of work performed in each clock cycle, the faster it can mated US $6 billion in 2000. Vendors abound. complete DSP tasks. Higher levels of parallelism, meaning the AMany newcomers have entered the market, while established ability to perform multiple operations at the same time, have companies compete for market share by creating ever more a direct effect on a processor’s speed, assuming that its clock novel, efficient, and higher-performing architectures. The rate does not decrease commensurately. The combination of range of digital signal-processing (DSP) architectures available more parallelism and faster clock speeds has increased the is unprecedented. speed of DSP processors since their commercial introduction In addition to expanding competition among DSP proces- in the early 1980s. A high-end DSP processor available in sor vendors, a new threat is coming from general-purpose 2000 from Texas Instruments Inc., Dallas, for example, is processors with DSP enhancements. So, DSP vendors have roughly 250 times as fast as the fastest processor the company begun to adapt their architectures to stave off the outsiders.
    [Show full text]
  • Superh TM RISC Engine High-Performance Embedded Workshop 3 User's Manual
    To our customers, Old Company Name in Catalogs and Other Documents On April 1st, 2010, NEC Electronics Corporation merged with Renesas Technology Corporation, and Renesas Electronics Corporation took over all the business of both companies. Therefore, although the old company name remains in this document, it is a valid Renesas Electronics document. We appreciate your understanding. Renesas Electronics website: http://www.renesas.com April 1st, 2010 Renesas Electronics Corporation Issued by: Renesas Electronics Corporation (http://www.renesas.com) Send any inquiries to http://www.renesas.com/inquiry. Notice 1. All information included in this document is current as of the date this document is issued. Such information, however, is subject to change without any prior notice. Before purchasing or using any Renesas Electronics products listed herein, please confirm the latest product information with a Renesas Electronics sales office. Also, please pay regular and careful attention to additional and different information to be disclosed by Renesas Electronics such as that disclosed through our website. 2. Renesas Electronics does not assume any liability for infringement of patents, copyrights, or other intellectual property rights of third parties by or arising from the use of Renesas Electronics products or technical information described in this document. No license, express, implied or otherwise, is granted hereby under any patents, copyrights or other intellectual property rights of Renesas Electronics or others. 3. You should not alter, modify, copy, or otherwise misappropriate any Renesas Electronics product, whether in whole or in part. 4. Descriptions of circuits, software and other related information in this document are provided only to illustrate the operation of semiconductor products and application examples.
    [Show full text]
  • On Performance of GPU and DSP Architectures for Computationally Intensive Applications
    University of Rhode Island DigitalCommons@URI Open Access Master's Theses 2013 On Performance of GPU and DSP Architectures for Computationally Intensive Applications John Faella University of Rhode Island, [email protected] Follow this and additional works at: https://digitalcommons.uri.edu/theses Recommended Citation Faella, John, "On Performance of GPU and DSP Architectures for Computationally Intensive Applications" (2013). Open Access Master's Theses. Paper 2. https://digitalcommons.uri.edu/theses/2 This Thesis is brought to you for free and open access by DigitalCommons@URI. It has been accepted for inclusion in Open Access Master's Theses by an authorized administrator of DigitalCommons@URI. For more information, please contact [email protected]. ON PERFORMANCE OF GPU AND DSP ARCHITECTURES FOR COMPUTATIONALLY INTENSIVE APPLICATIONS BY JOHN FAELLA A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF SCIENE IN ELECTRICAL ENGINEERING UNIVERSITY OF RHODE ISLAND 2013 MASTER OF SCIENCE THESIS OF JOHN FAELLA APPROVED: Thesis Committee: Major Professor Dr. Jien-Chung Lo Dr. Resit Sendag Dr. Lutz Hamel Nasser H. Zawia DEAN OF THE GRADUATE SCHOOL UNIVERSITY OF RHODE ISLAND 2013 ABSTRACT This thesis focuses on the implementations of a support vector machine (SVM) algorithm on digital signal processor (DSP), graphics processor unit (GPU), and a common Intel i7 core architecture. The purpose of this work is to identify which of the three is most suitable for SVM implementation. The performance is measured by looking at the time required by each of the architectures per prediction. This work also provides an analysis of possible alternatives to existing implementations of computationally intensive algorithms, such as SVM.
    [Show full text]
  • Solaris, HP9000 Series 700
    To our customers, Old Company Name in Catalogs and Other Documents On April 1st, 2010, NEC Electronics Corporation merged with Renesas Technology Corporation, and Renesas Electronics Corporation took over all the business of both companies. Therefore, although the old company name remains in this document, it is a valid Renesas Electronics document. We appreciate your understanding. Renesas Electronics website: http://www.renesas.com April 1st, 2010 Renesas Electronics Corporation Issued by: Renesas Electronics Corporation (http://www.renesas.com) Send any inquiries to http://www.renesas.com/inquiry. Notice 1. All information included in this document is current as of the date this document is issued. Such information, however, is subject to change without any prior notice. Before purchasing or using any Renesas Electronics products listed herein, please confirm the latest product information with a Renesas Electronics sales office. Also, please pay regular and careful attention to additional and different information to be disclosed by Renesas Electronics such as that disclosed through our website. 2. Renesas Electronics does not assume any liability for infringement of patents, copyrights, or other intellectual property rights of third parties by or arising from the use of Renesas Electronics products or technical information described in this document. No license, express, implied or otherwise, is granted hereby under any patents, copyrights or other intellectual property rights of Renesas Electronics or others. 3. You should not alter, modify, copy, or otherwise misappropriate any Renesas Electronics product, whether in whole or in part. 4. Descriptions of circuits, software and other related information in this document are provided only to illustrate the operation of semiconductor products and application examples.
    [Show full text]
  • Embedded Configurable Operating System
    eCos Embedded Configurable Operating System Embedded Track – FOSDEM 2003 Nick Garnett – Copyright © 2003 eCosCentric Features • Open Source • Royalty Free • Highly Configurable • Very Portable • Real Time • Small Footprint • Standards • Networking Embedded Track – FOSDEM 2003 Nick Garnett – Copyright © 2003 eCosCentric History • Cygnus Solutions – 1997 Project started – 1998 First release • Red Hat – 1999 Cygnus acquired by Red Hat – 2000 RedBoot added – 2002 Red Hat lay off eCos team • eCosCentric – 2002 eCos team form eCosCentric Embedded Track – FOSDEM 2003 Nick Garnett – Copyright © 2003 eCosCentric Licensing • RHEPL – Red Hat eCos Public License – Allowed proprietary code to be linked with eCos – Required all changes to be returned to Red Hat • GPL + exception – Standard GPL – Exception allows non-GPL code to be linked with eCos – No change requirement – No “plain” GPL code allowed in eCos Embedded Track – FOSDEM 2003 Nick Garnett – Copyright © 2003 eCosCentric Configuration • Packages – Separate package for each subsystem – Independently selectable – Self describing • Component Description Language – Describes features provided by package – Describes features required by package • Inference Engine – Automatic resolution of constraints Embedded Track – FOSDEM 2003 Nick Garnett – Copyright © 2003 eCosCentric Configuration Tools • GUI Configuration Tool – wxWindows based – Linux and Windows NT/2000/XP – Includes build and test systems – Large and small scale configuration • Command line Tool – Linux and Windows (Cygwin) – Automation
    [Show full text]
  • General Purpose Processors And
    Benchmarking Microprocessors for High-End Signal Processing Stephen Paavola SKY Computers Phone: 978-250-1920 Email Address: [email protected] There are a number of general-purpose methodology was chosen because the microprocessor architectures which, while not benchmark is intended to measure bandwidth, designed for high-end signal processing, might not computational performance. provide the processing performance required for complex radars, signal intelligence and other As might be expected, the 800 MHz Broadcom demanding applications. But how well does each BCM1250, with the lowest operating frequency really perform as a digital signal processor? of the group, also has the lowest bandwidth, whether the access is to L1 or L2 cache. Despite To answer this question, some simple the fact that this dual-processor chip has benchmarks were run on a 1GHz Freescale integrated memory controllers, it still lags behind 7447 PowerPC, 1.8 GHz IBM 970 PowerPC, 1.8 the other processors when accessing DRAM. GHz AMD Opteron and 800 MHz Broadcom MIPS-based 1250 chip. Memory Read Bandwidth The bottom line of this set of benchmarks is that 20.0 the PowerPC with AltiVec produces impressive 18.0 computational performance compared to the 16.0 14.0 other processors considered. Now that IBM is s) / B 12.0 1.8 GHz Opteron shipping its PowerPC 970 with AltiVec, there is a G 1 GHz 7447 h ( t 10.0 d 1.8 GHz 970 processor alternative that addresses the i w 8.0 800 MHz Broadcom d memory bandwidth limitations of the 7447. n 6.0 Ba Yet, despite the strengths of AltiVec, the 4.0 benchmarks revealed that the alternative 2.0 0.0 1 2 4 8 6 2 4 8 6 2 2 processors offer some interesting capabilities for 1 3 6 2 24 48 96 9 1 25 51 0 0 0 1 2 4 81 particular types of signal processing.
    [Show full text]
  • CORDIC Co-Processor Architecture for Digital Signal Processing Applications
    A Fast CORDIC Co-Processor Architecture for Digital Signal Processing Applications Javier O. Giacomantone, Horacio Villagarcía Wanza, Oscar N. Bria CeTADΗ – Fac. de Ingeniería – UNLP [email protected] Abstract The coordinate rotational digital computer (CORDIC) is an arithmetic algorithm, which has been used for arithmetic units in the fast computing of elementary functions and for special purpose hardware in programmable logic devices. This paper describes a classification method that can be used for the possible applications of the algorithm and the architecture that is required for fast hardware computing of the algorithm. Keywords: Computer Architectures, CORDIC, Computer Arithmetic, Hardware Algorithms, Digital Signal Processing Applications. Η Centro de Técnicas Analógico-Digitales. Director: Ing. Antonio A. Quijano. A Fast CORDIC Co-Processor Architecture for Digital Signal Processing Applications Javier O. Giacomantone, Horacio Villagarcía Wanza, Oscar N. Bria CeTADΗ – Fac. de Ingeniería – UNLP [email protected] Abstract The coordinate rotational digital computer (CORDIC) is an arithmetic algorithm, which has been used for arithmetic units in the fast computing of elementary functions and for special purpose hardware in programmable logic devices. This paper describes a classification method that can be used for the possible applications of the algorithm and the architecture that is required for fast hardware computing of the algorithm. Keywords: Computer Architectures, CORDIC, Computer Arithmetic, Hardware Algorithms, Digital Signal Processing Applications. I. Introduction The Coordinate Rotation Digital Computer (CORDIC) is an arithmetic technique, which makes it possible to perform two dimensional rotations using simple hardware components. The algorithm can be used to evaluate elementary functions such as cosine, sine, arctangent, sinh, cosh, tanh, ln and exp.
    [Show full text]