Архитектура Intel От I386 До Xeon Phi: Процессоры, Производительность, Энергопотребление

Total Page:16

File Type:pdf, Size:1020Kb

Архитектура Intel От I386 До Xeon Phi: Процессоры, Производительность, Энергопотребление Нижегородский государственный университет им. Н.И.Лобачевского Факультет Вычислительной математики и кибернетики Архитектура Intel от i386 до Xeon Phi: процессоры, производительность, энергопотребление Линёв А.В. 2014 Нижний Новгород Содержание Эволюция архитектур центральных процессоров. История появления новых концепций и их реализации в процессорах архитектур Intel. Изменения основных характеристик процессоров. Современные направления развития архитектуры процессоров. Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 2 из 66 Архитектура и микроархитектура CPU Фон Неймановская модель компьютера Разделение программируемой вычислительной машины на компоненты: Центральный обрабатывающий блок (Central Processing Unit, CPU) блок управления (Control Unit ) (декодирование инструкций, порядок операций) тракт данных (Datapath) (регистры, арифметико-логическое устройство, шины) Память: Хранение инструкций и их операндов Подсистема ввода/вывода (Input/Output, I/O sub- system): шина I/O, интерфейсы, устройства Концепция хранения программ: Инструкции из набора команд выбираются из общей памяти и исполняются последовательно Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 4 из 66 Фон Неймановская модель компьютера Компьютерная CPU I/O Devices система Control Memory Datapath Input Registers, (instructions, ALU, buses data) Главное ограничение производительности CPU: фон- Одно из ограничений Неймановская модель вычислений Output производительности: подразумевает последовательное общая память команд и исполнение инструкций по одной данных (потенциальное «узкое место») Процессор - программируемый вычислительный элемент, выполняющий программы, написанные с использованием предопределенного набора инструкций. Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 5 из 66 Шаги обработки инструкций в CPU Выборка Выбрать инструкцию программы из памяти инструкции Программный счетчик (Program Counter, PC / Instruction Pointer, IP) указывает на следующую для обработки инструкцию Декодирование Определить требуемые действия и размер инструкции инструкции Выборка операндов Найти и получить данные операндов Исполнение Вычислить значение результата или статус Сохранение Записать результаты в запоминающее устройство результатов для последующего использования Главное ограничение производительности CPU: фон- Неймановская модель вычислений подразумевает последовательное исполнение инструкций по одной Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 6 из 66 Intel 4004 Первый в мире процессор общего назначения 1971 г. 740 кГц 0,06 MIPS 2250 транзис- торов 12 мм2 10 мкм техпроцесс http://ru.wikipedia.org/wiki/Intel_4004 Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 7 из 66 Характеристики процессоров Процессор 4004 Год выпуска 1971 Тактовая частота 740 кГц Производительность (MIPS) 0,06 Количество транзисторов 2250 Площадь кристалла (кв.мм) 12 Техпроцесс 10 мкм Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 8 из 66 Intel 8008, 8080 Intel 8008 (1972) Первый 8-битный процессор Тактовая частота – 500 (800) кГц Производительность – 0,05 MIPS Количество транзисторов – 3500 Техпроцесс – 10 мкм Поддержка прерываний Intel 8080 (1974) Тактовая частота – 2 (2.5, 3) МГц Производительность – 0,64 MIPS Количество транзисторов – 6000 Техпроцесс – 6 мкм Порты ввода/вывода, Stack Pointer Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 9 из 66 Intel 8086/87 Intel 8086 (1978) Первый процессор архитектуры x86 Тактовая частота – 4 (10) МГц Производительность – 0,33 (0,75) MIPS Количество транзисторов – 29 000 Площадь кристалла – 33 кв.мм Техпроцесс – 3 мкм Потребляемая мощность – 0,65 Вт Тепловыделение – 1,75 Вт Intel 8087 (1980) Математический сопроцессор Производительность – ~50 000 FLOPS Количество транзисторов – 45 000 Техпроцесс – 3 мкм Потребляемая мощность – 2,4 Вт Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 10 из 66 http://faculty.etsu.edu/tarnoff/ntes2150/uproc/arch8088.htm Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 11 из 66 Конвейер команд… Конвейерная обработка инструкций – это метод реализации CPU, при котором множество операции над несколькими инструкциями перекрываются. Конвейерная обработка инструкций использует программный параллелизм уровня инструкций (Instruction- Level Parallelism, ILP) Конвейеризация увеличивает пропускную способность CPU - среднее число инструкций, завершенных за такт. В идеальном случае происходит завершение одной инструкции за машинный такт Конвейеризация не сокращает время выполнения отдельной инструкции (также называемое временем задержки завершения инструкции). Минимальное время задержки завершения инструкции - n тактов, где n – число ступеней конвейера Конвейер, описанный здесь, называется упорядоченным (in-order) конвейером так как инструкции обрабатываются или исполняются в порядке, указанном в исходной программе Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 12 из 66 Однопортовый конвейер MIPS с упорядоченной обработкой целочисленных операций Число тактов до заполнения = время разгона = число ступеней -1 Номер такта Номер инструкции 1 2 3 4 5 6 7 8 9 Время в тактах Инструкция I IF ID EX MEM WB Первая инструкция, I Инструкция I+1 IF ID EX MEM WB завершена Инструкция I+2 IF ID EX MEM WB Инструкция I+3 IF ID EX MEM WB Инструкция I +4 IF ID EX MEM WB Время разгона = 4 такта Последняя инструкция, Ступени конвейера MIPS: I+4 завершена IF = Выборка инструкции (Instruction Fetch) ID = Декодирование инструкции (Instruction Decode) EX = Исполнение (Execution) MEM = Обращение к памяти (Memory Access) WB = Запись результата (Write Back) Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 13 из 66 Intel 80286 Intel 80286 (1982) Тактовая частота – 6 (12,5) МГц Производительность – 0,9 (2,66) MIPS Количество транзисторов – 134 000 Площадь кристалла – 49 кв.мм Техпроцесс – 1,5 мкм 0.21 Instructions Per Clock Конвейер команд (длина - 4) Защищенный режим Linear Memory Management Unit (MMU) Intel 80287 (1983) Математический сопроцессор Производительность – ~65 000 FLOPS Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 14 из 66 Intel 80286 http://en.wikipedia.org/wiki/File:Intel_i80286_arch.svg Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 15 из 66 Intel 80386 Intel 80386DX (1985) Тактовая частота – 12 (33) МГц Производительность – 5 (11,4) MIPS Количество транзисторов – 275 000 Площадь кристалла – 104 (39) кв.мм Техпроцесс – 1,5 (1) мкм Страничное преобразование Аппаратная отладка Встроенный математический сопроцессор 80387 Производительность – ~300 000 FLOPS Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 16 из 66 Intel 80386 http://en.wikipedia.org/wiki/File:80386DX_arch.png Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 17 из 66 Страничная адресация Управляющий Виртуальное Регистр Адресное Таблица страниц Пространство RAM MAX Physical frame number N MAX (4 Гб) … Physical frame number 1 Physical frame number 0 0 HDD 0 Позволяет организовывать ВАП большого размера с линейной адресацией на основе разрывных блоков физической памяти Нижний Новгород 2013 Архитектура Intel Xeon Phi 18 из 45 Конвейер команд – Конфликты Структурные конфликты Возникают из-за недостатков аппаратных ресурсов когда доступное аппаратное обеспечение не в состоянии поддерживать все возможные комбинации инструкций Конфликты данных Возникают когда инструкция зависит от результата выполнения предыдущей инструкции так, что это проявляется при перекрытии инструкций в конвейере Конфликты управления Возникают при конвейеризации условных переходов и других инструкций, которые изменяют PC Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 19 из 66 Единая разделяемая память для инструкций и данных Процессор с блоком памяти, вызывающим структурные конфликты В машине с единственным портом памяти будет возникать конфликт при любом обращении к памяти. Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 20 из 66 Единая разделяемая память для данных и инструкций Разрешение структурного конфликта при помощи тактов простоя Структурный конфликт приводит к необходимости вставки «пузырей» в конвейер. Нижний Новгород Архитектура Intel от i386 до Xeon Phi: процессоры, 2014 производительность, энергопотребление 21 из 66 Многотактовый конвейер вещественных операций Задержка = 6 Целочисленный блок (Задержка= 0 Период запуска = 1) Конфликты: Период запуска = 1 RAW, WAW Конвейеризуемое возможны WAR невозможен Структурные: Вещественное (FP)/целочисленное умножение возможны Управления: возможны EX IF ID FP сумматор MEM WB Задержка = 3 FP/целочисленное деление Период запуска = 1 Задержка = 24 Конвейеризуемое
Recommended publications
  • Intel® IA-64 Architecture Software Developer's Manual
    Intel® IA-64 Architecture Software Developer’s Manual Volume 1: IA-64 Application Architecture Revision 1.1 July 2000 Document Number: 245317-002 THIS DOCUMENT IS PROVIDED “AS IS” WITH NO WARRANTIES WHATSOEVER, INCLUDING ANY WARRANTY OF MERCHANTABILITY, NONINFRINGEMENT, FITNESS FOR ANY PARTICULAR PURPOSE, OR ANY WARRANTY OTHERWISE ARISING OUT OF ANY PROPOSAL, SPECIFICATION OR SAMPLE. Information in this document is provided in connection with Intel products. No license, express or implied, by estoppel or otherwise, to any intellectual property rights is granted by this document. Except as provided in Intel's Terms and Conditions of Sale for such products, Intel assumes no liability whatsoever, and Intel disclaims any express or implied warranty, relating to sale and/or use of Intel products including liability or warranties relating to fitness for a particular purpose, merchantability, or infringement of any patent, copyright or other intellectual property right. Intel products are not intended for use in medical, life saving, or life sustaining applications. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked "reserved" or "undefined." Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. Intel® IA-64 processors may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Copies of documents which have an order number and are referenced in this document, or other Intel literature, may be obtained by calling 1-800- 548-4725, or by visiting Intel’s website at http://developer.intel.com/design/litcentr.
    [Show full text]
  • Appendix D an Alternative to RISC: the Intel 80X86
    D.1 Introduction D-2 D.2 80x86 Registers and Data Addressing Modes D-3 D.3 80x86 Integer Operations D-6 D.4 80x86 Floating-Point Operations D-10 D.5 80x86 Instruction Encoding D-12 D.6 Putting It All Together: Measurements of Instruction Set Usage D-14 D.7 Concluding Remarks D-20 D.8 Historical Perspective and References D-21 D An Alternative to RISC: The Intel 80x86 The x86 isn’t all that complex—it just doesn’t make a lot of sense. Mike Johnson Leader of 80x86 Design at AMD, Microprocessor Report (1994) © 2003 Elsevier Science (USA). All rights reserved. D-2 I Appendix D An Alternative to RISC: The Intel 80x86 D.1 Introduction MIPS was the vision of a single architect. The pieces of this architecture fit nicely together and the whole architecture can be described succinctly. Such is not the case of the 80x86: It is the product of several independent groups who evolved the architecture over 20 years, adding new features to the original instruction set as you might add clothing to a packed bag. Here are important 80x86 milestones: I 1978—The Intel 8086 architecture was announced as an assembly language– compatible extension of the then-successful Intel 8080, an 8-bit microproces- sor. The 8086 is a 16-bit architecture, with all internal registers 16 bits wide. Whereas the 8080 was a straightforward accumulator machine, the 8086 extended the architecture with additional registers. Because nearly every reg- ister has a dedicated use, the 8086 falls somewhere between an accumulator machine and a general-purpose register machine, and can fairly be called an extended accumulator machine.
    [Show full text]
  • Programmable Digital Microcircuits - a Survey with Examples of Use
    - 237 - PROGRAMMABLE DIGITAL MICROCIRCUITS - A SURVEY WITH EXAMPLES OF USE C. Verkerk CERN, Geneva, Switzerland 1. Introduction For most readers the title of these lecture notes will evoke microprocessors. The fixed instruction set microprocessors are however not the only programmable digital mi• crocircuits and, although a number of pages will be dedicated to them, the aim of these notes is also to draw attention to other useful microcircuits. A complete survey of programmable circuits would fill several books and a selection had therefore to be made. The choice has rather been to treat a variety of devices than to give an in- depth treatment of a particular circuit. The selected devices have all found useful ap• plications in high-energy physics, or hold promise for future use. The microprocessor is very young : just over eleven years. An advertisement, an• nouncing a new era of integrated electronics, and which appeared in the November 15, 1971 issue of Electronics News, is generally considered its birth-certificate. The adver• tisement was for the Intel 4004 and its three support chips. The history leading to this announcement merits to be recalled. Intel, then a very young company, was working on the design of a chip-set for a high-performance calculator, for and in collaboration with a Japanese firm, Busicom. One of the Intel engineers found the Busicom design of 9 different chips too complicated and tried to find a more general and programmable solu• tion. His design, the 4004 microprocessor, was finally adapted by Busicom, and after further négociation, Intel acquired marketing rights for its new invention.
    [Show full text]
  • APPLICATION NOTE Ap·113
    APPLICATION Ap·113 NOTE February 1981 Intel Corporation makes no warranty for the use of its products and assumes no responsibility for any errors which may appear in this document nor does it make a commitment to update the information contained herein. Intel software products are copyrighted by and shall remain the property of Intel Corporation. Use, duplication or disclosure is subjectto restrictions stated in Intel's software license, or as defined in ASPR 7-104.9 (a) (9). Intel Corporation a!=;sumes no resDonsibilitv for the use of anv circuitry other than circuitry embodied in an Intel product. No other circuit patent licenses are implied. No part of this document may be copied or reproduced in any form or by any means without the prior written consent of Intel Corporation. The following are trademarks of Intel Corporation and may only be used to identify Intel products: BXP Intelevision MULTIBUS* CREDIT Intellec MULTIMODULE i iSBC Plug-A-Bubble ICE iSBX PROMPT ICS Library Manager Promware im MCS RMX Insite Megachassis UPI Intel Micromap ~Scope System 2000 and the combinations of ICE, iCS, iSBC, MCS or RMX and a numerical suffix. MDS is an ordering code only and is not used as a product name or trademark. MDS® is a registered trademark of Mohawk Data Sciences Corporation. *MULTIBUS is a patented Intel bus. Additional copies of this manual or other Intel literature may be obtained from: Intel Corporation Literature Department SV3-3 3065 Bowers Avenue Santa Clara, CA 95051 © INTEL CORPORATION, 1981 AFN-013008-1 Ap·113 Getting Started With Contents the Numeric Data INTRODUCTION Processor iAPX 86,88 Base .............................
    [Show full text]
  • Introduction to Cpu
    microprocessors and microcontrollers - sadri 1 INTRODUCTION TO CPU Mohammad Sadegh Sadri Session 2 Microprocessor Course Isfahan University of Technology Sep., Oct., 2010 microprocessors and microcontrollers - sadri 2 Agenda • Review of the first session • A tour of silicon world! • Basic definition of CPU • Von Neumann Architecture • Example: Basic ARM7 Architecture • A brief detailed explanation of ARM7 Architecture • Hardvard Architecture • Example: TMS320C25 DSP microprocessors and microcontrollers - sadri 3 Agenda (2) • History of CPUs • 4004 • TMS1000 • 8080 • Z80 • Am2901 • 8051 • PIC16 microprocessors and microcontrollers - sadri 4 Von Neumann Architecture • Same Memory • Program • Data • Single Bus microprocessors and microcontrollers - sadri 5 Sample : ARM7T CPU microprocessors and microcontrollers - sadri 6 Harvard Architecture • Separate memories for program and data microprocessors and microcontrollers - sadri 7 TMS320C25 DSP microprocessors and microcontrollers - sadri 8 Silicon Market Revenue Rank Rank Country of 2009/2008 Company (million Market share 2009 2008 origin changes $ USD) Intel 11 USA 32 410 -4.0% 14.1% Corporation Samsung 22 South Korea 17 496 +3.5% 7.6% Electronics Toshiba 33Semiconduc Japan 10 319 -6.9% 4.5% tors Texas 44 USA 9 617 -12.6% 4.2% Instruments STMicroelec 55 FranceItaly 8 510 -17.6% 3.7% tronics 68Qualcomm USA 6 409 -1.1% 2.8% 79Hynix South Korea 6 246 +3.7% 2.7% 812AMD USA 5 207 -4.6% 2.3% Renesas 96 Japan 5 153 -26.6% 2.2% Technology 10 7 Sony Japan 4 468 -35.7% 1.9% microprocessors and microcontrollers
    [Show full text]
  • Numerical Computation Guide
    Numerical Computation Guide Sun Microsystems, Inc. 901 San Antonio Road Palo Alto, CA 94303 U.S.A. 650-960-1300 Part No. 806-3568-10 May 2000, Revision A Send comments about this document to: [email protected] Copyright © 2000 Sun Microsystems, Inc., 901 San Antonio Road • Palo Alto, CA 94303-4900 USA. All rights reserved. This product or document is distributed under licenses restricting its use, copying, distribution, and decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Third-party software, including font technology, is copyrighted and licensed from Sun suppliers. Parts of the product may be derived from Berkeley BSD systems, licensed from the University of California. UNIX is a registered trademark in the U.S. and other countries, exclusively licensed through X/Open Company, Ltd. For Netscape™, Netscape Navigator™, and the Netscape Communications Corporation logo™, the following notice applies: Copyright 1995 Netscape Communications Corporation. All rights reserved. Sun, Sun Microsystems, the Sun logo, docs.sun.com, AnswerBook2, Solaris, SunOS, JavaScript, SunExpress, Sun WorkShop, Sun WorkShop Professional, Sun Performance Library, Sun Performance WorkShop, Sun Visual WorkShop, and Forte are trademarks, registered trademarks, or service marks of Sun Microsystems, Inc. in the U.S. and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the U.S. and other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc. The OPEN LOOK and Sun™ Graphical User Interface was developed by Sun Microsystems, Inc.
    [Show full text]
  • Intel Architecture Software Developer's Manual
    Intel Architecture Software Developer’s Manual Volume 2: Instruction Set Reference NOTE: The Intel Architecture Software Developer’s Manual consists of three volumes: Basic Architecture, Order Number 243190; Instruction Set Reference, Order Number 243191; and the System Programming Guide, Order Number 243192. Please refer to all three volumes when evaluating your design needs. 1999 Information in this document is provided in connection with Intel products. No license, express or implied, by estoppel or otherwise, to any intellectual property rights is granted by this document. Except as provided in Intel's Terms and Conditions of Sale for such products, Intel assumes no liability whatsoever, and Intel disclaims any express or implied warranty, relating to sale and/or use of Intel products including liability or warranties relating to fitness for a particular purpose, merchantability, or infringement of any patent, copyright or other intellectual property right. Intel products are not intended for use in medical, life saving, or life sustaining applications. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked “reserved” or “undefined.” Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. Intel’s Intel Architecture processors (e.g., Pentium®, Pentium® II, Pentium® III, and Pentium® Pro processors) may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Contact your local Intel sales office or your distributor to obtain the latest specifications and before placing your product order.
    [Show full text]
  • Microprocessors: from Basic Chips to Complete Systems
    - 237 - MICROPROCESSORS: FROM BASIC CHIPS TO COMPLETE SYSTEMS R. W. Dobinson,*) University of Illinois, Urbana, Illinois, USA. "Good-morning, good morning!", the General said When we met him last week on our way to the line. Now the soldiers he smiled at are most of them dead, And we're cursing his staff for Incompetent swine. "He's a cheery old card," grunted Harry to Jack As they slogged up to Arras with rifle and pack. * * * * But he did for them both by his plan of attack. Siegfried Sassoon April 1917 1. AIMS OF THESE LECTURES Microprocessor technology has, since its conception and birth in the early 1970's, entered very many areas of our lives. No end to its growth is in sight, and new uses appear almost daily. The semiconductor industry continues to produce ever more powerful integrated circuits (known far and wide as chips); more functionality and speed at lower cost is every salesman's cry. These lectures aim to present and explain in general terms some of the characteristics of microprocessor chips and associated components. They will show how systems are synthesized from the basic integrated circuit building blocks which are currently available; processor, memory, input-output (I/O) devices, etc. It is not my intention to discuss in detail the many different microprocessors now available on the market, nor will a complete catalogue of support chips be presented. Time will not permit this. Instead, emphasis will be placed on explaining the basic principles of different types of chip. As far as possible X will avoid talking too much about any specific devices; thus I will spend some time discussing a generic microprocessor accessing generic memory and talking to the outside world via generic I/O devices.
    [Show full text]
  • Coprocessors: Failures and Successes
    COPROCESSORS : FAILURES AND SUCCESSES APREPRINT Daniel Etiemble Paris Sud University, Computer Science Laboratory (LRI) 91405 Orsay - France [email protected] July 25, 2019 English version of the paper presented in the French Conference COMPAS 2019. ABSTRACT The appearance and disappearance of coprocessors by integration into the CPU, the success or failure of coprocessors are examined by summarizing their characteristics from the mainframes of the 1960s. The coprocessors most particularly reviewed are the IBM 360 and CDC-6600 I/O processors, the Intel 8087 math coprocessor, the Cell processor, the Intel Xeon Phi coprocessors, the GPUs, the FPGAs, and the coprocessors of manycores SW26010 and Pezy SC-2 used in high-ranked supercomputers in the TOP500 or Green500. The conditions for a coprocessor to be viable in the medium or long-term are defined. Keywords Coprocessor · 8087 · Cell · Xeon Phi · GPU 1 Introduction Since the early days of computers, coprocessors have been used to relieve the main processor (CPU) of certain “ancillary” tasks. These coprocessors have been or are being used for different tasks: • I/O coprocessors • Floating-point coprocessors • Graphic coprocessors arXiv:1907.06948v1 [cs.AR] 16 Jul 2019 • Coprocessors for accelerating computation The history of these coprocessors is diverse. Some have disappeared in the medium or short-term, following a technological breakthrough such as the invention of semiconductors, or the evolution of integrated circuit density. This is particularly the case of I/O coprocessors or floating-point coprocessors. Others like graphic coprocessors have a tumultuous history depending on their use for graphics or for high-performance computing or artificial intelligence.
    [Show full text]
  • INF5063: Programming Heterogeneous Multi-Core Processors
    INF5063: Programming heterogeneous multi-core processors Introduction Håkon Kvale Stensland August 25th, 2015 INF5063 University of Oslo INF5063 Overview § Course topic and scope § Background For the use and parallel processing using heterogeneous multi-core processors § Examples oF heterogeneous architectures § Vector Processing University of Oslo INF5063 INF5063: The Course People § Håkon Kvale Stensland email: haakonks @ ifi § Carsten Griwodz email: griFF @ ifi § ProFessor Pål Halvorsen email: paalh @ ifi § Guest lectures From Dolphin Interconnect Solutions Hugo Kohmann & Roy Nordstrøm University of Oslo INF5063 Time and place § Lectures: Tuesday 09:00 - 16:00 Storstua (Simula Research Laboratory) − August 25th − September 22nd − October 20th − November 24st § Group exercises: No group exercises in this course! University of Oslo INF5063 Plan For Today (Session 1) 10:15 – 11:00: Course Introduction 11:00 – 11:15: Break 11:15 – 12:00: Introduction to SSE/AVX 12:00 – 13:00: Lunch (Will be provided by Simula) 13:00 – 13:45: Introduction to Video Processing 13:45 – 14:00: Break 14:00 – 14:45: Using SIMD For Video Processing 14:45 – 15:00: Break 15:00 – 15:45: Codec 63 (c63) Home Exam Precode 15:45 – 16:00: Home Exam 1 is presented University of Oslo INF5063 About INF5063: Topic & Scope § Content: The course gives … − … an overview oF heterogeneous multi-core processors in general and three variants in particular and a modern general-purpose core (architectures and use) − … an introduction to working with heterogeneous multi-core processors
    [Show full text]
  • Intel® Itanium® Architecture Software Developer's Manual, Volume 1
    Intel® Itanium® Architecture Software Developer’s Manual Volume 1: Application Architecture Revision 2.3 May 2010 Document Number: 245317 THIS DOCUMENT IS PROVIDED “AS IS” WITH NO WARRANTIES WHATSOEVER, INCLUDING ANY WARRANTY OF MERCHANTABILITY, FITNESS FOR ANY PARTICULAR PURPOSE, OR ANY WARRANTY OTHERWISE ARISING OUT OF ANY PROPOSAL, SPECIFICATION OR SAMPLE. Information in this document is provided in connection with Intel® products. No license, express or implied, by estoppel or otherwise, to any intellectual property rights is granted by this document. Except as provided in Intel's Terms and Conditions of Sale for such products, Intel assumes no liability whatsoever, and Intel disclaims any express or implied warranty, relating to sale and/or use of Intel products including liability or warranties relating to fitness for a particular purpose, merchantability, or infringement of any patent, copyright or other intellectual property right. Intel products are not intended for use in medical, life saving, or life sustaining applications. Intel may make changes to specifications and product descriptions at any time, without notice. Designers must not rely on the absence or characteristics of any features or instructions marked “reserved” or “undefined.” Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them. Intel® processors based on the Itanium architecture may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request. Contact your local Intel sales office or your distributor to obtain the latest specifications and before placing your product order.
    [Show full text]
  • Instruction Set Reference, A-M
    IA-32 Intel® Architecture Software Developer’s Manual Volume 2A: Instruction Set Reference, A-M NOTE: The IA-32 Intel Architecture Software Developer’s Manual consists of four volumes: Basic Architecture, Order Number 253665; Instruction Set Reference A-M, Order Number 253666; Instruction Set Reference N-Z, Order Number 253667; and the System Programming Guide, Order Number 253668. Refer to all four volumes when evaluating your design needs. Order Number: 253666-017 September 2005 INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EX- PRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. EXCEPT AS PROVIDED IN INTEL’S TERMS AND CONDITIONS OF SALE FOR SUCH PRODUCTS, INTEL ASSUMES NO LIABILITY WHATSOEVER, AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RE- LATING TO SALE AND/OR USE OF INTEL PRODUCTS INCLUDING LIABILITY OR WARRANTIES RELATING TO FIT- NESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. INTEL PRODUCTS ARE NOT INTENDED FOR USE IN MEDICAL, LIFE SAVING, OR LIFE SUSTAINING APPLICATIONS. Intel may make changes to specifications and product descriptions at any time, without notice. Developers must not rely on the absence or characteristics of any features or instructions marked “reserved” or “undefined.” Improper use of reserved or undefined features or instructions may cause unpredictable behavior or failure in developer's software code when running on an Intel processor. Intel reserves these features or instructions for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from their unauthorized use.
    [Show full text]