The Supercomputer in Your Pocket
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  NVIDIA Tesla Personal Supercomputer, Please VisitNVIDIA TESLA PERSONAL SUPERCOMPUTER TESLA DATASHEET Get your own supercomputer. Experience cluster level computing performance—up to 250 times faster than standard PCs and workstations—right at your desk. The NVIDIA® Tesla™ Personal Supercomputer AccessiBLE to Everyone TESLA C1060 COMPUTING ™ PROCESSORS ARE THE CORE is based on the revolutionary NVIDIA CUDA Available from OEMs and resellers parallel computing architecture and powered OF THE TESLA PERSONAL worldwide, the Tesla Personal Supercomputer SUPERCOMPUTER by up to 960 parallel processing cores. operates quietly and plugs into a standard power strip so you can take advantage YOUR OWN SUPERCOMPUTER of cluster level performance anytime Get nearly 4 teraflops of compute capability you want, right from your desk. and the ability to perform computations 250 times faster than a multi-CPU core PC or workstation. NVIDIA CUDA UnlocKS THE POWER OF GPU parallel COMPUTING The CUDA™ parallel computing architecture enables developers to utilize C programming with NVIDIA GPUs to run the most complex computationally-intensive applications. CUDA is easy to learn and has become widely adopted by thousands of application developers worldwide to accelerate the most performance demanding applications. TESLA PERSONAL SUPERCOMPUTER | DATASHEET | MAR09 | FINAL FEATURES AND BENEFITS Your own Supercomputer Dedicated computing resource for every computational researcher and technical professional. Cluster Performance The performance of a cluster in a desktop system. Four Tesla computing on your DesKtop processors deliver nearly 4 teraflops of performance. DESIGNED for OFFICE USE Plugs into a standard office power socket and quiet enough for use at your desk. Massively Parallel Many Core 240 parallel processor cores per GPU that can execute thousands of GPU Architecture concurrent threads.
- 
												  EEMBC and the Purposes of Embedded Processor Benchmarking Markus Levy, PresidentEEMBC and the Purposes of Embedded Processor Benchmarking Markus Levy, President ISPASS 2005 Certified Performance Analysis for Embedded Systems Designers EEMBC: A Historical Perspective • Began as an EDN Magazine project in April 1997 • Replace Dhrystone • Have meaningful measure for explaining processor behavior • Developed business model • Invited worldwide processor vendors • A consortium was born 1 EEMBC Membership • Board Member • Membership Dues: $30,000 (1st year); $16,000 (subsequent years) • Access and Participation on ALL Subcommittees • Full Voting Rights • Subcommittee Member • Membership Dues Are Subcommittee Specific • Access to Specific Benchmarks • Technical Involvement Within Subcommittee • Help Determine Next Generation Benchmarks • Special Academic Membership EEMBC Philosophy: Standardized Benchmarks and Certified Scores • Member derived benchmarks • Determine the standard, the process, and the benchmarks • Open to industry feedback • Ensures all processor/compiler vendors are running the same tests • Certification process ensures credibility • All benchmark scores officially validated before publication • The entire benchmark environment must be disclosed 2 Embedded Industry: Represented by Very Diverse Applications • Networking • Storage, low- and high-end routers, switches • Consumer • Games, set top boxes, car navigation, smartcards • Wireless • Cellular, routers • Office Automation • Printers, copiers, imaging • Automotive • Engine control, Telematics Traditional Division of Embedded Applications Low High Power
- 
												  Mobile Phones and Cloud ComputingMobile phones and cloud computing A quantitative research paper on mobile phone application offloading by cloud computing utilization Oskar Hamrén Department of informatics Human Computer Interaction Master’s programme Master thesis 2-year level, 30 credits SPM 2012.07 Abstract The development of the mobile phone has been rapid. From being a device mainly used for phone calls and writing text messages the mobile phone of today, or commonly referred to as the smartphone, has become a multi-purpose device. Because of its size and thermal constraints there are certain limitations in areas of battery life and computational capabilities. Some say that cloud computing is just another buzzword, a way to sell already existing technology. Others claim that it has the potential to transform the whole IT-industry. This thesis is covering the intersection of these two fields by investigating if it is possible to increase the speed of mobile phones by offloading computational heavy mobile phone application functions by using cloud computing. A mobile phone application was developed that conducts three computational heavy tests. The tests were run twice, by not using cloud computing offloading and by using it. The time taken to carry out the tests were saved and later compared to see if it is faster to use cloud computing in comparison to not use it. The results showed that it is not beneficial to use cloud computing to carry out these types of tasks; it is faster to use the mobile phone. 1 Table of Contents Abstract ..................................................................................................................................... 1 Table of Contents ..................................................................................................................... 2 1. Introduction .......................................................................................................................... 5 1.1 Previous research ........................................................................................................................
- 
												  45-Year CPU Evolution: One Law and Two Equations45-year CPU evolution: one law and two equations Daniel Etiemble LRI-CNRS University Paris Sud Orsay, France [email protected] Abstract— Moore’s law and two equations allow to explain the a) IC is the instruction count. main trends of CPU evolution since MOS technologies have been b) CPI is the clock cycles per instruction and IPC = 1/CPI is the used to implement microprocessors. Instruction count per clock cycle. c) Tc is the clock cycle time and F=1/Tc is the clock frequency. Keywords—Moore’s law, execution time, CM0S power dissipation. The Power dissipation of CMOS circuits is the second I. INTRODUCTION equation (2). CMOS power dissipation is decomposed into static and dynamic powers. For dynamic power, Vdd is the power A new era started when MOS technologies were used to supply, F is the clock frequency, ΣCi is the sum of gate and build microprocessors. After pMOS (Intel 4004 in 1971) and interconnection capacitances and α is the average percentage of nMOS (Intel 8080 in 1974), CMOS became quickly the leading switching capacitances: α is the activity factor of the overall technology, used by Intel since 1985 with 80386 CPU. circuit MOS technologies obey an empirical law, stated in 1965 and 2 Pd = Pdstatic + α x ΣCi x Vdd x F (2) known as Moore’s law: the number of transistors integrated on a chip doubles every N months. Fig. 1 presents the evolution for II. CONSEQUENCES OF MOORE LAW DRAM memories, processors (MPU) and three types of read- only memories [1]. The growth rate decreases with years, from A.
- 
												  Uses and Effects of Mobile Computing Devices in K–8 ClassroomsUses and Effects of Mobile Computing Devices in K–8 Classrooms Karen Swan Mark van ‘t Hooft Annette Kratcoski Kent State University Darlene Unger Virginia Commonwealth University Abstract This preliminary study employed mixed methodologies to explore students’ use of mobile comput- ing devices and its effects on their motivation to learn, engagement in learning activities, and support for learning processes. Data collected from students in four elementary and two seventh grade science classes in Northeast Ohio included usage logs, student work samples, student and teacher interviews, and classroom observations. Findings highlight the personalization of learning afforded by such devices both in terms of individuals and individual classroom cultures, as well as their usefulness in extending learning beyond the classroom. They also sug- gest that increased motivation due to mobile device use leads to increases in the quality and quantity of student work. (Keywords: mobile computing, motivation, writing.) BACKGROUND More than a decade ago, Mark Weiser (1991) wrote that we live in a society in which technology is so pervasive that we do not notice it anymore when used for everyday tasks such as information retrieval, communication, and entertainment. Defining this environment as ubiquitous computing, he described it more as a state of mind, as “a new way of thinking about computers in the world . [that] allows the computers themselves to vanish into the background . [and] be- come indistinguishable from everyday life” (p. 94). As a result, the current gener- ation of K–12 students is growing up more technologically literate than children their age were a decade ago, with access to an increasing number of devices and services such as video game consoles, mobile gaming devices, cell phones, the In- ternet, and instant messaging.
- 
												  On Energy Consumption of Mobile Cloud Gaming Using GaminganywhereThesis no.:MSEE-2016-54 On energy consumption of mobile cloud gaming using GamingAnywhere Suren Musinada Faculty of Computing Blekinge Institute of Technology SE–371 79 Karlskrona, Sweden This thesis is submitted to the Faculty of Computing at Blekinge Institute of Technology in partial fulfillment of the requirements for the degree of Masters in Electrical Engineering with Emphasis on Telecommunication Systems. The thesis is equivalent to 20 weeks of full time studies. Contact Information: Author(s): Suren Musinada E-mail: [email protected] University advisor: Dr. Yong Yao Department of communication systems E-mail: [email protected] Faculty of Computing Internet : www.bth.se Blekinge Institute of Technology Phone : +46 455 38 50 00 SE–371 79 Karlskrona, Sweden Fax : +46 455 38 50 57 Abstract In the contemporary world, there has been a great proliferation of using smart-phone devices and broadband wireless networks, the young gener- ation using mobile gaming market is tremendously increasing because of the enormous entertainment features. Mobile cloud gaming is a promising technology that overcome the implicit restrictions such as computational capacity and limited battery life. GamingAnywhere is an open source cloud gaming system which is used in this thesis and calculate the energy con- sumption of mobile device when using GamingAnywhere. The aim of the thesis is to measure the power consumption of the mo- bile device when the game is streamed from the GamingAnywhere server to GamingAnywhere client. Total power consumption is calculated for four resolutions by using the hardware monsoon power monitoring tool and the individual components of mobile device such as CPU, LCD and Audio power are calculated by software PowerTutor.
- 
												  The Von Neumann Computer Model 5/30/17, 10:03 PMThe von Neumann Computer Model 5/30/17, 10:03 PM CIS-77 Home http://www.c-jump.com/CIS77/CIS77syllabus.htm The von Neumann Computer Model 1. The von Neumann Computer Model 2. Components of the Von Neumann Model 3. Communication Between Memory and Processing Unit 4. CPU data-path 5. Memory Operations 6. Understanding the MAR and the MDR 7. Understanding the MAR and the MDR, Cont. 8. ALU, the Processing Unit 9. ALU and the Word Length 10. Control Unit 11. Control Unit, Cont. 12. Input/Output 13. Input/Output Ports 14. Input/Output Address Space 15. Console Input/Output in Protected Memory Mode 16. Instruction Processing 17. Instruction Components 18. Why Learn Intel x86 ISA ? 19. Design of the x86 CPU Instruction Set 20. CPU Instruction Set 21. History of IBM PC 22. Early x86 Processor Family 23. 8086 and 8088 CPU 24. 80186 CPU 25. 80286 CPU 26. 80386 CPU 27. 80386 CPU, Cont. 28. 80486 CPU 29. Pentium (Intel 80586) 30. Pentium Pro 31. Pentium II 32. Itanium processor 1. The von Neumann Computer Model Von Neumann computer systems contain three main building blocks: The following block diagram shows major relationship between CPU components: the central processing unit (CPU), memory, and input/output devices (I/O). These three components are connected together using the system bus. The most prominent items within the CPU are the registers: they can be manipulated directly by a computer program. http://www.c-jump.com/CIS77/CPU/VonNeumann/lecture.html Page 1 of 15 IPR2017-01532 FanDuel, et al.
- 
												  Imagination Technologies Group Plc Annual Report 2006Imagination Technologies Group plc Annual Report 2006 The data and projections shown on pages 1-24 are the product of consolidated partner data, analyst information and Imagination Technologies research. Imagination Technologies, the Imagination Technologies logo, PowerVR, the PowerVR logo, Metagence, the Metagence logo, Ensigma, the Ensigma logo, PURE Digital, IMGworks, CodeScape, META, MTX, MBX, MBX Lite, SGX, UCC, MiniEngine, PocketDAB, ReVu, the Bug, the Bug logo, Legato, EVOKE-1, TEMPUS, CHRONOS, OASIS, PURE ONE, SONUS-1XT and Élan are trademarks or registered trademarks of Imagination Technologies Limited. All other logos, products, trademarks and registered trademarks are the property of their respective owners. Copyright © 2006 Imagination Technologies Limited, an Imagination Technologies Group plc company. 4 colour print + matt lamination on 1 side, 215x280mm (h) trim size with 4mm spin, 300gsm board fold, gather and perfect bind 19/06/2006 Contact Oscar Tse / IMG Publications / 01923 260511 / [email protected] Introduction Improvements in technology mean that more and more functions can be integrated onto just one silicon chip. Whilst in the past, half a dozen companies might each have made a specialised chip to go into an advanced product like a TV or mobile phone, now it is more likely that one large semiconductor company will make a single chip that contains most, if not all, of that specialised knowledge. That’s where silicon IP, or ‘intellectual property’ comes in. Semiconductor companies can licence parts of a chip design from companies that specialise in those advanced technologies, instead of developing the technology themselves. They are especially likely to do this in new market areas.
- 
												  Lecture NotesLecture #4-5: Computer Hardware (Overview and CPUs) CS106E Spring 2018, Young In these lectures, we begin our three-lecture exploration of Computer Hardware. We start by looking at the different types of computer components and how they interact during basic computer operations. Next, we focus specifically on the CPU (Central Processing Unit). We take a look at the Machine Language of the CPU and discover it’s really quite primitive. We explore how Compilers and Interpreters allow us to go from the High-Level Languages we are used to programming to the Low-Level machine language actually used by the CPU. Most modern CPUs are multicore. We take a look at when multicore provides big advantages and when it doesn’t. We also take a short look at Graphics Processing Units (GPUs) and what they might be used for. We end by taking a look at Reduced Instruction Set Computing (RISC) and Complex Instruction Set Computing (CISC). Stanford President John Hennessy won the Turing Award (Computer Science’s equivalent of the Nobel Prize) for his work on RISC computing. Hardware and Software: Hardware refers to the physical components of a computer. Software refers to the programs or instructions that run on the physical computer. - We can entirely change the software on a computer, without changing the hardware and it will transform how the computer works. I can take an Apple MacBook for example, remove the Apple Software and install Microsoft Windows, and I now have a Window’s computer. - In the next two lectures we will focus entirely on Hardware.
- 
												  Mobile SupercomputersEMBEDDED COMPUTING ing. We also anticipate the emergence of relatively simple, disposable devices Mobile that support the pervasive computing infrastructure—for example, sensor network nodes. The requirements of low-end devices Supercomputers are increasing exponentially, and com- puter architectures must adapt to keep Todd Austin, David Blaauw, Scott Mahlke, up. Some elements of high-end devices and Trevor Mudge, University of Michigan are already present in 3G cell phones Chaitali Chakrabarti, Arizona State University from the major manufacturers. High- Wayne Wolf, Princeton University end PDAs also include an amazing range of features, such as networking and cameras. oore’s law has held sway over the past several Current trends in computer decades, with the number architecture and power cannot M of transistors per chip doubling every 18 meet the demands of mobile months. As a result, a fairly inexpen- supercomputing. Significant sive CPU can perform hundreds of millions of operations per second— innovation is required. performance that would have cost mil- lions of dollars two decades ago. We should be proud of our achieve- puters. Rather than worrying solely SUPERCOMPUTING REQUIREMENTS ments and rest on our laurels, right? about performance, with the occa- A mobile supercomputer will employ Unfortunately, no. sional nod to power consumption and natural I/O interfaces to the mobile The human appetite for computation cost, we need to judge computers by user. For example, input could come has grown even faster than the pro- their performance-power-cost product. through a continuous real-time speech- cessing power that Moore’s law pre- This new way of looking at proces- processing component.
- 
												  Comparing the Power and Performance of Intel's SCC to StateComparing the Power and Performance of Intel’s SCC to State-of-the-Art CPUs and GPUs Ehsan Totoni, Babak Behzad, Swapnil Ghike, Josep Torrellas Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA E-mail: ftotoni2, bbehza2, ghike2, [email protected] Abstract—Power dissipation and energy consumption are be- A key architectural challenge now is how to support in- coming increasingly important architectural design constraints in creasing parallelism and scale performance, while being power different types of computers, from embedded systems to large- and energy efficient. There are multiple options on the table, scale supercomputers. To continue the scaling of performance, it is essential that we build parallel processor chips that make the namely “heavy-weight” multi-cores (such as general purpose best use of exponentially increasing numbers of transistors within processors), “light-weight” many-cores (such as Intel’s Single- the power and energy budgets. Intel SCC is an appealing option Chip Cloud Computer (SCC) [1]), low-power processors (such for future many-core architectures. In this paper, we use various as embedded processors), and SIMD-like highly-parallel archi- scalable applications to quantitatively compare and analyze tectures (such as General-Purpose Graphics Processing Units the performance, power consumption and energy efficiency of different cutting-edge platforms that differ in architectural build. (GPGPUs)). These platforms include the Intel Single-Chip Cloud Computer The Intel SCC [1] is a research chip made by Intel Labs (SCC) many-core, the Intel Core i7 general-purpose multi-core, to explore future many-core architectures. It has 48 Pentium the Intel Atom low-power processor, and the Nvidia ION2 (P54C) cores in 24 tiles of two cores each.
- 
												  Technology & EnergyThis Unit: Technology & Energy • Technology basis • Fabrication (manufacturing) & cost • Transistors & wires CIS 501: Computer Architecture • Implications of transistor scaling (Moore’s Law) • Energy & power Unit 3: Technology & Energy Slides'developed'by'Milo'Mar0n'&'Amir'Roth'at'the'University'of'Pennsylvania'' with'sources'that'included'University'of'Wisconsin'slides' by'Mark'Hill,'Guri'Sohi,'Jim'Smith,'and'David'Wood' CIS 501: Comp. Arch. | Prof. Milo Martin | Technology & Energy 1 CIS 501: Comp. Arch. | Prof. Milo Martin | Technology & Energy 2 Readings Review: Simple Datapath • MA:FSPTCM • Section 1.1 (technology) + 4 • Section 9.1 (power & energy) Register Data Insn File PC s1 s2 d Mem • Paper Mem • G. Moore, “Cramming More Components onto Integrated Circuits” • How are instruction executed? • Fetch instruction (Program counter into instruction memory) • Read registers • Calculate values (adds, subtracts, address generation, etc.) • Access memory (optional) • Calculate next program counter (PC) • Repeat • Clock period = longest delay through datapath CIS 501: Comp. Arch. | Prof. Milo Martin | Technology & Energy 3 CIS 501: Comp. Arch. | Prof. Milo Martin | Technology & Energy 4 Recall: Processor Performance • Programs consist of simple operations (instructions) • Add two numbers, fetch data value from memory, etc. • Program runtime = “seconds per program” = (instructions/program) * (cycles/instruction) * (seconds/cycle) • Instructions per program: “dynamic instruction count” • Runtime count of instructions executed by the program • Determined by program, compiler, instruction set architecture (ISA) • Cycles per instruction: “CPI” (typical range: 2 to 0.5) • On average, how many cycles does an instruction take to execute? • Determined by program, compiler, ISA, micro-architecture Technology & Fabrication • Seconds per cycle: clock period, length of each cycle • Inverse metric: cycles per second (Hertz) or cycles per ns (Ghz) • Determined by micro-architecture, technology parameters • This unit: transistors & semiconductor technology CIS 501: Comp.