FPGA-Acceleration on COTS X86 Platforms University of Mannheim, 16 Feb 2007

FPGA-Acceleration on COTS x86 Platforms University of Mannheim, 16 Feb 2007 XtremeData, Inc.: Confidential Slide 1 Slide 1: XtremeData Inc.: Confidential Information TodayToday’’’’ss Agenda XtremeData Corporate & Team background Why FPGAs in COTS x86? Issues and XDI Solution FPGA acceleration markets FPGAs in HPC Summary XtremeData, Inc.: Confidential Slide 2 Slide 2: XtremeData Inc.: Confidential Information XtremeDataXtremeData:: Corporate HistoryHistory………… 2004 Incorporated 2003, Seed funds raised. Jan Market research & POC completed: target markets identified. Apr SeriesA raised and development started with two teams: Jul hardware in Chicago and software in Bangalore, India. Oct System architecture defined: commodity hardware platform, Jan accelerator and database engine. Apr Jul FPGA Module offered as a stand-alone product, press releases; 2006 2005 strategic partnerships made, shipments started… Oct Jan Apr SeriesB fund raise closing Jan 2007 for Go-To-Market financing 2007 XtremeData, Inc.: Confidential Slide 3 Slide 3: XtremeData Inc.: Confidential Information Team Background Ravi Chandran , CEO BE Electronics, India, MS EE, University of Texas, Arlington, MBA, Kellogg School, Northwestern University, IL President, Binary Machines, Inc., Schaumburg, IL COO, VP of Engineering., Bio-Imaging Research, Inc., Lincolnshire, IL (www.bio-imaging.com) 20+ years of product development & design services in medical & industrial (NDT) imaging markets. 20+ years experience with Toshiba Medical Systems – 20% of worldwide CT scanner installed base. 1 in 5 CT scanners worldwide have an imaging system designed by our team Images courtesy of: Toshiba Medical Systems, Philips Medical Systems & BIR Inc. Images courtesy of: Toshiba Medical Systems, Philips Medical Systems & BIR Inc. XtremeData, Inc.: Confidential Slide 4 Slide 4: XtremeData Inc.: Confidential Information Vision XtremeData’s vision is to build “Accelerated Computing Appliances ” Easy installation – “plug and use” “Appliance” implies No disruption to existing process “Accelerated Computing” implies x86 CPU (“PC”) + FPGA XtremeData, Inc.: Confidential Slide 5 Slide 5: XtremeData Inc.: Confidential Information Strategy Our strategy is to enable “Accelerated Computing Appliances” by: 1. coupling off-the-shelf x86 hardware (“PC’s”) 2. with FPGA accelerators High Volume / Low Cost 3. via a software middleware layer that enables ease-of-use. High Performance We believe that the combination of these 3 key concepts gives us a compelling and sustainable price/performance advantage over the long term. XtremeData, Inc.: Confidential Slide 6 Slide 6: XtremeData Inc.: Confidential Information FPGAs in Computing: Market Environment & Challenges XtremeData, Inc.: Confidential Slide 7 Slide 7: XtremeData Inc.: Confidential Information Market EnvironmentEnvironment………….~2002.~2002 High-Performance Embedded Systems Commodity “PC” systems Specialized CPUs & DSPs + FPGA x86 CPUs Specialized interconnect (Myrinet, Race++, Standard interconnect (PCI-X, GigE) RapidIO..) Custom boards, backplanes Low Volume / High Cost Low Performance High Performance High Volume / Low Cost XtremeData, Inc.: Confidential Slide 8 Slide 8: XtremeData Inc.: Confidential Information Market EnvironmentEnvironment………….today.today High-Performance Embedded Systems Commodity “PC” systems Outperformed by ~3Ghz x86 CPUs Outperformed by FPGAs at high-end Outperformed by IB Specialized CPUs & DSPs + FPGA x86 CPUs – multi-core, 3Ghz Specialized interconnect (Myrinet, Race++, Standard interconnect (PCIe, IB, 10GigE) RapidIO..) Custom boards, backplanes Best choice: x86 CPU+FPGA. High Performance Low Volume / High Cost How to do this? High Performance High Volume / Low Cost Take x86 CPU back to embedded world Bring FPGA forward to x86-COTS world XtremeData, Inc.: Confidential Slide 9 Slide 9: XtremeData Inc.: Confidential Information FPGA in COTS x86: Challenges Physical factors: form factor, power supply, cooling System Architect issues External interfaces: I/O, Memory • How to integrate FPGA into x86-COTS? Communication & Data exchange between FPGA & CPU FPGA interconnect between Computing Blocks Radisys, Gidel, Nallatech FPGA Computing Design engineer (s/w & h/w) issues PHYSICAL Blocks EXTERNAL I/F EXTERNAL Annapolis Microsystems • How to transition from DSP world to FPGAs? COMMUNICATION Soft-CPU, IP, Mercury Computer INTERNAL I/F INTERNAL INNER LOOP ESL tools XtremeData, Inc.: Confidential Slide 10 Slide 10: XtremeData Inc.: Confidential Information Our Solution : build a simple, minimalist board Idea with interfaces to HyperTransport and memory: (Patent Pending)changes to no drop-in replacement for an AMD Opteron with motherboard! Dual-socket AMD Opteron Motherboard: Simply remove Opteron & replace with FPGA Module ! y, heat-sink erver systems ( including FPGA uses all motherboard resources meant for CPU: HyperTransport Links, Memory interface, power suppl XtremeData, Inc.: Confidential Slide 11 Usable with any AMD Opteron (or future Intel CPU) s Mix & match FPGAs, CPUs on quad-socket systems Usable in rack-mount or high-density “blade” server ATCA), where plug-in boards are not feasible Slide 11: XtremeData Inc.: Confidential Information TodayToday’’’’ss 940940----pinpin solutionsolution………… Mechanical Monitoring • Plugs directly into socket-940 • FPGA mastered I2C bus • Fits within AMD-specified retention frame • Voltage monitoring • 68 x 60 mm form factor • Temperature monitoring • Can use off-the-shelf Opteron™ heat sink Test Support HyperTransport Interfaces (HT) • JTAG test port • Multiple HT interfaces • 4 programmable LEDs • 16 bits wide @ 800 M Transfers/s • 8 programmable test pads • Bridging to additional XD1000™modules Flash ROM Memory Interface • 32 MB of CFI FLASH • 128 bits wide DDR-333 memory • Use for FPGA configuration files, or • 5.4 GBytes/s bandwidth application data • Up to four 4GB DIMMs of ECC memory Development Package SRAM • HyperTransport core • 8 MB of Zero Bus Turn-around (ZBT) SRAM • Memory controller core • 800 Mbytes/s bandwidth • Linux device driver • 32 bits wide with parity • FPGA messaging infrastructure • 5 clock cycle latency for reads @ 200MHz FPGA Configuration • Auto FPGA configuration on power-up • Host triggered FPGA reconfiguration Newer Opteron socket solutions on the roadmap… XtremeData, Inc.: Confidential Slide 12 Slide 12: XtremeData Inc.: Confidential Information FPGA-Acceleration Markets XtremeData, Inc.: Confidential Slide 13 Slide 13: XtremeData Inc.: Confidential Information FPGA acceleration markets “Embedded” High Performance Emerging market: Computing Computing Video Medical Imaging Telecom Financial, Database, Scientific Broadcast Consumer • Toshiba Geoscience BioInformatics • Motorola • Set Top Box • DVD creation • GE Medical • Nokia • Cray • Video on Demand • HD Video • Siemens • TimeLogic • Ericcson • SGI • IPTV • Phillips • Mercury • Tarari • Linux NetworX • Progeniq Some examples of companies / applications that are using FPGA acceleration today XtremeData, Inc.: Confidential Slide 14 Slide 14: XtremeData Inc.: Confidential Information FPGAs in High-Performance Computing (HPC) XtremeData, Inc.: Confidential Slide 15 Slide 15: XtremeData Inc.: Confidential Information HPC: “““Burning“BurningBurning””””Issues “ … more than 80% of data centers are already constrained by electrical power, physical space, or cooling capacity. Simply adding more of the same kinds of systems is clearly no solution, …” [Sun Whitepaper on Throughput Computing, Nov 2005] “ … IBM Fellow Bernard Meyerson told the crowd at the Hot Chips conference yesterday that he expects a power crisis of sorts to occur in the server market come 2007. That's when the overall cost of powering and cooling all the servers in the US will outpace the amount of money spent on new server sales.…” [The Register, 23 Aug 2006] “… Google is rumored to have a million servers around the world and, according to a knowledgeable source, is already the top electricity user in at least one large U.S. state. .…” [Fortune, 1 May 2006] "Just think about where there are windmills, dams , and other natural power sources around the world, and that's where you're going to see server farms ,” Ray Ozzie, Chief Software Architect, Microsoft [Fortune, 1 May 2006] Performance/Watt is key…..FPGAs are a viable alternative. XtremeData, Inc.: Confidential Slide 16 Slide 16: XtremeData Inc.: Confidential Information FPGAs in HPC: Challenges HDL-based design flow NOT acceptable! A software-oriented programming model is NECESSARY High-level FPGA design tool ( ESL ) is a must: C-based, MatLab, etc.,? We have ongoing efforts to enable high-level design flow… XtremeData, Inc.: Confidential Slide 17 Slide 17: XtremeData Inc.: Confidential Information ESL: Our first step: PSP for ImpulseC C Source Code Single and double-precision IEEE 754 floating-point arithmetic supported in Compile C to HDL FPGA and inferred from code All Components Connected via Avalon Calls SOPC Builder drives Impulse C Fabric it with scripts Module XDI Memory Controllers SOPC Builder XDI Hypertransport Interface 1) All RTL for FPGA on XD1000 2) Complete Quartus II Project 3) Complete Quartus II Compile Script No user-supplied HDLsource code required XtremeData, Inc.: Confidential Slide 18 Slide 18: XtremeData Inc.: Confidential Information Some FPGA Demo applications… XtremeData, Inc.: Confidential Slide 19 Slide 19: XtremeData Inc.: Confidential Information HPC: Financial Analytics Key Applications Derivatives Trading Black Scholes Model Black-Scholes

FPGA-Acceleration on COTS X86 Platforms University of Mannheim, 16 Feb 2007

Exploring Weak Scalability for FEM Calculations on a GPU-Enhanced Cluster

HDAMA Rev.G User's Guide

Cray XT and Cray XE Y Y System Overview

Product Change Notification

System Design for Telecommunication Gateways

Lewis University Dr. James Girard Summer Undergraduate Research Program 2021 Faculty Mentor - Project Application

Virtualization: Comparision of Windows and Linux

Archana Subramanian Nikhat Farha

A Survey of Reconfigurable Processors

AMD Athlon™ 64 Processor Product Brief

AMD's Early Processor Lines, up to the Hammer Family (Families K8

Lista Sockets.Xlsx