The Evolution of Mobile Technology Series

Part 3: The Evolution of Mobile Processing Architectures

June 24, 2009

Moderated by Jim McGregor Chief Technology Strategist In-Stat

1 Introduction

‹ Welcome to the Evolution of Mobile Technology webinar series featuring: • Designing of High-Performance and All-Day Battery life (Completed: Download replay now) • Design Challenges of Supporting Multiple Connectivity Technologies Architectures (Completed: Download replay now) • The Evolution of Mobile Processing Architectures • Creating Flexible Designs for Future Features and Applications • The Impact of the Cloud on Mobile Devices • The Future of Wireless Technologies ‹ Today’s host: • Jim McGregor, Chief Technology Strategist, In-Stat ‹ Agenda • 5-minute overview • 40-minute discussion by panelists • 15-minute live Q&A ‹ Archive of webinar available at: • www.ti.com/wirelesspresenations • www.instat.com 2 Panelists

‹ Brian Carlson • OMAP Platform Marketing Manager, TI’s OMAP Platform Business • Responsible for definition and management of future mobile application platforms • Serves on Mobile Industry Interface Alliance Board of Directors • 25 years in technology marketing, business development, and engineering in DSP, communications, and multimedia applications • Experience at DNA Enterprises, E-Systems, Hyperception, and LSI Logic • Recognized by TI as Member of Group Technical Staff ‹ John Goodacre • Director of program management for ARM’s processor division • Responsible for application processor’s technology roadmap including definition and market development of the ARM MPCore processor technology • More than 20 years experience of realizing new technologies in the engineering industry • Previously worked for Microsoft specializing in enterprise software

3 Overview

• The Increasing requirements & options for mobile performance (Jim McGregor)

• Mobile processor technology trends (Brian Carlson)

• Leveraging symmetric in mobile devices (John Goodacre)

4 Recap

Future of Mobility Performance is Inherent • Communications • Improvements –Voice – Communication speeds – Messaging – Display resolution – Social Networking – Graphics – Navigation – Storage capacity • Entertainment – Applications Processing –Audio • Additional Features – Video – Touch screens –Gaming – Wi-Fi connectivity – Internet – Motion control • Computing – Enhanced I/O – Content creation & manipulation – Productivity Apps

5 Increased performance requirements

Improvements

• Applications 120 – HD media content – 3D graphics 100 – Content creation & manipulation 80 – Interactive web – Computing 60

• Communication Link % Increase – Advanced signals & 40 algorithms – Increased data rates 20 – Multiple streams 0 • I/O EV-DO HSPA+ LTE 802.11b 802.11g 802.11n

– Increased display Source: In-Stat, 6/09 resolution – Increased interfaces & bandwidth

6 Mobile processor technologies

• Processor cores

•DSP cores Processing Memory Core(s) •GPUs DPS Graphics •Memory Core(s) • Dedicated Functional IP Baseband Audio & –Audio Processing Video

– Video RF Dedicated Accelerators –I/O – Accelerators –RF

7 Multiple chip options

Monolithic die

Multi-Chip Module (MCM)

System in Package (SiP)

Micro- GPU/ Accelerator processor Chipset Multiple chips

8 Decision factors

Market Considerations Design Considerations • Solutions • Time-to-Market – Space • Differentiation – Power/Battery Life •Cost – Thermal Limits – Features – Performance

• Product – Support – Tools – Software – Roadmaps – Scalability/Reusability

9 Multiple mobile platforms

3,000

2,500 Edutainment Toys Handheld Games 2,000 Personal Navigation Device Personal Media Player 1,500 Cellphone Mini-notebook PC Million Units of 1,000 Notebook PC UMPC 500 MID

- 2007 2008 2009 2010 2011 2012 2013

Source: In-Stat, 6/09 10 Mobile Processor Technology Trends

Brian Carlson Platform Marketing Manager, TI’s OMAP Platform Business Unit

11 Mobile processor trends

• The platforms for future applications growth – paradigm shift from PC to mobile devices with higher volumes – new experiences enabled by mobility and “greater than the sum of parts”

• Highly-integrated SoCs becoming even more integrated and leveraging multicore technology aggressively for higher performance – with low-power design being paramount

• Dramatic increase in processing performance and ubiquity of 3G data networks enable new applications and use cases – always connected with laptop-like performance, HD multimedia, sensors…

• Expanded market beyond and into consumer electronics and into mobile computing – leverage platform investment across multiple markets

12 Evolution of mobile processors Technology 2007-8 2009-10 2011-12 Improvement ARM® Processor ARM11 Cortex-A8 Dual Cortex-A9 10x + 470-700 DMIPS 1,200-2,000 DMIPS 5,000+ DMIPS SMP Ext. Display VGA XGA WUXGA + HDMI 8x + HDMI Video VGA-30fps 720p-30fps 1080p-30fps 7x 3D Graphics 2 Mtri/s 10+ Mtri/s 20+ Mtri/s 10x + OpenGL ES 1.1 OpenGL ES 2.0 OpenGL ES 2.0 Pgm. shaders Imaging 3-5 MP 8-12 MP 16-20 MP 7x Audio 15 hrs 40 hrs 140+ hrs 10x DDR Memory 128-256 MB 256-512 MB 1-2GB 8x Mass Storage 8-16 GB 16-32 GB 64-128 GB 8x 90 nm 65/45 nm 45 nm / beyond 3+ nodes

Notes: 1. Dates shown are approximate mobile handset availability dates. 2. Features are capabilities of high-end mobile devices in timeframes; not necessarily specific product specifications. ~10x improvement over a four year period! 13 Mobile processor architectures • Architecture and integration of a mobile processor are driven by primary factors related to market and end-product Performance Low-power, highly-integrated D Power B SoC required to meet battery B A Cost life, cost and form factor R P Size F • There are also manufacturer preferences – Thin modem -vs- fat modem; RF integration; system components • Typically lower-tier products offer higher integration, but with less processing performance to reduce cost • Standalone applications processors dominant in higher-tier for highest performance and choice of separate modem/RF – Don’t lock modem and application processing technologies to same development schedule – different innovation cycles

– Combine best-in-class of each technology 14 Modem integration • TI is focused on offering the best system solution combining modems from multiple vendors with our industry-leading OMAP™ mobile applications processors – Stacked die in one package (e.g., ST-E U380…) • Enabled by TI OMAP innovative die-to-die / chip-to-chip interfaces – Competitive side-by-side system solutions – Supported by multiple mobile operating systems – TI differentiator

• This approach broadens our engagements and allows us to meet wide variety of customer needs – Custom joint solutions – major OEMS / modem partners – Customer preferences for certain modem vendors – Integration with wide variety of module vendors – Multi-standard support • HSPA+, CDMA EV-DO, TD-SCDMA, LTE, WiMAX… 15 Future mobile processor challenges

• Need for higher CPU performance to provide enhanced mobile computing user experience – Faster boot, applications and responsiveness – New use cases and features that will transform mobile devices and innovate into new products • Ability to create and play growing HD multimedia content – 1080p-30 video and beyond – 3D stereoscopic, multi-megapixel images and MV video – Console-quality graphics for advanced user interfaces and games – Output on multiple HD displays (internal/external) • Ability to handle multiple, high-speed data streams simultaneously – 4G modem, USB 3.0, MIPI® Alliance M-PHY interfaces And doing all this within a mobile power budget of a few hundred milliwatts ! Multiple, high-speed streams

16 Multicore approach is needed • performance has hit wall as process migration no longer provides huge increase, delivering higher IPC is expensive and using overdrive voltage increases power exponentially – Symmetric Multi-Processing (SMP) enables multiple processor cores to work together to provide scalable performance on demand • Application-specific cores are more efficient and required to meet the high-performance and quality-of-service needs moving forward • Multicore provides ability to turn cores on/off as needed as low power is critical for mobile devices – Inactive cores burn power if they are just in standby

A multicore approach with SMP cores for general-purpose processing combined with application-specific cores meets mobile processor performance/power needs.

17 Value of application-specific core

• Video is a good example of how an application-specific core meets mobile performance/power constraints • Video implementation evolution over three generations of mobile processors –DSP → DSP + hardware acceleration → Dedicated video core

DSP only DSP+HWA Video Core Area 1.0x 1.2x 0.5x Performance 1.0x 2.0x 6.0x Power 1.0x 0.8x 0.5x Flexibility 1.0x 0.5x 0.0x • ~300 MHz DSP supports H.264 BL D1 @ 30fps – Same size/frequency video core can do 1080p @ 30fps 18 Ref: “Media Processor Architecture for Video and Imaging on Camera Phones” – ICASSP-2008, Meehan, et al. TI OMAP™ mobile processor evolution

• Breakthrough computing and multimedia performance • Extending multicore with SMP • Headroom and flexibility to address tomorrow’s applications • Aggressive for extended battery life

www.ti.com/omap4_platform

19 Multicore mobile processor

OMAP™ 4 Platform Dual-core App-specific SMP Multicore

Optimal mix of multiple cores working together for best performance/power 20 Summary

Multicore technology is critical for mobile processors to meet demanding performance and low-power requirements

SMP is next step in mobile processor CPU evolution – enabling scalable power and performance on demand

Application-specific cores provide performance, quality-of-service and power-efficient offload of the main CPU

21 Leveraging symmetric multiprocessing in mobile devices John Goodacre Director, Program Management ARM Processor Division

22 SMP benefits for mobile devices

Advanced Processing • Multicore solutions provide flexibility to meet processing demands • SMP addresses mobile performance and power challenges • Combination of multicore technologies is foundation for future processing needs

Benefits • Scalability with multiple cores to deliver right mix of performance/power • Ability to adjust dynamically to use case • Leverage platform to scale across tiers of products and into the future • smartphone → MID → mobile consumer → smartbook products • Solid foundation for supporting future applications that is transparent to software

23 Symmetric Multi Processing (SMP)

SMP is a load-distribution software architecture that determines the roles of CPU cores dynamically

Homogeneous Scalable Shared Memory Task Task Task Task Task Task Task Task Flexible Symmetric Task Task Task Task Operating System Easy to program coherent CPU CPU CPU CPU Abstracted Tightly coupled MPCore processor

24 Mass adoption of SMP capabilities

Vendor OS SMP WindRiver VxWorks 6.6 SMP Y eSol eT-kernel Multicore Edition Y Express Logic ThreadX Y QNX Neutrino RTOS Y Green Hills INTEGRITY 10 Y Montavista MobiLinux 5.0 Y kernel.org Linux 2.6+ Y Symbian Symbian OS 9+ Y Microsoft WinCE (*) Mentor Graphics Nucleus PLUS RTOS (*) Solaris Open Solaris Y etc etc (*)

(*) Contact vendor for further details Vendors are at different stages of ARM MPCore support Large ecosystem of SMP aware OSs and RTOSs

25 Concurrency within today’s software

RichRich andand completecomplete cataloguecatalogue ofof applicationsapplications availableavailable

NoNo softwaresoftware changeschanges requiredrequired

The SMP Operating System automatically FullFull backwardsbackwards The SMP Operating System automatically andand forwardsforwards time-slices tasks across available CPUs compatibilitycompatibility

26 Browsers are good candidates for SMP Windows Internet Explorer

SMP load balancing across dualcore

100% utilization when running on single CPU

SMP vs 1CPU (affinity applied to CPU0 for key apps) • SMP browsing: Linux Firefox MAC OS Safari – Improved UI responsiveness – Faster rendering of complex pages – Ability to lower power consumption – Available today

27

27 Creating explicit parallelism ANALYSE Analyse the UP application/problem to • Required if single find parallelisable areas application needs more performance than a single CPU DECOMPOSE – The single performance Decompose problems into independent – Cortex-A9 has the highest single tasks for parallel execution: thread performance of any embedded • Task Decomposition processor • Functional Block Partitioning • Data Decomposition • A workload well balanced across multiple CPU can operate at lower voltages therefore consume less PARALLELISE energy for the required Parallelise using threading industry performance standard APIs, libraries and models such as POSIX Threads and OpenMP 28 SMP programming resources SMP supported by Wide spread industry all main OS/RTOSs standards support for SMP programming

431 hits for books on pthreads !

Great number of freely Great number of available SMP resources on MP programming tutorials and parallelism Vast amount of technical resources available

29 Performance scalability • Must understand memory system to understand if multicore will scale with number of CPU • If DDR can only satisfy a single CPU, then a memory bound application will not scale with multicore • Applications which are cache bound scale linear with ARM MPCore technology • Multicore performance gain is related to cache miss rate and DDR performance

30 CoreMarks: Absolute performance

¾¾Today’sToday’s Cortex-A8Cortex-A8 providesprovides comparablecomparable performanceperformance toto IntelIntel AtomAtom ¾¾FirstFirst GenerationGeneration Cortex-A9Cortex-A9 devicesdevices willwill provideprovide comparablecomparable performanceperformance toto dual-dual- corecore AtomsAtoms ¾¾High-speedHigh-speed implementationimplementation ofof Cortex-A9Cortex-A9 cancan provideprovide desktopdesktop levellevel performanceperformance

EEMBC CoreMark™ at device speed 31 ARM MPCore for low power operation

32 Summary: Benefits of MPCore solutions PERFORMANCE ARM11 MPCore: 650 DMIPS Æ 2600 DMIPS

Cortex-A9 MPCore: 2000 DMIPS Æ 8000 DMIPS

LOWER POWER Less power consumption than UP per equivalent performance throughput More CPUs at lower frequency with ability of individual power-off

SCALABILITY Add/enable additional CPUs for on demand performance increase Scalable system expansion to leverage next-generation system requirements

PORTABILITY Flexible, ready-available, support existing software, with programming models to suite application requirements FLEXIBILITY Isolate real-time requirements from high-performance application deployment through advanced configurations

33 Summary

• Mobile processor requirements are scaling exponentially – Applications (HD media content, creation & manipulation) – Features (HD & multi-display options) – Higher bandwidth & multiple data streams – Usage models (multiple applications, entertainment, device consolidation) • Technology creates diversity – There are many ways to increase performance – Market & design considerations must be considered • Multicore solutions are required – To meet current and future performance levels & power limitations – Heterogeneous for increased performance & efficiency – SMP required for improved core performance & parallelism

34 Q & A

• To participate, click on the Ask a Question link on the left side of the interface; enter your question in the box on the screen; hit “Submit.” We’ll answer them during the Q&A session or after the webcast.

www.ti.com/wirelesspresentations community.ti.com/blogs/mobilemomentum

35 Contact information

Brian Carlson Platform Marketing Manager, OMAP Platform Business Unit Texas Instruments [email protected] John Goodacre Director of Program Management, ARM’s processor division ARM [email protected] Jim McGregor Chief Technology Strategist In-Stat [email protected]

36