Embree High Performance Ray Tracing Kernels 3.13.1

Total Page:16

File Type:pdf, Size:1020Kb

Embree High Performance Ray Tracing Kernels 3.13.1 Intel® Embree High Performance Ray Tracing Kernels 3.13.1 August 10, 2021 Contents 1 Embree Overview 2 1.1 Supported Platforms ......................... 3 1.2 Version History ............................ 3 2 Installation of Embree 22 2.1 Windows MSI Installer ........................ 22 2.2 Windows ZIP File .......................... 22 2.3 Linux tar.gz Files ........................... 22 2.4 macOS PKG Installer ........................ 23 2.5 macOS ZIP file ............................ 23 3 Compiling Embree 24 3.1 Linux and macOS .......................... 24 3.2 Windows ............................... 26 3.3 CMake Configuration ......................... 28 4 Using Embree 32 5 Embree API 33 5.1 Device Object ............................. 34 5.2 Scene Object ............................. 34 5.3 Geometry Object ........................... 35 5.4 Ray Queries .............................. 35 5.5 Point Queries ............................. 36 5.6 Collision Detection .......................... 36 5.7 Miscellaneous ............................. 36 6 Upgrading from Embree 2 to Embree 3 37 6.1 Device ................................. 38 6.2 Scene ................................. 38 6.3 Geometry ............................... 38 6.4 Buffers ................................. 40 6.5 Miscellaneous ............................. 40 1 7 Embree API Reference 43 7.1 rtcNewDevice ............................. 43 7.2 rtcRetainDevice ............................ 46 7.3 rtcReleaseDevice ........................... 47 7.4 rtcGetDeviceProperty ........................ 48 7.5 rtcGetDeviceError .......................... 51 7.6 rtcSetDeviceErrorFunction ...................... 52 7.7 rtcSetDeviceMemoryMonitorFunction ............... 53 7.8 rtcNewScene ............................. 55 7.9 rtcGetSceneDevice .......................... 56 7.10 rtcRetainScene ............................ 57 7.11 rtcReleaseScene ............................ 58 7.12 rtcAttachGeometry .......................... 59 7.13 rtcAttachGeometryByID ....................... 60 7.14 rtcDetachGeometry .......................... 61 7.15 rtcGetGeometry ........................... 62 7.16 rtcCommitScene ........................... 63 7.17 rtcJoinCommitScene ......................... 64 7.18 rtcSetSceneProgressMonitorFunction ................ 66 7.19 rtcSetSceneBuildQuality ....................... 68 7.20 rtcSetSceneFlags ........................... 69 7.21 rtcGetSceneFlags ........................... 70 7.22 rtcGetSceneBounds .......................... 71 7.23 rtcGetSceneLinearBounds ...................... 72 7.24 rtcNewGeometry ........................... 73 7.25 RTC_GEOMETRY_TYPE_TRIANGLE ............. 76 7.26 RTC_GEOMETRY_TYPE_QUAD ................ 78 7.27 RTC_GEOMETRY_TYPE_GRID ................ 80 7.28 RTC_GEOMETRY_TYPE_SUBDIVISION ........... 82 7.29 RTC_GEOMETRY_TYPE_CURVE ............... 85 7.30 RTC_GEOMETRY_TYPE_POINT ............... 90 7.31 RTC_GEOMETRY_TYPE_USER ................ 92 7.32 RTC_GEOMETRY_TYPE_INSTANCE ............. 94 7.33 RTCCurveFlags ............................ 96 7.34 rtcRetainGeometry .......................... 97 7.35 rtcReleaseGeometry ......................... 98 7.36 rtcCommitGeometry ......................... 99 7.37 rtcEnableGeometry .......................... 100 7.38 rtcDisableGeometry ......................... 101 7.39 rtcSetGeometryTimeStepCount ................... 102 7.40 rtcSetGeometryTimeRange ..................... 103 7.41 rtcSetGeometryVertexAttributeCount ............... 104 7.42 rtcSetGeometryMask ......................... 105 7.43 rtcSetGeometryBuildQuality .................... 106 7.44 rtcSetGeometryBuffer ........................ 107 7.45 rtcSetSharedGeometryBuffer .................... 108 2 7.46 rtcSetNewGeometryBuffer ...................... 110 7.47 RTCFormat .............................. 111 7.48 RTCBufferType ............................ 113 7.49 rtcGetGeometryBufferData ..................... 115 7.50 rtcUpdateGeometryBuffer ...................... 116 7.51 rtcSetGeometryIntersectFilterFunction ............... 117 7.52 rtcSetGeometryOccludedFilterFunction .............. 120 7.53 rtcFilterIntersection ......................... 121 7.54 rtcFilterOcclusion .......................... 122 7.55 rtcSetGeometryUserData ...................... 123 7.56 rtcGetGeometryUserData ...................... 124 7.57 rtcSetGeometryUserPrimitiveCount ................ 125 7.58 rtcSetGeometryBoundsFunction ................... 126 7.59 rtcSetGeometryIntersectFunction .................. 128 7.60 rtcSetGeometryOccludedFunction .................. 130 7.61 rtcSetGeometryPointQueryFunction ................ 132 7.62 rtcSetGeometryInstancedScene ................... 135 7.63 rtcSetGeometryTransform ...................... 136 7.64 rtcSetGeometryTransformQuaternion ................ 137 7.65 rtcGetGeometryTransform ...................... 138 7.66 rtcSetGeometryTessellationRate ................... 139 7.67 rtcSetGeometryTopologyCount ................... 140 7.68 rtcSetGeometrySubdivisionMode .................. 141 7.69 rtcSetGeometryVertexAttributeTopology .............. 143 7.70 rtcSetGeometryDisplacementFunction ............... 144 7.71 rtcGetGeometryFirstHalfEdge .................... 146 7.72 rtcGetGeometryFace ......................... 148 7.73 rtcGetGeometryNextHalfEdge .................... 150 7.74 rtcGetGeometryPreviousHalfEdge ................. 152 7.75 rtcGetGeometryOppositeHalfEdge ................. 154 7.76 rtcInterpolate ............................. 156 7.77 rtcInterpolateN ............................ 158 7.78 rtcNewBuffer ............................. 160 7.79 rtcNewSharedBuffer ......................... 161 7.80 rtcRetainBuffer ............................ 162 7.81 rtcReleaseBuffer ........................... 163 7.82 rtcGetBufferData ........................... 164 7.83 RTCRay ................................ 165 7.84 RTCHit ................................ 167 7.85 RTCRayHit .............................. 168 7.86 RTCRayN ............................... 169 7.87 RTCHitN ............................... 170 7.88 RTCRayHitN ............................. 171 7.89 rtcInitIntersectContext ........................ 172 7.90 rtcIntersect1 .............................. 174 7.91 rtcOccluded1 ............................. 176 3 7.92 rtcIntersect4/8/16 .......................... 178 7.93 rtcOccluded4/8/16 .......................... 180 7.94 rtcIntersect1M ............................ 182 7.95 rtcOccluded1M ............................ 184 7.96 rtcIntersect1Mp ............................ 186 7.97 rtcOccluded1Mp ........................... 187 7.98 rtcIntersectNM ............................ 188 7.99 rtcOccludedNM ............................ 190 7.100rtcIntersectNp ............................ 192 7.101rtcOccludedNp ............................ 194 7.102rtcInitPointQueryContext ...................... 196 7.103rtcPointQuery ............................. 198 7.104rtcCollide ............................... 200 7.105rtcNewBVH .............................. 202 7.106rtcRetainBVH ............................ 203 7.107rtcReleaseBVH ............................ 204 7.108rtcBuildBVH ............................. 205 7.109RTCQuaternionDecomposition ................... 209 7.110rtcInitQuaternionDecomposition .................. 211 8 Performance Recommendations 212 8.1 MXCSR control and status register ................. 212 8.2 Thread Creation and Affinity Settings ............... 212 8.3 Fast Coherent Rays .......................... 213 8.4 Huge Page Support .......................... 213 8.5 Avoid store-to-load forwarding issues with single rays ....... 214 9 Embree Tutorials 216 9.1 Minimal ................................ 217 9.2 Triangle Geometry .......................... 218 9.3 Dynamic Scene ............................ 219 9.4 Multi Scene Geometry ........................ 220 9.5 User Geometry ............................ 221 9.6 Viewer ................................. 222 9.7 Stream Viewer ............................ 223 9.8 Intersection Filter .......................... 224 9.9 Instanced Geometry ......................... 225 9.10 Multi Level Instancing ........................ 226 9.11 Path Tracer .............................. 227 9.12 Hair .................................. 228 9.13 Curve Geometry ........................... 229 9.14 Subdivision Geometry ........................ 230 9.15 Displacement Geometry ....................... 231 9.16 Grid Geometry ............................ 232 9.17 Point Geometry ............................ 233 9.18 Motion Blur Geometry ........................ 234 4 9.19 Quaternion Motion Blur ....................... 235 9.20 Interpolation ............................. 236 9.21 Closest Point ............................. 237 9.22 Voronoi ................................ 238 9.23 Collision Detection .......................... 239 9.24 BVH Builder ............................. 239 9.25 BVH Access .............................. 240 9.26 Find Embree ............................. 240 9.27 Next Hit ................................ 240 5 Chapter 1 Embree Overview Intel® Embree is a collection of high-performance ray tracing kernels, developed at Intel. The target users of Intel® Embree are graphics application engineers who want to improve the performance of their photo-realistic rendering appli- cation by leveraging Embree’s performance-optimized ray tracing kernels. The kernels are optimized for the latest Intel® processors with support for SSE, AVX, AVX2, and AVX-512 instructions. Intel® Embree supports runtime code selection to
Recommended publications
  • Apple US Education Price List
    US Education Institution – Hardware and Software Price List April 30, 2021 For More Information: Please refer to the online Apple Store for Education Institutions: www.apple.com/education/pricelists or call 1-800-800-2775. Pricing Price Part Number Description Date iMac iMac with Intel processor MHK03LL/A iMac 21.5"/2.3GHz dual-core 7th-gen Intel Core i5/8GB/256GB SSD/Intel Iris Plus Graphics 640 w/Apple Magic Keyboard, Apple Magic Mouse 2 8/4/20 1,049.00 MXWT2LL/A iMac 27" 5K/3.1GHz 6-core 10th-gen Intel Core i5/8GB/256GB SSD/Radeon Pro 5300 w/Apple Magic Keyboard and Apple Magic Mouse 2 8/4/20 1,699.00 MXWU2LL/A iMac 27" 5K/3.3GHz 6-core 10th-gen Intel Core i5/8GB/512GB SSD/Radeon Pro 5300 w/Apple Magic Keyboard & Apple Magic Mouse 2 8/4/20 1,899.00 MXWV2LL/A iMac 27" 5K/3.8GHz 8-core 10th-gen Intel Core i7/8GB/512GB SSD/Radeon Pro 5500 XT w/Apple Magic Keyboard & Apple Magic Mouse 2 8/4/20 2,099.00 BR332LL/A BNDL iMac 21.5"/2.3GHz dual-core 7th-generation Core i5/8GB/256GB SSD/Intel IPG 640 with 3-year AppleCare+ for Schools 8/4/20 1,168.00 BR342LL/A BNDL iMac 21.5"/2.3GHz dual-core 7th-generation Core i5/8GB/256GB SSD/Intel IPG 640 with 4-year AppleCare+ for Schools 8/4/20 1,218.00 BR2P2LL/A BNDL iMac 27" 5K/3.1GHz 6-core 10th-generation Intel Core i5/8GB/256GB SSD/RP 5300 with 3-year AppleCare+ for Schools 8/4/20 1,818.00 BR2S2LL/A BNDL iMac 27" 5K/3.1GHz 6-core 10th-generation Intel Core i5/8GB/256GB SSD/RP 5300 with 4-year AppleCare+ for Schools 8/4/20 1,868.00 BR2Q2LL/A BNDL iMac 27" 5K/3.3GHz 6-core 10th-gen Intel Core i5/8GB/512GB
    [Show full text]
  • Holiday Catalog
    Brilliant for what’s next. With the power to achieve anything. AirPods Pro AppleCare+ Protection Plan†* $29 Key Features • Active Noise Cancellation for immersive sound • Transparency mode for hearing and connecting with the world around you • Three sizes of soft, tapered silicone tips for a customizable fit • Sweat and water resistant1 • Adaptive EQ automatically tunes music to the shape of your ear • Easy setup for all your Apple devices2 • Quick access to Siri by saying “Hey Siri”3 • The Wireless Charging Case delivers more than 24 hours of battery life4 AirPods Pro. Magic amplified. Noise nullified. Active Noise Cancellation for immersive sound. Transparency mode for hearing what’s happening around you. Sweat and water resistant.1 And a more customizable fit for all-day comfort. AirPods® AirPods AirPods Pro with Charging Case with Wireless Charging Case with Wireless Charging Case $159 $199 $249 1 AirPods Pro are sweat and water resistant for non-water sports and exercise and are IPX4 rated. Sweat and water resistance are not permanent conditions. The charging case is not sweat or water resistant. 2 Requires an iCloud account and macOS 10.14.4, iOS 12.2, iPadOS, watchOS 5.2, or tvOS 13.2 or later. 3Siri may not be available in all languages or in all areas, and features may vary by area. 4 Battery life varies by use and configuration. See apple.com/batteries for details. Our business is part of a select group of independent Apple® Resellers and Service Providers who have a strong commitment to Apple’s Mac® and iOS platforms and have met or exceeded Apple’s highest training and sales certifications.
    [Show full text]
  • Apple Accessories & Prices Input Devices Apple Magic Mouse 2 (APPX333) £65.50 Imac, Mac Mini and Accessories Magic Mouse 2 - Space Grey (APPX015) £82.50
    Apple Accessories & Prices Input Devices Apple Magic Mouse 2 (APPX333) £65.50 iMac, Mac Mini and Accessories Magic Mouse 2 - Space Grey (APPX015) £82.50 Magic Trackpad 2 (APPX335) £107.50 Magic Trackpad 2 - Space Grey (APPX016) £124.00 Magic Keyboard (APPK005) £79.95 Magic Keyboard - Numeric (APPK006) £105.00 Magic Keyboard - Numeric - Space Grey (APPK007) £124.00 Adapters and Network Mini DisplayPort to DVI Adapter (APPX117) £23.95 Mini DisplayPort to VGA Adapter (APPX142) £23.95 USB-C to USB-A (APPX281) £14.95 USB-C to Digital AV (APPX099) £62.50 Misc. Accessories USB Retina Superdrive (APPX228) £65.50 HomePod Mini Smart Speakers - Available in Space Grey and Silver (APPX137/138) £82.50 HomePod Smart Speakers - Available in Space Grey and Silver (APPX013/014) £232.50 Many other adapters and accessories also available! Guernsey33 Commercial Arcade, Computers St. Peter Port Tel 01481-728738 Ltd E. & O. E. 18th August 2021 Mac Products & Specifications Mac Products & Specifications Mac Mini - i5 2.6GHz (APPC022) £915.00 iMac 27” i5 3.1GHz - 5K Retina Display (APPC028) £1499.00 3.0GHz 6-Core i5 Processor w/ 9MB shared L3 cache - Turbo Boost 3.1GHz 6-Core i5 Processor (Turbo Boost up to 4.5GHz), 8GB up to 4.1GHz, 8GB DDR4 RAM, 256GB Solid State Drive, Intel UHD DDR4 RAM, 256GB Solid State Drive, Radeon Pro 5300 Graphics Graphics 630, 802.11ac Wi-Fi (802.11 a/b/g/n compatible) & Bluetooth 5.0 (4GB), 802.11ac Wi-Fi (802.11 a/b/g/n compatible) & Bluetooth 5.0 Mac Mini - M1 8-Core Processor (APPC031) £582.00 iMac 27” i5 3.3GHz - 5K Retina Display
    [Show full text]
  • In-Datacenter Performance Analysis of a Tensor Processing Unit
    In-Datacenter Performance Analysis of a Tensor Processing Unit Presented by Josh Fried Background: Machine Learning Neural Networks: ● Multi Layer Perceptrons ● Recurrent Neural Networks (mostly LSTMs) ● Convolutional Neural Networks Synapse - each edge, has a weight Neuron - each node, sums weights and uses non-linear activation function over sum Propagating inputs through a layer of the NN is a matrix multiplication followed by an activation Background: Machine Learning Two phases: ● Training (offline) ○ relaxed deadlines ○ large batches to amortize costs of loading weights from DRAM ○ well suited to GPUs ○ Usually uses floating points ● Inference (online) ○ strict deadlines: 7-10ms at Google for some workloads ■ limited possibility for batching because of deadlines ○ Facebook uses CPUs for inference (last class) ○ Can use lower precision integers (faster/smaller/more efficient) ML Workloads @ Google 90% of ML workload time at Google spent on MLPs and LSTMs, despite broader focus on CNNs RankBrain (search) Inception (image classification), Google Translate AlphaGo (and others) Background: Hardware Trends End of Moore’s Law & Dennard Scaling ● Moore - transistor density is doubling every two years ● Dennard - power stays proportional to chip area as transistors shrink Machine Learning causing a huge growth in demand for compute ● 2006: Excess CPU capacity in datacenters is enough ● 2013: Projected 3 minutes per-day per-user of speech recognition ○ will require doubling datacenter compute capacity! Google’s Answer: Custom ASIC Goal: Build a chip that improves cost-performance for NN inference What are the main costs? Capital Costs Operational Costs (power bill!) TPU (V1) Design Goals Short design-deployment cycle: ~15 months! Plugs in to PCIe slot on existing servers Accelerates matrix multiplication operations Uses 8-bit integer operations instead of floating point How does the TPU work? CISC instructions, issued by host.
    [Show full text]
  • Apple Branded 20210331
    Apple Inc. Please visit the Apple NASPO online store for current product pricing, availability and product information. Consumer MNWNC- Band Part Number Description (MSRP) 102 iMac iMac 21.5"/2.3GHz dual-core 7th-gen Intel Core i5/8GB/256GB SSD/Intel Iris Plus Graphics 640 w/Apple Magic Keyboard, Apple Magic Mouse 1 MHK03LL/A 2 1,099.00 1,049.00 1 MHK23LL/A iMac 21.5" 4K/3.6GHz quad-core 8th-gen Intel Core i3/8GB/256GB SSD/Radeon Pro 555X w/Apple Magic Keyboard and Apple Magic Mouse 2 1,299.00 1,249.00 1 MHK33LL/A iMac 21.5" 4K/3.0GHz 6-core 8th-gen Intel Core i5/8GB/256GB SSD/Radeon Pro 560X w/Apple Magic Keyboard and Apple Magic Mouse 2 1,499.00 1,399.00 1 MXWT2LL/A iMac 27" 5K/3.1GHz 6-core 10th-gen Intel Core i5/8GB/256GB SSD/Radeon Pro 5300 w/Apple Magic Keyboard and Apple Magic Mouse 2 1,799.00 1,699.00 1 MXWU2LL/A iMac 27" 5K/3.3GHz 6-core 10th-gen Intel Core i5/8GB/512GB SSD/Radeon Pro 5300 w/Apple Magic Keyboard & Apple Magic Mouse 2 1,999.00 1,899.00 1 MXWV2LL/A iMac 27" 5K/3.8GHz 8-core 10th-gen Intel Core i7/8GB/512GB SSD/Radeon Pro 5500 XT w/Apple Magic Keyboard & Apple Magic Mouse 2 2,299.00 2,099.00 1 BR332LL/A BNDL iMac 21.5"/2.3GHz dual-core 7th-generation Core i5/8GB/256GB SSD/Intel IPG 640 with 3-year AppleCare+ for Schools - 1,168.00 1 BR342LL/A BNDL iMac 21.5"/2.3GHz dual-core 7th-generation Core i5/8GB/256GB SSD/Intel IPG 640 with 4-year AppleCare+ for Schools - 1,218.00 1 BR3G2LL/A BNDL iMac 21.5" 4K/3.6GHz quad-core 8th-gen Intel Core i3/8GB/256GB SSD/Radeon Pro 555X with 3-year AppleCare+ for Schools - 1,368.00
    [Show full text]
  • The American Express® Card Membership Rewards® 2021 Catalogue WHAT's INSIDE Select Any Icon to Go to Desired Section
    INCENTIVO The American Express® Card Membership Rewards® 2021 Catalogue WHAT'S INSIDE Select any icon to go to desired section THE MEMBERSHIP REWARDS® PROGRAM FREQUENT TRAVELER OPTION FREQUENT GUEST FREQUENT FLYER PROGRAM PROGRAM NON-FREQUENT TRAVELER OPTION GADGETS & HOME & TRAVEL HEALTH & ENTERTAINMENT KITCHEN ESSENTIALS WELLNESS DINING HEALTH & WELLNESS LUXURY HOTEL SHOPPING VOUCHERS VOUCHERS VOUCHERS VOUCHERS FINANCIAL REWARDS TERMS & CONDITIONS THE MEMBERSHIP REWARDS PROGRAM THE MEMBERSHIP REWARDS PROGRAM TAKES YOU FURTHER NON-EXPIRING MEMBERSHIP REWARDS® POINTS Your Membership Rewards® Points do not expire, allowing you to save your Membership Rewards® Points for higher 1 value rewards. ACCELERATED REWARDS REDEMPTION Membership Rewards® Points earned from your Basic and Supplementary Cards are automatically pooled to enable 2 accelerated redemption. COMPREHENSIVE REWARDS PROGRAM The Membership Rewards® Points can be redeemed for a wide selection of shopping gift certificates, dining vouchers, 3 gadgets, travel essential items and more. The Membership Rewards® Points can also be transferred to other loyalty programs such as Airline Frequent Flyer and Hotel Frequent Guest Rewards programs which will allow you to enjoy complimentary flights or hotel stays. *Select any item below to go to the desired section. THE FREQUENT NON- MEMBERSHIP FREQUENT FINANCIAL TERMS & TRAVELER CONDITIONS REWARDS OPTION TRAVELER REWARDS PROGRAM OPTION THE MEMBERSHIP REWARDS PROGRAM PROGRAM OPTIONS AND STRUCTURE Earn one (1) Membership Rewards® Point for every
    [Show full text]
  • Apple US Education Price List
    US Education Institution – Hardware and Software Price List November 10, 2020 For More Information: Please refer to the online Apple Store for Education Institutions: www.apple.com/education/pricelists or call 1-800-800-2775. Pricing Price Part Number Description Date iMac MHK03LL/A iMac 21.5"/2.3GHz dual-core 7th-gen Intel Core i5/8GB/256GB SSD/Intel Iris Plus Graphics 640 w/Apple Magic Keyboard, Apple Magic Mouse 2 8/4/20 1,049.00 MHK23LL/A iMac 21.5" 4K/3.6GHz quad-core 8th-gen Intel Core i3/8GB/256GB SSD/Radeon Pro 555X w/Apple Magic Keyboard and Apple Magic Mouse 2 8/4/20 1,249.00 MHK33LL/A iMac 21.5" 4K/3.0GHz 6-core 8th-gen Intel Core i5/8GB/256GB SSD/Radeon Pro 560X w/Apple Magic Keyboard and Apple Magic Mouse 2 8/4/20 1,399.00 MXWT2LL/A iMac 27" 5K/3.1GHz 6-core 10th-gen Intel Core i5/8GB/256GB SSD/Radeon Pro 5300 w/Apple Magic Keyboard and Apple Magic Mouse 2 8/4/20 1,699.00 MXWU2LL/A iMac 27" 5K/3.3GHz 6-core 10th-gen Intel Core i5/8GB/512GB SSD/Radeon Pro 5300 w/Apple Magic Keyboard & Apple Magic Mouse 2 8/4/20 1,899.00 MXWV2LL/A iMac 27" 5K/3.8GHz 8-core 10th-gen Intel Core i7/8GB/512GB SSD/Radeon Pro 5500 XT w/Apple Magic Keyboard & Apple Magic Mouse 2 8/4/20 2,099.00 BR332LL/A BNDL iMac 21.5"/2.3GHz dual-core 7th-generation Core i5/8GB/256GB SSD/Intel IPG 640 with 3-year AppleCare+ for Schools 8/4/20 1,168.00 BR342LL/A BNDL iMac 21.5"/2.3GHz dual-core 7th-generation Core i5/8GB/256GB SSD/Intel IPG 640 with 4-year AppleCare+ for Schools 8/4/20 1,218.00 BR3G2LL/A BNDL iMac 21.5" 4K/3.6GHz quad-core 8th-gen Intel Core i3/8GB/256GB
    [Show full text]
  • Importance of New Apple Computers
    Importance of New Apple Computers Lorrin R. Garson OPCUG & PATACS December 12, 2020 © 2020 Lorrin R. Garson Rapidly Changing Scene •Some information will have changed within the past few days and even hours •Expect new developments over the next several months 2 A Short Prologue: Computer Systems I’ve Worked On •Alpha Microsystems* (late 1970s ➜ 1990s) •Various Unix systems (1980s ➜ 2000s) Active hypertext •Microsoft Windows (~1985 ➜ 2013) links •Apple Computers (~1986 ➜ 2020) * Major similarities to DEC PDP/11 3 Not me in disguise! No emotional attachment to any computer system 4 Short History of Apple CPUs •1976 Apple I & II; MOS 6502 •1977 Apple III; Synertek 6502B •1985 Macintosh; Motorola 68000 ✓ 68020, 68030 and 68030 •1994 Macintosh; PowerPC 601 ✓ 603, 604, G3, G4 and G5 5 History of Apple Hardware (CPUs) (cont.) •2006 Macintosh; Intel x86 ✓ Yonah, Core Penryn, Nehalem, Westmere, Sandy Bridge, Ivy Bridge, Haswell, Broadwell, Skylake, Kaby Lake, Coffee Lake, Ice Lake, Tiger Lake ✓ 2009 Apple dropped support for PowerPC •2020 Mac Computers; Apple Silicon 6 Terminology •“Apple Silicon” refers to Apple’s proprietary ARM- based hardware •Apple Silicon aka “System* on a Chip” aka “SoC” •“M1” name of the chip implementing Apple Silicon** * Not silicon on a chip ** The M1 is a “superset” of the iPhone A14 chip 7 ARM vs. x86 •ARM uses RISC architecture (Reduced Instruction Set Computing) ✓ Fugaku supercomputer (world’s fastest computer) •x86 uses CISC architecture (Complex Instruction Set Computing) ✓ Intel-based computers •ARM focuses
    [Show full text]
  • Faster Dilithium, Kyber, and Saber on Cortex-A72 and Apple M1
    Neon NTT: Faster Dilithium, Kyber, and Saber on Cortex-A72 and Apple M1 Hanno Becker1, Vincent Hwang2,3, Matthias J. Kannwischer3, Bo-Yin Yang3 and Shang-Yi Yang4 1 Arm Research, Cambridge, UK [email protected] 2 National Taiwan University, Taipei, Taiwan [email protected] 3 Academia Sinica, Taipei, Taiwan [email protected], [email protected] 4 Chelpis Co. Ltd., Taipei, Taiwan [email protected] Abstract. We present new speed records on the Armv8-A architecture for the lattice- based schemes Dilithium, Kyber, and Saber. The core novelty in this paper is the combination of Montgomery multiplication and Barrett reduction resulting in “Barrett multiplication” which allows particularly efficient modular one-known-factor multiplication using the Armv8-A Neon vector instructions. These novel techniques combined with fast two-unknown-factor Montgomery multiplication, Barrett reduction sequences, and interleaved multi-stage butterflies result in significantly faster code. We also introduce “asymmetric multiplication” which is an improved technique for caching the results of the incomplete NTT, used e.g. for matrix-to-vector polynomial multiplication. Our implementations target the Arm Cortex-A72 CPU, on which our speed is 1.7× that of the state-of-the-art matrix-to-vector polynomial multiplication in Kyber [Nguyen–Gaj 2021]. For Saber, NTTs are far superior to Toom–Cook multi- plication on the Armv8-A architecture, outrunning the matrix-to-vector polynomial multiplication by 2.1×. On the Apple M1, our matrix-vector products run 2.1× and 1.9× faster for Kyber and Saber respectively. Keywords: NIST PQC · Armv8-A · Neon · Dilithium · Kyber · Saber 1 Introduction When large quantum computers arrive, Shor’s algorithm [Sho97] will break almost all currently deployed public-key cryptography by solving the integer factorization and the discrete logarithms problems.
    [Show full text]
  • Macbook Air for Business Supercharged. Super Light. Super Productive
    MacBook Air for Business Supercharged. Super light. Super productive. MacBook Air completely redefines what a notebook can do in a work-from-anywhere world. With the groundbreaking efficiency of the Apple M1 chip, the power of macOS Big Sur, and stunning performance in a fanless design, MacBook Air also offers incredible value. Starting at only $999, it’s a smarter investment for business than ever before. Powered by the Apple M1 chip Better everyday work experience. Small chip. Giant leap for business. Apps for work with new possibilities. MacBook Air is powered by the revolutionary M1, Apple’s first chip designed MacBook Air with the M1 chip and macOS runs common third-party business apps specifically for the Mac. With its industry-leading performance per watt, M1 like Microsoft Word, PowerPoint and Excel, Slack and Zoom. So employees can do the delivers up to 3.5x faster CPU, up to 5x faster GPU, up to 9x faster machine work they need to do in the apps they already use. When upgrading to M1, Rosetta 2 learning capabilities, and up to 18 hours of battery life.1,2 M1 transforms the ensures that existing Mac apps work out of the box. And for the first time, employees Mac experience. can run popular iOS and iPadOS apps directly on Mac. All-day battery life in a fanless design. Works seamlessly with iPhone and iPad. MacBook Air has advanced energy-saving technologies that help extend battery MacBook Air is the perfect companion to iPhone and iPad. With Continuity, employees life — with up to 15 hours of wireless web browsing and up to 18 hours of video can move easily between devices throughout their workday.
    [Show full text]
  • Apple M1 Foreshadows Rise of RISC-V | by Erik Engheim | Dec, 2020 | Medium
    12/23/2020 Apple M1 foreshadows Rise of RISC-V | by Erik Engheim | Dec, 2020 | Medium Get started Open in app Erik Engheim Follow 5.1K Followers About You have 1 free member-only story left this month. Sign up for Medium and get an extra one Apple M1 foreshadows Rise of RISC-V The M1 is the beginning of a paradigm shift, which will benefit RISC-V microprocessors, but not the way you think. Erik Engheim 4 days ago · 15 min read By now it is pretty clear that Apple’s M1 chip is a big deal. And the implications for the rest of the industry is gradually becoming clearer. In this story I want to talk about a https://erik-engheim.medium.com/apple-m1-foreshadows-risc-v-dd63a62b2562 1/17 12/23/2020 Apple M1 foreshadows Rise of RISC-V | by Erik Engheim | Dec, 2020 | Medium connection to RISC-V microprocessors which may not be obvious to most readers. Get started Open in app Let me me give you some background first: Why Is Apple’s M1 Chip So Fast? In that story I talked about two factors driving M1 performance. One was the use of massive number of decoders and Out-of-Order Execution (OoOE). Don’t worry it that sounds like technological gobbledegook to you. This story will be all about the other part: Heterogenous computing. Apple is aggressively pursued a strategy of adding specialized hardware units, I will refer to as coprocessors throughout this article: GPU (Graphical Processing Unit) for graphics and many other tasks with a lot of data parallelism (do the same operation on many elements at the same time).
    [Show full text]
  • Apple M1 Chip Small Chip, Huge Leap
    Apple M1 Chip Small chip, huge leap Every once in a while the world of technology makes a giant leap forward. The first personal computers, wide adoption of the internet and the first iPhones were not only fascinating and useful technology in and of themselves, but they also transformed an entire industry, spurring competitors and collaborators to new heights. The Apple M1 chip heralds one of those leaps. Reviewers love the M1 Apple's ARM-based M1 chip will mean astounding efficiency, speed and performance for Mac and Mac-based programs and frameworks, and reviews were . gushing might not be too strong a word for it. One reviewer for engadget.com positively swooned over his experience with the new M1-based MacBook Air1: “Apple's new MacBook Air is stunningly fast. It's raring to go the instant you open its lid. Want to browse the web? Watch it load bloated sites faster than you've ever seen on a laptop. Want to play some games? Step back as it blows away every ultraportable, with no fan noise to get in the way. And if you need to take a break, don't worry. It's got enough battery life to last you all day. Using the new MacBook Air is like stepping into a new world where we can demand much more from ultraportables.” 1 “MacBook Air M1 review: Faster than most PCs, no fan required,” engadget.com, November 17, 2020 One gobsmacked developer pointed out on his Twitter feed that Xcode 12.3 beta unzips in five minutes on an M1 Apple Silicon-based machine versus 13 minutes and 22 seconds on And Jamf’s own CEO Dean Hager, after ordering his own a device using Intel i9.
    [Show full text]