Artificial Intelligence and Machine Learning General Definitions and Fraud/AML Applications

Total Page:16

File Type:pdf, Size:1020Kb

Artificial Intelligence and Machine Learning General Definitions and Fraud/AML Applications Artificial Intelligence and Machine Learning general definitions and fraud/AML applications Frankfurt, April 2019 Group Financial Crime Eric WAGNER Agenda „difference between artificial intelligence and machine learning?“ 1 Current progress? Different types of machine 2 learning 3 approach 4 examples 5 Erste Group applications “By far, the greatest danger of Artificial Intelligence is that and AML synergies people conclude too early that they understand it.” Eliezer Yudkowsky1 6 challenges “Nobody phrases it this way, but I think that artificial intelligence is almost a humanities discipline. It's really an attempt to understand human intelligence and human 7 cognition.” conclusio 2 Sebastian Thrun 8 1 US researche and author. Decision theory and longterm social and philosophical impacts of artificial intelligence 2 German computer scientist and robotic specialist. Former professor for artificial intelligence at Stanford University and vice president of Google. Unfortunately no common definition, but converging opinions and papers EXEMPLARY General Artificial Intelligence Specific Artificial Intelligence „Humanity“ „nothing less than build a machine, a robot • empathy / mood • pass through its childhood, • opinion (having / reasoning) • learns languages like a child, • culture / faith / love • gains knowledge about the world by observing it with its own organs and „visual nature“ • ultimately contemplates about the whole • “human look” human knowledge and intellectual world” • Body movements and -functions • Mimic/gesture Joseph Weizenbaum, MIT AI Laboratory „interaction“ • perception (5+ senses) „Turing Test: a test person tries to identify if • intervention the unknown counterpart is human or a • context (spacial / logic) machine based on interactions [originally via keyboard and screen only] …„ „problem solving“ • recognize/deviate problem (situation) Alan Turing, 1950 • problem resolution (general and specific) • (Machine) Learning ca. 20 years to go for comprehensive artificial intelligence, but in specific areas „superhuman“ performance have already been achieved. Estimation current status AI/ML No „average“ „Humanity“ performance human • empathy / mood • opinion (having / reasoning) • culture / faith / love „visual nature“ • “human look” • Body movements and -functions • Mimic/gesture „interaction“ • perception (5+ senses) • intervention • context (spacial / logic) „problem solving“ • recognize/deviate problem (situation) general specific • problem resolution (general and specific) • (Machine) Learning further focus In general Machine Learning can be separated in three categories … Supervised Learning Unsupervised Learning Reinforcement Learning regression (cont. output) clustering status optimization by rewards classification (discrete output) outlier/anomaly detection Types • labeled training data • no labeled training data • decision process • direct feedback • no feedback • delayed feedback via “reward • predict result • find the “hidden” structure system” • agent evaluates (new) status and initiates actions to further optimize Characteristics own status • recognition: handwriting, speech and • system diagnosis • autonomous cars, robots, elevators, pictures • security-/event detection etc. • recommendation: Spamfilter, Online- • analysis: social network, astronomy • (computer-)games (AlphaGo) ads, recommender systems • market segmentation • marketing strategy optimization • analysis: brainsignals, genes, share Examples prices, weather, … … which can be realized either by statistical machine learning methods (can be re-calculated) or neural networks (cannot be re-cacluated). EXEMPLARY Supervised Learning Unsupervised Learning • Lineare/Ridge Regression1 • LVQ - Linear Vector Quantization (videocodecs: Apple Quicktime and audiocodecs: DTS, Ogg Vorbis) • K-Nearest Neighbor (regression/classification) • K-Means (fast, but heuristic for market segmentation, computer • Decision Trees2 (regression/classification) vision, geo statistics, agriculture) • Logistic Regression3 • HBOS - Histogram Based Outlier Scoring (fraud, structural learning • SVM – Support Vector Machine (binary classification) defects) • Naive Bayes (document-/text classification) • One-Class SVM Statistical machine (learning via „back propagation“) • NN (perceptron, RBM, autoencoder) Clustering • Deep NN (recurrent/LSTM, convolutional, • Kohonen SOM (Self-Organizing Map) • generative adversarial) • Neural Gas • DeepQ/Hierarchical Reinforcement NN neural networks Reinforcement Learning • rarely used an autonomous learning-method, • but as specific evaluation-algorithm of learning successes/-failures • Within supervised/unsupervised learning methods 1 avoidance of overfitting: recognize connections, separation of signal and noise incl. error estimation 2 Decision Trees, incl. Random Forest, Monte-Carlo Decision Tree etc. can be either regression (numerical output) or classification (discrete output) 3 binary problems and analysis of context probability with multiple features: market research, impact analysis Numerous successful Machine Learning implementations (realtime and faster/better than human) … (1/2) Autonomous vehicles Picture recognition and description as well (NVIDIA DRIVE PX) as context queries Object recognition and classification Object recognition and description https://www.youtube.com/embed/0rc4RqYLtEU?end=125&fs= 0&modestbranding=1&showinfo=0&autoplay=1 Computer model generation of environment Natural language question analysis and -answering for object recognition incl. context/relation https://www.youtube.com/embed/PjH_1hEoIDs?end=100&fs= 0&modestbranding=1&showinfo=0&autoplay=1 Numerous successful Machine Learning implementations (realtime and faster/better than human) … (2/2) Autonomous learning of games and Medical Diagnostics simulation of intuitive moves Autonomous learning of computer games (2014) Deep Learning via Computer Vision leads to Deep Q-Network (DQN) + Access to Care and Diagnostic Accuracy Reinforcement Learning https://www.youtube.com/embed/n_- xKr3vF3M?fs=0&modestbranding=1&showinfo=0&autoplay=1 Simulation of „intuitive“ moves (AlphaGo 2016) Monte Carlo Tree Search and for each Branch/Leaf Evaluation … 4 CNNs (3 Policy + 1 Value CNN) IEEE Spectrum … and advanced Machine Learning based approaches for fraudulent use (1/5) Spoofing / Disguising / Phishing everything Spoofcard.com: SNAP_R | Black Hat | blackhat.com • Disguise Caller ID • Disguise voice (sound as man SNAP_R (Social Network Automated Phishing with Reconnaissance) or woman) and add background AI came out in an experiment to get more Twitter users to click malicious links sounds than a human competitor by • Call straight to voicemail • studied how Twitter users behave, • Send spoof texts • then designed and implemented its own phishing bait. https://www.spoofcard.com • The results of the experiment showed that the artificial hacker was able to compose and distribute more phishing tweets, and with a more substantial conversion rate. Authentication Factor Description Key Vulnerabilities Security Key A compact device that Can be hacked/manipulated contains a secure IC chip Ownership which leverages public-key infrastructure Fingerprint Compares fingerprint on Usage of Master-Fingerprint Scanning record with new scans captured optically or electrically Vein Scanning Compares veins on record Can be „stolen“ and forged/faked with new scans captured Inherence optically 3D Facial Compares 3D characteristics 3D-print (mostly Android and Win-Hello, iOS less vulnerable) Recognition of a face on record with new scans captured optically 10 … and advanced Machine Learning based approaches for fraudulent use (2/5) Fake Sound based on written Text: Tacotron 2 - … and even trained to sound like an existing human Generating Human-like Speech from Text (VoCo/Lyrebird.ai) https://google.github.io/tacotron/publications/tacotron2/index.html https://arxiv.org/abs/1711.10433https://arxiv.org/abs/1711.10433https://arxiv.org/abs/ 1711.10433 Generate human-like speech from text using neural networks trained using only speech examples and corresponding text transcripts. Sequence of features (i.e. an 80-dimensional audio spectrogram with frames computed every 12.5 milliseconds) encoding an audio, capture not only • pronunciation of words, but also • various subtleties of human speech, including volume, speed and intonation. Deepminds WaveNet (Parallel WaveNet: Fast High-Fidelity Speech Synthesis), deployed already in Google Assistant Fake Video based on Audio (in Realtime): Adapting Lip-Sync from Audio https://grail.cs.washington.edu/projects/AudioToObama/ Input: Given audio of President Barack Obama Output: Synthesize a photorealistic video of Obama speaking with accurate lip sync, composited into a target video clip. Training: Trained on many hours of his weekly address footage, • a recurrent neural network learns the mapping from raw audio features to mouth shapes. • Given the mouth shape at each time instant, high quality mouth texture was synthesized and • composite with proper 3D pose matching to change what he appears to be saying in a target video to match the input audio track. https://www.youtube.com/embed/9Yq67CjDqvw?start=45&e nd=62&autoplay=1 11 … and advanced Machine Learning based approaches for fraudulent use (3/5) Fake Video based on swapped face (NOT Realtime): DeepFake (IN Realtime): FaceSwap https://www.fakeapp.org/ various implementations, but not fully convincing ... yet https://hackernoon.com/exploring-deepfakes- 20c9947c22d9 http://faceswaplive.com/ https://github.com/arturoc/FaceSubstit
Recommended publications
  • Artificial Intelligence in Health Care: the Hope, the Hype, the Promise, the Peril
    Artificial Intelligence in Health Care: The Hope, the Hype, the Promise, the Peril Michael Matheny, Sonoo Thadaney Israni, Mahnoor Ahmed, and Danielle Whicher, Editors WASHINGTON, DC NAM.EDU PREPUBLICATION COPY - Uncorrected Proofs NATIONAL ACADEMY OF MEDICINE • 500 Fifth Street, NW • WASHINGTON, DC 20001 NOTICE: This publication has undergone peer review according to procedures established by the National Academy of Medicine (NAM). Publication by the NAM worthy of public attention, but does not constitute endorsement of conclusions and recommendationssignifies that it is the by productthe NAM. of The a carefully views presented considered in processthis publication and is a contributionare those of individual contributors and do not represent formal consensus positions of the authors’ organizations; the NAM; or the National Academies of Sciences, Engineering, and Medicine. Library of Congress Cataloging-in-Publication Data to Come Copyright 2019 by the National Academy of Sciences. All rights reserved. Printed in the United States of America. Suggested citation: Matheny, M., S. Thadaney Israni, M. Ahmed, and D. Whicher, Editors. 2019. Artificial Intelligence in Health Care: The Hope, the Hype, the Promise, the Peril. NAM Special Publication. Washington, DC: National Academy of Medicine. PREPUBLICATION COPY - Uncorrected Proofs “Knowing is not enough; we must apply. Willing is not enough; we must do.” --GOETHE PREPUBLICATION COPY - Uncorrected Proofs ABOUT THE NATIONAL ACADEMY OF MEDICINE The National Academy of Medicine is one of three Academies constituting the Nation- al Academies of Sciences, Engineering, and Medicine (the National Academies). The Na- tional Academies provide independent, objective analysis and advice to the nation and conduct other activities to solve complex problems and inform public policy decisions.
    [Show full text]
  • AI Computer Wraps up 4-1 Victory Against Human Champion Nature Reports from Alphago's Victory in Seoul
    The Go Files: AI computer wraps up 4-1 victory against human champion Nature reports from AlphaGo's victory in Seoul. Tanguy Chouard 15 March 2016 SEOUL, SOUTH KOREA Google DeepMind Lee Sedol, who has lost 4-1 to AlphaGo. Tanguy Chouard, an editor with Nature, saw Google-DeepMind’s AI system AlphaGo defeat a human professional for the first time last year at the ancient board game Go. This week, he is watching top professional Lee Sedol take on AlphaGo, in Seoul, for a $1 million prize. It’s all over at the Four Seasons Hotel in Seoul, where this morning AlphaGo wrapped up a 4-1 victory over Lee Sedol — incidentally, earning itself and its creators an honorary '9-dan professional' degree from the Korean Baduk Association. After winning the first three games, Google-DeepMind's computer looked impregnable. But the last two games may have revealed some weaknesses in its makeup. Game four totally changed the Go world’s view on AlphaGo’s dominance because it made it clear that the computer can 'bug' — or at least play very poor moves when on the losing side. It was obvious that Lee felt under much less pressure than in game three. And he adopted a different style, one based on taking large amounts of territory early on rather than immediately going for ‘street fighting’ such as making threats to capture stones. This style – called ‘amashi’ – seems to have paid off, because on move 78, Lee produced a play that somehow slipped under AlphaGo’s radar. David Silver, a scientist at DeepMind who's been leading the development of AlphaGo, said the program estimated its probability as 1 in 10,000.
    [Show full text]
  • Communications
    Oracle Enterprise Session Border Controller with Zoom Phone Premise Peering ( BYOC) and Verizon Business SIP Trunk Technical Application Note COMMUNICATIONS Disclaimer The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle’s products remains at the sole discretion of Oracle. 2 | P a g e Contents 1 RELATED DOCUMENTATION ............................................................................................................................... 5 1.1 ORACLE SBC ........................................................................................................................................................................ 5 1.2 ZOOM PHONE ....................................................................................................................................................................... 5 2 REVISION HISTORY ................................................................................................................................................. 5 3 INTENDED AUDIENCE ............................................................................................................................................ 5 3.1 VALIDATED ORACLE VERSIONS .......................................................................................................................................
    [Show full text]
  • THE FUTURE of IDEAS This Work Is Licensed Under a Creative Commons Attribution-Noncommercial License (US/V3.0)
    less_0375505784_4p_fm_r1.qxd 9/21/01 13:49 Page i THE FUTURE OF IDEAS This work is licensed under a Creative Commons Attribution-Noncommercial License (US/v3.0). Noncommercial uses are thus permitted without any further permission from the copyright owner. Permissions beyond the scope of this license are administered by Random House. Information on how to request permission may be found at: http://www.randomhouse.com/about/ permissions.html The book maybe downloaded in electronic form (freely) at: http://the-future-of-ideas.com For more permission about Creative Commons licenses, go to: http://creativecommons.org less_0375505784_4p_fm_r1.qxd 9/21/01 13:49 Page iii the future of ideas THE FATE OF THE COMMONS IN A CONNECTED WORLD /// Lawrence Lessig f RANDOM HOUSE New York less_0375505784_4p_fm_r1.qxd 9/21/01 13:49 Page iv Copyright © 2001 Lawrence Lessig All rights reserved under International and Pan-American Copyright Conventions. Published in the United States by Random House, Inc., New York, and simultaneously in Canada by Random House of Canada Limited, Toronto. Random House and colophon are registered trademarks of Random House, Inc. library of congress cataloging-in-publication data Lessig, Lawrence. The future of ideas : the fate of the commons in a connected world / Lawrence Lessig. p. cm. Includes index. ISBN 0-375-50578-4 1. Intellectual property. 2. Copyright and electronic data processing. 3. Internet—Law and legislation. 4. Information society. I. Title. K1401 .L47 2001 346.04'8'0285—dc21 2001031968 Random House website address: www.atrandom.com Printed in the United States of America on acid-free paper 24689753 First Edition Book design by Jo Anne Metsch less_0375505784_4p_fm_r1.qxd 9/21/01 13:49 Page v To Bettina, my teacher of the most important lesson.
    [Show full text]
  • Intelligibility of Selected Speech Codecs in Frame-Erasure Conditions
    NTIA Report 17-522 Intelligibility of Selected Speech Codecs in Frame-Erasure Conditions Andrew A. Catellier Stephen D. Voran report series U.S. DEPARTMENT OF COMMERCE • National Telecommunications and Information Administration NTIA Report 17-522 Intelligibility of Selected Speech Codecs in Frame-Erasure Conditions Andrew A. Catellier Stephen D. Voran U.S. DEPARTMENT OF COMMERCE November 2016 DISCLAIMER Certain commercial equipment and materials are identified in this report to specify adequately the technical aspects of the reported results. In no case does such identification imply recommendation or endorsement by the National Telecommunications and Information Administration, nor does it imply that the material or equipment identified is the best available for this purpose. iii PREFACE The work described in this report was performed by the Public Safety Communications Research Program (PSCR) on behalf of the Department of Homeland Security (DHS) Science and Technology Directorate. The objective was to quantify the speech intelligibility associated with selected digital speech coding algorithms subjected to erased data frames. This report constitutes the final deliverable product for this project. The PSCR is a joint effort of the National Institute of Standards and Technology and the National Telecommunications and Information Administration. v CONTENTS Preface..............................................................................................................................................v Figures.........................................................................................................................................
    [Show full text]
  • In-Datacenter Performance Analysis of a Tensor Processing Unit
    In-Datacenter Performance Analysis of a Tensor Processing Unit Presented by Josh Fried Background: Machine Learning Neural Networks: ● Multi Layer Perceptrons ● Recurrent Neural Networks (mostly LSTMs) ● Convolutional Neural Networks Synapse - each edge, has a weight Neuron - each node, sums weights and uses non-linear activation function over sum Propagating inputs through a layer of the NN is a matrix multiplication followed by an activation Background: Machine Learning Two phases: ● Training (offline) ○ relaxed deadlines ○ large batches to amortize costs of loading weights from DRAM ○ well suited to GPUs ○ Usually uses floating points ● Inference (online) ○ strict deadlines: 7-10ms at Google for some workloads ■ limited possibility for batching because of deadlines ○ Facebook uses CPUs for inference (last class) ○ Can use lower precision integers (faster/smaller/more efficient) ML Workloads @ Google 90% of ML workload time at Google spent on MLPs and LSTMs, despite broader focus on CNNs RankBrain (search) Inception (image classification), Google Translate AlphaGo (and others) Background: Hardware Trends End of Moore’s Law & Dennard Scaling ● Moore - transistor density is doubling every two years ● Dennard - power stays proportional to chip area as transistors shrink Machine Learning causing a huge growth in demand for compute ● 2006: Excess CPU capacity in datacenters is enough ● 2013: Projected 3 minutes per-day per-user of speech recognition ○ will require doubling datacenter compute capacity! Google’s Answer: Custom ASIC Goal: Build a chip that improves cost-performance for NN inference What are the main costs? Capital Costs Operational Costs (power bill!) TPU (V1) Design Goals Short design-deployment cycle: ~15 months! Plugs in to PCIe slot on existing servers Accelerates matrix multiplication operations Uses 8-bit integer operations instead of floating point How does the TPU work? CISC instructions, issued by host.
    [Show full text]
  • Understanding & Generalizing Alphago Zero
    Under review as a conference paper at ICLR 2019 UNDERSTANDING &GENERALIZING ALPHAGO ZERO Anonymous authors Paper under double-blind review ABSTRACT AlphaGo Zero (AGZ) (Silver et al., 2017b) introduced a new tabula rasa rein- forcement learning algorithm that has achieved superhuman performance in the games of Go, Chess, and Shogi with no prior knowledge other than the rules of the game. This success naturally begs the question whether it is possible to develop similar high-performance reinforcement learning algorithms for generic sequential decision-making problems (beyond two-player games), using only the constraints of the environment as the “rules.” To address this challenge, we start by taking steps towards developing a formal understanding of AGZ. AGZ includes two key innovations: (1) it learns a policy (represented as a neural network) using super- vised learning with cross-entropy loss from samples generated via Monte-Carlo Tree Search (MCTS); (2) it uses self-play to learn without training data. We argue that the self-play in AGZ corresponds to learning a Nash equilibrium for the two-player game; and the supervised learning with MCTS is attempting to learn the policy corresponding to the Nash equilibrium, by establishing a novel bound on the difference between the expected return achieved by two policies in terms of the expected KL divergence (cross-entropy) of their induced distributions. To extend AGZ to generic sequential decision-making problems, we introduce a robust MDP framework, in which the agent and nature effectively play a zero-sum game: the agent aims to take actions to maximize reward while nature seeks state transitions, subject to the constraints of that environment, that minimize the agent’s reward.
    [Show full text]
  • AI for Broadcsaters, Future Has Already Begun…
    AI for Broadcsaters, future has already begun… By Dr. Veysel Binbay, Specialist Engineer @ ABU Technology & Innovation Department 0 Dr. Veysel Binbay I have been working as Specialist Engineer at ABU Technology and Innovation Department for one year, before that I had worked at TRT (Turkish Radio and Television Corporation) for more than 20 years as a broadcast engineer, and also as an IT Director. I have wide experience on Radio and TV broadcasting technologies, including IT systems also. My experience includes to design, to setup, and to operate analogue/hybrid/digital radio and TV broadcast systems. I have also experienced on IT Networks. 1/25 What is Artificial Intelligence ? • Programs that behave externally like humans? • Programs that operate internally as humans do? • Computational systems that behave intelligently? 2 Some Definitions Trials for AI: The exciting new effort to make computers think … machines with minds, in the full literal sense. Haugeland, 1985 3 Some Definitions Trials for AI: The study of mental faculties through the use of computational models. Charniak and McDermott, 1985 A field of study that seeks to explain and emulate intelligent behavior in terms of computational processes. Schalkoff, 1990 4 Some Definitions Trials for AI: The study of how to make computers do things at which, at the moment, people are better. Rich & Knight, 1991 5 It’s obviously hard to define… (since we don’t have a commonly agreed definition of intelligence itself yet)… Lets try to focus to benefits, and solve definition problem later… 6 Brief history of AI 7 Brief history of AI . The history of AI begins with the following article: .
    [Show full text]
  • Input Formats & Codecs
    Input Formats & Codecs Pivotshare offers upload support to over 99.9% of codecs and container formats. Please note that video container formats are independent codec support. Input Video Container Formats (Independent of codec) 3GP/3GP2 ASF (Windows Media) AVI DNxHD (SMPTE VC-3) DV video Flash Video Matroska MOV (Quicktime) MP4 MPEG-2 TS, MPEG-2 PS, MPEG-1 Ogg PCM VOB (Video Object) WebM Many more... Unsupported Video Codecs Apple Intermediate ProRes 4444 (ProRes 422 Supported) HDV 720p60 Go2Meeting3 (G2M3) Go2Meeting4 (G2M4) ER AAC LD (Error Resiliant, Low-Delay variant of AAC) REDCODE Supported Video Codecs 3ivx 4X Movie Alaris VideoGramPiX Alparysoft lossless codec American Laser Games MM Video AMV Video Apple QuickDraw ASUS V1 ASUS V2 ATI VCR-2 ATI VCR1 Auravision AURA Auravision Aura 2 Autodesk Animator Flic video Autodesk RLE Avid Meridien Uncompressed AVImszh AVIzlib AVS (Audio Video Standard) video Beam Software VB Bethesda VID video Bink video Blackmagic 10-bit Broadway MPEG Capture Codec Brooktree 411 codec Brute Force & Ignorance CamStudio Camtasia Screen Codec Canopus HQ Codec Canopus Lossless Codec CD Graphics video Chinese AVS video (AVS1-P2, JiZhun profile) Cinepak Cirrus Logic AccuPak Creative Labs Video Blaster Webcam Creative YUV (CYUV) Delphine Software International CIN video Deluxe Paint Animation DivX ;-) (MPEG-4) DNxHD (VC3) DV (Digital Video) Feeble Files/ScummVM DXA FFmpeg video codec #1 Flash Screen Video Flash Video (FLV) / Sorenson Spark / Sorenson H.263 Forward Uncompressed Video Codec fox motion video FRAPS:
    [Show full text]
  • Efficiently Mastering the Game of Nogo with Deep Reinforcement
    electronics Article Efficiently Mastering the Game of NoGo with Deep Reinforcement Learning Supported by Domain Knowledge Yifan Gao 1,*,† and Lezhou Wu 2,† 1 College of Medicine and Biological Information Engineering, Northeastern University, Liaoning 110819, China 2 College of Information Science and Engineering, Northeastern University, Liaoning 110819, China; [email protected] * Correspondence: [email protected] † These authors contributed equally to this work. Abstract: Computer games have been regarded as an important field of artificial intelligence (AI) for a long time. The AlphaZero structure has been successful in the game of Go, beating the top professional human players and becoming the baseline method in computer games. However, the AlphaZero training process requires tremendous computing resources, imposing additional difficulties for the AlphaZero-based AI. In this paper, we propose NoGoZero+ to improve the AlphaZero process and apply it to a game similar to Go, NoGo. NoGoZero+ employs several innovative features to improve training speed and performance, and most improvement strategies can be transferred to other nonspecific areas. This paper compares it with the original AlphaZero process, and results show that NoGoZero+ increases the training speed to about six times that of the original AlphaZero process. Moreover, in the experiment, our agent beat the original AlphaZero agent with a score of 81:19 after only being trained by 20,000 self-play games’ data (small in quantity compared with Citation: Gao, Y.; Wu, L. Efficiently 120,000 self-play games’ data consumed by the original AlphaZero). The NoGo game program based Mastering the Game of NoGo with on NoGoZero+ was the runner-up in the 2020 China Computer Game Championship (CCGC) with Deep Reinforcement Learning limited resources, defeating many AlphaZero-based programs.
    [Show full text]
  • AI Chips: What They Are and Why They Matter
    APRIL 2020 AI Chips: What They Are and Why They Matter An AI Chips Reference AUTHORS Saif M. Khan Alexander Mann Table of Contents Introduction and Summary 3 The Laws of Chip Innovation 7 Transistor Shrinkage: Moore’s Law 7 Efficiency and Speed Improvements 8 Increasing Transistor Density Unlocks Improved Designs for Efficiency and Speed 9 Transistor Design is Reaching Fundamental Size Limits 10 The Slowing of Moore’s Law and the Decline of General-Purpose Chips 10 The Economies of Scale of General-Purpose Chips 10 Costs are Increasing Faster than the Semiconductor Market 11 The Semiconductor Industry’s Growth Rate is Unlikely to Increase 14 Chip Improvements as Moore’s Law Slows 15 Transistor Improvements Continue, but are Slowing 16 Improved Transistor Density Enables Specialization 18 The AI Chip Zoo 19 AI Chip Types 20 AI Chip Benchmarks 22 The Value of State-of-the-Art AI Chips 23 The Efficiency of State-of-the-Art AI Chips Translates into Cost-Effectiveness 23 Compute-Intensive AI Algorithms are Bottlenecked by Chip Costs and Speed 26 U.S. and Chinese AI Chips and Implications for National Competitiveness 27 Appendix A: Basics of Semiconductors and Chips 31 Appendix B: How AI Chips Work 33 Parallel Computing 33 Low-Precision Computing 34 Memory Optimization 35 Domain-Specific Languages 36 Appendix C: AI Chip Benchmarking Studies 37 Appendix D: Chip Economics Model 39 Chip Transistor Density, Design Costs, and Energy Costs 40 Foundry, Assembly, Test and Packaging Costs 41 Acknowledgments 44 Center for Security and Emerging Technology | 2 Introduction and Summary Artificial intelligence will play an important role in national and international security in the years to come.
    [Show full text]
  • ELF Opengo: an Analysis and Open Reimplementation of Alphazero
    ELF OpenGo: An Analysis and Open Reimplementation of AlphaZero Yuandong Tian 1 Jerry Ma * 1 Qucheng Gong * 1 Shubho Sengupta * 1 Zhuoyuan Chen 1 James Pinkerton 1 C. Lawrence Zitnick 1 Abstract However, these advances in playing ability come at signifi- The AlphaGo, AlphaGo Zero, and AlphaZero cant computational expense. A single training run requires series of algorithms are remarkable demonstra- millions of selfplay games and days of training on thousands tions of deep reinforcement learning’s capabili- of TPUs, which is an unattainable level of compute for the ties, achieving superhuman performance in the majority of the research community. When combined with complex game of Go with progressively increas- the unavailability of code and models, the result is that the ing autonomy. However, many obstacles remain approach is very difficult, if not impossible, to reproduce, in the understanding of and usability of these study, improve upon, and extend. promising approaches by the research commu- In this paper, we propose ELF OpenGo, an open-source nity. Toward elucidating unresolved mysteries reimplementation of the AlphaZero (Silver et al., 2018) and facilitating future research, we propose ELF algorithm for the game of Go. We then apply ELF OpenGo OpenGo, an open-source reimplementation of the toward the following three additional contributions. AlphaZero algorithm. ELF OpenGo is the first open-source Go AI to convincingly demonstrate First, we train a superhuman model for ELF OpenGo. Af- superhuman performance with a perfect (20:0) ter running our AlphaZero-style training software on 2,000 record against global top professionals. We ap- GPUs for 9 days, our 20-block model has achieved super- ply ELF OpenGo to conduct extensive ablation human performance that is arguably comparable to the 20- studies, and to identify and analyze numerous in- block models described in Silver et al.(2017) and Silver teresting phenomena in both the model training et al.(2018).
    [Show full text]