Safety Engineering for Artificial General Intelligence
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Safety of Artificial Superintelligence
Environment. Technology. Resources. Rezekne, Latvia Proceedings of the 12th International Scientific and Practical Conference. Volume II, 180-183 Safety of Artificial Superintelligence Aleksejs Zorins Peter Grabusts Faculty of Engineering Faculty of Engineering Rezekne Academy of Technologies Rezekne Academy of Technologies Rezekne, Latvia Rezekne, Latvia [email protected] [email protected] Abstract—The paper analyses an important problem of to make a man happy it will do it with the fastest and cyber security from human safety perspective which is usu- cheapest (in terms of computational resources) without ally described as data and/or computer safety itself with- using a common sense (for example killing of all people out mentioning the human. There are numerous scientific will lead to the situation that no one is unhappy or the predictions of creation of artificial superintelligence, which decision to treat human with drugs will also make him could arise in the near future. That is why the strong neces- sity for protection of such a system from causing any farm happy etc.). Another issue is that we want computer to arises. This paper reviews approaches and methods already that we want, but due to bugs in the code computer will presented for solving this problem in a single article, analy- do what the code says, and in the case of superintelligent ses its results and provides future research directions. system, this may be a disaster. Keywords—Artificial Superintelligence, Cyber Security, The next sections of the paper will show the possible Singularity, Safety of Artificial Intelligence. solutions of making superintelligence safe to humanity. -
Liability Law for Present and Future Robotics Technology Trevor N
Liability Law for Present and Future Robotics Technology Trevor N. White and Seth D. Baum Global Catastrophic Risk Institute http://gcrinstitute.org * http://sethbaum.com Published in Robot Ethics 2.0, edited by Patrick Lin, George Bekey, Keith Abney, and Ryan Jenkins, Oxford University Press, 2017, pages 66-79. This version 10 October 2017 Abstract Advances in robotics technology are causing major changes in manufacturing, transportation, medicine, and a number of other sectors. While many of these changes are beneficial, there will inevitably be some harms. Who or what is liable when a robot causes harm? This paper addresses how liability law can and should account for robots, including robots that exist today and robots that potentially could be built at some point in the near or distant future. Already, robots have been implicated in a variety of harms. However, current and near-future robots pose no significant challenge for liability law: they can be readily handled with existing liability law or minor variations thereof. We show this through examples from medical technology, drones, and consumer robotics. A greater challenge will arise if it becomes possible to build robots that merit legal personhood and thus can be held liable. Liability law for robot persons could draw on certain precedents, such as animal liability. However, legal innovations will be needed, in particular for determining which robots merit legal personhood. Finally, a major challenge comes from the possibility of future robots that could cause major global catastrophe. As with other global catastrophic risks, liability law could not apply, because there would be no post- catastrophe legal system to impose liability. -
Classification Schemas for Artificial Intelligence Failures
Journal XX (XXXX) XXXXXX https://doi.org/XXXX/XXXX Classification Schemas for Artificial Intelligence Failures Peter J. Scott1 and Roman V. Yampolskiy2 1 Next Wave Institute, USA 2 University of Louisville, Kentucky, USA [email protected], [email protected] Abstract In this paper we examine historical failures of artificial intelligence (AI) and propose a classification scheme for categorizing future failures. By doing so we hope that (a) the responses to future failures can be improved through applying a systematic classification that can be used to simplify the choice of response and (b) future failures can be reduced through augmenting development lifecycles with targeted risk assessments. Keywords: artificial intelligence, failure, AI safety, classification 1. Introduction Artificial intelligence (AI) is estimated to have a $4-6 trillion market value [1] and employ 22,000 PhD researchers [2]. It is estimated to create 133 million new roles by 2022 but to displace 75 million jobs in the same period [6]. Projections for the eventual impact of AI on humanity range from utopia (Kurzweil, 2005) (p.487) to extinction (Bostrom, 2005). In many respects AI development outpaces the efforts of prognosticators to predict its progress and is inherently unpredictable (Yampolskiy, 2019). Yet all AI development is (so far) undertaken by humans, and the field of software development is noteworthy for unreliability of delivering on promises: over two-thirds of companies are more likely than not to fail in their IT projects [4]. As much effort as has been put into the discipline of software safety, it still has far to go. Against this background of rampant failures we must evaluate the future of a technology that could evolve to human-like capabilities, usually known as artificial general intelligence (AGI). -
Artificial General Intelligence and the Future of the Human Race Bryon Pavlacka
ARTIFICIAL GENERAL INTELLIGENCE AND THE FUTURE OF THE HUMAN RACE Bryon Pavlacka Artificial Intelligence is all around us. It manages be used against humanity. Of course, such threats are not your investments, makes the subway run on time, imaginary future possibilities. Narrow AI is already used diagnoses medical conditions, searches the internet, solves by the militaries of first world countries for war purposes. enormous systems of equations, and beats human players Consider drones such as the Northrop Grumman X-47B, at chess and Jeopardy. However, this “narrow AI,” designed an Unmanned Combat Aerial Vehicle that is being tested for solving specific, narrow problems, is something by the US Navy (DefenseTech.org, 2011). That’s right, there distinctly different from Artificial General Intelligence, or is no pilot. Of course, the drone can be given orders, but “AGI”, true thinking machines with human-like general the exact way in which those orders are carried out will intelligence (Wang, Goertzel, & Franklin, 2008, p. v). While be left up to the drone’s Narrow AI system. Whether such AGI is not rigidly defined, it is often envisioned as being systems will ever be extended toward general intelligence self-aware and capable of complex thought, and has is currently unknown. However, the US military has shown BSJ been a staple of science fiction, appearing prominently in interest in producing and controlling generally intelligent popular films such as 2001: A Space Odyssey, Terminator, killing machines as well, as made evident by a paper called and I, Robot. In each of these films, the machines go “Governing Lethal Behavior” by Ronald C. -
Resume / CV Prof. Dr. Hugo De GARIS
Resume / CV Prof. Dr. Hugo de GARIS Full Professor of Computer Science Supervisor of PhD Students in Computer Science and Mathematical Physics Director of the Artificial Brain Laboratory, Department of Cognitive Science, School of Information Science and Technology, Xiamen University, Xiamen, Fujian Province, CHINA [email protected] http://www.iss.whu.edu.cn/degaris Education PhD, 1992, Brussels University, Brussels, Belgium, Europe. Thesis Topic : Artificial Intelligence, Artificial Life. Who’s Who Prof. Dr. Hugo de Garis, biographical entry in Marquis’s “Who’s Who in Science and Engineering”, USA, 2008. Editorial Board Memberships of Academic Journals (5) 1. Executive Editor of the “Journal of Artificial General Intelligence” (JAGI), an online journal at http://journal.agi-network.org (from 2008). 2. “Engineering Letters ” journal (International Association of Engineers, IAENG) 3. “Journal of Evolution and Technology ” (Institute for Ethics and Emerging Technologies, IEET) 4. “Evolutionary Computation ” journal (MIT Press) (former editorial board member, 1992-2007). 5. “Neurocomputing ” journal (Elsevier) (former editorial board member, 2000- 2006) Publications Summary Movie 1 Books 4 Journal Articles 27 Conference Papers 100 Book Chapters 15 Total 147 Movie (1) 6 minutes in the movie “Transcendent Man : The Life and Ideas of Ray Kurzweil”, Ptolemaic Productions, 2009. The movie discusses the rise of massively intelligent machines. Books (4) Hugo de Garis, “The Artilect War : Cosmists vs. Terrans : A Bitter Controversy Concerning Whether Humanity Should Build Godlike Massively Intelligent Machines”, ETC Publications, ISBN 0-88280-153-8, 0-88280-154-6, 2005. (Chinese version) “History of Artificial Intelligence –The Artilect War”, Tsinghua University Press, 2007. Hugo de Garis, “Multis and Monos : What the Multicultured Can Teach the Monocultured : Towards the Growth of a Global State”, ETC Publications, ISBN 978- 088280162-9, 2009. -
Leakproofing the Singularity Artificial Intelligence Confinement Problem
Roman V.Yampolskiy Leakproofing the Singularity Artificial Intelligence Confinement Problem Abstract: This paper attempts to formalize and to address the ‘leakproofing’ of the Singularity problem presented by David Chalmers. The paper begins with the definition of the Artificial Intelli- gence Confinement Problem. After analysis of existing solutions and their shortcomings, a protocol is proposed aimed at making a more secure confinement environment which might delay potential negative effect from the technological singularity while allowing humanity to benefit from the superintelligence. Keywords: AI-Box, AI Confinement Problem, Hazardous Intelligent Software, Leakproof Singularity, Oracle AI. ‘I am the slave of the lamp’ Genie from Aladdin 1. Introduction With the likely development of superintelligent programs in the near future, many scientists have raised the issue of safety as it relates to such technology (Yudkowsky, 2008; Bostrom, 2006; Hibbard, 2005; Chalmers, 2010; Hall, 2000). A common theme in Artificial Intelli- gence (AI)1 safety research is the possibility of keeping a super- intelligent agent in a sealed hardware so as to prevent it from doing any harm to humankind. Such ideas originate with scientific Correspondence: Roman V. Yampolskiy, Department of Computer Engineering and Computer Sci- ence University of Louisville. Email: [email protected] [1] In this paper the term AI is used to represent superintelligence. Journal of Consciousness Studies, 19, No. 1–2, 2012, pp. 194–214 Copyright (c) Imprint Academic 2011 For personal use only -- not for reproduction LEAKPROOFING THE SINGULARITY 195 visionaries such as Eric Drexler, who has suggested confining transhuman machines so that their outputs could be studied and used safely (Drexler, 1986). -
Perfection: United Goal Or Divisive Myth? a Look Into the Concept of Posthumanism and Its Theoretical Outcomes in Science Fiction
PERFECTION: UNITED GOAL OR DIVISIVE MYTH? A LOOK INTO THE CONCEPT OF POSTHUMANISM AND ITS THEORETICAL OUTCOMES IN SCIENCE FICTION by Rebecca McCarthy B.A., Southern Illinois University, 2009 M.A., Southern Illinois University, 2013 A Thesis Submitted in Partial Fulfillment of the Requirements for the Masters of Arts Department of English in the Graduate School Southern Illinois University Carbondale December 2013 Copyright by Rebecca McCarthy, 2013 All Rights Reserved THESIS APPROVAL PERFECTION: UNITED GOAL OR DIVISIVE MYTH? A LOOK INTO THE CONCEPT OF POSTHUMANISM AND ITS THEORETICAL OUTCOMES IN SCIENCE FICTION By Rebecca McCarthy A Thesis Submitted in Partial Fulfillment of the Requirements for the Degree of Masters in the field of English Literature Approved by: Dr. Robert Fox, Chair Dr. Elizabeth Klaver Dr. Tony Williams Graduate School Southern Illinois University Carbondale 8/09/2013 AN ABSTRACT OF THE THESIS OF Rebecca McCarthy, for the Masters of English degree in Literature, presented on August 9, 2013, at Southern Illinois University Carbondale. TITLE: PERFECTION: UNITED GOAL OR DIVISIVE MYTH? A LOOK INTO THE CONCEPT OF POSTHUMANISM AND ITS THEORETICAL OUTCOMES IN SCIENCE FICTION MAJOR PROFESSOR: Dr. Robert Fox As science races to keep up with science fiction, many scientists are beginning to believe that the next step in human evolution will be a combination of human and machine and look a lot like something out of Star Trek. The constant pursuit of perfection is a part of the human condition, but if we begin to stretch beyond the natural human form can we still consider ourselves human? Transhumanism and posthumanism are only theories for now, but they are theories that threaten to permanently displace the human race, possibly pushing it into extinction. -
03 Garbowski.Pdf
ZESZYTY NAUKOWE POLITECHNIKI ŚLĄSKIEJ 2017 Seria: ORGANIZACJA I ZARZĄDZANIE z. 113 Nr kol. 1991 1 Marcin GARBOWSKI 2 John Paul II Catholic University of Lublin 3 Methodology of Science Department at the Faculty of Philosophy 4 [email protected] 5 A CRITICAL ANALYSIS OF THE ASILOMAR AI PRINCIPLES 6 Streszczenie. The article focuses on the analysis of a set of principles 7 concerning the development of artificial intelligence, which were agreed upon a 8 the Asilomar international conference in January 2017. The declaration was 9 signed by leading scholar in the field of transhumanism and representatives of the 10 hi-tech enterprises. The purpose of the article is to lay out the core assumptions 11 behind this established set of norms that aim at the creation of so call „friendly 12 AI” as well as their practical implications. 13 Słowa kluczowe: Artificial intelligence, philosophy of technology, trans- 14 humanism 15 KRYTYCZNA ANALIZA ZASAD DOTYCZĄCYCH SZTUCZNEJ 16 INTELIGENCJI Z ASILOMAR 17 Abstract. Przedmiotem artykułu jest analiza zestawu zasad uzgodnionych na 18 międzynarodowej konferencji w Asilomar ze stycznia 2017 roku dotycząca 19 rozwoju sztucznej inteligencji. Sygnatariuszami porozumienia są czołowi badacze 20 tematyki związanej z transhumanizmem, ale także przedstawiciele koncernów z 21 sektora hi-tech. Celem artykułu jest przedstawienie założeń stojących za 22 deklarowanym szkicem norm, mających na celu wytworzenie tzw. Przyjaznej 23 sztucznej inteligencji oraz ich praktycznych konsekwencji. 24 Keywords: Sztuczna inteligencja, filozofia techniki, transhumanism 25 1. Introduction 26 When we consider the various technological breakthroughs which in the past decades 27 have had the greatest impact on a broad scope of human activity and which are bound to 28 increase it in the coming years, artificial intelligence certainly occupies a prominent place. -
Intelligence Explosion FAQ
MIRI MACHINE INTELLIGENCE RESEARCH INSTITUTE Intelligence Explosion FAQ Luke Muehlhauser Machine Intelligence Research Institute Abstract The Machine Intelligence Research Institute is one of the leading research institutes on intelligence explosion. Below are short answers to common questions we receive. Muehlhauser, Luke. 2013. “Intelligence Explosion FAQ.” First published 2011 as “Singularity FAQ.” Machine Intelligence Research Institute, Berkeley, CA Contents 1 Basics 1 1.1 What is an intelligence explosion? . 1 2 How Likely Is an Intelligence Explosion? 2 2.1 How is “intelligence” defined? . 2 2.2 What is greater-than-human intelligence? . 2 2.3 What is whole-brain emulation? . 3 2.4 What is biological cognitive enhancement? . 3 2.5 What are brain-computer interfaces? . 4 2.6 How could general intelligence be programmed into a machine? . 4 2.7 What is superintelligence? . 4 2.8 When will the intelligence explosion happen? . 5 2.9 Might an intelligence explosion never occur? . 6 3 Consequences of an Intelligence Explosion 7 3.1 Why would great intelligence produce great power? . 7 3.2 How could an intelligence explosion be useful? . 7 3.3 How might an intelligence explosion be dangerous? . 8 4 Friendly AI 9 4.1 What is Friendly AI? . 9 4.2 What can we expect the motivations of a superintelligent machine to be? 10 4.3 Can’t we just keep the superintelligence in a box, with no access to the Internet? . 11 4.4 Can’t we just program the superintelligence not to harm us? . 11 4.5 Can we program the superintelligence to maximize human pleasure or desire satisfaction? . -
Hello, World: Artificial Intelligence and Its Use in the Public Sector
Hello, World: Artificial intelligence and its use in the public sector Jamie Berryhill Kévin Kok Heang Rob Clogher Keegan McBride November 2019 | http://oe.cd/helloworld OECD Working Papers on Public Governance No. 36 Cover images are the European Parliament and the dome of Germany’s Reichstag building processed through Deep Learning algorithms to match the style of Van Gogh paintings. tw Hello, World: Artificial Intelligence and its Use in the Public Sector Authors: Jamie Berryhill, Kévin Kok Heang, Rob Clogher, Keegan McBride PUBE 2 This document and any map included herein are without prejudice to the status of or sovereignty over any territory, to the delimitation of international frontiers and boundaries and to the name of any territory, city or area. 1. Note by Tukey: The information in this document with reference to ‘Cyprus’ relates to the southern part of the island. There is no single authority representing both Turkish and Greek Cypriot people on the island. Turkey recognises the Turkish Republic of Northern Cyprus (TRNC). Until a lasting and equitable solution is found within the context of the United Nations, Turkey shall preserve its position concerning the ‘Cyprus issue’. 2. Note by all the European Union Member States of the OECD and the European Commission: The Republic of Cyprus is recognised by all members of the United Nations with the exception of Turkey. The information in this document relates to the area under the effective control of the Government of the Republic of Cyprus. HELLO, WORLD: ARTIFICIAL INTELLIGENCE AND ITS USE IN THE PUBLIC SECTOR © OECD 2019 3 Foreword Artificial Intelligence (AI) is an area of research and technology application that can have a significant impact on public policies and services in many ways. -
A Survey of Research Questions for Robust and Beneficial AI
A survey of research questions for robust and beneficial AI 1 Introduction Artificial intelligence (AI) research has explored a variety of problems and approaches since its inception, but for the last 20 years or so has been focused on the problems surrounding the construction of intelligent agents|systems that perceive and act in some environment. In this context, the criterion for intelligence is related to statistical and economic notions of rationality|colloquially, the ability to make good decisions, plans, or inferences. The adoption of probabilistic representations and statistical learning methods has led to a large degree of integration and cross-fertilization between AI, machine learning, statistics, control theory, neuroscience, and other fields. The establishment of shared theoretical frameworks, combined with the availability of data and processing power, has yielded remarkable successes in various component tasks such as speech recognition, image classification, autonomous vehicles, machine translation, legged locomotion, and question-answering systems. As capabilities in these areas and others cross the threshold from laboratory research to economically valuable technologies, a virtuous cycle takes hold whereby even small improvements in performance are worth large sums of money, prompting greater investments in research. There is now a broad consensus that AI research is progressing steadily, and that its impact on society is likely to increase. The potential benefits are huge, since everything that civilization has to offer is a product of human intelligence; we cannot predict what we might achieve when this intelligence is magnified by the tools AI may provide, but the eradication of disease and poverty are not unfathomable. Because of the great potential of AI, it is valuable to investigate how to reap its benefits while avoiding potential pitfalls. -
Artificial Intelligence Safety and Cybersecurity: a Timeline of AI Failures
Artificial Intelligence Safety and Cybersecurity: a Timeline of AI Failures Roman V. Yampolskiy M. S. Spellchecker Computer Engineering and Computer Science Microsoft Corporation University of Louisville One Microsoft Way, Redmond, WA [email protected] [email protected] Abstract In this work, we present and analyze reported failures of artificially intelligent systems and extrapolate our analysis to future AIs. We suggest that both the frequency and the seriousness of future AI failures will steadily increase. AI Safety can be improved based on ideas developed by cybersecurity experts. For narrow AIs safety failures are at the same, moderate, level of criticality as in cybersecurity, however for general AI, failures have a fundamentally different impact. A single failure of a superintelligent system may cause a catastrophic event without a chance for recovery. The goal of cybersecurity is to reduce the number of successful attacks on the system; the goal of AI Safety is to make sure zero attacks succeed in bypassing the safety mechanisms. Unfortunately, such a level of performance is unachievable. Every security system will eventually fail; there is no such thing as a 100% secure system. Keywords: AI Safety, Cybersecurity, Failures, Superintelligence. 1. Introduction A day does not go by without a news article reporting some amazing breakthrough in artificial intelligence1. In fact progress in AI has been so steady that some futurologists, such as Ray Kurzweil, project current trends into the future and anticipate what the headlines of tomorrow will bring us. Consider some developments from the world of technology: 2004 DARPA sponsors a driverless car grand challenge. Technology developed by the participants eventually allows Google to develop a driverless automobile and modify existing transportation laws.