AGI Safety Literature Review Arxiv:1805.01109V2 [Cs.AI] 21 May 2018
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
1 COPYRIGHT STATEMENT This Copy of the Thesis Has Been
University of Plymouth PEARL https://pearl.plymouth.ac.uk 04 University of Plymouth Research Theses 01 Research Theses Main Collection 2012 Life Expansion: Toward an Artistic, Design-Based Theory of the Transhuman / Posthuman Vita-More, Natasha http://hdl.handle.net/10026.1/1182 University of Plymouth All content in PEARL is protected by copyright law. Author manuscripts are made available in accordance with publisher policies. Please cite only the published version using the details provided on the item record or document. In the absence of an open licence (e.g. Creative Commons), permissions for further reuse of content should be sought from the publisher or author. COPYRIGHT STATEMENT This copy of the thesis has been supplied on condition that anyone who consults it is understood to recognize that its copyright rests with its author and that no quotation from the thesis and no information derived from it may be published without the author’s prior consent. 1 Life Expansion: Toward an Artistic, Design-Based Theory of the Transhuman / Posthuman by NATASHA VITA-MORE A thesis submitted to the University of Plymouth in partial fulfillment for the degree of DOCTOR OF PHILOSOPHY School of Art & Media Faculty of Arts April 2012 2 Natasha Vita-More Life Expansion: Toward an Artistic, Design-Based Theory of the Transhuman / Posthuman The thesis’ study of life expansion proposes a framework for artistic, design-based approaches concerned with prolonging human life and sustaining personal identity. To delineate the topic: life expansion means increasing the length of time a person is alive and diversifying the matter in which a person exists. -
Understanding Agent Incentives Using Causal Influence Diagrams
Understanding Agent Incentives using Causal Influence Diagrams∗ Part I: Single Decision Settings Tom Everitt Pedro A. Ortega Elizabeth Barnes Shane Legg September 9, 2019 Deepmind Agents are systems that optimize an objective function in an environ- ment. Together, the goal and the environment induce secondary objectives, incentives. Modeling the agent-environment interaction using causal influ- ence diagrams, we can answer two fundamental questions about an agent’s incentives directly from the graph: (1) which nodes can the agent have an incentivize to observe, and (2) which nodes can the agent have an incentivize to control? The answers tell us which information and influence points need extra protection. For example, we may want a classifier for job applications to not use the ethnicity of the candidate, and a reinforcement learning agent not to take direct control of its reward mechanism. Different algorithms and training paradigms can lead to different causal influence diagrams, so our method can be used to identify algorithms with problematic incentives and help in designing algorithms with better incentives. Contents 1. Introduction 2 2. Background 3 3. Observation Incentives 7 4. Intervention Incentives 13 arXiv:1902.09980v6 [cs.AI] 6 Sep 2019 5. Related Work 19 6. Limitations and Future Work 22 7. Conclusions 23 References 24 A. Representing Uncertainty 29 B. Proofs 30 ∗ A number of people have been essential in preparing this paper. Ryan Carey, Eric Langlois, Michael Bowling, Tim Genewein, James Fox, Daniel Filan, Ray Jiang, Silvia Chiappa, Stuart Armstrong, Paul Christiano, Mayank Daswani, Ramana Kumar, Jonathan Uesato, Adria Garriga, Richard Ngo, Victoria Krakovna, Allan Dafoe, and Jan Leike have all contributed through thoughtful discussions and/or by reading drafts at various stages of this project. -
The Romantic Economist
This page intentionally left blank THE ROMANTIC ECONOMIST Since economies are dynamic processes driven by creativity, social norms and emotions, as well as rational calculation, why do econo- mists largely study them through the prism of static equilibrium models and narrow rationalistic assumptions? Economic activity is as much a function of imagination and social sentiments as of the rational optimisation of given preferences and goods. Richard Bronk argues that economists can best model and explain these creative and social aspects of markets by using new structuring assumptions and metaphors derived from the poetry and philosophy of the Romantics. By bridging the divide between literature and science, and between Romanticism and narrow forms of rationalism, economists can access grounding assumptions, models and research methods suitable for comprehending the creativity and social dimensions of economic activity. This is a guide to how economists and other social scientists can broaden their analytical repertoire to encompass the vital role of sentiments, language and imagination. Educated at Merton College, Oxford, Richard Bronk gained a first class degree in Classics and Philosophy. He spent the first seventeen years of his career working in the City of London, where he acquired a wide expertise in international economics, business and politics. His first book, Progress and the Invisible Hand (1998) was well received critically, and anticipated millennial angst about the increasingly strained relationship between economic growth and progress in wel- fare. Having returned to academic life in 2000, Bronk is now a writer and part-time academic. richard bronk is currently a Visiting Fellow in the European Institute at the London School of Economics and Political Science. -
Safety of Artificial Superintelligence
Environment. Technology. Resources. Rezekne, Latvia Proceedings of the 12th International Scientific and Practical Conference. Volume II, 180-183 Safety of Artificial Superintelligence Aleksejs Zorins Peter Grabusts Faculty of Engineering Faculty of Engineering Rezekne Academy of Technologies Rezekne Academy of Technologies Rezekne, Latvia Rezekne, Latvia [email protected] [email protected] Abstract—The paper analyses an important problem of to make a man happy it will do it with the fastest and cyber security from human safety perspective which is usu- cheapest (in terms of computational resources) without ally described as data and/or computer safety itself with- using a common sense (for example killing of all people out mentioning the human. There are numerous scientific will lead to the situation that no one is unhappy or the predictions of creation of artificial superintelligence, which decision to treat human with drugs will also make him could arise in the near future. That is why the strong neces- sity for protection of such a system from causing any farm happy etc.). Another issue is that we want computer to arises. This paper reviews approaches and methods already that we want, but due to bugs in the code computer will presented for solving this problem in a single article, analy- do what the code says, and in the case of superintelligent ses its results and provides future research directions. system, this may be a disaster. Keywords—Artificial Superintelligence, Cyber Security, The next sections of the paper will show the possible Singularity, Safety of Artificial Intelligence. solutions of making superintelligence safe to humanity. -
Liability Law for Present and Future Robotics Technology Trevor N
Liability Law for Present and Future Robotics Technology Trevor N. White and Seth D. Baum Global Catastrophic Risk Institute http://gcrinstitute.org * http://sethbaum.com Published in Robot Ethics 2.0, edited by Patrick Lin, George Bekey, Keith Abney, and Ryan Jenkins, Oxford University Press, 2017, pages 66-79. This version 10 October 2017 Abstract Advances in robotics technology are causing major changes in manufacturing, transportation, medicine, and a number of other sectors. While many of these changes are beneficial, there will inevitably be some harms. Who or what is liable when a robot causes harm? This paper addresses how liability law can and should account for robots, including robots that exist today and robots that potentially could be built at some point in the near or distant future. Already, robots have been implicated in a variety of harms. However, current and near-future robots pose no significant challenge for liability law: they can be readily handled with existing liability law or minor variations thereof. We show this through examples from medical technology, drones, and consumer robotics. A greater challenge will arise if it becomes possible to build robots that merit legal personhood and thus can be held liable. Liability law for robot persons could draw on certain precedents, such as animal liability. However, legal innovations will be needed, in particular for determining which robots merit legal personhood. Finally, a major challenge comes from the possibility of future robots that could cause major global catastrophe. As with other global catastrophic risks, liability law could not apply, because there would be no post- catastrophe legal system to impose liability. -
The Transhumanist Reader Is an Important, Provocative Compendium Critically Exploring the History, Philosophy, and Ethics of Transhumanism
TH “We are in the process of upgrading the human species, so we might as well do it E Classical and Contemporary with deliberation and foresight. A good first step is this book, which collects the smartest thinking available concerning the inevitable conflicts, challenges and opportunities arising as we re-invent ourselves. It’s a core text for anyone making TRA Essays on the Science, the future.” —Kevin Kelly, Senior Maverick for Wired Technology, and Philosophy “Transhumanism has moved from a fringe concern to a mainstream academic movement with real intellectual credibility. This is a great taster of some of the best N of the Human Future emerging work. In the last 10 years, transhumanism has spread not as a religion but as a creative rational endeavor.” SHU —Julian Savulescu, Uehiro Chair in Practical Ethics, University of Oxford “The Transhumanist Reader is an important, provocative compendium critically exploring the history, philosophy, and ethics of transhumanism. The contributors anticipate crucial biopolitical, ecological and planetary implications of a radically technologically enhanced population.” M —Edward Keller, Director, Center for Transformative Media, Parsons The New School for Design A “This important book contains essays by many of the top thinkers in the field of transhumanism. It’s a must-read for anyone interested in the future of humankind.” N —Sonia Arrison, Best-selling author of 100 Plus: How The Coming Age of Longevity Will Change Everything IS The rapid pace of emerging technologies is playing an increasingly important role in T overcoming fundamental human limitations. The Transhumanist Reader presents the first authoritative and comprehensive survey of the origins and current state of transhumanist Re thinking regarding technology’s impact on the future of humanity. -
THE IMPACT of the FOURTH INDUSTRIAL REVOLUTION on the JOBS MARKET Senate of the Italian Republic – 11Th Labour and Social Secu
THE IMPACT OF THE FOURTH INDUSTRIAL REVOLUTION ON THE JOBS MARKET Senate of the Italian Republic – 11th Labour and Social Security Committee For the victims of terrorism fallen in the line of work Introduction The issue of the relationship between technology and work has returned to the heart of the public debate. This is by no means a new discussion: fear of jobs being destroyed by the emergence of new tools for producing goods and services, and their new processes, crisscrosses the history of the industrial economy, leaving us with memories of Luddites, Keynesian technological unemployment, and the “the end of work” alarm raised in the early 1990s. The new Industrial Revolution appears to have been enabled by technologies that are increasingly available at low cost to companies and individuals. These technologies are destined to evolve at an unpredictable pace, in unpredictable ways and with unforeseeable content. Their consequences may impact business models and production processes; above all, they are ushering in new ways of relating to consumers and the markets through more efficient, personalized and immediate channels of coordination enabled by technology. The hallmark feature of this technology is its way of integrating physical processes and digital technologies as a means of renewing organizational models. Another way of expressing this is to say that production is “smartening up” along a number of different paths, either making a break or evolving from the past. Large factories are looking to go beyond assembly lines and replace them with autonomous “islands”, manned by people and machines – teams of workers and robots. Small companies are leveraging a typically Italian feature of the economy – the fact that they are concentrated in districts, and are specialized in niche output – as they work to combine classical artisanal with digital skills. -
Between Ape and Artilect Createspace V2
Between Ape and Artilect Conversations with Pioneers of Artificial General Intelligence and Other Transformative Technologies Interviews Conducted and Edited by Ben Goertzel This work is offered under the following license terms: Creative Commons: Attribution-NonCommercial-NoDerivs 3.0 Unported (CC-BY-NC-ND-3.0) See http://creativecommons.org/licenses/by-nc-nd/3.0/ for details Copyright © 2013 Ben Goertzel All rights reserved. ISBN: ISBN-13: “Man is a rope stretched between the animal and the Superman – a rope over an abyss.” -- Friedrich Nietzsche, Thus Spake Zarathustra Table&of&Contents& Introduction ........................................................................................................ 7! Itamar Arel: AGI via Deep Learning ................................................................. 11! Pei Wang: What Do You Mean by “AI”? .......................................................... 23! Joscha Bach: Understanding the Mind ........................................................... 39! Hugo DeGaris: Will There be Cyborgs? .......................................................... 51! DeGaris Interviews Goertzel: Seeking the Sputnik of AGI .............................. 61! Linas Vepstas: AGI, Open Source and Our Economic Future ........................ 89! Joel Pitt: The Benefits of Open Source for AGI ............................................ 101! Randal Koene: Substrate-Independent Minds .............................................. 107! João Pedro de Magalhães: Ending Aging .................................................... -
Classification Schemas for Artificial Intelligence Failures
Journal XX (XXXX) XXXXXX https://doi.org/XXXX/XXXX Classification Schemas for Artificial Intelligence Failures Peter J. Scott1 and Roman V. Yampolskiy2 1 Next Wave Institute, USA 2 University of Louisville, Kentucky, USA [email protected], [email protected] Abstract In this paper we examine historical failures of artificial intelligence (AI) and propose a classification scheme for categorizing future failures. By doing so we hope that (a) the responses to future failures can be improved through applying a systematic classification that can be used to simplify the choice of response and (b) future failures can be reduced through augmenting development lifecycles with targeted risk assessments. Keywords: artificial intelligence, failure, AI safety, classification 1. Introduction Artificial intelligence (AI) is estimated to have a $4-6 trillion market value [1] and employ 22,000 PhD researchers [2]. It is estimated to create 133 million new roles by 2022 but to displace 75 million jobs in the same period [6]. Projections for the eventual impact of AI on humanity range from utopia (Kurzweil, 2005) (p.487) to extinction (Bostrom, 2005). In many respects AI development outpaces the efforts of prognosticators to predict its progress and is inherently unpredictable (Yampolskiy, 2019). Yet all AI development is (so far) undertaken by humans, and the field of software development is noteworthy for unreliability of delivering on promises: over two-thirds of companies are more likely than not to fail in their IT projects [4]. As much effort as has been put into the discipline of software safety, it still has far to go. Against this background of rampant failures we must evaluate the future of a technology that could evolve to human-like capabilities, usually known as artificial general intelligence (AGI). -
Artificial General Intelligence and the Future of the Human Race Bryon Pavlacka
ARTIFICIAL GENERAL INTELLIGENCE AND THE FUTURE OF THE HUMAN RACE Bryon Pavlacka Artificial Intelligence is all around us. It manages be used against humanity. Of course, such threats are not your investments, makes the subway run on time, imaginary future possibilities. Narrow AI is already used diagnoses medical conditions, searches the internet, solves by the militaries of first world countries for war purposes. enormous systems of equations, and beats human players Consider drones such as the Northrop Grumman X-47B, at chess and Jeopardy. However, this “narrow AI,” designed an Unmanned Combat Aerial Vehicle that is being tested for solving specific, narrow problems, is something by the US Navy (DefenseTech.org, 2011). That’s right, there distinctly different from Artificial General Intelligence, or is no pilot. Of course, the drone can be given orders, but “AGI”, true thinking machines with human-like general the exact way in which those orders are carried out will intelligence (Wang, Goertzel, & Franklin, 2008, p. v). While be left up to the drone’s Narrow AI system. Whether such AGI is not rigidly defined, it is often envisioned as being systems will ever be extended toward general intelligence self-aware and capable of complex thought, and has is currently unknown. However, the US military has shown BSJ been a staple of science fiction, appearing prominently in interest in producing and controlling generally intelligent popular films such as 2001: A Space Odyssey, Terminator, killing machines as well, as made evident by a paper called and I, Robot. In each of these films, the machines go “Governing Lethal Behavior” by Ronald C. -
Powering Fast Forward Thinking
1 Powering Fast Forward Thinking THE AXA 2019 FORESIGHT TRENDBOOK 2 Editorial 3 AXA Foresight: Who are we? 4 Environment 5 Health 21 New Tech 37 Socio-Economics 53 Appendix 69 Acknowledgement & Credits 70 3 EDITORIAL Editorial Using past experience to anticipate probable future is at the core of insurance. At the same time, as an investor and asset manager, we know that past performance is no guarantee of future results. Societies’, companies’ and individuals’ fates may be overturned, for better or worse, by an unexpected combination of existing trends, by major scientific breakthrough or by a “black swan” event. Foresight is not about the extrapolation of existing trends, it is about the identification of potential disruptions. At AXA, day-to-day management matters but we also have the conviction that true achievements are founded on long term visions. We feel that a foresight effort is important to build our strategy but might also be useful to tackle many public debates. This first Trendbook is a modest contribution to answer this need to broaden our horizons. With this publication, we hope to provide more than a simple compilation of “trending topics”: exploring “ Foresight is not about the substance beyond buzzwords and, based on latest scientific evidence, sharing visions and options for decision-making. extrapolation of existing Indeed, we believe that concepts such as “slashers”, “affective computing” or “genetic engineering” will change trends, it is about the the way we think, live and work. identification of potential Our research initiatives, notably through the AXA Research Fund, are focused around four pillars: Environment & disruptions. -
Approximate Universal Artificial Intelligence And
APPROXIMATEUNIVERSALARTIFICIAL INTELLIGENCE AND SELF-PLAY LEARNING FORGAMES Doctor of Philosophy Dissertation School of Computer Science and Engineering joel veness supervisors Kee Siong Ng Marcus Hutter Alan Blair William Uther John Lloyd January 2011 Joel Veness: Approximate Universal Artificial Intelligence and Self-play Learning for Games, Doctor of Philosophy Disserta- tion, © January 2011 When we write programs that learn, it turns out that we do and they don’t. — Alan Perlis ABSTRACT This thesis is split into two independent parts. The first is an investigation of some practical aspects of Marcus Hutter’s Uni- versal Artificial Intelligence theory [29]. The main contributions are to show how a very general agent can be built and analysed using the mathematical tools of this theory. Before the work presented in this thesis, it was an open question as to whether this theory was of any relevance to reinforcement learning practitioners. This work suggests that it is indeed relevant and worthy of future investigation. The second part of this thesis looks at self-play learning in two player, determin- istic, adversarial turn-based games. The main contribution is the introduction of a new technique for training the weights of a heuristic evaluation function from data collected by classical game tree search algorithms. This method is shown to outperform previous self-play training routines based on Temporal Difference learning when applied to the game of Chess. In particular, the main highlight was using this technique to construct a Chess program that learnt to play master level Chess by tuning a set of initially random weights from self play games.