Value Lock-In Notes 2021 (Public Version)
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Infinite Ethics
INFINITE ETHICS Nick Bostrom Faculty of Philosophy Oxford University [Published in Analysis and Metaphysics, Vol. 10 (2011): pp. 9-59] [This is the final version. Earlier versions: 2003, 2005, 2008, 2009] www.nickbostrom.com ABSTRACT Aggregative consequentialism and several other popular moral theories are threatened with paralysis: when coupled with some plausible assumptions, they seem to imply that it is always ethically indifferent what you do. Modern cosmology teaches that the world might well contain an infinite number of happy and sad people and other candidate value-bearing locations. Aggregative ethics implies that such a world contains an infinite amount of positive value and an infinite amount of negative value. You can affect only a finite amount of good or bad. In standard cardinal arithmetic, an infinite quantity is unchanged by the addition or subtraction of any finite quantity. So it appears you cannot change the value of the world. Modifications of aggregationism aimed at resolving the paralysis are only partially effective and cause severe side effects, including problems of “fanaticism”, “distortion”, and erosion of the intuitions that originally motivated the theory. Is the infinitarian challenge fatal? 1. The challenge 1.1. The threat of infinitarian paralysis When we gaze at the starry sky at night and try to think of humanity from a “cosmic point of view”, we feel small. Human history, with all its earnest strivings, triumphs, and tragedies can remind us of a colony of ants, laboring frantically to rearrange the needles of their little ephemeral stack. We brush such late-night rumination aside in our daily life and analytic 1 philosophy. -
Apocalypse Now? Initial Lessons from the Covid-19 Pandemic for the Governance of Existential and Global Catastrophic Risks
journal of international humanitarian legal studies 11 (2020) 295-310 brill.com/ihls Apocalypse Now? Initial Lessons from the Covid-19 Pandemic for the Governance of Existential and Global Catastrophic Risks Hin-Yan Liu, Kristian Lauta and Matthijs Maas Faculty of Law, University of Copenhagen, Copenhagen, Denmark [email protected]; [email protected]; [email protected] Abstract This paper explores the ongoing Covid-19 pandemic through the framework of exis- tential risks – a class of extreme risks that threaten the entire future of humanity. In doing so, we tease out three lessons: (1) possible reasons underlying the limits and shortfalls of international law, international institutions and other actors which Covid-19 has revealed, and what they reveal about the resilience or fragility of institu- tional frameworks in the face of existential risks; (2) using Covid-19 to test and refine our prior ‘Boring Apocalypses’ model for understanding the interplay of hazards, vul- nerabilities and exposures in facilitating a particular disaster, or magnifying its effects; and (3) to extrapolate some possible futures for existential risk scholarship and governance. Keywords Covid-19 – pandemics – existential risks – global catastrophic risks – boring apocalypses 1 Introduction: Our First ‘Brush’ with Existential Risk? All too suddenly, yesterday’s ‘impossibilities’ have turned into today’s ‘condi- tions’. The impossible has already happened, and quickly. The impact of the Covid-19 pandemic, both directly and as manifested through the far-reaching global societal responses to it, signal a jarring departure away from even the © koninklijke brill nv, leiden, 2020 | doi:10.1163/18781527-01102004Downloaded from Brill.com09/27/2021 12:13:00AM via free access <UN> 296 Liu, Lauta and Maas recent past, and suggest that our futures will be profoundly different in its af- termath. -
Robin Hanson Statement on Teaching
Robin Hanson Statement on Teaching Long ago I was one of the top teaching assistants at the University of Chicago physics department. I had to learn to translate this ability into teaching economics here. For example, I learned that students are far more willing to believe physicists about physics, than to believe economists about economics. To help overcome this skepticism, I run some classroom experiments in my microeconomics courses. These experiments give students a good initial hook for the concepts of marginal value and cost, and they show that standard theory predicts experimental outcomes reasonably well. One physics practice that translates well to economics is “back of the envelope” analysis. For example, I ask why poets are paid less than romance novelists, and teachers are paid less than garbage men (hint: note supply side effects). And I ask whether a subsidy or tax would best correct for law mower externalities (hint: neighbors can negotiate easily). While many undergraduates prefer teachers to give them the one true answer on policy questions, I try to avoid taking sides. In class discussions, I ask students to take and defend policy positions, and then I defend any undefended major positions, and mention any unmentioned major criticisms of these positions. The important thing is for students to understand the sorts of arguments that an economist would respect. I am big on normative analysis. I review how normative economics differs from normative analysis elsewhere, and the basic types of market failure. I cover counter- arguments that most teachers don’t cover, such as consistency checks (e.g., if this were the failure then we should also see that) and minimal interventions (e.g., this milder alternative should work just as well). -
FROM LONGITUDE to ALTITUDE: INDUCEMENT PRIZE CONTESTS AS INSTRUMENTS of PUBLIC POLICY in SCIENCE and TECHNOLOGY Clayton Stallbau
FROM LONGITUDE TO ALTITUDE: INDUCEMENT PRIZE CONTESTS AS INSTRUMENTS OF PUBLIC POLICY IN SCIENCE AND TECHNOLOGY Clayton Stallbaumer* I. INTRODUCTION On a foggy October night in 1707, a fleet of Royal Navy warships ran aground on the Isles of Scilly, some twenty miles southwest of England.1 Believing themselves to be safely west of any navigational hazards, two thousand sailors discovered too late that they had fatally misjudged their position.2 Although the search for an accurate method to determine longitude had bedeviled Europe for several centuries, the “sudden loss of so many lives, so many ships, and so much honor all at once” elevated the discovery of a solution to a top national priority in England.3 In 1714, Parliament issued the Longitude Act,4 promising as much as £20,0005 for a “Practicable and Useful” method of determining longitude at sea; fifty-nine years later, clockmaker John Harrison claimed the prize.6 Nearly three centuries after the fleet’s tragic demise, another accident, also fatal but far removed from longitude’s terrestrial reach, struck a nation deeply. Just four days after its crew observed a moment’s silence in memory of the Challenger astronauts, the space shuttle Columbia disintegrated upon reentry, killing all on board.7 Although * J.D., University of Illinois College of Law, 2006; M.B.A., University of Illinois College of Business, 2006; B.A., Economics, B.A., History, Lake Forest College, 2001. 1. DAVA SOBEL, LONGITUDE: THE TRUE STORY OF A LONE GENIUS WHO SOLVED THE GREATEST SCIENTIFIC PROBLEM OF HIS TIME 12 (1995). -
The Transhumanist Reader Is an Important, Provocative Compendium Critically Exploring the History, Philosophy, and Ethics of Transhumanism
TH “We are in the process of upgrading the human species, so we might as well do it E Classical and Contemporary with deliberation and foresight. A good first step is this book, which collects the smartest thinking available concerning the inevitable conflicts, challenges and opportunities arising as we re-invent ourselves. It’s a core text for anyone making TRA Essays on the Science, the future.” —Kevin Kelly, Senior Maverick for Wired Technology, and Philosophy “Transhumanism has moved from a fringe concern to a mainstream academic movement with real intellectual credibility. This is a great taster of some of the best N of the Human Future emerging work. In the last 10 years, transhumanism has spread not as a religion but as a creative rational endeavor.” SHU —Julian Savulescu, Uehiro Chair in Practical Ethics, University of Oxford “The Transhumanist Reader is an important, provocative compendium critically exploring the history, philosophy, and ethics of transhumanism. The contributors anticipate crucial biopolitical, ecological and planetary implications of a radically technologically enhanced population.” M —Edward Keller, Director, Center for Transformative Media, Parsons The New School for Design A “This important book contains essays by many of the top thinkers in the field of transhumanism. It’s a must-read for anyone interested in the future of humankind.” N —Sonia Arrison, Best-selling author of 100 Plus: How The Coming Age of Longevity Will Change Everything IS The rapid pace of emerging technologies is playing an increasingly important role in T overcoming fundamental human limitations. The Transhumanist Reader presents the first authoritative and comprehensive survey of the origins and current state of transhumanist Re thinking regarding technology’s impact on the future of humanity. -
Are Disagreements Honest?
Are Disagreements Honest? Tyler Cowen Robin Hanson* Department of Economics George Mason University August 18, 2004 (First version April 16, 2001.) *The authors wish to thank Curt Adams, Nick Bostrom, Geoff Brennan, James Buchanan, Bryan Caplan, Wei Dai, Hal Finney, Mark Grady, Patricia Greenspan, Kevin Grier, Robin Grier, Hans Haller, Claire Hill, Mary Hirschfeld, Dan Houser, Stephen Hsu, Michael Huemer, Maureen Kelley, Arnold Kling, Peter McCluskey, Tom Morrow, William Nelson, Mark Notturno, David Schmidtz, Susan Snyder, Aris Spanos, Damien Sullivan, Daniel Sutter, Alexander Tabarrok, William Talbott, Nicolaus Tideman, Eleizer Yudkowsky, and participants of the Virginia Tech economics department seminar for useful comments and discussion. We thank the Center for Study of Public Choice and the Mercatus Center for financial support. * Correspondence to Robin Hanson, [email protected], MSN 1D3, Carow Hall, Fairfax VA 22030-4444, 703-993-2326 FAX: 703-993-2323 Are Disagreements Honest? ABSTRACT We review literatures on agreeing to disagree and on the rationality of differing priors, in order to evaluate the honesty of typical disagreements. A robust result is that honest truth-seeking agents with common priors should not knowingly disagree. Typical disagreement seems explainable by a combination of random belief influences and by priors that tell each person that he reasons better than others. When criticizing others, however, people seem to uphold rationality standards that disapprove of such self- favoring priors. This suggests that typical disagreements are dishonest. We conclude by briefly considering how one might try to become more honest when disagreeing. KEYWORDS: agreeing, disagree, common, prior, truth-seeking, Bayesian 2 I. Introduction People disagree all of the time, especially about politics, morality, religion, and relative abilities. -
Joseph Gao EAS 499: Senior Thesis Advisor: Brett Hemenway [DRAFT]
Joseph Gao EAS 499: Senior Thesis Advisor: Brett Hemenway [DRAFT] Introduction to Cryptocurrencies for Retail Investors 1. ABSTRACT In the year 1998, roughly seven years before the smartphone revolution, (2007 marked the release of the iPhone, which arguably kicked off the ‘always online’ era that we live in today), Wei Dai published a proposal for an “anonymous, distributed electronic cash system”.(http://www.weidai.com/bmoney.txt). In the paper, Dai introduced two protocols to achieve his proposal - the first using a proof of work function as a means of creating money, and the second defining a set of servers responsible for keeping accounts, which must be regularly published, and verify balances for the other participants in the decentralized system. While B-money never took off, in part due to the impractical requirement of the first protocol that asks for a broadcast channel that is synchronous and unjammable, Wei Dai’s proposal planted the seeds of an idea that would later inspire Satoshi Nakamoto to publish the Bitcoin: A Peer-to-Peer Electronic Cash System white paper. This publication sparked a renewed wave of interest in distributed payment systems and resulted in thousands and thousands of new proposals and protocols to be developed in the next ten years. 2. INTRODUCTION The year of 2017 showed immense mainstream adoption in a number of cryptocurrencies. However, while mainstream chatter of these various cryptocurrencies has become nearly impossible to avoid, many retail investors are unfamiliar with the underlying technology powering each coin, according to studies performed by CoinDesk, Blockchain Capital, and The 1 University of Cambridge . -
Una Historia Del Pensamiento Transhumanista 1 a History of a Transhumanist Thought
UNA HISTORIA DEL PENSAMIENTO TRANSHUMANISTA 1 A HISTORY OF A TRANSHUMANIST THOUGHT NICK BOSTROM Universidad de Oxford [email protected] RECIBIDO: 24/05/2011 ACEPTADO: 04/07/2011 Resumen: Este artículo repasa algunos de los antecedentes e hitos del pensamiento transhumanista. Lo hace recordando narrativas y pensadores –principalmente occidentales- que han exhibido ideas o deseos convergentes con, anticipadores, o inspiradores de los del transhumanismo, tales como la milenaria empresa de mejorar la condición humana a través del desarrollo tecnológico. También se lleva a cabo una recapitulación de asuntos y debates surgidos en torno al transhumanismo desde finales del siglo pasado hasta 2005. Esta recapitulación concluye con una llamada al entendimiento entre bandos enfrentados –especialmente, transhumanistas y bioconservadores. Asimismo, se incluye una presentación de la gestación e ideas básicas de la World Transhumanist Association (Asociación Transhumanista Mundial), cuya declaración fundacional (en su versión de 2009) se incluye al final del artículo. Palabras clave: transhumanismo, posthumano, humanismo, posthumanismo, perfeccionamiento humano, singularidad. Abstract: This paper traces some of the historic antecedents and landmarks of transhumanist thought. It does so by recalling narratives and thinkers –primarily Western- that have exhibited ideas or desires that are convergent with, anticipations of, or inspirations for those that characterize transhumanism, such as the age-old quest for improving the human condition through technical development. There is also a recapitulation of topics and debates merging or emerging around transhumanism during XXth century and up to 2005. This recapitulation concludes with a call to the quarrelling sides -primarily, transhumanists and bioconservatives- for mutual understanding. It also includes a brief account of the historic gestation and basic ideas of the World Transhumanist Association (WTA), whose foundational declaration (in its 2009 version) is included at the end of the paper. -
Comments to Michael Jackson's Keynote on Determining Energy
Comments to Michael Jackson’s Keynote on Determining Energy Futures using Artificial Intelligence Prof Sirkka Heinonen Finland Futures Research Centre (FFRC) University of Turku ENERGIZING FUTURES 13–14 June 2018 Tampere, Finland AI and Energy • Concrete tools: How Shaping Tomorrow answers the question How can AI help? • Goal of foresight crystal clear: Making better decisions today Huge Challenge – The Challenge The world will need to cut energy-related carbon dioxide emissions by 60 percent by 2050 -even as the population grows by more than two billion people Bold Solution on the Horizon The Renewable Energy Transition Companies as pioneers on energy themes, demand, supply, consumption • Google will reach 100% RE for its global operations this year • GE using AI to boost different forms of energy production and use tech-driven data reports to anticipate performance and maintenance needs around the world BUT …also the role of governments, cities and citizens …NGOs, media… new actors should be emphasised AI + Energy + Peer-to-Peer Society • The transformation of the energy system aligns with the principles of a Peer-to-Peer Society. • Peer-to-peer practices are based on the active participation and self-organisation of citizens. Citizens share knowledge, skills, co-create, and form new peer groups. • Citizens will use their capabilities also to develop energy-related products and services Rethinking Concepts Buildings as Power stations – global (economic) opportunity to construct buildings that generate, store and release solar energy -
What Is the Upper Limit of Value?
WHAT IS THE UPPER LIMIT OF VALUE? Anders Sandberg∗ David Manheim∗ Future of Humanity Institute 1DaySooner University of Oxford Delaware, United States, Suite 1, Littlegate House [email protected] 16/17 St. Ebbe’s Street, Oxford OX1 1PT [email protected] January 27, 2021 ABSTRACT How much value can our decisions create? We argue that unless our current understanding of physics is wrong in fairly fundamental ways, there exists an upper limit of value relevant to our decisions. First, due to the speed of light and the definition and conception of economic growth, the limit to economic growth is a restrictive one. Additionally, a related far larger but still finite limit exists for value in a much broader sense due to the physics of information and the ability of physical beings to place value on outcomes. We discuss how this argument can handle lexicographic preferences, probabilities, and the implications for infinite ethics and ethical uncertainty. Keywords Value · Physics of Information · Ethics Acknowledgements: We are grateful to the Global Priorities Institute for highlighting these issues and hosting the conference where this paper was conceived, and to Will MacAskill for the presentation that prompted the paper. Thanks to Hilary Greaves, Toby Ord, and Anthony DiGiovanni, as well as to Adam Brown, Evan Ryan Gunter, and Scott Aaronson, for feedback on the philosophy and the physics, respectively. David Manheim also thanks the late George Koleszarik for initially pointing out Wei Dai’s related work in 2015, and an early discussion of related issues with Scott Garrabrant and others on asymptotic logical uncertainty, both of which informed much of his thinking in conceiving the paper. -
Artificial Intelligence As a Positive and Negative Factor in Global Risk
The Singularity Institute Artificial Intelligence as a Positive and Negative Factor in Global Risk Eliezer Yudkowsky The Singularity Institute Yudkowsky, Eliezer. 2008. Artificial intelligence as a positive and negative factor in global risk. In Global catastrophic risks, ed. Nick Bostrom and Milan M. Cirkovic, 308–345. New York: Oxford University Press. This version contains minor changes. Eliezer Yudkowsky 1. Introduction By far the greatest danger of Artificial Intelligence is that people conclude too early that they understand it. Of course this problem is not limited to the field of AI. Jacques Monod wrote: “A curious aspect of the theory of evolution is that everybody thinks he understands it” (Monod 1975). My father, a physicist, complained about people making up their own theories of physics; he wanted to know why people did not make up their own theories of chemistry. (Answer: They do.) Nonetheless the problem seems tobe unusually acute in Artificial Intelligence. The field of AI has a reputation formaking huge promises and then failing to deliver on them. Most observers conclude that AI is hard; as indeed it is. But the embarrassment does not stem from the difficulty. It is difficult to build a star from hydrogen, but the field of stellar astronomy does not have a terrible reputation for promising to build stars and then failing. The critical inference is not that AI is hard, but that, for some reason, it is very easy for people to think they know far more about Artificial Intelligence than they actually do. In my other chapter for Global Catastrophic Risks, “Cognitive Biases Potentially Affect- ing Judgment of Global Risks” (Yudkowsky 2008), I opened by remarking that few people would deliberately choose to destroy the world; a scenario in which the Earth is destroyed by mistake is therefore very worrisome. -
A R E S E a R C H Agenda
Cooperation, Conflict, and Transformative Artificial Intelligence — A RESEARCH AGENDA Jesse Clifton — LONGTERMRISK.ORG March 2020 First draft: December 2019 Contents 1 Introduction 2 1.1 Cooperation failure: models and examples . .3 1.2 Outline of the agenda . .5 2 AI strategy and governance 8 2.1 Polarity and transition scenarios . .8 2.2 Commitment and transparency . .8 2.3 AI misalignment scenarios . 10 2.4 Other directions . 10 2.5 Potential downsides of research on cooperation failures . 11 3 Credibility 12 3.1 Commitment capabilities . 13 3.2 Open-source game theory . 13 4 Peaceful bargaining mechanisms 16 4.1 Rational crisis bargaining . 16 4.2 Surrogate goals . 18 5 Contemporary AI architectures 21 5.1 Learning to solve social dilemmas . 21 5.2 Multi-agent training . 24 5.3 Decision theory . 25 6 Humans in the loop 27 6.1 Behavioral game theory . 27 6.2 AI delegates . 28 7 Foundations of rational agency 30 7.1 Bounded decision theory . 30 7.2 Acausal reasoning . 31 8 Acknowledgements 35 1 1 Introduction Transformative artificial intelligence (TAI) may be a key factor in the long-run trajec- tory of civilization. A growing interdisciplinary community has begun to study how the development of TAI can be made safe and beneficial to sentient life (Bostrom, 2014; Russell et al., 2015; OpenAI, 2018; Ortega and Maini, 2018; Dafoe, 2018). We present a research agenda for advancing a critical component of this effort: preventing catastrophic failures of cooperation among TAI systems. By cooperation failures we refer to a broad class of potentially-catastrophic inefficiencies in interactions among TAI-enabled actors.