Arxiv:1808.03644V1 [Cs.AI] 11 Aug 2018 to Any Question of Human Interest

Building Safer AGI by introducing Artificial Stupidity Michaël Trazzi1, Roman V. Yampolskiy2 1Sorbonne universite, Paris, France 2Computer Engineering and Computer Science, Speed School of Engineering, University of Louisville [email protected] [email protected] Abstract not exceed by far humans’ abilities, for instance in arith- metics, the computing power allowed for mathematical ca- Artificial Intelligence (AI) achieved super-human per- pabilities must be artificially diminished. Besides, humans formance in a broad variety of domains. We say that exhibit cognitive biases, which result in systematic errors in an AI is made Artificially Stupid on a task when some judgment and decision making [9]. In order to build a safe limitations are deliberately introduced to match a human’s ability to do the task. An Artificial General In- AGI, some of those biases may need to be replicated. telligence (AGI) can be made safer by limiting its com- We will start by introducing the concept of Artificial Stu- puting power and memory, or by introducing Artificial pidity. Then, we will recommend limitations to build a safer Stupidity on certain tasks. We survey human intellec- AGI. tual limits and give recommendations for which limits to implement in order to build a safe AGI. Artificial Stupidity Passing the Turing Test Introduction To pass the Turing Test, programs that are deliberately sim- The Turing Test [1] was designed to replace the philosophi- plistic perform better at Turing Test contests such as the cal question "Can machines think ?" with a more pragmatic Loebner Prize. The computer program A.L.I.C.E. (or Artifi- one: "Can digital computers imitate human behaviors in an- cial Linguistic Internet Computer Entity) [10] won the Loeb- swering text questions ?" Since 1990, the Loebner Prize ner Prize in 2000, 2001 and 2004, even though "there is no competition awards the AI that has the more human-like be- representation of knowledge, no common-sense reasoning, haviour when passing a five-minutes Turing Test. This con- no inference engine to mimic human thought. Just a very test is controversial within the field of AI for encouraging long list of canned answers, from which it picks the best low-quality interactions and simplistic chatbots [2]. option" [11]. A.L.I.C.E. has an approach similar to ELIZA Programmers force chatbots to make mistakes, such as [12]: it identifies some relevant keywords and give appropri- typing errors, to be more human-like. Because computers ate answers without learning anything about the interrogator achieve super-human performance in some tasks, such as [11]. arithmetic [1] or video game [3], their ability may need to be A general trend for computer programs written for the artificially constrained. Those deliberate mistakes are called Loebner prize is to avoid being asked questions it cannot Artificial Stupidity in the media [4] [5]. answer, by directing the conversation towards simpler con- Solving the Turing Test problem in its general form im- versational context. For A.L.I.C.E. and ELIZA, that means plies building an AI capable of delivering human-like replies focusing mainly of what had been said in the last few sen- tences (stateless context). Another example of an AI per- arXiv:1808.03644v1 [cs.AI] 11 Aug 2018 to any question of human interest. As such, it is known to be AI-Complete [6], because by solving this problem we would forming well at Turing Test contests is Eugene Goostman, be able to solve most of problems of interest in Artificial In- who convinced 33% of the judges that it was human [13]. telligence by reformulating them as questions during a Tur- Goostman is portrayed as a thirteen-year-old Ukrainian boy ing Test. To appear human, an AI will need to fully under- who does not speak English well. Thus, Goostman makes stand human limits and biases. Thus, the Turing Test can be typing mistakes and its interrogators are more inclined to used to test if an AI is capable of understanding human stu- forgive its grammatical errors or lack of general knowledge. pidity. Introducing deliberate mistakes, what we call Artificial Stu- By deliberately limiting an AI’s ability to achieve a task, pidity, is necessary to cover up an even greater gap in intel- to better match humans’ ability, an AI can be made safer, ligence during a Turing Test. in the sense that its capabilities will not exceed by sev- eral orders of magnitude humans’ abilities. Upper-bounds Interacting with Humans on the number of operations made per second by a human Outside those very specific Turing Test contests, Artificial brain have been estimated [7] [8]. To obtain an AI that does Stupidity is being increasingly introduced to interact with humans. In “Artificial Stupidity: The Art of Intentional Mis- Woebot, a Facebook chatbot developed by Stanford Re- takes”[3], Liden describes the design choices an AI pro- searchers, improves users’ mental-health through Cognitive grammer must make in the context of video games. He gives Behavorial Therapy (CBT) [15]. In addition to CBT, Woebot general principles that a Non-Player Character (NPC) must uses two therapeutic process-oriented features. First, it pro- follow to make the game playable. For instance, NPCs must vides empathic responses to the user, according to the mood "move before firing" (e.g. by rolling when the player enters the user said he had. Second, content tailored to its mood is the room) so that the player has additional time to under- presented to the user. [16]. The chatbot delivers specifically stand that a fight is happening, or "miss the first time" to in- what will help the user best, without describing the details dicate the direction of the attack without hurting the player. of the user’s mental-health. It gives a very simple answer the In video games, because computer programs can be much consumer wanted, masking the intelligence of its algorithm. more capable than human beings (for instance because of Chatbots are designed to help humans, giving appropriate their perfect aim in First Person Shooters (FPS)), develop- simple responses. They are not designated to appear smarter ers force NPCs to make mistakes to make life easier for the than humans. human player. This tendency to make AI deliberately stupid can be ob- Avoiding Superintelligence requires served across multiple domains. For example, at Google Superintelligence I/O 2018, Sundar Pichai introduced Google Duplex, "a new In "Computing Machinery and Intelligence" [1], Turing ex- technology for conducting natural conversations to carry out poses common fallacies when arguing that a machine cannot “real world” tasks over the phone"[14]. In the demo, the pass the Turing Test. In particular, he explains why the belief Google Assistant used this Google Duplex technology to that "the interrogator could distinguish the machine from the successfully make an appointment with a human. To that man simply by setting them a number of problems in arith- end, it used the interjection "uh" to imitate the space filler metic" because "the machine would be unmasked because of humans use in day-to-day conversations. This interjection its deadly accuracy" is false. Indeed, the machine "would not was not necessary to make the appointment. More precisely, attempt to give the right answers to the arithmetic problems. when humans use these kind of space fillers, it is a sign of It would deliberately introduce mistakes in a manner calcu- poor communication skills. The developers introduced this lated to confuse the interrogator." Thus, the machine would type of Artificial Stupidity to make the call more fluid, more hide its super-human ability by giving a wrong answer, or human-like. simply saying that he could not compute it. Similarly, in a video game, AI designers artificially make the AI not omni- Exploiting Human Vulnerabilities scient, so that it does not miraculously guess where each and When interacting with AI, humans want to fulfill some basic every weapon of the game is located [3]. needs and desires. The AI can exploit these cravings, by giv- The general trend here is that AI tend to quickly achieve ing people what they want. Accordingly, the AI may want super-human level performance after having achieved to appear vulnerable to make the human feel better about human-level performance. For instance, for the game of Go, himself. For instance, Liden [3] suggests to give NPCs "hor- in a few months, the state-of-the-art went from strong am- rible aim", so that humans feel superior because they think ateur, to weak professional player, to super-human perfor- they are dodging gunshots. Similarly, he encourages "kung- mance [17]. From that point onwards, to make the AI pass a fu style" fights, where amongst dozens of enemies, only two Turing Test, or make it behave human-like to satisfy human are effectively attacking the player at each moment. Thus, desires, AI designers must deliberately limit its capabilities. the player feels powerful (because he believes he is fighting We now take the point of view of algorithmic complexity. multiple enemies). Instead of unleashing its full potentiality, We call AI-problems the problems that can be solved (i.e. for the AI designers diminish its power to please the players. which the correct output for a given input can be computed) The same mechanism applies to the computer program by the union of all humans [6]. The Turing Test is said to be A.L.I.C.E.. The program delivers a simple verbal behav- AI-complete [6], because all AI-problems can be reduced via ior, and because "a great deal of our interactions with oth- a polynomial-time reduction to it (by framing the problem ers involves verbal behavior, and many people are interested as a question during a Turing Test), and it is an AI-problem.

Arxiv:1808.03644V1 [Cs.AI] 11 Aug 2018 to Any Question of Human Interest

Details

Download

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

Support