Is It Enough to Get the Behaviour Right? Hector J

Is it Enough to Get the Behaviour Right? Hector J. Levesque Dept. of Computer Science University of Toronto Toronto, Ontario Canada M5S 3A6 [email protected] Abstract to be enough. In short: if the behaviour in the long run is what it should be, we should be prepared to ascribe the same This paper deals with the relationship between in- mental qualities we would to a person. Searle, on the other telligent behaviour, on the one hand, and the men- hand, imagines himself in a room called the Chinese Room tal qualities needed to produce it, on the other. We where there is a large book. People give him messages writ- consider two well-known opposing positions on this ten in Chinese, which he does not understand. However, the issue: one due to Alan Turing and one due to John book in the room tells him what to do with this message, cul- Searle (via the Chinese Room). In particular, we ar- minating in him writing characters on a piece of paper, the gue against Searle, showing that his answer to the meaning of which he does not understand. He hands the pa- so-called System Reply does not work. The argu- per back outside the room, and the people there find these ment takes a novel form: we shift the debate to a dif- responses quite congenial, and in fact indistinguishable from ferent and more plausible room where the required those of a native Chinese speaker. But Searle does not know conversational behaviour is much easier to charac- Chinese. He is producing linguistic behaviour that is indistin- terize and to analyze. Despite being much simpler guishable from a native speaker’s without any understanding. than the Chinese Room, we show that the behaviour So getting the behaviour right does not justify the ascription there is still complex enough that it cannot be pro- of the mental qualities. duced without appropriate mental qualities. So Turing and Searle take opposite positions on the issue above. But one aspect that they both would agree on (one In this paper, we will consider the issue of the relationship imagines) is this: when we talk about getting the behaviour between external behaviours and mental qualities. The exter- right, and in a way that is indistinguishable from someone nal behaviours we have in mind are the linguistic responses with the appropriate mental qualities, we are not talking about in an intelligent conversation. The mental qualities we have doing so in some limited context. All parties agree (or would in mind are things like knowing how to speak a language, or likely agree) that it is possible to use trickery and other un- understanding what is being said, or even being intelligent interesting means to get the behaviour right in conversations (none of which we will need to distinguish for now). The that are restricted enough. For example, a conversant that fundamental issue we intend to investigate is this: says nothing but “I love the Yankees!” over and over might When can we justifiably draw conclusions about be producing conversation that is indistinguishable from that mental qualities like these, given external behaviour of fanatical baseball fan (possessing mental abilities beyond that is indistinguishable from that of a person? the four words, one still presumes), but nothing interesting In a sense, this question is not really part of AI. One definition follows from this. Similarly, a test that is limited in advance of AI is that it is “the study of intelligent behaviour achieved to a certain number of words may not be enough. What mat- through computational means” [Brachman and Levesque, ters to both Searle and Turing and what concerns us here are 2004]. From this point of view, AI research is about getting the conclusions that we would draw about the mental prop- the behaviour right and nothing more. The question above erties of a conversant given that the conversation is natural, goes beyond this and asks what conclusions we can draw cooperative, unrestricted, and as long as necessary. should we ever get the behaviour right. The Turing Test [Turing, 1950] and the Chinese Room 1 The AI perspective and the Systems Reply [Searle, 1980] are two thought experiments designed to help So who is correct here, Turing or Searle? Much of the debate us understand this issue. To recap the positions very briefly, within AI has not really attempted to answer the question. we have Turing who says (roughly) that the mental vocabu- Regarding the Turing Test, the main discussion has been lary above is too vague and open to interpretation to be worth on whether linguistic behaviour is enough, or whether a more arguing about. If we are unable to distinguish the responses comprehensive notion of behaviour would be a better test of an entity from that of a person in an unrestricted conver- [Harnad, 1989]. For example, we might want a notion of sation (in what Turing calls the Imitation Game), that ought behaviour that encompasses broader notions of perception 1439 and action in the world. There has also been discussion on start would say “The marks on the paper you will receive whether passing the Turing Test is a suitable long term goal will be a question or statement in Chinese. Use the rest of for AI research [Cohen, 2004]. This is especially germane this book to translate it into English. Then formulate a re- given the Loebner competition [Shieber, 1994], a restricted sponse, translate it back into Chinese using the rest of this version of the test that has attracted considerable publicity. book, print your Chinese response on the paper, and hand it By general consensus, the programs that do well in this com- back.” The rest of the book would be an elaborate English- petition do not tell us much about intelligence, for the reasons Chinese-English manual, with plenty of pictures of Chinese mentioned above. But they do tell us something about fooling characters, vocabulary, grammar, and examples of usage. A people. It appears to be more of a case like ELIZA [Weizen- small book might lead to stilted Chinese like that of a first- baum, 1966], where a program using very simple means was time tourist, perhaps. But a larger book, with extensive ex- able to fool people into believing they were conversing with amples of usage, ought to lead to fairly natural Chinese. a psychiatrist. All this simply reflects the fact that it is some- Of course, AI supporters get no comfort from a Type 2 times possible to simulate linguistic behaviour that has been book like this one. It suffices to teach Chinese, but it teaches limited in some way (like that of a Rogerian psychiatrist, or a Chinese as a second language. It tells us how to answer ques- fanatical baseball fan or, for that matter, a person with autism) tions about dogs, say, by connecting the Chinese symbol for by philosophically uninteresting means. None of this reflects dog to the English word “dog,” relying on the fact that we directly on the (unrestricted) Turing Test itself. already understand what the word “dog” means. In a sense, Regarding the Chinese Room, much of the discussion the challenge of AI is to come up with some sort of book for within AI has been to dismiss it (sometimes quite impatiently) Chinese as a first language. as concentrating on the wrong question: what matters is not I do not intend to argue from the philosophical armchair whether or not intelligent behaviour is evidence for mental about the prospects of AI eventually achieving this goal. The qualities, but rather how or even if the behaviour can be pro- intention here is more modest; I want to argue for this: duced at all, that is, the AI question. There are no Type 1 books for Chinese! When the former question is addressed, it is typically along the lines of the Systems Reply [Searle, 1980]. The argument Note that the truth of this claim will still be sufficient to refute is that although Searle himself does not know Chinese, the Searle’s answer to the Systems Reply: if there are no Type 1 system consisting of Searle together with the book does. For books for Chinese, then Searle’s assertion that he would not computer scientists, this is a natural notion: Searle is the ex- understand Chinese after learning the book is just wrong. ecutor of a program (written in the book), and although the But how could we possibly “prove” such a claim? Without executor does not have a certain ability, it can execute pro- knowing what a book for Chinese would need to be like, we grams that give it that ability. Searle had already anticipated are in no position to assess what it would be like to learn it. the Systems Reply and had a ready answer: He asks us to All we can do is wave our hands. This is maybe why direct extend the thought experiment somewhat, and imagine that philosophical arguments in the past about the Chinese Room he memorizes the contents of the book and then discards it. have been so stupendously unconvincing: your views about He still does not understand the Chinese, yet can generate the what such a book would have to be like may not coincide with same linguistic behaviour. But now there is no longer a sys- mine. To get beyond these obstacles and see the issues more tem consisting of him and the book; there is just him. He is clearly, we propose moving away from the Chinese Room to generating behaviour that is indistinguishable from that of a a related but simpler thought experiment.

Is It Enough to Get the Behaviour Right? Hector J

Much Has Been Written About the Turing Test in the Last Few Years, Some of It

Turing Test Does Not Work in Theory but in Practice

THE TURING TEST RELIES on a MISTAKE ABOUT the BRAIN For

The Turing Test and Other Design Detection Methodologies∗

A Review on Neural Turing Machine

The Future Just Arrived

(GPT-3) in Healthcare Delivery ✉ Diane M

Does the Turing Test Demonstrate Intelligence Or Not?

An Introduction to Bayesian Artificial Intelligence in Medicine

Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation

The Chinese Room: Just Say “No!” to Appear in the Proceedings of the 22Nd Annual Cognitive Science Society Conference, (2000), NJ: LEA

IEEE Paper Template in A4 (V1)