On How to Build a Moral Machine Paul Bello
[email protected] Human & Bioengineered Systems Division - Code 341, Office of Naval Research, 875 N. Randolph St., Arlington, VA 22203 USA Selmer Bringsjord
[email protected] Depts. of Cognitive Science, Computer Science & the Lally School of Management, Rensselaer Polytechnic Institute, Troy, NY 12180 USA Abstract Herein we make a plea to machine ethicists for the inclusion of constraints on their theories con- sistent with empirical data on human moral cognition. As philosophers, we clearly lack widely accepted solutions to issues regarding the existence of free will, the nature of persons and firm conditions on moral agency/patienthood; all of which are indispensable concepts to be deployed by any machine able to make moral judgments. No agreement seems forthcoming on these mat- ters, and we don’t hold out hope for machines that can both always do the right thing (on some general ethic) and produce explanations for its behavior that would be understandable to a human confederate. Our tentative solution involves understanding the folk concepts associated with our moral intuitions regarding these matters, and how they might be dependent upon the nature of hu- man cognitive architecture. It is in this spirit that we begin to explore the complexities inherent in human moral judgment via computational theories of the human cognitive architecture, rather than under the extreme constraints imposed by rational-actor models assumed throughout much of the literature on philosophical ethics. After discussing the various advantages and challenges of taking this particular perspective on the development of artificial moral agents, we computationally explore a case study of human intuitions about the self and causal responsibility.