Reinforcement Learning: Chap1
This excerpt from Reinforcement Learning. Richard S. Sutton and Andrew G. Barto. © 1998 The MIT Press. is provided in screen-viewable form for personal use only by members of MIT CogNet. Unauthorized use or dissemination of this information is expressly forbidden. If you have any questions about this material, please contact [email protected]. 1 Introduction
The idea that we learn by interacting with our environmentis probably the first to occur to us when we think about the natureof learning. When an infant plays, wavesits arms, or looks about, it has no explicit teacher, but it doeshave a direct sensorimotorconnection to its environment. Exercisingthis connectionproduces a wealthof informationabout cause and effect , aboutthe consequencesof actions, and about what to do in order to achievegoals . Throughoutour lives, such interactions are undoubtedlya major sourceof knowledgeabout our environmentand ourselves. Whether we are learning to drive a car or to hold a conversation, we are acutely awareof how our environmentresponds to what we do, and we seekto influence what happensthrough our behavior. Learningfrom interactionis a foundationalidea underlyingnearly all theoriesof learningand intelligence. In this book we explore a computationalapproach to learning from interaction. - Ratherthan directly theorizing abouthow peopleor animalslearn , we explore ide alized learningsituations and evaluate the effectivenessof variouslearning methods . That is, we adoptthe perspectiveof an artificial intelligenceresearcher or engineer. We explore designsfor machinesthat are effective in solving learning problemsof scientificor economicinterest , evaluatingthe designsthrough mathematical analysis or computationalexperiments . The approachwe explore, called reinforcementlearning , is much more focusedon goal-directedlearning from interactionthan are other approach es to machinelearning .