Introduction
Total Page:16
File Type:pdf, Size:1020Kb
15-780: Graduate AI Introduction Instructors: Tuomas Sandholm and Manuela Veloso TAs: John Dickerson and Prateek Tandon http://www.cs.cmu.edu/~15780 Carnegie Mellon University Evaluation n 4 Homeworks 40% n Midterm Test 20% n Final Project 30% n Class participation 10% n Homeworks are done individually n 8 total “mercy” days (no penalty) for late homeworks - cannot use more than 3 mercy days in a single homework. No credit for late homeworks with no mercy days. n Final projects can be done in groups of up to 2 people 1 Resources n Lectures n Presentation and discussion in class n Lecture slides rd n Textbook: Russell and Norvig, 3 edition n Instructors – office hours by appointment n TAs – office hours will be announced More Information n TAs will hold recitations as needed n Doodle poll for best time n Audits n Register by fill out audit form n Must do final project n Waitlist n Let us know if you are on it n Mailing list n 15780students @ cs.cmu.edu n We will send a test message 2 What is Artificial Intelligence? What is “intelligence” ? Can we emulate intelligent behavior in machines ? How far can we take it ? Time Line (NYTimes 2011) 3 n Performed for 84 years n Defeated Napoleon and Franklin n Amazon Mechanical Turk: “artificial artificial intelligence” 7 Amazon Mechanical Turk 8 4 n Can a general machine think? n Can machines “pass” the imitation game? n Turing Test: Evaluator communicates via text channel with computer and human, must reliably identify the computer 9 Views of AI Think like humans Think rationally Cognitive Science Formalize inference into laws of thought Act like humans Act rationally Turing test Act according to laws 5 The Dartmouth Conference “We propose that a two-month, ten-man study of artificial intelligence be carried out during the summer of 1956 at Dartmouth College in Hanover, NH. The study is to proceed on the basis of the conjecture that every aspect of learning or any other feature of intelligence can in principle be so precisely described that a machine can be made to simulate it.” The Proponents n John McCarthy, assistant professor of mathematics at Dartmouth (2011) n Marvin Minsky, Harvard junior fellow in mathematics and neurology (MIT) n Nathaniel Rochester, manager of information research at IBM, NY (2001) n Claude Shannon, information theory, mathematician at Bell Labs (2001) 6 The Invited n Trenchard More, IBM n Arthur Samuel, IBM n Oliver Selfridge, Lincoln Labs, MIT n Ray Solomoff, MIT And “two vaguely known persons from RAND and Carnegie Tech… a significant afterthought.” (Pamela McCorduck, “Machines Who Think”, page 94) Herbert A. Simon and Allen Newell 7 Computer Chess at CMU n A long history n HiTech – Hans Berliner, Murray Campbell n ChipTest – Feng-Hsiung Hsu n Deep Thought – Hsu, Campbell, Anantharaman n Deep Blue – CMU team hired by IBM Research 8 Problem Solving n Allen Newell and Herb Simon – 1950s n Given: n an initial state n a set of actions n a goal statement n Search for a plan, a sequence of actions that transform the initial state into a state where the goal is satisfied 9 Intelligent Systems Three key steps (Craik, 1943): 1. the stimulus must be translated into an internal representation 2. the representation is manipulated by cognitive processes to derive new internal representations 3. internal representations are translated into action “agent” perception cognition action 10 Intelligent Agents n Sensing: vision, hearing, touch, smell n Cognition: think, reason, plan, learn n Action: motion, speech, manipulation 11 Perception – Sensors to State n Sensors – “signal” (data) collectors from the physical world: n Vision, sound, touch, sonar, laser, infrared, GPS, temperature,…. n Signal-to-symbol challenge: n Recognize the state of the environment n …wall at 2m… door on the left… green light… person in front… personX entering the room… ball at 1m and 30o East… Learning n Automatically generate strategies to classify or predict from training examples Classification: Is the Training data: Example object present in the images of object input image, yes/no? 12 Learning n Automatically generate strategies to classify or predict from training examples Mpg good/bad Predict mpg on new data Training data: good/bad mpg for example cars Web Search 13 Statistical Learning Recommender Systems 14 Question Answering – Watson! Multiagent Systems and Learning n How can an agent learn from experience in a world that contains other agents too ? n Other agents’ learning makes the world nonstationary for the former agent n Games n Learn to play Nash equilibrium n Learn to play optimally against static opponents 15 Games n Multiple agents maybe competing or cooperating to achieve a task n Capabilities for finding strategies, equilibrium between agents, auctioning, bargaining, negotiating. n Business n E-commerce n Multi-robot systems n Investment management n ….. Kidney Exchange n In US, ≥ 50,000/yr are diagnosed with lethal kidney disease n Cure = transplantation, but cadaver kidney have a long waiting list (2-5 yrs) n Potential donors may be incompatible with patient 32 16 Kidney Exchange Pair 1 Pair 2 Patient 1 Patient 2 Donor 1 Donor 2 Kidneys usually travel inside donors Optimizing kidney exchange n Cycle cover = optimize matched vertices under cycles 1 2 3 4 of length ≤ L n Problem is NP-hard 5 6 34 17 RoboCup n Robot soccer competition and symposium n Challenge: By 2050, a team of fully autonomous humanoid robot soccer players shall win the soccer game, complying with the official rule of the FIFA, against the winner of the most recent World Cup n Pioneering multi-robot domain, since the early 90s, adversarial, with fully autonomous robots n www.robocup.org 35 CoBot – Collaborave Robot n Tasks n Go to a locaon n Deliver a message n Escort a visitor n Transport object between locaons n Semi-autonomous telepresence n Visitor companion n Deliver mail n Tour guide 18 Autonomy but Limitaons n Perceptual n No universal object recognizer, object finder n No universal speech understanding n Cognive n No complete certainty of posi+on n No certainty about all decisions, what to learns n No stac op+mal knowledge representaon n Actuaon n No ability to manipulate all objects n No constant mobility in all surfaces Symbio+c Autonomy Autonomous Robots Tasked Autonomous Robots Tasked ““To DO MORETo DO MORE”” than than what they actually what they actually ““CAN DOCAN DO””: : Intelligent ProacAve Intelligent ProacAve ““Ask for HelpAsk for Help”” 19 AI of the 21st century n The integration of MANY pieces n Use of unlimited resources n Own built-in and compiled knowledge n Search, infer, process information, remember, adapt n All external digital partners (sensors, robots, cell phones, toasters, vaccuum cleaners, cars, roads) n Digital talk, digital translate n The Web n Mine human-understandable data n People n Ask, learn from, be taught, observe, copy Main Technical Threads n Problem / knowledge representation n Search, optimization, constraint thinking n Reasoning under uncertainty n Modeling and planning n Multiple agents, complex decision making n Hidden information n Learning 20 Part I – No Uncertainty n Symbolic knowledge representation n Pure search n Deterministic planning Part II – Uncertainty n Probabilistic knowledge representation n Planning under uncertainty n Learning n Other agents 21 Schedule n (2) Jan 14 – Jan 16 – Introduction and agents n Knowledge representation – inference n (8) Jan 21 – Feb 13 – Search n Uninformed – SAT, CSP n Informed – Heuristics, MIP n (2) Feb 18 – Feb 20 – Planning n Classical planning and path planning n (3) Feb 25 – Mar 4 – Probabilistic reasoning n Representation n MDPs/POMDPs n (1) Wednesday, March 6 – Midterm Exam Schedule n (2) Mar 18 – Mar 20 – Learning n Supervised and RL n (2) Mar 25 – Mar 27 – Introduction to games n Representation, solving, complexity n (4) Apr 1 – Apr 10 – Multi-robot systems n Adversarial, playbook planning and learning n Learning from demonstration, reuse, correction n (5) Apr 15 – Apr 29 – Game solving n Normal-form games, sequential complete- and incomplete-information games n (1) May 1 – Discussion, open problems 22 .