Universal Algorithmic Intelligence a Mathematical Top→Down Approach

Universal Algorithmic Intelligence A Mathematical Top!Down Approach Marcus Hutter Canberra, ACT, 0200, Australia http://www.hutter1.net/ ANU RSISE NICTA Marcus Hutter - 2 - Universal Arti¯cial Intelligence Abstract The dream of creating arti¯cial devices that reach or outperform human intelligence is many centuries old. Nowadays most research is more modest, focussing on solving narrower, speci¯c problems, associated with only some aspects of intelligence, like playing chess or natural language translation, either as a goal in itself or as a bottom-up approach. The dual top down approach investigates theories of general-purpose intelligent agents: the power of such theoretical agents, how to scale them down, and the involved key concepts. Necessary ingredients seem to be Occam's razor; Turing machines; Kolmogorov complexity; probability theory; Solomono® induction; Bayesian sequence prediction; minimum description length principle; agent framework; sequential decision theory; adaptive control theory; reinforcement learning; Levin search and extensions, which are all important subjects in their own right. From a mathematical point of view these concepts also seem to be su±cient. Marcus Hutter - 3 - Universal Arti¯cial Intelligence Contents ² What is (Arti¯cial) Intelligence. ² Philosophical Foundations. (Ockham, Epicurus, Induction) ² Mathematical Foundations. (Information, Complexity, Bayesian & Algorithmic Probability, Solomono® induction, Sequential Decision) ² Framework: Rational Agents. (in Known and Unknown Environments) ² AIXI: Universal Arti¯cial Intelligence. ² Approximations & Applications. (Universal Search, AIXItl, AI», MC-AIXI-CTW, ©MDP) ² Comparison to Other Approaches. ² Discussion. (Summary, Criticism, Questions, Next Steps, Literature) Marcus Hutter - 4 - Universal Arti¯cial Intelligence What is (Arti¯cial) Intelligence? Intelligence can have many faces ) formal de¯nition di±cult ² reasoning What is AI? Thinking Acting ² creativity ² association humanly Cognitive Turing test, ² generalization Science Behaviorism ² pattern recognition ² problem solving rationally Laws Doing the ² memorization Thought Right Thing ² planning ² achieving goals ² learning Collection of 70+ Defs of Intelligence ² optimization http://www.vetta.org/ ² self-preservation definitions-of-intelligence/ ² vision ² language processing ² classi¯cation Real world is nasty: partially unobservable, ² induction ² deduction uncertain, unknown, non-ergodic, reactive, ² ... vast, but luckily structured, ... Marcus Hutter - 5 - Universal Arti¯cial Intelligence Informal De¯nition of (Arti¯cial) Intelligence Intelligence measures an agent's ability to achieve goals in a wide range of environments. [S. Legg and M. Hutter] Emergent: Features such as the ability to learn and adapt, or to understand, are implicit in the above de¯nition as these capacities enable an agent to succeed in a wide range of environments. The science of Arti¯cial Intelligence is concerned with the construction of intelligent systems/artifacts/agents and their analysis. What next? Substantiate all terms above: agent, ability, utility, goal, success, learn, adapt, environment, ... === ===== Never trust experimenta theory if it is not supported by an experiment theory Marcus Hutter - 6 - Universal Arti¯cial Intelligence Induction!Prediction!Decision!Action Having or acquiring or learning or inducing a model of the environment an agent interacts with allows the agent to make predictions and utilize them in its decision process of ¯nding a good next action. Induction infers general models from speci¯c observations/facts/data, usually exhibiting regularities or properties or relations in the latter. Example Induction: Find a model of the world economy. Prediction: Use the model for predicting the future stock market. Decision: Decide whether to invest assets in stocks or bonds. Action: Trading large quantities of stocks influences the market. Marcus Hutter - 7 - Universal Arti¯cial Intelligence Science ¼ Induction ¼ Occam's Razor ² Grue Emerald Paradox: Hypothesis 1: All emeralds are green. Hypothesis 2: All emeralds found till y2020 are green, thereafter all emeralds are blue. ² Which hypothesis is more plausible? H1! Justi¯cation? ² Occam's razor: take simplest hypothesis consistent with data. is the most important principle in machine learning and science. Marcus Hutter - 8 - Universal Arti¯cial Intelligence Information Theory & Kolmogorov Complexity ² Quanti¯cation/interpretation of Occam's razor: ² Shortest description of object is best explanation. ² Shortest program for a string on a Turing machine T leads to best extrapolation=prediction. KT (x) = minfl(p): T (p) = xg p ² Prediction is best for a universal Turing machine U. Kolmogorov-complexity(x) = K(x) = KU (x) · KT (x) + cT ² Recent impressive applications: universal similarity metric "[LV08]. [Clustering by Compression, Cilibrasi & Vitanyi 2005] Marcus Hutter - 9 - Universal Arti¯cial Intelligence Bayesian Probability Theory Given (1): Models P (DjHi) for probability of observing data D, when Hi is true. Given (2): Prior probability over hypotheses P (Hi). Goal: Posterior probability P (HijD) of Hi, after having seen data D. Solution: P (DjHi) ¢ P (Hi) Bayes' rule: P (HijD) = P i P (DjHi) ¢ P (Hi) (1) Models P (DjHi) usually easy to describe (objective probabilities) (2) But Bayesian prob. theory does not tell us how to choose the prior P (Hi) (subjective probabilities) Marcus Hutter - 10 - Universal Arti¯cial Intelligence Algorithmic Probability Theory ² Epicurus: If more than one theory is consistent with the observations, keep all theories. ² ) uniform prior over all Hi? ² Re¯nement with Occam's razor quanti¯ed in terms of Kolmogorov complexity: ¡KT =U (Hi) P (Hi) := 2 ² Fixing T we have a complete theory for prediction. Problem: How to choose T . ² Choosing U we have a universal theory for prediction. Observation: Particular choice of U does not matter much. Problem: Incomputable. Marcus Hutter - 11 - Universal Arti¯cial Intelligence Inductive Inference & Universal Forecasting ² Solomono® combined Occam, Epicurus, Bayes, and Turing into one formal theory of sequential prediction. ² M(x) = probability that a universal Turing machine outputs x when provided with fair coin flips on the input tape. ² A posteriori probability of y given x is M(yjx) = M(xy)=M(x). ² Given x1; ::; xt¡1, the probability of xt is M(xtjx1:::xt¡1). ² Immediate \applications": - Weather forecasting: xt 2 fsun,raing. - Stock-market prediction: xt 2 fbear,bullg. - Continuing number sequences in an IQ test: xt 2 IN. ² Optimal universal inductive reasoning system! Marcus Hutter - 12 - Universal Arti¯cial Intelligence Sequential Decision Theory Setup: For t = 1; 2; 3; 4; ::: Given sequence x1; x2; :::; xt¡1 (1) predict/make decision yt, (2) observe xt, (3) su®er loss Loss(xt; yt), (4) t ! t + 1, goto (1) Goal: Minimize expected Loss. Greedy minimization of expected loss is optimal if: Important: Decision yt does not influence env. (future observations). Loss function is known. Problem: Expectation w.r.t. what? Solution: W.r.t. universal distribution M if true distr. is unknown. Marcus Hutter - 13 - Universal Arti¯cial Intelligence Example: Weather Forecasting Observation xt 2 X = fsunny, rainyg Decision yt 2 Y = fumbrella, sunglassesg Loss sunny rainy umbrella 0.1 0.3 sunglasses 0.0 1.0 Taking umbrella/sunglasses does not influence future weather (ignoring butterfly e®ect) Marcus Hutter - 14 - Universal Arti¯cial Intelligence Agent Model with Reward if actions/decisions a influence the environment q r1 j o1 r2 j o2 r3 j o3 r4 j o4 r5 j o5 r6 j o6 ... © HYH ©© H ©¼© H H Agent Environ- work tape ... work tape ... p ment q P ³1 PP ³³ PP ³³ PPq³³ a1 a2 a3 a4 a5 a6 ... Marcus Hutter - 15 - Universal Arti¯cial Intelligence Rational Agents in Known Environment ² Setup: Known deterministic or probabilistic environment ² Greedy maximization of reward r (=¡Loss) no longer optimal. Example: Chess ² Exploration versus exploitation problem. ) Agent has to be farsighted. ² Optimal solution: Maximize future (expected) reward sum, called value. ² Problem: Things drastically change if environment is unknown Marcus Hutter - 16 - Universal Arti¯cial Intelligence Rational Agents in Unknown Environment Additional problem: (probabilistic) environment unknown. Fields: reinforcement learning and adaptive control theory Bayesian approach: Mixture distribution. 1. What performance does Bayes-optimal policy imply? It does not necessarily imply self-optimization (Heaven&Hell example). 2. Computationally very hard problem. 3. Choice of horizon? Immortal agents are lazy. Universal Solomono® mixture ) universal agent AIXI. Represents a formal (math., non-comp.) solution to the AI problem? Most (all?) problems are easily phrased within AIXI. Marcus Hutter - 17 - Universal Arti¯cial Intelligence The AIXI Model in one Line complete & essentially unique & limit-computable X X X ¡`(q) AIXI: ak := arg max ::: max [rk + ::: + rm] 2 ak am okrk omrm q : U(q;a1::am)=o1r1::omrm action, reward, observation, Universal TM, qrogram, k=now AIXI is an elegant mathematical theory of AI Claim: AIXI is the most intelligent environmental independent, i.e. universally optimal, agent possible. Proof: For formalizations, quanti¯cations, and proofs, see [Hut05]. Potential Applications: Agents, Games, Optimization, Active Learning, Adaptive Control, Robots. Marcus Hutter - 18 - Universal Arti¯cial Intelligence Computational Issues: Universal Search ² Levin search: Fastest algorithm for inversion and optimization

Universal Algorithmic Intelligence a Mathematical Top→Down Approach

Details

Download

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

Support