Logic and Probability Lecture 1: Probability As Logic

Total Page:16

File Type:pdf, Size:1020Kb

Logic and Probability Lecture 1: Probability As Logic Logic and Probability Lecture 1: Probability as Logic Wesley Holliday & Thomas Icard UC Berkeley & Stanford August 11, 2014 ESSLLI, T¨ubingen Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 1 Overview Logic as a theory of: I truth-preserving inference I proof / deduction I consistency I rationality I definability ... Probability as a theory of: I uncertain inference I induction I learning I rationality I information ... Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 2 Some questions and points of contact: I In what ways might probability be said to extend logic? I How do probability and various logical systems differ on what they say about rational inference? I Can logic be used to gain a better understanding of probability? Say, through standard techniques of formalization, definability, questions of completeness, complexity, etc.? I Can the respective advantages of logical representation and probabilistic learning and inference be combined to create more powerful reasoning systems? Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 3 Course Outline I Day 1: Probability as (Extended) Logic I Day 2: Probability, Nonmonotonicity, and Graphical Models I Day 3: Beyond Boolean Logic I Day 4: Qualitative Probability I Day 5: Logical Dynamics Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 4 Interpretations of Probability I Frequentist: Probabilities are about `limiting frequencies' of events. I Propensity: Probabilities are about physical dispositions, or propensities, of events. I Logical: Probabilities are determined objectively via a logical language and some additional principles, e.g., of `symmetry'. I Bayesian: Probabilities are subjective and reflect an agent's degree of confidence concerning some event. We will mostly remain neutral, focusing on mathematical questions, but will note when the interpretation is critical. Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 5 A measurable space is a pair (W , E) with I W is an arbitrary set I E is a s-algebra over W , i.e., a subset of }(W ) closed under complement and infinite union. A probability space is a triple (W , E, m), with (W , E) a measurable space and m : E! [0, 1] a measure function satisfying: I m(W ) = 1 ; I m(E [ F ) = m(E ) + m(F ), whenever E \ F = Æ . A random variable from (W , E, m) to some other space (V , F) is a function X : W ! V such that: whenever X (E ) = F and F 2 F, we have E 2 E . Then m induces a natural distribution for X , typically written P(X = v) = m(fw 2 W : X (w) = vg) . Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 6 Suppose we have a propositional logical language L generated as follows j ::= A j B j ... j j ^ j j :j We can define a probability P : L! [0, 1] directly on L, requiring I P(j) = 1, for any tautology j ; I P(j _ y) = P(j) + P(y), whenever :(j ^ y) . Equivalent set of requirements: I P(j) = 1 for any tautology ; I P(j) ≤ P(y) whenever j ! y ; I P(j) = P(j ^ y) + P(j ^ :y) . Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 7 It is then easy to show: I P(j) = 0, for any contradiction j ; I P(:j) = 1 − P(j) ; I P(j _ y) = P(j) + P(y) − P(j ^ y) ; I A propositional valuation sending atoms to 1 or 0 is a special case of a probability function ; I A probability on L gives rise to a standard probability measure over `world-states', i.e., maximally consistent sets of formulas from L . Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 8 With this setup we can easily define, e.g., conditional probability: P(j ^ y) P(jjy) = , P(y) provided P(y) > 0. With this it is easy to prove Bayes' Theorem: Theorem (Bayes) P(yjj)P(j) P(jjy) = . P(y) Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 9 Today's Theme: Probability as Logic Three senses in which probability has been claimed as part of logic: 1. Logical Deduction and Probability Preservation 2. Consistency and Fair Odds 3. Patterns of Plausible Reasoning Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 10 PART I: Logical Deduction and Probability Preservation Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 11 Logical Deduction and Probability Preservation Suppose we can show G ` j in propositional logic, but the assumptions g 2 G themselves are uncertain. Knowing about the probabilities of formulas in G, what can we conclude about the probability of j? Example (Adams & Levine) A telephone survey is made of 1,000 people as to their political party affiliation. Supposing 629 people report being Democrat, with some possibility of error in each case. What can we say about the probability of the following statement? At least 600 people are Democrats. Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 12 Logical Deduction and Probability Preservation Proposition (Suppes 1966) If G ` j, then for all P we have P(:j) ≤ ∑g2G P(:g). Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 13 Logical Deduction and Probability Preservation Example (Kyburg's 1965 `Lottery Paradox') Imagine a fair lottery with n tickets. Let g1, ... , gn be sentences: gi : ticket i is the winner. n−1 Each sentence :gi might seem reasonable, since it has probability n . However, putting these all together rapidly decreases probability. Indeed, ^ P :gi = 0 . i≤n This demonstrates that the (logical) conclusion of all these individually reasonable premises|that none of the tickets will win the lottery|has probability 0. Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 14 Logical Deduction and Probability Preservation Definition (Suppes Entailment) G s j, iff for all P: P(:j) ≤ ∑ P(:g) . g2G Definition (Adams Entailment) G a j, iff for all e > 0, there is a d > 0, such that for all P: if P(:g) < d for all g 2 G, then P(:j) < e . Lemma G s j, iff G a j. Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 15 Logical Deduction and Probability Preservation Theorem (Adams 1966) Classical logic is sound and complete for Suppes/Adams entailment. Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 16 Logical Deduction and Probability Preservation The following are equivalent: (a) G ` j (b) G j (c) G s j (d) G a j . Proof. (a) , (b) is just standard completeness. Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 17 Logical Deduction and Probability Preservation The following are equivalent: (a) G ` j (b) G j (c) G s j (d) G a j . Proof. For (b) ) (c), suppose G 2s j, so there is a P, such that 1 − P(j) > ∑ P(:g) . g2G From this it follows that 1 > ∑ P(:g) + P(j) . g2G Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 18 Logical Deduction and Probability Preservation The following are equivalent: (a) G ` j (b) G j (c) G s j (d) G a j . Proof. For (b) ) (c), suppose G 2s j, so there is a P, such that 1 − P(j) > ∑ P(:g) . g2G From this it follows that 1 > ∑ P(:g) + P(j) ≥ P(g1 ^ · · · ^ gn ! j) , g2G for any subset fg1, ... , gng ⊆ G. Since all tautologies must have probability 1, no such implication g1 ^ · · · ^ gn ! j is a tautology. Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 19 Logical Deduction and Probability Preservation The following are equivalent: (a) G ` j (b) G j (c) G s j (d) G a j . Proof. For (c) ) (d), suppose G s j, and e > 0. If G is finite, choose e d = . jGj Then we have P(:j) ≤ ∑ P(:g) < ∑ d = e . g g Hence P(:j) < e, and G a j. The case of G infinite is left as exercise. Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 20 Logical Deduction and Probability Preservation The following are equivalent: (a) G ` j (b) G j (c) G s j (d) G a j . Proof. Finally, for (d) ) (b), suppose G 2 j. For any e > 0, we can simply take a valuation witnessing G 2 j, which is itself a probability function P making P(j) = 0 and P(g) = 1, for all g 2 G. Hence G 2a j. (Note that essentially the same proof works to show (c) ) (b).) Thus we have shown (a) , (b) ) (c) ) (d) ) (b). a Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 21 Logical Deduction and Probability Preservation Example In the telephone survey example, suppose each person's statement has probability 0.999. Then the best upper bound we can get on probability for the negation of the conclusion is 629 ∗ 0.001 = 0.629 . That is, our best lower bound on the probability that there are at least 600 Democrats is 0.371. This seems weak. Wesley Holliday & Thomas Icard: Logic and Probability, Lecture 1: Probability as Logic 22 Logical Deduction and Probability Preservation Definition 0 0 Where G j, we say G ⊆ G is essential if G − G 2 j. Definition Given G j, the degree of essentialness of g 2 G is given by 1 E (g) = , jSgj where jSgj is the size of the smallest essential set containing g.
Recommended publications
  • Creating Modern Probability. Its Mathematics, Physics and Philosophy in Historical Perspective
    HM 23 REVIEWS 203 The reviewer hopes this book will be widely read and enjoyed, and that it will be followed by other volumes telling even more of the fascinating story of Soviet mathematics. It should also be followed in a few years by an update, so that we can know if this great accumulation of talent will have survived the economic and political crisis that is just now robbing it of many of its most brilliant stars (see the article, ``To guard the future of Soviet mathematics,'' by A. M. Vershik, O. Ya. Viro, and L. A. Bokut' in Vol. 14 (1992) of The Mathematical Intelligencer). Creating Modern Probability. Its Mathematics, Physics and Philosophy in Historical Perspective. By Jan von Plato. Cambridge/New York/Melbourne (Cambridge Univ. Press). 1994. 323 pp. View metadata, citation and similar papers at core.ac.uk brought to you by CORE Reviewed by THOMAS HOCHKIRCHEN* provided by Elsevier - Publisher Connector Fachbereich Mathematik, Bergische UniversitaÈt Wuppertal, 42097 Wuppertal, Germany Aside from the role probabilistic concepts play in modern science, the history of the axiomatic foundation of probability theory is interesting from at least two more points of view. Probability as it is understood nowadays, probability in the sense of Kolmogorov (see [3]), is not easy to grasp, since the de®nition of probability as a normalized measure on a s-algebra of ``events'' is not a very obvious one. Furthermore, the discussion of different concepts of probability might help in under- standing the philosophy and role of ``applied mathematics.'' So the exploration of the creation of axiomatic probability should be interesting not only for historians of science but also for people concerned with didactics of mathematics and for those concerned with philosophical questions.
    [Show full text]
  • There Is No Pure Empirical Reasoning
    There Is No Pure Empirical Reasoning 1. Empiricism and the Question of Empirical Reasons Empiricism may be defined as the view there is no a priori justification for any synthetic claim. Critics object that empiricism cannot account for all the kinds of knowledge we seem to possess, such as moral knowledge, metaphysical knowledge, mathematical knowledge, and modal knowledge.1 In some cases, empiricists try to account for these types of knowledge; in other cases, they shrug off the objections, happily concluding, for example, that there is no moral knowledge, or that there is no metaphysical knowledge.2 But empiricism cannot shrug off just any type of knowledge; to be minimally plausible, empiricism must, for example, at least be able to account for paradigm instances of empirical knowledge, including especially scientific knowledge. Empirical knowledge can be divided into three categories: (a) knowledge by direct observation; (b) knowledge that is deductively inferred from observations; and (c) knowledge that is non-deductively inferred from observations, including knowledge arrived at by induction and inference to the best explanation. Category (c) includes all scientific knowledge. This category is of particular import to empiricists, many of whom take scientific knowledge as a sort of paradigm for knowledge in general; indeed, this forms a central source of motivation for empiricism.3 Thus, if there is any kind of knowledge that empiricists need to be able to account for, it is knowledge of type (c). I use the term “empirical reasoning” to refer to the reasoning involved in acquiring this type of knowledge – that is, to any instance of reasoning in which (i) the premises are justified directly by observation, (ii) the reasoning is non- deductive, and (iii) the reasoning provides adequate justification for the conclusion.
    [Show full text]
  • The Interpretation of Probability: Still an Open Issue? 1
    philosophies Article The Interpretation of Probability: Still an Open Issue? 1 Maria Carla Galavotti Department of Philosophy and Communication, University of Bologna, Via Zamboni 38, 40126 Bologna, Italy; [email protected] Received: 19 July 2017; Accepted: 19 August 2017; Published: 29 August 2017 Abstract: Probability as understood today, namely as a quantitative notion expressible by means of a function ranging in the interval between 0–1, took shape in the mid-17th century, and presents both a mathematical and a philosophical aspect. Of these two sides, the second is by far the most controversial, and fuels a heated debate, still ongoing. After a short historical sketch of the birth and developments of probability, its major interpretations are outlined, by referring to the work of their most prominent representatives. The final section addresses the question of whether any of such interpretations can presently be considered predominant, which is answered in the negative. Keywords: probability; classical theory; frequentism; logicism; subjectivism; propensity 1. A Long Story Made Short Probability, taken as a quantitative notion whose value ranges in the interval between 0 and 1, emerged around the middle of the 17th century thanks to the work of two leading French mathematicians: Blaise Pascal and Pierre Fermat. According to a well-known anecdote: “a problem about games of chance proposed to an austere Jansenist by a man of the world was the origin of the calculus of probabilities”2. The ‘man of the world’ was the French gentleman Chevalier de Méré, a conspicuous figure at the court of Louis XIV, who asked Pascal—the ‘austere Jansenist’—the solution to some questions regarding gambling, such as how many dice tosses are needed to have a fair chance to obtain a double-six, or how the players should divide the stakes if a game is interrupted.
    [Show full text]
  • 1 Stochastic Processes and Their Classification
    1 1 STOCHASTIC PROCESSES AND THEIR CLASSIFICATION 1.1 DEFINITION AND EXAMPLES Definition 1. Stochastic process or random process is a collection of random variables ordered by an index set. ☛ Example 1. Random variables X0;X1;X2;::: form a stochastic process ordered by the discrete index set f0; 1; 2;::: g: Notation: fXn : n = 0; 1; 2;::: g: ☛ Example 2. Stochastic process fYt : t ¸ 0g: with continuous index set ft : t ¸ 0g: The indices n and t are often referred to as "time", so that Xn is a descrete-time process and Yt is a continuous-time process. Convention: the index set of a stochastic process is always infinite. The range (possible values) of the random variables in a stochastic process is called the state space of the process. We consider both discrete-state and continuous-state processes. Further examples: ☛ Example 3. fXn : n = 0; 1; 2;::: g; where the state space of Xn is f0; 1; 2; 3; 4g representing which of four types of transactions a person submits to an on-line data- base service, and time n corresponds to the number of transactions submitted. ☛ Example 4. fXn : n = 0; 1; 2;::: g; where the state space of Xn is f1; 2g re- presenting whether an electronic component is acceptable or defective, and time n corresponds to the number of components produced. ☛ Example 5. fYt : t ¸ 0g; where the state space of Yt is f0; 1; 2;::: g representing the number of accidents that have occurred at an intersection, and time t corresponds to weeks. ☛ Example 6. fYt : t ¸ 0g; where the state space of Yt is f0; 1; 2; : : : ; sg representing the number of copies of a software product in inventory, and time t corresponds to days.
    [Show full text]
  • Probability and Logic
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector Journal of Applied Logic 1 (2003) 151–165 www.elsevier.com/locate/jal Probability and logic Colin Howson London School of Economics, Houghton Street, London WC2A 2AE, UK Abstract The paper is an attempt to show that the formalism of subjective probability has a logical in- terpretation of the sort proposed by Frank Ramsey: as a complete set of constraints for consistent distributions of partial belief. Though Ramsey proposed this view, he did not actually establish it in a way that showed an authentically logical character for the probability axioms (he started the current fashion for generating probabilities from suitably constrained preferences over uncertain options). Other people have also sought to provide the probability calculus with a logical character, though also unsuccessfully. The present paper gives a completeness and soundness theorem supporting a logical interpretation: the syntax is the probability axioms, and the semantics is that of fairness (for bets). 2003 Elsevier B.V. All rights reserved. Keywords: Probability; Logic; Fair betting quotients; Soundness; Completeness 1. Anticipations The connection between deductive logic and probability, at any rate epistemic probabil- ity, has been the subject of exploration and controversy for a long time: over three centuries, in fact. Both disciplines specify rules of valid non-domain-specific reasoning, and it would seem therefore a reasonable question why one should be distinguished as logic and the other not. I will argue that there is no good reason for this difference, and that both are indeed equally logic.
    [Show full text]
  • A FIRST COURSE in PROBABILITY This Page Intentionally Left Blank a FIRST COURSE in PROBABILITY
    A FIRST COURSE IN PROBABILITY This page intentionally left blank A FIRST COURSE IN PROBABILITY Eighth Edition Sheldon Ross University of Southern California Upper Saddle River, New Jersey 07458 Library of Congress Cataloging-in-Publication Data Ross, Sheldon M. A first course in probability / Sheldon Ross. — 8th ed. p. cm. Includes bibliographical references and index. ISBN-13: 978-0-13-603313-4 ISBN-10: 0-13-603313-X 1. Probabilities—Textbooks. I. Title. QA273.R83 2010 519.2—dc22 2008033720 Editor in Chief, Mathematics and Statistics: Deirdre Lynch Senior Project Editor: Rachel S. Reeve Assistant Editor: Christina Lepre Editorial Assistant: Dana Jones Project Manager: Robert S. Merenoff Associate Managing Editor: Bayani Mendoza de Leon Senior Managing Editor: Linda Mihatov Behrens Senior Operations Supervisor: Diane Peirano Marketing Assistant: Kathleen DeChavez Creative Director: Jayne Conte Art Director/Designer: Bruce Kenselaar AV Project Manager: Thomas Benfatti Compositor: Integra Software Services Pvt. Ltd, Pondicherry, India Cover Image Credit: Getty Images, Inc. © 2010, 2006, 2002, 1998, 1994, 1988, 1984, 1976 by Pearson Education, Inc., Pearson Prentice Hall Pearson Education, Inc. Upper Saddle River, NJ 07458 All rights reserved. No part of this book may be reproduced, in any form or by any means, without permission in writing from the publisher. Pearson Prentice Hall™ is a trademark of Pearson Education, Inc. Printed in the United States of America 10987654321 ISBN-13: 978-0-13-603313-4 ISBN-10: 0-13-603313-X Pearson Education, Ltd., London Pearson Education Australia PTY. Limited, Sydney Pearson Education Singapore, Pte. Ltd Pearson Education North Asia Ltd, Hong Kong Pearson Education Canada, Ltd., Toronto Pearson Educacion´ de Mexico, S.A.
    [Show full text]
  • History of Probability (Part 4) - Inverse Probability and the Determination of Causes of Observed Events
    History of Probability (Part 4) - Inverse probability and the determination of causes of observed events. Thomas Bayes (c1702-1761) . By the early 1700s the work of Pascal, Fermat, and Huygens was well known, mainly couched in terms of odds and fair bets in gambling. Jacob Bernoulli and Abraham DeMoivre had made efforts to broaden the scope and interpretation of probability. Bernoulli had put probability measures on a scale between zero and one and DeMoivre had defined probability as a fraction of chances. But then there was a 50 year lull in further development in probability theory. This is surprising, but the “brains” of the day were busy applying the newly invented calculus to problems of physics – especially astronomy. At the beginning of the 18 th century a probability exercise like the following one was not too difficult. Suppose there are two boxes with white and black balls. Box A has 4 white and 1 black, box B has 2 white and 3 black. You pick one box at random, with probability 1/3 it will be Box A and 2/3 it will be B. Then you randomly pick one ball from the box. What is the probability you will end up with a white ball? We can model this scenario by a tree diagram. We also have the advantage of efficient symbolism. Expressions like = had not been developed by 1700. The idea of the probability of a specific event was just becoming useful in addition to the older emphasis on odds of one outcome versus another. Using the multiplication rule for independent events together with the addition rule, we find that the probability of picking a white ball is 8/15.
    [Show full text]
  • Unifying Logic and Probability
    review articles DOI:10.1145/2699411 those arguments: so one can write rules Open-universe probability models show merit about On(p, c, x, y, t) (piece p of color c is on square x, y at move t) without filling in unifying efforts. in each specific value for c, p, x, y, and t. Modern AI research has addressed BY STUART RUSSELL another important property of the real world—pervasive uncertainty about both its state and its dynamics—using probability theory. A key step was Pearl’s devel opment of Bayesian networks, Unifying which provided the beginnings of a formal language for probability mod- els and enabled rapid progress in rea- soning, learning, vision, and language Logic and understanding. The expressive power of Bayes nets is, however, limited. They assume a fixed set of variables, each taking a value from a fixed range; Probability thus, they are a propositional formal- ism, like Boolean circuits. The rules of chess and of many other domains are beyond them. What happened next, of course, is that classical AI researchers noticed the perva- sive uncertainty, while modern AI research- ers noticed, or remembered, that the world has things in it. Both traditions arrived at the same place: the world is uncertain and it has things in it. To deal with this, we have to unify logic and probability. But how? Even the meaning of such a goal is unclear. Early attempts by Leibniz, Bernoulli, De Morgan, Boole, Peirce, Keynes, and Carnap (surveyed PERHAPS THE MOST enduring idea from the early by Hailperin12 and Howson14) involved days of AI is that of a declarative system reasoning attaching probabilities to logical sen- over explicitly represented knowledge with a general tences.
    [Show full text]
  • Philosophy of Probability
    Philosophy of Probability Aidan Lyon Abstract In the philosophy of probability there are two central questions we are concerned with. The first is: what is the correct formal theory of proba- bility? Orthodoxy has it that Kolmogorov’s axioms are the correct axioms of probability. However, we shall see that there are good reasons to con- sider alternative axiom systems. The second central question is: what do probability statements mean? Are probabilities “out there”, in the world as frequencies, propensities, or some other objective feature of reality, or are probabilities “in the head”, as subjective degrees of belief? We will sur- vey some of the answers that philosophers, mathematicians and physicists have given to these questions. 1. Introduction The famous mathematician Henri Poincaré once wrote of the probability calcu- lus: “if this calculus be condemned, then the whole of the sciences must also be condemned” (Poincaré [1902], p. 186). Indeed, every branch of science makes extensive use of probability in some form or other. Quantum mechanics is well–known for making heavy use of probability. The second law of thermodynamics is a statistical law and, for- mulated one way, states that the entropy of a closed system is most likely to increase. In statistical mechanics, a probability distribution known as the micro–canonical distribution is used to make predictions concerning the macro- properties of gases. In evolutionary theory, the concept of fitness is often de- fined in terms of a probability function (one such definition says that fitness is expected number of offspring). Probability also plays central roles in natural se- lection, drift, and macro-evolutionary models.
    [Show full text]
  • A Brief Overview of Probability Theory in Data Science by Geert
    A brief overview of probability theory in data science Geert Verdoolaege 1Department of Applied Physics, Ghent University, Ghent, Belgium 2Laboratory for Plasma Physics, Royal Military Academy (LPP–ERM/KMS), Brussels, Belgium Tutorial 3rd IAEA Technical Meeting on Fusion Data Processing, Validation and Analysis, 27-05-2019 Overview 1 Origins of probability 2 Frequentist methods and statistics 3 Principles of Bayesian probability theory 4 Monte Carlo computational methods 5 Applications Classification Regression analysis 6 Conclusions and references 2 Overview 1 Origins of probability 2 Frequentist methods and statistics 3 Principles of Bayesian probability theory 4 Monte Carlo computational methods 5 Applications Classification Regression analysis 6 Conclusions and references 3 Early history of probability Earliest traces in Western civilization: Jewish writings, Aristotle Notion of probability in law, based on evidence Usage in finance Usage and demonstration in gambling 4 Middle Ages World is knowable but uncertainty due to human ignorance William of Ockham: Ockham’s razor Probabilis: a supposedly ‘provable’ opinion Counting of authorities Later: degree of truth, a scale Quantification: Law, faith ! Bayesian notion Gaming ! frequentist notion 5 Quantification 17th century: Pascal, Fermat, Huygens Comparative testing of hypotheses Population statistics 1713: Ars Conjectandi by Jacob Bernoulli: Weak law of large numbers Principle of indifference De Moivre (1718): The Doctrine of Chances 6 Bayes and Laplace Paper by Thomas Bayes (1763): inversion
    [Show full text]
  • Basic Probability
    Mathematics Learning Centre Basic concepts in probability Sue Gordon c 2005 University of Sydney Mathematics Learning Centre, University of Sydney 1 1 Set Notation You may omit this section if you are familiar with these concepts. A set is a collection of objects. We often specify a set by listing its members, or elements, in parentheses like this {}. For example A = {2, 4, 6, 8} means that A is the set consisting of numbers 2,4,6,8. We could also write A ={even numbers less than 9}. The union of A and B is the set of elements which belong to A or to B (or both) and can be written as A ∪ B. The intersection of A and B is the set of elements which belong to both A and B, and can be written as A ∩ B. The complement of A, frequently denoted by A, is the set of all elements which do not belong to A. In making this definition we assume that all elements we are thinking about belong to some larger set U, which we call the universal set. The empty set, written ∅ or {}, means the set with no elements in it. A set C is a subset of A if all the elements in C are also in A. For example, let U = {all positive numbers ≤ 10} A = {2, 4, 6, 8} B = {1, 2, 3} C = {6, 8} Sets A, B and U may be represented in a Venn Diagram as follows: 5 AB 7 4 1 U 2 9 6 8 3 10 Mathematics Learning Centre, University of Sydney 2 A intersection B, A ∩ B, is shown in the Venn diagram by the overlap of the sets A and B, A ∩ B = {2}.
    [Show full text]
  • The Principle of Sufficient Reason and Probability
    THE PRINCIPLE OF SUFFICIENT REASON AND PROBABILITY ALEXANDER R. PRUSS Abstract. I shall argue that considerations about frequency-to-chance inferences make very plausible some a localized version of the Principle of Sufficient Reason (PSR). But a localized version isn't enough, and so we should accept a full PSR. 1. The Principle of Sufficient Reason I will take the Principle of Sufficient Reason (PSR) to be the claim that necessarily every contingent truth has an explanation (Pruss 2006). It has been often noted that many of our ordinary epistemic practices presuppose the existence of explanations. Rescher, for instance, talks of how the investigators of an airplane crash do not conclude that there is no explanation from their inability to find an explanation (Rescher 1995, p. 2). But it is one thing to agree that the PSR holds for propositions about everyday localized matters, and another to generalize to cosmic cases, such as the origination of the universe. (Of course it would be difficult to precisely define what counts as \localized".) Still, we have a rough grasp of localization: an LED lighting up is localized, while an infinite regress of events or the complete state of our large universe presumably are not. I will argue, however, that in order to make sense of our scientific epistemic practices, we need a principle like the PSR that applies to global matters. In doing so, I will draw on recent mathematical work on laws of large numbers for nonmeasurable processes (Pruss 2013). The argument begins with the local cases and generalizes to the global ones.
    [Show full text]