The Probability Set-Up.Pdf
Total Page:16
File Type:pdf, Size:1020Kb
CHAPTER 2 The probability set-up 2.1. Basic theory of probability We will have a sample space, denoted by S (sometimes Ω) that consists of all possible outcomes. For example, if we roll two dice, the sample space would be all possible pairs made up of the numbers one through six. An event is a subset of S. Another example is to toss a coin 2 times, and let S = fHH;HT;TH;TT g; or to let S be the possible orders in which 5 horses nish in a horse race; or S the possible prices of some stock at closing time today; or S = [0; 1); the age at which someone dies; or S the points in a circle, the possible places a dart can hit. We should also keep in mind that the same setting can be described using dierent sample set. For example, in two solutions in Example 1.30 we used two dierent sample sets. 2.1.1. Sets. We start by describing elementary operations on sets. By a set we mean a collection of distinct objects called elements of the set, and we consider a set as an object in its own right. Set operations Suppose S is a set. We say that A ⊂ S, that is, A is a subset of S if every element in A is contained in S; A [ B is the union of sets A ⊂ S and B ⊂ S and denotes the points of S that are in A or B or both; A \ B is the intersection of sets A ⊂ S and B ⊂ S and is the set of points that are in both A and B; ; denotes the empty set; Ac is the complement of A, that is, the points in S that are not in A. We extend this denition to have n is the union of sets , and similarly n . [i=1Ai A1; ··· ;An \i=1Ai An exercise is to show that De Morgan's laws n !c n n !c n [ \ c and \ [ c Ai = Ai Ai = Ai : i=1 i=1 i=1 i=1 © Copyright 2013 Richard F. Bass, 2020 Masha Gordina Typesetting date: March 31, 2020 21 22 2. THE PROBABILITY SET-UP We will also need the notion of pairwise disjoint sets 1 which means that fEigi=1 Ei \ Ej = ; unless i = j. There are no restrictions on the sample space S. The collection of events, F, is assumed to be a σ-eld, which means that it satises the following. Denition ( σ-eld) A collection F of sets in S is called a σ-eld if (i) Both ;;S are in F, (ii) if A is in F, then Ac is in F, (iii) if are in , then S1 and T1 are in . A1;A2;::: F i=1 Ai i=1 Ai F Typically we will take F to be all subsets of S, and so (i)-(iii) are automatically satised. The only times we won't have F be all subsets is for technical reasons or when we talk about conditional expectation. 2.1.2. Probability axioms. So now we have a sample space S, a σ-eld F, and we need to talk about what a probability is. Probability axioms (1) 0 6 P(E) 6 1 for all events E 2 F. (2) P(S) = 1. (3) If the 1 are pairwise disjoint, S1 P1 . fEigi=1 ;Ei 2 F P( i=1 Ei) = i=1 P(Ei) Note that probabilities are probabilities of subsets of S, not of points of S. However, it is common to write P(x) for P(fxg). Intuitively, the probability of E should be the number of times E occurs in n experiments, taking a limit as n tends to innity. This is hard to use. It is better to start with these axioms, and then to prove that the probability of E is the limit as we hoped. Below are some easy consequences of the probability axioms. Proposition 2.1 (Properties of probability) (1) P(;) = 0. (2) If are pairwise disjoint, then Sn Pn . E1;:::;En 2 F P( i=1 Ei) = i=1 P(Ei) (3) P(Ec) = 1 − P(E) for any E 2 F. (4) If E ⊂ F , then P(E) 6 P(F ), for any E; F 2 F. (5) for any E; F 2 F (2.1.1) P(E [ F ) = P(E) + P(F ) − P(E \ F ): The last property is sometimes called the inclusion-exclusion identity. Proof. To show (1), choose for each . These are clearly disjoint, so Ei = ; i P(;) = 1 P1 P1 . If were strictly positive, then the last term would P([i=1Ei) = i=1 P(Ei) = i=1 P(;) P(;) 2.1. BASIC THEORY OF PROBABILITY 23 be innity, contradicting the fact that probabilities are between 0 and 1. So the probability of ; must be zero. Part (2) follows if we let . Then 1 are still pairwise En+1 = En+2 = ··· = ; fEigi=1 ;Ei 2 F disjoint, and S1 Sn , and by (1) we have i=1 Ei = i=1 Ei 1 n X X P(Ei) = P(Ei): i=1 i=1 To prove (3), use S = E [ Ec. By (2), P(S) = P(E) + P(Ec). By axiom (2), P(S) = 1, so (1) follows. c c To prove (4), write F = E [ (F \ E ), so P(F ) = P(E) + P(F \ E ) > P(E) by (2) and axiom (1). Similarly, to prove (5), we have P(E[F ) = P(E)+P(Ec\F ) and P(F ) = P(E\F )+P(Ec\F ). Solving the second equation for P(Ec \ F ) and substituting in the rst gives the desired result. It is common for a probability space to consist of nitely many points, all with equally likely probabilities. For example, in tossing a fair coin, we have S = fH; T g, with P(H) = P(T ) = 1 . Similarly, in rolling a fair die, the probability space consists of , each point 2 f1; 2; 3; 4; 5; 6g having probability 1 . 6 Example 2.1. What is the probability that if we roll 2 dice, the sum is 7? Solution: There are 36 possibilities, of which 6 have a sum of 7: (1; 6); (2; 5); (3; 4), (4; 3), , . Since they are all equally likely, the probability is 6 1 . (5; 2) (6; 1) 36 = 6 Example 2.2. What is the probability that in a poker hand (5 cards out of 52) we get exactly 4 of a kind? Solution: we have four suits: clubs, diamonds, hearts and spades. Each suit includes an ace, a king, queen and jack, and ranks two through ten. For example, the probability of 4 aces and 1 king is 44 4 1 52 : 5 The probability of 4 jacks and one 3 is the same. There are 13 ways to pick the rank that we have 4 of, and then 12 ways to pick the rank we have one of, so the answer is 44 4 1 13 · 12 · 52 : 5 Example 2.3. What is the probability that in a poker hand we get exactly 3 of a kind (and the other two cards are of dierent ranks)? 24 2. THE PROBABILITY SET-UP Solution: for example, the probability of 3 aces, 1 king and 1 queen is 444 3 1 1 52 : 5 We have 13 choices for the rank we have 3 of, and 12 choices for the other two ranks, so 2 the answer is 444 12 3 1 1 13 2 52 : 5 Example 2.4. In a class of 30 people, what is the probability everyone has a dierent birthday? (We assume each day is equally likely.) Solution: we assume that it is not a leap year. Let the rst person have a birthday on some day. The probability that the second person has a dierent birthday will be 364 . The 365 probability that the third person has a dierent birthday from the rst two people is 363 . So 365 the answer is 364 363 336 · · ::: · : 365 365 365 2.2. FURTHER EXAMPLES AND APPLICATIONS 25 2.2. Further examples and applications 2.2.1. Sets revisited. A visual way to represent set operations is given by the Venn diagrams. A picture of Venn diagrams from http://www.onlinemathlearning.com/shading-venn-diagrams.html Example 2.5. Roll two dice. We can describe the sample set S as ordered pairs of numbers 1; 2; :::; 6, that is, S has 36 elements. Examples of events are E = the two dice come up equal and even = f(2; 2) ; (4; 4) ; (6; 6)g ; F = the sum of the two dice is 8 = f(2; 6) ; (3; 5) ; (4; 4) ; (5; 3) ; (6; 2)g ; E [ F = f(2; 2) ; (2; 6) ; (3; 5) ; (4; 4) ; (5; 3) ; (6; 2) ; (6; 6)g ; E \ F = f(4; 4)g ; F c = all 31 pairs that do not include f(2; 6) ; (3; 5) ; (4; 4) ; (5; 3) ; (6; 2)g : Example 2.6. Let S = [0; 1) be the space of all possible ages at which someone can die. Possible events are A = person dies before reaching 30 = [0; 30). Ac = [30; 1) = person dies after turning 30. A [ Ac = S, B = a person lives either less than 15 or more than 45 years = (15; 45]. 2.2.2. Axioms of probability revisited. Example 2.7 (Coin tosses). In this case S = fH; T g, where H stands for heads, and T stands for tails.