Solutions to Problem Set 1 1.3 Birthday Problem 1.4 Russian

Cornell University, Physics Department Fall 2013 PHYS-3341 Statistical Physics Prof. Itai Cohen Solutions to Problem Set 1 David C. Tsang, Woosong Choi, Philip Kidd, Igor Segota, Yariv Yanay 1.3 Birthday Problem Suppose there are N people in a room. What is the probability that at least two of them share the same birthday - the same day of the same month? It’s easiest to begin by calculating the probability p(N) that N people in a room all have different birthdays. The probability of at least two people having the same birthday is then 1 p(N). Assuming that birthdays are evenly distributed throughout the year, the probability− that, in some arbitrary order, the second person in the room does not have the same birthday as the first is given by 364/365. The probability that a third person does not share a birthday with the first two is then 363/365. Continuing in this vein, we find 364 363 365 N +1 365! p(N)= ... − = 365 × 365 × × 365 365N (365 N)! − 1.4 Russian Roulette Reif §1.5: In the game of Russian roulette, one inserts a single cartridge into the drum of a revolver, leaving the other five chambers of the drum empty. One then spins the drum, aims at one’s head and pulls the trigger. (a) What is the probability of still being alive after playing the game N times? (b) What is the probability of surviving (N 1) turns in this game and then being shot the N th time on pulls the trigger?− (c) What is the mean number of times a player gets the opportunity of pulling the trigger in this macabre game? (a) The probability of surviving one round of the game is 5/6. Assuming that the chamber is re-spun after each round, the probability of surviving N rounds is then (5/6)N . (b) The probability of dying immediately after having survived some number of turns is th N 1 1/6, so the probability of dying on the N turn is (1/6)(5/6) − . 1 (c) To find the mean number of turns one can play before death, we compute the sum n np(n), with p(n) the probability of the game lasting exactly n turns. P n 1 ∞ 1 5 − n¯ = n 6 6 n=1 X n 1 d n Using the derivative trick we note that n nq − = dq n q . Also noting that the n 1 sum of a geometric series is given by n q = 1 q , we find P − P n 1 P ∞ 1 5 − 1 d 1 1 1 n = = =6 6 6 6 · dq 1 q 6 · (1 5/6)2 n=1 q=5/6 X − − 1.5 1-D Random Walk Reif §1.6: Consider the random walk problem with p = q and let m = n1 n2 denote the net displacement to the right. After a total number of steps, calculate− the following mean values: m,m2, m3, m4. From the text ( 1.4.1) we have the probability of a number of steps n1 to the right, and § a number of steps n2 = N n1 to the left as − N! n1 N n1 W (n1)= p q − n1!(N n1)! − First we note that by the binomial theorem N N W (n1)=(p + q) =1 n =0 X1 Thus we have for any function f(n1) N W (n1)f(n1) n1=0 f(n1) = N W (n1) P n1=0 N P = f(n1)W (n1) n =0 X1 2 For the mean displacement m we have m = n1 n2 N − = (n1 n2)W (n1) − n =0 X1 N ∂ ∂ = p q W (n1) ∂p − ∂q n =0 X1 ∂ ∂ N = p q W (n1) ∂p − ∂q n =0 X1 ∂ ∂ = p q (p + q)N ∂p − ∂q Taking these derivatives and with p = q =1/2 we see: N 1 N 1 N 1 m = pN(p + q) − qN(p + q) − = N(p + q) − (p q) − − which gives m =0. Similarly we have for the mean square displacement m2 we have N 2 2 m = (n1 n2) W (n1) − n =0 X1 2 ∂ ∂ N = p q W (n1) ∂p − ∂q n =0 X1 ∂ ∂ n 1 = p q (N(p + q) − (p q)) ∂p − ∂q − N 1 N 2 = pN(p + q) − + pN(N 1)(p + q) − (p q) − N 2 − N 1 qN(N 1)(p + 1) − (p q)+ qN(p + q) − . − − − Simplifying we see the suggestive form: 2 N N 2 2 m = N(p + q) + N(N 1)(p + q) − (p q) − − N 2 2 = N W (n1)+ N(N 1)(p + q) − (p q) . − − X Setting p = q =1/2 gives m2 = N 3 For m3 we can similarly construct: 3 3 m = (n1 n2) − N 3 = (n1 n2) W (n1) − n =0 X1 3 ∂ ∂ N = p q W (n1) ∂p − ∂q n =0 X1 ∂ ∂ = p q m2 ∂p − ∂q ∂ ∂ N 2 2 = p q [N W (n1)+ N(N 1)(p + q) − (p q) ] ∂p − ∂q − − 2 N 1 X = N (p + q) − (p q) − N 3 3 +N(N 1)(N 2)(p + q) − (p q) − − N 1 − +2N(N 1)(p + q) − (p q) − − N 3 3 = [N + 2(N 1)]m + N(N 1)(N 2)(p + q) − (p q) − − − − where we’ve rewritten the expression in terms of m in order to simplify the computation for higher order powers. Applying p = q =1/2 again we see m3 =0 . Finally for m4 we have ∂ ∂ m4 = p q m3 ∂p − ∂q ∂ ∂ N 3 3 = p q [(3N 2)m + N(N 1)(N 2)(p + q) − (p q) ] ∂p − ∂q − − − − 2 N 2 2 = (3N 2)m +3N(N 1)(N 2)(p + q) − (p q) − − − N 4− 4 +N(N 1)(N 2)(N 3)(p + q) − (p q) − − − − Note that we can reduce m4 further in terms of the previous mean values by utilizing the N 2 2 2 relation N(N 1)(p + q) − (p q) = m N W (n1). This makes it easier to determine − − − the mean of higher order powers of the net displacement. P 4 2 N 4 4 m = (6N 8)m (3N 6)N W (n1)+ N(N 1)(N 2)(N 3)(p + q) − (p q) − − − − − − − X Evaluating with p = q =1/2 we get m4 = (3N 2)N − ✷ 4 1.6 Alternative Analysis of the 1-D Random Walk In lecture and in the text, we evaluated the probability distribution for taking n+ of N total steps in the +x direction PN (n+) and, by substituting m =2n+ N, the distribution of the net number of steps m in +x direction. Using this− distribution, we calculated the mean number of steps and the standard deviation. Another approach is to consider the probability distribution of the individual steps, as follows. Assume that all steps have the same length l, and that the probability of taking steps in the +x and x directions are p and q, respectively. (a) Sketch the probability distribution of a single step si versus x. Does this cor- respond to any of the standard probability distributions we have considered so far? (b) What are the mean and standard deviation of the distribution of si? (c) The total displacement xN = ml after N steps can be expressed as a sum of N statistically independent random variables si. Evaluate the mean number of steps taken in the +x direction. (Hint: What is the mean of a sum of independent random variables?) (d) Evaluate the standard deviation of m. (Hint: What is the mean of a product of statistically independent random variables?) (e) Similarly, evaluate the expectation value of m3 and m4. Compare your answers with the previous question. (f) Arguing based upon the Central Limit Theorem, what would you expect the probability distribution of m to look like in the limit of large N, i.e., when you add up a very large number of statistically independent random variables each with the distribution sketched in (a)? What should be the mean and standard deviation of this distribution? (a) si is non-zero for all x except for x = l and x =?l. si(x) is a continuous probability distribution, and is non-zero only at two points. Therefore, si(x) is a sum of two Delta functions, s (x)= pδ(x l)+ qδ(x l). i − − Note that with p + q = 1, we have si(x)dx = 1. (b) The mean is R ∞ s = xs (x)dx = lp +( l)q = l(p q) i i − − Z−∞ while mean of the squares is ∞ s2 = x2s (x)dx = l2p +( l)2q = l2. i i − Z−∞ Therefore the average l(p q) and the standard deviation is 2l√pq. − 5 N (c) The total displacement is a sum of individual displacements, xN = i=1 si. Therefore 1 1 N P m¯ = x = s = N(p q). l N l i − i=1 X 2 (d) To calculate the standard deviation, we first find, xN : N N N N N 2 xN = si si = sisj = sisj + sisi. i=1 ! i=1 ! i,j=1 i=j=1 i=1 X X X 6X X Now since individual steps are statistically independent sisj = sisj and we get N N x2 = l2(p q)2 + l2 = N(N 1)l2(p q)2 + Nl2 N − − − i=j=1 i=1 6X X and so m2 = N(N 1)(p q)2 + N and the standard deviation is − − σ = m2 m2 = N(p q)2 + N =2 Npq.

Solutions to Problem Set 1 1.3 Birthday Problem 1.4 Russian

The Birthday Problem (2.7)

A Note on the Exponentiation Approximation of the Birthday Paradox

Probability Theory

A Non-Uniform Birthday Problem with Applications to Discrete Logarithms

The Matching, Birthday and the Strong Birthday Problem

Approximations to the Birthday Problem with Unequal Occurrence Probabilities and Their Application to the Surname Problem in Japan*

A FIRST COURSE in PROBABILITY This Page Intentionally Left Blank a FIRST COURSE in PROBABILITY

A (Probably) Exact Solution to the Birthday Problem

A Monte Carlo Simulation of the Birthday Problem

This Lecture on Coincidences

Counting Birthday Collisions Using Partitions

Expected Number of Random Duplications Within Or Between Lists