<<

Chapter 6: Random Variables and the

 6.1 Discrete Random Variables

 6.2 Binomial Distribution

 6.3 Continuous Random Variables and the Normal

6.1 Discrete Random Variables

Objectives: By the end of this section, I will be able to…

1) Identify random variables. 2) Explain what a discrete probability distribution is and construct probability distribution tables and graphs. 3) Calculate the , , and standard of a discrete random . Random Variables

 A variable whose values are determined by chance

 Chance in the definition of a is crucial Example 6.2 - Notation for random variables

Suppose our is to toss a single fair die, and we are interested in the rolled. We define our random variable X to be the of a single die roll. a. Why is the variable X a random variable? b. What are the possible values that the random variable X can take? c. What is the notation used for rolling a 5? d. Use random variable notation to express the probability of rolling a 5. Example 6.2 continued Solution a) We don’t know the value of X before we toss the die, which introduces an element of chance into the experiment b) Possible values for X: 1, 2, 3, 4, 5, and 6. c) When a 5 is rolled, then X equals the outcome 5, or X = 5. d) Probability of rolling a 5 for a fair die is 1/6, thus P(X = 5) = 1/6. Types of Random Variables

 Discrete random variable - either a finite number of values or countable number of values, where “countable” refers to the fact that there might be infinitely many values, but they result from a counting process

 Continuous random variable infinitely many values, and those values can be associated with on a continuous without gaps or interruptions Example

Identify each as a discrete or continuous random variable. (a) Total amount in ounces of soft drinks you consumed in the past year. (b) The number of cans of soft drinks that you consumed in the past year. Example

ANSWER: (a) continuous (b) discrete Example

Identify each as a discrete or continuous random variable. (a) The number of movies currently playing in U.S. theaters. (b) The running time of a randomly selected movie (c) The cost of making a randomly selected movie. Example

ANSWER (a) discrete (b) continuous (c) continuous Discrete Probability Distributions

 Provides all the possible values that the random variable can assume

 Together with the probability associated with each value

 Can take the form of a table, graph, or formula

 Describe populations, not samples Example

Table 6.2 in your textbook The probability distribution table of the number of heads observed when tossing a twice Probability Distribution of a Discrete Random Variable

 The sum of the of all the possible values of a discrete random variable must equal 1.

 That is, ΣP(X) = 1.

 The probability of each value of X must be between 0 and 1, inclusive.

 That is, 0 ≤ P(X ) ≤ 1. Example

Let the random variable x represent the number of girls in a family of four children. Construct a table describing the probability distribution.

Example

Determine the outcomes with a :

Example

 Total number of outcomes is 16  Total number of ways to have 0 girls is 1 P(0 girls) 1/16 0.0625  Total number of ways to have 1 girl is 4

P(1 girl) 4/16 0.2500  Total number of ways to have 2 girls is 6

P(2 girls) 6/16 0.375 Example

 Total number of ways to have 3 girls is 4 P(3 girls) 4/16 0.2500  Total number of ways to have 4 girls is 1 P(4 girls) 1/16 0.0625 Example

Distribution is:

x P(x)

0 0.0625 1 0.2500 2 0.3750 3 0.2500 4 0.0625 NOTE:

P(x) 1 Mean of a Discrete Random Variable

 The mean μ of a discrete random variable represents the mean result when the experiment is repeated an indefinitely large number of times

 Also called the or expectation of the random variable X.

 Denoted as E(X )

 Holds for discrete and continuous random variables

Finding the Mean of a Discrete Random Variable

 Multiply each possible value of X by its probability.

 Add the resulting products.

XPX Variability of a Discrete Random Variable Formulas for the Variance and of a Discrete Random Variable

Definition Formulas 2 XPX2

XPX2

Computational Formulas 2XPX 2 2

XPX22 Example

x P(x) 2 x P(x) x x2 P(x) 0 0.0625 0 0 0 1 0.2500 0.25 1 0.2500 2 0.3750 0.75 4 1.5000 3 0.2500 0.75 9 2.2500 4 0.0625 0.25 16 1.0000

xP(x) 2.0 Example

x P(x) 2 2 x P(x) x x P(x) 0 0.0625 0 0 0 1 0.2500 0.25 1 0.2500 2 0.3750 0.75 4 1.5000 3 0.2500 0.75 9 2.2500 4 0.0625 0.25 16 1.0000 2 x2 P(x) 2 5.0000 4.0000 1.0000

1.0000 1.0 Discrete Probability Distribution as a Graph  Graphs show all the contained in probability distribution tables

 Identify patterns more quickly

FIGURE 6.1 Graph of probability distribution for Kristin’s financial gain. Example

 Page 270 Example

 Probability distribution (table)

x P(x) 0 0.25 1 0.35 2 0.25 3 0.15

 Omit graph Example

 Page 270 Example

 ANSWER X number of goals scored

(a) Probability X is fewer than 3

P(X 0 X 1 X 2) P(X 0) P(X 1) P(X 2) 0.25 0.35 0.25 0.85 Example

 ANSWER (b) The most likely number of goals is the expected value (or mean) of X

x P(x) x P(x)

0 0.25 0 1 0.35 0.35 xP(x) 1.3 1 2 0.25 0.50 3 0.15 0.45

She will most likely one goal Example

 ANSWER (c) Probability X is at least one

P(X 1 X 2 X 3) P(X 1) P(X 2) P(X 3) 0.35 0.25 0.15 0.75 Summary

 Section 6.1 introduces the idea of random variables, a crucial concept that we will use to assess the behavior of variable processes for the remainder of the text.  Random variables are variables whose value is determined at least partly by chance.  Discrete random variables take values that are either finite or countable and may be put in a list.  Continuous random variables take an infinite number of possible values, represented by an on the number line. Summary

 Discrete random variables can be described using a probability distribution, which specifies the probability of observing each value of the random variable.

 Such a distribution can take the form of a table, graph or formula.

 Probability distributions describe populations, not samples.

 We can find the mean μ, standard deviation σ, and variance σ2 of a discrete random variable using formulas.

 6.2 Binomial Probability Distribution

6.2 Binomial Probability Distribution Objectives: By the end of this section, I will be able to…

1) Explain what constitutes a binomial experiment. 2) Compute probabilities using the binomial probability formula. 3) Find probabilities using the binomial tables. 4) Calculate and interpret the mean, variance, and standard deviation of the binomial random variable. Factorial symbol

 For any integer n ≥ 0, the factorial symbol n! is defined as follows:

 0! = 1

 1! = 1

 n! = n(n - 1)(n - 2) · · · 3 · 2 · 1 Example

Find each of the following 1. 4! 2. 7!

Example

ANSWER

1. 4! 4 3 2 1 24 2. 7! 7 6 5 4 3 2 1 5040 Factorial on Calculator

Calculator 7 MATH PRB 4:! which is 7! Enter gives the result 5040 Combinations

An arrangement of items in which  r items are chosen from n distinct items.  repetition of items is not allowed (each item is distinct).  the order of the items is not important.

Example of a Combination

The number of different possible 5 card poker hands. Verify this is a combination by checking each of the three properties. Identify r and n. Example

 Five cards will be drawn at random from a deck of cards is depicted below

Example

An arrangement of items in which  5 cards are chosen from 52 distinct items.  repetition of cards is not allowed (each card is distinct).  the order of the cards is not important.

Combination Formula

The number of combinations of r items chosen from n different items is denoted as nCr and given by the formula:

n! C nr r!! n r Example

Find the value of

7 C4 Example

ANSWER: 7! 7 C4 4!(7 4)! 7!

4!3! 7 6 5 4 3 2 1

(4 3 2 1) (3 2 1) 7 6 5

3 2 1 7 5 35 Combinations on Calculator

Calculator 7 MATH PRB 3:nCr 4

To get: 7 C4 Then Enter gives 35

Example of a Combination

Determine the number of different possible 5 card poker hands. Example

ANSWER:

52C5 2,598,960 Motivational Example

Genetics • In mice an allele A for agouti (gray-brown, grizzled fur) is dominant over the allele a, which determines a non-agouti color. Suppose each parent has the genotype Aa and 4 offspring are produced. What is the probability that exactly 3 of these have agouti fur?

Motivational Example

• A single offspring has genotypes:

A a

A AA Aa {AA , Aa,aA,aa}

a aA aa Motivational Example

• Agouti genotype is dominant • Event that offspring is agouti:

{AA, Aa ,aA} • Therefore, for any one birth:

P(agouti genotype) 3/ 4

P(not agouti genotype) 1/ 4 Motivational Example

• Let G represent the event of an agouti offspring and N represent the event of a non-agouti • Exactly three agouti offspring may occur in four different ways (in order of birth):

NGGG, GNGG, GGNG, GGGN

Motivational Example

• Consecutive events (birth of a mouse) are independent and using multiplication rule:

P(N G G G) P(N) P(G) P(G) P(G) 1 3 3 3 27

4 4 4 4 256

P(G N G G) P(G ) P(N) P(G) P(G) 3 1 3 3 27

4 4 4 4 256

Motivational Example

P(G G N G) P(G) P(G) P(N) P(G)

3 3 1 3 27

4 4 4 4 256

P(G G G N) P(G) P(G) P(G) P(N)

3 3 3 1 27

4 4 4 4 256 Motivational Example

• P(exactly 3 offspring has agouti fur)

P( first birth N OR second birth N OR third birth N OR fourth birth N) 1 3 3 3 3 1 3 3 3 3 1 3 3 3 3 1

4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4

3 3 1 108 4 0 .422 4 4 256

Binomial Experiment

Agouti fur example may be considered a binomial experiment

Binomial Experiment

Four Requirements:

1) Each trial of the experiment has only two possible outcomes (success or failure)

2) Fixed number of trials

3) Experimental outcomes are independent of each other

4) Probability of observing a success remains the same from trial to trial Binomial Experiment

Agouti fur example may be considered a binomial experiment

1) Each trial of the experiment has only two possible outcomes (success=agouti fur or failure=non-agouti fur)

2) Fixed number of trials (4 births)

3) Experimental outcomes are independent of each other

4) Probability of observing a success remains the same from trial to trial (¾) Binomial Probability Distribution

When a binomial experiment is performed, the of all of possible of successful outcomes of the experiment together with their associated probabilities makes a binomial probability distribution. Binomial Probability Distribution Formula

For a binomial experiment, the probability of observing exactly X successes in n trials where the probability of success for any one trial is p is

P(X ) C p X (1 p)n X n X where n! C n X X! n X !

Binomial Probability Distribution Formula

Let q=1-p

P(X ) C p X qn X n X

Rationale for the Binomial Probability Formula

n ! P(x) = • px • qn-x (n – x )!x!

The number of outcomes with exactly x successes among n trials Binomial Probability Formula

n ! P(x) = • px • qn-x (n – x )!x!

Number of The probability of x outcomes with exactly successes among n x successes among n trials for any one trials particular order Agouti Fur Genotype Example

X event of a birth with agouti fur

4! 3 3 1 1 P(X ) (4 3)!3! 4 4 3 3 1 1 4 4 4 27 1 4 64 4 108 0.422 256 Binomial Probability Distribution Formula: Calculator

2ND VARS A:binompdf(4, .75, 3)

n, p, x Enter gives the result 0.421875

Binomial Distribution Tables

 n is the number of trials  X is the number of successes  p is the probability of observing a success

See Example 6.16 on page 278 for more information

FIGURE 6.7 Excerpt from the binomial tables. Example

Page 284 Example

ANSWER X number of heads P(X 5) 5 20 5 20C5 (0.5) (0.5) 15,504 (0.5)5 (0.5)15 0.0148 Binomial Mean, Variance, and Standard Deviation

 Mean (or expected value): μ = n · p

2  Variance: np(1 p)

2 Use q 1 p, then npq

 Standard deviation:

np(1 p) npq Example 20 coin tosses The expected number of heads: np (20)(0.50) 10

Variance and standard deviation: 2 npq (20)(0.50)(0.50) 5.0

5 2.24 Example

Page 284 Is this a ?

Four Requirements:

1) Each trial of the experiment has only two possible outcomes (makes the basket or does not make the basket)

2) Fixed number of trials (50)

3) Experimental outcomes are independent of each other

4) Probability of observing a success remains the same from trial to trial (assumed to be 58.4%=0.584) Example

ANSWER (a) X number of baskets

P(X 25) 25 25 50 C25 (0.584) (0.416) 0.0549

Example

ANSWER (b) The expected value of X

np (50)(0.584) 29.2

The most likely number of baskets is 29

Example

ANSWER (c) In a random sample of 50 of O’Neal’s shots he is expected to make 29.2 of them.

Example Page 285

Is this a Binomial Distribution?

Four Requirements:

1) Each trial of the experiment has only two possible outcomes (contracted AIDS through injected drug use or did not)

2) Fixed number of trials (120)

3) Experimental outcomes are independent of each other

4) Probability of observing a success remains the same from trial to trial (assumed to be 11%=0.11) Example

ANSWER (a) X=number of white males who contracted AIDS through injected drug use

P(X 10)

10 110 120C10 (0.11) (0.89) 0.0816

Example

ANSWER (b) At most 3 men is the same as less than or equal to 3 men:

P(X 3) P(X 0) P(X 1) P(X 2) P(X 3)

Why do probabilities add?

Example Use TI-83+ calculator 2ND VARS to get A:binompdf(n, p, X)

P(X 0) P(X 1) P(X 2) P(X 3)

= binompdf(120, .11, 0) + binompdf(120, .11, 1)

+ binompdf(120, .11, 2) + binompdf(120, .11,3)

5.535172385 E 4 0.000554

Example

ANSWER (c) Most likely number of white males is the expected value of X np (120)(0.11) 13.2

Example

ANSWER (d) In a random sample of 120 white males with AIDS, it is expected that approximately 13 of them will have contracted AIDS by injected drug use

Example

Page 286

Example

ANSWER (a)

2 npq (120)(0.11)(0.89) 11.748

11.748 3.43

RECALL: and z Scores

values are not unusual if 2 z -score 2

Otherwise, they are moderately unusual or an (see page 131)

Z score Formulas

Sample Population x x x z z s Example

Z-score for 20 white males who contracted AIDS through injected drug use: 20 13.2 z 1.98 3.43

It would not be unusual to find 20 white males who contracted AIDS through injected drug use in a random sample of 120 white males with AIDS.

Summary

 The most important discrete distribution is the binomial distribution, where there are two possible outcomes, each with probability of success p, and n independent trials.

 The probability of observing a particular number of successes can be calculated using the binomial probability distribution formula. Summary

 Binomial probabilities can also be found using the binomial tables or using technology.

 There are formulas for finding the mean, variance, and standard deviation of a binomial random variable. 6.3 Continuous Random Variables and the Normal Probability Distribution

Objectives: By the end of this section, I will be able to…

1) Identify a continuous probability distribution.

2) Explain the properties of the normal probability distribution. FIGURE 6.15-

(a) Relatively small sample (n = 100) with large class widths (0.5 lb).

(b) Large sample (n = 200) with smaller class widths (0.2 lb). Figure 6.15 continued

(c) Very large sample (n = 400) with very small class widths (0.1 lb).

(d) Eventually, theoretical of entire population becomes smooth curve with class widths arbitrarily small. Continuous Probability Distributions

 A graph that indicates on the horizontal axis the of values that the continuous random variable X can take

 Density curve is drawn above the horizontal axis

 Must follow the Requirements for the Probability Distribution of a Continuous Random Variable Requirements for Probability Distribution of a Continuous Random Variable

1) The total area under the density curve must equal 1 (this is the for Continuous Random Variables).

2) The vertical height of the density curve can never be negative. That is, the density curve never goes below the horizontal axis. Probability for a Continuous Random Variable

 Probability for Continuous Distributions is represented by area under the curve above an interval. The Normal Probability Distribution

 Most important probability distribution in the world

 Population is said to be normally distributed, the data values follow a normal probability distribution

 population mean is μ

 population standard deviation is σ

 μ and σ are of the normal distribution

FIGURE 6.19

 The normal distribution is symmetric about its mean μ (bell-shaped). Properties of the Normal Density Curve (Normal Curve)

1) It is symmetric about the mean μ.

2) The highest point occurs at X = μ, because symmetry implies that the mean equals the , which equals the of the distribution.

3) It has inflection points at μ-σ and μ+σ.

4) The total area under the curve equals 1. Properties of the Normal Density Curve (Normal Curve) continued

5) Symmetry also implies that the area under the curve to the left of μ and the area under the curve to the right of μ are both equal to 0.5 (Figure 6.19).

6) The normal distribution is defined for values of X extending indefinitely in both the positive and negative directions. As X moves farther from the mean, the density curve approaches but never quite touches the horizontal axis. The Empirical Rule

For data sets having a distribution that is approximately bell shaped, the following properties apply:

 About 68% of all values fall within 1 standard deviation of the mean.  About 95% of all values fall within 2 standard deviations of the mean.  About 99.7% of all values fall within 3 standard deviations of the mean. FIGURE 6.23 The Empirical Rule. Drawing a Graph to Solve Normal Probability Problems

1. Draw a “generic” bell-shaped curve, with a horizontal number line under it that is labeled as the random variable X.

 Insert the mean μ in the center of the number line.

Steps in Drawing a Graph to Help You Solve Normal Probability Problems

2) Mark on the number line the value of X indicated in the problem. Shade in the desired area under the normal curve. This part will depend on what values of X the problem is asking about. 3) Proceed to find the desired area or probability using the empirical rule. Example

Page 295

Example

ANSWER

Example

Page 296

Example

ANSWER

Example

Page 296

Example

ANSWER

Summary

 Continuous random variables assume infinitely many possible values, with no between the values.

 Probability for continuous random variables consists of area above an interval on the number line and under the distribution curve. Summary

 The normal distribution is the most important continuous probability distribution.

 It is symmetric about its mean μ and has standard deviation σ.

 One should always sketch a picture of a normal probability problem to help solve it.