<<

5.12 The Bivariate 313 512 The Bivariate Normal Distribution

The first multivariate continuous distribution for which we have a name is a generalization of the normal distribution to two coordinates. There is more structure to the bivanate normal distribution than just a pair of normal marginal distributions.

Definition of the Bivarlate Normal Distribution

Suppose that Z and Z are independent random variables, each of which has a standard normal distribution. Then the joint p.d.f. 2) of 1Z and Z is specified for all values of and z by the equation

g(z,z,) = — exp[——(z + z)]. (5.12.1) 1 2 L 2 For constants 1t. /12, a. a,, and p such that —00 0 (i = 1, 2). and —1< p < 1, we shall now define two new random variables 1X and X, as follows: 1X 1=aZ 1-i—i X, = a [pZi + (1 — p 2z + it,. (5.12.2) We shall derive the joint p.d.f. 1f(x X2) of 1X and X,. The transformation from 1Z . and 1, to 1X and 2X is a linear transformation; and it will be found that the determinant of the matrix of coefficients of 1Z and 2Z has the value z\ = (1 Therefore, as discussed — )p2 a1212 in Section 3.9, the Jacobian J of the inverse transformation. from 1X and K, to 1Z and 2Z is J = —i = -aa2(l—p-) . (5.12.3) Since J > 0. the value of IJI is equal to1the value of J itself. If the relations (5.12.2) are solved for Zi and 2Z in terms of X and .2X then the joint p.d.f. 1f(x x-,) can be obtained by replacing and z in Eq. (5.12.1) by their expressions in terms. 1ofx and 52. and then multiplying by JI. It can be shown that the result is, for — 1

When thejoint p.d.f. of two random variables 1X and 2X is of the form in Eq. (5.12.4). it is said that 1X and X-,have a biiari ate normaldistribution. The and the of the bivariate normal distribution specified by Eq. (5.12.4) are easily derived from the definitions in Eq. (5.12.2). Because 1Z and 2Z are independent and each has 0 and

314

Example

5.12.1

and

marginal Independence

it

marginal

distributions

and

normal

X 2 in

Marginaiflistributions.

X-,

Marginal

correlation

25%.

correspond

from

the

to

can

cerned

eral

Anthropometry

normal

of

bivariate

the

ple, joint

normal

ing

Also,

the

disthbution

the

Furthermore,

Chapter

the

as

have

distinguish

the

Z.

first

he

It

heights

physical

standard

in study

for

correlation

the

50%.

following

distribution

distributions.

has

individuals

seen

it

with

Eq.

joint

p.d.f

a

distribution

distribution.

distribution

many

species

bivariate

follows

normal

5

been

of

of

1.

to

from

(5.12.2).

and

of

and

and

whether

in

Special

of

the

it

the

features

and

for

a

normal

the

both

populations

the

it follows

fitted

75%

result

X

convenient

the

of

Eq.

properties

Chaetocnema

can

fitted

which

from

of

distribution.

Conditional

different

in

correlation,

normal

of

first

X 1

and

of

weights

X 1

In

Flea

(5.12.4)

the

E(Xi)=j.L:

where arises

Distributions

some

be

of

bivariate

certain

distributions.

and

of

has

X

particular.

distribution

this

tarsus

that

the

and

shown

the

population

the

a

Beetles.

We

distribution,

is

X:

been

combination

directly

variety

of

representation species.

Z 1

marginal

the

p.d.f.

E(X 1 )

for

a X

of

that

linear

is

versus

shall

this

normal

For

and

normal

by

the joint

simply

heikertingeri.

established:

us

are

If

thejointp.d.f.

is

using

since Distributions

distribution,

of other

continue

individuals

Z

X 1 and

Lubischew

to =

is

and

on combinations

It

specified

Figure

also

the

distribution

p.d.f.

p(X 1 ,

0.64.

species

i 1 .

distribution

should

introduce

are

and

distribution.

two

and p.

naturally

Eq.

both

of

second

of

populations,

normal

In

E(X 2 )

independent

Var(X 1 )=a

X-,

and

related

easily

the

of

their

5.8

(5.12.2)

X 2 )

summary.

to

X 1

be

of

are

by

X 2

The

assume

in

fitted

(1962)

it

from

joint shows

j(x.

joint

=

and

emphasized,

flea

the

will

Eq. =

distributions.

of

uncorrelated,

the in

obtained

of

Hence,

with

tests

p.

p,

The

many

that

two

bivariate

Corollary

bivariate in

independent

beetle.

X,

(5.12.4),

population

be

x’)

p.d.f.

the

if

reports

a

that

random

the

mean

will

Var(X 1 )

also

convenient

Cov(X

X 1

scatterplot

are

physical

factors

joint

X

practical

fori=l,2.

the

first

is

and

be

linear

The

includes

and

/1,

specified

however,

the

normal

random

then

normal

approximately

then

distribution

5.6.1

Thus.

variables

tarsus

X 2 .

were

=

into

will

and

investigation

X)

X

characteristics

measurements

random

combinations

have

.

to problems.

of

p

the

variance

that

be

are

three

=

chosen

=0. for

for

represent

distribution

distribution. measurements

variables

and

by

that

a

Plo’2.

approximately

product0

indePefldd1

with

a

bivariate

the

i

could

Eq.

In

variables

Var(Xj

=

ellipses

of

the

this

to

mar

a a

standard

(5.12.41

1,2,

For

the

was

Therefo,

bivariate

X 1

X 1

contain

bivariate

be

case

of

of

such

scores

exam

of

arid nornj

the

and as

used

The

con

that

I

sev

hav

31

4

of

the

as a I IC ClVcflIOC IMUIIIICII L.)U,IIIUULIUH

1))) 5)) 2i))) .: ii J)) ! ui ii t Lu Junit

Figure 5.8 Sit rplut tied htte CLita iL 5O dud 5’ hodridie nuruta) e)Iipe Lu E\urnple 5. ). I.

Tao random variables 1.V and X that hake a baariate normal distribution are independent if and only if they are uncorrelated.

\Ve have already seen in Section 4.6 that tso random sariables X and X v ith an arbitrary joint distribution can he unconelated ithout being independent.

(‘onditional Distributions. The conditional distribution of X-. izien that ‘i = can also he derived from the representation in Eq. 15.12.2). If 1X = .. . then 1Z = )• Therefore, the conditional I — distribution of ,V eien that .V = i is the same as the conditional distribution of 2)t (I 2J7 + /L + (i (5.12.5) at I Because Z has a standard normal distribution and is independent of .1V it follows from 5. 12.5 that the conditional distribution of .V cisen that X = .r is a normal distribution. for which the mean is . _/tt), E(Xx = p. + pa (5.12.6) )1 ( and the variance is (1 — p2)a. The conditional distribution of 1X given that X = x cannot he deri’ed so easily from Eq. 5. 12.2) because of the different ways in s hich Z and Z enter Eq. t 5. 12.2). f1oseer. it is seen from Eq. (5.12.4) that the Joint p.d.f. 1fLy v is symmetric in the I t o ariables — p I o and (v /L ) nH . Therefore, it. foIlos s ihat the conditional distribution of 1X gi en that X = r can he found from the conditional distribution of \ uRen that .V = Ithis distribLttion has Just been deri’.ed) simply by interchanging

Example

316

Example

5.12.3

5,12,2

a

tional

mean

distribution

distribution

Determining

from

that

height.

the

regardless

linear

M.S.E.

(1 distribution

person’s

E(X 2 )

not

weight

the

We by

random

from

Predicting

marginal

variance

is

negative.

p

should Since

of

person’s

and

it

If

distribution

x 1

.

Chapter distribution

a

Eq.

known.

> (1

smallest

know

p 2 )c

is

a

Some

the

We

population

and

Furthermore,

and

a

0, bivariate

— the

is

easier

certain

must

(5.12.4)

the

height

person’s

the

variables

reduced

p 2 )a. have

be

of

p2;

of

variance

with

variance

t.

that

distribution of

However,

of

weight

particular of

a

the

5

variance

another

noted. a

M.S.E.

be

Marginal and to

the interchanging

of,

that

Person’s

now

Special

X 2

the

of

mean population.

known

of

is

normal

predict

predicted

and and

in

height

X 1

conditional

from

the

given

x 1 .

conditional

and

of marginal I

is have

r 2 .

shown

which

If

is

since

random

given

given

its

p that that

the

of

the

this

M.S.E.

then

(I

p

features

Distributions

We

height the

of

H

and

distribution

a

Weight.

Distribution.

the

a value that

— is

same

variance

the

person’s

X 2 .

can the linear

that height

bivariate

sariance

shall

with

to

0, that not

the

p 2 )a.

that

and

variance

conditional

distribution

p

X 1

variance

EtX 1 Ix 2 )

of

(I

then

person’s

distribution be

each

distribution.

known, does

best

for

X 2

let

of

X

and

the

determine — this

of function

and

attained

Let

of

a

p 2 )H. X 2

the

=

weight

x 1 ; EX 2 x 1 )

the

is

prediction

smallest marginal

normal

JLs. is

not tall

prediction

x

the

a 2 .

2 must

X 1 weight

Y

denote constant

(I a

person.

and conditional

is

then

weight

Suppose =

given

person.

and depend

distribution

of

conditional

denote

a —

and

is p

from

if

normal

of

the

the

p 2 )H

X

Hence. the

positive.

distribution

interchanging

the

M.S.E.

+

are the is

be

distribution

that

that X-

is

it

marginal

is X 2 M.S.E.

is best

a 1

It

a

her

a

a

the

on

that follows

a

person’s

weight

highly

the the

normal

linear follows

is

for

short

decreases normal

X must

distribution,

normal

distribution

when

the

height height

prediction

(:

smaller

a

that

distribution

in

= mean

If

variance

every

random

of

p

Example given

person,

correlated,

of

be function distribution

that

for

can

distribution,

is

this and

the that

<0.

height

distribution,

distribution.

of

when the

a 1

predicted.

P2)

E(X 2 jx 1 )

a number

which

as

the

than

a

normal

value be

each

prediction

and

value

the

person. H.

variable

the of

person

for

of

or

pJ

difficulty

attained

the

of

X 1

of

5.12.2

her Joint

the a.

If

a

slope increases,

conditional X 2

which

the

of

X 2

x,

x 1 .

person

the

of

person

it

is

distribution

weight

of

Thus,

for

variance

We

selected

Suppose

X 1

is given

the

distiihutiofl

and

p.d.f.

X given

known

I

Furthermore,

given

of

the

is

is

known if

which has

the

is

of

shall

conditional

U

the the the

of her

is

known.

the

conditional

predicting

is

it

is a that

that mean

selected

medium

value

normal distribution

the

condi when at

follows variance

function

that

height H

specified

compare

conditional

the

p 2 )a.

that

with

random

X 1

X 1

mean

of

(5.127)

of

iS

mean these

the

the

= her

4

=

this

the

is

is

If is 5.1.2 The Bivariate Normal Distribution 317

.Vand Y must he a hi ariate nirmal distribution see Exercise 14>.Hence, the marginal

distribution of 1 is also a normal disinbution. [Themean and the ariance of 1 niust he determined.

ihe mean at } is EiY EIEiYXiJ EX —jr.

Eui themmore. h E\ercmse II of Section 4.7,

ar( Y> E[Var( Y X ii Var[ Ei Y Xi Er) —Var>X>

k Hence, he distribution of ) is a normal distribution s oh mean i and variance r- o .4

Linear Combinations Suppose again that two random variables X and X hake a hivariate normal distribution, tar shich the p.d.f. is specified by Eq. 5.12.4). Now consider the Y = a X + a X + /, v here 0. a. and h are arbitrary given constants. Both 1X and X can he represented. as in Eq. (5.12.2). as linear combinations of independent and normally distributed random variables Z and Z. Since Y is a of X and X. it follows that V can also be represented as a linear combination of 1Z and Z. Therefore. by Corollary 56.1. the distribution of V will also be a normal distribution. Thus, the following important property has been established.

If two random variables 1X and ,V have a b/variate normal distribution, then will distribution. each linear combination V = a1X -i-aX + h have a normal

The mean and variance of V are as follows: E>Y> = a1E(X + a2E(X + b ) +a2M2 +1 ‘ili ) and Van Y = 02 Var) )1K a )5VariX a12a 1Cov>X )2K = c1rT aa. ± 2aapon. , (5.12.8) Example 5.12.4 Heights of Husbands and Wives. Suppose that a maiTied couple is selected at random from a certain population of married couples. and that the joint distribution of the height of the s ife and the height of her husband is a bivariate normal distribution. Suppose that the heights at the wives have a mean of 66.8 inches and a standard of 2 inches. the heights of the husbands have a mean of 70 inches and a of 2 inches. and the correlation heteen these to heights is 0.68. We shall determine the probability that the wife will he taller than her husband.

EXERCISES

2.

1.

318

population.

.4

given

Suppose

inches.

of

husbands

Consider

is

the

85.

on

to

sife

and

a

of

test

that

and

again

student

given

the Suppose

the

B

two

wives

conditional

is

the

standard

that

different

90,

chosen

be

treatment

stant

the

X

tion

If

Summary

joint

Therefore,

in

also

tion.

Hence,

and

the

for

normal

then

and

a

Chapter

and

found

Exaniple

conditional

random

The

aX

in

that

If

height

distribution

which

deviation

the

at

It

the

distribution

Y

tests

we

we

s’.

random

can

random

+

the

in are

the

distribution,

variance

of

standard

(Similarly,

5

must

let

bY

5.12.4.

of

the

the

A

the

vector

the

be

standard mean

normal.

Special

the

and

X

is +

mean

probability

book

found

bivariate

from

mean

denote

detennine

of

variable

10:

c

husband

of

score

deviation

B

Find

is

has (X,

heights

the

the

are

Var(X

for

by

a

being Also,

Distributions

deviation

from

is

it

certain

a Y)

the

on

Pr(X—

height

follows

the

mean

D.

to

the

normal

normal

Z

is

has

test

095

be

of

that

the

a

F.

is

the

the

height

conditional

72 =

linear

Morrison EX

a

(X Y)

conditional

table

value

the

bivariate

of

that

disthbution

Y

distribution. =

X

of

3.

>0)=Pr(Z

function wife

4

the

Var(X)

Y given

tests

the

the

Exercise

Consider

90?

of probability

0.8.

Y)

distribution

+

16:

+

distribution Y

distribution

(1990).

=

4

will

Pr(X

probability

normal

=

wife,

3.2)/(

the

If

is

will 0.0227.

distribution

at

66.8—

of the

and

+

1.6.

2(0.68)(2)(2)

scores

be

the

In

2.

he

again

Var(Y)

and

y

student’s

1.6)

taller

particular.

that

higher-dimensional

If

>2)=

and

distribution,

greater

end

Y

and

a

70

let

on

>

that

of

of

will

her

student

the

the

of

the

than

0).

the

=

X

Y

Y

—2

of

the

score

than

two

this

score

conditional

1—ci(2)

given

denote

have —3.2

correlation

two

Since

X

her

Cov(X,

sum

the

Y

is

tests

2(X)?

given

book

on

tests

then

will

on

chosen

2.56.

a

X

husband

marginal

of

standard

test

the

X

test

=

A

be

have

her

every

that

Y

and

x.)

and

Y)

height

B

of

variance

generalizations

A

a

at

=

scores

will

normal

the

A a

is

random.

y

Y

B

is

bivariate

linear

distributions

normal

80,

more

is

be

described

have

two

0.0227,

of

normal

on

what

higher

being

her

distributjon

scores

combina

thorough

the

what

a

distribu

normal

husband

bivayjate is

tWO

than

con

in

with

the

can

IS

15

of

4

) I