5.12 The Bivariate Normal Distribution 313 512 The Bivariate Normal Distribution
The first multivariate continuous distribution for which we have a name is a generalization of the normal distribution to two coordinates. There is more structure to the bivanate normal distribution than just a pair of normal marginal distributions.
Definition of the Bivarlate Normal Distribution
Suppose that Z and Z are independent random variables, each of which has a standard normal distribution. Then the joint p.d.f. 2) of 1Z and Z is specified for all values of and z by the equation
g(z,z,) = — exp[——(z + z)]. (5.12.1) 1 2 L 2 For constants 1t. /12, a. a,, and p such that —00 0 (i = 1, 2). and —1< p < 1, we shall now define two new random variables 1X and X, as follows: 1X 1=aZ 1-i—i X, = a [pZi + (1 — p 2z + it,. (5.12.2) We shall derive the joint p.d.f. 1f(x X2) of 1X and X,. The transformation from 1Z . and 1, to 1X and 2X is a linear transformation; and it will be found that the determinant of the matrix of coefficients of 1Z and 2Z has the value z\ = (1 Therefore, as discussed — )p2 a1212 in Section 3.9, the Jacobian J of the inverse transformation. from 1X and K, to 1Z and 2Z is J = —i = -aa2(l—p-) . (5.12.3) Since J > 0. the value of IJI is equal to1the value of J itself. If the relations (5.12.2) are solved for Zi and 2Z in terms of X and .2X then the joint p.d.f. 1f(x x-,) can be obtained by replacing and z in Eq. (5.12.1) by their expressions in terms. 1ofx and 52. and then multiplying by JI. It can be shown that the result is, for — 1 When thejoint p.d.f. of two random variables 1X and 2X is of the form in Eq. (5.12.4). it is said that 1X and X-,have a biiari ate normaldistribution. The means and the variances of the bivariate normal distribution specified by Eq. (5.12.4) are easily derived from the definitions in Eq. (5.12.2). Because 1Z and 2Z are independent and each has mean 0 and 314 Example 5.12.1 and marginal Independence it marginal distributions and normal X 2 in Marginaiflistributions. X-, Marginal correlation 25%. correspond from the to can cerned eral Anthropometry normal of bivariate the ple, joint normal ing Also, the disthbution the Furthermore, Chapter the as have distinguish the Z. first he It heights physical standard in study for correlation the 50%. following distribution distributions. has individuals seen it with Eq. joint p.d.f a distribution distribution. distribution many species bivariate follows normal 5 been of of 1. to from (5.12.2). and of and and whether in Special of the it the features and for a normal the both populations the it follows fitted 75% result X convenient the of Eq. properties Chaetocnema can fitted which from of distribution. Conditional different in correlation, normal of first X 1 and of weights X 1 In Flea (5.12.4) the E(Xi)=j.L: where arises Distributions some be of bivariate certain distributions. and of has X particular. distribution this tarsus that the and shown the population the a Beetles. We distribution, is X: been combination directly variety of representation species. Z 1 marginal the p.d.f. probability E(X 1 ) for a X of that linear is versus shall this normal For and normal by the joint simply heikertingeri. established: us are If thejointp.d.f. is using since Distributions distribution, of other continue individuals Z X 1 and Lubischew to = is and on combinations It specified Figure also the distribution p.d.f. p(X 1 , 0.64. species i 1 . distribution should introduce are and distribution. two and p. naturally Eq. both of second of populations, normal In E(X 2 ) independent Var(X 1 )=a X-, and related easily the of their 5.8 (5.12.2) X 2 ) summary. to X 1 be of are by X 2 The assume in fitted (1962) it from joint shows j(x. joint = and emphasized, flea the will Eq. = distributions. of uncorrelated, the in obtained of Hence, with tests p. p, The many that two bivariate Corollary bivariate in independent beetle. X, (5.12.4), population be x’) p.d.f. the if reports a that random the mean will Var(X 1 ) also convenient Cov(X X 1 scatterplot are physical factors joint X practical fori=l,2. the first is and be linear The includes and /1, specified however, the normal random then normal approximately then distribution 5.6.1 Thus. variables tarsus X 2 . were = into will and investigation X) X characteristics measurements random combinations have r. to problems. of p the variance that be are three = chosen =0. for for represent distribution distribution. measurements variables and by that a Plo’2. approximately product0 indePefldd1 with a bivariate the i could Eq. In variables Var(Xj = ellipses of the this to mar a a standard (5.12.41 1,2, For the was Therefo, bivariate X 1 X 1 contain bivariate be case of of such scores exam of arid nornj the and as used The con that I sev hav 31 4 of the as a I IC ClVcflIOC IMUIIIICII L.)U,IIIUULIUH 1))) 5)) 2i))) .: ii J)) ! ui ii t Lu Junit Figure 5.8 Sit rplut tied htte CLita iL 5O dud 5’ hodridie nuruta) e)Iipe Lu E\urnple 5. ). I. Tao random variables 1.V and X that hake a baariate normal distribution are independent if and only if they are uncorrelated. \Ve have already seen in Section 4.6 that tso random sariables X and X v ith an arbitrary joint distribution can he unconelated ithout being independent. (‘onditional Distributions. The conditional distribution of X-. izien that ‘i = can also he derived from the representation in Eq. 15.12.2). If 1X = .. . then 1Z = )• Therefore, the conditional I — distribution of ,V eien that .V = i is the same as the conditional distribution of 2)t (I 2J7 + /L + (i (5.12.5) at I Because Z has a standard normal distribution and is independent of .1V it follows from 5. 12.5 that the conditional distribution of .V cisen that X = .r is a normal distribution. for which the mean is . _/tt), E(Xx = p. + pa (5.12.6) )1 ( and the variance is (1 — p2)a. The conditional distribution of 1X given that X = x cannot he deri’ed so easily from Eq. 5. 12.2) because of the different ways in s hich Z and Z enter Eq. t 5. 12.2). f1oseer. it is seen from Eq. (5.12.4) that the Joint p.d.f. 1fLy v is symmetric in the I t o ariables — p I o and (v /L ) nH . Therefore, it. foIlos s ihat the conditional distribution of 1X gi en that X = r can he found from the conditional distribution of \ uRen that .V = Ithis distribLttion has Just been deri’.ed) simply by interchanging Example 316 Example 5.12.3 5,12,2 a tional mean distribution distribution Determining from that height. the regardless linear M.S.E. (1 distribution person’s E(X 2 ) not weight the We by random from Predicting marginal variance is negative. p should Since of person’s and it If distribution x 1 . Chapter distribution a Eq. known. > (1 smallest know p 2 )c is a Some the We population and Furthermore, and a 0, bivariate — the is easier certain must (5.12.4) the height person’s the variables reduced p 2 )a. have be of p2; of variance with variance t. that distribution of However, of weight particular of a the 5 variance another noted. a M.S.E. be Marginal and to the interchanging of, that Person’s now Special X 2 the of mean population. known of is normal predict predicted and and in height X 1 conditional from the given x 1 . conditional and of marginal I is have r 2 . shown which If is since random given given its p that that the of the this M.S.E. then (I p features Distributions We height the of H and distribution a Weight. Distribution. the a value that — is same variance the person’s X 2 . can the linear that height bivariate sariance shall with to 0, that not the p 2 )a. that and variance conditional distribution p X 1 variance EtX 1 Ix 2 ) of (I then person’s distribution be each distribution. known, does best for X 2 let of X and the determine — this of function and attained Let of a p 2 )H. X 2 the = weight x 1 ; EX 2 x 1 ) the is prediction smallest marginal normal JLs. is not tall prediction x the a 2 . 2 must X 1 weight Y denote constant (I a person. and conditional is then weight Suppose = given person. and depend univariate distribution of conditional denote a — and is p from if normal of the the p 2 )H X Hence. the positive. distribution interchanging the M.S.E. + are the is be distribution that that X- is it marginal is X 2 M.S.E. is best a 1 It a her a a the on that follows a person’s weight highly the the normal linear follows is for short decreases normal X must distribution, normal distribution when the height height prediction (: smaller a that distribution in = mean If variance every random of p Example given person, correlated, of be function distribution that for can distribution, is this and the that <0. height distribution, distribution. of when the a 1 predicted. P2) E(X 2 jx 1 ) a number which as the than a normal value be each prediction and value the person. H. variable the of person for of or pJ difficulty attained the of X 1 of 5.12.2 her Joint the a. If a slope increases, conditional X 2 which the of X 2 x, x 1 . person the of person it is distribution weight of Thus, for variance We selected Suppose X 1 is given the distiihutiofl and p.d.f. X given known I Furthermore, given of the is is known if which has the is of shall conditional U the the the of her is known. the conditional predicting is it is a that that mean — selected medium value normal distribution the condi when at follows variance function that height H specified compare conditional the p 2 )a. that with random X 1 X 1 mean of (5.127) of iS mean these the the = her 4 = this the is is If is 5.1.2 The Bivariate Normal Distribution 317 .Vand Y must he a hi ariate nirmal distribution see Exercise 14>.Hence, the marginal distribution of 1 is also a normal disinbution. [Themean and the ariance of 1 niust he determined. ihe mean at } is EiY EIEiYXiJ EX —jr. Eui themmore. h E\ercmse II of Section 4.7, ar( Y> E[Var( Y X ii Var[ Ei Y Xi Er) —Var>X> k Hence, he distribution of ) is a normal distribution s oh mean i and variance r- o .4 Linear Combinations Suppose again that two random variables X and X hake a hivariate normal distribution, tar shich the p.d.f. is specified by Eq. 5.12.4). Now consider the random variable Y = a X + a X + /, v here 0. a. and h are arbitrary given constants. Both 1X and X can he represented. as in Eq. (5.12.2). as linear combinations of independent and normally distributed random variables Z and Z. Since Y is a linear combination of X and X. it follows that V can also be represented as a linear combination of 1Z and Z. Therefore. by Corollary 56.1. the distribution of V will also be a normal distribution. Thus, the following important property has been established. If two random variables 1X and ,V have a b/variate normal distribution, then will distribution. each linear combination V = a1X -i-aX + h have a normal The mean and variance of V are as follows: E>Y> = a1E(X + a2E(X + b ) +a2M2 +1 ‘ili ) and Van Y = 02 Var) )1K a )5VariX a12a 1Cov>X )2K = c1rT aa. ± 2aapon. , (5.12.8) Example 5.12.4 Heights of Husbands and Wives. Suppose that a maiTied couple is selected at random from a certain population of married couples. and that the joint distribution of the height of the s ife and the height of her husband is a bivariate normal distribution. Suppose that the heights at the wives have a mean of 66.8 inches and a standard deviation of 2 inches. the heights of the husbands have a mean of 70 inches and a standard deviation of 2 inches. and the correlation heteen these to heights is 0.68. We shall determine the probability that the wife will he taller than her husband. EXERCISES 2. 1. 318 population. .4 given Suppose inches. of husbands Consider is the 85. on to sife and a of test that and again student given the Suppose the B two wives conditional is the standard that different 90, chosen be treatment stant the X tion If Summary joint Therefore, in also tion. Hence, and the for normal then and a Chapter and found Exaniple conditional random The aX in that If height distribution which deviation the at It the distribution Y tests we we s’. random can random + the in are the distribution, variance of standard (Similarly, 5 must let bY 5.12.4. of the the A the vector the be standard mean normal. Special the and X is + mean probability book found bivariate from mean denote detennine of variable 10: c husband of score deviation B Find is has (X, heights the the are Var(X for by a being Also, Distributions deviation from is it certain a Y) the on Pr(X— height follows the mean D. to the normal normal Z is has test 095 be of that the a F. is the the height conditional 72 = — linear Morrison EX a (X Y) conditional table value the bivariate of that disthbution Y distribution. = — X of 3. >0)=Pr(Z function wife — 4 the Var(X) Y given — tests the the Exercise Consider 90? of probability 0.8. Y) distribution + 16: + distribution Y distribution (1990). = 4 will Pr(X probability normal = wife, 3.2)/( the If is will 0.0227. — distribution at 66.8— of the and + 1.6. 2(0.68)(2)(2) scores be the In 2. he again — Var(Y) and y student’s 1.6) taller particular. that higher-dimensional If >2)= and distribution, greater end Y and a 70 let on > that of of will her student the the of the than 0). the = X Y Y —2 of the score than two this score conditional 1—ci(2) given denote have —3.2 — correlation two Since X her Cov(X, sum the Y is tests 2(X)? given book on tests then will on chosen 2.56. a X husband marginal of standard test the X test = A be have her every that Y and x.) and Y) height B of variance generalizations A a at = scores will normal the A a is random. y Y B is bivariate linear distributions normal 80, more is be described have two 0.0227, of normal on what higher being her distributjon scores combina thorough the what a distribu normal husband bivayjate is tWO than con in with the can IS 15 of 4 ) I