Conditional Expectation Statistics 251

Conditional Expectation Statistics 251

Lecture 21: Conditional Expectation Statistics 251 Yi Sun and Zhongjian Wang Department of Statistics The University of Chicago Lecture Outline Formula for computation Compute Expectation by Conditioning Conditional Expectation is the best preditor Where are we? Formula for computation Compute Expectation by Conditioning Conditional Expectation is the best preditor Base on this, we know pX jY {· j Y = yg is a distribution function. Now how to define E[X j Y = y]? The conditional expectation of X given that Y = y; for all values of y such that pY (y) > 0; by X E[X j Y = y] = xPfX = x j Y = yg x X = xpX jY (x j y) x Conditional Expectation for Discrete r.v. Recall that if X and Y are jointly discrete random variables, then the conditional probability mass function of X ; given that Y = y; is defined, for all y such that PfY = yg > 0; by p(x; y) pX jY (x j y) = PfX = x j Y = yg = pY (y) The conditional expectation of X given that Y = y; for all values of y such that pY (y) > 0; by X E[X j Y = y] = xPfX = x j Y = yg x X = xpX jY (x j y) x Conditional Expectation for Discrete r.v. Recall that if X and Y are jointly discrete random variables, then the conditional probability mass function of X ; given that Y = y; is defined, for all y such that PfY = yg > 0; by p(x; y) pX jY (x j y) = PfX = x j Y = yg = pY (y) Base on this, we know pX jY {· j Y = yg is a distribution function. Now how to define E[X j Y = y]? Conditional Expectation for Discrete r.v. Recall that if X and Y are jointly discrete random variables, then the conditional probability mass function of X ; given that Y = y; is defined, for all y such that PfY = yg > 0; by p(x; y) pX jY (x j y) = PfX = x j Y = yg = pY (y) Base on this, we know pX jY {· j Y = yg is a distribution function. Now how to define E[X j Y = y]? The conditional expectation of X given that Y = y; for all values of y such that pY (y) > 0; by X E[X j Y = y] = xPfX = x j Y = yg x X = xpX jY (x j y) x Again, we know pX jY {· j Y = yg is a distribution function. Now how to define E[X j Y = y]? The conditional expectation of X ; given that Y = y; by Z 1 E[X j Y = y] = xfX jY (x j y)dx −∞ provided that fY (y) > 0 Conditional Expectation for Continuous r.v. Similarly, let us recall that if X and Y are jointly continuous with a joint probability density function f (x; y); then the conditional probability density of X ; given that Y = y; is defined, for all values of y such that fY (y) > 0; by f (x; y) fX jY (x j y) = fY (y) The conditional expectation of X ; given that Y = y; by Z 1 E[X j Y = y] = xfX jY (x j y)dx −∞ provided that fY (y) > 0 Conditional Expectation for Continuous r.v. Similarly, let us recall that if X and Y are jointly continuous with a joint probability density function f (x; y); then the conditional probability density of X ; given that Y = y; is defined, for all values of y such that fY (y) > 0; by f (x; y) fX jY (x j y) = fY (y) Again, we know pX jY {· j Y = yg is a distribution function. Now how to define E[X j Y = y]? Conditional Expectation for Continuous r.v. Similarly, let us recall that if X and Y are jointly continuous with a joint probability density function f (x; y); then the conditional probability density of X ; given that Y = y; is defined, for all values of y such that fY (y) > 0; by f (x; y) fX jY (x j y) = fY (y) Again, we know pX jY {· j Y = yg is a distribution function. Now how to define E[X j Y = y]? The conditional expectation of X ; given that Y = y; by Z 1 E[X j Y = y] = xfX jY (x j y)dx −∞ provided that fY (y) > 0 Example Suppose that the joint density of X and Y is given by e−x=y e−y f (x; y) = 0 < x < 1; 0 < y < 1 y Compute E[X j Y = y]. Where are we? Formula for computation Compute Expectation by Conditioning Conditional Expectation is the best preditor If Y is a discrete random variable, X E[X ] = E[X j Y = y]PfY = yg y whereas if Y is continuous random variable, Z 1 E[X ] = E[X j Y = y]fY (y)dy −∞ Formula E[X ] = E[E[X j Y ]] What does it mean? whereas if Y is continuous random variable, Z 1 E[X ] = E[X j Y = y]fY (y)dy −∞ Formula E[X ] = E[E[X j Y ]] What does it mean? If Y is a discrete random variable, X E[X ] = E[X j Y = y]PfY = yg y Formula E[X ] = E[E[X j Y ]] What does it mean? If Y is a discrete random variable, X E[X ] = E[X j Y = y]PfY = yg y whereas if Y is continuous random variable, Z 1 E[X ] = E[X j Y = y]fY (y)dy −∞ Example: Miner Travel A miner is trapped in a mine containing 3 doors. I The first door leads to a tunnel that will take him to safety after 3 hours of travel. I The second door leads to a tunnel that will return him to the mine after 5 hours of travel. I The third door leads to a tunnel that will return him to the mine after 7 hours. If we assume that the miner is at all times equally likely to choose any one of the doors, what is the expected length of time until he reaches safety? Example Suppose that X and Y are independent continuous random variables having densities fX and fY ; respectively. Compute PfX < Y g Proof can be found on Ross Book and is left as homework. Conditional Variance Var(X ) = E[Var(X j Y )] + Var(E[X j Y ]) Conditional Variance Var(X ) = E[Var(X j Y )] + Var(E[X j Y ]) Proof can be found on Ross Book and is left as homework. Where are we? Formula for computation Compute Expectation by Conditioning Conditional Expectation is the best preditor I We would like to show g(X ) = E[Y j X ], is the best possible predictor. How to determine a predictor is good (or best)? One possible criterion for closeness is to choose g so as to minimize E (Y − g(X ))2 : Derivation I Assume a random variable X is observed. I We need to predict the value of a second random variable Y . I Let g(X ) denote the predictor for Y . How to determine a predictor is good (or best)? One possible criterion for closeness is to choose g so as to minimize E (Y − g(X ))2 : Derivation I Assume a random variable X is observed. I We need to predict the value of a second random variable Y . I Let g(X ) denote the predictor for Y . I We would like to show g(X ) = E[Y j X ], is the best possible predictor. One possible criterion for closeness is to choose g so as to minimize E (Y − g(X ))2 : Derivation I Assume a random variable X is observed. I We need to predict the value of a second random variable Y . I Let g(X ) denote the predictor for Y . I We would like to show g(X ) = E[Y j X ], is the best possible predictor. How to determine a predictor is good (or best)? Derivation I Assume a random variable X is observed. I We need to predict the value of a second random variable Y . I Let g(X ) denote the predictor for Y . I We would like to show g(X ) = E[Y j X ], is the best possible predictor. How to determine a predictor is good (or best)? One possible criterion for closeness is to choose g so as to minimize E (Y − g(X ))2 : E (Y − g(X ))2 j X =E (Y −E[Y j X ] + E[Y j X ] − g(X ))2 j X =E (Y − E[Y j X ])2 j X + E (E[Y j X ] − g(X ))2 j X +2 E[(Y − E[Y j X ])(E[Y j X ] − g(X )) j X ] However, given X ; E[Y j X ] − g(X ); being a function of X ; can be treated as a constant. Thus, E[(Y − E[Y j X ])(E[Y j X ] − g(X )) j X ] = (E[Y j X ] − g(X ))E[Y − E[Y j X ] j X ] = (E[Y j X ] − g(X ))(E[Y j X ] − E[Y j X ]) = 0 Proof In detail, we would like to prove, E (Y − g(X ))2 ≥ E (Y − E[Y j X ])2 : However, given X ; E[Y j X ] − g(X ); being a function of X ; can be treated as a constant. Thus, E[(Y − E[Y j X ])(E[Y j X ] − g(X )) j X ] = (E[Y j X ] − g(X ))E[Y − E[Y j X ] j X ] = (E[Y j X ] − g(X ))(E[Y j X ] − E[Y j X ]) = 0 Proof In detail, we would like to prove, E (Y − g(X ))2 ≥ E (Y − E[Y j X ])2 : E (Y − g(X ))2 j X =E (Y −E[Y j X ] + E[Y j X ] − g(X ))2 j X =E (Y − E[Y j X ])2 j X + E (E[Y j X ] − g(X ))2 j X +2 E[(Y − E[Y j X ])(E[Y j X ] − g(X )) j X ] Proof In detail, we would like to prove, E (Y − g(X ))2 ≥ E (Y − E[Y j X ])2 : E (Y − g(X ))2 j X =E (Y −E[Y j X ] + E[Y j X ] − g(X ))2 j X =E (Y − E[Y j X ])2 j X + E (E[Y j X ] − g(X ))2 j X +2 E[(Y − E[Y j X ])(E[Y j X ] − g(X )) j X ] However, given X ; E[Y j X ] − g(X ); being a function of X ; can be treated as a constant.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    26 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us