The tower property of conditional expectation (also called the law of iterated expectations) is akin to the law of total probability that we met in Section 6.3. Another name is law of total expectation. A special case is
E[X]=∑i:P(Ai)>0E[X∣Ai]P(Ai),
where A1,…,An are events giving a partition of the sample space Ω. Compare the above to
E[X]=E[E[X∣Y]]=∑yE[X∣Y=y]P(Y=y).
Sometimes we might write E[E[X∣Y]] as EY[E[X∣Y]], just to remind ourselves that the outer expectation is being applied to the variable Y. Of course it is implicit in all this that E[X] exists.
Remember that P(A|B) is only defined for P(B)>0. If you want to see something that illustrates this in a cute way, then read this nice description of Borel's paradox. The paradox occurs because of a mistaken step in conditioning on an event of probability 0, which can be reached as a limit of a sequence of events in different ways.
E[X]=∑i:P(Ai)>0E[X∣Ai]P(Ai),
where A1,…,An are events giving a partition of the sample space Ω. Compare the above to
E[X]=E[E[X∣Y]]=∑yE[X∣Y=y]P(Y=y).
Sometimes we might write E[E[X∣Y]] as EY[E[X∣Y]], just to remind ourselves that the outer expectation is being applied to the variable Y. Of course it is implicit in all this that E[X] exists.
Remember that P(A|B) is only defined for P(B)>0. If you want to see something that illustrates this in a cute way, then read this nice description of Borel's paradox. The paradox occurs because of a mistaken step in conditioning on an event of probability 0, which can be reached as a limit of a sequence of events in different ways.
Oliver Prior · 580 weeks ago
Richard Weber 35p · 580 weeks ago
var(S_N)=E[N]var(X) plus (EX)^2 var(N)
This is Examples Sheet 3, #8. Note that with the above formula the variance is dimensionally correct: imagine that X_i are measured in metres, and N is dimensionless. The answer should have units of meters^2.
I have corrected the notes.
p.s. For some odd reason I can't write a plus sign in a comment. I wonder why that is?