Theorem 6.1 ()
Consider a measure space and two measures , such that is -finite (Definition 31.4) and (Definition 31.1). Then there exists a measurable function such that:
Definition 6.1 ()
Given a probability space , consider as a sub -field of , i.e. . Let’s consider a random variable with finite expectation . We define a conditional expectation for given , any random variable such that:
- has finite expectation, i.e. .
- is -measurable.
- , , namely if and are restricted to , then their expectation coincides.
A -field can be used to describe our state of information. It means that, we already know if the event A has occurred or not. Therefore, when we insert in the events that we know were already occurred, we are saying that the random variable is -measurable, i.e. the value of is not stochastic once we know the information contained in . Moreover, the random variable represent a prediction of the random variable , given the information contained in the sub -field .
Definition 6.2 ()
Consider any -measurable random variable. Then can be interpreted as a predictor of another random variable under the information contained in the -field . However, when we substitute with its prediction, namely , we make an error given by the difference . In the special case in which , we can take as error function the mean squared error, i.e. We say that the conditional expectation is the best predictor in the sense that: Hence, is the best predictor that minimize the mean squared error over the class composed by -measurable functions with finite second moment, formally
Properties of conditional expectation
Here we state some useful properties of conditional expectation:
- Linearity: , for all constants .
- Positive: .
- Measurability: If is -measurable, then .
- Constant: . In general, if is -measurable then , i.e. is not stochastic.
- Independence: If is independent from the -field , then .
- Chain rule: consider two two sub -fields of such that , then we can write: Remember that, when using this property it is mandatory to take the conditional expectation before with respect to the greatest -field, i.e. the one that contains more information (in this case ), and then with respect to the smallest one (in this case ).
Conditional probability
Definition 6.3 ()
Given a probability space , consider as a sub -field of , i.e. . Then the general definition of the conditional probability of an event given is: Instead, the elementary definition do not consider the conditioning with respect to a -field, but instead with respect to a single event . In practice, take an event such that , then the conditional probability of given is defined as:
The elementary (Equation 6.2) and the general (Equation 6.1) definitions are equivalent, in fact consider a sub -field which provides only the information concerning whenever is in or not. A -field of this kind will have the form . Then, consider a -measurable function, , such that: It remains to find and in the following expression: Note that, the joint probability of and can be obtained as: Hence, we obtain: Equivalently for it is possible to prove that:
Finally it is possible to write the conditional probability in the general definition as a linear combination of conditional probabilities defined accordingly to the elementary one, i.e.
Example 6.1 Let’s continue from the example Example 2.1, let’s say that we observe , then we ask ourselves, what is the probability that in the next extraction ? The chances that with 52 cards we obtain is approximately (see Example 3.1). Then, given the fact that the extracted card originates we have that the probability, conditional to the fact that in the first extraction we had a card , that in the next extraction we have is . Let’s now investigate the chances that in the next extraction given that in the previous was . The unconditional probability is , the conditional probability will be .
Example 6.2 Let’s consider two random variables and taking values in . The marginal probabilities and . Let’s consider the matrix of joint events and probabilities, i.e. Then, by definition the conditional probabilities are defined as: and Considering instead: and Then, it is possible to express the marginal probability of as: And similarly for