1 Basic set theory

Reference: Chapter 1. Resnick (2005).

In probability, an event is interpreted as a collection of possible outcomes of a random experiment.

Definition 1.1 ( $Random experiment$ )
A random experiment is any repeatable procedure that results in one out of a well-defined set of possible outcomes.

The set of possible outcomes is called sample space and denoted with $Ω$ . An element of $Ω$ is denoted as $ω$ and is an outcome
A set of zero or more outcomes $ω$ is an event $A$ , where we denote with $F$ a class of subsets $A \subset Ω$ .
A map (function) that goes from the space of the events $F$ to the space of probabilities (real numbers in $[0, 1]$ ) is the probability law and is denoted with $P$ .

Together, sample space, event space and probability law characterize a random experiment.

There are several definitions related to sets and their operation.

Definition 1.2 ( $Complementation$ )
The complement of a set $A$ is denoted by $A^{c}$ and represents the set of elements that do not belong to $A$ , i.e. $\begin{matrix} (1.1) & A^{c} = {ω \in Ω : ω \notin A} . \end{matrix}$

Definition 1.3 ( $Containment$ )
A set $A$ is said to be contained in a set $B$ if every element of $A$ is also an element of $B$ . Formally, $\begin{matrix} (1.2) & A \subset B ⟺ ω \in A ⟹ ω \in B \forall ω \in Ω . \end{matrix}$

Definition 1.4 ( $Equality$ )
Given two sets, $A$ is equal to $B$ , written $A = B$ , if and only if every element of $A$ is an element of $B$ and every element of $B$ is an element of $A$ . Formally, $\begin{matrix} (1.3) & A \subset B and B \subset A . \end{matrix}$

1.1 Set operations

Let’s now state some elementary operations between sets.

Definition 1.5 ( $Union$ )
The union of two sets, written $A \cup B$ , is the set of $ω$ that belongs either to $A$ or $B$ , i.e. $\begin{matrix} (1.4) & A \cup B = {ω \in Ω : ω \in A or ω \in B} . \end{matrix}$

As a consequence of the definition of the union the following relations holds true, i.e. $\begin{aligned} A \cup A = A & A \cup Ω = Ω \\ A \cup \emptyset = A & A ⊔ A^{c} = Ω \end{aligned}$

Definition 1.6 ( $Intersection$ )
The intersection of $A$ and $B$ is written $A \cap B$ and is the set of elements that belongs at the same time to $A$ and $B$ . $A \cap B = {ω \in Ω : ω \in A and ω \in B} .$

As a consequence of the definition of the intersection the following relations holds, i.e. $\begin{aligned} A \cap A = A & A \cap Ω = A \\ A \cap \emptyset = \emptyset & A ⊓ A^{c} = \emptyset \end{aligned}$ Moreover, let’s state the distributive laws of the union and the intersection, i.e. $\begin{matrix} (1.5) & \begin{aligned} Intersection . & (A \cup B) \cap C = (A \cap C) \cup (B \cap C) \\ Union . & (A \cap B) \cup C = (A \cup C) \cap (B \cup C) \end{aligned} \end{matrix}$ And the De Morgan’s laws: $\begin{matrix} (1.6) & \begin{aligned} Intersection . & (A \cap B)^{c} = (A^{c} \cup B^{c}) \\ Union . & (A \cup B)^{c} = (A^{c} \cap B^{c}) \end{aligned} \end{matrix}$

Definition 1.7 ( $Difference$ )
The difference between two sets $A$ and $B$ , written $A - B$ (or also $A / B$ ), is the set of elements of $A$ that do not belong to $B$ . Formally $\begin{matrix} (1.7) & A - B = A \cap B^{c} = {ω \in Ω : ω \in A and ω \notin B} . \end{matrix}$

Disjoint reppresentation of a set

Given two set $A$ and $B$ , then each one can be written as the union of disjoint sets. In fact, their union can be decomposed into the union of three disjoint sets, i.e. $\begin{matrix} (1.8) & A ⊔ B = (A \cap B) ⊔ (A \cap B^{c}) ⊔ (A^{c} \cap B), \end{matrix}$ and therefore for example the set $A$ can be written as $\begin{matrix} (1.9) & A = (A \cap B) ⊔ (A - B) = (A \cap B) ⊔ (A \cap B^{c}) . \end{matrix}$

Definition 1.8 ( $Symmetric difference$ )
The symmetric difference between two sets $A$ and $B$ is written $A Δ B$ and is the union of elements of $A$ that do not belong to $B$ and of elements of $B$ that do not belong to $A$ , i.e. $\begin{aligned} A Δ B & = (A - B) ⊔ (B - A) = \\ = (A \cap B^{c}) ⊔ (A^{c} \cap B) = \\ = {ω : ω \in A, ω \notin B} ⊔ {ω : ω \in B, ω \notin A} \end{aligned}$

Proposition 1.1 Given two set $A, B$ , the symmetric difference can be written as $A Δ B = (A ⊔ B) \cap (A^{c} ⊔ B^{c}) .$

Proof: Proposition 1.1

Proof. Let’s denote with $C = A^{c} \cap B$ , then apply the distributive law of the union twice (Equation 1.5) and develop the computations, i.e. $\begin{aligned} A Δ B & = (A \cap B^{c}) ⊔ (A^{c} \cap B) = \\ = (A \cap B^{c}) ⊔ C = \\ = (A \cup C) \cap (B^{c} \cup C) = \\ = [A \cup (A^{c} \cap B)] \cap [B^{c} \cup (A^{c} \cap B)] = \\ = [((A ⊔ A^{c}) \cap (A \cup B)] \cap (B^{c} \cup A^{c}) \cap (B^{c} ⊔ B) = \\ = (A \cup B) \cap (A^{c} \cup B^{c}) \end{aligned}$

1.2 Indicator function

Definition 1.9 ( $Indicator function$ )
An indicator function is a function that associate an $ω \in A \subset Ω$ to a real number, i.e. either 0 or 1. It is a tool that allows to transfer a computation from the set domain into the real numbers domain, i.e. ${0, 1}$ . Formally, $1_{A} (ω) : Ω \to {0, 1}$ , i.e. $1_{A} (ω) = {\begin{cases} 1 ω \in A \\ 0 ω \in A^{c} \end{cases} .$

Indicator function in

R

Remark 1.1. If $x \in R$ , then the equivalent operator an indicator function is the heavyside function in a point $a \in R$ (see here), i.e. $\begin{matrix} (1.10) & H_{a} (x) = H (x - a) = {\begin{cases} 0 & x < a \\ 1 & x \geq a \end{cases} . \end{matrix}$ The first derivative of the heavyside with respect to $x$ is the dirac delta function (see here), i.e. $\begin{matrix} (1.11) & \frac{d}{d x} H_{a} (x) = δ_{a} (x) = δ (x - a) = {\begin{cases} 1 & x = a \\ 0 & othewise \end{cases} . \end{matrix}$ A fundamental property of the dirac delta is that for a general function $f$ Thanks to a property of the diract function, i.e. $\begin{matrix} (1.12) & \int_{- \infty}^{\infty} f (y) δ (y - a) d y = f (a) . \end{matrix}$

Proposition 1.2 The containment between two sets can be equivalently written in terms of indicator functions: $A \subset B ⟺ 1_{A} (ω) \leq 1_{B} (ω), \forall ω \in Ω .$

Proof: Proposition 1.2

Proof. Let’s start by assuming $A \subset B$ and let’s distinguish two main cases.

Assuming $ω \in A$ implies that $ω \in B$ , and therefore one have an equality $1 = 1_{A} \leq 1_{B} = 1$ .
Assuming $ω \in A^{c}$ implies $[ω \in B] \cup [ω \in B^{c}]$ . In this situation for both cases one will have that $1_{A} \leq 1_{B}$ , in fact:
- Considering $ω \in B$ implies that $0 = 1_{A} < 1_{B} = 1$ .
- Considering $ω \in B^{c}$ implies that $0 = 1_{A} \leq 1_{B} = 0$ .

Hence, assuming $A \subset B$ implies that $1_{A} (ω) \leq 1_{B} (ω)$ for all $ω \in Ω$ . Now let’s assume the contrary: $1_{A} \leq 1_{B}$ and let’s again distinguish in two main cases:

Assuming $ω \in A$ , i.e. $1_{A} = 1$ , the inequality $1_{A} \leq 1_{B}$ holds and since the indicator function is bounded by 1 by definition it is possible to write $1 = 1_{A} \leq 1_{B} \leq 1$ . Therefore, one obtain $1_{B} = 1$ and so $ω \in B$ .
Assuming $ω \in A^{c}$ , i.e. $1_{A} = 0$ , the inequality $1_{A} \leq 1_{B}$ holds and it is possible to write $0 = 1_{A} \leq 1_{B} \leq 1$ . Hence, when $ω \in A^{c}$ , there are two possible cases, i.e.
- $1_{B} = 1$ , but this implies that $ω \in B$ .
- $1_{B} = 0$ , but this implies that $ω \in B^{c}$ .

When an $ω \in A$ implies that $ω \in B$ , but the contrary do not holds true. Hence, it is possible to conclude that $A \subset B$ .

1.3 Limits of sets

Let’s define the infimum ( $inf$ ) and the supremum ( $sup$ ) of a sequence of sets ${A_{n}}_{n \geq 1}$ as $inf_{k \geq n} = ⋂_{k = n}^{\infty} A_{k} sup_{k \geq n} = ⋃_{k = n}^{\infty} A_{k},$ Informally, the infimum of a sequence of sets is the smallest set in $k = n, \dots, \infty$ , on the other hand, the supremum of a sequence of sets is the biggest set in $k = n, \dots, \infty$ .

Then, the $lim inf$ is defined as $lim_{n \to \infty} inf A_{n} = sup_{n \geq 1} inf_{k \geq n} A_{k} = ⋃_{n = 1}^{\infty} ⋂_{k = n}^{\infty} A_{k},$ The liminf (limit inferior of sets) is the set of all elements that eventually always belong to the sequence ${A_{n}}_{n \geq 1}$ . By definition, the limit of the infimum ( $lim inf$ ) is the biggest (union) among all the smallest (intersection) sets. In other words, $x \in lim inf A_{n}$ if there exists some index $N$ such that for all $n \geq N$ , we have $x \in A_{n}$ .

Instead, the $lim sup$ is defined as $lim_{n \to \infty} sup A_{n} = inf_{n \geq 1} sup_{k \geq n} A_{k} = ⋂_{n = 1}^{\infty} ⋃_{k = n}^{\infty} A_{k} .$ The limsup (limit superior of sets) is the set of all elements that belong infinitely often to the sequence ${A_{n}}_{n \geq 1}$ . By definition, the limit of the supremum ( $lim sup$ ) is the smallest (intersection) among all the biggest (union) sets. In other words, $x \in lim sup A_{n}$ if for infinitely many $n$ , $x \in A_{n}$ .

Remark 1.2. By De Morgan’s laws (Equation 1.6): $(lim_{n \to \infty} sup A_{n})^{c} = {(⋂_{n = 1}^{\infty} ⋃_{k = n}^{\infty} A_{k})}^{c} = ⋃_{n = 1}^{\infty} ⋂_{k = n}^{\infty} A_{k}^{c} = lim_{n \to \infty} inf A_{n}^{c},$ and similarly $(lim_{n \to \infty} inf A_{n})^{c} = {(⋃_{n = 1}^{\infty} ⋂_{k = n}^{\infty} A_{k})}^{c} = ⋂_{n = 1}^{\infty} ⋃_{k = n}^{\infty} A_{k}^{c} = lim_{n \to \infty} sup A_{n}^{c} .$

1.4 Monotone Sequences

Let’s define a sequence of sets ${A_{n}}_{n \geq 1}$ as monotone non-decreasing if $A_{1} \subset A_{2} \subset \dots$ , while we define it monotone non-increasing if $A_{1} \supset A_{2} \supset \dots$ . In general, a monotone non-decreasing sequence is denoted with $A_{n} ↑$ , while a non-increasing one with $A_{n} ↓$ . In general, the limit of a monotone sequence always exists.

Definition 1.10 ( $Limit of Monotone Sequences$ )
Let’s consider a monotone sequence of sets ${A_{n}}$ . Then, if

$A_{n} ↑$ is a sequence of non-decreasing sets, i.e. $A_{1} \subset A_{2} \subset \dots$ , then the limit exists, i.e. $A_{n} ↑ ⟹ lim_{n \to \infty} A_{n} = ⋃_{n = 1}^{\infty} A_{n} .$
$A_{n} ↓$ is a sequence of non-increasing sets, i.e. $A_{1} \supset A_{2} \supset \dots$ , then the limit exists, i.e. $A_{n} ↓ ⟹ lim_{n \to \infty} A_{n} = ⋂_{n = 1}^{\infty} A_{n} .$

1.5 Fields and $σ$ -fields

Definition 1.11 ( $Field$ )
Let’s define a field $A$ as a non-empty class of subsets of $Ω$ closed under complementation, finite union and intersection. The minimal requirements for a class of subsets to be a field are:

$Ω \in A$ , i.e. the sample space if in $A$ .
$A \in A ⟹ A^{c} \in A$ , i.e. if a set $A$ is in $A$ , then also its complement is in $A$ .
$A, B \in A ⟹ A \cup B \in A$ , i.e. if two sets $A$ and $B$ are in $A$ , then also their union (or intersection) is in $A$ .

Definition 1.12 ( $σ -field$ )
Let’s define a $σ$ -field $B$ as a non-empty class of subsets of $Ω$ closed under complementation, countable union and intersection. The minimal requirements for a class of subsets to be a $σ$ -field are:

$Ω \in B$ , i.e. the sample space if in $B$ .
$B \in B ⟹ B^{c} \in B$ , i.e. if a set $B$ is in $B$ , then also its complement is in $B$ .
$B_{i} \in B, i \geq 1 ⟹ ⋃_{n \geq 1} B_{n} \in B$ , i.e. if a countable sequence of sets $B_{i}$ is in $B$ , then also its countable union (or intersection) is in $B$ .

Field vs

σ

-Field

The main difference between a field and a $σ$ -field is in the third property of the definitions. A field is closed under finite union, namely the union of a finite sequence of events $A_{n}$ indexed by $n \in {0, 1, 2, \dots, n}$ (property 3 of Definition 1.11). On the other hand, a $σ$ -field is closed under countable union, namely the union of an infinite sequence of events $A_{n}$ indexed by $n \in {0, 1, 2, \dots, n, n + 1, \dots}$ (property 3. of the Definition 1.12).

Exercise 1.1 Let the sample space be $Ω^{'} = {- 1, 0, 1}$ . Generate a $σ$ -algebra according to Definition 1.12

Solution: Exercise 1.1

Solution 1.1. Let the sample space be $Ω^{'} = {- 1, 0, 1}$ . A natural choice of $σ$ –algebra on a finite set is the power set: $P (Ω^{'}) = {\emptyset, {- 1}, {0}, {1}, {- 1, 0}, {- 1, 1}, {0, 1}, Ω^{'}} .$