Difference between revisions of "Probability review for warm up"

Revision as of 17:36, 4 February 2022

Probability Review

In this section, let's go through a quick review about probability theory. The best way to refresh ourselves is to read a few notes and go straight to problem exercises. From your EEE 137 we can summarize probability as a mathematical term which we use to investigate properties of mathematical models of chance phenomena. It is also a generalized notion of weights whereby we weigh events to see how likely they are to occur. In most cases probability is dependent on the relative frequency of events, while some look at fractions based on sets. Let's review some important properties and definitions then proceed immediately to practice exercises.

Basic Properties of Probability

Suppose we have events $A$ and $B$ which are subsets of a sample space $S$ (i.e., $A,B\subset S$ ). Let $P(A)$ be the probability that event $A$ happens, and $P(B)$ be the probability that event $B$ happens. Then some of the basic properties follow:

$0\leq P(A)\leq 1$ and $0\leq P(B)\leq 1$
$P(S)=1$ and $P(\varnothing )=0$ . The $\varnothing$ is the null or empty set.
$P(A\cap B)=P(A)+P(B)$ if and only if $A$ and $B$ are disjoint.
$P(A-B)=P(A)-P(B)$ if $B\subseteq A$ and vice versa.
$P({\bar {A}})=1-P(A)$ and $P({\bar {B}})=1-P(B)$
$P(A)\leq P(B)$ whenever $A\subseteq B$ and vice versa.
$P(A\cup B)=P(A)+P(B)-P(A\cap B)$ . This can be extended to $N$ sets or variables. This one is left as an exercise for you.
Let's say $C=\{E_{1},E_{2},E_{3}...,E_{n}\}$ is a partition of $S$ then $\sum _{j=1}^{n}P(E_{j})=1$ .

Principle of Symmetry

Let $S$ be a finite sample space with outcomes or events $\{E_{1},E_{2},E_{3},...,E_{n}\}$ which all $E_{i}$ are physically identical or objects having the same properties and characteristics. In this case we have:

$P(E_{1})=P(E_{2})=P(E_{3})=...=P(E_{n})$ .

Subjective Probabilities

These are often expressed in terms of odds. For example, suppose a betting site is offering odds of $x$ to $y$ on Team Secret beating Team TSM. This means out of the total $x+y$ equally valued coins, the better is willing to bet $x$ of them that Team Secret beats Team TSM. So if the outcome $s$ is the event that Team Secret beats Team TSM then we have:

$P(s)={\frac {x}{x+y}}$

Relative Frequency

Suppose we are monitoring a particular outcome $x$ and we observe that $x$ occurs in $r$ out of $n$ experiments (or repetitions). We define the relative frequency of $x$ based on $n$ experiments as:

$P(x)={\frac {r}{n}}$

Conditional Probability

Figure 1: Simple Venn diagram showing the fractional parts for computing

P(A|B)

In a nutshell, conditional probability is to gain in information about an event that leads to a change in its probability. Suppose an event $A$ happens with probability $P(A)$ . However, when event $B$ happens, this influences the probability of $A$ such that we have $P(A|B)$ as the conditional probability of $A$ happening given $B$ occurred. Mathematically we know this as:

$P(A|B)={\frac {P(A\cap B)}{P(B)}}$

Figure 1 shows a Venn diagram that can visualize this. The $P(A|B)$ is the relative probability of event $A$ happening with respect to event $B$ happening. Therefore, it justifies that we first need to get $P(A\cap B)$ then dividing this by $P(B)$ . It's just a matter of shifting the scope of event $A$ happening with respect to the entire space $S$ to the scope of event $A$ happening with respect to the space of event $B$ . We can rearrange the equation to get:

$P(A\cap B)=P(A|B)P(B)$

We can extend this concept to three or more sets. For example, if $A$ , $B$ , and $C$ are some events in a sample space, then we can compute the intersection of all events as:

$P(A\cap B\cap C)=P(C|A\cap B)P(B|A)P(A)$

Figure 2: Simple partition example. The entire space

S

is cut into different

E_{i}

parts. The event

A

is a subset of the space and it constitutes different fractions of

E_{i}

components.

Here's another interesting formula: Suppose we have a partition set $Y=\{E_{1},E_{2},E_{3},...E_{n}\}$ of some sample space $S$ with each $P(E_{i})\neq 0$ . In other words, we just cut the sample space $S$ into several partitions. Let event $A\ \epsilon \ Y$ . In other words, event $A$ is just a part of the partition $Y$ . We have:

$P(A)=\sum _{i=1}^{n}P(A|E_{i})P(E_{i})$

Figure 2 visualizes this formula. The entire space is $S$ and we cut it into several $E_{i}$ components. Suppose event $A$ is also part of the entire space $S$ , then it's simply the sum of all $P(E_{i}\cap A)=P(A|E_{i})P(E_{i})$ components. You might wonder why the summation is from $i=1$ up to $i=n$ when we could just get the probabilities where it only matters? You are correct to think that we only need to get those that have a contribution to $P(A)$ but the equation is generalized to include all partitions. For example, $P(E_{6}\cap A)=0$ because event $E_{6}$ and event $A$ can never happen. So it's okay to generalize the formula to include all events even when their intersections don't happen at all.

@@ Line 35: / Line 35: @@
 === Conditional Probability ===
+[[File:A given b.PNG|thumb|right|400px| Figure 1: Simple Venn diagram showing the fractional parts for computing <math> P(A|B) </math>]]
 In a nutshell, ''conditional probability'' is to gain in information about an event that leads to a change in its probability. Suppose an event <math> A </math> happens with probability <math> P(A) </math>. However, when event <math> B </math> happens, this influences the probability of <math> A </math> such that we have <math> P(A|B) </math> as the conditional probability of <math> A </math> happening given <math> B </math> occurred. Mathematically we know this as:
@@ Line 40: / Line 42: @@
 <math> P(A|B) = \frac{P(A \cap B)}{P(B)} </math>
-There are some interesting formulas to take note of. For example, if <math> A </math>, <math> B </math>, and <math> C </math> are some events in a sample space, then we can compute the intersection of all events as:
+Figure 1 shows a Venn diagram that can visualize this. The <math> P(A|B) </math> is the relative probability of event <math> A </math> happening with respect to event <math> B </math> happening. Therefore, it justifies that we first need to get <math> P(A \cap B) </math> then dividing this by <math> P(B) </math>. It's just a matter of shifting the scope of event <math> A </math> happening with respect to the entire space <math> S </math> to the scope of event <math> A </math> happening with respect to the space of event <math> B </math>. We can rearrange the equation to get:
+<math> P(A \cap B) = P(A|B)P(B) </math>
+We can extend this concept to three or more sets. For example, if <math> A </math>, <math> B </math>, and <math> C </math> are some events in a sample space, then we can compute the intersection of all events as:
 <math> P(A \cap B \cap C) = P(C| A \cap B)P(B | A) P(A) </math>
-[[File:Partition set.PNG|thumb|right|400px|Figure 1: A sample space S cut into several E partitions. Blue is event A, red is event B, green is event C, gray is event D, and orange is event E. ]]
+[[File:Simple partition.PNG|400px|thumb|right|Figure 2: Simple partition example. The entire space <math> S </math> is cut into different <math> E_i </math> parts. The event <math> A </math> is a subset of the space and it constitutes different fractions of <math> E_i </math> components.]]
-The pattern carries over and you could generalize this. Here's another interesting formula: Suppose we have a partition set <math> Y = \{E_1, E_2, E_3, ... E_n \} </math> of some sample space <math> S </math> with each <math> P(E_i) \neq 0 </math>. In other words, we just cut the sample space <math> S </math> into several partitions. Let event <math> A \ \epsilon \ Y </math>. In other words, event <math> A </math> is just a part of the partition <math> Y </math>. We have:
-<math> P(A) = \sum_{i=1}^n P_{E_i}(A)P(E_i) </math>
+Here's another interesting formula: Suppose we have a partition set <math> Y = \{E_1, E_2, E_3, ... E_n \} </math> of some sample space <math> S </math> with each <math> P(E_i) \neq 0 </math>. In other words, we just cut the sample space <math> S </math> into several partitions. Let event <math> A \ \epsilon \ Y </math>. In other words, event <math> A </math> is just a part of the partition <math> Y </math>. We have:
-Figure 1 visualizes this formula. The entire space is <math> S </math> and we cut it into several <math> E_i </math> components. The larger color-coded chunks are different events. Suppose the blue chunk is event <math> A </math>.
+<math> P(A) = \sum_{i=1}^n P(A|E_i)P(E_i) </math>
+Figure 2 visualizes this formula. The entire space is <math> S </math> and we cut it into several <math> E_i </math> components. Suppose event <math> A </math> is also part of the entire space <math> S </math>, then it's simply the sum of all <math> P(E_i \cap A) = P(A|E_i)P(E_i) </math> components. You might wonder why the summation is from <math> i = 1 </math> up to <math> i=n </math> when we could just get the probabilities where it only matters? You are correct to think that we only need to get those that have a contribution to <math> P(A) </math> but the equation is generalized to include all partitions. For example, <math> P(E_6 \cap A) = 0 </math> because event <math> E_6 </math> and event <math> A </math> can never happen. So it's okay to generalize the formula to include all events even when their intersections don't happen at all.
 == Random Variables ==
 == Exercises ==

Difference between revisions of "Probability review for warm up"

Revision as of 17:36, 4 February 2022

Contents

Probability Review

Basic Properties of Probability

Principle of Symmetry

Subjective Probabilities

Relative Frequency

Conditional Probability

Random Variables

Exercises

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools