conditional probability and Bayes' rule

conditional probability

probability questions are asked relative to some universe of possibilities, the sample space $Ω$ , the universe—i like cosmos. the probability of an event $A$ , $P (A)$ , is thus a fraction of the universe. in $P (A ∣ B)$ , read as $the probability of A given B$ , $B$ becomes the universe relative to which the part of $A$ that lives inside of $B$ gets compared, thus conditional probability is defined

P (A ∣ B) = \frac{P ( A \cap B )}{P ( B )}

or, in other words

select sum(case when condition_a = 'A' then 1 else 0 end) / count(*) 
from omega
where condition_b = 'B'

disjoint and independent events

two disjoint events have no intersection $(A \cap B) = \emptyset$ , and hence, from the formula above $P (A ∣ B) = 0$ . if $B$ happens, $A$ doesn’t.

whereas $A$ and $B$ are independent when shrinking the universe to $B$ doesn’t change the probability of $A$ , i.e. $P (A ∣ B) = P (A)$ .

it’s worth noting that two disjoint events are maximally dependent! knowing that $A$ happened tells everything about $B$ , when shrinking the universe to $B$ , $A$ vanishes completely.

bayes’ rule

from

P (A ∣ B) = \frac{P ( A \cap B )}{P ( B )}

also the probability of $B$ given $A$ is

P (B ∣ A) = \frac{P ( A \cap B )}{P ( A )}

hence the intersection of $A$ and $B$ can be expressed by both

P (A \cap B) = P (A ∣ B) \cdot P (B)

and

P (A \cap B) = P (B ∣ A) \cdot P (A)

that is to say that knowing $P (A ∣ B)$ makes it possible to know $P (B ∣ A)$

P (B ∣ A) = \frac{P ( A ∣ B ) \cdot P ( B )}{P ( A )}

practical bayes

two events $D$ , a disease and a positive test $Positive$ . $D$ prevalence is $1 in 10000$ , test accuracy is $80 in 100$ , and false positive rate $5 in 100$ .

lay out the events probabilities

$P (D) = 0.0001$
$P (Positive ∣ D) = 0.8$
$P (Positive ∣\neg D) = 0.05$
$P (\neg Positive ∣\neg D) = 0.95$

remembering that, from rearranging the conditional probability formula

P (A \cap B) = P (B ∣ A) \cdot P (A)

and

P (A \cap B) = P (A ∣ B) \cdot P (B)

what is the probability $P (D ∣ Positive)$ of actually having the disease upon a positive test?

what is the universe here? the set of all tested subjects.

think of the tested subject, they have a chance of $1 in 10000$ of having the disease and if they do, they have a $0.8$ chance of testing positive: $0.8 \cdot 0.0001 = 0.00008$ : this is

P (Positive ∣ D) \cdot P (D)

at the same time, they have $9999 in 10000$ chances of not having the disease and if they don’t, they have $0.05$ chances of testing nonetheless positive: $0.05 \cdot 0.9999 = 0.049995$ and this is

P (Positive ∣\neg D) \cdot P (\neg D)

but what are these? from the conditional probability formula, they are intersections! the intersections of the probability of testing positive with the probabilities of both having $P (D)$

P (Positive \cap D) = P (Positive ∣ D) \cdot P (D)

and not having $P (\neg D)$

P (Positive \cap \neg D) = P (Positive ∣\neg D) \cdot P (\neg D)

the disease.

what is the interesting question? it’s what is the probability of having the disease upon having tested positive, and the chance of a positive outcome, whether having the disease or not, is

P (Positive \cap D) + P (Positive \cap \neg D)

representing the microcosm of all positive tests, hence

P (D ∣ Positive) = \frac{P ( Positive ∣ D ) \cdot P ( D )}{P ( Positive ∣ D ) \cdot P ( D ) + P ( Positive ∣\neg D ) \cdot P ( \neg D )}

or, intuitively

P (D ∣ Positive) = \frac{P ( Positive \cap D )}{P ( Positive \cap D ) + P ( Positive \cap \neg D )}

the tested subject, for the record, has a $\approx 0.16%$ probability of having the disease after a positive test.

tirellic.io

Explorer

conditional probability and Bayes' rule

conditional probability

disjoint and independent events

bayes’ rule

practical bayes

Graph View

Table of Contents

Backlinks