Statistical Methods

Table of Contents

Probability intro

sample space: set of all possible outcomes

event: collection of outcomes (capital letters)

probability measure: value between 0 and 1

Determining probability

  1. Estimate with relative frequency: $\begin{aligned} P(A) &= \frac{\text{number of occurrences of A}}{\text{number of times procedure was repeated}} \\ &= \frac{\text{successes}}{\text{total number of tries}} \end{aligned}$

  2. Theoretical approach: make a probability model

  3. Subjective approach: estimate P(A) based on intuition/experience

Finding P(A) for discrete case:

  1. Find sample space Ω

  2. Determine probabilities P(ω) for all ω ∈ Ω

    • if all equally likely, then P(ω) = 1/N where N is number of outcomes in Ω
  3. Determine which outcomes are in A

  4. Compute P(A) by

$P(A) = \sum_{\omega :; \omega \in A} P(\omega)$

Probability rules:

“At least one”: P(at least one) = 1 - P(none)

Addition rule (A and B): $P(A \cup B) = P(A) + P(B) - P(A \cap B)$

Complement (not A):

Conditional probability (B given A):

Disjoint events (mutually exclusive):

Independent events:

Bayes theorem

Forget that complicated-ass formula. You literally never need to use it. For example, given these values:

You need to calculate $P(A|X)$. Use conditional probability and do some rewriting:

$\begin{aligned} P(A|X) &= \frac{P(A \cap X)}{P(X)}\\ P(X) &= P(A \cap X) + P(\bar{A} \cap X)\\ \therefore P(A|X) &= \frac{P(A \cap X)}{P(A \cap X) + P(\bar{A} \cap X)} \\ P(A \cap X) &= P(X \cap A) \\ &= P(X|A) \times P(A) \\ \therefore P(A|X) &= \frac{P(X|A) \times P(A)}{P(X|A) \times P(A) + P(X|\bar{A}) \times P(\bar{A})} \\ &= \frac{0.9 \times 0.01}{0.9 \times 0.01 + 0.08 \times 0.99} \\ &= 0.1020408163 \approx 0.1 \end{aligned}$