Descriptive statistics
Mean (population)
Average of all population values.
Mean (sample)
Sample average.
Variance (population)
Spread squared, divides by N.
Variance (sample)
Bessel's correction: divide by .
Standard deviation
Square root of variance — same units as data.
Range
Simplest spread measure.
Probability rules
Addition rule
Probability of A or B (inclusion-exclusion).
Multiplication rule
Probability of A and B; reduces to product when independent.
Conditional probability
Probability of B given A occurred.
Bayes' theorem
Reverse conditional probabilities — diagnostic tests, machine learning.
Independence
Holds iff and are independent.
Counting
Permutations
Order matters: arrange from .
Combinations
Order doesn't matter: choose from .
Discrete distributions
Binomial PMF
successes in independent trials with success prob .
Binomial mean
Expected number of successes.
Binomial variance
Spread of the binomial.
Poisson PMF
Rare-event count with mean rate .
Normal distribution
Bell curve, mean , std .
Z-score
Standardise to compare across distributions.
Standard normal
After z-score transformation.
68-95-99.7 rule
For — only valid for normal data.
Inferential statistics
Standard error of mean
Standard deviation of as estimator.
Confidence interval (mean, known $\sigma$)
for 95% CI.
t-statistic (one sample)
Test mean = when unknown.
Chi-square statistic
Goodness-of-fit / independence test for categorical data.
Linear regression
Slope
Best-fit slope (least squares).
Intercept
Forces line through .
Pearson correlation
Strength + direction of linear relation, .
Coefficient of determination
Fraction of variance in explained by .