Hypothesis testing is a framework for using sample data to decide between two competing claims about a population:
- Null hypothesis : the default / "nothing interesting" claim (e.g. coin is fair, drug has no effect).
- Alternative : what we suspect / want to demonstrate.
Procedure:
- State and .
- Choose a significance level (commonly 0.05) — the probability of false rejection (Type I error).
- Compute a test statistic from the data (z-score, t-statistic, chi-square, F-ratio).
- Compute the p-value — probability under of seeing data at least as extreme.
- Decide: if , reject ; otherwise fail to reject.
Two error types:
- Type I: rejecting a true (probability ).
- Type II: failing to reject a false (probability ); is the power.
Common confusion: "fail to reject" ≠ "accept ". Absence of evidence is not evidence of absence — small sample sizes can hide real effects.
This framework underlies clinical trials, A/B tests, quality control, and most published "statistical significance" claims.