The chi-square ( $\chi^2$ ) test is the standard tool for categorical data. The test statistic:

$\chi^2 = \sum_i \frac{(O_i - E_i)^2}{E_i}$

where $O_i$ are observed counts and $E_i$ are expected under $H_0$ .

Three common variants:

Goodness-of-fit: does observed distribution match a theoretical one? (Is a die fair?). $df = k - 1$ .
Independence: are two categorical variables independent? (Is gender independent of voting preference?). $df = (r-1)(c-1)$ for $r \times c$ contingency tables.
Variance test: less common.

Assumption: expected counts must be sufficiently large (typically $\geq 5$ in each cell). For small samples, use Fisher's exact test instead.

The chi-square distribution itself is the distribution of a sum of squared standard normals — used to construct critical values.

Chi-square (χ²) Test