Correlation measures the strength and direction of the linear relationship between two variables and . The Pearson correlation coefficient:
Interpretation:
- : perfect positive linear relationship.
- : perfect negative linear relationship.
- : no linear relationship (but possibly a non-linear one!).
- : strong; : moderate; : weak.
Crucial caveats:
- Correlation is not causation. Ice cream sales correlate with drowning deaths — both driven by hot weather.
- Sensitive to outliers. A single extreme point can flip .
- Linear only. A perfect quadratic relationship has around symmetric data.
For ranked / non-linear monotonic relationships, use Spearman's . For categorical association, use chi-square or Cramér's V.