Hypothesis Test Calculator

Perform z-tests, t-tests, and two-sample tests with step-by-step solutions and p-values

拖拽或点击上传图片或 PDF

∑Math Input

One-sample t-test: n=25, x_bar=52, sample sd=8, mu_0=50, alpha=0.05

One-sample z-test: n=100, x_bar=48, sigma=10, mu_0=50, alpha=0.05

Two-sample t-test: n1=30, x1=75, s1=6, n2=35, x2=71, s2=7

Test whether proportion p=0.48 differs from p_0=0.5, n=200, alpha=0.05

What is Hypothesis Testing?

Hypothesis testing is a formal statistical procedure for deciding whether sample data provide sufficient evidence to reject a claim about a population parameter.

The Two Hypotheses

Null hypothesis $H_0$ : the default claim — assumes no effect, no difference, or a specific parameter value (e.g., $\mu = 50$ ).
Alternative hypothesis $H_a$ (or $H_1$ ): the claim you want to support — can be two-sided ( $\neq$ ), left-tailed ( $<$ ), or right-tailed ( $>$ ).

The Logic

Assume $H_0$ is true. Compute how extreme the sample result is if $H_0$ were true — this probability is the p-value. A very small p-value means the data would be highly unlikely under $H_0$ , so we reject $H_0$ in favor of $H_a$ .

Significance Level $\alpha$

$\alpha$ is the threshold for rejection. The most common choices are $\alpha = 0.05$ (5%) and $\alpha = 0.01$ (1%). If $p\text{-value} < \alpha$ , you reject $H_0$ .

Type I and Type II Errors

Decision	$H_0$ is true	$H_0$ is false
Reject $H_0$	Type I error (false positive), prob. $= \alpha$	Correct (power $= 1 - \beta$ )
Fail to reject $H_0$	Correct (prob. $= 1 - \alpha$ )	Type II error (false negative), prob. $= \beta$

Common Hypothesis Tests

One-Sample Z-Test (known $\sigma$ )

Tests whether the population mean equals a specified value when the population standard deviation $\sigma$ is known:

$z = \frac{\bar{x} - \mu_0}{\sigma / \sqrt{n}}$

Compare $z$ to a standard normal critical value $z^*$ (e.g., $\pm 1.96$ for two-sided $\alpha = 0.05$ ).

One-Sample T-Test (unknown $\sigma$ )

The most common test in practice — uses the sample standard deviation $s$ :

$t = \frac{\bar{x} - \mu_0}{s / \sqrt{n}}, \quad df = n - 1$

Compare $t$ to a t-distribution critical value. For large $n$ , the t-distribution approaches the standard normal.

Two-Sample T-Test

Tests whether two independent population means are equal:

$t = \frac{\bar{x}_1 - \bar{x}_2}{\sqrt{\dfrac{s_1^2}{n_1} + \dfrac{s_2^2}{n_2}}}$

Degrees of freedom are estimated with the Welch–Satterthwaite approximation when $\sigma_1 \neq \sigma_2$ is not assumed.

Z-Test for a Proportion

Tests whether a population proportion equals a specified value $p_0$ :

$z = \frac{\hat{p} - p_0}{\sqrt{p_0(1 - p_0)/n}}$

Valid when $np_0 \geq 10$ and $n(1 - p_0) \geq 10$ .

Critical Values Quick Reference

Test type	$\alpha = 0.05$	$\alpha = 0.01$
Two-sided z	$\pm 1.96$	$\pm 2.576$
Right-tailed z	$1.645$	$2.326$
Left-tailed z	$-1.645$	$-2.326$

Step-by-Step Hypothesis Testing Procedure

Follow these five steps for any hypothesis test:

State the hypotheses: Write $H_0$ and $H_a$ in terms of the population parameter. Identify the tail direction (two-sided, left, or right).
Choose the test and check conditions: Pick the appropriate test statistic (z or t). Verify sample size conditions (normality, independence).
Compute the test statistic: Plug in the sample values.
Find the p-value: Using the test statistic and the sampling distribution, compute the probability of observing a result at least as extreme as yours under $H_0$ . For two-sided tests, double the one-tail area.
State the conclusion: If $p\text{-value} < \alpha$ , reject $H_0$ — the data provide statistically significant evidence for $H_a$ . Otherwise, fail to reject $H_0$ (this is NOT the same as accepting $H_0$ ).

示例题目

Step 1: Hypotheses:

H_0: \mu = 50

H_a: \mu > 50

(right-tailed),

\alpha = 0.05

Step 2: Test statistic:

t = \dfrac{52 - 50}{8/\sqrt{25}} = \dfrac{2}{1.6} = 1.25

df = 24

Step 3: Critical value:

t^*_{24} = 1.711

(one-tailed,

\alpha = 0.05

)

Step 4: p-value:

P(T_{24} > 1.25) \approx 0.112

Step 5: Decision:

0.112 > 0.05

— fail to reject

H_0

Answer: Fail to reject

H_0

. Insufficient evidence to conclude

\mu > 50

at the 5% significance level (

p \approx 0.112

Step 1: Conditions:

np_0 = 200(0.5) = 100 \geq 10

✓

Step 2: Test statistic:

z = \dfrac{0.48 - 0.5}{\sqrt{0.5 \cdot 0.5 / 200}} = \dfrac{-0.02}{0.0354} \approx -0.566

Step 3: Two-sided p-value:

p = 2 \times P(Z < -0.566) \approx 2(0.2858) = 0.572

Step 4: Critical value:

\pm 1.96

;

|{-0.566}| < 1.96

Step 5: Decision:

0.572 > 0.05

— fail to reject

H_0

Answer: Fail to reject

H_0

. The data do not provide significant evidence that the true proportion differs from 0.5 (

p \approx 0.572

Step 1: Hypotheses:

H_0: \mu_1 = \mu_2

H_a: \mu_1 \neq \mu_2

(two-sided)

Step 2: Standard error:

\sqrt{6^2/30 + 7^2/35} = \sqrt{1.2 + 1.4} = \sqrt{2.6} \approx 1.612

Step 3: Test statistic:

t = (75 - 71)/1.612 \approx 2.481

Step 4: Degrees of freedom (Welch):

df \approx 62

Step 5: p-value (two-sided):

p \approx 2 \times P(T_{62} > 2.481) \approx 0.016

Step 6: Decision:

0.016 < 0.05

— reject

H_0

Answer: Reject

H_0

. There is significant evidence that the two group means differ (

p \approx 0.016

常见问题

A result is statistically significant when the p-value is below the chosen significance level α. It means the observed result would be unlikely to occur by chance alone if the null hypothesis were true — it does NOT measure the practical importance or size of the effect.

A two-tailed test checks for differences in either direction (H_a: μ ≠ μ_0) and splits α across both tails. A one-tailed test is directional (H_a: μ > μ_0 or H_a: μ < μ_0) and puts all of α in one tail. Use a one-tailed test only when you have a strong a priori reason to expect a particular direction.

The p-value is the probability of observing a test statistic at least as extreme as the one computed, assuming H_0 is true. A small p-value means the observed data are inconsistent with H_0. It is NOT the probability that H_0 is true.

Use a z-test when the population standard deviation σ is known. Use a t-test (far more common) when σ is unknown and you estimate it with the sample standard deviation s. For large samples (n ≥ 30), the distinction matters less because the t-distribution closely approximates the normal.

Hypothesis Test Calculator

Perform z-tests, t-tests, and two-sample tests with step-by-step solutions and p-values

What is Hypothesis Testing?

The Two Hypotheses

The Logic

Significance Level $\alpha$

Type I and Type II Errors

Common Hypothesis Tests

One-Sample Z-Test (known $\sigma$ )

One-Sample T-Test (unknown $\sigma$ )

Two-Sample T-Test

Z-Test for a Proportion

Critical Values Quick Reference

Step-by-Step Hypothesis Testing Procedure

示例题目

常见问题

What does 'statistically significant' mean?

What is the difference between a one-tailed and two-tailed test?

What is the p-value exactly?

When do I use a z-test vs a t-test?

相关学习指南

免费试用 AI-Math

Hypothesis Test Calculator

Perform z-tests, t-tests, and two-sample tests with step-by-step solutions and p-values

What is Hypothesis Testing?

The Two Hypotheses

The Logic

Significance Level α\alphaα

Type I and Type II Errors

Common Hypothesis Tests

One-Sample Z-Test (known σ\sigmaσ)

One-Sample T-Test (unknown σ\sigmaσ)

Two-Sample T-Test

Z-Test for a Proportion

Critical Values Quick Reference

Step-by-Step Hypothesis Testing Procedure

示例题目

Problem: One−samplet−test:asampleofOne-sample t-test: a sample of One−samplet−test:asampleofn=25givesgivesgives\bar{x}=52,, ,s=8.Test. Test .TestH_0: \mu=50vsvsvsH_a: \mu > 50atatat\alpha = 0.05...

Problem: Z−testforproportion:ofZ-test for proportion: of Z−testforproportion:ofn=200voters,voters,voters,\hat{p}=0.48favorapolicy.Testfavor a policy. Testfavorapolicy.TestH_0: p=0.5vsvsvsH_a: p \neq 0.5atatat\alpha = 0.05...

Problem: Two−samplet−test:group1(Two-sample t-test: group 1 (Two−samplet−test:group1(n_1=30,, ,\bar{x}_1=75,, ,s_1=6)vsgroup2() vs group 2 ()vsgroup2(n_2=35,, ,\bar{x}_2=71,, ,s_2=7).Test). Test ).TestH_0: \mu_1=\mu_2atatat\alpha=0.05...

常见问题

What does 'statistically significant' mean?

What does 'statistically significant' mean?

What is the difference between a one-tailed and two-tailed test?

What is the difference between a one-tailed and two-tailed test?

What is the p-value exactly?

What is the p-value exactly?

When do I use a z-test vs a t-test?

When do I use a z-test vs a t-test?

相关学习指南

免费试用 AI-Math

Significance Level $\alpha$

One-Sample Z-Test (known $\sigma$ )

One-Sample T-Test (unknown $\sigma$ )