Null Hypothesis Significance Testing

Review: setting up and running a significance test

Notes.

significance level을 사용하지 않고 rejection 영역을 사용해도 됨.
null hypothesis는 'cautious hypothesis'가 될 수도 있음. significance level이 낮을수록 가설을 reject하기 위해 더 많은 evidence가 필요하기 때문.
key point of confusion:
significance level = 0.05의 의미가 test에서 실수가 일어날 확률이 5%라는 의미 X
H0이 참일 때 H0을 reject할 확률이라는 의미 O
!! power of the test = HA가 참일 때 H0를 reject할 확률
-> HA가 참일 때 H0을 reject하지 못할 확률 = 1 - power

Errors

Z-test

Review

Example)

데이터가

data \sim N(\mu ,4^2 )

를 따를 때

으로 가정함. 우리가 수집한 데이터가 1, 2, 3, 6, -1이라고 가정한다면 significance level이 0.05일때 H0를 reject해야 하는가?

Answer)

p<0.05이기 때문에 H0를 reject해야한다

The Student t distribution

One sample t-test

z-test는 분산을 안다고 가정하지만 실제로는 알 수 없는 경우가 많기 때문에 데이터에서 추정해야 함
- 이런 경우에 one sample t-test를 사용함

example)

이전 예제에서 variance를 모른다고 가정함. 우리가 수집한 데이터는 1, 2, 3, 6, -1이고

이라고 가정함. significance level이 0.05일 때 H0을 reject해야 하는가?

Two-sample t-test with equal variance

Data: Normal distribution을 따르는 두 가지 데이터셋
mean과 variance 둘 다 모르는 값이지만 같다고 추정
Null Hypothesis:
$\mu _1 = \mu _2$
Test statistic:
Null distribution:
$f\left( {t|H_0 } \right)\,is\,the\,pdf\,of\,T\sim t\left( {n + m - 2} \right).$