What is the scenario we are going to be focusing on here?
Why can’t we assess whether our sample and population means are different from the UK average?
due to sampling errors
What are the questions we could ask of this data?
• Does the sample mean look very different from the UK population average?
- Hypothesis: The sample came from a population with higher/ different population mean
- Same as asking ‘does the sample mean look inconsistent with a sample of the appropriate size from this population?’
• Do the older vs. younger sample means look different from each other?
What is our null hypothesis significance testing example we are going to be using?
What happens in step one of NHST and apply this to our example?
What happens in step 2 of NHST and apply this to our example?
What happens is step 3 of NHST?
What happens in step 4 of NHST?
What happens in step 5 of NHST?
In step 3 if our sample mean is not particularly high what would you say about inconsistency?
Not very inconsistent: p is not small here – i.e. it seems entirely plausible to get a sample mean here if null is true so don’t reject the null
In step 3 if our sample mean is uncharacteristically high what would you say about inconsistency?
Very inconsistent: p is rather small here – i.e. it seems too implausible to get a sample mean here if null is true so reject the null
When do you fail to reject the null?
- suggest not inconsistent with H0
When do you reject the null?
What value is a? (how small should p be before we decide to reject H0)
What is the conditional probability associated with you sample statistic assuming that the null hypothesis is true sometimes called?
The p-value
is p the probability that the null hypothesis is true?
No. We can never know this. We are just looking for evidence of inconsistency
if p<0.05 does that mean that H1 (our research hypothesis) is correct?
- by chance (i.e. random sampling error) you could get a sample mean that was extreme (leading to small p)
What is the z-test?
A null hypothesis significant test in which in step 2 you find out the z-score for your data and hence the conditional probability of obtaining that value. If p < 0.05 then we can reject the null hypothesis in favour of the research hypothesis as our data is inconsistent with the null hypothesis - it is not likely to get this sample mean from the parent population.
When do you use a z-test?
Why don’t we usually use a z-test?
Because we don’t usually have the parent population parameters
What is a one tailed hypothesis?
What is a two tailed hypothesis?
How does does step 3 of z two tailed z-test work?
What is the key difference when working out the conditional probability for a two tailed z-test?
You need to double the probability to get it on both sides.