Null hypothesis significance testing Flashcards by LIVIA BUXTON

Inferential statistics

Attempting to see if we can:
- Infer an alternative explanation that the difference observed is NOT due to chance e.g. due to IV
-Or if it is a result of random variance caused by sampling error, e.g. the IV has no effect

How well did you know this?

Not at all

Perfectly

Null hypothesis symbol

Hₒ

How well did you know this?

Not at all

Perfectly

Null hypothesis meaning

In the population we’re sampling, there is no relationship between the variables being tested (IV effect on DV)

How well did you know this?

Not at all

Perfectly

What does it mean to assume null hypothesis is true

We say…
Under Hₒ there will be no statistically significant differences between groups (experimental condition)

How well did you know this?

Not at all

Perfectly

When can we reject the null hypothesis?

If we were to assume this is true…
But turns out the probability it is due to chance/ sampling error is actually very low
Then we can reject it

How well did you know this?

Not at all

Perfectly

Symbol for the alternative hypothesis

H₁

How well did you know this?

Not at all

Perfectly

What is the alternative hypothesis?

In the population sampled, there is a significant difference between experimental conditions scores, i.e. the IV had an effect on the DV

How well did you know this?

Not at all

Perfectly

If the null hypothesis is rejected, is the alternative hypothesis therefore true?

Maybe but maybe not
Because the tests tell us the probability of the data being obtained due to chance, assuming the null hypothesis is true
Does not tell us the probability of either hypothesis being true as a result of what data we obtained

How well did you know this?

Not at all

Perfectly

How can we argue for the alternative hypothesis being true?

Conditions are identical and controlled in every way - all confounding variables controlled for

How well did you know this?

Not at all

Perfectly

Why might we not be able to conclude the alternative hypothesis is true?

Assumption we have controlled everything is wrong
Experimenter bias, small errors e.g. we are not aware of
there may be other plausaible mechanisms and explanations for the results

How well did you know this?

Not at all

Perfectly

NHST

Null hypothesis significance testing

How well did you know this?

Not at all

Perfectly

What are NHSTs?

Allow us to test for statistically significant differences between groups = how probable this observation is if its purely due to chance
And rule out sampling error

How well did you know this?

Not at all

Perfectly

What do NHSTs use?

Z scores and p values

How well did you know this?

Not at all

Perfectly

Using z scores recap

Transform scores into z scores = how many SDs this score is away from the mean
Score taken away from mean
Divide by the standard deviation
Use a table to obtain probability of obtaining a data score with this z value/ the percentage of scores below this, above this etc

How well did you know this?

Not at all

Perfectly

p value

A probability value of obtaining any given score
This is the value looked up in tables of z value that shows probability of a score being in smaller/ larger portion/ mean to z portion

How well did you know this?

Not at all

Perfectly

How to use z values to determine significance

Based on standardised distribution…
z value of +-1.96 will be 5% of the population in total
If, given the sample size, we can find the number of Ps we will expect to have this z score (5% of the population)
If we were to randomly select participants from this sample

How well did you know this?

Not at all

Perfectly

How to use z values to determine p value

Use same principle as determining outliers:
Think of 2 conditions as having their own standardised distributions that overlap

How well did you know this?

Not at all

Perfectly

alpha sign

Study These Flashcards

Threshold of significance

NHST on paired distributions

Study These Flashcards

How many scores would we expect above the threshold (alpha) if null hypothesis is true
E.g. what percentage of scores would need to be present in the overlap of both distributions for each condition
(If null hypothesis is true then the distributions should be very similar and have a lot of overlap)

p value for null hypothesis significance test

Study These Flashcards

We obtain the probability that, if assuming the H0 is true, we expect to observe results as extremely different as this a percentage of times equal to the p value
E.g if p = 0.001 then 0.1% of the time

If the null hypothesis is true, then group differences that are extreme are…

Study These Flashcards

unlikely but not impossible

Equation to show means are identical aka nul hypothesis is true

Study These Flashcards

μ₁ - μ₂ = 0
aka the means of both sample are the same

Sampling distrubtion of a null hypothesis (plotting means of study that had been done again and again)

Study These Flashcards

For each condition, the means make up its own singular curve:
But they are identical because all the means are the same
And as we take more samples, converges on the population mean closer and closer

What happens if we assume the null hypothesis is true but we obtain large difference in means?

Study These Flashcards

If we obtain a difference in means that is quite large, when plotting a sampling mean assuming the null hypothesis is true,
It is still possible to have a large difference in means even if the results are due to chance by sampling error alone

Central limit theorem undepins NHST

Allows us to imagine all possible outcomes and compare this to the data we collected in single study

Logic of NHST

Obtain p value = assuming the null hypothesis is true, we expect the results at least as extreme as this p% times aka due to chance is this many times If this is very small = improbable that this is sampling error Based on our subjective judgement of how small this value is, we can

alpha

Threshold for statistical significance set

Most common a valye

.05 = p

If p value is less than .05 that was set as alpha

Then we reject the null hypothesis: threshold the likelihood of results being due to chance was exceded negatively (lower than alpha)

p value translation to z value

On a z curve, our decided p value will transform to a z value which will be a certain number of SDs away from mean and also, given the area between the z values on wither side, will be a percentage This percentage = chance that we reject the null hypothesis even though it's true

NHST process

State hypothesis + null hypothesis Set alpha level Apply NHST (statistical model) Decide whether to reject null hypothesis or not

Stating hypothesis types

Directional Non-directional

Directional hypothesis

Predicting the direction you expect group differences to occur such as smaller or bigger between group scores

Non-directional hypothesis

Predict there will be significant group differences but will not state the direction this will occur in

Types of tests

One tailed Two tailed

One tailed hypothesis test

Two tailed hypothesis test

Reject the null hypothesis id sample results fall in EITHER tail of the sampling distribution e.g., extreme high or lower end Alpha value set at will be split between 2 tails e.g. .025

Tails

The extreme ends of the bell curve results may fall under

One-tailed hypothesis test

Reject the null hypothesis if a sample falls in the PREDICTED TAIL e.g. extreme end of the sampling distribution Alpha value set at will NOT be split between 2 tails e.g. stays .05 and is less stringent

Two tailed test is used for what?

Non-directional hypothesis: if we observe statistically extreme difference in either direction, then we reject H0

One tailed test is used for what?

Directional hypothesis

How to decide what hypothesis to use>

Directional hypothesis is chosen if we have strong empirical or theoretical basis to it from previous literature

Pre-registration method

Decide what to do before the study is done: Hypothesis direction test will use Alpha level And justification for why based on pre-reading

Null hypothesis significance testing Flashcards

(43 cards)