What is the Chi-Square test used for?
To test for a difference between observed and expected frequencies in categorical (nominal) data.
When is Chi-Square appropriate?
Nominal data
Independent groups design
Test of difference
Strengths and weaknesses of Chi-Square?
Simple to use with categorical data
Widely applicable
− Requires expected frequencies
− Less powerful than parametric tests
What is the Mann-Whitney U test used for?
To test for a difference between two independent groups using ordinal data.
When is it appropriate?
Independent groups design
Ordinal data or non-normal interval data
strengths and weaknesses of the mann-whitney u test?
Good alternative to independent t-test
Works with small samples
− Less sensitive than parametric tests
− Uses ranks → loss of detail
What is the Sign Test used for?
To test for a difference in repeated measures designs using nominal data (direction of change).
When is it appropriate?
Repeated measures design
Nominal data (e.g. + / − changes)
Strengths and weaknesses of the sign text?
Very simple to calculate
Minimal assumptions
− Very low sensitivity
− Ignores magnitude of change
What is Spearman’s rank used for?
To test for a correlation between two co-variables.
When is it appropriate?
Correlational design
Ordinal data (or non-parametric interval data)
Strengths and weaknesses of the spearman’s rank test?
Identifies strength and direction of relationship
Works with ranked data
− Cannot show causation
− Affected by outliers
What is the Wilcoxon test used for?
To test for a difference in repeated measures or matched pairs designs using ordinal data.
When is it appropriate?
Repeated measures or matched pairs
Ordinal data
Strengths and weaknesses of the Wilcoxon test?
More sensitive than Sign Test
Considers magnitude of differences
− More complex to calculate
− Still less powerful than parametric tests
What is a probability value (p-value)?
The probability that the results occurred by chance.
How are p-values interpreted?
p ≤ 0.05 → significant (reject null hypothesis)
p > 0.05 → not significant (accept null hypothesis)
What is a significance level?
The threshold used to decide if results are statistically significant (usually 0.05).
Why is 0.05 commonly used?
It means there is only a 5% probability results are due to chance.
What is an observed value?
The test statistic calculated from the data
What is a critical value?
The value taken from a statistical table to compare against the observed value.
How do you decide whether results are significant?
If observed value ≥ critical value → significant → reject null
If observed value < critical value → not significant → accept null
What do the symbols mean in inferential statistics?
= equal to
< less than
> greater than
≤ less than or equal to
≥ greater than or equal to
How do you choose the correct statistical test?