When are statistical tests used?
A statistical test is used to determine whether a difference or association/correlation found in a particular investigation is statistically significant (i.e. whether the result could have occurred by chance or there is a real effect).
What are the three criteria when choosing a statistical test?
What are the tests of difference?
(Related) Repeated
Nominal: Sign test
Ordinal: Wilcoxon
Interval: Related t-test
(Unrelated) Independent
Nominal: Chi-Squared
Ordinal: Mann-Whitney U
Interval: Unrelated t-test
What are the tests of correlation?
Nominal: Chi-Squared
Ordinal: Spearman’s rank/rho
Interval: Pearson’s product-moment/r
What are parametric tests?
Parametric tests are those that make assumptions about the parameters of the population distribution from which the sample is drawn.
They include the related t-test, the unrelated t-test, and Pearson’s product-moment/r.
What is nominal data?
What is ordinal data?
What is interval data?
What is significance?
The difference/association between two sets of data is greater than what would occur by chance - coincidence or fluke. To find out if the difference/association is significant, we need to use a statistical test.
What happens if the statistical test is not significant?
If the statistical test is not significant, the null hypothesis is accepted. The null hypothesis states there is ‘no difference’ or ‘no correlation’ between the conditions. The statistical test determines which hypothesis (null or alternative) is ‘true’ and thus which we accept and reject.
What is probability?
Probability (p) is a numerical measure of the likelihood that certain events will occur, where 0 is statistical impossibility and 1 is a statistical certainty.
There are no statistical certainties in psychology but there is a significance level - the point at which the null hypothesis is accepted or rejected.
What is the usual level of significance?
The accepted level of probability in psychology is 0.05 (or 5%). This is the level at which the researcher decides to accept the research hypothesis or not.
If the research hypothesis is accepted, there is less than 5% probability that the results occurred by chance.
This is a compromise between too lenient (10%) or too stringent (1%).
What is the calculated and critical values?
The calculated value is compared with a critical value to decide whether the result is significant or not. The critical values for a particular test are given in a table of critical values based on probabilities.
How do you find the critical value?
To find the critical value, you need to know:
What does one-tailed mean?
When there’s a directional hypothesis.
What does two-tailed mean?
When there’s a non-directional hypothesis.
What is a type one error?
A type one error refers to a situation in which we assume that our findings show something when they don’t.
Therefore we should have kept the null hypothesis rather than rejecting it.
This is an optimistic error or false positive as a significant difference or correlation is found when one does not exist.
What is a type two error?
A type two error refers to a situation in which we may miss something that is actually happening.
Therefore we should have rejected the null hypothesis rather than accepting it.
This is a pessimistic error or false negative.
What is wrong with type one error?
too lenient
What is wrong with type two error?
too strict
What is the standard p-value?
Setting the p-value correctly is a balancing act, as we want to avoid both type one and two errors but they can never be entirely avoided. The “standard” used by the social sciences is P < (equal to or less than) 0.05.
What are the conditions of use for the sign test?
Used to analyse the difference in scores between related items, e.g. the same participant is tested twice. Can be used with nominal data (or better).
What is the calculation for the sign test?
What is the critical value for the sign test?
If S is equal to or less than critical value, then S is significant and the experimental hypothesis is retained.