What is parametric data?
Why is it important to identify parametric data
Steps to identify if you need to use parametric test on data
Has an outlier?
Can outlier be removed?
Data is skewed or kurtotic?
Can the data be transformed?
Testing for outliers
What to do when you have an outlier
Shapiro-Wilk test: Testing skew and kurtosis
Parametric tests
Pearson correlation
T-test - 2 groups
(between groups or within groups)
ANOVA - >2 groups
(IRM, we’ll work with between
groups)
Non-parametric tests
Spearman correlation
Wilcoxon test
(2 groups/conditions)
Kruskall-Wallis test
(3 or more groups)
Parametric v non-parametric tests
Degrees of freedom
The independent samples t-test
In psychology, this tends to correspond to two different groups of participants, where each group corresponds to a different condition in your study. For each person in the study you measure some outcome variable of interest, and the research question that you’re asking is whether or not the two groups have the same population mean.
Homoscedastic
Even distribution/variance across correlation line
Homogenous or equal variance.
Heteroscedastic
Uneven distribution/variance across correlation line
Heterogenous or unequal variance.
Independence (in context of parametric tests)
Pearson correlation
r = 0 no relationship
r = 1 perfect positive relationship
r = -1 perfect negative relationship
Correlations are also measures
of effect size!
Pearson correlation Assumptions
T-tests
p value less than 0.05 is statistically significant
T value:
This is the result of the t-test formula, which compares the means of two groups (or a sample mean to a population mean) relative to the variability in the data (standard error).
A higher absolute t-value indicates a larger difference between groups (or means) relative to the variance, which suggests a stronger deviation from the null hypothesis.
Assumptions of t-test
Between group (independent) t-tests
If the two means are from different people.
Within group (dependent or paired) t-tests
if the two means are from the same people at different times.
Spearman’s rank order correlation
Treat data as an ordinal scale and rank each variable in order. e.g. That is, student 1 did the least work out of anyone (2 hours) so they get the lowest rank (rank = 1). Student 4 was the next laziest, putting in only 6 hours of work over the whole semester, so they get the next lowest rank (rank = 2).
Cohen’s d
effect size that tells us how large the difference between groups of data. The strength of the effect.
A d of 0.5 indicates that the two group means differ by 0.5 standard deviations.
A d of 1 indicates that the group means differ by 1 standard deviation.
A d of 2 indicates that the group means differ by 2 standard deviations.
A value of 0.2 represents a small effect size.
A value of 0.5 represents a medium effect size.
A value of 0.8 represents a large effect size.
Error in statistical models…
Is important to measure, report and interpret
Data = x + y, where x and y are…
Model and error