When we want to compare more than 2 variables, looking at the differences between conditions between the variables.
ANOVA test
What is ANOVA?
ANOVA is similar to a t-test, the job of an ANOVA is to determine whether the variation of sample means among groups is greater than expected by chance.
Why can we not use a t-test for more than two variables?
The more comparisons we make, the greater the possibility we’ll pick up a difference purely by chance and associate it with some biological effect when in reality, there is none.
What are the 3 primary assumptions of ANOVA?
One way ANOVA
One independent variable
One p value
two way ANOVA
Two independent variables
Three p values
Factorial design
Why can we not answer these questions with repeated t-tests?
When do we use a one way ANOVA?
ANOVA in R
Degrees of freedowm (df)
F value
P value
The variable defining the groups must be identified as a factor (or a categorical variable). Otherwise it will be treated as a continuous variable.
The relevant functions are the ‘aov’ and ‘summary’ functions as illustrated below (Note some numbers may differ slightly from those on the preceding slides due to rounding).
It needs to be set as a factor or else the variable will just be treated as a character.
What is an F-test?
Basically the critical value equivalent of the ANOVA.
As with the other hypothesis tests we studied earlier, we need to calculate the test statistic and determine the rejection region
MS denotes the mean squared error.
The rejection region is determined using an F distribution with degrees of freedom equal to the degrees of freedom between groups and the degrees of freedom within groups.