Lecture 16; ANOVA: Flashcards

(25 cards)

1
Q

What is a controlled (experimental) study?

A

Often discrete (categorial) variation in the predictor variable, with no manipulation or assignment of treatments by the researcher.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a uncontrolled (observational) study?

A

Often continuous variation in the predictor variable, with no manipulation or assignment of treatments by the researcher.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 4 classes of statistical design?

A

1) Continuous Dependent + Continuous Independent = Regression
2) Continuous Dependent + Categorical Independent = t-tests and ANOVA
3) Categorical Dependent + Continuous Independent = Logistic Regression
4) Categorical Dependent + Categorical Independent = Tabular

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A study once suggested that shining light on the back of the knees could reset the human circadian clock and potentially prevent jet lag, sparking excitement among scientists, entrepreneurs, and the public.

  • What was wrong with this study as pointed out by Wright and Czeisler (2002)?
A

Subjects may have been exposed to light reaching their eyes indirectly, even though the light was aimed at the back of the knees.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What were the 3 treatments to test “the knees who say night”?

A

Created 3 treatments; 22 people to one of the 3 treatments:
1) Control: light device present without producing light
2) Goggles that blocked all light from eyes and light shone at the knees
3) Light in the eyes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is ANOVA as demonstrated by the knee study?

A

A statistical method used to compare the means of three or more independent groups to determine if at least one group mean is statistically different from the others.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Can samples from different statistical populations have similar means? and why is this important?

A

YES: This is important because statistical populations can have identical means but differ in their variances (and other aspects).
- This indicates that samples could be drawn from distinct populations, yet their means may not differ. This aligns with the null hypothesis (H0), which assumes no difference in means between groups.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are generalized forms of H0 and HA preferred by the professor?

A
  • H0: Differences in group means are solely due to sampling variation from statistical populations that share the same mean.
  • HA: Differences in group means are not solely due to sampling variation, indicating the populations may differ in their means.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is sampling error due to?

A

Remember: Sampling error is due to sampling variation, i.e., samples that come from the statistical populations sharing the same mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What variables are involved in an ANOVA?

A

An ANOVA always involves one continuous response variable (e.g., shift in circadian rhythm) and one categorical predictor variable.
–> The categorical variable (predictor) is divided into groups which are often referred to as treatments or factor levels → dead or alive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When studying only a single factor, what type of ANOVA is used?

A

One-way ANOVA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does ANOVA analyze? How is this achieved by the F-statistic?

A
  • We are analyzing the variances of the mean!!
  • The F-statistic achieves this by evaluating the ratio of two variance components.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How does HETEROSCEDASTICITY (unequal variances) influence the F-ratio in the F-statistic?

A

HETEROSCEDASTICITY reduces the F-ratio ability to differentiate among differences in means among groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the formula for the ratio of variances to calculate the F-statistic?

A

F = Variance among group means (due to “treatment”)/Variance within groups (referred to as error or residual variation, it represents the variation not explained by the differences in means among groups)

F = Group mean square/Error mean square

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does a lower F-statistic mean?

A

Heteroscedastic = increased variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What 2 things does the F-statistic detect?

A

1) Variation among means of different groups → numerator
2) Variance within groups → denominator

17
Q

We require a test statistic that can detect variations in means across multiple groups: what achieves this and how?

A

The F-statistic achieves this by evaluating the ratio of two variance components.

18
Q

What is the role of degrees of freedom for estimating variance in an unbiased manner?

A
  • Computing the sum of squares requires subtracting the sample mean, which introduces bias because the mean is estimated from the same data.
  • To correct for this, the sum of squares is divided by the appropriate degrees of freedom for the numerator and denominator sum of squares, producing an unbiased estimate of variability
19
Q

What does the numerator and denominator degrees of freedom for F-statistic depend on?

A

The numerator degrees of freedom depends on the number of groups (g1) and the denominator degrees of freedom depends on the total number of observations (N-g)

20
Q

TRUE or FALSE: ANOVA is directional

A

FALSE: non-directional

21
Q

What does it mean when we say ANOVA is non-directional?

A
  • ANOVA is non-directional, but it focuses on large F values because large values indicate that group means differ more than expected by chance. Small F values, in contrast, suggest that the means are very similar.
  • Thus, although ANOVA tests a non-directional hypothesis, evidence against the null hypothesis appears only in the upper tail of the F-distribution → values on the right tail.
22
Q

How is the p-value estimated from the F-distribution seen in ANOVA?

A

The p-value is estimated as the number of F-values in the null distribution equal or greater than the observed F-value (i.e., one tailed-test).

23
Q

What are the 3 assumptions of ANOVA?

A

1) Observations are independent random samples from their respective populations.
2) The response variable (e.g., shift in circadian rhythm) is approximately normally distributed within each treatment group (we will revisit this assumption later).
3) Variances are equal across groups (homoscedasticity). Unequal variances can distort F-values and lead to unreliable inference about differences among means (discussed in a later lecture).

24
Q

What does Levene’s test do?

A

Statistical inferential method used to assess the equality (homogeneity) of variances across two or more population samples.

25
What is the null hypothesis for ANOVA?
NULL: all group means are equal --> if this is rejected the standard ANOVA should not be used