What is non-significance in statistics?
What is insignificance in statistics?
What are the 4 assumptions of a two-sample comparison of means?
1) Each of the two samples is a random sample from their population
2) The variable (e.g., horn length) is normally distributed for each population.
3) The standard deviation (and variance) of the variable is the same in both populations.
4) The theoretical sampling distribution of the differences between sample means assuming H0 as true follows a t-distribution only if the samples are drawn from populations with equal variances.
If the null hypothesis is true and variances are different, what is the effect on alpha?
When the assumption of equal variances is violated (heteroscedasticity), the actual Type I error rate of tests that assume equal variances (e.g., the classical Student’s two-sample t-test) can deviate from the nominal α. This deviation can go in either direction: the test can become liberal (inflated Type I error) or conservative (deflated Type I error).
Define heteroscedasticity
Unequal variances
Define homoscedasticity
equal variances
What is the intuition underlying a two-sample test of variances?
How does the F-distribution differ from the T-distribution?
What do we do because the F-distribution is asymmetric?
We typically place the larger variance in the numerator so that the F-statistic is ≥ 1 and easier to interpret.
–> Using the reverse ratio leads to a different numerical value, but it corresponds to the opposite tail of the same distribution, resulting in an equivalent test and conclusion.
How do you calculate F for the f-distribution?
F = Pop variance(1)/Pop variance(2)
–> pop variance is pop sd^2
For distribution, how is the exact p-value calculated?
Multiplying the p-value by 2 gives the exact p-value provided that the largest variance is in the numerator.
What are the 2 assumptions for the two-sample comparison of variances?
1) Both samples are independently drawn at random from their respective statistical populations (live and dead).
2) The variable (e.g., horn length) is normally distributed in each statistical population (live and dead).
When the variances of two samples are unequal, a different version of the t-test should be used to compare their means. What is it?
Welch’s t-test
Why isn’t heteroscedasticity an issue for paired t-tests?
Because it operates on a single sample of differences between paired observations rather than on two separate samples.
When is Welch’s t-test used?
If variances differ
For the following example: Research question: Does the presence of brook trout affect the survivorship of salmon?
What is the H0 and HA?
1) H0: The variance of the proportion of chinook salmon surviving is the same in streams with and without brook trout (i.e., 𝜎12 = 𝜎22).
2) HA: The variance of the proportion of chinook salmon surviving differs in streams with and without brook trout (i.e., 𝜎12 ≠ 𝜎22).
What are the 2 differences between standard t-test and welch’s t-test?
1) Standard t-test for comparing two-sample means use a common variance estimator (i.e., pooled variance). Welch’s test does not assume equal population variances, so each sample provides its own estimate of variability.
2) Overall uncertainty depends on how reliable each variance estimate is. When sample sizes are small or variances are very different, this uncertainty increases, effectively reducing the degrees of freedom in comparison to the standard two-sample t-test
Describe the Welch’s t-Test degrees of freedom
Why non-whole numbers for degrees of freedom for Welch’s t-test and what is the effect of this?
TRUE or FALSE: When the null hypothesis for means is true (equal 𝜇) but the variances differ, the risk of false positives exceeds the pre-established alpha level (in general).
TRUE
With smaller degrees of freedom, the p-value for Welch’s t-test tends to be larger than that of the standard t-test. How is this remedied?
Why is there such emphasis on ensuring that the Type I error rate (α) is properly controlled?
For example, Welch’s t-test maintains the correct Type I error rate when variances differ, and procedures addressing multiple testing or p-hacking aim to prevent an inflated risk of false discoveries.
When you reject H0, you are making a positive claim. If that claim is wrong (Type I error), what can it create?
If that claim is wrong (Type I error), it can: create false knowledge, propagate through future studies, and lead to poor decisions or wasted resources.
–> In contrast, a Type II error is more conservative: you simply fail to make a claim. While this can also be costly (e.g., missing a real species), it does not introduce false conclusions into the scientific record.
What are the 3 potential impacts of a Type I error?
1) Wastes time and resources: Pursuing a non-existent effect.
2) Can cause harm: Approving an ineffective drug or treatment.
3) Loss of credibility: Damages trust in scientific findings.