What are descriptive stats
What do descriptive statistics ntot tell us
us whether the difference between these groups can be inferred beyond our sample to the population
what do inferential statistics generate and what can they help with
p-value = help us understanding if there is a difference in our general population.
what is a p-value/ inferential statistic dependant on
what type of data level is used
What is an inferential statistic
what are the two types of inferential statistics
frequentist ( focused on in this lecture)
bayes
what is a null hypothesis
what is an alternative hypothesis
(BASIC UNDERSTANDING)
We define a null hypothesis. This means that there is no difference between the groups we are looking at.
An alternative hypothesis states that there is a difference in the results
what do we compare the null hypothesis
with the alternative hypothesis to see a contrast
What is the purpose of null hypothesis testing?
It estimates the probability of obtaining a result /pvalue (or one more extreme) by chance, assuming the null hypothesis of no difference or association is true.
If the result is unlikely or extreme under this assumption, we reject the null and conclude there is evidence of a real difference or association.
for p-values what value is known as satistically significant
0.05/ 5%
What is the alpha level
the leve whcih we accept a result to be significant (0.05 )
what is the true definition of a p-value in frequent statistics
Probability the result we found (or one more extreme) occurred by chance assuming the null hypothesis is true.
what is the misinterpretation about p-values
p-values tell us the likelihood that our (alternative) hypothesis is real / true. - It can’t be. The p-value is specific to an experiment and Null hypothesis.
if p i slower than 5% what does this indicate
we should reject the null. As it is suprising enough to be real
What does the normal distribution tell us in hypothesis testing?
Normal Distribution:
A bell curve that shows the range of possible outcomes.
Used to determine how likely your sample result is if the null hypothesis is true.
P-value:
The probability of obtaining a result as extreme (or more extreme) than your sample result, if the null hypothesis is true.
Low p-value (e.g., < 0.05) → Reject the null hypothesis (your result is unlikely under the null).
High p-value → Fail to reject the null hypothesis (your result is not unusual).
Null hypothesis (H₀): The average height is 160 cm.
Sample result: Average height = 165 cm.
P-value: 0.02 → Reject the null hypothesis (165 cm is unlikely if the true average is 160 cm).
What percentages of observations fall within standard deviations of the mean in a normal distribution?
68% within ±1 SD
~95% within ±2 SDs
In a normal distribution, the ±1.96 standard deviations (SDs) from the mean cover 95% of the data, leaving only 5% outside of this range—split between the two tails of the distribution.
So in hypothesis testing, we often use 1.96 to determine the boundaries for a 95% confidence interval or the critical region for rejecting the null hypothesis at a 5% significance level (α = 0.05). This means that
what are the region of rejections
They are the extreme ends (tails) of a distribution where results are unlikely if the null is true.
If a test statistic falls in this region (e.g., p < 0.05, beyond ±1.96 SDs), we reject the null hypothesis.
What if a sample has an extreme (<5%) probability under the null hypothesis?
It’s very unlikely to happen by chance, so it might come from a different population — suggesting a real difference or effect.
How do effect sizes (Cohen’s d) relate to p-values?
what is the definition of cohen d.
When Cohen’s d = 0 → no real effect → p-values are usually not significant (p > .05).
When Cohen’s d = 1–2 → large real effect → p-values are usually significant (p < .05).
Example: Height difference between men and women (d = 1.72) shows a strong, real difference that’s almost always statistically significant.
Cohen’s d is a measure of effect size that tells you how large the difference is between two groups (or conditions), relative to the variability within those groups.
what is a 1-tailed test
used for directional hypothesis . The 5 percent significance levels is concentrated on one tail
what is a 2-tailed test
non-directional hypothesis
5 percent significance levels are split across both tails. We only use 2-tailed tests are used
what is the 5 percent significance zone
part of our results that are too extreme/ random
What are Cohen’s d effect size bins
Cohen provided guidelines to help interpret the magnitude of the effect size, or what we often call effect size bins.
0.2 = small
0.5 = medium
0.8 = large
How do effect size and p-value differ?
Effect size (d): how big the difference is.
p-value: how surprising the result is (depends on sample size).
✅ Big effects can be non-significant, and small effects can be significant.