What will most studies do before data collection begins?
attempt to determine size of the sample needed to achieve certain degree of accuracy in estimation
Two reasons why estimating minumum sample size commonly done with population proportions?
Why is estimating minimum sample size less commonly done with population means?
Three things that must set to calculate the desired sample size
why must we select a
value of the population proportion – π (most conservative is 0.5)
Why is the most conservative guess of pi 0.5?
this value produces the largest standard error (most spread/variance); you’ll know you got the right sample size
What is you guessed that pi is 0.5, but it turns out that it is was actually 0.8?
Either way, it would’ve said you needed a smaller sample size, less observations than you originally thought, your result will be even more precise
What equation do you use to answer the question “for a given level of precision, how many observations do I need to ensure that election night +/- value that I want?”
n = π( 1− π)
( z/B)^2
in the equation n = π( 1− π)
( z/B)^2, what does B denote?
the maximum error around the proporMon
in the equation n = π( 1− π)
( z/B)^2, what does n denote?
denotes the sample size ensuring that, with fixed probability,
the error of esMmaMon of π by the sample proporMon is no
greater than B
in the equation n = π( 1− π)
( z/B)^2, what does zw denote?
the corresponding z-score for a confidence interval
with a confidence coefficient equal to the fixed probability
In the equation n = π( 1− π)
( z/B)^2, can you throw in a value for pi?
NO - that is the true population proportion (you don’t know that, and you can’t estimate it by throwing in sample value)
Two solutions to not knowing pi in n the equation n = π( 1− π)
( z/B)^2
What equation do you use to calculate the necessary sample size for estimating means?
n = σ^2(z/B)^2
What equation do you use to calculate the necessary sample size for estimating proportions?
n = π( 1− π)( z/B)^2,
What must you specify when calculating sample size for estimating means? n = σ^2(z/B)^2
2 things
Desired confidence coefficient
population standard deviation
What must you specify when calculating sample size for estimating means? n = σ^2(z/B)^2 , how might you specify the standard deviation?
*use standard deviation from past research, or related population data if possible
when dealing with calculating sample size for estimating means: the __ the spread of the population
distribution, as measured by the __, the __ the
sample size needed to achieve a certain accuracy
greater
standard deviation
larger
Two types of estimators?
Biased
Unbiased
Unbiased estimator
the average sample statistic (from an indefinitely large number of samples) equals the population parameter in the long run/ (average of sampling distributions’ statistics will equal parameter)
T/F: When you have an unbiased estimator, for some samples a statistic may underestimate the
parameter of interest and for others it may overesMmate
the parameter
True - but in the long run the estimates will “average” themselves out
Biased estimator/statistic
In the long run, the statistic consistently over or underestimates the parameter it is estimating (the average of all possible statistics is not equal to the parameter)
Why is range a biased estimator?
when find all possible sample ranges and take the average, doesn’t equal population range, since almost of all sample ranges might be like 4 or 7, only combos including outlier of 10 million have the true range captured
Why is standard deviation a biased estimator?
end to underestimate, since most samples don’t contain the max and min value; the true population parameter will always be a bit larger, also using a different equation