Given n = 250 and ý = 27, what can be say according to the LLN?
sample size of 250 means its closer than if we had 200 or 20
Given n = 250 and ý = 27, what can be say according to the CLT?
Sampling distribution of the sample mean should be approximately normal in shape (functional form), mode=median=mean of the sampling distribution is in the middle and equal to the population mean; standard deviation of sampling distribution is sigma/squared n
Why is CLT cool?
As n gets larger, the sapling distribution approaches normality, even if the pdf of X in the population is not normal
Point estimate
a single number, calculated from a set of data, that is the best single guess for the population parameter
Point estimator
a sample statistic that predicts the
value of that parameter; can think of this as the
equation used to produce the point estimate
Problems with a point estimate?
Maybe high probability of being highly wrong
What problem do you still have even with CLT and LLN?
Though we know the outcome of increasing sample size, is it clear how good of a guest our sample statistics are relative to the population parameter?
Interval estimate
consists of a range of numbers around the point estimate, within which the parameter is believed to fall
What is another word for the interval estimate?
the confidence interval
What does interval estimate allow us to do?
Gauge accuracy of a point estimate using probability
Why are we able to gauge the accuracy of a point estimate using probability?
We know the probability of the population parameter falling in a given interval
What is confidence interval based on? (2)
A point estimator and the spread of the sampling distribution of that estimator
What assumption must be met for you to use confidence intervals?
that the sampling distribution is approximately normal
How do you construct a confidence interval?
Why is knowing “If we know the parameters of a population, specifically the mean and standard deviation, then we can predict the chance that a given sample of size n will have a sample mean within a certain distance of the population” useful?
We can reverse it to find confidence interval
What is the implication of a 95% confidence?
5% chance that the sample mean does not fall within the interval you get a mean that range does not include the true population parameter
Is it true that there’s a 95% chance that the mean will fall within the interval?
No, it either does or it does not. Rather, 95% of possible means will contain the population mean
How do we reverse the logic?
given a sample of size n and a sample standard deviation and mean, we predict the chance that the unknown population mean is within a certain distance of the sample mean
Once a sample mean is calculated, if the sample
mean does fall within the interval
µ−1.96σÝ and µ+1.96σŶ, then…
the interval from Ý−1.96σ Ý and Ý +1.96σ Ý
contains μ
What is the equation for confidence interval?
ci= Ý ± z ˆσý
For confidence interval equation - how do we get z?
Usually not given z. instead start with desired confidence interval, then select appropriate z-score.
Confidence coefficient
The probability that the interval estimate contains the parameter. Typical confidence coeffieincets are .99 and .95
What drives the z-score that you use for confidence interval calculations?
the confidence coefficient
If you decide on confidence coefficient of .95, what z-score would you use?
1.96 (0.025 of the distribution falls outside this z-score)