What is one primary goal of statistics?
One primary goal of statistics is to estimate (infer) an unknown quantity (parameter) of a population based on sample data.
What does estimation involve?
Estimation involves inferring a population parameter (e.g., mean, standard deviation, median) from a sample.
What is variation within a sample summarized by?
The variation within a sample is summarized by the standard deviation
What does variation within a sample tell us?
The variation within a sample, summarized by the standard deviation, informs us about the expected variability of sample means around the true population mean—that is, how much our sample average might deviate simply because of random sampling.
What is Statistical “superpower”?
By measuring variation within a sample, we can estimate how uncertain our estimate is—that is, how much the sample means would fluctuate if we repeated the study.
What is a population parameter vs a sample estimate and provide an example?
Why is an an estimate (derived from a sample) rarely, if ever, exactly the same as the population parameter being estimated—especially in large populations?
because sampling is influenced by chance.
At its core, statistics asks…
At its core, statistics asks: given uncertainty arising from random sampling, how reliable is an estimate, and how much confidence should we place in decisions based on it? Put differently, how close is a sample-based estimate expected to be to the true population value?
Is the goal of statistics to quantify or eliminate uncertainty?
The goal is not to eliminate uncertainty (one can’t), but to quantify it, so that we can make decisions with a known degree of certainty.
What are 3 examples of properties of estimators?
1) Mean
2) Variance
3) Sd
What is a sampling distribution and what does it describe?
How do sampling distributions and frequency distributions differ?
Sampling distributions describe probabilities of possible estimates, not frequencies of observed data values
What is µ?
µ = population mean (we say “mu”, Greek alphabet).
What is σ?
σ = population standard deviation (we say “sigma”)
What is σ^2?
σ2 = population variance (we say “sigma squared”)
What is X̄ or Ȳ?
X̄ = sample mean (we say “X bar”, LaBn or Roman alphabet).
–> While µ always represents the population mean of a variable (e.g., X), the symbol for the sample mean depends on the variable being measured. For example, the sample mean of X is written as X̄, and the sample mean of Y is written as Ȳ. The key point is that the sample mean is always indicated by a bar over
What is s?
s = sample standard deviation
What is s^2?
sample variance
What is the first property of sampling distributions?
Property 1: The mean of all sample means is always equal to the population mean
When the mean of all possible sample means = the population parameter, the estimate is said to be…
the sample mean is unbiased because, under random sampling, the sample means do not systematically tend to be either larger or smaller than the true population mean
When sampling is biased, what does this mean about the differences in sample estimate and population value?
When sampling is biased, differences between the sample estimate and the population value are no longer due solely to random variation—they reflect a systematic distortion of the sampling process
How does sample size affect the precision of sample estimates under random sampling?
Does random sampling eliminate sampling error?
Random sampling does not eliminate sampling error, but it prevents systematic bias and allows sampling error to be measured
What is the 2nd property of sampling distributions?
Property #2: Under random sampling, increasing sample size reduces sampling variability, so sample means cluster more tightly around the true population mean, resulting in greater precision.