Lecture 8; Sampling Distributions: Flashcards

(29 cards)

1
Q

What is one primary goal of statistics?

A

One primary goal of statistics is to estimate (infer) an unknown quantity (parameter) of a population based on sample data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does estimation involve?

A

Estimation involves inferring a population parameter (e.g., mean, standard deviation, median) from a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is variation within a sample summarized by?

A

The variation within a sample is summarized by the standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does variation within a sample tell us?

A

The variation within a sample, summarized by the standard deviation, informs us about the expected variability of sample means around the true population mean—that is, how much our sample average might deviate simply because of random sampling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Statistical “superpower”?

A

By measuring variation within a sample, we can estimate how uncertain our estimate is—that is, how much the sample means would fluctuate if we repeated the study.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a population parameter vs a sample estimate and provide an example?

A
  • A parameter describes a quantity in a statistical population, while an estimate (or statistic) is a similar quantity derived from a sample.
  • For example, the mean of a population is a parameter, whereas the mean of a sample is an estimate (or statistic) of the population mean.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why is an an estimate (derived from a sample) rarely, if ever, exactly the same as the population parameter being estimated—especially in large populations?

A

because sampling is influenced by chance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

At its core, statistics asks…

A

At its core, statistics asks: given uncertainty arising from random sampling, how reliable is an estimate, and how much confidence should we place in decisions based on it? Put differently, how close is a sample-based estimate expected to be to the true population value?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Is the goal of statistics to quantify or eliminate uncertainty?

A

The goal is not to eliminate uncertainty (one can’t), but to quantify it, so that we can make decisions with a known degree of certainty.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are 3 examples of properties of estimators?

A

1) Mean
2) Variance
3) Sd

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a sampling distribution and what does it describe?

A
  • A sampling distribution is the probability distribution of an estimator generated by random sampling from a population.
  • It describes what values the estimate would take if we were to repeatedly sample from the same population under identical conditions.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do sampling distributions and frequency distributions differ?

A

Sampling distributions describe probabilities of possible estimates, not frequencies of observed data values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is µ?

A

µ = population mean (we say “mu”, Greek alphabet).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is σ?

A

σ = population standard deviation (we say “sigma”)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is σ^2?

A

σ2 = population variance (we say “sigma squared”)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is X̄ or Ȳ?

A

X̄ = sample mean (we say “X bar”, LaBn or Roman alphabet).
–> While µ always represents the population mean of a variable (e.g., X), the symbol for the sample mean depends on the variable being measured. For example, the sample mean of X is written as X̄, and the sample mean of Y is written as Ȳ. The key point is that the sample mean is always indicated by a bar over

17
Q

What is s?

A

s = sample standard deviation

18
Q

What is s^2?

A

sample variance

19
Q

What is the first property of sampling distributions?

A

Property 1: The mean of all sample means is always equal to the population mean

20
Q

When the mean of all possible sample means = the population parameter, the estimate is said to be…

A

the sample mean is unbiased because, under random sampling, the sample means do not systematically tend to be either larger or smaller than the true population mean

21
Q

When sampling is biased, what does this mean about the differences in sample estimate and population value?

A

When sampling is biased, differences between the sample estimate and the population value are no longer due solely to random variation—they reflect a systematic distortion of the sampling process

22
Q

How does sample size affect the precision of sample estimates under random sampling?

A
  • As sample size increases, sampling variability decreases, so sample means cluster more tightly around the true population mean, yielding more precise estimates.
23
Q

Does random sampling eliminate sampling error?

A

Random sampling does not eliminate sampling error, but it prevents systematic bias and allows sampling error to be measured

24
Q

What is the 2nd property of sampling distributions?

A

Property #2: Under random sampling, increasing sample size reduces sampling variability, so sample means cluster more tightly around the true population mean, resulting in greater precision.

25
What are sampling distributions best represented by when the estimator is continuous?
Sampling distributions are best represented by probability density functions (PDFs) when the estimator is continuous, as is the case for sample means except in trivial or extremely small populations
26
What does probability density give you?
Probability density does not give the probability of a single exact value, but rather how densely probability is concentrated around that value. Probabilities are obtained by integrating the PDF over an interval.
27
How is probability density affected by sample means (sample means near or far from the pop mean)?
- Sample means near the true population mean occur often → they have high probability density. - Sample means far from the population mean occur rarely → they have low probability density
28
How do you plot the probability distribution of the sample mean?
If you plot all possible sample means and how frequently they occur, you get a smooth curve. That curve is the probability distribution of the sample mean
29
What do the height of the curve and area under the curve of a probability distribution of the sample mean (curve) tell you?
- The height of the curve at a value tells you the probability density (how common nearby values are). - The area under the curve over an interval gives you an actual probability