Chapter 2: Describing Quantitative Distributions With Numbers Flashcards

(42 cards)

1
Q

What is a parameter?

A

A number that describes the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a statistic?

A

A number that can be computed from the sample data without making use of any unknown parameters; a number that describes a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

True or False?: Statistics are often used to describe parameters.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the median?

A

The midpoint of a distribution; the number such that half the observations are smaller and the other half are larger.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do you find the median?

A
  1. Arrange all observations from smallest to largest.
  2. Find the median by calculating (n+1)/2, then count up from the smallest observation this many times.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a percentile?

A

The value such that some percent (p) of observations in a distribution lie below it is called the pth percentile.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is another term to describe the 50th percentile?

A

Median.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a resistant measure?

A

A measure of any aspect of a distribution that is relatively unaffected by changes in the numerical value of a small proportion of the total number of observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Is mean a resistant measure?

A

No.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Is median a resistant measure?

A

Yes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why is median usually reported rather than the mean for skewed distributions?

A

Because the mean is sensitive to extremities or outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the range?

A

The distance between a distribution’s largest and smallest values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the interquartile range (IQR) and how is it calculated?

A

The distance between the first and third quartiles; Q3-Q1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the first quartile (Q1)?

A

The value larger than 25% of the distribution’s values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the second quartile (Q2)?

A

The value larger than 50% of the distribution’s values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the third quartile (Q3)?

A

The value larger than 75% of the distribution’s values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How do you calculate Q1?

A

Take the median of the observations that are positioned to the left of the overall median (do not include the value of the overall median).

18
Q

How do you calculate Q2?

A

Take the median of the observations that are positioned to the right of the overall median (do not include the value of the overall median).

19
Q

What does the five number summary include?

A

The distribution’s minimum, Q1, Q2, Q3, and maximum.

20
Q

What is the most common numerical description of a distribution?

A

Mean and standard deviation.

21
Q

What is variance (s^2)?

A

The measure of spread about the mean of a distribution; standard deviation squared.

22
Q

What does variance tell you?

A

How much the data varies from the mean.

23
Q

What are the degrees of freedom?

24
Q

If mean is not used as a measure of center, should standard deviation be used as a measure of spread?

A

No. Standard deviation measures spread about the mean and should only be used when the mean is also being used/reported.

25
When does s=0?
When there is no spread.
26
When is s<0?
Never. s is always greater than or equal to 0.
27
True or False?: Standard deviation is a resistant measure of spread.
False. Standard deviation is not a resistant measure.
28
What type of graph displays the five number summary?
A boxplot.
29
True or False?: A boxplot shows less detail than a histogram, but more detail than a dotplot.
False. A boxplot shows less detail than both a dotplot and a histogram.
30
In a box plot, when is Q1 and Q3 equally distant from the median?
When the distribution is symmetric.
31
In a box plot, when is Q1 further from the median than Q3?
When the distribution is skewed to the left.
32
In a box plot, when is Q3 further from the median than Q1?
When the distribution is skewed to the right.
33
How does the IQR rule define suspected outliers?
Per the IQR rule, a value is determined to be a suspected outlier if it falls more than 1.5IQR above Q3 or more than 1.5IQR below Q1.
34
True or false?: If a value is determined a suspected outlier by the IQR rule, it should always be considered an outlier.
False. A value considered a suspected outlier by the IQR rule should not be considered an outlier if it still follows the pattern of the distribution.
35
What are the three possible reasons for outliers in data?
Human error in experimentation or data collection, human error in recording information, and the unexplainable.
36
What are the steps one should take to organize a statistical problem?
1. State - state what the practical question is in context of the real world setting. 2. Plan - Plan what specific statistical operations are necessary for the problem. 3. Solve - analyze the data with graphs and computations suitable for the problem. 4. Conclude - give a practical conclusion in the setting of the real world problem.
37
What numerical summaries are suitable for a symmetric distribution?
Mean and standard deviation.
38
What numerical summaries are suitable for a non-symmetric distribution?
The five number summary.
39
True or false?: True parameters can never be found.
True.
40
Why is a boxplot advantageous?
A boxplot displays the information gained from a bar graph but in more detail.
41
How is measurement of spread displayed on a bar graph?
Through an error bar.
42
What does an error bar usually display, specifically?
Standard deviation.