Chapter 2 Flashcards

(24 cards)

1
Q

Grouped vs Ungrouped Frequency Distributions

A

Ungrouped: Only one value of the variable is recorded (eg. each value is recorded individually)
Grouped: Uses intervals. (eg. Instead of each value being shown, values are grouped into classes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Examples of Ungrouped charts

A

Bar graph, pie chart

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Examples of Grouped charts

A

Histogram, pareto chart

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How do we ensure there are no gaps in a histogram?

A

We subtract from the lower limit and add to the upper limit to find a middle point

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a range within a frequency distribution called?

A

A class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a commonly used formula to find the # of classes?

A

1+ 3.3Log(n)
(where n is the number of data points)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How to determine the width of each bar in a histogram?

A

(Largest value - smallest value) / #Classes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How to find the lower limits of a histogram?

A

Use the smallest value or one slightly smaller than it
Add width until desired number of values is reached

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How to find the upper limits of a histogram?

A

Subtract 1 (or another appropriate number) from the minimums until desired number of values is reached

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the relative frequency?
(hint: relative to the rest of the data)

A

The frequency divided by n, the percentage of the data that falls inside the given class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the cumulative frequency?

A

The sum of each frequency one after another. Represents the amount of data that exists before the following class(es)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the relative cumulative frequency?

A

The cumulative frequency divided by n. Represents the percentage of data that exists before the following class(es)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are examples of central tendencies? (4)

A

Mean
Median
Mode
Midrange

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Which central tendencies are more affected by outliers? Why?

A

Mean and midrange tend to be more influenced by outliers since they deal with the extremes of the data.
Median and mode aren’t usually affected by outliers since they deal with the order and frequency of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is an Outlier?

A

Data that does not fit in with the rest. Abnormally different data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the two main measures of position? (in regards to data in a data set)

A

Percentile
Quartile

17
Q

How to find the percentile VALUE of a data set?

A
  • Arrange the data
  • i = ((n+1)p)/100 (where n is sample size and p is %)
  • If i is an integer, count out
  • If i is not an integer, must take average of the two closest values
18
Q

How to find a given quartile of a data set?

A

Quartiles = P25, p50, p75
Follow same steps to find percentile.

19
Q

Do we always indicate the upper / lower limits on a box and whisker plot?

A

No. If no outliers are present, the upper/ lower limits become the smallest/ largest values.

20
Q

What is the IQR value of a box plot?

21
Q

How to find the upper / lower limits of a box plot?

A

Lower: Q1 - (IQR x 1.5)
Upper: Q3 + (IQR x 1.5)

22
Q

How to find weighted mean?

A

Sum of data times their frequencies divided by the sum of the frequencies

23
Q

What are the properties of a normal distribution? (3)

A

1) Symmetric about the mean
2) Mean median and mode all lie at the same position, all are equal
3) The curve’s size and shape are determined by the mean and standard deviation

24
Q

What is the empirical rule?

A

On BOTH sides of a distribution:
- 68% of data falls into one SD of the middle
- 95% Falls within two SD of the middle
- 99.7% falls within three SD of the middle