Module 2 Flashcards

(44 cards)

1
Q

Three measures of center for quantitative variables

A

Mean, Median, Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Mean

A

Average

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Finding average / sample mean

A

(y1 + y2 + y3) divided by total number of observations –> n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Median

A

Middle

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Finding median with odd number of observations

A

(n+1) divided by 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Finding median with even number observations

A

Take two middle observations:

(observation 1 + observation 2) divided by 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Outlier

A

Observation much smaller or greater than main body of observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Is the median affected by the outlier?

A

No

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Mode

A

Highest frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Three quartiles and corresponding percentiles

A

Q1 –> 25%
Q2 –> 50%
Q3 –> 75%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Five number summary

A

min, Q1, Q2, Q3, max

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Determining five number summary with odd number of observations

A

Find median –> Q2
Don’t include median when calculating Q1 and Q3

Q1 = (add two middle numbers) divided by two

Q3 = (add two middle numbers) divided by two

Find min and max

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Determining five number summary with even number of observations

A

Find median –> Q2
Include median when calculating Q1 and Q3

Q1 = (add two middle numbers) divided by two

Q3 = (add two middle numbers) divided by two

Find min and max

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Frequency

A

Number of observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Calculating relative frequency / proportion

A

Frequency divided by frequency total

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Marginal distribution

A

Totals of each frequency distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Response variable (y)

A

Main variable
Explained by or depends on explanatory variable (x)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

To get joint distribution from contingency table

A

Divide options by overall total then multiply by 100 to get percent

19
Q

When to use joint distribution

A

When sentence says
“and”

20
Q

When to use conditional distribution

A

When sentence mentions particular group

21
Q

Three measures of spread for quantitative variables

A

Range, IQR, standard deviation (SD)

22
Q

IQR

A

Difference between Q3 and Q1

Q3 - Q1

23
Q

How to determine outliers

A

Upper fence = Q3 + 1.5 x IQR

Lower fence = Q1 - 1.5 x IQR

Outlier if higher than upper fence and lower than lower fence

24
Q

Range

A

Largest - smallest

25
Population mean symbol and calculation
μ (mu) Sum of values divided by total population size (n)
26
Sample mean symbol and calculation
ȳ (y bar) Sum of values divided by total sample size (n)
27
Population standard deviation symbol and calculation
σ (sigma) Find population variance Square root population variance
28
Sample standard deviation symbol and calculation
s Find sample variance Square root sample variance
29
Population variance symbol and calculation
σ² 1. Find μ 2. Subtract μ from each data point 3. Square answers from step 2 4. Add all answers from step 3 5. Divide step 4 answer by n
30
Sample variance symbol and calculation
s² 1. Find ȳ 2. Subtract ȳ from each data point 3. Square answers from step 2 4. Add all answers from step 3 5. Divide step 4 answer by n-1
31
Does standard deviation have same unit as original data?
Yes
32
When is standard deviation zero?
No variance in data (all data is the same value)
33
When to use density and how to calculate it
Use when intervals have different lengths Relative frequency divided by length of interval / BINS
34
What does ( and ] mean for BINS?
( --> include that number and any number greater ] --> go up to that number but don't include the number itself
35
How to describe histogram
Shape --> number of humps / mods --> symmetric or skewed --> outliers Center --> mean, median, mode Spread --> range, SD, IQR
36
What type of distribution is a graph with no mods, one mod, two mods, and more than two mods?
Uniform --> no mods Uni-modal --> one mode Bimodal --> two mods Multimodal --> more than two mods
37
Describe what a graph looks like when symmetric and skewed left and right
Symmetric --> same shape reflected on both sides of center Left skewed --> highest mod on right --> tail on left Right skewed --> highest mod on left --> tail on right
38
What to use when measuring center and spread of histogram when histogram is skewed
Median --> center SD --> spread
39
What to use when measuring center and spread of histogram when histogram is symmetric
Mean --> center IQR --> spread
40
Order mean, mode, median least to greatest on right skewed histogram
Mode < Median < Mean
41
Order mean, mode, median least to greatest on left skewed histogram
Mean < Median < Mode
42
Box plot when symmetric
Q2 in the middle of Q3 and Q1 Max and min whiskers are same length
43
Box plot when right skewed
Q2 closer to Q1 Max whisker longer than min whisker
44
Box plot when left skewed
Q2 closer to Q3 Min whisker longer than max whisker