Unit 1 Flashcards

(32 cards)

1
Q

Categorical variable

A

Assigns labels that place individuals into a group or category

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Quantitative variable

A

Takes on numerical values where it makes sense to take an average

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Distribution

A

Tells us what values a variable takes on and how often they occur

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Frequency table

A

Shows number of individuals that have each attribute

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Relative frequency table

A

Shows proportion or percentage of individuals that have each attribute

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Marginal relative frequency

A

Total number of individuals in one category divided by table total

SEE page 8

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Joint Relative frequency

A

Individuals that belong to 2 categories divided by table total

SEE page 8

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Conditional relative frequency

A

Individuals in one category divided by total of another category

SEE page 8

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Conditional distribution

A

All the conditional relative frequencies listed together for a variable

SEE page 8

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Symmetric data plot

A

One peak (unimodal) and points are the the same on either side (symmetrical)

SEE page 14

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Skewed to the right data plot

A

One peak, data points are mostly on the left (the lower end)

SEE page 14

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Skewed to the left data plot

A

One peak, data points are mostly to the right (high side)

SEE page 14

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Unimodal plot

A

Data plot with one peak

SEE page 14

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Bimodal plot

A

Data plot with two peaks

SEE page 14

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Approximately Symmetric

A

Can have multiple peaks, but is roughly a symmetric plot

SEE page 14

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How to describe the distribution of a quantitative variable

A

SOCS!!!

Shape
Outliers
Center
Spread

***AP exam tip: include CONTEXT

17
Q

Histogram Def

A

A histogram shows each interval of values as a bar. The heights represent how many individuals are in each interval

NOT a bar graph
For QUANTITATIVE data

18
Q

Histogram Rules

A

Choose equal width for each bar
Draw and label each axis
Bars touch each other and y-axis
Scale the axes appropriately

19
Q

Mean

A

Average of a data set

Symbol: x̄

20
Q

Statistic

A

A value that describes a SAMPLE of data

21
Q

Parameter

A

A value that describes an entire POPULATION

22
Q

Resistant

A

A statistical measure that isn’t sensitive to outliers

23
Q

Median

A

Midpoint of the data set value where 1/2 data lies below and 1/2 data lies above it

Median IS resistant

24
Q

Comparing Median & Mean

A

If mean = median (symmetric)
If mean > median (skewed right)
If mean < median (skewed left)

25
Range
Maximum — Minimum Range is a single value
26
Standard deviation
Measures the average distance the data values are from the mean Symbol: Sx SEE page 24
27
Standard deviation properties
Always ≥ 0 Larger Sx indicates larger variation NOT resistant to outliers Only use when mean is the chosen measure of center
28
Sample variance
Standard deviation squared
29
Quartiles
First quartile (Q1): Median of the first 1/2 of data Third Quartile (Q3): Median of the 2nd half of data Interquartile range (IQR): Range of the middle 50% of the distribution Formula: Q3 — Q1 (IS resistant)
30
Outlier Rules
Low outlier < Q1 — (1.5 x IQR) High outlier > Q3 + (1.5 x IQR)
31
Five number summary
Minimum Q1 Median Q3 Maximum
32
Boxplots
Visual representation of the 5 number summary Drawn above one horizontal axis Box starts at Q1 and ends at Q3 Median is marked inside the box Whiskers start and ends at points low and high that are NOT outliers Outliers grouped at separate points