Categorical variable
Assigns labels that place individuals into a group or category
Quantitative variable
Takes on numerical values where it makes sense to take an average
Distribution
Tells us what values a variable takes on and how often they occur
Frequency table
Shows number of individuals that have each attribute
Relative frequency table
Shows proportion or percentage of individuals that have each attribute
Marginal relative frequency
Total number of individuals in one category divided by table total
SEE page 8
Joint Relative frequency
Individuals that belong to 2 categories divided by table total
SEE page 8
Conditional relative frequency
Individuals in one category divided by total of another category
SEE page 8
Conditional distribution
All the conditional relative frequencies listed together for a variable
SEE page 8
Symmetric data plot
One peak (unimodal) and points are the the same on either side (symmetrical)
SEE page 14
Skewed to the right data plot
One peak, data points are mostly on the left (the lower end)
SEE page 14
Skewed to the left data plot
One peak, data points are mostly to the right (high side)
SEE page 14
Unimodal plot
Data plot with one peak
SEE page 14
Bimodal plot
Data plot with two peaks
SEE page 14
Approximately Symmetric
Can have multiple peaks, but is roughly a symmetric plot
SEE page 14
How to describe the distribution of a quantitative variable
SOCS!!!
Shape
Outliers
Center
Spread
***AP exam tip: include CONTEXT
Histogram Def
A histogram shows each interval of values as a bar. The heights represent how many individuals are in each interval
NOT a bar graph
For QUANTITATIVE data
Histogram Rules
Choose equal width for each bar
Draw and label each axis
Bars touch each other and y-axis
Scale the axes appropriately
Mean
Average of a data set
Symbol: x̄
Statistic
A value that describes a SAMPLE of data
Parameter
A value that describes an entire POPULATION
Resistant
A statistical measure that isn’t sensitive to outliers
Median
Midpoint of the data set value where 1/2 data lies below and 1/2 data lies above it
Median IS resistant
Comparing Median & Mean
If mean = median (symmetric)
If mean > median (skewed right)
If mean < median (skewed left)