label
= a specific variable used in some data sets to distinguish the different cases
variable
= a characteristic of a case
- different cases can have different values of variables
categorical variable
= places a case into one of several groups/ categories
categorical nominal
= equal categories (e.g. gender (male/ female))
categorical ordinal
= ordered categories (e.g. education level (low/ average/ high)
–> òrdering´
quantitative variable
= takes numerical values for which arithmetic operations such as adding and averaging make sense
quantitative interval
= meaningful numbers (e.g. IQ 50-150)
quantitative ratio
= an interval variable within an absolute zero point (e.g. age)
case
= an object that is described by the data
exam a distribution
= look for overall pattern like the shape, the spread, the center
–> if distribution is not symmetric, it’s tighter right skewed (positive) or left skewed (negative
5-number summary
asked for:
minimum, maximum; median; Q1, Q3
–> used to make a boxplot
measures of centre:
2. standard deviation
1. = in order to compare different distributions with each other & have a linked component z = x-x̄/Sx
2. inferential statistics