Descriptive statistics
Methods of summarizing the data in an informative way
Inferential statistics
Methods to draw conclusions (or to make inferences, test hypotheses) • Mean difference test • Chi-square test • Analysis of variance (ANOVA) • Regression analysis • Logit analysis
Four types of scales
Nominal scale
allows classifying data into groups/categories
e.g. gender
Ordinal scale
rank orders in a meaningful way
e.g. education level
Interval scale
Meaningful differences between values, but no natural zero point –> zero means something (0 degrees)
Ratio scale
Meaningful differences and ratios between values due to a natural zero point –> zero is actually nothing (0 dollar is no money)
Choosing between inferential statistics:
IV=nominal/ordinal DV=nominal/ordinal
Chi-square test
Choosing between inferential statistics:
IV=nominal/ordinal DV=interval/ratio
T-test, Anova
Choosing between inferential statistics:
IV=interval/ratio DV=nominal/ordinal
logit analysis
Choosing between inferential statistics:
IV=interval/ratio DV=interval/ratio
regression analysis
When to perform T-Test vs Anova
T-Test –> compare two means (two levels of IV)
Anova –> compare more than two levels
Rating scales
TREATED AS INTERVAL/RATIO so that you can use regression
What is a population?
Entire group of people, firms, events, or things of interest for which you would like to make inferences
What is a sample?
A subset of the population of interest
What is a subject?
Single member
What is low representativeness?
= properties of the population are over- or underrepresented in the sample
= high sampling error
The sampling process
e.g. students TISEM, dutch organ donors
“Physical” representation of the target population
- where you can reach out to e.g. Donorregister
coverage error
sampling frame ≠ population
• Under-coverage: true population members are excluded
• Miss-coverage: non-population members are included
solutions to coverage error
* If large, redefine the population in terms of the sampling frame
probability vs non-probability sampling
Probability sampling
Each element of the population has a known chance
of being selected as a subject
–>Results generalizable to population
BUT more time and resource intensive