When do we need stats in the research process?
After collecting data - need to summarize & communicate findings
2 types of stats:
How should we most efficiently present research:
Want to convey maximum information using minimum space
Purpose of descriptive statistics?
Summarizes mass of data points
- Understanding and interpretation
- Visual displays, appropriate calculations
In experiments can calculate within each
- condition/group
- Mean, standard deviation…
In correlation designs
- For each variable, calculate mean, standard deviations, etc
- For every pair of variables, calculate a correlation coefficient (also a descriptive statistic)
3 Types of Descriptive Statistics
Scales of Measures
Nominal
Group or categorization
- No order or direction
- Summarized by proportion/percentages or the mode
Ordinal
Ranked order (1st, 2nd, 3rd..)
- Uneven spaces between “scores”
Interval
Numerical scales in which intervals have the same interpretation throughout but no true zero (e.g. temp in celsius - 0 deg still indicates a temperature)
Ratio
An interval scale with a true zero reference point (e.g. 0 pounds)
- Summarized with the mean or median and standard deviation
Measures of central tendency
3 measures of central tendency:
Mean, Median, Mode
Mean
= arithmetic average
Downsides of Mean:
□ Affected by outliers (i.e., extreme scores)
Upsides of mean:
□ With increasing sample size, each extreme score has less effect on the mean.
□ Maximizes use of all of our data.
□ Has mathematical properties that enable us to
use it in statistical analysis.
Outliers - With increasing sample size, the mean is
Less affected by outliers
□ Main idea here - check for outliers if you only have a small sample, but try to get a large sample
Median
= score that divides group in half
How to find median:
Put scores in order. Count number of scores.
If odd #: identify the middlemost score.
If even#: identify two middle scores, take average of them.
When is median useful?
Whenever it’s most descriptively informative to report the value for which equal numbers of people score higher
and lower (e.g. income)
Mode
= most frequently occurring score
When is mode useful?
Whenever it’s most descriptively informative to
report the most frequently occurring score (e.g.: employee salary distribution)
Measures of Spread:
Variability:
The spread in a distribution of scores
AMOUNT of spread is often measured by Standard Deviation
How much each score deviates from the mean
Possible Measures of Variability
□ Range (max – min)
□ Variance
□ Standard Deviation