What are some ways to compare distributions?
How should we treat outliers?
With attention and care.
What can re-expression of data do and what are some ways to do it?
When comparing the distributions of several groups using histograms or stem-and-leaf displays, consider their…?
When comparing groups with boxplots, compare the…?
-Compare the shapes. Do the boxes look symmetric or skewed? Are there differences between groups?
-Compare the medians. Which group has the higher centre? Is there any pattern to the medians?
-Compare the IQRs. Which group is more spread out? Is there any pattern to how the IQRs change?
Using the IQRs as a background measure of variation, do the medians seem to be different, or do they just vary in the way that you’d expect from the overall variation?
-Check for possible outliers. Identify them if you can and discuss why they might be unusual. Of course, correct them if you find that they are errors.
Define ‘Timeplot’.
A timeplot (often called a time series plot) displays data that change over time. Often, successive values are connected with lines to show trends more clearly. Sometimes a smooth curve is added to the plot to help show long-term patterns and trends.