If data is coded using the formula
y = (x - a) / b,
What is the mean of the code given by?
ȳ = (x̄ - a) / b
If data is coded using the formula
y = (x - a) / b,
What is the standard deviation of the code given by?
σy = σx / b
In coding and data, what is the mean effected by and the standard deviation affected by?
The mean is affected by ALL operations, the standard deviation is ONLY affected by × or ÷
What is an outlier?
An outlier is an extreme value that lies outside the overall pattern of data
What is cleaning the data?
The process of removing anomalies from the data.
What should you do before drawing a box plot?
Remove any anomalies/ outliers
On a histogram, where does the frequency density go?
It goes on the vertical axis
How do you calculate frequency density?
Frequency density =
frequency / class width
In a histogram what are the following proportional to?
When making comparisons between two sets of data, what must you comment on?
And which types of data just pair together?
PAIRS:
- Compare mean with standard deviation
- Compare median with IQR