What are the three main ways of collecting data?
What is a sample?
A subset of the target population
What are the benefits of Random sampling ?
Describe precision
-Implies that the value of the statistic is similar in all samples
Describe Bias
-Implies that the sample statistic differs systematically
Describe the process of systematic random sampling with a random start
The process of splitting a sample into groups or subsets and then sampling is called…
Stratified Random sampling
Name the 8 types of non-sampling errors
What is the term for taking a group in an experiment and splitting them up (By age for example)?
Blocking of experimental units
What are the two types of observational studies?
- Retrospective (for past events)
Why might you use a observational study?
If it is impossible, unethical or impractical to conduct an experiment
What is a confounding variable?
A factor not accounted for that introduces a difference in outcomes
What are the three types of plot sampling?
What are the descriptions of response and explanatory variables ?
- Explanatory–>variable that helps us explain or predict the response variable
What are the two types of quantitative data variables?
- Discrete (distinct values)
What are the two types of qualitative data variables?
- Nominal (distinct by name only like green or October)
What is the interquartile range ?
The difference between the 75th quantile and the 25th quantile
Describe Histograms
- Number of points in each bin on the Y axis
Describe the shape of a histogram for the following cases:
What are some of the possible ways to visually display data?
What are the lines that extend out of a box plot?
Whiskers and they extend to 1.5 times the interquartile range
Describe the Normal distribution
- Defined by two parameters, mean and variance
Describe what confidence intervals are used to represent
A confidence interval is used to represent a range of values we are either 95 or 99% confident that the true value for the mean or statistical value lies