What is discrete data
non decimal/ non fractional number
for example number of children in a classroom (you can’t have decimal amount of children)
What is continuous data?
includes decimals
E.G height, weight
What is a measure of central tendancy?
where the center of our data falls
What does it mean when data is skewed?
What is a positive skew and what is a negative skew? Draw them both
Data can be “skewed”, meaning it tends to have a long tail on one side or the other:
When do you use mean as a measure of central tendency? When do you not use the mean as a measure of central tendency?
when your data distribution is continuous and symmetrical when it is quantitative and uses all pieces of its data
Not:
When do you use the median as a measure of central tendency
When do you use the mode as a measure of central tendency?
used for nomial data (data that can be labelled or classified into mutually exclusive categories within a variable.
These categories cannot be ordered in a meaningful way.
For example, for the nominal variable of preferred mode of transportation, you may have the categories of car, bus, train, tram or bicycle)
What are the advantages and disadvantages of box plots?
Pros:
Cons:
When is the regression line a valid model?
when the data shows linear correlation
stronger correlation = higher accuracy
When trying to estimate a DEPENDENT variable (y coord)
What is a census? Name the pros and cons
When each member of a population is used
Pros:
Cons:
What does a sampling frame mean?
the source material or device from which a sample is drawn
What are the three METHODS of random sampling?
Give definitions
What is the equation for the number sampled in strata?
number in strata/ number in population x overall sample size
Why is random sampling useful?
it removes bias
What are the pros and cons of simple random sampling?
What are the pros and cons of systematic sampling
What are the pros and cons of stratified sampling?
What are the two types of NON-random sampling?
What are the definitions?
What are the pros and cons of Quota sampling?
What are pros and cons of opportunity sampling?
What are the types of data/variables
(data is interchangeable with variables)
What is meant by population?
a set of data that you can take a sample of
the whole set of items that are of interest
When do you increase the lower and upper bound by 0.5
when the classes do not overlap
e.g. 10
What is the equation for variance and standard deviation for raw data