data manipulation
a branch of mathematics that involves the collection, analysis and interpretation of data
what are the 2 types of statistics
descriptive and inferential
where to start in data analysis
data preparation
coding
checking
data cleaning
preparation of variables/categories
what does checking and cleaning data look like?
you check if all the information is there and you decide whether something might be inferred or excluded
what is preparations of variables/categories
after the cleaning and
what is descriptive statistics
includes descriptions and summaries of the data
includes tables, graphs and describes basic patterns
what are examples of descriptive statistics
frequency tables
graphs/plots
numerical summaries
what is inferential statistics
comparing groups and discovering relationships
what do you do with missing data?
you can use the mean
use the midpoint
mean of similar respondents
predicted value form regression analysis
other sources of information
what is a univariate analysis?
analysis of one variable at a time
what is a multivariate analysis?
Examines the relationship between three or more variables
- Spuriousness exists if two variables are correlated but only through a third
variable
what is the mode?
the score that shows up the most
what is the mean?
the average of EVERY number
it is the most affected my outliers
what is the median?
the perfectly middle of all the scores
what are measures of dispersion
the amount of variations
- range
- standard deviation
explain the range
showed influence of outliers
explain standard deviation
influenced by outliers. is the average distance bewttwn values and means
what is bivariate analysis
Determines whether there is a relationship between two variables
what is Pearson’s correlation
normally used with continuous data (interval/ratio data)
used to see the strentgh and direction of relationship bewteen 2 continous variables
what does a positive skewed distribution look like
aka right skewed, the right is missing
what does a negative skewed distribution look like
aka a left skew, the left is missing
type 2 error
failing to reject false null hypothesis