Describe the 8 Steps and Approach to Data Analysis.
What is coding?
Coding = process of translating information gathered from questionnaires or other sources into something that can be analyzed.
assigning a value to the information given (value may be given a label)
Coding can make data more consistent: – Example: Question = Gender
Answers = Male or Female, M or F, 1 or 2?
State the 3 steps of data cleaning
What do we screen for?
–Duplicate records
– Invalid and out-of-range codes
– Missing data
– Outliers (unlikely values)
– Lack of variability
– Unlikely patterns, including
reverse polarity
– Skip pattern checks
– Logic checks
How do we resolve issues during data cleaning?
What do we look for in terms of diagnosis?
Go back to the original data
source, if possible
– Error– Missing data– True extreme– Cannot determine
What is ana analysis plan based on?
Analysis Plan is based on what question(s) you need to answer, what information you want to communicate, and what data you have