Variable types
Formats to store data
Exploratory Data Analysis
Summary Statistics
Matplotlib
Seaborn
Scatter plots
Histograms
Density Plots
Line Charts
Bar Chart
-discrete variable (countable, finit) & categorical data
- e.g. countries & body_mass_index
Boxplot
Heatmaps
Cleaning & preprocessing data - Common issues
Data Cleaning
Removing values vs imputing
Depends on situation
Removing values
Imputing
Problem & Methods for scale of data
Creating new variables
sometimes good to combine features into one feature by creating a ration of 2
e.g. Combining weight & height to BMI
Normalization
Standardization