how many missing or incorrect values for engine size?
One missing or incorrect value (0)
Year registered
All registered in 3rd-9th June 2002 or 6th-12th June 2016
how many missing or incorrect values are there for mass?
92 incorrect or missing values (0)
how many missing or incorrect values are there for CO2?
2 missing or incorrect values (0)
how many missing or incorrect values are there for CO?
13 missing values (blank)
how many missing or incorrect values are there for NOX?
74 missing values (blank)
how many missing or incorrect values are there for parts (particulate emissions)?
3105 missing values (blank)
how many missing or incorrect values are there for hc (hydrocarbon emissions)?
1422 missing values (blank)
How can we avoid using missing/incorrect data?
Clean the data before sampling
Summary (4 points)
makes of cars in the large data set
regions included in the large data set
propulsion types in the large data set
keeper title IDs
units used in the large data sets
what does the mass include in the large data set?
which proportion types are there only one of in the whole data set?