What is data manipulation?
Cleaning, transforming, organising data for analysis.
Why manipulate data?
Make raw data suitable for analytics tasks.
Give examples of manipulation processes.
Filter, create variables, summarise, impute, reorder observations.
List out example of Narrowing observations?
Select recent year products or subset by condition.
List out example of Variable creation
Compute BMI from weight and height
List out example of summarising
Calculate counts, means, or group summaries.
List out example of Imputation
Add or update values to fill missing data.
What is the tbl class (tidyverse)?
A tidy data frame variant with improved behavior and printing
How do tibbles print?
Show first 10 rows; truncate columns for legibility.
Name behaviours tibbles have vs data.frames.
What does cbind() do?
Column-bind objects ( matr -> matrices ) and joining data frames -> df ) ; rows count must match.
What does rbind() do?.
Row-bind objects ( matr -> matrices ) and joining data frames -> df ); columns must match names and count
Which join merges carrier names into flights?
left_join(flights, airlines, by=\"carrier\")
What does left_join keep?
All left table rows; matched right table values added.
What must match for rbind()?
Same columns and names across data frames.
What must match for cbind()?
Same number of rows across objects.