What is one-hot encoding?
Representing categorical variables as binary vectors.
What is normalization?
Scaling data to a range, usually [0,1].
What is standardization?
Transforming data to have mean 0 and variance 1.
How to handle missing data?
Options: imputation, removal, or using algorithms that handle missing values.
What is PCA?
Dimensionality reduction method finding orthogonal components maximizing variance.
What is feature selection?
Choosing a subset of relevant features to improve model performance.
What is multicollinearity?
When independent variables are highly correlated, affecting model stability.
What is target encoding?
Encoding categories by replacing them with target variable statistics.
What is binning?
Grouping continuous variables into discrete intervals.
What is SMOTE?
Synthetic Minority Oversampling Technique for balancing class distribution.