Feature Engineering Flashcards

(11 cards)

1
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is one-hot encoding?

A

Representing categorical variables as binary vectors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is normalization?

A

Scaling data to a range, usually [0,1].

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is standardization?

A

Transforming data to have mean 0 and variance 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How to handle missing data?

A

Options: imputation, removal, or using algorithms that handle missing values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is PCA?

A

Dimensionality reduction method finding orthogonal components maximizing variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is feature selection?

A

Choosing a subset of relevant features to improve model performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is multicollinearity?

A

When independent variables are highly correlated, affecting model stability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is target encoding?

A

Encoding categories by replacing them with target variable statistics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is binning?

A

Grouping continuous variables into discrete intervals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is SMOTE?

A

Synthetic Minority Oversampling Technique for balancing class distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly