Models Flashcards

(11 cards)

1
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is linear regression?

A

A model that predicts continuous output as a linear combination of inputs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is logistic regression?

A

A classification algorithm predicting probabilities using the logistic function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does a decision tree split nodes?

A

By choosing splits that maximize information gain or minimize Gini impurity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is random forest?

A

An ensemble of decision trees trained on bootstrapped samples with feature randomness.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is gradient boosting?

A

Sequentially builds models that correct errors of prior models using gradient descent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Compare XGBoost and LightGBM.

A

LightGBM uses histogram-based algorithm and is faster on large datasets; XGBoost more mature with regularization options.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is SVM?

A

A classifier that finds a hyperplane maximizing margin between classes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is kNN?

A

Instance-based method classifying samples by the majority label of k nearest neighbors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Explain DBSCAN clustering.

A

Groups together points close to each other with many neighbors, marking outliers as noise.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is overfitting in decision trees?

A

When trees become too deep and memorize training data patterns instead of generalizing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly