What are parametric models?
Models with a fixed number of parameters
What are non-parametric models?
Models for which the number of parameters is not fixed, it can increase when more data arrives
What are examples of non-parametric models?
KNN, decision trees, random forests, kernel methods, Gaussian processes
What are the three components of a supervised learning model
Model type, cost function, optimizer
What is supervised learning?
A type of ML where the model is trained on labeled data to predict outcomes.
What is unsupervised learning?
ML where the model finds hidden patterns in unlabeled data.
Explain bias-variance trade-off.
Bias: error from overly simplistic model. Variance: error from sensitivity to training data. Trade-off is balancing both for best generalization.
What is L1 regularization?
Adds the sum of absolute values of weights to the loss function; encourages sparsity.
What is L2 regularization?
Adds the sum of squared weights to the loss function; discourages large weights.
What is cross-validation?
A technique for assessing model performance by training/testing on different subsets of the data.
When would you use precision over recall?
When false positives are more costly than false negatives.
When would you use recall over precision?
When false negatives are more costly than false positives.
What is ROC-AUC?
Area under the ROC curve; measures model’s ability to distinguish between classes.
What is overfitting?
When a model learns noise in training data and performs poorly on unseen data.
What is the sigmoid function?
s(z) = 1 / (1 + e^(-z)), e.g. s(0) = 1/(1+1) =0.5
- it maps a real valued number into the interval (0,1)
- useful for turning logits (raw prediction values) into probabilities in binary classification models