What are the three important approaches for classification problems?
SVM: what is the high-level intuition behind this?
SVM what is the Affine function for our hyperplane?
SVM: What is our support vector and the notion of the “best hyperplane”?
SVM: How do you actually form the support vector?
SVM: What do we do when our features are generally not linearly separable?
SVM: What is our dual optimisation problem when we account for soft margins?
What are some non-linearly separable problems?
What are Kernels and the Kernel trick?
What is the RBF Kernel?
How does SVM look on a graph with a linear kernel vs an RBF kernel?
What is the Summary of SVM?
What is the Logistic Regression? What is the output of the model?
What does it estimate?
What is the log-loss function and what are we trying to do with it?
Optimisation 1?
What is a convex and non-convex loss function?
What is the difference between a local minimum/global minimum, and a unique minimum?
What is the optimisation problem in our linear regression?
What 3 pieces of terminology is used interchangeably within optimisation problems?
Do they have any nuanced differences?
Generally, what does constrained optimisation look like?
What is the ordinal encoding of categorical features?
What is one-hot and dummy encoding of categorical data?
What are performance measures?
What are the common ones for classifications?
What are the common ones for regression?
What is a confusion matrix?
Imagine we are testing for something bad e.g. if someone has an illness)
How do we calculate accuracy as a performance measure?
What is it importance and it limitations?
What is precision and recall as performance measures?
What is F1 or, more generally, F as a performance measure?
What is the Receiver Operating Characteristic Curve?
What is the Area Under the Curve (AUC)?