L8: Classification Problems Flashcards by Dylan Ottey

What are the three important approaches for classification problems?

How well did you know this?

Not at all

Perfectly

SVM: what is the high-level intuition behind this?

Imagine each column of our design matrix X formed a hyperplane –> in which we end up with two characteristics we want to separate the data on –>
Ideally, these would be linearly separable data and see on different sides of the hyperplane –> and we are trying to find out how to split this plane to categorise our data correctly.

How well did you know this?

Not at all

Perfectly

SVM what is the Affine function for our hyperplane?

How well did you know this?

Not at all

Perfectly

SVM: What is our support vector and the notion of the “best hyperplane”?

How well did you know this?

Not at all

Perfectly

SVM: How do you actually form the support vector?

How well did you know this?

Not at all

Perfectly

SVM: What do we do when our features are generally not linearly separable?

How well did you know this?

Not at all

Perfectly

SVM: What is our dual optimisation problem when we account for soft margins?

How well did you know this?

Not at all

Perfectly

What are some non-linearly separable problems?

How well did you know this?

Not at all

Perfectly

What are Kernels and the Kernel trick?

How well did you know this?

Not at all

Perfectly

What is the RBF Kernel?

How does SVM look on a graph with a linear kernel vs an RBF kernel?

How well did you know this?

Not at all

Perfectly

What is the Summary of SVM?

How well did you know this?

Not at all

Perfectly

What is the Logistic Regression? What is the output of the model?

What does it estimate?

What is the log-loss function and what are we trying to do with it?

How well did you know this?

Not at all

Perfectly

Optimisation 1?

What is a convex and non-convex loss function?

What is the difference between a local minimum/global minimum, and a unique minimum?

How well did you know this?

Not at all

Perfectly

What is the optimisation problem in our linear regression?

How well did you know this?

Not at all

Perfectly

What 3 pieces of terminology is used interchangeably within optimisation problems?

Do they have any nuanced differences?

How well did you know this?

Not at all

Perfectly

Generally, what does constrained optimisation look like?

Study These Flashcards

What is the ordinal encoding of categorical features?

Study These Flashcards

What is one-hot and dummy encoding of categorical data?

Study These Flashcards

What are performance measures?

What are the common ones for classifications?

What are the common ones for regression?

Study These Flashcards

What is a confusion matrix?

Study These Flashcards

Imagine we are testing for something bad e.g. if someone has an illness)

False Positive = false alarm –> you say something is bad when its actually fine (waste resources and trust)
False negative = missed detection –> you say something is fine when it is actually bad (cause real-world harm and risk)

How do we calculate accuracy as a performance measure?

What is it importance and it limitations?

Study These Flashcards

What is precision and recall as performance measures?

Study These Flashcards

What is F₁ or, more generally, F as a performance measure?

Study These Flashcards

What is the Receiver Operating Characteristic Curve?

What is the Area Under the Curve (AUC)?

Study These Flashcards

What is the summary of all the performance metrics?

Examples of when we would want different F_b measures?

Example 1: Medical Cancer Screening (False negative is worse!) so B >1 True Positive --> Patient has cancer and test detects False Positive --> Patient doesnt have cancer but tests positive for it (worrisome but not life threating) False Negative --> Patient has cancer but tests negative --> Thinks they are find but could die Example 2: Emall Spam Filter (False positives are worse) so B< 1 True Positive --> detects spam and deletes email and was spam False Positive --> detects spam and deletes but was an important personal email --> missed legal document this is bad! False negative --> allows spam email in which is annoying but not as bad as a false positive Example 3: Credit Card Fraud Detection --> beta=1 as only dataset will be unbalanced to normal people using their credit cards day to day True Positive --> detects someone is committing credit card fraud and they are False Positive --> Shut down normal peoples accounts --> angry customers and loss of business False Negative --> direct financial loss and fraud growth

L8: Classification Problems Flashcards

(26 cards)