Machine Learning Flashcards by Oscar Heijsteeg

What is machine learning?

using data to teach algorithms to predict outcomes they have never seen before. Steps

How well did you know this?

Not at all

Perfectly

What are the three steps in machine learning?

Give computer data and outcomes
Figures out patterns by itself
Uses these patterns to make predictions on new data

How well did you know this?

Not at all

Perfectly

Give 5 differences between statistics and machine learning?

Goal
Questions
Evaluation
Approach
Style

How well did you know this?

Not at all

Perfectly

How does the goal of statistics differ from machine learning goal?

understanding relationship VS making accurate predictions

How well did you know this?

Not at all

Perfectly

How does the central question in statistics differ from that in machine learning?

How does X relate to Y? VS Given X, what is Y?

How well did you know this?

Not at all

Perfectly

How does the evaluation of statistics differ from that of machine learning?

Coefficients, p-values VS Test-set accuracy, error rate

How well did you know this?

Not at all

Perfectly

How does the style of statistics differ from that of machine learning?

Transparent but rigid VS Flexible but often opaque

How well did you know this?

Not at all

Perfectly

How does the approach of statistics differ from that of machine learning?

Model assumptions VS Algorithms that learn patterns

How well did you know this?

Not at all

Perfectly

What are 4 important concepts in machine learning?

Feature = independent variable
 Any variable used to make predictions
Target = dependent variable
 Outcome to predict
 Used for classification, where outcome is a category
Training: estimate a model
Loss function: objective to minimize. Concepts are the same, just used more broadly

How well did you know this?

Not at all

Perfectly

What are the two types of machine learning?

Supervised: we have labels (outcomes). Model learns to predict them
Unsupervised: no labels. The model finds hidden structure in the data

How well did you know this?

Not at all

Perfectly

Give 4 ways in which you move from inference to prediction?

Does X cause Y?  Can we forecast Y?
Interpreting coefficients -> Minimizing prediction error
Worry about unobserved variables, reverse causality  Use whatever patterns work
Use fixed effects to control for unobservables  Use flexible algorithms that capture non-linearities and interactions

How well did you know this?

Not at all

Perfectly

What is a regression tree?

flowchart predicting a continuous outcome by splitting data into groups by asking a series of yes/no questions and splits data step by step. Each endpoint than gives a prediction, which is the average outcome for the observations that end up there.

How well did you know this?

Not at all

Perfectly

What are three characteristics of regression treess?

Easy to understand and visualize
Fundamentally algorithmic: computer searches for best splits rather than estimating coefficients
Showcase common strengths and problems of ML algorithms: flexibility, overfitting and cross-validation.

How well did you know this?

Not at all

Perfectly

What are the two elements of a regression tree?

Node: asks yes/no question about a variable that split data into two groups
Leaf: endpoint where tree makes a prediction (mean of observations that land there)

How well did you know this?

Not at all

Perfectly

What are the steps in which a regression tree works?

Consider every variable and every threshold at each node
Pick split that minimizes variance within resulting groups
Same as minimizing within-group variation as in Panel regression
Continue until stopping rule is met

How well did you know this?

Not at all

Perfectly

Give three key advantags of a regression tree?

Study These Flashcards

Handles non-linear relationships automatically
No need to specify interactions between variables
Intuitive and easy to explain

How do you measure prediction quality of algorithms?

Study These Flashcards

RMSE = SQRT(1/n * Sum of (Actual value – predicted value)^2

Lower RMSE means better predictions.

What is meant with model complexity?

Study These Flashcards

basically refers to the fact that more leaves in a regression tree leads to catching more intricate patterns.

What s the difference between a training set and a test set?

Study These Flashcards

Training set: set of data that is used to estimate (train) the model
Test set: hiding during estimation, used only to evaluate performance

What is meant with overfitting?

Study These Flashcards

model learns training data too well including its noise and peculiarities.

What is the sweet spot in ML algorithm development?

Study These Flashcards

number of leaves of the regression tree where the RMSE of test data is lowest

How does classification using logistic regression work?

Study These Flashcards

Classification is done using Logistic regression. Changes in classification opposed to testing regression trees’ RMSE:
1. Prediction: each leaf predicts a class instead of a number
2. Splitting criterion: we want each split to make groups as pure as possible instead of minimizing prediction error. Purity is measured by entropy

What is entropy?

Study These Flashcards

Measure of how mixed a group is

What are the different gradations in entropy and what do they mean?

Study These Flashcards

Low entropy: Mostly of one class
Medium entropy: some mixing
High entropy: even mix
Zero entropy: perfectly pure as in all of one class
 Tree picks the split that reduces the entropy the most

What are the 4 steps in a classification algorithm?

- Each split is a yes/no question - At each step, tree tries every word and picks one that reduce entropy the most - Keeps splitting until improvement falls below a threshold (complexity parameter) - Pattern recognition quickly outperforms humans.

What is a confusion matrix?

compares what model predicted vs what actually happened

What are the four possible outcomes of the confusion matrix?

1. True negative: Predicted A, Actual A 2. False negative: Predicted A, Actual B 3. False positive: Predicted B, Actual A 4. True positive: Predicted B, Actual B

What are three key metrics of the confusion matrix?

1. Accuracy 2. True positive rate 3. False positive rate

What is accuracy in a confusion matrix and how do you calculate it?

(TP+ TN) / (TN + FN + FP+ TP)  Share of all correct predictions

What is the true positive rate and how do you measure it?

TP / (TP + FN)  How many true positives did we catch

What is the false positive rate?

FP / (FP + TN)  How many observations did we falsely flag?

Give 4 wayis in which machine learning is used as a measurement tool?

1. Sentiment analysis: Optimism vs Pessimism 2. Innovation measurement: does patent describe breakthrough 3. Readability: how complex is this disclosure? 4. AI writing detection: purely human or AI generated?

Machine Learning Flashcards

(32 cards)