Regression Flashcards

Question 1

Q

Regression vs classification

Answer

A

Regression: when we predict quantitative outputs
Classification: when we predict categorical (qualitative) outputs

Question 2

Q

Regression

Answer

A

determine the relationship between the
dependent variable 𝑌 and a set of independent variables 𝑋

Question 3

Q

Dependent and independent values

Answer

A

X are independent, Y changes as a consequence of other variables

Question 4

Q

How do we fit linear model

Answer

A

We fit it through m inimizing the difference between the actual and predicted Y value.

Question 5

Q

Multiple Linear Regression

Answer

A

multiple input variables

Question 6

Q

Ordinary Least Squares

Answer

A

Used to fit regression line

Question 7

Q

How to measure the fit of the model

Answer

A

R2 measure.

Question 8

Q

Gradient Descent

Answer

A

It’s an optimization algorithm that finds the linear regression coefficients
iteratively

Question 9

Q

Problems with fitting the data

Answer

A

Over-fitting: the model models the training data
too well. Under-fitting: the model that can neither model the
training data nor generalize to new data

Question 10

Q

Problems with OLS

Answer

A

Low performance (such as over-fitting and Interpretation: to get the bigger picture. Solution is regularizing the coefficient estimates (Shrinkage Methods) which can be done with RIdge and lasso regression

Question 11

Q

Ridge regression

Answer

A

Shrinks the regression coefficients by imposing a
penalty on their size

Question 12

Q

Lasso Regression

Answer

A

Lasso regression is a shrinkage method like ridge. The only difference
is instead of taking the square of the coefficients, magnitudes are taken
into account:

Question 13

Q

Assumptions in linear regression. what is it?

Answer

A

“assumptions” are statements that we take to be true about our data and the model in order for the mathematical properties of linear regression to hold.

Question 14

Q

What are the assumptions?

Answer

A

The observations are independent (random sampling)
The relationship of 𝑌 with 𝑋 and the error term is linear
𝑌 is normally distributed at each value of 𝑋
The error term is normally distributed with mean zero and constant
variance
The 𝑋 variables are independent(only multiple linear regression)

Question 15

Q

Evaluation of a linear regression model

Answer

A

The performance of the model must be reported as an error for the predictions. Some methods are MSE, RMSE, MAE, MAPE

Question 16

Q

MSE

Answer

Study These Flashcards

A

Is calculated as the mean or average of the squared differences
between predicted and expected target values in a dataset

Question 17

Q

RMSE

Answer

Study These Flashcards

A

RMSE is calculated as the square root of the MSE, which means that
the units of the error are the same as the units of the target value that is
being predicted:

Question 18

Q

MAE

Answer

Study These Flashcards

A

MAE is calculated as the average of the absolute error values, and like
RMSE, the units of the error score match the units of the target value
that is being predicted

Question 19

Q

MAPE

Answer

Study These Flashcards

A

MAPE is the percentage equivalent of MAE.

Regression Flashcards

(19 cards)