Regression Flashcards

Question 1

Q

What is the main difference between linear and logistic regression?

Answer

A

Linear regression predicts a continuous value, while logistic regression predicts a probability between 0 and 1 for binary classification.

Question 2

Q

What are the two parameters in a linear regression model?

Answer

A

Slope (β₁) and intercept (β₀).

Question 3

Q

What is the dependent variable in a regression model?

Answer

A

It is the output that the model is trying to predict.

Question 4

Q

What are the assumptions of linear regression?

Answer

A

Linearity, independence of features, normality of errors, and homoscedasticity.

Question 5

Q

What is multicollinearity?

Answer

A

It’s when input features are highly correlated with each other, which can distort model interpretation.

Question 6

Q

Why do we square the errors in Ordinary Least Squares (OLS)?

Answer

A

To make them positive and easier to optimize, and to make large differences more noticeable.

Question 7

Q

What is robust regression?

Answer

A

An alternative to OLS that uses absolute values instead of squares, making it less sensitive to outliers.

Question 8

Q

What is a scatterplot useful for?

Answer

A

Detecting linearity and spotting outliers.

Question 9

Q

What is R-squared?

Answer

A

A measure of how well the model fits the data; closer to 1 means better fit.

Question 10

Q

Why do we scale features before fitting a model?

Answer

A

To ensure all features contribute equally and avoid dominance by large-scale variables.

Question 11

Q

How does logistic regression output predictions?

Answer

A

It uses a sigmoid function to output probabilities between 0 and 1.

Question 12

Q

What is the role of feature engineering?

Answer

A

Transforming or creating features to improve model performance or fit.

Question 13

Q

What is the difference between homoscedasticity and heteroscedasticity?

Answer

A

Homoscedasticity means the variance of errors is constant across all levels of input; heteroscedasticity means the variance changes.

Question 14

Q

What is a residual in regression?

Answer

A

The difference between the actual value and the predicted value by the model.

Question 15

Q

What is the sigmoid function used in logistic regression?

Answer

A

A mathematical function that maps any real value into a range between 0 and 1.

Question 16

Q

When should you not use linear regression?

Answer

Study These Flashcards

A

When the relationship between variables is non-linear or when the assumptions of linear regression are violated.

Question 17

Q

Why is normality of errors important in linear regression?

Answer

Study These Flashcards

A

It ensures reliable inference like confidence intervals and p-values.

Question 18

Q

How can linear regression be used in FP&A?

Answer

Study These Flashcards

A

To forecast costs or revenue based on inputs like volume, location, and seasonality.

Question 19

Q

How can logistic regression be used in FP&A?

Answer

Study These Flashcards

A

To predict binary outcomes such as whether an order will be late or whether a customer will reorder.

Regression Flashcards

(19 cards)