Regression Flashcards

(19 cards)

1
Q

What is the main difference between linear and logistic regression?

A

Linear regression predicts a continuous value, while logistic regression predicts a probability between 0 and 1 for binary classification.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two parameters in a linear regression model?

A

Slope (β₁) and intercept (β₀).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the dependent variable in a regression model?

A

It is the output that the model is trying to predict.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the assumptions of linear regression?

A

Linearity, independence of features, normality of errors, and homoscedasticity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is multicollinearity?

A

It’s when input features are highly correlated with each other, which can distort model interpretation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why do we square the errors in Ordinary Least Squares (OLS)?

A

To make them positive and easier to optimize, and to make large differences more noticeable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is robust regression?

A

An alternative to OLS that uses absolute values instead of squares, making it less sensitive to outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a scatterplot useful for?

A

Detecting linearity and spotting outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is R-squared?

A

A measure of how well the model fits the data; closer to 1 means better fit.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why do we scale features before fitting a model?

A

To ensure all features contribute equally and avoid dominance by large-scale variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How does logistic regression output predictions?

A

It uses a sigmoid function to output probabilities between 0 and 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the role of feature engineering?

A

Transforming or creating features to improve model performance or fit.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the difference between homoscedasticity and heteroscedasticity?

A

Homoscedasticity means the variance of errors is constant across all levels of input; heteroscedasticity means the variance changes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a residual in regression?

A

The difference between the actual value and the predicted value by the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the sigmoid function used in logistic regression?

A

A mathematical function that maps any real value into a range between 0 and 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

When should you not use linear regression?

A

When the relationship between variables is non-linear or when the assumptions of linear regression are violated.

17
Q

Why is normality of errors important in linear regression?

A

It ensures reliable inference like confidence intervals and p-values.

18
Q

How can linear regression be used in FP&A?

A

To forecast costs or revenue based on inputs like volume, location, and seasonality.

19
Q

How can logistic regression be used in FP&A?

A

To predict binary outcomes such as whether an order will be late or whether a customer will reorder.