5a. Linear Regression Flashcards

(16 cards)

1
Q

How does linear regression help decision makers?

A
  • Understand relationship between variables
  • Predict the value of variable (Y) based on a set of impacting factors (X1, X2, X3)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does linear regression do?

A

It analyses the linear relationship between two or more variables (attempts to fit a straight line through the points on a chart between DV and IVs).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What measurement level is used in linear regression? (dependent variable)

A

Interval or ratio scales

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the difference between a simple (bivariate) and a multiple linear regression?

A

Simple analyses the linear relationship between one DV and one IV
Multiple analyses the linear relationship between one DV and multiple IVs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the basic function of linear regression?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does a conceptual chart of a multiple linear regression look like? (online banking case)

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the first step to carrying out a linear regression (in SPSS)? (online banking case)

A

Select customer profitability as DV and other variables as IVs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does this mean?

A

R-squared value: All five variables explain 5.7% of the variation in customer profitability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the difference between adjusted R square vs normal R square?

A

R squared refers to how much of the total variance in the DV can be explained by the IV(s). We tend to inflate the R squared with the more variables we have, so the adjusted R square corrects for that and is a more conservative measure - USE ADJUSTED R SQUARE!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What can you tell from this?

A

That the model is statistically significant, you can reject the null hypothesis that no relationship exists between the variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the difference between unstandardised and standardised beta?

A

Unstandardised β: change in Y (raw units) for a 1-unit increase in X - good for interpretability.

Standardised β: change in Y (SD units) for a 1 SD increase in X (good for comparing predictors).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What can you tell about the significance of the coefficients?

A

Statistical significance of all regression coefficients except “district”.
(so all other variables do make statistically significant unique contribution to the DV).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Knowing “district” is insignificant, what should we do?

A

The fewer variables you have, the better is your predictive model. Given district code is insignificant, you can remove it and run again! Doing so does not change the R squared (but still better model).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When removing “district” from the linear regression, what should you check?

A

That the R squared value did not change (it is possible it worsens, in that case.. don’t remove it!)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What can you tell from the estimated equation? (about the profitability)

A

When a customer uses the online (as compared to offline) channel, the profitability will increase by 18.240

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What can you tell from the estimated equation? (about age)

A

When a customer’s age increases by one unit (?), the profitability will increase by 18.279.

HOWEVER, watch OUT!!!
Always look at how variables are coded, the age increasing by 1 unit is referring to going up an age group, its not by year!