Logistic Regression Flashcards

Formulas and slippery slopes (26 cards)

1
Q

Define logistic regression.

A

A statistical method for predicting binary classes using a logistic function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

True or false: Logistic regression can only handle binary outcomes.

A

FALSE

Logistic regression can be extended to handle multiple classes using multinomial logistic regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the logit function?

A

The natural logarithm of the odds of the probability of an event occurring.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Fill in the blank: The output of logistic regression is a _______ value.

A

probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does the sigmoid function do in logistic regression?

A

It maps any real-valued number into the range between 0 and 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define odds.

A

The ratio of the probability of an event occurring to the probability of it not occurring.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the purpose of the cost function in logistic regression?

A

To measure how well the model’s predictions match the actual outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

True or false: Logistic regression assumes a linear relationship between the independent and dependent variables.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is maximum likelihood estimation?

A

A method for estimating the parameters of a statistical model that maximizes the likelihood function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Fill in the blank: In logistic regression, the coefficients represent the change in the log odds of the outcome for a _______.

A

one-unit change in the predictor

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define multicollinearity.

A

A situation where two or more predictors are highly correlated, affecting the model’s estimates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the confusion matrix?

A

A table used to evaluate the performance of a classification model by comparing predicted and actual values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

True or false: Logistic regression can provide probabilities for class membership.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does the area under the ROC curve (AUC) represent?

A

The ability of a model to distinguish between classes; higher values indicate better performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Fill in the blank: The threshold in logistic regression determines the cutoff for classifying an instance as _______.

A

positive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the interpretation of the odds ratio in logistic regression?

A

It indicates how much the odds of the outcome change with a one-unit increase in the predictor.

17
Q

Define regularization.

A

A technique used to prevent overfitting by adding a penalty to the loss function.

18
Q

What is L1 regularization also known as?

A

Lasso regression

19
Q

True or false: Logistic regression can handle non-linear relationships without transformation.

A

FALSE

Non-linear relationships often require transformations or polynomial terms.

20
Q

What is the link function in logistic regression?

A

A function that connects the linear predictor to the mean of the distribution function.

21
Q

Fill in the blank: In logistic regression, the dependent variable is _______.

22
Q

What is the main assumption of logistic regression regarding the errors?

A

The errors are assumed to be independent and follow a binomial distribution.

23
Q

Define pseudo R-squared.

A

A measure that provides an indication of the goodness of fit for logistic regression models.

24
Q

What is the Hosmer-Lemeshow test used for?

A

To assess the goodness of fit of a logistic regression model.

25
True or false: Logistic regression can be used for time-to-event data.
FALSE
26
What is the **purpose of the Wald test** in logistic regression?
To test the significance of individual coefficients in the model.