Research: Module 7 - Regression Flashcards

(46 cards)

1
Q

is a regression a type of correlation?

A

NO

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

regression

A

a statistical technique that predicts an outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what type of data can regressions be run on?

A

the predictors can be interval/ratio, ordinal, or nominal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

predictors vs. outcome variables

A

predictor: the variable you use to predict or explain changes in another variable (IV)

outcome: the variable you are trying to predict or explain (DV)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

linear regression

A

one predictor (any level of data) and one I/R outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

multiple linear

A

multiple predictors (any level of data) and one I/R outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

logistic regression

A

**one or more predictors (any level of data) and one categorical outcome, 2 levels only

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

multinomial logistic regression

A

one or more predictors (any level of data) and one categorical outcome, multiple levels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

ordinal logistic regression

A

one or more predictors (any level of data) and one ordinal outcome, multiple levels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what type of regression is the following: Does GRE score significantly predict PT school GPA?

A

linear

one predictor (any level of data) and one I/R outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what type of regression is the following: Does age predict days to clearance following a concussion in adolescent athletes?

A

linear

one predictor (any level of data) and one I/R outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what type of regression is the following: Does disease category predict distance on the 6MWT?

A

linear

one predictor (any level of data) and one I/R outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

how would you convert this relationship hypothesis into a prediction hypothesis: There will be a significant relationship between time 1and time 2 at r >+.80 or r<-.80?

A

can time 1 ROM predict time 2 ROM?

what ROM value would be expected at time 2?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

linear regression equation

A

y=mx+b

m = regression coefficient
b = constant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

how would you calculate what time 2 would be?

A

for y=mx + b, you are solving for y

plug in ROM at time 1 for x

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what regression would you run for the following: Can sitting balance scores significantly predict
standing balance scores in patients following a CVA
in acute care?

A

linear regression

one predictor (any level of data) and one I/R outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what are the 5 assumptions for regression?

A

1: data must be linear, not curvilinear (scatterplot)

2: data must have normality

3: homoscedasticty of variance

4: data must be free of influential outliers

5: data must be independent of each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

what do you look at the assess if the data has normality?

A

histograms
skewness
kurtosis
boxplots
Shapiro-wilk test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

what is a major concern when meeting the assumptions for a regression?

20
Q

how do you tell if a data set is normally distributed when observing a box plot?

A

The box is roughly symmetric around the median (the line inside the box).

The lengths of the whiskers on both sides are approximately equal.

few or no outliers

21
Q

how do you tell if a data set is normally distributed when observing a histogram?

A

Bell-Shaped Curve:
The bars form a peak in the middle and taper off symmetrically on both sides.

Symmetry Around the Mean:
The left and right sides of the peak are roughly mirror images.

Single Peak (Unimodal):
There is one clear central peak, not multiple peaks.

22
Q

how do you determine if a data set is normally distributed based on kurtosis and skewness?

A

the values should be inbetween -2 and +2

23
Q

Shapiro-Wilk test

A

statistical test used to assess whether a dataset is normally distributed.

we DO NOT want there to be a significant difference

p-value > 0.05 → Fail to reject H₀ → Data appears normal

p-value ≤ 0.05 → Reject H₀
→ Data is not normally distributed

24
Q

Shapiro Wilk test calculates a _____ statistic

A

W

small W value → data not normally distributed

25
Levene's test
a statistical test used to assess whether two or more groups have equal variances (homogeneity of variance) want this test to NOT be significant (p>0.05) used in ANOVAs, t-tests
26
homoscedasticty of variance
in relationship designs, the variance (the spread) of the outcome variable should be approximately the **same at all levels** of the predictor variable used in regression analysis
27
homogeneity vs. homoscedasticty of variance
**Homogeneity**: Equality of variances across groups (comparing two or more groups.) -ex: Comparing knee ROM between two therapy groups: both have similar variability **Homoscedasticity**: Variance stays consistent across all values of an independent variable (continuous predictor in regression.) -ex: Predicting knee ROM from weeks of rehab: ROM variance is steady across weeks.
28
heteroscadasticity
the spread of the data around the best fit line is NOT even
29
Cook's distance
used in regression analysis to identify influential data points, outliers that have a large impact on the fitted regression model. >1: outlier is potentially influential → eliminate participant
30
Durbin-watson test
way to check for independence of observations...checks for possible correlations between the participants which would violate out assumption ranges: 0-4 (2 is perfect - no correlation)
31
adjusted R-squared
a modified version of R-squared that adjusts for the number of IVs in the model it penalizes the addition of unnecessary variables
32
what regression would you run for the following: Does UG GPA and GRE score significantly predict PT school GPA?
multiple linear regression **multiple predictors** (any level of data) and **one I/R outcome**
33
what regression would you run for the following: Does age and sport predict days to clearance following a concussion in adolescent athletes?
multiple linear regression **multiple predictors** (any level of data) and **one I/R outcome**
34
what regression would you run for the following: Does disease category and quadriceps strength predict distance on the 6MWT?
multiple linear regression **multiple predictors** (any level of data) and **one I/R outcome**
35
multicollinearity
occurs in multiple regression when two or more predictor (independent) variables are highly correlated with each other. This means they provide redundant information DO NOT want this
36
what is considered a high correlation amongst predictors?
>0.9
37
what 2 variables are used to test multicollinearity?
variance inflation factor - should be <10 tolerance - should be >0.1
38
what regression would you run for the following scenario: We are interested in predicting CI scores from the final clinical internship....this outcome score ranges from 0 (poor)-100 (excellent). We are interested in two predictors....entered into a model together - The number of classes that the student received a grade of “A” – max is 35. - The average ratings from their past three CIs at the end of each clinical affiliation. (0=poor – 100=excellent)
multiple linear regression **multiple predictors** (any level of data) and **one I/R outcome**
39
what would the regression equation look if you have multiple predictor variables?
y = mx + mx + b x = predictor variables
40
what type of regression should you run for the following?: Does age and type of cardiac procedure (value repair vs bypass) predict discharge location to home vs. other?
logistic regression one or more predictors (any level of data) and **one categorical outcome**, 2 levels only
41
what type of regression should you run for the following?: Does TUG score predict fallers and non- fallers?
logistic regression one or more predictors (any level of data) and **one categorical outcome**, 2 levels only
42
what type of regression should you run for the following?: Does a score on the VOM (Vestibular Ocular Motor) Screening test predict return to sport in the next 30 days? (yes/no)
logistic regression one or more predictors (any level of data) and **one categorical outcome**, 2 levels only
43
a logistic regression uses a _____________ instead of a straight line
logistic curve
44
what type of regression should you run for the following?: predictors: lateral trunk sway during gait, family history, cervical protraction/retraction during gait, birth history outcome: spastic CP, hereditary spastic paraplegia
logistic regression one or more predictors (any level of data) and **one categorical outcome**, 2 levels only
45
what type of regression should you run for the following?: predictors: lateral trunk sway during gait, family history, cervical protraction/retraction during gait, birth history outcome: spastic CP, hereditary spastic paraplegia, athetoid CP
multinominal logistic regression one or more predictors (any level of data) and **one categorical outcome**, multiple levels
46
what type of regression should you run for the following?: predictors: age, FIM, balance score, MMSE score, gait velocity outcome: live independently at home, assisted living, requires 24 hour care
ordinal logistic regression one or more predictors (any level of data) and **one ordinal outcome**, multiple levels