Section 7: basic analysis for determining associations with numeric outcomes II: Scatterplots & correlation analysis Flashcards

(34 cards)

1
Q

What are the statistical tools used for 2 numeric variables?

A
  1. Scatterplot
  2. Pearson correlation
  3. Spearman rank correlation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a scatterplot?

A
  • A graphical summary showing how one numeric variable changes with another.
  • X-axis: independent variable (exposure)
  • Y-axis: dependent variable (outcome)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When is a scatterplot usually drawn?

A

Before working out a linear correlation coefficient or fitting a regression line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does the resulting pattern of a scatterplot indicate?

A

type & strength of relationship between 2 variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the line of best fit?

A
  • Also called a “Trend Line“
  • A straight line that best represents the data on a scatter plot
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What can we interpret from scatterplots?

A

Info about relationship

  • Direction → positive, negative, or none
  • Form → linear, nonlinear, clusters
  • Strength → weak or strong
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When can we use pearson correlation analysis?

A

When relationship between 2 quantitative variables is visually linear, can quantify relationship using pearsons

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the assumptions for pearsons correlation?

A
  1. Relationship appears linear (linearity)
  2. Both variables are normally distributed (normality)
  3. No outliers
  4. Variables have same variance (homoscedasticity)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How can we test normality (if variables are normally distributed)?

A

Shapiro-wilk or shapiro francia tests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do we test homoscedasticity of variance?

A

2 sample variance comparison test (sdtest) in Stata

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does the correlation coefficient show?

A

It shows strength & direction of relationship between two numeric variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When do we use Spearman correlation?

A

When relationship is monotonic (not necessarily linear) or data are not normally distributed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When do we use Pearson correlation?

A

When relationship between two numeric variables is linear & data are normally distributed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the range of the correlation coefficient (r)?

A

r ranges from –1 to +1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does the value of r show?

A
  • Strength of the relationship
  • Closer r is to –1 or +1, the stronger the association.
16
Q

What does the sign of r indicate?

A

The direction of the relationship:
* Positive (+): both variables increase together
* Negative (–): one increases while the other decreases

17
Q

What does the p-value in correlation mean?

A
  • It shows the statistical significance
  • whether the correlation is likely due to chance or represents a real association.
18
Q

How do we interpret the strength of r?

A

r Range | Interpretation

0.00–0.20 → Negligible
0.20–0.40 → Weak
0.40–0.60 → Moderate
0.60–0.80 → Strong
0.80–1.00 → Very strong

19
Q

When do we use Spearman Rank Correlation Analysis?

A

When relationship between 2 quantitative variables is monotonic, but the linearity, normality, or equal variance assumptions are not met.

20
Q

What type of relationship does Spearman correlation measure?

A

A monotonic relationship — where variables move in same or opposite direction, but not necessarily at a constant rate (not linear).

21
Q

What is the monotonic relationship when it is monotoniocally increasing?

A

as x variable increases, y variable never decreases

22
Q

What is the monotonic relationship when it is monotoniocally decreasing?

A

as x variable increases, y variable never increases

23
Q

What is the monotonic relationship when it is not monotonic?

A

as x variable increases, y variable sometimes decreases & sometimes increases

24
What is Spearman’s Rank Correlation Analysis?
A non-parametric test that measures strength & direction of **monotonic** relationship between two numeric (or ordinal) variables.
25
When is spearman rank correlation done?
* Between two numeric variables when at least one of the two is not normally distributed * Also possible when one of the two variables is ordinal categorical
26
What is a monotonic relationship?
Relationship where as one variable increases, other consistently increases or decreases, but not necessarily in a straight line.
27
When is Spearman’s correlation used instead of Pearson’s?
When data are **not normally distributed**, relationship is **nonlinear but monotonic**, or the assumptions of l**inearity & equal variance** aren't met.
28
What is the symbol for Spearman’s Rank Correlation Coefficient?
ρ (rho)
29
What does Spearman Rank Correlation Coefficient indicate?
direction & strength of monotonic relationship
30
31
What does a positive Spearman’s ρ indicate?
* As one variable increases, the other also increases — their ranks move in the same direction. 📈 Example: As hours of study increase, exam scores increase. 👉 Perfect positive = ρ = +1
32
What does a negative Spearman’s ρ indicate?
* As one variable increases, the other decreases — their ranks move in **opposite** directions. 📉 Example: As physical activity increases, body weight decreases. 👉 Perfect negative = ρ = –1
33
What does Spearman’s ρ = 0 mean?
* No monotonic relationship * changes in one variable do not consistently relate to changes in the other.