7 Correlation and Regression Flashcards

(33 cards)

1
Q

What is the definition of BIVARIATE DATA?

A

Data that consists of pairs of values for two random variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does correlation measure?

A

The relationship between two variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Pearson’s Product Moment Correlation Coefficient?

A

A standardised measure of correlation for linear relationships between two variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the least squares regression line used for?

A

To make predictions using the equation of the regression line.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the independent variable?

A

The variable that is being changed or controlled by the data collector.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the dependent variable?

A

The variable that is being recorded.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

True or False: Correlation implies causation.

A

False.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Fill in the blank: The _______ is the graphical representation of the regression line.

A

line of best fit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the purpose of using residuals in scatter graphs?

A

To identify outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the term GRADIENT refer to in the context of regression?

A

The slope of the regression line.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Spearman’s Rank Correlation Coefficient?

A

A non-parametric measure of correlation based on the ranks of data.
It is the PMCC of the ranked data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is INTERPOLATION?

A

Estimating values within the range of a set of data points.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is EXTRAPOLATION?

A

Estimating values outside the range of a set of data points or out of the context of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the PRODUCT MOMENT CORRELATION COEFFICIENT (PMCC)?

A

A standardised measure of correlation specifically for linear relationships.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does it mean if two variables have CAUSAL CORRELATION?

A

A change in one variable directly affects the other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why is a scatter graph important?

A

To check if the relationship between two variables is linear.

17
Q

What does the term RANK refer to in statistics?

A

The position of a data point in an ordered list.

18
Q

What is the purpose of calculating the Spearman’s Rank Correlation Coefficient?

A

To determine the strength of the relationship between the ranks of two variables.

19
Q

True or False: The PMCC is affected by the units of measurement.

A

False. PMCC is independent of units.

20
Q

What is the purpose of identifying independent and dependent variables in an experiment?

A

To understand the relationship and effects between variables.

21
Q

Fill in the blank: The _______ is used to describe the strength of correlation.

A

correlation coefficient

22
Q

Fill in the blank: The Spearman’s rank correlation coefficient is more suitable than the product moment correlation coefficient for _______.

A

Data which does not follow a bivariate normal distribution

23
Q

What is the equation of the least squares regression line?

A

y = a + bx, where a is the y-intercept and b is the gradient

24
Q

True or False: A prediction is reliable if it is an extrapolation.

25
What does a positive residual indicate?
The actual value is above the predicted value from the regression line.
26
What is interpolation in the context of predictions?
Predicting values within the range of data used to calculate the regression line.
27
What is extrapolation?
Predicting values outside the range of data used to calculate the regression line.
28
How is the value of a interpretted for the line y=a+bx?
When 'x' is 0-'units', 'y' is 'a'-'units'
29
How is the value of b interpretted in the line y=a+bx?
As 'x' increases by 1-'unit', the 'y' increases/decreases by 'b'-'units'
30
What is the definition of a residual?
The difference between the actual value and the predicted value from the regression line.
31
What is the significance of a high PMCC value?
It indicates a strong linear relationship between the variables.
32
What does the graph of the data look like if the SRCC is 1
The data points would form a STAIRCASE
33
What is a TIED RANK?
When ranking the data, if two pieces of data are exactly the same value, this results in a TIED RANK. Allocate each value which needs a tied rank with the mean of the ranks to be allocated.