What does the correlation coefficient (r) summarize?
The strength of a linear relationship between variables as well as the direction of this relationship
How do you interpret r i.e. what values mean what?
What does r=0 mean?
There is no linear relationship between the variables
What does a strong/ weak relationship look visually?
- Strong= Points clustered heavily around the line of best fit
Calculate the correlation coefficient using the data on slide 694 and the equation found here (don’t need to memorize)?
Answers on slide
What function in r calculates the correlation coefficient?
cor(x,y)
How do you set up data in r?
x=c(data)
y=c(data)
Note: can use = or a backwards arrow
What is S subscript xy?
The sample covariance between x and y
What can the correlation coefficient ‘r’ be rewritten as?
S(subscript xy)/ Sx times Sy
Note: Sx and Sy are the sample standard deviations for the x and y variables
Can a correlation coefficient be used for prediction? Why or why not?
No, because its not a model
What is meant by the statement that the correlation coefficient is symmetric in variables?
Correlation between x and y is the same as correlation between y and x
What is R^2?
How do you interpret R^2 i.e. what does the numbers mean?
(can only be between 0 and 1, not such thing as a negative R squared value because squaring by nature removes negative signs)
What does the total sum of squares describe in contrast to R^2?
What is the residual sum of squares (RSS)?
What is ESS equal to? What does ESS stand for?
ESS= TSS- RSS
ESS= explained sum of squares. The amount of variation explained by the regression model
Complete the sentence: correlation does not equal…
causation
Find R squared for the stress data on slide 702
Answers on slide