correlation and regression are both
procedures used for examining the relationship between two variables. X and Y
similarities and differences between correlation and regression
correlation= strength of relationship of 2 variables regression= predictability of relationship between 2 variables
characteristics of regression situations
important to know in order to to distinguish a correlation from a regression rather than confuse the two
-the researcher is primarily interested in predicting Y from a knowledge of X
(which makes since if you think about the fact that we have a pre-existing/selected knowledge of X already. and Y is free to vary because it is what we are interested in predicting)
characteristics of correlation situations (how correlation differs in comparison to regression)
(and possibly in predicting either variable from a knowledge of the other)
similarities of correlation and regration
both are procedures for assessing the relationship between two variables
why is it important to distinguish between situations that use correlation procedures and regression procedures?
because the assumptions underlying the use of the two procedures differ.
4 ways correlation and regression differ
a bivariate frequency distribution (scatterplot) is a…
representation of the joint frequency of two variables
pearson product-moment correlation coefficient
-a measure of the linear relationship between 2 quantitative variables
(this is the numerical index of correlation)
possibly range for the value of a correlation coefficient
can range from -1 to +1
kind of relationship when coefficient has a value of +1 (r = +1)
positive relationship
kind of relationship when coefficient has a value of -1 (r= -1)
negative or inverse relationship
characteristics if there is no linear association between the variables (r = 0)
-data points fall in a circle
intermediate degrees of association
coefficients less than 0
-1 < r < 0
coefficients greater than 0
0 < r < 1
-data points tend to form an ellipse
two things a coefficient tells you
cross product (numerator of formula)
the product of the scores of two variables that are expressed as deviations from their respective means
data falls in quadrants 1 and 3
positive relationship exists between variables
-data is above or below the mean for both variables
data falls in quadrants 2 and 4
negative or inverse relationship exists between variable
-above the mean on one variable but below the mean on the other
the size of the absolute value of the cross product indicates what?
the strength of the association
covariance
mean of cross product sum
how to obtain a measure of strength of association that is independent of the number of pairs of scores.
compute the mean of the cross product sum (covariance)
and divide the covariance by the standard deviations of x and y.
-the resulting measure of strength of association is independent of the size of the dispersions of x and y variables.