explain the difference between t and z test
Have the value
you’re testing
against (population
mean) but NO
population SD
z-test
have population
mean & SD
Correlations are between what kinds of variables
not looking at group differences, but looking at the associations between variables.
* If two variables are correlated it means that they co-vary.
* Does not imply causation
response variable
dependent
explanatory variable
independent
A researcher would like to know if a mother’s height
can explain how tall her child will be. Which is the
response variable?
a. child’s height
b. mother’s height
c. father’s height
a. child’s height
what do we use correlations for
Two variables that correlate means that as one variable changes, so
does the other. They co-vary.
The Scatter Plot
Shows the relationship between two quantitative
variables measured on the same individuals.
The scatter plot is a visual representation of data, plotting two data distributions in one figure (i.e., two values or scores for each individual)
what does a scatter plot line mean
After plotting two variables on a scatterplot, we describe the relationship by
examining the form, direction, and strength of the association. We look for an
overall pattern …
negative
zero
how do we interpret scatterplots? explain negative and positive association
Correlation Values
How do you get the correlation coefficient?
The Strength of a Correlation
The sign of the relationship between two variables has nothing to do
with its strength.
* Rule of thumb to determine the strength of a correlation (Visual
Statistics, 2009):
* 0 to .3 are considered “weak” correlations
* .3 to .7 are considered “moderate” correlations
* .7 and above are considered “high” correlation
Correlation: properties of r
The Coefficient of Determination
why is correlation useful
Establishing reliability and validity
* Test-retest reliability
* If you just run a t-test between the two occasions, you may end up finding a statistical
difference even on a reliable exam. There is something called “testing effect” where
people may end up doing better the second time they do it.
* However, even if they do end up doing better the second time around, if the exam is
reliable, the two occasions should have strong correlation.
* How different constructs are related to each other
The Pearson Correlation
The most commonly-used correlation value is the Pearson Correlation
(formally the Pearson Product Moment Correlation), with the following characteristics (think assumption checks):
* The correlation is bivariate—there are just two variables involved.
* Both variables are measured on at least an interval scale (i.e., continuous data).
* The variables have a linear relationship.
* No significant outliers.
* The sampling distribution to which the data belong is normally distributed.
* (Usually satisfied when your sample size is large)
Bivariate correlations
refer to the relationship between two variables.
* There can be correlations between:
* nominal variables (think dichotomous variables),
* ordinal variables,
* interval/ratio variables,
* and variables of different data scales.
The Point-Biserial Correlation