Learning objectives
Overview:
- Measurement scales and latent variables
- Reliability
- Validity
- Guidelines for developing a scale
- Using SPSS to develop a scale
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _
Learning objectives:
1. Understand and be able to explain… measurement scales and latent variables.
2. Understand and be able to explain… the difference between the reliability and validity of a scale.
3. Understand and be able to explain… different ways to assess the reliability and validity of a scale.
4. Understand and be able to explain… the different steps to follow to develop a reliable and valid scale.
5. Be able to… conduct, report and interpret reliability and validity analyses.
The issue – how to measure a psychological variable using a questionnaire?
Simply putting together questions to measure a construct for which there is no pre-existing scale leaves the nagging doubt that the items may not be reliable or validly measuring what we aim to investigate.
We can’t directly measure psychological phenomena such as attitudes, so we used questionnaire items.
Scales and latent variables
Y ——–> X1
Reliability
A reliable scale is a scale in which variation in scale scores can be attributed to a latent variable that exerts a causal influence over all the items
i.e. if people respond similarly to all the items, the scale is reliably measuring the effect of the latent variable, all the items are being causally influenced in a similar way by the latent variable
Measures of reliability
List them (3)
Measures of reliability
If the scale is reliable, the two halves of the scale should correlate strongly.
Avoids fatigue effects, can see if this is the case easily
Measures of reliability
Coefficient Alpha (α) indicates the proportion of the variance in the scale scores that is attributable to the true score (score that would be obtained given no error) - how much variance is a result of the latent variable alone.
Coefficient alpha is based on correlations between each scale item score and the total score.
It provides a more accurate measure of internal reliability than a split-half correlation
Scores - goes from 0->1, closer to 1 the better. DeVellis recommends that an α of .60 or below is unacceptable, .80-.90 is seen as very good. Above .90 means the scale can likely be shortened.
Diadvantages of internal consistency:
Measures of reliability
Scores on the test at one point in time are correlated with scores on the test at another point in time.
If the scale has high test-retest reliability, a strong correlation will be found between the two administrations of scale.
***However, low test-retest reliability may reflect temporal instability in the underlying latent variable (e.g. exam stress 4 and 2 weeks before an exam).
Validity
A valid scale is a scale which measures the latent variable of interest (its measuring what we want it to)
You might have a reliable scale but not a valid one i.e. it is accurately measuring a construct but not the one we think it is
Measures of validity
List them (3)
Measures of validity
To what extent does the content of the scale items reflect the latent variable of interest?
e. g. conduct interviews with members of the target population to generate items for the scale
e. g. ask experts to rate the extent to which each item reflects the proposed latent variable of interest
…Do the items look like they’re measuring the right construct?
Increasing face validity –> interviews with target pop or ask experts to rate the items
Does the scale relate to other constructs in line with theoretical expectations?
e. g. administer the scale to a sample of participants along with another related measure. A strong correlation would indicate construct validity.
e. g. a measure of personal strengths would be expected to correlate with a measure of self-esteem.
We expect the scale to correlate reasonably well with some related constructs e.g. depression with anxiety.
Does the scale have an association with some other external criterion?
e. g. correlate scores on the scale with an external criterion/measure.
e. g. a measure of medication beliefs would be expected to correlate with an objective measure of medication adherence.
Association with some other gold standard? Behaviour/ other scale?
Guidelines for Scale Development (DeVellis, 1991)
Guidelines for Scale Development (DeVellis, 1991)
Skewness and Kurtosis
Skewness is where the hump of the bell curve is off to either side.
Kurtosis is where the hump of the bell curve is either super thin or really flat and wide/ non-existent
Both represent abnormal distribution
Investigating Skewness and Kurtosis
Both skewness and kurtosis can be represented by Z scores (standardised scores)
this is where the skewness/ kurtosis value is divided by the standard error of either measure e.g. Z score for kurtosis would be kurtosis/ S.E of kurtosis
If z > 3.08, p < .001,
which indicates highly significant levels of skewness/kurtosis.
we want skewness and kurtosis z scores to be BELOW 3.08 and to sig to a level of p
Example - mindfulness scale, 6 items
a) Distribution - Skewness and Kurtosis:
b) Reliability - Crohbach’s alpha
c) Validity - Construct validity
- does the measure correlate with with a related theoretical construct? if yes its valid…
Validity - Predictive validity
SPSS: Recoding variables
Recode things into different variable name - so you don’t forget you’ve recoded it e.g. item1 => item1r
So,
SPSS: Evaluating items
SPSS: Coefficient Alpha (Reliability Analysis)
SPSS: Combining items into scale
or, go to functions and select mean (good if you have missing data)