What is reliability?
Use an example
Reliability is a measure of consistency
Example: Does an individual get the same result on an IQ test when they take it again
If a particular measurement is repeated and the same result is obtained, then that measurement is reliable.
Reliable = Consistency
What are the ways to assess reliability?
Test-retest
Inter-observer
Note: For both methods, if a significant positive correlation is found between the two sets of results, they can be said to be reliable.
What is test-retest?
Use an example
Test the same person twice or repeat the study using the same procedure, design and measurements.
E.g., The same test or questionnaire is given to the same person two or more different occasions. If the test or questionnaire is reliable the results should be the same (or very similar) each time it is administered.
What is inter-observer?
Use an example
Compares observations from different (two or more) observers.
E.g., Two or more observers compare their data by conducting a pilot study of the observation to check that observers are applying behavioural categories in the same way. Observers should watch the same event, sequence of events, but record their data independently.
How to improve reliability of questionnaires and interviews?
Use structured interviews
Use more closed questions
Make sure questions are clear and unambiguous
Remove questions that are causing inconsistency
How to improve reliability of lab experiments?
Control extraneous variables
Use standardised procedures
How to improve reliability of observations?
Operationalise behavioural categories
Train observers on the behavioural categories
Do a pilot study to trial the behavioural categories
What is validity?
Provide example
Validity is the extent to which a test measures what it claims to measure.
Data can be reliable but not valid.
e.g., an IQ test may produce the same result every time when the same people are tested but not measure what is it is designed to.
What are the types of validity?
Internal
External
Ecological
Temporal
Population
What is internal validity?
Controls within a study e.g., demand characteristics
What is external validity?
The extent to which results can be generalised to other settings, population or eras.
What is ecological validity?
refers to whether findings can be generalised from one setting to another, most particularly generalised to everyday life. This may not relate to the setting (e.g., a lab) but more to the realism of the participant’s task.
What is temporal validity?
the extent to which the findings and conclusions of study are valid when we consider the differences and progressions that come with time. E.g., Asch’s study may lack temporal validity now because it was conducted during a conformist era in US history.
What is population validity?
generalising results to the target population.
What are the ways of assessing validity?
Face validity
Concurrent validity
What is face validity?
how valid your results seem based on what they look like. This is the least scientific method of validity, as it is not quantified using statistical methods.
What is concurrent validity?
Concurrent validity refers to the extent to which the results and conclusions concur with other studies and evidence. If a significant positive correlation is found between the two sets of results, the measure can be said to be valid.
How to improve validity for questionnaires?
Make them anonymous
Incorporate lie scales (questions that no one would honestly answer yes to, e.g. I always tell the truth)
How to improve validity for lab experiments?
Control extraneous variables
Use standardised procedures
Use single- and double-blind procedures
How to improve validity for observations?
Operationalise behavioural categories
Use covert observation
Summary of reliability and validity
Reliability and validity are both about how well a method measures something:
Reliability refers to the consistency of a measure (whether the results can be reproduced under the same conditions).
Validity refers to the accuracy of a measure (whether the results really do represent what they are supposed to measure).