What is reliability?
How consistent something is
What is internal reliability?
Whether a test is consistent within itself a.k.a. are all questions of equal difficulty
A test is said to have high internal reliability if all questions are of equal difficulty
How do you assess internal reliability?
The split-half method
This involves comparing a person’s scores from the first and second half of the test
If the test is high in internal reliability the scores should be similar
How can internal reliability be improved?
By conducting pilot studies to see if the questions are equally hard or easy- questions can be rephrased if some are harder than others
What is external reliability?
Whether results are consistently found when the study is repeated
It’s important for studies to be replicable as external reliability can only be checked if the study is able to be carried out again
How do you assess external reliability?
The test-retest method
This involves doing the study again to see if similar results are obtained
the results can be correlated and a positive correlation is indicative of high external reliability
How do you improve external reliability?
Factors in the experimental setting e.g. EVs could affect participant performance and so control is the main factor that will improve external reliability
Ensuring all participants have the same standardised instructions and investigator effects are eliminated
What is inter-observer/rater reliability?
The consistence between different researchers- if both researchers are collecting the same data
Most relevant to observational studies and content analysis as such studies often have more than one person recording data
How do you assess inter-observer/rater reliability?
Researchers will observe the same situation and use the same method to record the data
Results are compared through the correlation method- whereby a positive correlation will be found if there is high reliability between the researchers
How do you improve inter-observer/rater reliability?
Behavioural categories can be tightened/operationalised further so that they are not ambiguous and observers could receive training in observational techniques
Explain how you would assess inter-rater reliability?