The two components of measurement
measurement = true score + error
(we want as close to true score as possible)
Ways to Reduce Error
Many participants
Many measurements
High frequency or many occasions
Reliability
refers to the consistency/repeatability of the results of a measurement
Types of Reliability
Observers: Inter-Observer reliability
Observations: Internal (Split-half) reliability
Occasions: Test-retest reliability
Inter-Observer Reliability
→ the degree to which observers agree upon an observation or judgement
Example of Poor Inter-Observer Reliability
Rating attractiveness
Internal/Split-Half Reliability
→ the degree to which all of the specific items or observations in a multiple item measure behave the same way
How:
Example of Measuring Split-Half Reliability
Intelligence (split into three domains)
- Verbal intelligence, perceptual reasoning, working memory
Test-Retest Reliability
→ the reliability of a measure to produce the same results at different points in time or occasions
Visual Search Task Example (Test-Retest Reliability)
We need the measurement to remain constant over time
However, Practice effects undermine test-retest reliability
Brain Training Example (Test-Retest Reliability)
Things to improve brain, slow down cognitive decline
There is the question of whether it works
practice effects
improvement on scores in a tasks does not correlate to greater improvement on all tasks
replication
reliability of results across experiments
Validity
→ refers to how well a measure or construct actually measures or represents what it claims to
→ relates to accuracy
Types of Validity
Measurement Validity
how well an operationalised variable corresponds to what it is supposed to measure
Construct Validity
→ how well do your operationalised variables (independent and/or dependent) represent the abstract variables of interest
Example: hunger in rats
- must consider the weight of the amount of food consumed, speed running towards food etc
Content Validity
→ degree to which the items or tasks on a multi-faceted measure accurately measure what its suppose to measure (the target domain)
Example: extroversion measure with 40 different items
Difference between Content Validity and Internal Reliability
Content validity demonstrates that all of the items ACCURATELY measure the construct whereas reliability relates to whether items CONSISTENTLY measure the construct
Criterion Validity
→ measures how well scores on one measure predicts the outcome for another measure
concurrent: compares scores on two current measures to determine whether they are consistent (similarity ensures validity)
predictive: predict the outcome of a current behaviour from a separate behaviour (e.g if you see more of something, will you purchase it in the future)
Internal Validity
→ focussed on whether the research design and evidence allows us to demonstrate a clear causal relationship
Requirements for Causality
External Validity
→ how well a causal relationship can be generalised/replicated across different people, settings, measurements etc
Population Validity
how well your experimental findings can be replicated in a wider population