Reliability, Validity, Utility Flashcards by Nick Tabaldo

TRUE OR FALSE?
Psychological assessments are only useful if the tests we use are consistent, accurate, and practical.

TRUE.

How well did you know this?

Not at all

Perfectly

THREE MAJOR CONCEPTS
Consistency

Reliability

How well did you know this?

Not at all

Perfectly

THREE MAJOR CONCEPTS
Accuracy

Validity

How well did you know this?

Not at all

Perfectly

THREE MAJOR CONCEPTS
Practical usefulness

Utility

How well did you know this?

Not at all

Perfectly

Consistency of measurement; the degree to which test scores are stable, dependable, and free from random error.

Reliability

How well did you know this?

Not at all

Perfectly

Key Idea: If I measure the same thing again, will I get the same result?

Reliability

How well did you know this?

Not at all

Perfectly

TYPES OF RELIABILITY
Same test given at two different times; should produce similar scores.

Example: Taking an IQ test in January and again in February.

Test-Retest Reliability

How well did you know this?

Not at all

Perfectly

TYPES OF RELIABILITY
Agreement between different scorers/observers.

Example: Two clinicians rating the same patient’s behavior.

Inter-Rater Reliability

How well did you know this?

Not at all

Perfectly

TYPES OF RELIABILITY
Two different but equivalent versions of a test; should give similar results.

Example: Version A and Version B of an exam.

Parallel-Forms Reliability

How well did you know this?

Not at all

Perfectly

TYPES OF RELIABILITY
Consistency of items within the same test; measured by Cronbach’s alpha.

Example: All items on a depression scale should relate to depression.

Internal Consistency

How well did you know this?

Not at all

Perfectly

Accuracy of measurement; the degree to which a test measures what it is supposed to measure.

Validity

How well did you know this?

Not at all

Perfectly

Key Idea: Am I measuring the right thing?

Validity

How well did you know this?

Not at all

Perfectly

TYPES OF VALIDITY
Does the test cover all relevant aspects of the concept?

Example: A math test that only asks about addition has poor content validity.

Content Validity

How well did you know this?

Not at all

Perfectly

TYPES OF VALIDITY
How well a test predicts performance on an external standard.

Criterion-Related Validity

How well did you know this?

Not at all

Perfectly

TYPES OF VALIDITY | CRITERION-RELATED VALIDITY
Measured at the same time (e.g., depression test vs. clinical diagnosis).

Concurrent Validity

How well did you know this?

Not at all

Perfectly

TYPES OF VALIDITY | CRITERION-RELATED VALIDITY
Predicts future performance (e.g., SAT predicting college GPA).

Predictive Validity

How well did you know this?

Not at all

Perfectly

TYPES OF VALIDITY
Does the test really measure this theoretical concept?

Example: Does an anxiety scale truly capture anxiety and not just stress or shyness?

Construct Validity

How well did you know this?

Not at all

Perfectly

TYPES OF VALIDITY | CONSTRUCT VALIDITY
Test correlates with similar measures.

Convergent Validity

How well did you know this?

Not at all

Perfectly

TYPES OF VALIDITY | CONSTRUCT VALIDITY
Test does not correlate with unrelated measures.

Discriminant Validity

How well did you know this?

Not at all

Perfectly

Practical value of the test; practicality. The usefulness of a test in a real-world setting, considering both benefits and costs.

Study These Flashcards

Utility

Key Idea: Is this test worth using?

Study These Flashcards

Utility

FACTORS AFFECTING UTILITY
A test that isn’t consistent or accurate won’t be useful.

Study These Flashcards

Reliability & Validity

FACTORS AFFECTING UTILITY
Time, money, effort vs. value gained.

Study These Flashcards

Costs vs. Benefits

FACTORS AFFECTING UTILITY
Accessible and fair across groups.

Study These Flashcards

Fairness

**FACTORS AFFECTING UTILITY** Easy to administer, score, and interpret.

Practicality

**FACTORS AFFECTING UTILITY** *Identify whether this statement relates to Reliability/Validity, Costs vs. Benefits, Fairness, or Practicality:* An IQ test that gives widely different scores to the same student each time won't be useful.

Reliability

**FACTORS AFFECTING UTILITY** *Identify whether this statement relates to Reliability/Validity, Costs vs. Benefits, Fairness, or Practicality:* A "creativity test" that actually measures vocabulary is not useful, even if it's consistent.

Validity

**FACTORS AFFECTING UTILITY** *Identify whether this statement relates to Reliability/Validity, Costs vs. Benefits, Fairness, or Practicality:* A company considers a complex personality test for hiring. It costs P5,000 per applicant but only slightly improves hiring decisions. The cost outweighs the benefit, so the test has low utility.

Costs vs. Benefits

**FACTORS AFFECTING UTILITY** *Identify whether this statement relates to Reliability/Validity, Costs vs. Benefits, Fairness, or Practicality:* A short, free online survey that predicts job performance almost as well as the expensive one.

Costs vs. Benefits

**FACTORS AFFECTING UTILITY** *Identify whether this statement relates to Reliability/Validity, Costs vs. Benefits, Fairness, or Practicality:* A math placement test written only in English may disadvantage students who are proficient in math but not fluent in English.

Fairness

**FACTORS AFFECTING UTILITY** *Identify whether this statement relates to Reliability/Validity, Costs vs. Benefits, Fairness, or Practicality:* A well-designed nonverbal reasoning test avoids language bias, making it fairer and more useful.

Fairness

**FACTORS AFFECTING UTILITY** *Identify whether this statement relates to Reliability/Validity, Costs vs. Benefits, Fairness, or Practicality:* A test requiring expensive equipment, three hours of administration, and advanced statistical scoring is impractical for most schools.

Practicality

**FACTORS AFFECTING UTILITY** *Identify whether this statement relates to Reliability/Validity, Costs vs. Benefits, Fairness, or Practicality:* A 30-minute test with easy-to-score answer sheets is more practical and therefore more useful.

Practicality

**TRUE OR FALSE?** A test must be reliable to be useful, but reliability alone is not enough. A test must also be valid to ensure it measures what it claims. Finally, a test must have utility to be practical in real-word settings. *Together, these three concepts determine whether a psychological test is worth using in practice.*

**TRUE.**

**OTHER KEY CONCEPTS IN PSYCHOLOGICAL ASSESSMENT** The process of giving a test the same way every time to ensure fairness; includes norms (what's "average" for a group). Example: IQ tests are standardized on large populations to know what an "average" score is.

Standardization

**OTHER KEY CONCEPTS IN PSYCHOLOGICAL ASSESSMENT** Reference points to interpret a score.

Norms

**TYPES OF NORMS** Compare with same-age peers.

Age Norms

**TYPES OF NORMS** Compare with students in the same grade.

Grade Norms

**TYPES OF NORMS** Tells the % of people who scored lower.

Percentile Ranks

**OTHER KEY CONCEPTS IN PSYCHOLOGICAL ASSESSMENT** No test is perfect; there's always an error. Errors may come from the test itself, the environment, or the test-taker (mood, fatigue). *Observed Score = True Score + Error*

Errors in Measurement

**ERRORS IN MEASUREMENT** The score you actually get on a test (e.g., 85/100).

Observed Score

**ERRORS IN MEASUREMENT** Your real level of the trait being measured (e.g., your actual intelligence, ability, or depression level).

True Score

**ERRORS IN MEASUREMENT** Everything else that affects your score but is not part of the true ability (e.g., guessing, bad instructions, being tired, noise in the testing room).

Error

**OTHER KEY CONCEPTS IN PSYCHOLOGICAL ASSESSMENT** Examining test questions to see if they are good measures; includes difficulty index (easy vs. hard items) and discrimination index (does the item distinguish between high and low scorers?).

Item Analysis

**ITEM ANALYSIS** Easy vs. hard items.

Difficulty Index

**ITEM ANALYSIS** Does the item distinguish between high and low scores?

Discrimination Index

**OTHER KEY CONCEPTS IN PSYCHOLOGICAL ASSESSMENT** A test must not be unfair to groups based on culture, language, gender, or socioeconomic status. Example: A math test with word problems written in complex English may disadvantage non-native speakers.

Fairness and Bias

**OTHER KEY CONCEPTS IN PSYCHOLOGICAL ASSESSMENT** Informed consent, confidentiality, appropriate use, test security. Tests are powerful tools; misuse can harm people.

Ethical Issues in Testing

Reliability, Validity, Utility Flashcards

(48 cards)