Chapter 6 - Validity Flashcards by Amanda Johnson

What is validity?

A judgment or estimate of how well a test measures what it’s supposed to measure in a particular context

How well did you know this?

Not at all

Perfectly

What is the relationship between validity and reliability?

Reliability is required by not sufficient for validity

How well did you know this?

Not at all

Perfectly

What is validation? Who plays a role in it?

The process of gathering and evaluating evidence about validity
This can be done by both test developers and test users

How well did you know this?

Not at all

Perfectly

What is local validation?

When test users aim to determine the validity of a test within their own local settings or conditions, using their own group of test takers

How well did you know this?

Not at all

Perfectly

What are the 3 main categories of validity (from easiest to hardest to establish)?

Content –> Criterion-Related –> Construct

How well did you know this?

Not at all

Perfectly

What is Content Validity?

How well a test samples behaviors that are representative of the broader set of behaviors that it’s designed to measure

In other words, it measures how well test items/topics adequately represent the content that should be included based on the operational definition being used

How well did you know this?

Not at all

Perfectly

What is Face Validity?

A form of content validity, it is a judgment concerning how relevant the test items appear to be on the face of it
This is the simplest form of validity to prove, but some tests are intentionally designed to have low levels of it

How well did you know this?

Not at all

Perfectly

What is a Test Blueprint?

Part of the process of creating content validity, it is a plan regarding the types of information covered by the items, the number of items tapping into each area of coverage, and the organization of the items in the test

How well did you know this?

Not at all

Perfectly

How do we typically establish content validity?

Expert panels: obtain expert ratings on the degree of item importance and scrutinize what is missing from the measure
Focus Groups: having the general population react to the measure

How well did you know this?

Not at all

Perfectly

What is Criterion-Related Validity?

Evaluates the relationship between scores obtained on one test and scores obtained on other tests or measures

How well did you know this?

Not at all

Perfectly

What is a criterion? What does it need in order to be adequate?

A standard against which a test or test score is evaluated
Must be…
1- relevant to the matter at hand
2- valid for the purpose that it’s being used
3- uncontaminated, as in it cannot be a part of the predictor

How well did you know this?

Not at all

Perfectly

What ways can we establish criterion-related validity (in order from easiest to hardest to establish)?

1 - Concurrent validity
2 - Predictive validity
3 - Incremental validity

How well did you know this?

Not at all

Perfectly

What is Concurrent Validity?

The degree to which a test score is related to some criterion measure obtained at the SAME time

How well did you know this?

Not at all

Perfectly

What is Predictive Validity?

The degree to which a test score predicts some criterion measure (or outcome) obtained at a FUTURE time

How well did you know this?

Not at all

Perfectly

What is a Base Rate? How does it influence predictive validity?

The extent to which a phenomenon exists in the population
The less frequent it is, the more difficult it would be to show predictive validity

How well did you know this?

Not at all

Perfectly

What is the Hit Rate when establishing predictive validity? What are the two kinds?

Study These Flashcards

The ability of the measure to accurately predict results

Two possibilities…
1- true positive
2- true negative

What is the Miss Rate when establishing predictive validity? What are the two kinds?

Study These Flashcards

Failure to identify something accurately

Two possibilities…
1- False positive or Type I error: saying that something will happen and then it does not
2- False negative or Type II error: saying that something will not happen and then it does

What is the Validity Coefficient? What is it affected by?

Study These Flashcards

A correlation coefficient between test scores and scores on the criterion measure
Affected by restriction or inflation of range

What is Incremental Validity?

Study These Flashcards

The degree to which an additional predictor explains something about the criterion measures that is not explained by predictors already being used
Essentially saying “this test predicts the criterion better than other tests”

What is Construct Validity? How do we acquire evidence for it?

Study These Flashcards

The ability of a test to measure a theorized construct. Essentially, does the measure map onto the THEORY the way we would expect it to (as in, do high scorers and low scorers behave as theorized?)

Establishing content and criterion-related validity will also provide evidence for construct validity, but it requires a little bit more than just that

What are the different forms of evidence for construct validity? (7 things)

Study These Flashcards

1- evidence of homogeneity
2- evidence of changes
3- evidence of retest changes
4- evidence from distinct groups
5- convergent evidence
6- discriminant evidence
7- factor analysis

What is evidence of homogeneity?

Study These Flashcards

How uniform a test is in measuring a single construct (established using evidence from internal reliability)

Ex: if I believe that my construct is narrow, then my internal consistency should be high

What is evidence of changes?

Study These Flashcards

How the construct changes over time in the way it’s expected to, established using evidence from test-retest reliability

What is evidence of posttest or retest changes?

Study These Flashcards

Test scores change as a result of some kind of experience or intervention between pretest and posttest, established using evidence from dynamic assessment

What is evidence from distinct groups?

Scores on the test vary in a predictable way based on membership in some group

What is Convergent Evidence? How is it distinct from concurrent validity?

Scores from a test correlate highly in the predicted direction with scores on older, more established tests designed to measure the same or a similar construct This is similar to concurrent validity, except we're looking at correlations with other measures rather than with criterion in general

What is Discriminant Evidence?

Scores from a test have little to no relationship with variables that it should NOT be correlated with Ex: in most cases, test scores should NOT be correlated with measures of social desirability

What is Factor Analysis?

A family of statistical tests that looks at how items are correlated with one another and develop into factors or groupings A test should only measure ONE common factor unless it's an inventory or intentionally divided into different domains

What is Bias?

A factor inherent in a test that systematically prevents accurate, impartial measurement

What is Rating Error?

A judgment resulting from the intentional or unintentional misuse of a rating scale Raters may be either too lenient, too severe, or reluctant to give ratings at either extreme (i.e. central tendency error)

What is the halo effect?

The tendency to give a particular person a higher rating than he or she objectively deserves because of an overall favorable impression of the person

What is fairness?

The extent to which a test is used in an impartial, just, and equitable way

Can a test be unbiased and still be unfair? Why?

Unbiased tests can still be unfair due to random error or aspects of the test's administration/application (rather than aspects of the test itself)

Chapter 6 - Validity Flashcards

(33 cards)