EPPP Test Construction Wrong Flashcards

(18 cards)

1
Q

Positive hit rate

A

percent of positives on the predictor who were actually successful on the criterion

true positives / total number positives

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

To ascertain if the test you have developed is valid as a screening test for determining whether a person has an anxiety or affective disorder, you would be most interested in evaluating the test’s

A

Concurrent validity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

To increase false negatives:

A

raise the predictor cutoff and lower the criterion cutoff

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Standard error of estimate

A

estimate the range within which an examinee’s true criterion score is likely to fall given their predicted score on the criterion.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is the mean and SD of a t-score

A

mean of 50, SD of 10

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

item difficulty index

A

ranges from 0 (no one answers the item correctly) to 1.0 (everyone in a sample answers the item correctly)

.x = x% answered it right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

correction for attenuation formula

A

used to estimate the predictor’s validity coefficient if the predictor and/or criterion were perfectly reliable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

kappa statistic

A

estimates inter-rater reliability removing the effects of chance agreement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

distribution of percentile ranks

A

EVENLY distributed

distribution “curve” looks like a rectangle

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Assuming no constraints in terms of time, money, or other resources, what is the best way to demonstrate that a test has adequate reliability?

A

equivalent (alternate) forms

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Cronbach’s alpha

A

used to determine internal consistency reliability

Calculates average reliability with all possible test splits

underestimates reliability, but less so than regular split-half

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Criterion contamination

A

when a rater’s knowledge of a person’s predictor performance biases how he/she rates the person on the criterion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

KR-20

A

used to determine internal consistency reliability when items are scored dichotomously

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

To maximize the inter-rater reliability of a behavioral observation scale, you should make sure that coding categories:

A

are mutually exclusive and well-operationally-defined

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

standard error of estimate

A

standard deviation of the criterion score times the square root of one minus the validity coefficient squared

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what factors affect the size of the reliability coefficient?

A

test length (longer test, larger coeff), range of scores (larger range, larger coeff), guessing (higher probability of guessing correct, smaller coeff)

17
Q

After reviewing the data collected on a new selection test during the course of a criterion-related validity study on new hires, a psychologist decides to lower the selection test cutoff score. Apparently the psychologist is hoping to do which of the following?

A

Decrease the number of false negatives

18
Q

When a test user uses a correction for guessing formula that involves subtracting points from each examinee’s scores, the resulting distribution of scores will have a ____________________ than the original (non-corrected) distribution.

A

lower mean and larger standard deviation