Test Construction Flashcards

Question

What are the two types of criterion-related validity?

Answer 1

* Concurrent Validity * Predictive Validity

Answer 2

Evaluated by obtaining scores on the predictor and criterion at about the same time. ## Footnote Important for estimating current status.

Answer 3

Evaluated by obtaining scores on the predictor before obtaining scores on the criterion. ## Footnote Important for estimating future status.

Answer 4

More accurate predictor scores for predicting criterion scores.

Answer 5

By squaring the criterion-related validity coefficient.

Answer 6

The phenomenon where the correlation coefficient for a new sample is likely to be smaller than the original coefficient.

Answer 7

A measure of prediction error in criterion-related validity studies.

Answer 8

By adding and subtracting two standard errors of estimate from the predicted criterion score.

Answer 9

Standard deviation of the criterion measure times the square root of (1 - validity coefficient squared).

Answer 10

The standard error is 0.

Answer 11

The impact of measurement error on the magnitude of the criterion-related validity coefficient.

Answer 12

The increase in prediction accuracy by adding a new predictor to existing methods.

Answer 13

Using the Taylor-Russell tables.

Answer 14

Recently hired employees who obtained high scores on both the predictor and criterion.

Answer 15

Recently hired employees who obtained high scores on the predictor but low scores on the criterion.

Answer 16

Recently hired employees who obtained low scores on both the predictor and criterion.

Answer 17

Recently hired employees who obtained low scores on the predictor but high scores on the criterion.

Answer 18

By dividing the number of employees with high criterion scores by the total number of employees.

Answer 19

The ability of a test to correctly distinguish between people who do and do not have a disorder.

Answer 20

The proportion of people with the disorder identified by the test.

Answer 21

The proportion of people without the disorder identified by the test.

Answer 22

The proportion of people correctly categorized by the test.

Answer 23

The probability that a person who tests positive actually has the disorder.

Answer 24

The probability that a person who tests negative does not have the disorder.

Answer 25

The prevalence of the disorder in each setting.

Answer 26

A predictor's reliability places a ceiling on its validity.

Answer 27

The validity coefficient can be no greater than its reliability index.

Answer 28

The square root of the predictor's reliability coefficient.

Answer 29

Scores that indicate how well an examinee did on the test compared to individuals in the standardization sample ## Footnote They are designed to make distinctions among individuals or groups in terms of the ability or trait assessed by a test.

Answer 30

To make distinctions among individuals or groups in terms of the ability or trait assessed by a test ## Footnote (Urbina, 2014, p. 212)

Answer 31

Indicates the percentage of examinees in the reference group who scored at or below the score obtained by an examinee ## Footnote For example, a percentile rank of 82 means that 82% of examinees scored 75 or lower.

Answer 32

As a nonlinear transformation ## Footnote This is because the percentile rank distribution is always rectangular regardless of the raw score distribution.

Answer 33

Scores indicating how well an examinee did on a test in terms of standard deviations from the mean score of the reference group ## Footnote Includes z-scores, T-scores, IQ scores, and stanines.

Answer 34

Linear transformation ## Footnote The distribution of standard scores has the same shape as the raw score distribution.

Answer 35

A standard score with a mean of 0 and standard deviation of 1.0 ## Footnote It expresses an examinee’s score in terms of standard deviations from the mean.

Answer 36

z = (X – M)/SD ## Footnote Where X is the raw score, M is the mean, and SD is the standard deviation.

Answer 37

That the raw score is one standard deviation below the mean ## Footnote T-scores have a mean of 50 and standard deviation of 10.

Answer 38

Mean of 100 and standard deviation of 15 ## Footnote An IQ score of 85 indicates one standard deviation below the mean.

Answer 39

Standard scores with a mean of 5 and standard deviation of 2, ranging from 1 to 9 ## Footnote Each stanine represents one-half of a standard deviation except for the extremes.

Answer 40

To evaluate a person’s or group’s degree of competence or mastery in terms of a preestablished standard ## Footnote (Urbina, 2014, p. 121)

Answer 41

The percentage of test items answered correctly ## Footnote For instance, answering 75 of 150 items correctly results in a percentage score of 50%.

Answer 42

To provide information on an examinee’s expected score on another measure based on their obtained test score ## Footnote They predict future criterion scores based on predictor scores.

Answer 43

A predetermined score that distinguishes between mastery and non-mastery ## Footnote For example, a cutoff of 80% correct identifies those who have achieved mastery.

Answer 44

Candidates are ranked from highest to lowest based on their test scores ## Footnote Candidates are selected from the top down until the desired number is chosen.

Answer 45

Grouping test scores into bands determined by the test’s standard error of measurement ## Footnote Scores within each band are considered equivalent.

Answer 46

It helps reduce adverse impact by including members of minority groups who tend to receive lower test scores ## Footnote Candidates within a band are selected based on experience or skills rather than test scores alone.

Answer 47

A theory of measurement used for developing and evaluating tests, also known as true score test theory.

Answer 48

X = T + E, where X is the obtained score, T is the true score, and E is the measurement error.

Answer 49

Actual differences among examinees regarding what the test is measuring.

Answer 50

Random factors that affect test performance in unpredictable ways.

Answer 51

The extent to which a test provides consistent information.

Answer 52

The amount of variability in obtained test scores due to true score variability.

Answer 53

.70 or higher.

Answer 54

* Test-Retest Reliability * Alternate Forms Reliability * Internal Consistency Reliability * Inter-Rater Reliability

Answer 55

The consistency of scores over time.

Answer 56

The consistency of scores over different forms of the test.

Answer 57

The consistency of scores over different test items.

Answer 58

A method for evaluating internal consistency reliability.

Answer 59

Splitting the test in half and correlating the scores on the two halves.

Answer 60

The consistency of scores or ratings assigned by different raters.

Answer 61

An inter-rater reliability coefficient corrected for chance agreement.

Answer 62

When raters communicate while assigning ratings, leading to increased consistency but decreased accuracy.

Answer 63

Content Homogeneity.

Answer 64

Reliability coefficients are larger when test scores are unrestricted in range.

Answer 65

Easier guessing leads to lower reliability coefficients.

Answer 66

Reliability index is the theoretical correlation between observed and true test scores.

Answer 67

p = Number of correct answers / Total number of examinees.

Answer 68

A D value of .30 or higher.

Answer 69

The range within which an examinee's true score is likely to be given their obtained score.

Answer 70

Multiply the test's standard deviation by the square root of 1 minus the reliability coefficient.

Answer 71

68%: ±1 SEM, 95%: ±2 SEM, 99%: ±3 SEM.

Answer 72

An alternative to CTT that focuses on individual test items rather than total test scores.

Answer 73

Determining the probability of a specific examinee correctly answering any test item.

Answer 74

* Difficulty parameter * Discrimination parameter * Probability of guessing correctly

Answer 75

The level of the trait required for a 50% probability of answering the item correctly.

Answer 76

By the slope of the ICC.

Answer 77

The probability of guessing correctly.

Test Construction Flashcards

(103 cards)