Test Construction By Flashcards

Question 1

Q

Item difficulty

Answer

A

P=
Total Number of Examinees Passing the Item divided by
Total Number of Examinees

Question 2

Q

Item discrimination

Answer

A

refers to the extent to which an item differentiates between examinees
who obtain low or high scores on the test or an external criterion
*Symbolized by letter “D”
* -1 to +1

Question 3

Q

Internal consistency reliability

Answer

A

*Indicates the degree of consistency across different test items
* is useful for estimating the reliability of tests that measure characteristics that fluctuate over time or are susceptible to memory or practice effects

Question 4

Q

Cronbach’S coefficient alpha

Answer

A

Mean of all possible split-half correlation coefficients

Question 5

Q

Inter-rater reliability

Answer

A

Used for measures that are subjectively scored (essays and projective tests)
* uses kappa statistic

Question 6

Q

Test-retest reliability

Answer

A

*administering the same test to the same examinees on two occasions and correlating the two sets of scores

*appropriate for determining reliability of tests designed to measure attributes that are relatively stable over time and that are not affected by repeated measurement.

Question 7

Q

Factors that affect the reliability coefficient

Answer

A

Test length (longer test =more reliable)
*wide range of scores (want heterogeneity= diversity means more reliable) want unrestricted
*harder to guess right answer
homogeneity= more similar content means greater reliability

Question 8

Q

standard error of measurement (SEM)

Answer

A

index of the amount of error that can be expected in obtained scores due to the unreliability of the test.

Question 9

Q

Confidence interval

Answer

A

helps a test user estimate the range within which an examinee’s true score is likely to fall given his or her obtained score. This range is calculated using the standard error of measurement

Question 10

Q

standard error of measurement (SEM)

Answer

A

an index of the amount of error that can be expected in obtained scores due to the unreliability of the test.

Question 11

Q

Classical test theory.

Answer

A

Variability is a combination of true score and random measurement error

Question 12

Q

Kuder-Richardson formula 20 (KR-20)

Answer

A

Does same as Cronbach’s coefficient alpha, but is used as a substitute when test items are scored dichotomously

Question 13

Q

Validity

Answer

A

Accuracy in terms of the extent to which the test measures what it was designed to measure
* 3 C’s : content, construct, and criterion-related

Question 14

Q

Content validity

Answer

A

*assesses how well a test samples a particular content area
* Built into the test
* is of concern when test is designed to measure a content or behavior domain

Question 15

Q

Construct validity

Answer

A

assesses how well a test measures a hypothetical construct or trait
For tests designed to measure a hypothetical trait or construct
Intelligence, mechanical aptitude, self-esteem, and neuroticism are all constructs.

Two subtypes:
*convergent validity
*discriminant validity

Question 16

Q

Criterion validity (rxy)

Answer

Study These Flashcards

A

*assesses how well a test score can be used to predict or estimate criterion outcome
*scores range from -1.0 to +1.0
*SQUARE THE SCORE

Two subtypes:
*concurrent validity
*predictive validity

Question 17

Q

Taylor- Russell Tables

Answer

Study These Flashcards

A

Related to hiring decisions
*complete set of tables that provide measure of incremental validity
*tells us how much better diff companies would be doing (in hiring decisions ) if they added a predictor test

base rate: rate of successfully hiring employees without using a predictor tes. Can be expressed in any way
predictor test is most useful when company has moderate base rate (optimizes incremental validity)
*every company has a selection ratio- number of openings divided by # of ppl that apply
*Low selection ratio= ↑incremental validity