Test Construction By Flashcards

(22 cards)

1
Q

Item difficulty

A

P=
Total Number of Examinees Passing the Item divided by
Total Number of Examinees

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Item discrimination

A

refers to the extent to which an item differentiates between examinees
who obtain low or high scores on the test or an external criterion
*Symbolized by letter “D”
* -1 to +1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Internal consistency reliability

A

*Indicates the degree of consistency across different test items
* is useful for estimating the reliability of tests that measure characteristics that fluctuate over time or are susceptible to memory or practice effects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Cronbach’S coefficient alpha

A

Mean of all possible split-half correlation coefficients

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Inter-rater reliability

A

Used for measures that are subjectively scored (essays and projective tests)
* uses kappa statistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Test-retest reliability

A

*administering the same test to the same examinees on two occasions and correlating the two sets of scores

*appropriate for determining reliability of tests designed to measure attributes that are relatively stable over time and that are not affected by repeated measurement.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Factors that affect the reliability coefficient

A
  • Test length (longer test =more reliable)
    *wide range of scores (want heterogeneity= diversity means more reliable) want unrestricted
    *harder to guess right answer
  • homogeneity= more similar content means greater reliability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

standard error of measurement (SEM)

A

index of the amount of error that can be expected in obtained scores due to the unreliability of the test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Confidence interval

A

helps a test user estimate the range within which an examinee’s true score is likely to fall given his or her obtained score. This range is calculated using the standard error of measurement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

standard error of measurement (SEM)

A

an index of the amount of error that can be expected in obtained scores due to the unreliability of the test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Classical test theory.

A

Variability is a combination of true score and random measurement error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Kuder-Richardson formula 20 (KR-20)

A

Does same as Cronbach’s coefficient alpha, but is used as a substitute when test items are scored dichotomously

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Validity

A

Accuracy in terms of the extent to which the test measures what it was designed to measure
* 3 C’s : content, construct, and criterion-related

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Content validity

A

*assesses how well a test samples a particular content area
* Built into the test
* is of concern when test is designed to measure a content or behavior domain

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Construct validity

A
  • assesses how well a test measures a hypothetical construct or trait
    For tests designed to measure a hypothetical trait or construct
  • Intelligence, mechanical aptitude, self-esteem, and neuroticism are all constructs.

Two subtypes:
*convergent validity
*discriminant validity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Criterion validity (rxy)

A

*assesses how well a test score can be used to predict or estimate criterion outcome
*scores range from -1.0 to +1.0
*SQUARE THE SCORE

Two subtypes:
*concurrent validity
*predictive validity

17
Q

Taylor- Russell Tables

A

Related to hiring decisions
*complete set of tables that provide measure of incremental validity
*tells us how much better diff companies would be doing (in hiring decisions ) if they added a predictor test

  • base rate: rate of successfully hiring employees without using a predictor tes. Can be expressed in any way
  • predictor test is most useful when company has moderate base rate (optimizes incremental validity)
    *every company has a selection ratio- number of openings divided by # of ppl that apply
    *Low selection ratio= ↑incremental validity
18
Q

Incremental validity

A

Amount of improvement in success rate that results from using a predictor test

19
Q

Convergent Validity

A

Want ↑convergent validity
*

20
Q

Standard scores

A

Indicate examinee’S relative standing in comparison group

21
Q

Z score distribution

A
  • mean of 0
  • SD of 1
22
Q

T score

A
  • mean of 50 & SD of 10