Positive hit rate
percent of positives on the predictor who were actually successful on the criterion
true positives / total number positives
To ascertain if the test you have developed is valid as a screening test for determining whether a person has an anxiety or affective disorder, you would be most interested in evaluating the test’s
Concurrent validity
To increase false negatives:
raise the predictor cutoff and lower the criterion cutoff
Standard error of estimate
estimate the range within which an examinee’s true criterion score is likely to fall given their predicted score on the criterion.
what is the mean and SD of a t-score
mean of 50, SD of 10
item difficulty index
ranges from 0 (no one answers the item correctly) to 1.0 (everyone in a sample answers the item correctly)
.x = x% answered it right
correction for attenuation formula
used to estimate the predictor’s validity coefficient if the predictor and/or criterion were perfectly reliable
kappa statistic
estimates inter-rater reliability removing the effects of chance agreement
distribution of percentile ranks
EVENLY distributed
distribution “curve” looks like a rectangle
Assuming no constraints in terms of time, money, or other resources, what is the best way to demonstrate that a test has adequate reliability?
equivalent (alternate) forms
Cronbach’s alpha
used to determine internal consistency reliability
Calculates average reliability with all possible test splits
underestimates reliability, but less so than regular split-half
Criterion contamination
when a rater’s knowledge of a person’s predictor performance biases how he/she rates the person on the criterion
KR-20
used to determine internal consistency reliability when items are scored dichotomously
To maximize the inter-rater reliability of a behavioral observation scale, you should make sure that coding categories:
are mutually exclusive and well-operationally-defined
standard error of estimate
standard deviation of the criterion score times the square root of one minus the validity coefficient squared
what factors affect the size of the reliability coefficient?
test length (longer test, larger coeff), range of scores (larger range, larger coeff), guessing (higher probability of guessing correct, smaller coeff)
After reviewing the data collected on a new selection test during the course of a criterion-related validity study on new hires, a psychologist decides to lower the selection test cutoff score. Apparently the psychologist is hoping to do which of the following?
Decrease the number of false negatives
When a test user uses a correction for guessing formula that involves subtracting points from each examinee’s scores, the resulting distribution of scores will have a ____________________ than the original (non-corrected) distribution.
lower mean and larger standard deviation