Reliability Flashcards

(60 cards)

1
Q

refers to the consistency of scores obtained by the same person when re-examined with the same test on diffeerent ocasssions or with different sets of equivalent items or under other variable examinig condition

A

Reliability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

true or false
measurement error is common in all fields of science

A

true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

true or false
test that are relatively free ofmeasuremen error are considered reliable while test that contains relatively large measurement of error are considered unreliable

A

true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

identification
the difference between observed score and the true score results

A

measurement error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

identification
the standard deviation of the distribution of errors for each repeated application of the same test on an individual

A

standard error of measurement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

true or false
although it is impossible to eliminate all measurement error, test developers do strive to maximize psychometric nuisance through careful attention to the sources of measurement error

A

false. minimize

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

t/f
the greater the numberof items the higher the reliability

A

true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

identification
a problem in the use of limited number of items to represnt a larger and more complicated construct

A

domain sampling model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

3 sources of measurement error

A

item selection
test administration
test scoring

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

identify

one source of measurement error is the instrument itself

A

item selection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

identify what sources of measurement error and
true/false

general environmental conditions , momentary fluctuation of anxiety, motivation, attention and the examiner can contribute and be the source of measurement error in the process

A

true
test administration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

identification
with the help of acomputer the item difficulty is calibrated to the mental ability of the test taker

A

item response theory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

t/f
correlation coefficient (r) expresses the degree and magnitude of a linear relationship between two sets of scores obtained from the same person

A

true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

forms of reliability

A

test-retest
parallel forms
inter-rater reliability
split half reliability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

it is established by comparing the scores obtained from two successive measurements of the same individuals and calculating a correlation between the twosets of scores

A

test-retest reliability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

other term for test-retest reliability that measure the error associated with administering a test at two different times

A

time sampling reliability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

corresponds to the random fluctuations of performance from one test session to the other

A

error variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

you took an iq test today and you will take it again after exactly a year, if your scores are almost the same then the measure has a good ________? what kind of reliability is the example

A

test-retest reliability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

limitations of test-retest reliability

A

carryover effect
practice effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

occurs when the first testing session influences the results of the second session and this can affect the test-retest reliability of a psychological measure

A

carryover effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

a type of carryover effect wherein the scores on the second test administration are higher than they were on first

A

practice effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

t/f sometimes a poor test-retest correlation do not mean that the test is unreliable. it might mean that the variable under the study has changed

A

true

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

It is established when at least two different versions of the test yield almost
the same scores.

A

parallel form reliability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

other term for parallel form reliability that knowns to compares two equivalent forms of a test that measure the same attribute to
make sure that the items indeed assess a specific characteristic.

A

item sampling or alternate forms reliability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
The Purdue Non-Language Test (PNLT) has Forms A and B and both yield slightly identical scores of the test taker is an example of what reliability
parallel forms reliability
26
Tests should contain the same number of items and the items should be expresses in the same form and should cover the same type of content is an example of what reliability
parallel forms
27
true or false The error of variance in parallel forms represents fluctuations in performance from one set of items to another, but not fluctuations over time.
true
28
true or false The range and level of difficulty of the items in parallel forma of reliability should be equal.
true
29
It is the degree of agreement between two observers who simultaneously record measurements of the behaviors.
inter- rater reliability
30
Two psychologists observe the aggressive behavior of elementary school children. If their individual records of the construct are almost the same, then the measure has a good ______reliability.
inter rater reliability
31
Two parents evaluated the ADHD symptoms of their child. If they both yield identical ratings, then the measure has good __________ reliability
inter-rater
32
what statistic inter-rater reliability use in order to assess the level of agreement among several raters using nominal scales.
kappa statistics
33
It is obtained by splitting the items on a questionnaire or test in half, computing a separate score for each half, and then calculating the degree of consistency between the two scores for a group of participants.
split-half reliability
34
This model of reliability measures the internal consistency of the test which is the degree to which each test item measures the same construct. It is simply the intercorrelations among the items.
split-half
35
t/f The test can be divided according to the odd and even numbers of the items (odd-even system).
true
36
what are the formula used to measure the internal consistency of a test in split half reliability
spearman-brown, kuder-richardson and cronbach’s alpha
37
A statistics which allows a test developer to estimate what correlations between the two halves would have been if each half had been the length of the whole test and have equal variances.
Spearman- Brown Formula
38
Spearman Brown Formula
rSB= 2r hh/ 1+ rhh
39
A statistics which allows the test developer to confirm that a test has substantial reliability in case the two halves of a test have unequal variances.
Cronbach’s coefficient alpha
40
The statistics used for calculating the reliability of a test in which the items are dichotomous or scored as 0 or 1.
Kuder-Richardson 20 (KR20) Formula
41
t/f For tests that has two forms, use parallel forms reliability.
true
42
true /false For tests that are designed to be administered to an individual more than once, use split-half reliability
false. test-retest
43
t/f For tests with continuum Likert scale, use Cronbach’s coefficient alpha.
true
44
t/f For tests which involve dichotomous items or forced choice items, use spearman-brown formula
False. KR20
45
t/f For tests with items carefully ordered according to difficulty, use inter-rater reliability
false. split-half
46
t/f For tests which involve some degree of subjective scoring, use test-retest reliability
false. inter-rater reliability
47
is a statistic that quantifies reliability, ranging from 0 (not at all reliable) to 1 (perfectly reliable).
reliability coeffecient
48
usually refers to a mistake of some sort that could have been prevented had a person been more conscientious, more skilled, or better informed.
error
49
refers to the inherent uncertainty associated with any measurement, even after care has been taken to minimize preventable mistakes
measurement error
50
consists of unpredictable fluctuations and inconsistencies of other variables in the measurement process
random error
51
refers to the proportion of the total variance attributed to true variance.
reliability
52
When the interval between testing is greater than six months, the estimate of test-retest reliability is often referred to as the ________
coefficient of stability
53
The degree of the relationship between various forms of a test can be evaluated by means of an alternate-forms or parallel-forms coefficient of reliability, which is often termed the
coefficient of equivalence
54
refers to an estimate of the extent to which item sampling and other errors have affected test scores on versions of the same test when, for each form of the test, the means and variances of observed test scores are equal.
parallel forms reliability
55
are simply different versions of a test that have been constructed so as to be parallel. Although they do not meet the requirements for the legitimate designation “parallel,” they are typically designed to be equivalent with respect to variables such as content and level of difficulty.
alternate forms
56
refers to an estimate of the extent to which these different forms of the same test have been affected by item sampling error, or other error.
alternate forms reliability
57
refers to the degree of correlation among all the items on a scale.
inter-item consistency
58
seek to estimate the extent to which specific sources of variation under defined conditions are contributing to the test score.
domain sampling theory
59
tool used to estimate or infer the extent to which an observed score deviates from a true score.
standard error of measurement
60
t/f standard error of a score and denoted by the symbol σmeas,
true