Test Construction Flashcards by c s

Items with moderate difficulty levels are typically retained in classical test theory because

.5 = moderate
increases test score variability
helps ensure scores are normally distributed
provides maximum discrimination b/w examinees
maximizes the test’s reliability

How well did you know this?

Not at all

Perfectly

Item discrimination index measures

the extent to which a test item discriminates between examinees who obtain high versus low scores on the entire test

ranges from -1 to 1
.35 or above is acceptable
items with moderate difficulty most likely to differentiate

How well did you know this?

Not at all

Perfectly

Benefits of Item Response Theory

item characteristics (parameters) are sample invariant (same across different samples)
possible to equate scores from different sets of items/tests
easier to develop computer adaptive test

How well did you know this?

Not at all

Perfectly

Which theory of test construction uses an item characteristic curve?

item response theory

How well did you know this?

Not at all

Perfectly

item characteristic curves provide information on ___

difficulty, discrimination, probability of guessing correctly

How well did you know this?

Not at all

Perfectly

According to an item characteristic curve, an items ability to discriminate between high and low achievers is represented by the ____

slope of the curve

- the steeper the slope, the greater the discrimination

How well did you know this?

Not at all

Perfectly

According to an item characteristic curve, the probability of guessing correctly is indicated by _____

the point at which the ICC intercepts the vertical axis

How well did you know this?

Not at all

Perfectly

According to an item characteristic curve, an item’s difficulty level is indicated by ____

the ability level at which 50% of examinees in the tryout sample provided a correct response

How well did you know this?

Not at all

Perfectly

According to classical test theory, an examinee’s obtained test score (X) is composed of ____ and ____

their true score (T) and an error component (E)

How well did you know this?

Not at all

Perfectly

A reliability coefficient of .84 indicates

that 84% of variability in scores is due to true score differences among examinees, while the remaining 16% is due to measurement error.

How well did you know this?

Not at all

Perfectly

Kuder-richardson Formula 20

a variation of coefficient alpha for when test items are scored dichotomously (right/wrong)

How well did you know this?

Not at all

Perfectly

internal consistency reliability is not appropriate for

speeded tests

How well did you know this?

Not at all

Perfectly

the reliability coefficient is maximized when the range of scores is

unrestricted

How well did you know this?

Not at all

Perfectly

standard error of measurement

used to construct a confidence interval around a measured (obtained) score.

How well did you know this?

Not at all

Perfectly

Content Validity

test will be used to obtain information about an examinee’s familiarity with a particular content or behavior domain. Determined by experts

How well did you know this?

Not at all

Perfectly

Construct validity

Study These Flashcards

the test will be used to determine the extent to which an examinee possesses a particular hypothetical trait

Criterion-related Validity

Study These Flashcards

the test will be used to estimate or predict an examniee’s standing or performance on an external criterion

Face validity

Study These Flashcards

whether or not a test looks like it measures what it is intended to measure

convergent and discriminate validity are used to assess ___ validity

Study These Flashcards

construct

a squared factor loading provides a measure of

Study These Flashcards

shared variablity

when factors are orthogonal, a test’s communality can be calculated by ___

Study These Flashcards

squaring and adding the test’s factor loadings

Two types of criterion-related validity

Study These Flashcards

concurrent and predictive

standard error of the estimate

Study These Flashcards

is used to construct a confidence interval around a predicted (estimated) criterion score.

Base rate

Study These Flashcards

true positives + false negatives/ total number of people

Sensitivity

percent of people in the validation sample who have the disorder and were accurately identified by the predictor as having the disorder. (true positives/ true positive + false negatives)

specificity

percent of people in the validation sample who do not have the disorder and were accurately identified by the predictor as not having the disorder. (true negatives/ true negatives + false positives)

Test Construction Flashcards

(26 cards)