Test bias most often results from: What causes it?
A. A test being normed on only one demographic group
Lack of representative norming populations can lead to biased results.
An aptitude test is to an achievement test as: What is the relationship?
B. Potential is to what has been learned
Aptitude measures potential; achievement measures current knowledge or skill mastery.
In a projective test, the client is presented with: What type of stimuli?
C. Neutral, ambiguous stimuli
Projective measures use ambiguous stimuli to encourage projection of inner thoughts and feelings.
Group IQ tests like the Otis-Lennon are popular in schools because they:
B. Are quicker to administer
Group tests are efficient, though not necessarily more accurate.
In a predictive validity study, the correlation between high school GPA and first-year college GPA is found to be .48. Which of the following is the most accurate interpretation?
B. 23% of the variance in college GPA is accounted for by high school GPA
.48^2 = .2304 → ~23% shared variance.
A researcher reports that two measures of counselor burnout share 81% of their variance. This finding would most likely result from which correlation coefficient?
B. .90
√.81 = .90.
An English exam has a mean score of 72, a standard deviation of 9, and a standard error of measurement of 3. If a student scores 78, what range of scores would be expected on a retest about 95% of the time?
A. 72 - 84
Solution:
For 95% of the time, we use ±2 SEM: 78±(2×3)=78±6=(72,84)78 \pm (2 \times 3) = 78 \pm 6 = (72, 84)78±(2×3)=78±6=(72,84)
Two different intelligence tests are given to the same group of students. The correlation between the two tests is .70. This means:
B. 49% of the variance is shared between the two tests
Shared variance = .70^2 = .49 → 49% of variance in common.
Ethan and Chloe took a reasoning test (M = 100, SD = 20). Ethan’s z = -0.5, Chloe’s T = 60. On the test:
B. Ethan scored 90, Chloe scored 120
Ethan: 100 - (0.5 × 20) = 90
Chloe: T = 60 → z = +1.0 → 100 + 20 = 120
A physics test has a mean score of 60 and a standard deviation of 10. The standard error of measurement is 3. If a student scores 70, what range of scores would be expected on a retest about two-thirds of the time?
B. 67 - 73
Solution:
Since the question asks for about two-thirds of the time (~68%), we use ±1 SEM: 70±3=(67,73)70 \pm 3 = (67, 73)70±3=(67,73)
Liam and Ava took the same cognitive reasoning test. The test has a mean of 100 and a standard deviation of 15. Liam’s score corresponds to a z-score of +1.0, while Ava’s score corresponds to a T-score of 40. On the test:
A. Liam scored 115 and Ava scored 85.
Liam: 100 + (1.0 × 15) = 115
Ava: T = 40 → z = -1.0 → 100 + (-1 × 15) = 85
A counselor avoids relying solely on self-report questionnaires because: Why?
A. Clients often give inaccurate answers
Self-reports can be influenced by bias or inaccurate self-perception.
Ben and Zoe took an aptitude test (M = 50, SD = 10). Ben’s z = +1.5, Zoe’s T = 40. On the test:
Correct Answer: A. Ben scored 65, Zoe scored 40.
Ben: 50 + (1.5 × 10) = 65
Zoe: T = 40 → z = -1.0 → 50 - 10 = 40
The first widely used intelligence test was created to identify French schoolchildren needing academic help. Who developed it?
D. Binet and Simon
Alfred Binet and Théodore Simon developed the first practical IQ test in 1905.
A counselor is working with a nonprofit organization to select an assessment tool for adults seeking vocational rehabilitation. The agency needs a test that measures the clients’ maximum performance without being influenced by time constraints. Which type of test should the counselor recommend?
B. Power test
Power tests allow unlimited time and assess depth of ability or knowledge, unlike speed tests that measure how quickly tasks are completed.
A best practice when interpreting test scores is to: What is a best practice?
B. Avoid generalizing from a single score
Multiple data points provide a more accurate picture than a single score.
A counselor wants to assess the internal consistency of a true/false inventory without splitting it in half. Which statistic should she use?
C. Kuder-Richardson coefficient
The Kuder-Richardson formulas measure internal consistency for dichotomous items without splitting the test.
Anna and David took a standardized verbal test. The mean was 85, SD = 5. Anna’s z-score was 1.0, David’s T-score was 40. On the test:
A. Anna scored 90 and David scored 80.
Anna: 85 + (1 × 5) = 90
David: T = 40 → z = -1.0 → 85 - 5 = 80
A counselor compares scores from two alternate forms of an aptitude assessment and finds a correlation coefficient of .82. Which of the following best represents the proportion of shared variance between the two forms?
C. 67.24%
.82^2 = .6724 → 67.24% shared variance.
In Binet’s method, a task for 9-year-olds would be one that:
C. 50% of 9-year-olds answer correctly
Age-level tasks are those that 50% of children of that age can perform.
A student wants to access her educational records. Under FERPA, she has the right to do all of the following EXCEPT:
D. All of the above
FERPA gives students access to their own records and the right to request corrections.
A new counseling skills test has a reliability coefficient of .90. This means:
A. 90% of score variance is true variance.
Reliability coefficients reflect the proportion of true variance; .90 means 90% is accurate measurement.
The mean IQ score is 100. What is the standard deviation for the Wechsler and Stanford-Binet?
B. 15; 16
Wechsler’s SD is 15; Stanford-Binet’s is 16.
A client scores 110 on an IQ test with a standard deviation of 15. The SEM is 3. Which interpretation reflects both the reliability and practical meaning of this score?
B. The client’s true IQ likely falls between 107 and 113 about 68% of the time.
The SEM indicates that the true score likely falls within ±1 SEM about 68% of the time. Saying the score is exact (A) ignores measurement error; labeling as gifted (C) is inaccurate given the score; (D) is not necessary for interpretation.