Assessment & Testing Flashcards

Learn the principles of assessment and testing, including test construction, psychometrics, DSM-based diagnosis of mental disorders, and the ethical use of evaluation tools in counseling. (192 cards)

1
Q

Test bias most often results from: What causes it?

  • A. A test being normed on only one demographic group
  • B. Using projective measures
  • C. Scoring by inexperienced raters
  • D. Using too many multiple-choice items
A

A. A test being normed on only one demographic group

Lack of representative norming populations can lead to biased results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

An aptitude test is to an achievement test as: What is the relationship?

  • A. What has been learned is to potential
  • B. Potential is to what has been learned
  • C. Career interest is to career development
  • D. Current performance is to potential performance
A

B. Potential is to what has been learned

Aptitude measures potential; achievement measures current knowledge or skill mastery.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

In a projective test, the client is presented with: What type of stimuli?

  • A. Highly emotional stimuli
  • B. Reward-based prompts
  • C. Neutral, ambiguous stimuli
  • D. Only verbal instructions
A

C. Neutral, ambiguous stimuli

Projective measures use ambiguous stimuli to encourage projection of inner thoughts and feelings.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Group IQ tests like the Otis-Lennon are popular in schools because they:

  • A. Are more accurate than individual IQ tests
  • B. Are quicker to administer
  • C. Always have higher reliability
  • D. Replace individual testing entirely
A

B. Are quicker to administer

Group tests are efficient, though not necessarily more accurate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

In a predictive validity study, the correlation between high school GPA and first-year college GPA is found to be .48. Which of the following is the most accurate interpretation?

  • A. 48% of the variance in college GPA is accounted for by high school GPA.
  • B. 23% of the variance in college GPA is accounted for by high school GPA.
  • C. 77% of the variance in college GPA is explained by high school GPA.
  • D. The two measures are not significantly related.
A

B. 23% of the variance in college GPA is accounted for by high school GPA

.48^2 = .2304 → ~23% shared variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

A researcher reports that two measures of counselor burnout share 81% of their variance. This finding would most likely result from which correlation coefficient?

  • A. .81
  • B. .90
  • C. .95
  • D. .99
A

B. .90

√.81 = .90.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

An English exam has a mean score of 72, a standard deviation of 9, and a standard error of measurement of 3. If a student scores 78, what range of scores would be expected on a retest about 95% of the time?

  • A. 72 - 84
  • B. 75 - 81
  • C. 66 - 90
  • D. 69 - 87
A

A. 72 - 84

Solution:

For 95% of the time, we use ±2 SEM: 78±(2×3)=78±6=(72,84)78 \pm (2 \times 3) = 78 \pm 6 = (72, 84)78±(2×3)=78±6=(72,84)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Two different intelligence tests are given to the same group of students. The correlation between the two tests is .70. This means:

  • A. The tests measure exactly the same abilities.
  • B. 49% of the variance is shared between the two tests.
  • C. The reliability of the tests is 70%.
  • D. The tests are not related at all.
A

B. 49% of the variance is shared between the two tests

Shared variance = .70^2 = .49 → 49% of variance in common.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Ethan and Chloe took a reasoning test (M = 100, SD = 20). Ethan’s z = -0.5, Chloe’s T = 60. On the test:

  • A. Ethan scored 110, Chloe scored 120.
  • B. Ethan scored 90, Chloe scored 120.
  • C. Ethan scored 95, Chloe scored 120.
  • D. Ethan scored 90, Chloe scored 130.
A

B. Ethan scored 90, Chloe scored 120

Ethan: 100 - (0.5 × 20) = 90
Chloe: T = 60 → z = +1.0 → 100 + 20 = 120

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

A physics test has a mean score of 60 and a standard deviation of 10. The standard error of measurement is 3. If a student scores 70, what range of scores would be expected on a retest about two-thirds of the time?

  • A. 64 - 76
  • B. 67 - 73
  • C. 54 - 86
  • D. 57 - 83
A

B. 67 - 73

Solution:

Since the question asks for about two-thirds of the time (~68%), we use ±1 SEM: 70±3=(67,73)70 \pm 3 = (67, 73)70±3=(67,73)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Liam and Ava took the same cognitive reasoning test. The test has a mean of 100 and a standard deviation of 15. Liam’s score corresponds to a z-score of +1.0, while Ava’s score corresponds to a T-score of 40. On the test:

  • A. Liam scored 115 and Ava scored 85.
  • B. Liam scored 110 and Ava scored 90.
  • C. Liam scored 115 and Ava scored 90.
  • D. There is not enough data to make the calculation.
A

A. Liam scored 115 and Ava scored 85.

Liam: 100 + (1.0 × 15) = 115
Ava: T = 40 → z = -1.0 → 100 + (-1 × 15) = 85

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

A counselor avoids relying solely on self-report questionnaires because: Why?

  • A. Clients often give inaccurate answers
  • B. Ethics prohibit them
  • C. Clients need high IQ to complete them
  • D. They require excessive reading
A

A. Clients often give inaccurate answers

Self-reports can be influenced by bias or inaccurate self-perception.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Ben and Zoe took an aptitude test (M = 50, SD = 10). Ben’s z = +1.5, Zoe’s T = 40. On the test:

  • A. Ben scored 65, Zoe scored 40.
  • B. Ben scored 65, Zoe scored 45.
  • C. Ben scored 60, Zoe scored 45.
  • D. Ben scored 65, Zoe scored 55.
A

Correct Answer: A. Ben scored 65, Zoe scored 40.

Ben: 50 + (1.5 × 10) = 65
Zoe: T = 40 → z = -1.0 → 50 - 10 = 40

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

The first widely used intelligence test was created to identify French schoolchildren needing academic help. Who developed it?

  • A. Wechsler
  • B. Guilford
  • C. Galton
  • D. Binet and Simon
A

D. Binet and Simon

Alfred Binet and Théodore Simon developed the first practical IQ test in 1905.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

A counselor is working with a nonprofit organization to select an assessment tool for adults seeking vocational rehabilitation. The agency needs a test that measures the clients’ maximum performance without being influenced by time constraints. Which type of test should the counselor recommend?

  • A. Speed test
  • B. Power test
  • C. Ipsative test
  • D. Projective test
A

B. Power test

Power tests allow unlimited time and assess depth of ability or knowledge, unlike speed tests that measure how quickly tasks are completed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

A best practice when interpreting test scores is to: What is a best practice?

  • A. Always test personally
  • B. Avoid generalizing from a single score
  • C. Avoid culture-fair tests
  • D. Avoid scoring personally
A

B. Avoid generalizing from a single score

Multiple data points provide a more accurate picture than a single score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

A counselor wants to assess the internal consistency of a true/false inventory without splitting it in half. Which statistic should she use?

  • A. Split-half method
  • B. Test-retest reliability
  • C. Kuder-Richardson coefficient
  • D. Cross-validation
A

C. Kuder-Richardson coefficient

The Kuder-Richardson formulas measure internal consistency for dichotomous items without splitting the test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Anna and David took a standardized verbal test. The mean was 85, SD = 5. Anna’s z-score was 1.0, David’s T-score was 40. On the test:

  • A. Anna scored 90 and David scored 80.
  • B. Anna scored 90 and David scored 85.
  • C. Anna scored 95 and David scored 80.
  • D. Anna scored 90 and David scored 75.
A

A. Anna scored 90 and David scored 80.

Anna: 85 + (1 × 5) = 90
David: T = 40 → z = -1.0 → 85 - 5 = 80

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

A counselor compares scores from two alternate forms of an aptitude assessment and finds a correlation coefficient of .82. Which of the following best represents the proportion of shared variance between the two forms?

  • A. 67%
  • B. 82%
  • C. 67.24%
  • D. 82.24%
A

C. 67.24%

.82^2 = .6724 → 67.24% shared variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

In Binet’s method, a task for 9-year-olds would be one that:

  • A. Only 10-year-olds could answer
  • B. Only 8-year-olds could answer
  • C. 50% of 9-year-olds answer correctly
  • D. 75% of 9-year-olds answer correctly
A

C. 50% of 9-year-olds answer correctly

Age-level tasks are those that 50% of children of that age can perform.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

A student wants to access her educational records. Under FERPA, she has the right to do all of the following EXCEPT:

  • A. Access her own records
  • B. View her child’s preschool IQ test
  • C. Request corrections to her file
  • D. All of the above
A

D. All of the above

FERPA gives students access to their own records and the right to request corrections.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

A new counseling skills test has a reliability coefficient of .90. This means:

  • A. 90% of score variance is true variance.
  • B. 90% of examinees passed.
  • C. It is valid for all uses.
  • D. It is unreliable.
A

A. 90% of score variance is true variance.

Reliability coefficients reflect the proportion of true variance; .90 means 90% is accurate measurement.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

The mean IQ score is 100. What is the standard deviation for the Wechsler and Stanford-Binet?

  • A. 100; 100
  • B. 15; 16
  • C. 20; 10
  • D. 1; 1
A

B. 15; 16

Wechsler’s SD is 15; Stanford-Binet’s is 16.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

A client scores 110 on an IQ test with a standard deviation of 15. The SEM is 3. Which interpretation reflects both the reliability and practical meaning of this score?

  • A. The client’s IQ is exactly 110 with no error.
  • B. The client’s true IQ likely falls between 107 and 113 about 68% of the time.
  • C. The client is in the gifted range.
  • D. The score cannot be interpreted without comparing to an achievement test.
A

B. The client’s true IQ likely falls between 107 and 113 about 68% of the time.

The SEM indicates that the true score likely falls within ±1 SEM about 68% of the time. Saying the score is exact (A) ignores measurement error; labeling as gifted (C) is inaccurate given the score; (D) is not necessary for interpretation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
An IQ test has an SEM of 3, and Tom scores 106. About 68% of the time, his score will fall between: **What range?** * A. 100 and 103 * B. 100 and 106 * C. 103 and 109 * D. Higher than someone with 139
C. 103 and 109 ## Footnote 68% confidence interval = score ±1 SEM → 106 ± 3 = 103–109.
26
A counselor is designing a readiness test for children entering kindergarten. **She decides to create items that increase in difficulty as the test progresses, so that later questions challenge higher-level skills. This design reflects which test format?** * A. Spiral test * B. Cyclical test * C. Ipsative measure * D. Projective measure
A. Spiral test ## Footnote Spiral tests arrange items in order of increasing difficulty, which is appropriate for gauging ability across a range of skill levels.
27
A school psychologist administers a reading comprehension exam where students can take as long as they need to finish. **Which type of test is this?** * A. Speed test * B. Power test * C. Recognition test * D. Forced-choice test
B. Power test ## Footnote Power tests measure ability without time constraints, focusing on item difficulty rather than speed.
28
Olivia and Lucas took an intelligence test (M = 110, SD = 10). Olivia’s z = -1.0, Lucas’s T = 70. **On the test:** * A. Olivia scored 100, Lucas scored 120 * B. Olivia scored 100, Lucas scored 130 * C. Olivia scored 105, Lucas scored 120 * D. Olivia scored 95, Lucas scored 120.
B. Olivia scored 100, Lucas scored 130 ## Footnote Corrected Answer: B
29
A history exam has a mean score of 85 and a standard deviation of 12. The standard error of measurement is 4. If a student scores 90, **what range of scores would be expected on a retest about 95% of the time?** * A. 82 - 98 * B. 86 - 94 * C. 78 - 102 * D. 74 – 106
A. 82–98 ## Footnote Solution: Since the question asks for about 95% of the time, we use ±2 SEM:
30
A client takes two different career inventories, both measuring similar constructs, but with different item sets. **The results are nearly identical. This provides the strongest evidence for which psychometric property?** * A. Face validity * B. Equivalent forms reliability * C. Predictive validity * D. Internal consistency
B. Equivalent forms reliability ## Footnote Equivalent (or alternate) forms reliability involves administering different versions of the same test to the same group and correlating the results. Similar results suggest strong reliability.
31
A counselor is reviewing two licensure exams: the NCE and a projective personality test. She notices the NCE uses multiple-choice questions with one correct answer per item. **Which classification best fits the NCE?** * A. Free-response format * B. Forced-choice format * C. Projective format * D. Ipsative format
B. Forced-choice format ## Footnote Forced-choice tests require the test-taker to choose from limited, fixed options. The NCE is multiple-choice, making it a forced-choice instrument, unlike projective or ipsative measures.
32
The 16PF personality inventory reflects the work of: **Who?** * A. Raymond Cattell * B. Carl Jung * C. James Cattell * D. Otto Rank
A. Raymond Cattell ## Footnote Raymond Cattell developed the 16 Personality Factor Questionnaire.
33
A counselor administers a test that consistently produces the same scores for clients, but the scores do not reflect the actual trait it is supposed to measure. **This test is:** * A. Reliable but not valid * B. Valid but not reliable * C. Neither reliable nor valid * D. Both reliable and valid
A. Reliable but not valid ## Footnote Reliability refers to consistency, while validity refers to accuracy. A test can be consistent but still measure the wrong construct.
34
A counselor explains to a student intern that “psychometric” refers to: **What?** * A. Medical tests of brain activity * B. Any form of mental testing or measurement * C. Cognitive skills unrelated to testing * D. A type of counseling interview
B. Any form of mental testing or measurement ## Footnote Psychometrics is the science of measuring mental functions and abilities through testing.
35
An older adult scores lower on an IQ test compared to younger adults. A researcher attributes this primarily to differences in educational and cultural experiences between generations. **This illustrates:** * A. Measurement error in test reliability * B. Cohort effects related to educational/cultural differences * C. The direct impact of aging on cognition * D. Problems with convergent validity in IQ testing
B. Cohort effects related to educational/cultural differences ## Footnote Cross-sectional studies can reflect generational (cohort) differences, not actual cognitive decline.
36
A counselor explains to a new trainee: “Reliability is about consistency. **Validity is about…”** * A. Truth * B. Inconsistency * C. Replicability * D. Accuracy
D. Accuracy ## Footnote Validity is the degree to which an instrument measures what it claims to measure—its accuracy.
37
Sophia and Jake took an exam (M = 70, SD = 10). Sophia’s z = 0, Jake’s T = 30. **On the test:** * A. Sophia scored 70, Jake scored 50. * B. Sophia scored 75, Jake scored 55. * C. Sophia scored 70, Jake scored 60. * D. Sophia scored 80, Jake scored 50.
A. Sophia scored 70, Jake scored 50. ## Footnote Sophia: z = 0 → score = 70 Jake: T = 30 → z = -2.0 → 70 - 20 = 50
38
To admit only the top 25% of applicants, set item difficulty at: **What level?** * A. 0.0 * B. .5 * C. 1.0 * D. .25
D. .25 ## Footnote Difficulty index of .25 means only 25% are expected to answer correctly.
39
Keeping a journal of irrational thoughts for therapy is best classified as: **What is it classified as?** * A. Unethical * B. Standardized testing * C. Informal assessment * D. Aptitude measure
C. Informal assessment ## Footnote Journaling is a nonstandardized, informal tool for self-reflection and counselor insight.
40
A counselor is selecting an assessment to identify emotional regulation skills in adolescents. **The tool must provide a structured way to estimate this attribute. Which process is she engaging in?** * A. Observational analysis * B. Appraisal * C. Informal feedback * D. Standard error calculation
B. Appraisal ## Footnote Appraisal is the systematic process of assessing or estimating attributes, which fits the counselor’s purpose.
41
A counselor chooses a test with excellent reliability but low validity for measuring career interests. **What is the most likely outcome?** * A. The test will measure career interests accurately every time. * B. The test will produce consistent results that do not measure true career interests. * C. The test will be useful for predicting job satisfaction. * D. The test will have strong face validity.
B. The test will produce consistent results that do not measure true career interests. ## Footnote Reliability means consistency, not accuracy. A reliable but invalid test will consistently measure the wrong construct, making it ineffective for assessing career interests.
42
If the correlation between two tests measuring problem-solving ability is 0.90, **what percentage of variance is explained by the correlation?** * A. 81% * B. 70% * C. 10% * D. 45%
A. 81% ## Footnote The percentage of variance explained is the square of the correlation coefficient (r = 0.90). Squaring 0.90 gives 0.81, or 81% of the variance is explained by the correlation. The remaining variance is 100% - 81% = 19%.
43
A contradictory trend in testing is: **What is a contradictory trend?** * A. More reliance on tests while others advocate less * B. Assigning all testing to social workers * C. Assigning all testing to psychiatrists * D. Banning computer scoring
A. More reliance on tests while others advocate less ## Footnote There is debate between expanding and reducing reliance on standardized testing.
44
A counselor is reviewing a test designed to measure coping resilience. She wants to ensure it measures only the intended construct and not unrelated traits. **Which type of validity is she evaluating?** * A. Face validity * B. Construct validity * C. Content validity * D. Predictive validity
B. Construct validity ## Footnote Construct validity assesses whether a test truly measures the theoretical trait it claims to measure, such as resilience.
45
Jim’s score is 56, mean = 64, SEM = 4. **If he retakes the test twice more, his scores will likely fall within which range 68% of the time?** * A. 60–68 * B. 61–67 * C. 52–60 * D. 57–63
C. 52–60 ## Footnote 56 ± 4 SEM = 52–60 for a 68% confidence interval.
46
A career test has a criterion-related validity coefficient of .80. **What percentage of variance in job performance is explained by test scores?** * A. 36% * B. 64% * C. 80% * D. 20%
B. 64% ## Footnote Explained variance = r² = (.80)² = .64 = 64%.
47
A job applicant completes an interest inventory but answers in ways they think will be viewed favorably. **What is this called?** * A. Standard error * B. Social desirability bias * C. Cultural bias * D. Acquiescence
B. Social desirability bias ## Footnote Social desirability bias occurs when respondents give socially acceptable rather than truthful answers.
48
If two forms of a math test have 81% of variance in common, **what is the correlation?** * A. .81 * B. .09 * C. .90 or –.90 * D. .91
C. .90 or –.90 ## Footnote r = √.81 = .90 (sign depends on direction of relationship).
49
A test has a reliability coefficient of 1.00. **This means:** * A. It is highly valid. * B. It has no measurement error. * C. It is culturally fair. * D. It has low predictive value.
B. It has no measurement error. ## Footnote A coefficient of 1.00 means perfect consistency and zero measurement error.
50
A client’s IQ score is two standard deviations above the mean on the WAIS-IV. **What does this indicate?** * A. The client scored above 97% of the population. * B. The client scored above 84% of the population. * C. The client scored above 68% of the population. * D. The client’s score is average.
A. The client scored above 97% of the population. ## Footnote Two standard deviations above the mean (130 on WAIS-IV) is roughly the 98th percentile, meaning about 97–98% scored lower.
51
A depression inventory looks like it measures sadness and loss of interest. However, deeper analysis shows it doesn’t predict actual depression diagnoses. **This demonstrates:** * A. Low face validity * B. Low predictive validity * C. Low construct validity * D. Low reliability
B. Low predictive validity ## Footnote Predictive validity concerns whether the test can forecast relevant outcomes, such as a clinical diagnosis.
52
A researcher believes intelligence is normally distributed like height and largely determined by genetics. **Which historical figure’s view does this reflect?** * A. Spearman * B. Guilford * C. Williamson * D. Galton
D. Galton ## Footnote Sir Francis Galton held that intelligence was hereditary and normally distributed, similar to physical traits.
53
An industrial-organizational psychologist gives a sales aptitude test to all employees in a call center. Chester has a Z-score of +1.0, Timothy scores 90, Beverly scores a T-score of 70, and Kim scores 70 (raw). The mean is 100, SD = 10, normal distribution. **Who performed worst and who performed best?** * A. Beverly; Timothy * B. Kim; Chester * C. Timothy; Beverly * D. Kim; Beverly
D. Kim; Beverly ## Footnote Beverly’s T-score of 70 is 2 SDs above the mean (best). Kim’s raw score of 70 is 3 SDs below the mean (worst).
54
A counseling researcher notes a correlation of –.50 between stress levels and life satisfaction. **The proportion of variance in common is:** * A. 50% * B. 25% * C. –50% * D. –25%
B. 25% ## Footnote The sign doesn’t matter for shared variance; (−.50)^2 = .25 → 25%.
55
What percentage of variance is **not** explained by the correlation between the two tests measuring problem-solving ability if the correlation is 0.70? * A. 30% * B. 49% * C. 51% * D. 70%
C. 51% ## Footnote The percentage of variance explained by the correlation is the square of the correlation coefficient (r = 0.70). Squaring 0.70 gives 0.49, which means 49% of the variance is explained. The remaining variance, or the percentage not explained, is 100% - 49% = 51%.
56
A researcher finds a correlation of .95 between two measures of self-esteem. **This indicates:** * A. 95% of the variance is shared. * B. 90% of the variance is shared. * C. 81% of the variance is shared. * D. 90.25% of the variance is shared.
D. 90.25% of the variance is shared ## Footnote .95^2 = .9025 → 90.25%.
57
If the correlation between two problem-solving ability tests is 0.65, **what percentage of the variance is unexplained?** * A. 65% * B. 58% * C. 35% * D. 60%
B. 58% ## Footnote The percentage of variance explained is the square of the correlation coefficient (r = 0.65). Squaring 0.65 gives 0.4225, or 42.25% of the variance is explained by the correlation. The remaining variance is 100% - 42.25% = 57.75% (rounding to 58%).
58
A counselor finds a correlation of –.78 between test anxiety and exam performance. **Which conclusion is most consistent with this finding?** * A. Higher anxiety is associated with lower performance. * B. Higher anxiety is associated with higher performance. * C. The relationship is too weak to interpret. * D. Anxiety has no measurable effect on performance.
A. Higher anxiety is associated with lower performance ## Footnote Strong negative correlation means as anxiety increases, performance decreases.
59
A new counselor assumes that a test appearing to measure social skills is automatically valid for that purpose. **Which concept is she misunderstanding?** * A. Construct validity * B. Predictive validity * C. Face validity * D. Content validity
C. Face validity ## Footnote Face validity refers only to how a test appears, not whether it actually measures what it claims to measure.
60
A counseling agency introduces a job performance test that correlates highly with actual job success six months later. **This demonstrates:** * A. Predictive validity * B. Concurrent validity * C. Face validity * D. Incremental validity
A. Predictive validity ## Footnote Predictive validity evaluates how well a test forecasts future performance.
61
The correlation between two measures is unknown, but you know they share 25% of their variance. **What is the absolute value of r?** * A. .50 * B. .25 * C. .05 * D. .75
A. .50 ## Footnote r = √.25 = .50.
62
A WAIS-IV is administered to a large random sample of U.S. adults. **What percentage is expected to score between 85 and 115?** * A. 68% * B. 50% * C. 34% * D. 95%
A. 68% ## Footnote ±1 SD in a normal distribution includes about 68% of scores.
63
According to the Buckley Amendment (PL 93-380), a 19-year-old college student can: **What can they do?** * A. View her own records * B. View her child’s preschool IQ test * C. Request corrections to her file * D. All of the above
D. All of the above ## Footnote FERPA gives students access to their own records and the right to request corrections.
64
A counselor is researching predictors of IQ test performance. She discovers that certain demographic factors are correlated with IQ scores. **Which of the following is least likely to show a significant correlation?** * A. Race * B. Attitude toward IQ testing * C. Birth order * D. Socioeconomic status
B. Attitude toward IQ testing ## Footnote While socioeconomic status and race often show measurable correlations with IQ scores, attitudes toward testing have little predictive relationship.
65
A high school student takes a career interest inventory twice, six months apart. **The counselor notes that the results are consistent, even though the student’s experiences have changed. What does this demonstrate?** * A. The test’s predictive validity * B. The test’s test-retest reliability * C. The test’s internal consistency * D. The test’s face validity
B. The test’s test-retest reliability ## Footnote Consistent results over time with the same instrument indicate strong test-retest reliability, even when external factors may have changed.
66
A counselor **defines validity to a peer as:** * A. Standard error of estimate for a person’s score * B. Degree of constancy in a score * C. Measurement error accounted for in correlation * D. Degree to which scores can be generalized for the same inference across tests
D. Degree to which scores can be generalized for the same inference across tests ## Footnote Validity concerns whether a score supports the same interpretation across measures.
67
A researcher finds that two personality scales have 64% shared variance. **The correlation coefficient must be:** * A. .80 or –.80 * B. .64 * C. .60 or –.60 * D. .32
A. .80 or –.80 ## Footnote r = √.64 = .80; correlation could be positive or negative.
68
Why did the **Wechsler IQ tests gain popularity over the Binet for adults?** * A. Group administration * B. Better for older populations * C. Focused on women’s strengths * D. More projective in nature
B. Better for older populations ## Footnote The Wechsler scales were better suited for adults than the Binet, which was more child-focused.
69
In an assessment study, the correlation between two administrations of the same test, taken two weeks apart, is .42. **Which conclusion is most appropriate?** * A. The test has acceptable test-retest reliability. * B. The test’s stability over time is questionable. * C. The two administrations measure different constructs. * D. The test is highly reliable.
B. The test’s stability over time is questionable ## Footnote .42 is low for reliability; generally > .80 is considered acceptable.
70
A limitation of many interest inventories is that they: **What is a limitation?** * A. Contain too few questions * B. Focus heavily on professional jobs and overlook blue-collar roles * C. Require advanced reading skills * D. Always produce invalid results for minorities
B. Focus heavily on professional jobs and overlook blue-collar roles ## Footnote Many inventories emphasize professional careers, reducing relevance for those pursuing skilled trades.
71
A counselor compares percentile ranks and standard scores. **Which feature do they have in common?** * A. Both are normally distributed when transformed, regardless of the raw score distribution. * B. Both use a fixed external standard for interpretation. * C. Both compare the examinee’s score to scores of others in the norm group. * D. Both express scores in standard deviation units from the mean.
C. Both compare the examinee’s score to scores of others in the norm group. ## Footnote Both percentile ranks and standard scores reference the norm group’s performance, although they express results differently.
72
A researcher correlates two alternate forms of an achievement test and finds r = .80. **What percentage of score variance is due to true score variance?** * A. 64% * B. 80% * C. 20% * D. 36%
A. 64% ## Footnote True variance = r² = (.80)² = .64 or 64%.
73
Logan and Lily took a test (M = 100, SD = 25). Logan’s z = -1.0, Lily’s T = 60. **On the test:** * A. Logan scored 75, Lily scored 125. * B. Logan scored 80, Lily scored 115. * C. Logan scored 75, Lily scored 115. * D. Logan scored 90, Lily scored 125.
Correct Answer: A. Logan scored 75, Lily scored 125. ## Footnote Logan: 100 - 25 = 75 Lily: T = 60 → z = +1.0 → 100 + 25 = 125
74
Clients should be told that: **What should they be told?** * A. Validity is more important than reliability * B. Projective tests favor psychodynamic theory * C. Face validity is not essential * D. A test is only one source of data
D. A test is only one source of data ## Footnote Ethical practice requires explaining that test results are not infallible and should be used alongside other data.
75
An HR manager administers a sales aptitude test (mean = 100, SD = 10) to employees. Chester has a Z = +1.0, Timothy scores 90, Beverly has a T = 70, and Kim scores 70 raw. **Who scored the lowest, and who scored the highest?** * A. Beverly; Timothy * B. Kim; Chester * C. Timothy; Beverly * D. Kim; Beverly
D. Kim; Beverly ## Footnote Beverly’s T-score of 70 = +2 SD (highest); Kim’s raw 70 is 3 SD below mean (lowest).
76
During a licensing exam, candidates are required to compose case analyses that the graders will interpret and score differently based on judgment. **This format is best described as:** * A. Objective * B. Subjective * C. Forced-choice * D. Recognition
B. Subjective ## Footnote Subjective tests require scoring based on the evaluator’s interpretation, unlike objective tests with predetermined correct answers.
77
Which correlation coefficient below indicates the **weakest relationship between two measures?** * A. –.45 * B. .22 * C. .58 * D. –.71
B. .22 ## Footnote Lowest absolute value indicates weakest relationship.
78
The NCE is classified as which type of test? **What type?** * A. Intelligence test * B. Aptitude test * C. Personality test * D. Achievement test
D. Achievement test ## Footnote The NCE measures mastery of counseling knowledge, fitting the achievement test category.
79
Counselors may use psychological tests to secure which type of diagnosis for insurance reimbursement? **Which type?** * A. CPT * B. DSM or ICD * C. Percentile * D. SEM
B. DSM or ICD ## Footnote DSM or ICD diagnoses are required for most third-party payments.
80
Group IQ testing in the U.S. **began with:** * A. 1905 Binet-Simon test * B. Binet’s work in France * C. Army Alpha and Beta in WWI * D. Freud’s psychoanalytic movement
C. Army Alpha and Beta in WWI ## Footnote The Army Alpha/Beta tests were the first large-scale group IQ measures.
81
A counselor suspects that a standardized test is culturally biased because it contains vocabulary specific to middle-class American households. **Which of the following steps is the most ethical?** * A. Continue using the test until a better one is found. * B. Administer the test only to clients from similar backgrounds. * C. Use alternative assessments that are culturally fair and normed on diverse populations. * D. Give the test but explain the possible cultural bias afterward.
C. Use alternative assessments that are culturally fair and normed on diverse populations. ## Footnote Ethical practice requires selecting culturally fair instruments when available, especially if bias could affect validity.
82
Carlos and Mia took a logic assessment. The mean was 120 and the standard deviation was 20. Carlos had a z-score of -1.0, and Mia had a T-score of 50. **On the test:** * A. Carlos scored 100 and Mia scored 120. * B. Carlos scored 100 and Mia scored 110. * C. Carlos scored 120 and Mia scored 130. * D. Carlos scored 110 and Mia scored 100.
A. Carlos scored 100 and Mia scored 120. ## Footnote Carlos: 120 + (-1.0 × 20) = 100 Mia: T = 50 → z = 0 → 120 + 0 = 120
83
A counselor is preparing students for a statewide academic skills exam. The test begins with easier items in reading comprehension, progresses to more difficult items, then starts over with easier items in mathematics, and again increases difficulty. This pattern repeats for science and social studies. **Which test format best describes this exam?** * A. Spiral test * B. Cyclical test * C. Power test * D. Ipsative test
B. Cyclical test ## Footnote A cyclical test contains multiple sections, each section organized like a spiral (from easy to difficult), but difficulty resets at the start of each new section.
84
Two measures of self-efficacy demonstrate a shared variance of 36%. **Which correlation coefficient below is most consistent with this finding?** * A. .36 * B. .60 * C. –.36 * D. –.60
B. .60 ## Footnote √.36 = .60; could be positive or negative, but with absolute value .60.
85
A counselor searching for a new personality measure should review: **What should they review?** * A. Professional journals * B. Buros Mental Measurements Yearbook * C. Test manuals from publishers * D. All of the above
D. All of the above ## Footnote All three sources provide valuable data on test validity, reliability, and use.
86
A new counseling skills test produces scores that match almost exactly with established measures of the same skills. **This reflects:** * A. Concurrent validity * B. Construct validity * C. Predictive validity * D. Face validity
A. Concurrent validity ## Footnote Concurrent validity means the new test correlates strongly with other validated tests given at the same time.
87
An interest inventory is least valid for: **Who?** * A. First-year college students in undecided majors * B. Third-year college students with declared majors * C. Eighth-grade student with IQ of 136 * D. Mid-career adults exploring new roles
C. Eighth-grade student with IQ of 136 ## Footnote Interest inventories are less accurate for very young adolescents due to limited exposure to occupations.
88
A parent reviews her child’s assessment report and sees an “82nd percentile” score. **She asks the counselor what this means. Which is the most accurate explanation?** * A. The child scored better than 82% of students in the norm group. * B. The child got 82% of the questions correct. * C. The child met 82% of the learning objectives. * D. The child scored within the top 18% of the standard score range.
A. The child scored better than 82% of students in the norm group. ## Footnote Percentile rank refers to the proportion of the norm group scoring lower—not percentage correct or objectives met.
89
Which are examples of aptitude tests? **Which ones?** * A. O*NET Ability Profiler and MCAT * B. MMPI-2 and CPI * C. Strong Interest Inventory and LSAT * D. GRE and Holland’s SDS
A. O*NET Ability Profiler and MCAT ## Footnote Both measure potential performance or abilities relevant to future training or careers.
90
A counselor wants to estimate the gap between a predicted job performance score and the employee’s actual score. **She should use:** * A. Probability estimate formula * B. Correction for attenuation * C. Standard error of the estimate * D. Standard error of measurement
C. Standard error of the estimate ## Footnote The standard error of the estimate reflects accuracy in predicting criterion scores from predictor scores.
91
Henry and Olivia took the same aptitude test. The test has a mean of 90 and a standard deviation of 10. Henry’s score corresponds to a z-score of 0.5, and Olivia’s corresponds to a T-score of 35. **On the test:** * A. Henry scored 95 and Olivia scored 85. * B. Henry scored 95 and Olivia scored 75. * C. Henry scored 100 and Olivia scored 70. * D. Henry scored 85 and Olivia scored 80.
B. Henry scored 95 and Olivia scored 75. ## Footnote Henry: 90 + (0.5 × 10) = 95 Olivia: T = 35 → z = -1.5 → 90 + (-1.5 × 10) = 75
92
A career counselor wants to choose a test that gives consistent results regardless of when it is administered. **She is most concerned with:** * A. Reliability * B. Face validity * C. Incremental validity * D. Content validity
A. Reliability ## Footnote Reliability ensures consistent measurement over time, conditions, and populations.
93
An exam prep company designs a counseling theory test with three sections: human development, counseling skills, and ethics. Each section starts with recall questions, then moves to application and synthesis items. Students who struggle in one section still find the start of the next section approachable. **Which test structure does this illustrate?** * A. Spiral test * B. Cyclical test * C. Adaptive test * D. Speed test
B. Cyclical test ## Footnote The hallmark of a cyclical test is repeating cycles of increasing difficulty in each new section. Spiral tests only have one continuous progression without resetting difficulty.
94
A school psychologist is assessing a 5-year-old for possible gifted placement. She wants to verify the validity evidence of a widely used intelligence test before administering it. **Which resource will give her the most comprehensive, critical reviews of the test?** * A. Standards for Educational and Psychological Testing * B. Tests in Print * C. Mental Measurements Yearbook * D. A combination of B and C
C. Mental Measurements Yearbook ## Footnote The Mental Measurements Yearbook provides professional reviews and validity data, which is essential before using a standardized test for high-stakes decisions.
95
Who is generally most qualified to administer the Rorschach? **Who?** * A. NCC counselor * B. Clinical psychologist * C. Psychiatrist * D. LCSW
B. Clinical psychologist ## Footnote The Rorschach requires specialized training, typically held by clinical psychologists.
96
A study finds that the correlation between two tests measuring problem-solving ability is 0.80. **What percentage of variance is not explained by the correlation?** * A. 16% * B. 64% * C. 36% * D. 80%
C. 36% ## Footnote The percentage of variance explained is the square of the correlation coefficient (r = 0.80). Squaring 0.80 gives 0.64, or 64% of the variance being explained. The remaining variance (unexplained) is 100% - 64% = 36%.
97
A hiring manager uses a skills test that closely mirrors real job duties. New hires who scored well perform strongly on the job. **This indicates the test has:** * A. High face validity * B. High content validity * C. High predictive validity * D. Low construct validity
C. High predictive validity ## Footnote The ability to predict actual future job performance reflects predictive (criterion-related) validity.
98
A counselor tells a client that the SAT predicts their likelihood of succeeding in their first year of college. **This is an example of:** * A. Concurrent validity * B. Predictive validity * C. Content validity * D. Face validity
B. Predictive validity ## Footnote Predictive validity is the test’s ability to forecast future outcomes, such as academic performance.
99
Two counselors independently score the same client’s essay test and assign nearly identical scores. **This demonstrates:** * A. Alternate forms reliability * B. Inter-rater reliability * C. Test-retest reliability * D. Content validity
B. Inter-rater reliability ## Footnote Inter-rater reliability is agreement between different evaluators scoring the same performance.
100
A counselor is evaluating a new standardized achievement test for use in a diverse high school. The test has strong predictive validity for college GPA but shows lower performance scores among students from non-English-speaking backgrounds. **Which action best addresses both validity and fairness concerns?** * A. Continue using the test because it has predictive validity. * B. Modify the test format and language to reduce cultural loading before further use. * C. Switch to an ipsative measure to avoid cultural bias. * D. Eliminate all standardized tests from the admissions process.
B. Modify the test format and language to reduce cultural loading before further use. ## Footnote While predictive validity is important, fairness is also crucial. Cultural bias can lower scores for non-English speakers, so revising the test to minimize cultural loading supports both validity and ethical standards. Ipsative measures (C) are not comparable across individuals, and eliminating all tests (D) is not a balanced solution.
101
The distinction between aptitude and achievement is blurred in tests like the: **Which test?** * A. Stanford-Binet * B. WAIS-IV * C. GRE * D. Bender-Gestalt
C. GRE ## Footnote The GRE assesses both learned knowledge and predicts graduate school performance.
102
A word association test is classified as: **What type of test?** * A. Neuropsychological * B. Motoric * C. Achievement * D. Projective
D. Projective ## Footnote Word association is a projective technique revealing underlying thought patterns.
103
Using the original IQ formula, a child with a mental age of 12 and a chronological age of 10 **would have an IQ of:** * A. 120 * B. 80 * C. 100 * D. 112
A. 120 ## Footnote IQ = (MA ÷ CA) × 100 → (12 ÷ 10) × 100 = 120.
104
Two different versions of an aptitude test are given to the same group, and the results are nearly identical. **Which property does this support?** * A. Alternate forms reliability * B. Face validity * C. Standard error of measurement * D. Discriminant validity
A. Alternate forms reliability ## Footnote Alternate forms reliability assesses consistency between different but equivalent versions of the same test.
105
When reporting the mean of scores, **which variability measure is best to include?** * A. Range * B. Standard deviation * C. Semi-interquartile range * D. Correlation
B. Standard deviation ## Footnote Standard deviation shows how scores spread around the mean.
106
While designing an entrance exam for an advanced counseling program, faculty set most items to a difficulty level where only 25% of applicants are expected to answer correctly. **What is the intended outcome?** * A. Ensure the test is fair for all applicants * B. Select only the highest-performing candidates * C. Eliminate cultural bias * D. Increase test reliability
B. Select only the highest-performing candidates ## Footnote A difficulty level of .25 means only 25% are expected to answer correctly, which is appropriate when the goal is to admit only top performers.
107
A counselor uses an instrument that measures an abstract trait like ego strength. **Which type of validity is most relevant?** * A. Construct validity * B. Criterion validity * C. Face validity * D. Content validity
A. Construct validity ## Footnote Construct validity refers to whether a test accurately measures a theoretical trait or concept.
108
A counselor needs to measure both verbal and performance abilities in a 22-year-old male client. **The most suitable test is:** * A. WAIS-IV * B. WISC-IV * C. WPPSI-IV * D. Stanford-Binet for preschool
A. WAIS-IV ## Footnote The WAIS-IV is appropriate for adults and includes verbal and performance components.
109
Grace and Elijah took a reading test (M = 75, SD = 15). Grace’s z = +0.5, Elijah’s T = 45. **On the test:** * A. Grace scored 82.5, Elijah scored 67.5 * B. Grace scored 80, Elijah scored 60 * C. Grace scored 82.5, Elijah scored 75 * D. Grace scored 85, Elijah scored 70
A. Grace scored 82.5, Elijah scored 67.5 ## Footnote Grace: 75 + (0.5 × 15) = 82.5 Elijah: T = 45 → z = -0.5 → 75 - 7.5 = 67.5
110
Lewis Terman is known for: **What is he known for?** * A. Creating the Wechsler scales * B. Developing the original Binet test * C. Creating the Rorschach * D. Americanizing the Binet test
D. Americanizing the Binet test ## Footnote Terman adapted the Binet-Simon test into the Stanford-Binet for U.S. use.
111
A counselor uses test-retest reliability and finds a correlation of .70 between the first and second administrations. **What is the shared variance?** * A. 70% * B. 49% * C. 30% * D. 90%
B. 49% ## Footnote Shared variance is the square of the correlation: (.70)² = .49, or 49%.
112
A researcher finds that a stress inventory has a reliability coefficient of .70. **How should this be interpreted?** * A. 70% of the score variance is accurate measurement. * B. The test is invalid. * C. Only 70% of examinees scored accurately. * D. Reliability cannot be calculated from this figure.
A. 70% of the score variance is accurate measurement. ## Footnote A coefficient of .70 means 70% true variance and 30% measurement error.
113
Infant IQ tests are generally: **How reliable are they?** * A. More reliable than adult tests * B. Less reliable than adult tests * C. Not related to experience * D. Never used
B. Less reliable than adult tests ## Footnote Developmental changes make early IQ measures less stable over time.
114
While reviewing test results, a counselor notes that 80% of clients answered one particular item correctly. **Which index does this percentage represent?** * A. Difficulty index * B. Reliability coefficient * C. Standard error of measurement * D. Item discrimination
A. Difficulty index ## Footnote The difficulty index measures the percentage of individuals who answer correctly; higher percentages mean easier items.
115
The MMPI-2 is best described as: **What type of test?** * A. IQ test * B. Neurological screening * C. Projective test * D. Standardized personality inventory
D. Standardized personality inventory ## Footnote The MMPI-2 is a standardized measure of personality and psychopathology.
116
**Lewis Terman** is known for: * A. Creating the Wechsler scales * B. Developing the original Binet test * C. Creating the Rorschach * D. Americanizing the Binet test
D. Americanizing the Binet test ## Footnote Terman adapted the Binet-Simon test into the Stanford-Binet for U.S. use.
117
A counselor notes a test is very consistent in scores but does not measure what it claims. **Which is true?** * A. Reliable but not valid * B. Valid but not reliable * C. Both valid and reliable * D. Neither valid nor reliable
A. Reliable but not valid ## Footnote Reliability means consistency; validity means accuracy. A test can be consistent but still inaccurate.
118
One advantage of interest inventories is that they: **What is an advantage?** * A. Are always graded by the test-taker * B. Require no reading * C. Are generally reliable and non-threatening * D. Have universal predictive validity
C. Are generally reliable and non-threatening ## Footnote They tend to have good reliability and are easy for clients to complete without stress.
119
Most counselors agree that public education efforts should focus on: **What should they focus on?** * A. More preschool IQ testing * B. More teacher-administered personality tests * C. Educating the public on testing * D. Promoting the mystique of testing
C. Educating the public on testing ## Footnote Increasing public understanding of testing can reduce misuse and misinterpretation.
120
A counselor compares scores from an IQ test to the number of students who answered each item correctly. **She finds one question that only 15% of students answered correctly. Which index is she examining?** * A. Reliability index * B. Difficulty index * C. Discrimination index * D. Recognition index
B. Difficulty index ## Footnote The difficulty index measures the percentage of individuals who answer an item correctly. A lower percentage means a more difficult item.
121
Mary scores 110 on a 150-item test (mean = 100, SD = 10, reliability = .84). **What is the 68% confidence interval for her true score?** * A. 106–114 * B. 90–130 * C. 92–108 * D. 100–120
A. 106–114 ## Footnote SEM = SD × √(1 – r) = 10 × √(.16) = 4; CI = 110 ± 4.
122
A chemistry test has a mean score of 50, a standard deviation of 6, and a standard error of measurement of 2. If a student scores 55, **what range of scores would be expected on a retest about two-thirds of the time?** * A. 51 - 59 * B. 53 - 57 * C. 47 - 63 * D. 49 - 61
B. 53 - 57 ## Footnote Solution: For two-thirds of the time (~68%), we use ±1 SEM: 55±2=(53,57)55 \pm 2 = (53, 57)55±2=(53,57)
123
Standard scores are measured on **which type of scale?** * A. Interval * B. Ratio * C. Nominal * D. Ordinal
A. Interval ## Footnote Standard scores have equal intervals and an arbitrary zero, fitting interval scale definition.
124
A counselor shortens a reliable test because clients find it too long. **What is the most likely impact on reliability?** * A. Increase slightly * B. Remain unchanged * C. Increase by 10 points or more * D. Decrease
D. Decrease ## Footnote Shortening a test generally decreases reliability because fewer items reduce measurement stability.
125
On a criterion-referenced reading test, a student scores 83. **Which is the most accurate interpretation?** * A. Better than 83% of test-takers * B. Correctly answered 83 questions * C. Not enough information to determine * D. Mastered 83% of the content objectives
D. Mastered 83% of the content objectives ## Footnote Criterion-referenced scores reflect mastery of specified content, not comparison to others.
126
Isabella and Mason took the same cognitive test (M = 60, SD = 6). Isabella’s z = +2.0, Mason’s T = 50. **On the test:** * A. Isabella scored 66, Mason scored 60. * B. Isabella scored 72, Mason scored 60. * C. Isabella scored 72, Mason scored 66. * D. Isabella scored 70, Mason scored 60.
B. Isabella scored 72, Mason scored 60. ## Footnote Isabella: 60 + (2 × 6) = 72 Mason: T = 50 → z = 0 → score = 60
127
The standard error of measurement tells you: **What does it tell you?** * A. How accurate a score is likely to be * B. The test’s predictive power * C. How much a score correlates with norms * D. The population best suited for the test
A. How accurate a score is likely to be ## Footnote SEM estimates the range in which the true score is likely to fall due to measurement error.
128
A vocational rehabilitation counselor wants to know which personal variables are not significantly correlated with mental ability test scores. **Based on research, which factor would be least likely to have a significant relationship?** * A. Race * B. Attitude toward IQ testing * C. Birth order * D. Socioeconomic status
B. Attitude toward IQ testing ## Footnote Research consistently shows strong correlations between mental ability scores and socioeconomic status, moderate ones with race, and negligible with attitudes toward testing.
129
A counselor uses both discriminant and convergent validation studies when developing a new assessment. **These methods establish:** * A. Criterion-related validity * B. Content validity * C. Differential validity * D. Construct validity
D. Construct validity ## Footnote Construct validity includes convergent and discriminant evidence to confirm a test measures its intended construct.
130
An informal method of appraisal would be: **What is an example?** * A. IQ testing * B. Standardized personality test * C. GRE scores * D. Behavioral checklist
D. Behavioral checklist ## Footnote Checklists are informal, nonstandardized appraisal tools.
131
A counselor explains to a student that an aptitude test measures potential performance, **while an achievement test measures:** * A. Career interest * B. Current performance * C. Potential performance * D. Career development
B. Current performance ## Footnote Aptitude is predictive; achievement reflects what has already been learned or mastered.
132
A test is developed to measure intelligence. **Which term correctly describes the score expressed as a ratio of mental age to chronological age times 100?** * A. Intelligence quotient * B. Standard score * C. Z-score * D. Percentile rank
A. Intelligence quotient ## Footnote IQ = (Mental Age ÷ Chronological Age) × 100 in the original formula.
133
The Myers-Briggs Type Indicator is based on the theory of: **Whose theory?** * A. Raymond Cattell * B. Carl Jung * C. B.F. Skinner * D. Alfred Adler
B. Carl Jung ## Footnote The MBTI is rooted in Jung’s personality typology.
134
A counselor concerned about possible neurological impairment might choose: **Which test?** * A. Bender-Gestalt II * B. MMPI-2 * C. TAT * D. Strong Interest Inventory
A. Bender-Gestalt II ## Footnote The Bender-Gestalt II assesses visual-motor integration, often used in neuropsychological screening.
135
To reduce response set bias, a counselor considers multiple strategies. **Which is least effective?** * A. Forced-choice format * B. Having fewer negative than positive statements * C. Ensuring anonymity * D. Randomizing item order
B. Having fewer negative than positive statements ## Footnote Imbalanced item wording can worsen bias rather than reduce it.
136
During a school district’s review of testing procedures, a counselor recommends replacing a speed test with a power test for evaluating reading comprehension. **Which rationale best supports this change?** * A. Power tests allow for unlimited time, measuring the depth of comprehension rather than speed. * B. Speed tests are more culturally fair. * C. Power tests eliminate the need for item difficulty analysis. * D. Speed tests are only used in athletic performance assessment.
A. Power tests allow for unlimited time, measuring the depth of comprehension rather than speed. ## Footnote Power tests focus on the complexity of items and depth of understanding rather than quick responses, making them more appropriate for evaluating comprehension.
137
In a study that examines problem-solving ability using two tests with a correlation of 0.70, **which of the following describes the total variance in test scores?** * A. 49% is explained by the correlation, and 51% is unexplained. * B. 70% is explained by the correlation, and 30% is unexplained. * C. 51% is explained by the correlation, and 49% is unexplained. * D. 30% is explained by the correlation, and 70% is unexplained.
A. 49% is explained by the correlation, and 51% is unexplained ## Footnote The percentage of variance explained is the square of the correlation (0.70), which equals 49%. Therefore, the remaining 51% of the variance is not explained by the correlation.
138
An HR manager uses a test to hire engineers. **What is the minimum acceptable reliability coefficient for high-stakes decisions?** * A. .20 * B. .55 * C. .80 * D. .95
C. .80 ## Footnote For employment decisions, a reliability of .80 or higher is generally acceptable.
139
The **original purpose of the Binet-Simon test was to:** * A. Assess high school graduates * B. Screen U.S. Army recruits * C. Identify children with intellectual disabilities * D. Measure genius in adults
C. Identify children with intellectual disabilities ## Footnote It was designed to distinguish children with learning difficulties from those without.
140
A researcher finds that the correlation between two tests measuring problem-solving ability is 0.70. **What is the proportion of variance that is not shared between the tests?** * A. 49% * B. 51% * C. 70% * D. 30%
B. 51% ## Footnote The shared variance between the two tests is the square of the correlation coefficient (r = 0.70), which is 0.49 or 49%. The remaining 51% is the unexplained variance.
141
The Black-White IQ controversy was sparked by a 1969 article by: **Who?** * A. Ertl * B. Cattell * C. Jensen * D. Williams
C. Jensen ## Footnote Arthur Jensen questioned causes of group differences in IQ scores.
142
If a study finds a correlation of 0.70 between two tests measuring problem-solving ability, **which of the following is true regarding the unexplained variance?** * A. The tests explain 70% of the variance in problem-solving ability. * B. The tests explain 49% of the variance in problem-solving ability. * C. The tests explain 51% of the variance in problem-solving ability. * D. The tests do not explain any of the variance in problem-solving ability.
B. The tests explain 49% of the variance in problem-solving ability ## Footnote The percentage of variance explained is the square of the correlation coefficient (r = 0.70). Squaring 0.70 results in 0.49, or 49% of the variance being explained. Therefore, the remaining variance (unexplained) is 51%.
143
A counselor designs a creativity measure that evaluates both the ability to produce one correct solution and the ability to generate multiple unique ideas. **This approach reflects the work of:** * A. Cattell * B. Thurstone * C. Guilford * D. Wechsler
C. Guilford ## Footnote J.P. Guilford emphasized convergent (one answer) and divergent (many answers) thinking as key components of intelligence.
144
Which test is most appropriate for **assessing a 22-year-old male’s intelligence?** * A. WPPSI-IV * B. WAIS-IV * C. WISC-IV * D. Any online IQ test
B. WAIS-IV ## Footnote The WAIS-IV is the adult Wechsler test, appropriate for ages 16 and up.
145
A study finds that the correlation between two tests measuring problem-solving ability is 0.40. **What percentage of the variance is not explained by the correlation?** * A. 84% * B. 36% * C. 60% * D. 16%
A. 84% ## Footnote The percentage of variance explained is the square of the correlation coefficient (r = 0.40). Squaring 0.40 gives 0.16, or 16% of the variance being explained. The remaining variance (unexplained) is 100% - 16% = 84%.
146
A counselor interprets a client’s MMPI-2 results by examining the overall pattern of scores rather than focusing on one scale. **This process is called:** * A. Profile analysis * B. Scatter analysis * C. Assigning a DSM-5 diagnosis directly * D. Deterioration indexing
A. Profile analysis ## Footnote Profile analysis interprets the relative elevations of different MMPI scales to assess patterns of functioning.
147
Ella and James took a reading comprehension exam. The exam has a mean of 75 and a standard deviation of 5. Ella’s score corresponds to a z-score of +1.5, and James’s score corresponds to a T-score of 45. **On the test:** * A. Ella scored 82.5 and James scored 72.5. * B. Ella scored 80 and James scored 70. * C. Ella scored 82.5 and James scored 70. * D. There is not enough data to determine the scores.
A. Ella scored 82.5 and James scored 72.5. ## Footnote Ella: 75 + (1.5 × 5) = 82.5 James: T = 45 → z = -0.5 → 75 + (-0.5 × 5) = 72.5 Oops! James = 72.5, not 70. Correct answer: A
148
The best test for **assessing a kindergartner’s intelligence is:** * A. WPPSI-IV * B. WAIS-IV * C. WISC-IV * D. Myers-Briggs
A. WPPSI-IV ## Footnote The WPPSI-IV is for children ages 2 years 6 months to 7 years 7 months.
149
The Thematic Apperception Test uses: **What does it use?** * A. Inkblots * B. Pictures * C. Verbal analogies * D. Sentence completions
B. Pictures ## Footnote The TAT presents ambiguous pictures for storytelling to reveal underlying motives.
150
A career counselor administers two interest inventories to the same group of clients. **The instruments yield a correlation coefficient of .63. This suggests that:** * A. Approximately 40% of the variance in scores is shared between the instruments. * B. The inventories measure unrelated constructs. * C. The instruments are not appropriate for concurrent validity analysis. * D. Reliability is likely above .90.
A. Approximately 40% of the variance in scores is shared between the instruments ## Footnote .63^2 = .3969 → ~40% shared variance.
151
Seventeen counselors use the same new projective test with one client and arrive at nearly identical conclusions. **What does this indicate?** * A. Validity * B. Reliability * C. Cultural fairness * D. Predictive power
B. Reliability ## Footnote Agreement among raters reflects inter-rater reliability.
152
A counselor is selecting a standardized IQ test for a 6-year-old. **Which is the most appropriate choice?** * A. WAIS-IV * B. WISC-IV * C. WPPSI-IV * D. Stanford-Binet for adults
C. WPPSI-IV ## Footnote The WPPSI-IV is designed for children aged 2 years, 6 months to 7 years, 7 months.
153
Which type of counselor is most likely to prefer projective testing methods? **Who?** * A. Rogerian counselor * B. Strict behaviorist * C. Transactional analyst * D. Psychodynamic clinician
D. Psychodynamic clinician ## Footnote Projective tests are aligned with psychodynamic approaches that explore unconscious processes.
154
The best **intelligence test for a sixth-grade student is:** * A. WPPSI-IV * B. WAIS-IV * C. WISC-IV * D. Stanford-Binet for adults
C. WISC-IV ## Footnote The WISC-IV is intended for ages 6–16.
155
A new counselor says, “If a test has low reliability, it will also have low validity.” **Is this correct?** * A. Yes, because unreliable tests cannot be valid. * B. No, because reliability ensures validity. * C. No, because validity does not require reliability. * D. None of the above.
A. Yes, because unreliable tests cannot be valid. ## Footnote Reliability is a prerequisite for validity.
156
If a pair of tests shares 36% of variance, **which of the following could be the correlation coefficient?** * A. .36 * B. .60 * C. –.60 * D. Both B and C
D. Both B and C ## Footnote r = √.36 = .60; correlation could be positive or negative.
157
If the correlation between two alternate forms of an exam is .80, **the proportion of variance they share is:** * A. 64% * B. 80% * C. 8% * D. 32%
A. 64% ## Footnote Shared variance = r2r^2r2 → .802=.64.80^2 = .64.802=.64 → 64%.
158
Which type of standard score most closely **follows the normal curve?** * A. T-score * B. Deciles * C. Z-score * D. Percentiles
C. Z-score ## Footnote Z-scores are directly tied to the standard normal distribution.
159
Two tests share 49% of their variance. **What is the correlation between them?** * A. .49 * B. .70 * C. .50 * D. .35
B. .70 ## Footnote Shared variance = r^2 → r = √.49 = .70.
160
A school district plans to introduce a standardized test to predict student success in college. **Which type of validity should be prioritized?** * A. Predictive validity * B. Concurrent validity * C. Content validity * D. Face validity
A. Predictive validity ## Footnote Predictive validity measures how well a test forecasts future performance, such as college success.
161
The current **Stanford-Binet IQ test is classified as:** * A. Nonstandardized * B. Standardized * C. Projective * D. Both B and C
B. Standardized ## Footnote It is a standardized measure with consistent administration and scoring procedures.
162
In a normally distributed IQ test (mean = 100, SD = 15), **which T-score corresponds to a raw score of 130?** * A. 70 * B. 80 * C. 30 * D. 40
A. 70 ## Footnote A raw 130 is 2 SD above mean; T-score formula (10z + 50) = 70.
163
In a culture-fair test: **What is true?** * A. Items are familiar to all examinees regardless of culture * B. Test is not standardized * C. Culture-free items cannot be used * D. African Americans always score higher
A. Items are familiar to all examinees regardless of culture ## Footnote Culture-fair tests minimize language and cultural content to reduce bias.
164
A client takes the same cognitive ability test twice, six months apart, and receives similar scores. **Which psychometric property does this demonstrate?** * A. Predictive validity * B. Test-retest reliability * C. Internal consistency * D. Content validity
B. Test-retest reliability ## Footnote Test-retest reliability measures the stability of scores over time.
165
A counselor reviews the NCE and notes that each question has one correct answer and several distractors. **What test format is this?** * A. Recognition-based forced choice * B. Ipsative measure * C. Projective measure * D. Free-response
A. Recognition-based forced choice ## Footnote Multiple-choice items require recognition and are classified as forced-choice questions.
166
A counselor begins analyzing a large set of test scores. **Which is the most accepted starting point?** * A. Central tendency * B. Frequency distribution * C. Variability * D. Correlation
B. Frequency distribution ## Footnote Frequency distribution is a common first step before summarizing data.
167
Two equivalent forms of a leadership assessment are given to a group in the same week, producing similar results. **This is:** * A. Test-retest reliability * B. Alternate forms reliability * C. Split-half reliability * D. Inter-rater reliability
B. Alternate forms reliability ## Footnote Alternate forms reliability compares scores from two equivalent versions of the same test.
168
A counselor reminds a trainee that **the four major functions of assessment include:** * A. Prediction, monitoring, evaluation, discrimination * B. Prediction, standardization, evaluation, classification * C. Prediction, classification, monitoring, screening * D. Amelioration, determination, investigation, deterioration
A. Prediction, monitoring, evaluation, discrimination ## Footnote Standard texts identify prediction, monitoring, evaluation, and discrimination as primary functions.
169
A graduate counseling class is taking a midterm that requires analyzing case studies and writing short essays. **The professor notes that grading will depend on the evaluator’s judgment. Which term best describes this test format?** * A. Objective format * B. Subjective format * C. Forced-choice format * D. Recognition format
B. Subjective format ## Footnote Essays are graded using judgment and interpretation, making them subjective. Objective formats have predetermined correct answers (e.g., multiple-choice).
170
A counselor gives the same assessment to the same group twice over two weeks to check score stability. **This is testing:** * A. Test-retest reliability * B. Alternate forms reliability * C. Split-half reliability * D. Inter-rater reliability
A. Test-retest reliability ## Footnote Test-retest reliability checks consistency across repeated administrations.
171
A school psychologist is preparing to assess a 5-year-old for possible gifted education placement. She wants to confirm whether a commercially available IQ test has strong validity evidence for that age group. **Which resource would provide the most comprehensive critical review of the test?** * A. Standards for Educational and Psychological Testing * B. Tests in Print * C. Mental Measurements Yearbook * D. Both B and C
C. Mental Measurements Yearbook ## Footnote The Mental Measurements Yearbook contains professional reviews and technical validity data, making it the best resource for evaluating an instrument before use.
172
Which of the following correlations indicates the **strongest relationship?** * A. –.55 * B. .67 * C. –.81 * D. .79
C. –.81 ## Footnote Strength is based on absolute value; –.81 is strongest.
173
A counselor unsure whether a test fits their client population should first: **What should they do?** * A. Ask a graduate program * B. Contact the test’s author * C. Read the test manual * D. Administer it randomly to several clients
C. Read the test manual ## Footnote The test manual specifies intended populations and administration guidelines.
174
A client’s WAIS-IV shows significantly higher Verbal IQ than Performance IQ. **Which interpretation is most likely?** * A. Antisocial tendencies * B. Educational deficits * C. Neurological impairment * D. All of the above
C. Neurological impairment ## Footnote A large discrepancy can indicate neurological issues, though context is needed.
175
A counselor divides a standardized exam into even-numbered and odd-numbered questions, scoring each half separately. She then compares results. **This is testing:** * A. Split-half reliability * B. Alternate forms reliability * C. Inter-rater reliability * D. Predictive validity
A. Split-half reliability ## Footnote Split-half reliability assesses internal consistency by correlating two halves of the same test.
176
A new instrument measures a client’s “sense of purpose.” **Which validity type is critical to confirm before using it in treatment planning?** * A. Predictive validity * B. Construct validity * C. Concurrent validity * D. Face validity
B. Construct validity ## Footnote Construct validity ensures an abstract psychological concept is being accurately measured.
177
Two different depression inventories are given to a client in the same session, and the results are highly correlated. **This is evidence of:** * A. Concurrent validity * B. Predictive validity * C. Face validity * D. Test-retest reliability
A. Concurrent validity ## Footnote Concurrent validity is shown when scores on different tests measuring the same construct correlate closely when administered at the same time.
178
A parent asks the counselor, “My child’s test report says she’s at the 82nd percentile. **What does that actually mean?”** * A. “She performed better than 82% of students in the norm group.” * B. “She answered 82% of the questions correctly.” * C. “She met 82% of the learning objectives.” * D. “She scored in the top 18% of the standard score range.”
A. “She performed better than 82% of students in the norm group.” ## Footnote Percentile rank reflects the percentage of norm group members who scored lower, not the percentage correct.
179
A counselor evaluates a test that accurately measures empathy. **What must be true about its reliability?** * A. It may or may not be reliable. * B. It must be reliable. * C. It is always unreliable. * D. Reliability is irrelevant to validity.
B. It must be reliable. ## Footnote Validity requires reliability; an accurate measure must also produce consistent results.
180
If the correlation between two forms of a vocational aptitude test is .60, **what does this tell us?** * A. They share 36% of their variance. * B. They share 60% of their variance. * C. They share 40% of their variance. * D. They share 30% of their variance.
A. They share 36% of their variance ## Footnote .60^2 = .36 → 36% shared variance.
181
One major current trend in testing is: **What is a trend?** * A. More reliance on computer-assisted testing and scoring * B. Increased paper-and-pencil use * C. More standardized tests for children * D. Elimination of all computerized methods
A. More reliance on computer-assisted testing and scoring ## Footnote Technology has expanded computerized administration and scoring.
182
A researcher finds a correlation of 0.50 between two problem-solving ability tests. **What percentage of the variance is explained by this correlation?** * A. 50% * B. 25% * C. 75% * D. 70%
B. 25% ## Footnote The percentage of variance explained is the square of the correlation coefficient (r = 0.50). Squaring 0.50 gives 0.25, or 25% of the variance is explained by the correlation. The remaining variance is 100% - 25% = 75%.
183
A psychometrician notes that a test with strict time limits consistently yields more consistent results than a comparable untimed version. **This suggests speeded tests tend to be:** * A. Less reliable * B. Have no effect on reliability * C. More reliable * D. None of the above
C. More reliable ## Footnote Speeded tests can increase reliability because they reduce random error due to guessing or pacing differences.
184
Sophie and Noah took the same standardized test. The test has a mean of 80 and a standard deviation of 10. Sophie’s score corresponds to a z-score of 0, and Noah’s score corresponds to a T-score of 60. **On the test:** * A. Sophie scored 80 and Noah scored 90. * B. Sophie scored 70 and Noah scored 85. * C. Sophie scored 80 and Noah scored 100. * D. Sophie scored 80 and Noah scored 85.
A. Sophie scored 80 and Noah scored 90. ## Footnote Sophie: z = 0 → 80 + (0 × 10) = 80 Noah: T = 60 → z = +1.0 → 80 + (1 × 10) = 90 Oops! Noah should be 90. Correct answer: A
185
A client completes a personality inventory and is asked to compare different statements about themselves to select the one most applicable. **This type of measure is:** * A. Normative * B. Ipsative * C. Spiral * D. Power
B. Ipsative ## Footnote Ipsative measures require test-takers to compare their own responses rather than being compared to a norm group.
186
A licensing exam for counselors has four domains: ethics, assessment, career, and group work. In each domain, questions start basic and progress to complex. Candidates report that after finishing one domain, the next section begins again with easier questions. **What is the advantage of this cyclical structure?** * A. It eliminates the need for standardization. * B. It allows each content area to be assessed across a range of difficulty levels independently. * C. It ensures all questions are equally difficult. * D. It increases test speed and reduces completion time.
B. It allows each content area to be assessed across a range of difficulty levels independently. ## Footnote Cyclical design lets each content area be tested in depth without the earlier domains affecting the difficulty progression of later ones.
187
A counselor is designing a school-wide mental health screening. She needs an instrument that can estimate attributes like coping skills and stress tolerance for all students. **Which selection best meets the definition of appraisal?** * A. Administering an IQ test to identify gifted students. * B. Conducting interviews with at-risk students only. * C. Using structured questionnaires to estimate psychological attributes for the entire population. * D. Observing students in the cafeteria during lunch.
C. Using structured questionnaires to estimate psychological attributes for the entire population. ## Footnote Appraisal refers to systematically assessing or estimating attributes, often across a group. While observation (D) and interviews (B) can be part of the process, a structured questionnaire for all students fits the formal appraisal definition.
188
A counselor is choosing between two standardized math achievement tests for middle school students. Both are valid, but one has a higher reliability coefficient. **Which is the best choice and why?** * A. The less reliable test, because validity is more important. * B. The more reliable test, because it will provide more consistent results. * C. Either test, since validity automatically ensures reliability. * D. Neither test, because reliability is irrelevant for achievement measures.
B. The more reliable test, because it will provide more consistent results. ## Footnote Between two equally valid tests, the more reliable instrument is preferable because it will yield more consistent scores.
189
A client’s IQ score is reported as 104 with a standard error of measurement (SEM) of 3. **Which interpretation is most accurate?** * A. The client’s IQ is exactly 104. * B. The true score is likely between 101 and 107 about 68% of the time. * C. The IQ score cannot be interpreted without achievement data. * D. The IQ is in the gifted range.
B. The true score is likely between 101 and 107 about 68% of the time. ## Footnote The SEM indicates the likely range for the true score within a certain confidence interval (±1 SEM for 68% confidence).
190
A correlation coefficient of .00 between two variables **indicates:** * A. A complete absence of any relationship between the variables. * B. No linear relationship between the variables. * C. That the variables are orthogonal but share variance in other ways. * D. That the two measures are inversely related.
B. No linear relationship between the variables ## Footnote A zero correlation means no linear relationship; other non-linear relationships may still exist.
191
A counselor needs to choose between the WAIS-IV, MMPI-2, TAT, and a digital scale to ensure maximum reliability. **Which option will have the highest reliability coefficient?** * A. WAIS-IV * B. MMPI-2 * C. TAT * D. Digital scale
D. Digital scale ## Footnote Mechanical devices like a precise digital scale have near-perfect reliability compared to psychological tests.
192
A counselor is reviewing the relationship between reliability and validity for a new career interest inventory. The test has low reliability. **Which conclusion is accurate?** * A. Low reliability means low validity. * B. Low reliability means high validity. * C. Reliability and validity are unrelated. * D. High validity can occur without high reliability.
A. Low reliability means low validity. ## Footnote Reliability is a prerequisite for validity; an unstable measure cannot be valid.