Question 1

The objectivity of a standardized test refers to its&#10;A) degree of relationship to a criterion variable.&#10;B) freedom from tester bias.&#10;C) accuracy of prediction.&#10;D) provision of normative data.

Accepted Answer

The objectivity of a standardized test refers to its freedom from tester bias, meaning the test's results are not influenced by the opinions or prejudices of the person administering the test. This ensures that the test measures what it is supposed to measure in a consistent manner across different administrations.

Question 2

In contrast to the use of performance tests in research, the use of self-report measures generally&#10;A) is directed at measurement of personality characteristics.&#10;B) is meant to demonstrate each respondent's level of a characteristic that is typical for her.&#10;C) is subject to distortion due to response sets.&#10;D) all of the above.

Accepted Answer

Self-report measures are often used to assess personality characteristics, aim to demonstrate a respondent's typical level of a characteristic, and are subject to distortion due to response sets such as social desirability or acquiescence bias.

Question 3

The tendency to present oneself in a favorable light when responding to a self-report measure is known as a(n) _________response set.&#10;A) social desirability&#10;B) acquiescence&#10;C) deviance&#10;D) carelessness

Accepted Answer

Social desirability response set refers to the tendency of respondents to answer questions in a manner that will be viewed favorably by others, often leading to over-reporting of "good" behavior or under-reporting of "bad" behavior.

Question 4

To standardize a test, the developer must establish&#10;A) agreement between researchers and practitioners about test content.&#10;B) the test's reliability.&#10;C) procedures that ensure consistency in how the test is administered and scored.&#10;D) all of the above.

Accepted Answer

Standardizing a test primarily involves establishing procedures that ensure consistency in administration and scoring, which is option C.

Question 5

In test development, differential item functioning refers to the tendency for&#10;A) some test items to relate more closely with the total test score than other items.&#10;B) individual scorer bias to affect how test items are scored.&#10;C) individuals of equal ability but from different subgroups not to have the same probability of earning a given score.&#10;D) individuals to score differently on each administration of the test.

Accepted Answer

Differential item functioning occurs when individuals of equal ability but from different subgroups (e.g., gender, ethnicity) do not have the same probability of answering a test item correctly, indicating a potential bias in the item.

Question 6

Match each example of evidence about a test below with the type of evidence of test validity that it represents.

-The test was found to be a good measure of students' ability to learn independently.

A) Evidence from relationship to other variables
B) Evidence from consequences of testing
C) Evidence from response processes
D) Evidence from test content

Accepted Answer

This example demonstrates evidence from test content, as it directly relates to the topics covered in the textbook, indicating an attempt to align the test's content with the curriculum or material that is supposed to be assessed.

Question 7

Match each example of evidence about a test below with the type of evidence of test validity that it represents.

-Students' rank order of performance on the test was found to correspond well to the students' rank order of performance on an established longer test of the same ability.

A) Evidence from relationship to other variables
B) Evidence from consequences of testing
C) Evidence from response processes
D) Evidence from test content

Accepted Answer

This example illustrates evidence from consequences of testing because it focuses on the outcomes or implications of using the test, such as differential referral rates based on gender.

Question 8

Match each example of evidence about a test below with the type of evidence of test validity that it represents.

-Students' rank order of performance on the test was found to correspond well to the students' rank order of performance on an established longer test of the same ability.

A) Evidence from relationship to other variables
B) Evidence from consequences of testing
C) Evidence from response processes
D) Evidence from test content

Accepted Answer

This example demonstrates evidence from relationship to other variables because it compares the test results to another established measure of the same ability, showing a correlation between the two sets of scores.

Question 9

Match each example of evidence about a test below with the type of evidence of test validity that it represents.

-The test was found to lead to a higher rate of referral for boys than for girls to a treatment program for attention deficit disorder.

A) Evidence from relationship to other variables
B) Evidence from consequences of testing
C) Evidence from response processes
D) Evidence from test content

Accepted Answer

The answer of Match each example of evidence about a...

Question 10

Test validity is&#10;A) not a unitary concept.&#10;B) an intrinsic property of the test.&#10;C) an intrinsic property of the test scores.&#10;D) associated with inferences that are made from test scores.

Accepted Answer

The answer of Test validity is&#10;A) not a unitary concept.&#10;B)...

Question 11

Recent research on the effects of standards-based instruction has shown that test data collected in school settings generally show&#10;A) high evidence of content-related validity.&#10;B) low evidence of content-related validity.&#10;C) high alignment between curriculum content, teachers' instruction, and assessment by standardized tests.&#10;D) teachers' ability to judge evidence of a test's content-related validity by brief inspection of the items.

Accepted Answer

The answer of Recent research on the effects of standards-based...

Question 12

If a test is used to make decisions that are found to be harmful to certain groups, this represents evidence of the test's&#10;A) lack of validity.&#10;B) misuse.&#10;C) low objectivity.&#10;D) lack of reliability.

Accepted Answer

The answer of If a test is used to make...

Question 13

Match each example of test reliability below with the type of reliability coefficient that is used to calculate it.

-The test is administered and then readministered after a time delay, and scores from the first administration are correlated with those from the second administration.

A) Coefficient of equivalence
B) Coefficient alpha
C) Coefficient of internal consistency
D) Coefficient of stability

Accepted Answer

The answer of Match each example of test reliability below...

Question 14

Match each example of test reliability below with the type of reliability coefficient that is used to calculate it.

-After the test is administered, it is split into two halves and the scores on one half are correlated with those on the other half.

A) Coefficient of equivalence
B) Coefficient alpha
C) Coefficient of internal consistency
D) Coefficient of stability

Accepted Answer

The answer of Match each example of test reliability below...

Question 15

Match each example of test reliability below with the type of reliability coefficient that is used to calculate it.

-The test is administered and then readministered after a time delay, and scores from the first administration are correlated with those from the second administration.

A) Coefficient of equivalence
B) Coefficient alpha
C) Coefficient of internal consistency
D) Coefficient of stability

Accepted Answer

The answer of Match each example of test reliability below...

Question 16

Match each example of test reliability below with the type of reliability coefficient that is used to calculate it.

-Two parallel forms of the test are administered, and scores on one form are correlated with those on the other form.

A) Coefficient of equivalence
B) Coefficient alpha
C) Coefficient of internal consistency
D) Coefficient of stability

Accepted Answer

The answer of Match each example of test reliability below...

Question 17

A test is reliable to the extent that it&#10;A) yields scores that are interpreted the same way by different raters.&#10;B) measures a known construct.&#10;C) contains nonrandom measurement error.&#10;D) minimizes measurement error.

Accepted Answer

The answer of A test is reliable to the extent...

Question 18

The various sources of measurement error in a test can be estimated by&#10;A) classical test theory.&#10;B) generalizability theory.&#10;C) the standard error of measurement.&#10;D) coefficient alpha

Accepted Answer

The answer of The various sources of measurement error in...

Question 19

Calculation of the standard error of measurement allows researchers to determine&#10;A) the combined measurement error due to all sources of error that were investigated.&#10;B) the probable range within which individuals' true scores fall.&#10;C) the likelihood that that test was too easy or too difficult for most testees.&#10;D) the reliability of each test item.

Accepted Answer

The answer of Calculation of the standard error of measurement...

Question 20

Item response theory allows researchers to develop tests having all the following advantages over traditional tests except that&#10;A) the test can be customized to students at different ability levels.&#10;B) it is possible to construct many parallel tests of equivalent difficulty.&#10;C) measurement error can be reduced by administering items geared to each individual's ability level.&#10;D) each individual can take a test containing just a few items at an appropriate difficulty level.

Accepted Answer

The answer of Item response theory allows researchers to develop...

Question 21

When the researcher has the choice of using a standardized test or an achievement test constructed by the teachers who are being studied, the researcher&#10;A) should use the teacher-constructed test, because such tests generally are freer of measurement error.&#10;B) should use the teacher-constructed test, because such tests generally have better concurrent validity.&#10;C) should use the standardized test, because such tests generally are better written.&#10;D) can use either test, because both types of test have similar strengths and weaknesses.

Accepted Answer

The answer of When the researcher has the choice of...

Question 22

The interpretation of an individual's test score by comparing it to the scores earned by other individuals is referred to as&#10;A) norm-referenced measurement.&#10;B) individual-referenced measurement.&#10;C) domain-referenced measurement.&#10;D) objectives-referenced measurement.

Accepted Answer

The answer of The interpretation of an individual's test score...

Question 23

A prespecified standard of performance is most important in&#10;A) norm-referenced measurement.&#10;B) individual-referenced measurement.&#10;C) criterion-referenced measurement.&#10;D) objectives-referenced measurement.

Accepted Answer

The answer of A prespecified standard of performance is most...

Question 24

In computer-adaptive testing, the test-taker&#10;A) responds only to items that are at a moderate level of difficulty.&#10;B) is allowed as much time as he wishes to respond to each item.&#10;C) responds to items that are matched to his ability level.&#10;D) is allowed to choose the items to which he will respond.

Accepted Answer

The answer of In computer-adaptive testing, the test-taker&#10;A) responds only...

Question 25

The process by which items are placed on a scale in order of relative difficulty in computer-adaptive testing is based on&#10;A) item response theory.&#10;B) generalizability theory.&#10;C) classical test theory.&#10;D) grounded theory.

Accepted Answer

The answer of The process by which items are placed...

Question 26

It is usually better to administer an individual test than a group test when&#10;A) it is necessary to ensure standard conditions of administration.&#10;B) a high degree of objectivity in scoring is desired.&#10;C) it is necessary to obtain data on all research participants within a short period of time.&#10;D) the researcher is interested in the process by which the overall score is obtained.

Accepted Answer

The answer of It is usually better to administer an...

Question 27

Match each of the following types of test with the appropriate definition.

-Achievement test batteries

A) Estimate general intellectual level by sampling performance on various tasks.
B) Aim at predicting performance on future tasks.
C) Provide scores indicating individuals' strengths and weaknesses in a given area of the curriculum.
D) Measure knowledge or mastery of a variety of content areas.

Accepted Answer

The answer of Match each of the following types of...

Question 28

Match each of the following types of test with the appropriate definition.

-Achievement test batteries

A) Estimate general intellectual level by sampling performance on various tasks.
B) Aim at predicting performance on future tasks.
C) Provide scores indicating individuals' strengths and weaknesses in a given area of the curriculum.
D) Measure knowledge or mastery of a variety of content areas.

Accepted Answer

The answer of Match each of the following types of...

Question 29

Match each of the following types of test with the appropriate definition.

-Achievement test batteries

A) Estimate general intellectual level by sampling performance on various tasks.
B) Aim at predicting performance on future tasks.
C) Provide scores indicating individuals' strengths and weaknesses in a given area of the curriculum.
D) Measure knowledge or mastery of a variety of content areas.

Accepted Answer

The answer of Match each of the following types of...

Question 30

Match each of the following types of test with the appropriate definition.

-Achievement test batteries

A) Estimate general intellectual level by sampling performance on various tasks.
B) Aim at predicting performance on future tasks.
C) Provide scores indicating individuals' strengths and weaknesses in a given area of the curriculum.
D) Measure knowledge or mastery of a variety of content areas.

Accepted Answer

The answer of Match each of the following types of...

Question 31

Which of the following is not an appropriate criterion for judging the validity of inferences drawn from a performance assessment?&#10;A) Whether all students had an equal opportunity to acquire the expertise measured by the test&#10;B) The extent to which the test scores correlate with the test-takers' scores on a well-established standardized test&#10;C) The extent to which the test's content represents the content domain covered during instruction&#10;D) The cost of administering the test

Accepted Answer

The answer of Which of the following is not an...

Question 32

The hermeneutic approach to test reliability&#10;A) requires test scorers to reach a consensus about the score or rating that each test-taker will receive.&#10;B) requires administration of parallel forms of a test.&#10;C) involves identifying experts with a consistent perspective to score the tests.&#10;D) involves the calculation of a standard error of measurement.

Accepted Answer

The answer of The hermeneutic approach to test reliability&#10;A) requires...

Question 33

To determine an individual's unique structuring of reality, the researcher is advised to administer a&#10;A) variety of attitude scales.&#10;B) personality inventory.&#10;C) combination of attitude scales and a personality inventory.&#10;D) projective test.

Accepted Answer

The answer of To determine an individual's unique structuring of...

Question 34

The Mental Measurement Yearbooks are particularly helpful for&#10;A) determining the latest trends in assessment methodology.&#10;B) identifying procedures that can be used to establish a test's validity and reliability.&#10;C) obtaining a list of tests that are available for measuring a particular variable.&#10;D) obtaining a statistical summary of the test scores of students with different demographic characteristics.

Accepted Answer

The answer of The Mental Measurement Yearbooks are particularly helpful...

Question 35

The best source of information about tests that are not commercially available is&#10;A) Test Reviews Online.&#10;B) Test Collection.&#10;C) Tests in Microfiche.&#10;D) Standards for Educational and Psychological Testing.

Accepted Answer

The answer of The best source of information about tests...

Question 36

Contacting the test developer directly is particularly useful when you wish to obtain&#10;A) tables of norms for the test.&#10;B) recent information about the test that has not yet been published.&#10;C) reliability and validity information about the test.&#10;D) information about how to administer and score the test.

Accepted Answer

The answer of Contacting the test developer directly is particularly...

Question 37

In test development, the purpose of item analysis is to determine&#10;A) the difficulty level of each test item.&#10;B) the reliability of each test item.&#10;C) the validity of each test item.&#10;D) all of the above.

Accepted Answer

The answer of In test development, the purpose of item...

Question 38

Put the following steps of test development in the order that they usually occur.
a. Developing a prototype
b. Defining the constructs to be measured
c. Reviewing related tests that have been developed
d. Collecting data on the test's validity and reliability

Accepted Answer

The answer of Put the following steps of test development...

Question 39

Item analysis is a set of procedures used to&#10;A) develop items for a parallel form of a standardized test.&#10;B) revise the wording of test items to eliminate bias toward certain groups.&#10;C) assess the validity, reliability, and difficulty of each item on a test.&#10;D) all of the above.

Accepted Answer

The answer of Item analysis is a set of procedures...

Question 40

Which of the following actions by the researcher is least likely to be helpful if protestors question the administration of a certain test in a research study?&#10;A) Defending the test by reviewing with the protestors the merits of each item on the test&#10;B) Demonstrating how the test as a whole is valid and reliable&#10;C) Demonstrating that the test follows the guidelines in the Standards for Educational and Psychological Testing&#10;D) Offering to withdraw individuals from the study if they do not wish to take the test

Accepted Answer

The answer of Which of the following actions by the...

Question 41

Match each testing procedure below with the outcome for which it is intended.

-Have the teacher tell the students (the research participants) that the test is important and that they should try to do their best.

A) You want to ensure that the research participants depict themselves in a typical, honest manner.
B) You want to obtain the research participants' maximal performance.
C) You want to enhance the research participants' cooperation.

Accepted Answer

The answer of Match each testing procedure below with the...

Question 42

Match each testing procedure below with the outcome for which it is intended.

-Have the teacher tell the students (the research participants) that the test is important and that they should try to do their best.

A) You want to ensure that the research participants depict themselves in a typical, honest manner.
B) You want to obtain the research participants' maximal performance.
C) You want to enhance the research participants' cooperation.

Accepted Answer

The answer of Match each testing procedure below with the...

Question 43

Match each testing procedure below with the outcome for which it is intended.

-Emphasize the official nature of the testing session.

A) You want to ensure that the research participants depict themselves in a typical, honest manner.
B) You want to obtain the research participants' maximal performance.
C) You want to enhance the research participants' cooperation.

Accepted Answer

The answer of Match each testing procedure below with the...

Question 44

Name four types of performance tests that are commonly used in educational research.

Accepted Answer

The answer of Name four types of performance tests that...

Question 45

Name five types of personality measures that are commonly used in educational research.

Accepted Answer

The answer of Name five types of personality measures that...

Question 46

Describe three criteria for judging the quality of a test.

Accepted Answer

The answer of Describe three criteria for judging the quality...

Question 47

A researcher uses a theory of academic self-esteem to identify five behavior patterns that are indicative of students' level of academic self-esteem, such as the frequency of positive statements students make about themselves. The researcher next observes 50 students for one month and records the frequency of these behavior patterns for each student. Finally, the researcher develops a paper-and-pencil test of academic self esteem and correlates the students' scores with their level of academic self-esteem based on the observations. What type of evidence of test validity has the researcher established?

Accepted Answer

The answer of A researcher uses a theory of academic...

Question 48

A researcher has developed a new test designed to measure musical aptitude. She selects a random sample of 200 college freshmen and administers the test to them. One month later she again administers the test to the same students and computes a correlation coefficient between the two sets of scores. What type of reliability does this coefficient indicate?

Accepted Answer

The answer of A researcher has developed a new test...

Question 49

An investigator administers a reading achievement test that has a mean of 220, a standard deviation of 20, and a standard error of measurement of 5. Mary obtains a score of 235. Using the standard error of measurement (sm), what estimate can we make of her true score?

Accepted Answer

The answer of An investigator administers a reading achievement test...

Question 50

What is one advantage of using item response theory to develop an achievement test?

Accepted Answer

The answer of What is one advantage of using item...

Question 51

Describe four problems to which tests developed within the classical test theory framework are susceptible.

Accepted Answer

The answer of Describe four problems to which tests developed...

Question 52

Describe three assumptions on which the item-response-theory approach to test construction is based.

Accepted Answer

The answer of Describe three assumptions on which the item-response-theory...

Question 53

The superintendent tells a researcher who is conducting a study in her district that he should use achievement tests constructed by the teachers to measure student learning rather than a standardized achievement test. How can the researcher defend the decision to use a standardized test?

Accepted Answer

The answer of The superintendent tells a researcher who is...

Question 54

Name three factors that can distort the scores on a standardized test.

Accepted Answer

The answer of Name three factors that can distort the...

Question 55

Describe three limitations of standardized achievement tests.

Accepted Answer

The answer of Describe three limitations of standardized achievement tests....

Question 56

A student earns a score of 85 on a 100-item achievement test. How would this score be interpreted if the test was criterion-referenced?

Accepted Answer

The answer of A student earns a score of 85...

Question 57

State two advantages of administering a test in computer format.

Accepted Answer

The answer of State two advantages of administering a test...

Question 58

Describe one situation in which an individually administered test would be preferable to a group- administered test.

Accepted Answer

The answer of Describe one situation in which an individually...

Question 59

Name one drawback of administering a standardized test individually as opposed to administering it to an entire group at once.

Accepted Answer

The answer of Name one drawback of administering a standardized...

Question 60

Give one reason why the test ceiling is an important factor to consider in selecting an achievement test to use in a research study.

Accepted Answer

The answer of Give one reason why the test ceiling...

Question 61

Some school districts require candidates for a teaching position to teach an actual lesson, which is rated for quality. The district also may require candidates to submit a collection of curriculum materials and lesson plans that they have prepared in their previous teaching positions.
a. What is this approach to measurement called?
b. What is the technical name for the collection of completed work?

Accepted Answer

The answer of Some school districts require candidates for a...

Question 62

Describe four criteria recommended for judging the validity of performance assessments.

Accepted Answer

The answer of Describe four criteria recommended for judging the...

Question 63

Describe at least one advantage and one disadvantage of self-report personality measures.

Accepted Answer

The answer of Describe at least one advantage and one...

Question 64

How do the Mental Measurement Yearbooks contribute to the use of tests in research?

Accepted Answer

The answer of How do the Mental Measurement Yearbooks contribute...

Deck 7: Collecting Research Data With Tests and Self-Report Measures