A criterion-referenced test is a style of test which uses

test Test(s), testing, or TEST may refer to: * Test (assessment), an educational assessment intended to measure the respondents' knowledge or other abilities Arts and entertainment * ''Test'' (2013 film), an American film * ''Test'' (2014 film), ...

scores to generate a statement about the behavior that can be expected of a person with that score. Most tests and quizzes that are written by school teachers can be considered criterion-referenced tests. In this case, the objective is simply to see whether the student has learned the material. Criterion-referenced assessment can be contrasted with

norm-referenced assessment A norm-referenced test (NRT) is a type of test, assessment, or evaluation which yields an estimate of the position of the tested individual in a predefined population, with respect to the trait being measured. Assigning scores on such tests may be ...

and ipsative assessment. Criterion-referenced testing was a major focus of

psychometric Psychometrics is a field of study within psychology concerned with the theory and technique of measurement. Psychometrics generally refers to specialized fields within psychology and education devoted to testing, measurement, assessment, and ...

research in the 1970s.

Definition of ''criterion''

A common misunderstanding regarding the term is the meaning of ''criterion''. Many, if not most, criterion-referenced tests involve a cutscore, where the examinee passes if their score exceeds the cutscore and fails if it does not (often called a mastery test). The ''criterion'' is not the cutscore; the criterion is the domain of subject matter that the test is designed to assess. For example, the criterion may be "Students should be able to correctly add two single-digit numbers," and the cutscore may be that students should correctly answer a minimum of 80% of the questions to pass. The criterion-referenced interpretation of a test score identifies the relationship to the subject matter. In the case of a mastery test, this does mean identifying whether the examinee has "mastered" a specified level of the subject matter by comparing their score to the cutscore. However, not all criterion-referenced tests have a cutscore, and the score can simply refer to a person's standing on the subject domain. The ACT is an example of this; there is no cutscore, it simply is an assessment of the student's knowledge of high-school level subject matter. Because of this common misunderstanding, criterion-referenced tests have also been called standards-based assessments by some education agencies, as students are assessed with regards to standards that define what they "should" know, as defined by the state.

Comparison of criterion-referenced, domain-referenced and norm-referenced tests

Both terms ''criterion-referenced'' and ''norm-referenced'' were originally coined by

Robert Glaser Robert Glaser (January 18, 1921 – February 4, 2012) was an American educational psychologist, who has made significant contributions to theories of learning and instruction. The key areas of his research focused on the nature of aptitudes and in ...

. Unlike a criterion-reference test, a norm-referenced test indicates whether the test-taker did better or worse than other people who took the test. For example, if the criterion is "Students should be able to correctly add two single-digit numbers," then reasonable test questions might look like "

2+3 = ?

" or "

9+5 = ?

" A criterion-referenced test would report the student's performance strictly according to whether the individual student correctly answered these questions. A norm-referenced test would report primarily whether this student correctly answered more questions compared to other students in the group. Even when testing similar topics, a test which is designed to accurately assess mastery may use different questions than one which is intended to show relative

ranking A ranking is a relationship between a set of items such that, for any two items, the first is either "ranked higher than", "ranked lower than" or "ranked equal to" the second. In mathematics, this is known as a weak order or total preorder of ...

. This is because some questions are better at reflecting actual achievement of students, and some test questions are better at differentiating between the best students and the worst students. (Many questions will do both.) A criterion-referenced test will use questions which were correctly answered by students who know the specific material. A norm-referenced test will use questions which were correctly answered by the "best" students and not correctly answered by the "worst" students (e.g. Cambridge University's pre-entry 'S' paper). Some tests can provide useful information about both actual achievement and relative ranking. The ACT provides both a ranking, and indication of what level is considered necessary to likely success in college. Some argue that the term "criterion-referenced test" is a misnomer, since it can refer to the ''interpretation'' of the score as well as the test itself. In the previous example, the same score on the ACT can be interpreted in a norm-referenced or criterion-referenced manner. ''Domain-referenced'' test is similar to criterion-referenced test, it is an assessment that covers a specific area of study such that a score will reveal how much of this area has been mastered. Thus, if an individual got 90% of the items correct in a ''domain-referenced'' or ''criterion-referenced'' test, this would be a high score indicative of his or her deep knowledge and understanding of the content covered in the test. These kinds of tests are contrasted with ''norm-referenced'' tests, in which scores indicate how well a test taker performed on the items relative to others who took the test.

Relationship to high-stakes testing

Many high-profile criterion-referenced tests are also

high-stakes test A high-stakes test is a test with important consequences for the test taker. Passing has important benefits, such as a high school diploma, a scholarship, or a license to practice a profession. Failing has important disadvantages, such as being ...

s, where the results of the test have important implications for the individual examinee. Examples of this include

high school graduation examination An exit examination is a test that students must pass to receive a diploma and graduate from school. Such examinations have been used in a variety of countries; this article focuses on their use within the United States. These are usually criteri ...

s and licensure testing where the test must be passed to work in a profession, such as to become a physician or attorney. However, being a high-stakes test is not specifically a feature of a criterion-referenced test. It is instead a feature of how an educational or government agency chooses to use the results of the test. It is moreover an individual type of test.

Examples

Driving test A driving test (also known as a driving exam, driver's test, or road test) is a procedure designed to test a person's ability to drive a motor vehicle. It exists in various forms worldwide, and is often a requirement to obtain a driver's lic ...

s are criterion-referenced tests, because their goal is to see whether the test taker is skilled enough to be granted a driver's license, not to see whether one test taker is more skilled than another test taker. * Citizenship tests are usually criterion-referenced tests, because their goal is to see whether the test taker is sufficiently familiar with the new country's history and government, not to see whether one test taker is more knowledgeable than another test taker.

References

{{reflist Educational psychology Psychometrics Standardized tests Education reform

Definition of ''criterion''

Comparison of criterion-referenced, domain-referenced and norm-referenced tests

Relationship to high-stakes testing

Examples

See also

References