Psychological testing is the administration of psychological tests.
Psychological tests are administered by trained evaluators.
A person's responses are evaluated according to carefully prescribed guidelines. Scores are thought to reflect individual or group differences in the construct the test purports to measure.
The science behind psychological testing is
psychometrics
Psychometrics is a field of study within psychology concerned with the theory and technique of measurement. Psychometrics generally refers to specialized fields within psychology and education devoted to testing, measurement, assessment, and ...
.
Psychological tests
According to Anastasi and Urbina, psychological tests involve observations made on a "carefully chosen ''sample''
mphasis authorsof an individual's behavior."
A psychological test is often designed to measure unobserved constructs, also known as
latent variables
In statistics, latent variables (from Latin: present participle of ''lateo'', “lie hidden”) are variables that can only be inferred indirectly through a mathematical model from other observable variables that can be directly observed or me ...
. Psychological tests can include a series of tasks or problems that the respondent has to solve. Psychological tests can include
questionnaires
A questionnaire is a research instrument that consists of a set of questions (or other types of prompts) for the purpose of gathering information from respondents through survey or statistical study. A research questionnaire is typically a mix of ...
and
interview
An interview is a structured conversation where one participant asks questions, and the other provides answers.Merriam Webster DictionaryInterview Dictionary definition, Retrieved February 16, 2016 In common parlance, the word "interview" ...
s, which are also designed to measure unobserved constructs. Questionnaire- and interview-based scales typically differ from psychoeducational tests, which ask for a respondent's maximum performance. Questionnaire- and interview-based scales, by contrast, ask for the respondent's typical behavior. Symptom and attitude tests are more often called scales. A useful psychological test/scale must be both
valid (i.e., there is evidence to support the idea that the test or scale measures what it is purported to measure and "how well it does so"
) and
reliable (i.e., internally consistent or give consistent results over time, across raters, etc.).
It is important that people who are equal on the measured construct (e.g., mathematics ability, depression) have an approximately equal probability of answering a test item accurately or acknowledging the presence of a symptom. An example of an item on a
mathematics
Mathematics is an area of knowledge that includes the topics of numbers, formulas and related structures, shapes and the spaces in which they are contained, and quantities and their changes. These topics are represented in modern mathematics ...
test that might be used in the United Kingdom but not the United States could be the following: "In a football match two players get a red card; how many players are left on the pitch?" This item requires knowledge of football (soccer) to be answered correctly, not just mathematical ability. Thus, group membership can influence the chance of correctly answering items, as encapsulated in the concept of
differential item functioning
Differential item functioning (DIF) is a statistical characteristic of an item that shows the extent to which the item might be measuring different abilities for members of separate subgroups. Average item scores for subgroups having the same overa ...
. Often tests are constructed for a specific population and the nature of that population should be taken into account when administering tests outside that population. If a test is invariant to one population (e.g. schoolchildren in the United Kingdom) it does not automatically mean that the test functions in much the same way in another population (e.g. schoolchildren in the United States).
Psychological assessment
Psychological evaluation is a method to assess an individual's behavior, personality, cognitive abilities, and several other domains. A common reason for a psychological evaluation is to identify psychological factors that may be inhibiting a pers ...
is similar to psychological testing but usually involves a more comprehensive assessment of the individual. Psychological assessment is a process that involves checking the integration of information from multiple sources, such as tests of normal and abnormal personality, tests of ability or intelligence, tests of interests or attitudes, as well as information from personal interviews. Collateral information is also collected about personal, occupational, or
medical history
The medical history, case history, or anamnesis (from Greek: ἀνά, ''aná'', "open", and μνήσις, ''mnesis'', "memory") of a patient is information gained by a physician by asking specific questions, either to the patient or to other peo ...
, such as from records or from interviews with parents, spouses, teachers, or previous therapists or physicians. A ''psychological test'' is one of the sources of data used within the process of
assessment; usually more than one test is used. Many psychologists do some level of assessment when providing services to clients or patients, and may use for example, simple checklists to assess some traits or symptoms, but psychological assessment is a more complex, detailed, in-depth process. Typical types of focus for psychological assessment are to provide a diagnosis; to assess a particular area of functioning or disability often for school settings; to help select type of treatment or to assess treatment outcomes; to help courts decide
forensic psychological issues such as child custody or competency to stand trial; or to help assess job applicants or employees and provide career development counseling or training.
History
The first large-scale tests may have been examinations that were part of the
imperial examination
The imperial examination (; lit. "subject recommendation") refers to a civil-service examination system in Imperial China, administered for the purpose of selecting candidates for the state bureaucracy. The concept of choosing bureaucrats by ...
system in China. The test, an early form of psychological testing, assessed candidates based on their proficiency in topics such as civil law and fiscal policies.
Other early tests of intelligence were made for entertainment rather than analysis.
Modern mental testing began in France in the 19th century. It contributed to separating
mental retardation
Intellectual disability (ID), also known as general learning disability in the United Kingdom and formerly mental retardation,Rosa's Law, Pub. L. 111-256124 Stat. 2643(2010). is a generalized neurodevelopmental disorder characterized by signific ...
from
mental illness
A mental disorder, also referred to as a mental illness or psychiatric disorder, is a behavioral or mental pattern that causes significant distress or impairment of personal functioning. Such features may be persistent, relapsing and remitti ...
and reducing the neglect, torture, and ridicule heaped on both groups.
Englishman
Francis Galton
Sir Francis Galton, FRS FRAI (; 16 February 1822 – 17 January 1911), was an English Victorian era polymath: a statistician, sociologist, psychologist, anthropologist, tropical explorer, geographer, inventor, meteorologist, proto- ...
coined the terms
psychometrics
Psychometrics is a field of study within psychology concerned with the theory and technique of measurement. Psychometrics generally refers to specialized fields within psychology and education devoted to testing, measurement, assessment, and ...
and
eugenics
Eugenics ( ; ) is a fringe set of beliefs and practices that aim to improve the genetic quality of a human population. Historically, eugenicists have attempted to alter human gene pools by excluding people and groups judged to be inferior or ...
and developed a method for measuring intelligence based on nonverbal sensory-motor tests. It was initially popular but was abandoned after the discovery that it had no relationship to outcomes such as college grades.
French psychologist
Alfred Binet
Alfred Binet (; 8 July 1857 – 18 October 1911), born Alfredo Binetti, was a French psychologist who invented the first practical IQ test, the Binet–Simon test. In 1904, the French Ministry of Education asked psychologist Alfred Binet to ...
, together with psychologists Victor Henri and
Théodore Simon
Théodore Simon (; 10 July 1873 – 4 September 1961) was a French psychologist who worked with Alfred Binet to develop the Binet-Simon scale, one of the most widely used scales in the world for measuring intelligence. This scale was revised i ...
, after about 15 years of development, published the
Binet-Simon test in 1905, which focused on verbal abilities. It was intended to identify mental retardation in school children.
The origins of
personality testing
A personality test is a method of assessing human personality constructs. Most personality assessment instruments (despite being loosely referred to as "personality tests") are in fact introspective (i.e., subjective) self-report questionnaire ...
date back to the 18th and 19th centuries, when personality was assessed through
phrenology, the measurement of the human skull, and
physiognomy
Physiognomy (from the Greek , , meaning "nature", and , meaning "judge" or "interpreter") is the practice of assessing a person's character or personality from their outer appearance—especially the face. The term can also refer to the genera ...
, which assessed personality based on a person's outer appearances.
These early pseudoscientific techniques were eventually replaced with more empirical methods in the 20th century. One of the earliest modern personality tests was the Woolworth Personality Data Sheet, a
self-report inventory
A self-report inventory is a type of psychological test in which a person fills out a survey or questionnaire with or without the help of an investigator. Self-report inventories often ask direct questions about personal interests, values, sympto ...
developed for
World War I
World War I (28 July 1914 11 November 1918), often abbreviated as WWI, was one of the deadliest global conflicts in history. Belligerents included much of Europe, the Russian Empire, the United States, and the Ottoman Empire, with fightin ...
and used for the psychiatric screening of new draftees.
Principles
Proper psychological testing is conducted after vigorous research and development in contrast to quick web-based or magazine questionnaires that say "Find out your Personality Color," or "What's your Inner Age?" Proper psychological testing consists of the following:
:* ''Standardization'' - All procedures and steps must be conducted with consistency and under the same environment to achieve the same testing performance from those being tested.
:* ''Objectivity'' - Scoring such that subjective judgments and biases are minimized, with results for each test taker obtained in the same way.
:* ''Test Norms'' - The average test score within a large group of people where the performance of one individual can be compared to the results of others by establishing a point of comparison or frame of reference.
:* ''Reliability'' - Obtaining the same result after multiple testing.
:* ''Validity'' - The type of test being administered must measure what it is intended to measure.
Sample of behavior
The term ''sample of behavior'' refers to an individual's performance on tasks that have usually been prescribed beforehand. The samples of behavior that make up a paper-and-pencil test, the most common type of psychological test, are a series of test items. Performance on these items produces a test score. A score on a well-constructed test is believed to reflect a
psychological construct
In philosophy, a construct is an object which is ''ideal'', that is, an object of the mind or of thought, meaning that its existence may be said to depend upon a subject's mind. This contrasts with any possibly ''mind-independent'' objects, the ...
such as achievement in a school subject like mathematics knowledge,
cognitive ability
Cognitive skills, also called cognitive functions, cognitive abilities or cognitive capacities, are brain-based skills which are needed in acquisition of knowledge, manipulation of information and reasoning. They have more to do with the mechanisms ...
,
aptitude
An aptitude is a component of a competence to do a certain kind of work at a certain level. Outstanding aptitude can be considered "talent". Aptitude is inborn potential to perform certain kinds of activities, whether physical or mental, and ...
, emotional functioning,
personality
Personality is the characteristic sets of behaviors, cognitions, and emotional patterns that are formed from biological and environmental factors, and which change over time. While there is no generally agreed-upon definition of personality, mos ...
, etc. Differences in test scores are thought to reflect individual differences in the construct the test is purported to measure.
Types
There are several broad categories of psychological tests:
Achievement tests
Achievement test An achievement test is a test of developed skill or knowledge. The most common type of achievement test is a standardized test developed to measure skills and knowledge learned in a given grade level, usually through planned instruction, such as tr ...
s are tests that assess an individual's knowledge in a subject domain. Academic achievement tests are designed to be administered by a trained evaluator to an individual or a group of people. During achievement tests, a series of test items are presented to the person being evaluated. A score on a test is believed to reflect achievement in a school subject.
Many achievement tests are
norm-referenced. The person's responses are scored according to standardized protocols and the results can be compared to the responses of a norming group after the test is completed.
Some achievement tests are
criterion referenced, the purpose of which is find out if the test-taker mastered a predetermined body of knowledge rather than to compare the test-taker to everyone else who is taking the test.
The Kaufman Test of Educational Achievement is an example of an individually administered achievement test for students.
Aptitude tests
Psychological tests have been designed to measure specific abilities, such as clerical, perceptual, numerical, or spatial aptitude. Sometimes these tests must be specially designed for a particular job, but there are also tests available that measure general clerical and mechanical aptitudes, or even general learning ability. An example of an occupational aptitude test is the Minnesota Clerical Test, which measures the perceptual speed and accuracy required to perform various clerical duties. A widely used aptitude test in business is the
Wonderlic Test
The Wonderlic Contemporary Cognitive Ability Test (formerly Wonderlic Personnel Test) is an assessment used to measure the cognitive ability and problem-solving aptitude of prospective employees for a range of occupations. It is a proprietary ...
. There are aptitudes that are believed to be related to specific occupations and are used for career guidance as well as selection and recruitment.
Evidence suggests that aptitude tests like
IQ tests
An intelligence quotient (IQ) is a total score derived from a set of standardized tests or subtests designed to assess human intelligence. The abbreviation "IQ" was coined by the psychologist William Stern for the German term ''Intelligenzq ...
are sensitive to past learning and cannot avoid measuring past achievement, although they were once thought to measure untutored ability. The SAT, which used to be called the Scholastic Aptitude Test, had it's named changed because performance on the test is sensitive to training.
Attitude scales
An attitude scale assesses an individual's disposition regarding an event (e.g., a Supreme Court decision), person (e.g., a governor), concept (e.g., wearing face masks during a pandemic), organization (e.g., the Boy Scouts), or object (e.g., nuclear weapons) on a unidimensional favorable-unfavorable attitude continuum. Attitude scales are used in marketing to determine individuals' preferences for brands. Historically social psychologists have developed attitude scales to assess individuals' attitudes toward the United Nations and race relations. Typically
Likert scale
A Likert scale ( , commonly mispronounced as ) is a psychometric scale commonly involved in research that employs questionnaires. It is the most widely used approach to scaling responses in survey research, such that the term (or more fully the ...
s are used in attitude research. Historically, the
Thurstone scale
Louis Leon Thurstone (29 May 1887 – 29 September 1955) was an American pioneer in the fields of psychometrics and psychophysics. He conceived the approach to measurement known as the law of comparative judgment, and is well known for his cont ...
was used prior to the development of the Likert scale. The Likert scale has largely supplanted the Thurstone scale.
Biographical Information Blank
The
Biographical Information Blanks
Biographical Information Blank (BIB) is a type of assessment that uses biodata in employee recruitment to help determine which of several candidates should be hired for a job. Originally companies would take the information from their job applicati ...
or BIB is a paper-and-pencil form that includes items that ask about detailed personal and work history. It is used to aid in the hiring of employees by matching the backgrounds of individuals to requirements of the job.
Clinical tests
The purpose of clinical tests is to assess the presence of symptoms of psychopathology .
Examples of clinical assessments include the
Minnesota Multiphasic Personality Inventory
The Minnesota Multiphasic Personality Inventory (MMPI) is a standardized psychometric test of adult personality and psychopathology. Psychologists and other mental health professionals use various versions of the MMPI to help develop treatment ...
,
Millon Clinical Multiaxial Inventory-IV,
Child Behavior Checklist The Child Behavior Checklist (CBCL) is a widely used caregiver report form identifying problem behavior in children.Achenbach, T.M., & Rescorla, L. A. (2001). ''Manual for the ASEBA School-Age Forms and Profiles.'' Burlington, VT: University of Verm ...
,
Symptom Checklist 90 and the
Beck Depression Inventory
The Beck Depression Inventory (BDI, BDI-1A, BDI-II), created by Aaron T. Beck, is a 21-question multiple-choice self-report inventory, one of the most widely used psychometric tests for measuring the severity of depression. Its development mar ...
.
Clinical tests like the MMPI are also norm-referenced, with 50 the middlemost score on a symptom subscale such as the Depression scale and 60 a score that places the individual one standard deviation above the mean for the symptom scale.
Criterion-referenced
A
criterion-referenced test
A criterion-referenced test is a style of test which uses test scores to generate a statement about the behavior that can be expected of a person with that score. Most tests and quizzes that are written by school teachers can be considered criter ...
is an
achievement test An achievement test is a test of developed skill or knowledge. The most common type of achievement test is a standardized test developed to measure skills and knowledge learned in a given grade level, usually through planned instruction, such as tr ...
in a specific knowledge domain.
An individual's performance on the test is compared to a criterion. Test-takers are not compared to each other. A passing score, i.e., the criterion performance, is established by the teacher or an educational institution. Criterion-referenced tests are part and parcel of
mastery based education.
Direct observation
Psychological assessment can involve the observation of people as they complete activities. This type of assessment is usually conducted with families in a laboratory or at home. Sometimes the observation can involve children in a classroom or the schoolyard. The purpose may be clinical, such as to establish a pre-intervention baseline of a child's hyperactive or aggressive classroom behaviors or to observe the nature of parent-child interaction in order to understand a relational disorder.
Time sampling methods are also part of direct observational research. The reliability of observers in direct observational research can be evaluated using
Cohen's kappa
Cohen's kappa coefficient (''κ'', lowercase Greek kappa) is a statistic that is used to measure inter-rater reliability (and also intra-rater reliability) for qualitative (categorical) items. It is generally thought to be a more robust measure th ...
.
The Parent-Child Interaction Assessment-II (PCIA) is an example of a direct observation procedure that is used with school-age children and parents. The parents and children are video recorded playing at a make-believe zoo. The Parent-Child Early Relational Assessment is used to study parents and young children and involves a feeding and a
puzzle
A puzzle is a game, Problem solving, problem, or toy that tests a person's ingenuity or knowledge. In a puzzle, the solver is expected to put pieces together (Disentanglement puzzle, or take them apart) in a logical way, in order to arrive at th ...
task. The MacArthur Story Stem Battery (MSSB) is used to elicit narratives from children. The Dyadic Parent-Child Interaction Coding System-II tracks the extent to which children follow the commands of parents and ''vice versa'' and is well suited to the study of children with
Oppositional Defiant Disorders and their parents.
Interest inventories
Psychological tests include interest inventories. These tests are used primarily for career counseling. Interest inventories include items that ask about the preferred activities and interests of people seeking career counseling. The rationale is that if the individual's activities and interests are similar to the modal pattern for people who are successful in a given occupation, then the chances are high that the individual would find satisfaction in that occupation. A widely used interest test is the
Strong Interest Inventory
The Strong Interest Inventory (SII) is an interest inventory used in career assessment. As such, career assessments may be used in career counseling.Prince, J.P. (1995). ''Strong Interest Inventory resource: Strategies for group and individual in ...
, which is used in career assessment, career counseling, and educational guidance.
Neuropsychological tests
Neuropsychological tests are designed to be an objective and standardized measure of a sample of behavior.
Norm-referenced tests
Items on
norm-referenced tests have been tried out on a norming group and scores on the test can be classified as high, medium, or low and the gradations in between.
These tests allow for the study of individual differences. Scores on norm-referenced achievement tests are associated with percentile ranks vis-á-vis other individuals who are the test-taker's age or grade.
Personality tests
Personality test
A personality test is a method of assessing human personality construct (psychology), constructs. Most personality assessment instruments (despite being loosely referred to as "personality tests") are in fact introspective (i.e., subjective) self ...
s assess constructs that are thought to be the constituents of personality. Examples of personality constructs include traits in the
Big Five, such as introversion-extroversion and conscientiousness. Personality constructs are thought to be dimensional. Personality measures are used in research and in the selection of employees. They include self-report and observer-report scales. Examples of
norm-referenced personality tests include the NEO-PI, the 16PF, the
OPQ, and the FFPI-C.
The
IPIP scales are assess the same personality traits that the NEO and the other scales assess but IPIP scales and items are available free of charge.
Projective tests
Projective testing originated in the first half of the 1900s.
Examples of projective tests are story-telling, drawings, or sentence-completion tasks.
Public safety employment tests
Vocations within the public safety field (i.e., fire service, law enforcement, corrections, emergency medical services) often require
Industrial and Organizational Psychology
Industrial and organizational psychology (I-O psychology), an applied discipline within psychology, is the science of human behavior in the workplace. Depending on the country or region of the world, I-O psychology is also known as occupational ...
tests for initial employment and advancement throughout the ranks. The
National Firefighter Selection Inventory - NFSI {{No footnotes, date=November 2021
In the United States, vocations within the public safety sector, (i.e., firefighter, sheriff and police officer, correctional officer, emergency medical services including emergency medical technicians), often req ...
, the
National Criminal Justice Officer Selection Inventory - NCJOSI, and the
Integrity Inventory The ''Integrity Inventory'' (stylized as ''I2''), is a nationally normed entry-level personnel selection tool that incorporates employment integrity testing. It was developed by industrial organizational psychologist Mark Tawney, Ph.D., Principal ...
are prominent examples of these tests.
Sources of psychological tests
Thousands of psychological tests have been developed. Some were produced by commercial testing companies that charge for their use. Others have been developed by researchers, and can be found in the academic research literature. Tests to assess specific psychological constructs can be found by conducting a database search. Some databases are open access, such as Google Scholar. Others are proprietary and are available through library access at universities, such as APA PsycInfo.
There are online archives available that contain tests on various topics.
:*APA PsycTests. Requires subscription
:*Mental Measurements Yearbook- a non-profit that provides independent reviews of thousands of distinct psychological tests.
:*Assessment Psychology Online has links to dozens of tests for clinical assessment.
:*International Personality Item Pool (IPIP) contains items to assess more than 100 personality traits including Five Factor Model.
:*Organization of Work: Measurement Tools for Research and Practice. NIOSH site devoted to Occupational Health and Safety
:*Industrial and Organizational Psychology Assessments
:*Mental Health Assessment Archive
Test security
Many psychological and psychoeducational tests are not available to the public. Test publishers put restrictions on who has access to the test. Psychology licensing boards also restrict access to the tests used in licensing psychologists. Test publishers hold that both copyright and professional ethics require them to protect the tests. Publishers sell tests only to people who have proved their educational and professional qualifications. Purchasers are legally bound not to give test answers or the tests themselves to members of the public unless permitted by the publisher.
The
International Test Commission The International Test Commission (ITC) is an association of national psychological associations, test commissions, organizations and individuals, who promote "the proper development, evaluation and uses" of educational and psychological tests. The ...
(ITC), an international association of national psychological societies and test publishers, publishes the ''International Guidelines for Test Use'', which prescribes measures to take to "protect the integrity" of the tests by not publicly describing test techniques and by not "coaching individuals" so that they "might unfairly influence their test performance."
[International Test Commission (2000]
''International Guidelines for Test Use''
/ref>
See also
References
External links
* ttp://www.psychtesting.org.uk/ British Psychological Society Psychological Testing Centrebr>Guidelines
of the International Test Commission The International Test Commission (ITC) is an association of national psychological associations, test commissions, organizations and individuals, who promote "the proper development, evaluation and uses" of educational and psychological tests. The ...
International Item Pool, an alternative and free source of items available for research on personality
Mental Measurements Yearbook
{{DEFAULTSORT:Psychological Testing
Clinical psychology