Level of measurement or scale of measure is a classification that describes the nature of information within the values assigned to
variables.
Psychologist
Stanley Smith Stevens developed the best-known classification with four levels, or scales, of measurement:
nominal
Nominal may refer to:
Linguistics and grammar
* Nominal (linguistics), one of the parts of speech
* Nominal, the adjectival form of "noun", as in "nominal agreement" (= "noun agreement")
* Nominal sentence, a sentence without a finite verb
* Nou ...
,
ordinal,
interval, and
ratio
In mathematics, a ratio shows how many times one number contains another. For example, if there are eight oranges and six lemons in a bowl of fruit, then the ratio of oranges to lemons is eight to six (that is, 8:6, which is equivalent to the ...
.
This framework of distinguishing levels of measurement originated in psychology and is widely criticized by scholars in other disciplines. Other classifications include those by Mosteller and
Tukey
John Wilder Tukey (; June 16, 1915 – July 26, 2000) was an American mathematician and statistician, best known for the development of the Cooley–Tukey FFT algorithm, fast Fourier Transform (FFT) algorithm and box plot. The Tukey's range test ...
,
and by Chrisman.
Stevens's typology
Overview
Stevens proposed his typology in a 1946 ''
Science
Science is a systematic endeavor that builds and organizes knowledge in the form of testable explanations and predictions about the universe.
Science may be as old as the human species, and some of the earliest archeological evidence for ...
'' article titled "On the theory of scales of measurement".
In that article, Stevens claimed that all
measurement
Measurement is the quantification of attributes of an object or event, which can be used to compare with other objects or events.
In other words, measurement is a process of determining how large or small a physical quantity is as compared ...
in science was conducted using four different types of scales that he called "nominal", "ordinal", "interval", and "ratio", unifying both "
qualitative" (which are described by his "nominal" type) and "
quantitative
Quantitative may refer to:
* Quantitative research, scientific investigation of quantitative properties
* Quantitative analysis (disambiguation)
* Quantitative verse, a metrical system in poetry
* Statistics, also known as quantitative analysis ...
" (to a different degree, all the rest of his scales). The concept of scale types later received the mathematical rigour that it lacked at its inception with the work of mathematical psychologists Theodore Alper (1985, 1987), Louis Narens (1981a, b), and
R. Duncan Luce (1986, 1987, 2001). As Luce (1997, p. 395) wrote:
Comparison
Nominal level
The nominal type differentiates between items or subjects based only on their names or (meta-)categories and other qualitative classifications they belong to; thus
dichotomous
A dichotomy is a partition of a whole (or a set) into two parts (subsets). In other words, this couple of parts must be
* jointly exhaustive: everything must belong to one part or the other, and
* mutually exclusive: nothing can belong simult ...
data involves the
construction of classifications as well as the classification of items. Discovery of an exception to a classification can be viewed as progress. Numbers may be used to represent the variables but the numbers do not have numerical value or relationship: for example, a
globally unique identifier.
Examples of these classifications include gender, nationality, ethnicity, language, genre, style, biological species, and form. In a university one could also use hall of affiliation as an example. Other concrete examples are
* in
grammar
In linguistics, the grammar of a natural language is its set of structure, structural constraints on speakers' or writers' composition of clause (linguistics), clauses, phrases, and words. The term can also refer to the study of such constraint ...
, the
parts of speech
In grammar, a part of speech or part-of-speech (abbreviated as POS or PoS, also known as word class or grammatical category) is a category of words (or, more generally, of lexical items) that have similar grammatical properties. Words that are ass ...
: noun, verb, preposition, article, pronoun, etc.
* in politics,
power projection: hard power, soft power, etc.
* in biology, the
taxonomic rank
In biological classification, taxonomic rank is the relative level of a group of organisms (a taxon) in an ancestral or hereditary hierarchy. A common system consists of species, genus, family (biology), family, order (biology), order, class (b ...
s below domains: Archaea, Bacteria, and Eukarya
* in
software engineering
Software engineering is a systematic engineering approach to software development.
A software engineer is a person who applies the principles of software engineering to design, develop, maintain, test, and evaluate computer software. The term '' ...
, type of
faults: specification faults, design faults, and code faults
Nominal scales were often called qualitative scales, and measurements made on qualitative scales were called qualitative data. However, the rise of qualitative research has made this usage confusing. If numbers are assigned as labels in nominal measurement, they have no specific numerical value or meaning. No form of arithmetic computation (+, −, ×, etc.) may be performed on nominal measures. The nominal level is the lowest measurement level used from a statistical point of view.
Mathematical operations
Equality
Equality may refer to:
Society
* Political equality, in which all members of a society are of equal standing
** Consociationalism, in which an ethnically, religiously, or linguistically divided state functions by cooperation of each group's elit ...
and other operations that can be defined in terms of equality, such as
inequality
Inequality may refer to:
Economics
* Attention inequality, unequal distribution of attention across users, groups of people, issues in etc. in attention economy
* Economic inequality, difference in economic well-being between population groups
* ...
and
set membership, are the only
non-trivial In mathematics, the adjective trivial is often used to refer to a claim or a case which can be readily obtained from context, or an object which possesses a simple structure (e.g., groups, topological spaces). The noun triviality usually refers to a ...
operation
Operation or Operations may refer to:
Arts, entertainment and media
* ''Operation'' (game), a battery-operated board game that challenges dexterity
* Operation (music), a term used in musical set theory
* ''Operations'' (magazine), Multi-Ma ...
s that generically apply to objects of the nominal type.
Central tendency
The
mode
Mode ( la, modus meaning "manner, tune, measure, due measure, rhythm, melody") may refer to:
Arts and entertainment
* '' MO''D''E (magazine)'', a defunct U.S. women's fashion magazine
* ''Mode'' magazine, a fictional fashion magazine which is ...
, i.e. the ''most common'' item, is allowed as the measure of
central tendency
In statistics, a central tendency (or measure of central tendency) is a central or typical value for a probability distribution.Weisberg H.F (1992) ''Central Tendency and Variability'', Sage University Paper Series on Quantitative Applications ...
for the nominal type. On the other hand, the
median
In statistics and probability theory, the median is the value separating the higher half from the lower half of a data sample, a population, or a probability distribution. For a data set, it may be thought of as "the middle" value. The basic fe ...
, i.e. the ''middle-ranked'' item, makes no sense for the nominal type of data since ranking is meaningless for the nominal type.
Ordinal scaling
The ordinal type allows for
rank order
A ranking is a relationship between a set of items such that, for any two items, the first is either "ranked higher than", "ranked lower than" or "ranked equal to" the second.
In mathematics, this is known as a weak order or total preorder of ...
(1st, 2nd, 3rd, etc.) by which data can be sorted but still does not allow for a relative ''degree of difference'' between them. Examples include, on one hand, dichotomous data with dichotomous (or dichotomized) values such as 'sick' vs. 'healthy' when measuring health, 'guilty' vs. 'not-guilty' when making judgments in courts, 'wrong/false' vs. 'right/true' when measuring
truth value
In logic and mathematics, a truth value, sometimes called a logical value, is a value indicating the relation of a proposition to truth, which in classical logic has only two possible values (''true'' or '' false'').
Computing
In some progr ...
, and, on the other hand, non-dichotomous data consisting of a spectrum of values, such as 'completely agree', 'mostly agree', 'mostly disagree', 'completely disagree' when measuring
opinion
An opinion is a judgment, viewpoint, or statement that is not conclusive, rather than facts, which are true statements.
Definition
A given opinion may deal with subjective matters in which there is no conclusive finding, or it may deal with f ...
.
The ordinal scale places events in order, but there is no attempt to make the intervals of the scale equal in terms of some rule. Rank orders represent ordinal scales and are frequently used in research relating to qualitative phenomena. A student's rank in his graduation class involves the use of an ordinal scale. One has to be very careful in making a statement about scores based on ordinal scales. For instance, if Devi's position in his class is 10 and Ganga's position is 40, it cannot be said that Devi's position is four times as good as that of Ganga. The statement would make no sense at all.
Ordinal scales only permit the ranking of items from highest to lowest. Ordinal measures have no absolute values, and the real differences between adjacent ranks may not be equal. All that can be said is that one person is higher or lower on the scale than another, but more precise comparisons cannot be made. Thus, the use of an ordinal scale implies a statement of 'greater than' or 'less than' (an equality statement is also acceptable) without our being able to state how much greater or less. The real difference between ranks 1 and 2, for instance, may be more or less than the difference between ranks 5 and 6. Since the numbers of this scale have only a rank meaning, the appropriate measure of central tendency is the median. A percentile or quartile measure is used for measuring dispersion. Correlations are restricted to various rank order methods. Measures of statistical significance are restricted to the non-parametric methods (R. M. Kothari, 2004).
Central tendency
The
median
In statistics and probability theory, the median is the value separating the higher half from the lower half of a data sample, a population, or a probability distribution. For a data set, it may be thought of as "the middle" value. The basic fe ...
, i.e. ''middle-ranked'', item is allowed as the measure of
central tendency
In statistics, a central tendency (or measure of central tendency) is a central or typical value for a probability distribution.Weisberg H.F (1992) ''Central Tendency and Variability'', Sage University Paper Series on Quantitative Applications ...
; however, the mean (or average) as the measure of
central tendency
In statistics, a central tendency (or measure of central tendency) is a central or typical value for a probability distribution.Weisberg H.F (1992) ''Central Tendency and Variability'', Sage University Paper Series on Quantitative Applications ...
is not allowed. The
mode
Mode ( la, modus meaning "manner, tune, measure, due measure, rhythm, melody") may refer to:
Arts and entertainment
* '' MO''D''E (magazine)'', a defunct U.S. women's fashion magazine
* ''Mode'' magazine, a fictional fashion magazine which is ...
is allowed.
In 1946, Stevens observed that psychological measurement, such as measurement of opinions, usually operates on ordinal scales; thus means and standard deviations have no
validity
Validity or Valid may refer to:
Science/mathematics/statistics:
* Validity (logic), a property of a logical argument
* Scientific:
** Internal validity, the validity of causal inferences within scientific studies, usually based on experiments
** ...
, but they can be used to get ideas for how to improve
operationalization
In research design, especially in psychology, social sciences, life sciences and physics, operationalization or operationalisation is a process of defining the measurement of a phenomenon which is not directly measurable, though its existence is in ...
of variables used in
questionnaire
A questionnaire is a research instrument that consists of a set of questions (or other types of prompts) for the purpose of gathering information from respondents through survey or statistical study. A research questionnaire is typically a mix of ...
s. Most
psychological data collected by
psychometric
Psychometrics is a field of study within psychology concerned with the theory and technique of measurement. Psychometrics generally refers to specialized fields within psychology and education devoted to testing, measurement, assessment, and ...
instruments and tests, measuring
cognitive
Cognition refers to "the mental action or process of acquiring knowledge and understanding through thought, experience, and the senses". It encompasses all aspects of intellectual functions and processes such as: perception, attention, thought, ...
and other abilities, are ordinal, although some theoreticians have argued they can be treated as interval or ratio scales. However, there is little
prima facie evidence to suggest that such attributes are anything more than ordinal (Cliff, 1996; Cliff & Keats, 2003; Michell, 2008). In particular, IQ scores reflect an ordinal scale, in which all scores are meaningful for comparison only. There is no absolute zero, and a 10-point difference may carry different meanings at different points of the scale.
Interval scale
The interval type allows for the ''degree of difference'' between items, but not the ratio between them. Examples include ''
temperature scale
Scale of temperature is a methodology of calibrating the physical quantity temperature in metrology. Empirical scales measure temperature in relation to convenient and stable parameters, such as the freezing and boiling point of water. Absolute ...
s'' with the
Celsius scale
The degree Celsius is the unit of temperature on the Celsius scale (originally known as the centigrade scale outside Sweden), one of two temperature scales used in the International System of Units (SI), the other being the Kelvin scale. The d ...
, which has two defined points (the freezing and boiling point of water at specific conditions) and then separated into 100 intervals, ''date'' when measured from an arbitrary epoch (such as AD), ''location'' in Cartesian coordinates, and ''direction'' measured in degrees from true or magnetic north. Ratios are not meaningful since 20 °C cannot be said to be "twice as hot" as 10 °C (unlike temperature in
Kelvin
The kelvin, symbol K, is the primary unit of temperature in the International System of Units (SI), used alongside its prefixed forms and the degree Celsius. It is named after the Belfast-born and University of Glasgow-based engineer and phys ...
s), nor can multiplication/division be carried out between any two dates directly. However, ''ratios of differences'' can be expressed; for example, one difference can be twice another. Interval type variables are sometimes also called "scaled variables", but the formal mathematical term is an
affine space
In mathematics, an affine space is a geometric structure that generalizes some of the properties of Euclidean spaces in such a way that these are independent of the concepts of distance and measure of angles, keeping only the properties relate ...
(in this case an
affine line
In mathematics, an affine space is a geometric structure that generalizes some of the properties of Euclidean spaces in such a way that these are independent of the concepts of distance and measure of angles, keeping only the properties relat ...
).
Central tendency and statistical dispersion
The
mode
Mode ( la, modus meaning "manner, tune, measure, due measure, rhythm, melody") may refer to:
Arts and entertainment
* '' MO''D''E (magazine)'', a defunct U.S. women's fashion magazine
* ''Mode'' magazine, a fictional fashion magazine which is ...
,
median
In statistics and probability theory, the median is the value separating the higher half from the lower half of a data sample, a population, or a probability distribution. For a data set, it may be thought of as "the middle" value. The basic fe ...
, and
arithmetic mean
In mathematics and statistics, the arithmetic mean ( ) or arithmetic average, or just the ''mean'' or the ''average'' (when the context is clear), is the sum of a collection of numbers divided by the count of numbers in the collection. The colle ...
are allowed to measure central tendency of interval variables, while measures of statistical dispersion include
range
Range may refer to:
Geography
* Range (geographic), a chain of hills or mountains; a somewhat linear, complex mountainous or hilly area (cordillera, sierra)
** Mountain range, a group of mountains bordered by lowlands
* Range, a term used to i ...
and
standard deviation. Since one can only divide by ''differences'', one cannot define measures that require some ratios, such as the
coefficient of variation. More subtly, while one can define
moments about the
origin
Origin(s) or The Origin may refer to:
Arts, entertainment, and media
Comics and manga
* ''Origin'' (comics), a Wolverine comic book mini-series published by Marvel Comics in 2002
* ''The Origin'' (Buffy comic), a 1999 ''Buffy the Vampire Sl ...
, only central moments are meaningful, since the choice of origin is arbitrary. One can define
standardized moment
In probability theory and statistics, a standardized moment of a probability distribution is a moment (often a higher degree central moment) that is normalized, typically by a power of the standard deviation, rendering the moment scale invariant ...
s, since ratios of differences are meaningful, but one cannot define the coefficient of variation, since the mean is a moment about the origin, unlike the standard deviation, which is (the square root of) a central moment.
Ratio scale
:''See also'':
The ratio type takes its name from the fact that measurement is the estimation of the ratio between a magnitude of a continuous quantity and a
unit of measurement
A unit of measurement is a definite magnitude (mathematics), magnitude of a quantity, defined and adopted by convention or by law, that is used as a standard for measurement of the same kind of quantity. Any other quantity of that kind can ...
of the same kind (Michell, 1997, 1999). Most measurement in the physical sciences and engineering is done on ratio scales. Examples include
mass
Mass is an intrinsic property of a body. It was traditionally believed to be related to the quantity of matter in a physical body, until the discovery of the atom and particle physics. It was found that different atoms and different elementar ...
,
length
Length is a measure of distance. In the International System of Quantities, length is a quantity with dimension distance. In most systems of measurement a base unit for length is chosen, from which all other units are derived. In the Interna ...
,
duration
Duration may refer to:
* The amount of time elapsed between two events
* Duration (music) – an amount of time or a particular time interval, often cited as one of the fundamental aspects of music
* Duration (philosophy) – a theory of time and ...
,
plane angle
In Euclidean geometry, an angle is the figure formed by two rays, called the '' sides'' of the angle, sharing a common endpoint, called the ''vertex'' of the angle.
Angles formed by two rays lie in the plane that contains the rays. Angles a ...
,
energy
In physics, energy (from Ancient Greek: ἐνέργεια, ''enérgeia'', “activity”) is the quantitative property that is transferred to a body or to a physical system, recognizable in the performance of work and in the form of heat a ...
and
electric charge
Electric charge is the physical property of matter that causes charged matter to experience a force when placed in an electromagnetic field. Electric charge can be ''positive'' or ''negative'' (commonly carried by protons and electrons respe ...
. In contrast to interval scales, ratios can be compared using
division
Division or divider may refer to:
Mathematics
*Division (mathematics), the inverse of multiplication
*Division algorithm, a method for computing the result of mathematical division
Military
*Division (military), a formation typically consisting ...
. Very informally, many ratio scales can be described as specifying "how much" of something (i.e. an amount or magnitude). Ratio scale is often used to express an
order of magnitude
An order of magnitude is an approximation of the logarithm of a value relative to some contextually understood reference value, usually 10, interpreted as the base of the logarithm and the representative of values of magnitude one. Logarithmic dis ...
such as for temperature in
Orders of magnitude (temperature)
List of orders of magnitude for temperature
Detailed list for 100 K to 1000 K
Most ordinary human activity takes place at temperatures of this order of magnitude. Circumstances where water naturally occurs in liquid form are shown in light gr ...
.
Central tendency and statistical dispersion
The
geometric mean and the
harmonic mean are allowed to measure the central tendency, in addition to the mode, median, and arithmetic mean. The
studentized range and the
coefficient of variation are allowed to measure statistical dispersion. All statistical measures are allowed because all necessary mathematical operations are defined for the ratio scale.
Debate on Stevens's typology
While Stevens's typology is widely adopted, it is still being challenged by other theoreticians, particularly in the cases of the nominal and ordinal types (Michell, 1986).
Some however have argued that the degree of discord can be overstated. Hand says, "Basic psychology texts often begin with Stevens's framework and the ideas are ubiquitous. Indeed, the essential soundness of his hierarchy has been established for representational measurement by mathematicians, determining the invariance properties of mappings from empirical systems to real number continua. Certainly the ideas have been revised, extended, and elaborated, but the remarkable thing is his insight given the relatively limited formal apparatus available to him and how many decades have passed since he coined them."
Duncan (1986) objected to the use of the word ''measurement'' in relation to the nominal type, but Stevens (1975) said of his own definition of measurement that "the assignment can be any consistent rule. The only rule not allowed would be random assignment, for randomness amounts in effect to a nonrule".
The use of the mean as a measure of the central tendency for the ordinal type is still debatable among those who accept Stevens's typology. Many behavioural scientists use the mean for ordinal data, anyway. This is often justified on the basis that the ordinal type in behavioural science is in fact somewhere between the true ordinal and interval types; although the interval difference between two ordinal ranks is not constant, it is often of the same order of magnitude.
For example, applications of measurement models in educational contexts often indicate that total scores have a fairly linear relationship with measurements across the range of an assessment. Thus, some argue that so long as the unknown interval difference between ordinal scale ranks is not too variable, interval scale statistics such as means can meaningfully be used on ordinal scale variables. Statistical analysis software such as
SPSS
SPSS Statistics is a statistical software suite developed by IBM for data management, advanced analytics, multivariate analysis, business intelligence, and criminal investigation. Long produced by SPSS Inc., it was acquired by IBM in 2009. C ...
requires the user to select the appropriate measurement class for each variable. This ensures that subsequent user errors cannot inadvertently perform meaningless analyses (for example correlation analysis with a variable on a nominal level).
L. L. Thurstone made progress toward developing a justification for obtaining the interval type, based on the
law of comparative judgment The law of comparative judgment was conceived by L. L. Thurstone. In modern-day terminology, it is more aptly described as a model that is used to obtain measurements from any process of pairwise comparison. Examples of such processes are the compa ...
. A common application of the law is the
analytic hierarchy process
In the theory of decision making, the analytic hierarchy process (AHP), also analytical hierarchy process, is a structured technique for organizing and analyzing complex decisions, based on mathematics and psychology. It was developed by Thomas ...
. Further progress was made by
Georg Rasch
Georg William Rasch () (21 September 1901 – 19 October 1980) was a Danish mathematician, statistician, and psychometrician, most famous for the development of a class of measurement models known as Rasch models. He studied with R.A. Fisher and a ...
(1960), who developed the probabilistic
Rasch model
The Rasch model, named after Georg Rasch, is a psychometric model for analyzing categorical data, such as answers to questions on a reading assessment or questionnaire responses, as a function of the trade-off between the respondent's abilities, ...
that provides a theoretical basis and justification for obtaining interval-level measurements from counts of observations such as total scores on assessments.
Other proposed typologies
Typologies aside from Stevens's typology have been proposed. For instance,
Mosteller and
Tukey
John Wilder Tukey (; June 16, 1915 – July 26, 2000) was an American mathematician and statistician, best known for the development of the Cooley–Tukey FFT algorithm, fast Fourier Transform (FFT) algorithm and box plot. The Tukey's range test ...
(1977), Nelder (1990) described continuous counts, continuous ratios, count ratios, and categorical modes of data. See also Chrisman (1998), van den Berg (1991).
Mosteller and Tukey's typology (1977)
Mosteller and Tukey
noted that the four levels are not exhaustive and proposed:
# Names
# Grades (ordered labels like beginner, intermediate, advanced)
# Ranks (orders with 1 being the smallest or largest, 2 the next smallest or largest, and so on)
# Counted fractions (bound by 0 and 1)
# Counts (non-negative integers)
# Amounts (non-negative real numbers)
# Balances (any real number)
For example, percentages (a variation on fractions in the Mosteller–Tukey framework) do not fit well into Stevens's framework: No transformation is fully admissible.
Chrisman's typology (1998)
Nicholas R. Chrisman
introduced an expanded list of levels of measurement to account for various measurements that do not necessarily fit with the traditional notions of levels of measurement. Measurements bound to a range and repeating (like degrees in a circle, clock time, etc.), graded membership categories, and other types of measurement do not fit to Stevens's original work, leading to the introduction of six new levels of measurement, for a total of ten:
# Nominal
# Gradation of membership
# Ordinal
# Interval
# Log-interval
# Extensive ratio
# Cyclical ratio
# Derived ratio
# Counts
# Absolute
While some claim that the extended levels of measurement are rarely used outside of academic geography,
graded membership is central to
fuzzy set theory
In mathematics, fuzzy sets (a.k.a. uncertain sets) are sets whose elements have degrees of membership. Fuzzy sets were introduced independently by Lotfi A. Zadeh in 1965 as an extension of the classical notion of set.
At the same time, defined ...
, while absolute measurements include probabilities and the plausibility and ignorance in
Dempster–Shafer theory
The theory of belief functions, also referred to as evidence theory or Dempster–Shafer theory (DST), is a general framework for reasoning with uncertainty, with understood connections to other frameworks such as probability, possibility and i ...
. Cyclical ratio measurements include angles and times. Counts appear to be ratio measurements, but the scale is not arbitrary and fractional counts are commonly meaningless. Log-interval measurements are commonly displayed in stock market graphics. All these types of measurements are commonly used outside academic geography, and do not fit well to Stevens' original work.
Scale types and Stevens's "operational theory of measurement"
The theory of scale types is the intellectual handmaiden to Stevens's "operational theory of measurement", which was to become definitive within psychology and the
behavioral sciences, despite Michell's characterization as its being quite at odds with measurement in the natural sciences (Michell, 1999). Essentially, the operational theory of measurement was a reaction to the conclusions of a committee established in 1932 by the
British Association for the Advancement of Science to investigate the possibility of genuine scientific measurement in the psychological and behavioral sciences. This committee, which became known as the ''Ferguson committee'', published a Final Report (Ferguson, et al., 1940, p. 245) in which Stevens's
sone
The sone () is a unit of loudness, the subjective perception of sound pressure. The study of perceived loudness is included in the topic of psychoacoustics and employs methods of psychophysics. Doubling the perceived loudness doubles the son ...
scale (Stevens & Davis, 1938) was an object of criticism:
That is, if Stevens's ''
sone
The sone () is a unit of loudness, the subjective perception of sound pressure. The study of perceived loudness is included in the topic of psychoacoustics and employs methods of psychophysics. Doubling the perceived loudness doubles the son ...
'' scale genuinely measured the intensity of auditory sensations, then evidence for such sensations as being quantitative attributes needed to be produced. The evidence needed was the presence of ''additive structure'' – a concept comprehensively treated by the German mathematician
Otto Hölder
Ludwig Otto Hölder (December 22, 1859 – August 29, 1937) was a German mathematician born in Stuttgart.
Early life and education
Hölder was the youngest of three sons of professor Otto Hölder (1811–1890), and a grandson of professor Chris ...
(Hölder, 1901). Given that the physicist and measurement theorist
Norman Robert Campbell
Norman Robert Campbell (1880–1949) was an English physicist and philosopher of science.
Early life
Norman Robert Campbell was born in 1880. He was the son of William Middleton Campbell, Governor of the Bank of England, and his wife Edith Ag ...
dominated the Ferguson committee's deliberations, the committee concluded that measurement in the social sciences was impossible due to the lack of
concatenation
In formal language theory and computer programming, string concatenation is the operation of joining character strings end-to-end. For example, the concatenation of "snow" and "ball" is "snowball". In certain formalisations of concatenat ...
operations. This conclusion was later rendered false by the discovery of the
theory of conjoint measurement
The theory of conjoint measurement (also known as conjoint measurement or additive conjoint measurement) is a general, formal theory of continuous quantity. It was independently discovered by the French economist Gérard Debreu (1960) and by the Am ...
by Debreu (1960) and independently by Luce & Tukey (1964). However, Stevens's reaction was not to conduct experiments to test for the presence of additive structure in sensations, but instead to render the conclusions of the Ferguson committee null and void by proposing a new theory of measurement:
Stevens was greatly influenced by the ideas of another Harvard academic, the
Nobel laureate physicist
Percy Bridgman
Percy Williams Bridgman (April 21, 1882 – August 20, 1961) was an American physicist who received the 1946 Nobel Prize in Physics for his work on the physics of high pressures. He also wrote extensively on the scientific method and on other as ...
(1927), whose doctrine of ''operationism'' Stevens used to define measurement. In Stevens's definition, for example, it is the use of a tape measure that defines length (the object of measurement) as being measurable (and so by implication quantitative). Critics of operationism object that it confuses the relations between two objects or events for properties of one of those of objects or events (Hardcastle, 1995; Michell, 1999; Moyer, 1981a,b; Rogers, 1989).
The Canadian measurement theorist
William Rozeboom (1966) was an early and trenchant critic of Stevens's theory of scale types.
Same variable may be different scale type depending on context
Another issue is that the same variable may be a different scale type depending on how it is measured and on the goals of the analysis. For example, hair color is usually thought of as a nominal variable, since it has no apparent ordering.
However, it is possible to order colors (including hair colors) in various ways, including by hue; this is known as
colorimetry
Colorimetry is "the science and technology used to quantify and describe physically the human color perception".
It is similar to spectrophotometry, but is distinguished by its interest in reducing spectra to the physical correlates of color ...
. Hue is an interval level variable.
See also
*
Cohen's kappa
Cohen's kappa coefficient (''κ'', lowercase Greek kappa) is a statistic that is used to measure inter-rater reliability (and also intra-rater reliability) for qualitative (categorical) items. It is generally thought to be a more robust measure th ...
*
Coherence (units of measurement)
A coherent system of units is a system of units of measurement used to express physical quantities that are defined in such a way that the equations relating the numerical values expressed in the units of the system have exactly the same form, inc ...
*
Hume's principle
Hume's principle or HP says that the number of ''F''s is equal to the number of ''G''s if and only if there is a one-to-one correspondence (a bijection) between the ''F''s and the ''G''s. HP can be stated formally in systems of second-order logic. ...
*
Inter-rater reliability
In statistics, inter-rater reliability (also called by various similar names, such as inter-rater agreement, inter-rater concordance, inter-observer reliability, inter-coder reliability, and so on) is the degree of agreement among independent obse ...
*
Logarithmic scale
A logarithmic scale (or log scale) is a way of displaying numerical data over a very wide range of values in a compact way—typically the largest numbers in the data are hundreds or even thousands of times larger than the smallest numbers. Such a ...
*
Ramsey–Lewis method
The Ramsey–Lewis method is a method for defining terms found in theory, theoretical frameworks (such as in scientific theory, scientific theories), credited to Frank P. Ramsey and David Lewis (philosopher), David Lewis. By using this method, a se ...
*
Set theory
Set theory is the branch of mathematical logic that studies sets, which can be informally described as collections of objects. Although objects of any kind can be collected into a set, set theory, as a branch of mathematics, is mostly conce ...
*
Statistical data type
In statistics, groups of individual data points may be classified as belonging to any of various statistical data types, e.g. categorical ("red", "blue", "green"), real number (1.68, -5, 1.7e+6), odd number (1,3,5) etc. The data type is a fundam ...
*
Transition (linguistics)
A transition or linking word is a word or phrase that shows the relationship between paragraphs or sections of a text or speech. Transitions provide greater cohesion by making it more explicit or signaling how ideas relate to one another. Transiti ...
References
Further reading
*
*
* Briand, L. & El Emam, K. & Morasca, S. (1995). On the Application of Measurement Theory in Software Engineering. ''Empirical Software Engineering'', ''1'', 61–88.
n line
N, or n, is the fourteenth Letter (alphabet), letter in the Latin alphabet, used in the English alphabet, modern English alphabet, the alphabets of other western European languages and others worldwide. Its name in English is English alphabet# ...
https://web.archive.org/web/20070926232755/http://www2.umassd.edu/swpi/ISERN/isern-95-04.pdf
*
Cliff, N. (1996). ''Ordinal Methods for Behavioral Data Analysis''. Mahwah, NJ: Lawrence Erlbaum.
*
Cliff, N. & Keats, J. A. (2003). ''Ordinal Measurement in the Behavioral Sciences''. Mahwah, NJ: Erlbaum.
*
:See also reprints in:
:* ''Readings in Statistics'', Ch. 3, (Haber, A., Runyon, R. P., and Badia, P.) Reading, Mass: Addison–Wesley, 1970
:*
*
* Lord, F. M., & Novick, M. R. (1968). ''Statistical theories of mental test scores''. Reading, MA: Addison–Wesley.
*
*
*
* Luce, R. D. (2000). ''Utility of uncertain gains and losses: measurement theoretic and experimental approaches''. Mahwah, N.J.: Lawrence Erlbaum.
*
*
*
*
* Michell, J. (1999). ''Measurement in Psychology – A critical history of a methodological concept''. Cambridge: Cambridge University Press.
*
*
*
* Rasch, G. (1960). ''Probabilistic models for some intelligence and attainment tests''. Copenhagen: Danish Institute for Educational Research.
*
*
* Stevens, S. S. (1951). Mathematics, measurement and psychophysics. In S. S. Stevens (Ed.), ''Handbook of experimental psychology '' (pp. 1–49). New York: Wiley.
* Stevens, S. S. (1975). ''Psychophysics''. New York: Wiley.
*
{{wikiversity
Scientific method
Statistical data types
Measurement
Cognitive science