Categorization is the ability and activity of recognizing shared features or similarities between the elements of the

experience Experience refers to conscious events in general, more specifically to perceptions, or to the practical knowledge and familiarity that is produced by these conscious processes. Understood as a conscious event in the widest sense, experience involv ...

of the world (such as

objects Object may refer to: General meanings * Object (philosophy), a thing, being, or concept ** Object (abstract), an object which does not exist at any particular time or place ** Physical object, an identifiable collection of matter * Goal, an ...

, events, or

idea In common usage and in philosophy, ideas are the results of thought. Also in philosophy, ideas can also be mental representational images of some object. Many philosophers have considered ideas to be a fundamental ontological category of bei ...

s), organizing and classifying experience by associating them to a more abstract group (that is, a category, class, or type), on the basis of their traits, features, similarities or other criteria that are

universal Universal is the adjective for universe. Universal may also refer to: Companies * NBCUniversal, a media and entertainment company ** Universal Animation Studios, an American Animation studio, and a subsidiary of NBCUniversal ** Universal TV, a ...

to the group. Categorization is considered one of the most fundamental cognitive abilities, and as such it is studied particularly by

psychology Psychology is the scientific study of mind and behavior. Psychology includes the study of conscious and unconscious phenomena, including feelings and thoughts. It is an academic discipline of immense scope, crossing the boundaries between ...

and

cognitive linguistics Cognitive linguistics is an interdisciplinary branch of linguistics, combining knowledge and research from cognitive science, cognitive psychology, neuropsychology and linguistics. Models and theoretical accounts of cognitive linguistics are con ...

. Categorization is sometimes considered synonymous with classification (cf., Classification synonyms). Categorization and classification allow humans to organize things, objects, and ideas that exist around them and simplify their understanding of the world. Categorization is something that humans and other organisms ''do'': "doing the right thing with the right ''kind'' of thing." The activity of categorizing things can be nonverbal or verbal. For humans, both concrete objects and abstract ideas are recognized, differentiated, and understood through categorization. Objects are usually categorized for some adaptive or pragmatic purposes. Categorization is grounded in the features that distinguish the category's members from nonmembers. Categorization is important in learning, prediction, inference, decision making, language, and many forms of organisms' interaction with their environments.

Overview of categorization

Categories are distinct collections of concrete or abstract instances (category members) that are considered equivalent by the cognitive system. Using category knowledge requires one to access

mental representation A mental representation (or cognitive representation), in philosophy of mind, cognitive psychology, neuroscience, and cognitive science, is a hypothetical internal cognitive symbol that represents external reality, or else a mental process that ...

s that define the core features of category members (cognitive psychologists refer to these category-specific mental representations as

concept Concepts are defined as abstract ideas. They are understood to be the fundamental building blocks of the concept behind principles, thoughts and beliefs. They play an important role in all aspects of cognition. As such, concepts are studied by ...

s). To categorization theorists, the categorization of objects is often considered using taxonomies with three hierarchical levels of abstraction. For example, a plant could be identified at a high level of abstraction by simply labeling it a flower, a medium level of abstraction by specifying that the flower is a rose, or a low level of abstraction by further specifying this particular rose as a dog rose. Categories in a taxonomy are related to one another via class inclusion, with the highest level of abstraction being the most inclusive and the lowest level of abstraction being the least inclusive. The three levels of abstraction are as follows: * Superordinate level, Genus (e.g., Flower) - The highest and most inclusive level of abstraction. Exhibits the highest degree of generality and the lowest degree of within-category similarity. * Basic Level, Species (e.g., Rose) - The middle level of abstraction. Rosch and colleagues (1976) suggest the basic level to be the most cognitively efficient. Basic level categories exhibit high within-category ''similarities'' and high between-category ''dissimilarities''. Furthermore, the basic level is the most inclusive level at which category exemplars share a generalized identifiable shape. Adults most-often use basic level object names, and children learn basic object names first. * Subordinate level (e.g., Dog Rose) - The lowest level of abstraction. Exhibits the highest degree of specificity and within-category similarity.

Theories of categorization

Classical view

The classical theory of categorization, is a term used in

to denote the approach to categorization that appears in Plato and Aristotle and that has been highly influential and dominant in Western culture, particularly in philosophy, linguistics and psychology. Aristotle's categorical method of analysis was transmitted to the scholastic medieval university through Porphyry's

Isagoge The ''Isagoge'' ( el, Εἰσαγωγή, ''Eisagōgḗ''; ) or "Introduction" to Aristotle's "Categories", written by Porphyry in Greek and translated into Latin by Boethius, was the standard textbook on logic for at least a millennium after his ...

. The classical view of categories can be summarized into three assumptions: a category can be described as a list of

necessary and sufficient In logic and mathematics, necessity and sufficiency are terms used to describe a conditional or implicational relationship between two statements. For example, in the conditional statement: "If then ", is necessary for , because the truth of ...

features that its membership must have, categories are discrete in that they have clearly defined boundaries (either an element belongs to one or not, with no possibilities in between), and all the members of a category have the same status. (There are no members of the category which belong more than others). In the classical view, categories need to be clearly defined, mutually exclusive and collectively exhaustive; this way, any entity in the given classification universe belongs unequivocally to one, and only one, of the proposed categories. The classical view of categories first appeared in the context of

Western Philosophy Western philosophy encompasses the philosophical thought and work of the Western world. Historically, the term refers to the philosophical thinking of Western culture, beginning with the ancient Greek philosophy of the pre-Socratics. The word ' ...

in the work of

Plato Plato ( ; grc-gre, Πλάτων ; 428/427 or 424/423 – 348/347 BC) was a Greek philosopher born in Athens during the Classical period in Ancient Greece. He founded the Platonist school of thought and the Academy, the first institution ...

, who, in his Statesman dialogue, introduces the approach of grouping objects based on their similar

properties Property is the ownership of land, resources, improvements or other tangible objects, or intellectual property. Property may also refer to: Mathematics * Property (mathematics) Philosophy and science * Property (philosophy), in philosophy an ...

. This approach was further explored and systematized by

Aristotle Aristotle (; grc-gre, Ἀριστοτέλης ''Aristotélēs'', ; 384–322 BC) was a Greek philosopher and polymath during the Classical period in Ancient Greece. Taught by Plato, he was the founder of the Peripatetic school of ph ...

in his

Categories Category, plural categories, may refer to: Philosophy and general uses *Categorization, categories in cognitive science, information science and generally *Category of being * ''Categories'' (Aristotle) *Category (Kant) * Categories (Peirce) * ...

treatise, where he analyzes the differences between

class Class or The Class may refer to: Common uses not otherwise categorized * Class (biology), a taxonomic rank * Class (knowledge representation), a collection of individuals or objects * Class (philosophy), an analytical concept used differentl ...

es and

object Object may refer to: General meanings * Object (philosophy), a thing, being, or concept ** Object (abstract), an object which does not exist at any particular time or place ** Physical object, an identifiable collection of matter * Goal, an ...

s. Aristotle also applied intensively the classical categorization scheme in his approach to the classification of living beings (which uses the technique of applying successive narrowing questions such as "Is it an animal or vegetable?", "How many feet does it have?", "Does it have fur or feathers?", "Can it fly?"...), establishing this way the basis for

natural Nature, in the broadest sense, is the physical world or universe. "Nature" can refer to the phenomena of the physical world, and also to life in general. The study of nature is a large, if not the only, part of science. Although humans are ...

taxonomy Taxonomy is the practice and science of categorization or classification. A taxonomy (or taxonomical classification) is a scheme of classification, especially a hierarchical classification, in which things are organized into groups or types. ...

. Examples of the use of the classical view of categories can be found in the western philosophical works of Descartes, Blaise Pascal,

Spinoza Baruch (de) Spinoza (born Bento de Espinosa; later as an author and a correspondent ''Benedictus de Spinoza'', anglicized to ''Benedict de Spinoza''; 24 November 1632 – 21 February 1677) was a Dutch philosopher of Portuguese-Jewish origin, ...

and John Locke, and in the 20th century in

Bertrand Russell Bertrand Arthur William Russell, 3rd Earl Russell, (18 May 1872 – 2 February 1970) was a British mathematician, philosopher, logician, and public intellectual. He had a considerable influence on mathematics, logic, set theory, linguistics, ...

G.E. Moore George Edward Moore (4 November 1873 – 24 October 1958) was an English philosopher, who with Bertrand Russell, Ludwig Wittgenstein and earlier Gottlob Frege was among the founders of analytic philosophy. He and Russell led the turn from ideal ...

, the

logical positivists Logical positivism, later called logical empiricism, and both of which together are also known as neopositivism, is a movement in Western philosophy whose central thesis was the verification principle (also known as the verifiability criterion of ...

. It has been a cornerstone of analytic philosophy and its

conceptual analysis Philosophical analysis is any of various techniques, typically used by philosophers in the analytic tradition, in order to "break down" (i.e. analyze) philosophical issues. Arguably the most prominent of these techniques is the analysis of concep ...

, with more recent formulations proposed in the 1990s by

Frank Cameron Jackson Frank Cameron Jackson (born 31 August 1943) is an Australian analytic philosopher and Emeritus Professor in the School of Philosophy (Research School of Social Sciences) at Australian National University (ANU) where he had spent most of the l ...

and

Christopher Peacocke Christopher Arthur Bruce Peacocke (born 22 May 1950) is a British philosopher known for his work in philosophy of mind and epistemology. His recent publications, in the field of epistemology, have defended a version of rationalism. His daught ...

. At the beginning of the 20th century, the question of categories was introduced into the empirical social sciences by Durkheim and Mauss, whose pioneering work has been revisited in contemporary scholarship. The classical model of categorization has been used at least since the 1960s from linguists of the

structural semantics Structural semantics (also structuralist semantics) is a linguistic school and paradigm that emerged in Europe from the 1930s, inspired by the structuralist linguistic movement started by Ferdinand de Saussure's 1916 work "'' Cours De Linguist ...

paradigm, by

Jerrold Katz Jerrold Jacob Katz (14 July 19327 February 2002) was an American philosopher and linguist. Biography After receiving a PhD in philosophy from Princeton University in 1960, Katz became a Research Associate in Linguistics at the Massachusetts Ins ...

and

Jerry Fodor Jerry Alan Fodor (; April 22, 1935 – November 29, 2017) was an American philosopher and the author of many crucial works in the fields of philosophy of mind and cognitive science. His writings in these fields laid the groundwork for the modul ...

in 1963, which in turn have influenced its adoption also by psychologists like

Allan M. Collins Allan M. Collins is an American Cognitive Science, cognitive scientist, Professor Emeritus of Learning Sciences at Northwestern University, Northwestern University's School of Education and Social Policy. His research is recognized as having broad ...

and

M. Ross Quillian ( ; ; pl. ; ; 1512, from Middle French , literally "my lord") is an honorific title that was used to refer to or address the eldest living brother of the king in the French royal court. It has now become the customary French title of respec ...

. Modern versions of classical categorization theory study how the brain learns and represents categories by detecting the features that distinguish members from nonmembers.

Prototype theory

The pioneering research by psychologist

Eleanor Rosch Eleanor Rosch (once known as Eleanor Rosch Heider;"Natural Categories", Cognitive Psychology, Vol. 4, No. 3, (May 1973), p. 328. born 1938) is an American psychologist. She is a professor of psychology at the University of California, Berkeley, s ...

and colleagues since 1973, introduced the

prototype theory Prototype theory is a theory of categorization in cognitive science, particularly in psychology and cognitive linguistics, in which there is a graded degree of belonging to a conceptual category, and some members are more central than others. I ...

, according to which categorization can also be viewed as the process of grouping things based on prototypes. This approach has been highly influential, particularly for

. It was in part based on previous insights, in particular the formulation of a category model based on

family resemblance Family resemblance (german: Familienähnlichkeit, link=no) is a philosophical idea made popular by Ludwig Wittgenstein, with the best known exposition given in his posthumously published book ''Philosophical Investigations'' (1953). It argues tha ...

Wittgenstein Ludwig Josef Johann Wittgenstein ( ; ; 26 April 1889 – 29 April 1951) was an Austrians, Austrian-British people, British philosopher who worked primarily in logic, the philosophy of mathematics, the philosophy of mind, and the philosophy o ...

(1953), and by Roger Brown's ''How shall a thing be called?'' (1958). Prototype theory has been then adopted by cognitive linguists like George Lakoff. The prototype theory is an example of a similarity-based approach to categorization, in which a stored category representation is used to assess the similarity of candidate category members. Under the prototype theory, this stored representation consists of a summary representation of the category's members. This prototype stimulus can take various forms. It might be a central tendency that represents the category's average member, a modal stimulus representing either the most frequent instance or a stimulus composed of the most common category features, or, lastly, the "ideal" category member, or a caricature that emphasizes the distinct features of the category. An important consideration of this prototype representation is that it does not necessarily reflect the existence of an actual instance of the category in the world. Furthermore, prototypes are highly sensitive to context. For example, while one's prototype for the category of beverages may be soda or seltzer, the context of brunch might lead them to select mimosa as a prototypical beverage. The prototype theory claims that members of a given category share a

, and categories are defined by sets of typical features (as opposed to all members possessing necessary and sufficient features).

Exemplar theory

Another instance of the similarity-based approach to categorization, the exemplar theory likewise compares the similarity of candidate category members to stored memory representations. Under the exemplar theory, all known instances of a category are stored in memory as exemplars. When evaluating an unfamiliar entity's category membership, exemplars from potentially relevant categories are retrieved from memory, and the entity's similarity to those exemplars is summed to formulate a categorization decision. Medin and Schaffer's (1978) Context model employs a nearest neighbor approach which, rather than summing an entity's similarities to relevant exemplars, multiplies them to provide weighted similarities that reflect the entity's proximity to relevant exemplars. This effectively biases categorization decisions towards exemplars most similar to the entity to be categorized.

Conceptual clustering

Conceptual clustering is a

machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...

paradigm for

unsupervised classification Unsupervised learning is a type of algorithm that learns patterns from untagged data. The hope is that through mimicry, which is an important mode of learning in people, the machine is forced to build a concise representation of its world and t ...

that was defined by Ryszard S. Michalski in 1980. It is a modern variation of the classical approach of categorization, and derives from attempts to explain how knowledge is represented. In this approach, classes (clusters or entities) are generated by first formulating their conceptual descriptions and then classifying the entities according to the descriptions. Conceptual clustering developed mainly during the 1980s, as a machine paradigm for

unsupervised learning Unsupervised learning is a type of algorithm that learns patterns from untagged data. The hope is that through mimicry, which is an important mode of learning in people, the machine is forced to build a concise representation of its world and t ...

. It is distinguished from ordinary

data clustering Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). It is a main task of ...

by generating a concept description for each generated category. Conceptual clustering is closely related to

fuzzy set In mathematics, fuzzy sets (a.k.a. uncertain sets) are sets whose elements have degrees of membership. Fuzzy sets were introduced independently by Lotfi A. Zadeh in 1965 as an extension of the classical notion of set. At the same time, defined ...

theory, in which objects may belong to one or more groups, in varying degrees of fitness. A cognitive approach accepts that natural categories are graded (they tend to be fuzzy at their boundaries) and inconsistent in the status of their constituent members. The idea of necessary and sufficient conditions is almost never met in categories of naturally occurring things.

Category learning

''While an exhaustive discussion of category learning is beyond the scope of this article, a brief overview of category learning and its associated theories is useful in understanding formal models of categorization.'' If categorization research investigates how categories are maintained and used, the field of category learning seeks to understand how categories are acquired in the first place. To accomplish this, researchers often employ novel categories of arbitrary objects (e.g., dot matrices) to ensure that participants are entirely unfamiliar with the stimuli. Category learning researchers have generally focused on two distinct forms of category learning. Classification learning tasks participants with predicting category labels for a stimulus based on its provided features. Classification learning is centered around learning between-category information and the diagnostic features of categories.Higgins, E., & Ross, B. (2011). Comparisons in category learning: How best to compare for what. In Proceedings of the Annual Meeting of the Cognitive Science Society (Vol. 33, No. 33). In contrast, inference learning tasks participants with inferring the presence/value of a category feature based on a provided category label and/or the presence of other category features. Inference learning is centered on learning within-category information and the category's prototypical features. Category learning tasks can generally be divided into two categories, supervised and unsupervised learning.

Supervised learning Supervised learning (SL) is a machine learning paradigm for problems where the available data consists of labelled examples, meaning that each data point contains features (covariates) and an associated label. The goal of supervised learning alg ...

tasks provide learners with category labels. Learners then use information extracted from labeled example categories to classify stimuli into the appropriate category, which may involve the abstraction of a rule or concept relating observed object features to category labels.

Unsupervised learning Unsupervised learning is a type of algorithm that learns patterns from untagged data. The hope is that through mimicry, which is an important mode of learning in people, the machine is forced to build a concise representation of its world and t ...

tasks do not provide learners with category labels. Learners must therefore recognize inherent structures in a data set and group stimuli together by similarity into classes. Unsupervised learning is thus a process of generating a classification structure. Tasks used to study category learning take various forms: * Rule-based tasks present categories that participants can learn through explicit reasoning processes. In these kinds of tasks, classification of stimuli is accomplished via the use of an acquired rule (i.e., if stimulus is large on dimension x, respond A). * Information-integration tasks require learners to synthesize perceptual information from multiple stimulus dimensions prior to making categorization decisions. Unlike rule-based tasks, information-integration tasks do not afford rules that are easily articulable. Reading an X-ray and trying to determine if a tumor is present can be thought of as a real-world instantiation of an information-integration task. * Prototype distortion tasks require learners to generate a prototype for a category. Candidate exemplars for the category are then produced by randomly manipulating the features of the prototype, which learners must classify as either belonging to the category or not.

Category learning theories

Category learning researchers have proposed various theories for how humans learn categories. Prevailing theories of category learning include the prototype theory, the exemplar theory, and the decision bound theory. The prototype theory suggests that to learn a category, one must learn the category's prototype. Subsequent categorization of novel stimuli is then accomplished by selecting the category with the most similar prototype. The exemplar theory suggests that to learn a category, one must learn about the exemplars that belong to that category. Subsequent categorization of a novel stimulus is then accomplished by computing its similarity to the known exemplars of potentially relevant categories and selecting the category that contains the most similar exemplars. Decision bound theory suggests that to learn a category, one must either learn the regions of a stimulus space associated with particular responses or the boundaries (the decision bounds) that divide these response regions. Categorization of a novel stimulus is then accomplished by determining which response region it is contained within.

Formal models of categorization

Computational models A computational model uses computer programs to simulate and study complex systems using an algorithmic or mechanistic approach and is widely used in a diverse range of fields spanning from physics, chemistry and biology to economics, psychology, ...

of categorization have been developed to test theories about how humans represent and use category information. To accomplish this, categorization models can be fit to experimental data to see how well the predictions afforded by the model line up with human performance. Based on the model's success at explaining the data, theorists are able to draw conclusions about the accuracy of their theories and their theory's relevance to human category representations. To effectively capture how humans represent and use category information, categorization models generally operate under variations of the same three basic assumptions. First, the model must make some kind of assumption about the internal representation of the stimulus (e.g., representing the perception of a stimulus as a point in a multi-dimensional space). Second, the model must make an assumption about the specific information that needs to be accessed in order to formulate a response (e.g., exemplar models require the collection of all available exemplars for each category). Third, the model must make an assumption about how a response is selected given the available information. Though all categorization models make these three assumptions, they distinguish themselves by the ways in which they represent and transform an input into a response representation. The internal knowledge structures of various categorization models reflect the specific representation(s) they use to perform these transformations. Typical representations employed by models include exemplars, prototypes, and rules. * Exemplar models store all distinct instances of stimuli with their corresponding category labels in memory. Categorization of subsequent stimuli is determined by the stimulus' collective similarity to all known exemplars. * Prototype models store a summary representation of all instances in a category. Categorization of subsequent stimuli is determined by selecting the category whose prototype is most similar to the stimulus. * Rule-based models define categories by storing summary lists of the necessary and sufficient features required for category membership. Boundary models can be considered as atypical rule models, as they do not define categories based on their content. Rather, boundary models define the edges (boundaries) between categories, which subsequently serve as determinants for how a stimulus gets categorized.

Examples of categorization models

Prototype models

Weighted Features Prototype Model An early instantiation of the prototype model was produced by Reed in the early 1970s. Reed (1972) conducted a series of experiments to compare the performance of 18 models on explaining data from a categorization task that required participants to sort faces into one of two categories. Results suggested that the prevailing model was the weighted features prototype model, which belonged to the family of average distance models. Unlike traditional average distance models, however, this model differentially weighted the most distinguishing features of the two categories. Given this model's performance, Reed (1972) concluded that the strategy participants used during the face categorization task was to construct prototype representations for each of the two categories of faces and categorize test patterns into the category associated with the most similar prototype. Furthermore, results suggested that similarity was determined by each categories most discriminating features.

Exemplar models

Generalized Context Model Medin and Schaffer's (1978) context model was expanded upon by Nosofsky (1986) in the mid-1980's, resulting in the production of the Generalized Context Model (GCM). The GCM is an exemplar model that stores exemplars of stimuli as exhaustive combinations of the features associated with each exemplar. By storing these combinations, the model establishes contexts for the features of each exemplar, which are defined by all other features with which that feature co-occurs. The GCM computes the similarity of an exemplar and a stimulus in two steps. First, the GCM computes the

psychological distance Psychological distance is the degree to which people feel removed from a phenomenon. Distance in this case is not limited to the physical surroundings, rather it could also be abstract. Distance can be defined as the separation between the self and ...

between the exemplar and the stimulus. This is accomplished by summing the absolute values of the dimensional difference between the exemplar and the stimulus. For example, suppose an exemplar has a value of 18 on dimension X and the stimulus has a value of 42 on dimension X; the resulting dimensional difference would be 24. Once psychological distance has been evaluated, an exponential decay function determines the similarity of the exemplar and the stimulus, where a distance of 0 results in a similarity of 1 (which begins to decrease exponentially as distance increases). Categorical responses are then generated by evaluating the similarity of the stimulus to each category's exemplars, where each exemplar provides a "vote" to their respective categories that varies in strength based on the exemplar's similarity to the stimulus and the strength of the exemplar's association with the category. This effectively assigns each category a selection probability that is determined by the proportion of votes it receives, which can then be fit to data.

Rule-based models

RULEX (Rule-Plus-Exception) Model While simple logical rules are ineffective at learning poorly defined category structures, some proponents of the rule-based theory of categorization suggest that an imperfect rule can be used to learn such category structures if exceptions to that rule are also stored and considered. To formalize this proposal, Nosofsky and colleagues (1994) designed the RULEX model. The RULEX model attempts to form a decision tree composed of sequential tests of an object's attribute values. Categorization of the object is then determined by the outcome of these sequential tests. The RULEX model searches for rules in the following ways: * Exact Search for a rule that uses a single attribute to discriminate between classes without error. * Imperfect Search for a rule that uses a single attribute to discriminate between classes with few errors * Conjunctive Search for a rule that uses multiple attributes to discriminate between classes with few errors. * Exception Search for exceptions to the rule. The method that RULEX uses to perform these searches is as follows: First, RULEX attempts an exact search. If successful, then RULEX will continuously apply that rule until misclassification occurs. If the exact search fails to identify a rule, either an imperfect or conjunctive search will begin. A sufficient, though imperfect, rule acquired during one of these search phases will become permanently implemented and the RULEX model will then begin to search for exceptions. If no rule is acquired, then the model will attempt the search it did not perform in the previous phase. If successful, RULEX will permanently implement the rule and then begin an exception search. If none of the previous search methods are successful RULEX will default to only searching for exceptions, despite lacking an associated rule, which equates to acquiring a random rule.

Hybrid models

SUSTAIN (Supervised and Unsupervised Stratified Adaptive Incremental Network) It is often the case that learned category representations vary depending on the learner's goals, as well as how categories are used during learning. Thus, some categorization researchers suggest that a proper model of categorization needs to be able to account for the variability present in the learner's goals, tasks, and strategies. This proposal was realized by Love and colleagues (2004) through the creation of SUSTAIN, a flexible clustering model capable of accommodating both simple and complex categorization problems through incremental adaptation to the specifics of problems. In practice, the SUSTAIN model first converts a stimulus' perceptual information into features that are organized along a set of dimensions. The representational space that encompasses these dimensions is then distorted (e.g., stretched or shrunk) to reflect the importance of each feature based on inputs from an attentional mechanism. A set of clusters (specific instances grouped by similarity) associated with distinct categories then compete to respond to the stimulus, with the stimulus being subsequently assigned to the cluster whose representational space is closest to the stimulus'. The unknown stimulus dimension value (e.g., category label) is then predicted by the winning cluster, which, in turn, informs the categorization decision. The flexibility of the SUSTAIN model is realized through its ability to employ both supervised and unsupervised learning at the cluster level. If SUSTAIN incorrectly predicts a stimulus as belonging to a particular cluster, corrective feedback (i.e., supervised learning) would signal sustain to recruit an additional cluster that represents the misclassified stimulus. Therefore, subsequent exposures to the stimulus (or a similar alternative) would be assigned to the correct cluster. SUSTAIN will also employ unsupervised learning to recruit an additional cluster if the similarity between the stimulus and the closest cluster does not exceed a threshold, as the model recognizes the weak predictive utility that would result from such a cluster assignment. SUSTAIN also exhibits flexibility in how it solves both simple and complex categorization problems. Outright, the internal representation of SUSTAIN contains only a single cluster, thus biasing the model towards simple solutions. As problems become increasingly complex (e.g., requiring solutions consisting of multiple stimulus dimensions), additional clusters are incrementally recruited so SUSTAIN can handle the rise in complexity.

Social categorization

Social categorization consists of putting human beings into groups in order to identify them based on different criteria. Categorization is a process studied by scholars in cognitive science but can also be studied as a social activity. Social categorization is different from the categorization of other things because it implies that people create categories for themselves and others as human beings. Groups can be created based on ethnicity, country of origin, religion, sexual identity, social privileges, economic privileges, etc. Various ways to sort people exist according to one's schemas. People belong to various social groups because of their ethnicity, religion, or age. Social categories based on age, race, and gender are used by people when they encounter a new person. Because some of these categories refer to physical traits, they are often used automatically when people don't know each other. These categories are not objective and depend on how people see the world around them. They allow people to identify themselves with similar people, and to identify people who are different. They are useful in one's identity formation with the people around them. One can build their own identity by identifying themselves in a group or by rejecting another group. Social categorization is similar to other types of categorization since it aims at simplifying the understanding of people. However, creating social categories implies that people will position themselves in relation to other groups. A hierarchy in group relations can appear as a result of social categorization. Scholars argue that the categorization process starts at a young age when children start to learn about the world and the people around them. Children learn how to know people according to categories based on similarities and differences. Social categories made by adults also impact their understanding of the world. They learn about social groups by hearing generalities about these groups from their parents. They can then develop prejudices about people as a result of these generalities. Another aspect of social categorization is mentioned by Stephen Reicher and Nick Hopkins and is related to political domination. They argue that political leaders use social categories to influence political debates.

Negative aspects

The activity of sorting people according to subjective or objective criteria can be seen as a negative process because of its tendency to lead to violence from a group to another. Indeed, similarities gather people who share common traits but differences between groups can lead to tensions and then the use of violence between those groups. The creation of social groups by people is responsible of a hierarchization of relations between groups. These hierarchical relations participate in the promotion of stereotypes about people and groups, sometimes based on subjective criteria. Social categories can encourage people to associate stereotypes to groups of people. Associating stereotypes to a group, and to people who belong to this group, can lead to forms of discrimination towards people of this group. The perception of a group and the stereotypes associated with it have an impact on social relations and activities. Some social categories have more weight than others in society. For instance, in history and still today, the category of "race" is one of the first categories used to sort people. However, only a few categories of race are commonly used such as "Black", "White", "Asian" etc. It participates in the reduction of the multitude of ethnicities to a few categories based mostly on people's skin color. The process of sorting people creates a vision of the other as 'different', leading to the dehumanization of people. Scholars talk about intergroup relations with the concept of social identity theory developed by H. Tajfel. Indeed, in history, many examples of social categorization have led to forms of domination or violence from a dominant group to a dominated group. Periods of colonisation are examples of times when people from a group chose to dominate and control other people belonging to other groups because they considered them as inferior. Racism, discrimination and violence are consequences of social categorization and can occur because of it. When people see others as different, they tend to develop hierarchical relation with other groups.

Miscategorization

There cannot be categorization without the possibility of miscategorization. To do "the right thing with the right ''kind'' of thing.", there has to be both a right and a wrong thing to do. Not only does a category of which "everything" is a member lead logically to the

Russell paradox In mathematical logic, Russell's paradox (also known as Russell's antinomy) is a set-theoretic paradox discovered by the British philosopher and mathematician Bertrand Russell in 1901. Russell's paradox shows that every set theory that contains ...

("is it or is it not a member of itself?"), but without the possibility of error, there is no way to detect or define what distinguishes category members from nonmembers. An example of the absence of nonmembers is the problem of the

poverty of the stimulus Poverty of the stimulus (POS) is the controversial argument from linguistics that children are not exposed to rich enough data within their linguistic environments to acquire every feature of their language. This is considered evidence contrary to ...

in language learning by the child: children learning the language do not hear or make errors in the rules of

Universal Grammar Universal grammar (UG), in modern linguistics, is the theory of the genetic component of the language faculty, usually credited to Noam Chomsky. The basic postulate of UG is that there are innate constraints on what the grammar of a possible hu ...

(UG). Hence they never get corrected for errors in UG. Yet children's speech obeys the rules of UG, and speakers can immediately detect that something is wrong if a linguist generates (deliberately) an utterance that violates UG. Hence speakers can categorize what is UG-compliant and UG-noncompliant. Linguists have concluded from this that the rules of UG must be somehow encoded innately in the human brain. Ordinary categories, however, such as "dogs," have abundant examples of nonmembers (cats, for example). So it is possible to learn, by trial and error, with error-correction, to detect and define what distinguishes dogs from non-dogs, and hence to correctly categorize them. This kind of learning, called

reinforcement learning Reinforcement learning (RL) is an area of machine learning concerned with how intelligent agents ought to take actions in an environment in order to maximize the notion of cumulative reward. Reinforcement learning is one of three basic machine ...

in the behavioral literature and

supervised learning Supervised learning (SL) is a machine learning paradigm for problems where the available data consists of labelled examples, meaning that each data point contains features (covariates) and an associated label. The goal of supervised learning alg ...

in the computational literature, is fundamentally dependent on the possibility of error, and error-correction. Miscategorization—examples of nonmembers of the category—must always exist, not only to make the category learnable, but for the category to exist and be definable at all.

References

External links

To Cognize is to Categorize: Cognition is Categorization

Wikipedia Categories Visualizer

Interdisciplinary Introduction to Categorization: Interview with Dvora Yanov (political sciences), Amie Thomasson (philosophy) and Thomas Serre (artificial intelligence)
* {{philosophy of language Cognition Concepts in epistemology Knowledge representation Semantics