Adaptive resonance theory
   HOME

TheInfoList



OR:

Adaptive resonance theory (ART) is a theory developed by Stephen Grossberg and Gail Carpenter on aspects of how the brain processes information. It describes a number of neural network models which use supervised and
unsupervised learning Unsupervised learning is a type of algorithm that learns patterns from untagged data. The hope is that through mimicry, which is an important mode of learning in people, the machine is forced to build a concise representation of its world and t ...
methods, and address problems such as
pattern recognition Pattern recognition is the automated recognition of patterns and regularities in data. It has applications in statistical data analysis, signal processing, image analysis, information retrieval, bioinformatics, data compression, computer graphics ...
and prediction. The primary intuition behind the ART model is that object identification and recognition generally occur as a result of the interaction of 'top-down' observer expectations with 'bottom-up'
sensory information A sense is a biological system used by an organism for sensation, the process of gathering information about the world through the detection of stimuli. (For example, in the human body, the brain which is part of the central nervous system re ...
. The model postulates that 'top-down' expectations take the form of a memory template or
prototype A prototype is an early sample, model, or release of a product built to test a concept or process. It is a term used in a variety of contexts, including semantics, design, electronics, and software programming. A prototype is generally used to ...
that is then compared with the actual features of an object as detected by the senses. This comparison gives rise to a measure of category belongingness. As long as this difference between sensation and expectation does not exceed a set threshold called the 'vigilance parameter', the sensed object will be considered a member of the expected class. The system thus offers a solution to the 'plasticity/stability' problem, i.e. the problem of acquiring new knowledge without disrupting existing knowledge that is also called
incremental learning In computer science, incremental learning is a method of machine learning in which input data is continuously used to extend the existing model's knowledge i.e. to further train the model. It represents a dynamic technique of supervised learnin ...
.


Learning model

The basic ART system is an
unsupervised learning Unsupervised learning is a type of algorithm that learns patterns from untagged data. The hope is that through mimicry, which is an important mode of learning in people, the machine is forced to build a concise representation of its world and t ...
model. It typically consists of a ''comparison field'' and a ''recognition field'' composed of
neuron A neuron, neurone, or nerve cell is an electrically excitable cell that communicates with other cells via specialized connections called synapses. The neuron is the main component of nervous tissue in all animals except sponges and placozoa ...
s, a ''vigilance parameter'' (threshold of recognition), and a ''reset module''. * The comparison field takes an ''input vector'' (a one-dimensional array of values) and transfers it to its best match in the recognition field. ** Its best match is the single neuron whose set of weights (weight vector) most closely matches the input vector. * Each recognition field neuron outputs a negative signal (proportional to that neuron's quality of match to the input vector) to each of the other recognition field neurons and thus inhibits their output. ** In this way the recognition field exhibits
lateral inhibition In neurobiology, lateral inhibition is the capacity of an excited neuron to reduce the activity of its neighbors. Lateral inhibition disables the spreading of action potentials from excited neurons to neighboring neurons in the lateral direction ...
, allowing each neuron in it to represent a category to which input vectors are classified. * After the input vector is classified, the reset module compares the strength of the recognition match to the vigilance parameter. ** If the vigilance parameter is overcome (i.e. the input vector is within the normal range seen on previous input vectors), then training commences: *** The weights of the winning recognition neuron are adjusted towards the features of the input vector ** Otherwise, if the match level is below the vigilance parameter (i.e. the input vector's match is outside the normal expected range for that neuron) the winning recognition neuron is inhibited and a search procedure is carried out. *** In this search procedure, recognition neurons are disabled one by one by the reset function until the vigilance parameter is overcome by a recognition match. **** In particular, at each cycle of the search procedure the most active recognition neuron is selected and then switched off, if its activation is below the vigilance parameter **** (note that it thus releases the remaining recognition neurons from its inhibition). ** If no committed recognition neuron's match overcomes the vigilance parameter, then an uncommitted neuron is committed and its weights are adjusted towards matching the input vector. * The vigilance parameter has considerable influence on the system: higher vigilance produces highly detailed memories (many, fine-grained categories), while lower vigilance results in more general memories (fewer, more-general categories).


Training

There are two basic methods of training ART-based neural networks: slow and fast. In the slow learning method, the degree of training of the recognition neuron's weights towards the input vector is calculated to continuous values with
differential equation In mathematics, a differential equation is an equation that relates one or more unknown functions and their derivatives. In applications, the functions generally represent physical quantities, the derivatives represent their rates of change, ...
s and is thus dependent on the length of time the input vector is presented. With fast learning,
algebraic equation In mathematics, an algebraic equation or polynomial equation is an equation of the form :P = 0 where ''P'' is a polynomial with coefficients in some field, often the field of the rational numbers. For many authors, the term ''algebraic equation'' ...
s are used to calculate degree of weight adjustments to be made, and binary values are used. While fast learning is effective and efficient for a variety of tasks, the slow learning method is more biologically plausible and can be used with continuous-time networks (i.e. when the input vector can vary continuously).


Types

ART 1Carpenter, G.A. & Grossberg, S. (2003)
Adaptive Resonance Theory
, In Michael A. Arbib (Ed.), The Handbook of Brain Theory and Neural Networks, Second Edition (pp. 87-90). Cambridge, MA: MIT Press
Grossberg, S. (1987)
Competitive learning: From interactive activation to adaptive resonance
,
Cognitive Science (journal) ''Cognitive Science'' is a multidisciplinary peer-reviewed academic journal published by John Wiley & Sons on behalf of the Cognitive Science Society The Cognitive Science Society is a professional society for the interdisciplinary field of cog ...
, 11, 23-63
is the simplest variety of ART networks, accepting only binary inputs. ART 2Carpenter, G.A. & Grossberg, S. (1987)
ART 2: Self-organization of stable category recognition codes for analog input patterns
,
Applied Optics ''Applied Optics'' is a peer-reviewed scientific journal published by The Optical Society three times a month. It was established in 1962 with John N. Howard as founding editor-in-chief. The journal covers all aspects of optics, photonics, imagin ...
, 26(23), 4919-4930
extends network capabilities to support continuous inputs. ART 2-ACarpenter, G.A., Grossberg, S., & Rosen, D.B. (1991a)
ART 2-A: An adaptive resonance algorithm for rapid category learning and recognition
, ''
Neural Networks A neural network is a network or circuit of biological neurons, or, in a modern sense, an artificial neural network, composed of artificial neurons or nodes. Thus, a neural network is either a biological neural network, made up of biological ...
'', 4, 493-504
is a streamlined form of ART-2 with a drastically accelerated runtime, and with qualitative results being only rarely inferior to the full ART-2 implementation. ART 3Carpenter, G.A. & Grossberg, S. (1990)
ART 3: Hierarchical search using chemical transmitters in self-organizing pattern recognition architectures
, ''
Neural Networks A neural network is a network or circuit of biological neurons, or, in a modern sense, an artificial neural network, composed of artificial neurons or nodes. Thus, a neural network is either a biological neural network, made up of biological ...
'', 3, 129-152
builds on ART-2 by simulating rudimentary
neurotransmitter A neurotransmitter is a signaling molecule secreted by a neuron to affect another cell across a synapse. The cell receiving the signal, any main body part or target cell, may be another neuron, but could also be a gland or muscle cell. Neu ...
regulation of synaptic activity by incorporating simulated sodium (Na+) and calcium (Ca2+) ion concentrations into the system's equations, which results in a more physiologically realistic means of partially inhibiting categories that trigger mismatch resets. ARTMAPCarpenter, G.A., Grossberg, S., & Reynolds, J.H. (1991)
ARTMAP: Supervised real-time learning and classification of nonstationary data by a self-organizing neural network
, ''
Neural Networks A neural network is a network or circuit of biological neurons, or, in a modern sense, an artificial neural network, composed of artificial neurons or nodes. Thus, a neural network is either a biological neural network, made up of biological ...
'', 4, 565-588
also known as Predictive ART, combines two slightly modified ART-1 or ART-2 units into a supervised learning structure where the first unit takes the input data and the second unit takes the correct output data, then used to make the minimum possible adjustment of the vigilance parameter in the first unit in order to make the correct classification. Fuzzy ARTCarpenter, G.A., Grossberg, S., & Rosen, D.B. (1991b)
Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system
, ''
Neural Networks A neural network is a network or circuit of biological neurons, or, in a modern sense, an artificial neural network, composed of artificial neurons or nodes. Thus, a neural network is either a biological neural network, made up of biological ...
'', 4, 759-771
implements fuzzy logic into ART's pattern recognition, thus enhancing generalizability. An optional (and very useful) feature of fuzzy ART is complement coding, a means of incorporating the absence of features into pattern classifications, which goes a long way towards preventing inefficient and unnecessary category proliferation. The applied similarity measures are based on the L1 norm. Fuzzy ART is known to be very sensitive to noise. Fuzzy ARTMAPCarpenter, G.A., Grossberg, S., Markuzon, N., Reynolds, J.H., & Rosen, D.B. (1992)
Fuzzy ARTMAP: A neural network architecture for incremental supervised learning of analog multidimensional maps
, IEEE Transactions on Neural Networks, 3, 698-713
is merely ARTMAP using fuzzy ART units, resulting in a corresponding increase in efficacy. Simplified Fuzzy ARTMAP (SFAM) constitutes a strongly simplified variant of fuzzy ARTMAP dedicated to
classification Classification is a process related to categorization, the process in which ideas and objects are recognized, differentiated and understood. Classification is the grouping of related facts into classes. It may also refer to: Business, organizat ...
tasks. Gaussian ARTJames R. Williamson. (1996)
Gaussian ARTMAP: A Neural Network for Fast Incremental Learning of Noisy Multidimensional Maps
Neural Networks, 9(5):881-897
and Gaussian ARTMAP use Gaussian activation functions and computations based on probability theory. Therefore, they have some similarity with Gaussian mixture models. In comparison to fuzzy ART and fuzzy ARTMAP, they are less sensitive to noise. But the stability of learnt representations is reduced which may lead to category proliferation in open-ended learning tasks. Fusion ART and related networks extend ART and ARTMAP to multiple pattern channels. They support several learning paradigms, including unsupervised learning, supervised learning and reinforcement learning. TopoART combines fuzzy ART with topology learning networks such as the growing neural gas. Furthermore, it adds a noise reduction mechanism. There are several derived neural networks which extend TopoART to further learning paradigms. Hypersphere ARTGeorgios C. Anagnostopoulos and Michael Georgiopoulos. (2000)
Hypersphere ART and ARTMAP for Unsupervised and Supervised Incremental Learning
In: Proceedings of the International Joint Conference on Neural Networks (IJCNN), vol. 6, 59-64
and Hypersphere ARTMAP are closely related to fuzzy ART and fuzzy ARTMAP, respectively. But as they use a different type of category representation (namely hyperspheres), they do not require their input to be normalised to the interval
, 1 The comma is a punctuation mark that appears in several variants in different languages. It has the same shape as an apostrophe or single closing quotation mark () in many typefaces, but it differs from them in being placed on the baseline o ...
They apply similarity measures based on the
L2 norm In mathematics, a norm is a function from a real or complex vector space to the non-negative real numbers that behaves in certain ways like the distance from the origin: it commutes with scaling, obeys a form of the triangle inequality, and is z ...
. LAPART The Laterally Primed Adaptive Resonance Theory (LAPART) neural networks couple two Fuzzy ART algorithms to create a mechanism for making predictions based on learned associations. The coupling of the two Fuzzy ARTs has a unique stability that allows the system to converge rapidly towards a clear solution. Additionally, it can perform logical inference and supervised learning similar to fuzzy ARTMAP.


Criticism

It has been noted that results of Fuzzy ART and ART 1 (i.e., the learnt categories) depend critically upon the order in which the training data are processed. The effect can be reduced to some extent by using a slower learning rate, but is present regardless of the size of the input data set. Hence Fuzzy ART and ART 1 estimates do not possess the statistical property of
consistency In classical deductive logic, a consistent theory is one that does not lead to a logical contradiction. The lack of contradiction can be defined in either semantic or syntactic terms. The semantic definition states that a theory is consistent ...
.Sarle, Warren S. (1995)
Why Statisticians Should Not FART
This problem can be considered as a side effect of the respective mechanisms ensuring stable learning in both networks. More advanced ART networks such as TopoART and Hypersphere TopoART that summarise categories to clusters may solve this problem as the shapes of the clusters do not depend on the order of creation of the associated categories. (cf. Fig. 3(g, h) and Fig. 4 of Marko Tscherepanow. (2012
Incremental On-line Clustering with a Topology-Learning Hierarchical ART Neural Network Using Hyperspherical Categories
In: Poster and Industry Proceedings of the Industrial Conference on Data Mining (ICDM), 22–34
)


References

Wasserman, Philip D. (1989), Neural computing: theory and practice, New York: Van Nostrand Reinhold, {{ISBN, 0-442-20743-3


External links

* Stephen Grossberg'
website

ART's implementation for unsupervised learning (ART 1, ART 2A, ART 2A-C and ART distance)

Summary of the ART algorithm

LibTopoART
— TopoART implementations for supervised and unsupervised learning (TopoART, TopoART-AM, TopoART-C, TopoART-R, Episodic TopoART, Hypersphere TopoART, and Hypersphere TopoART-C) Neuropsychology Artificial neural networks