predictive coding
   HOME

TheInfoList



OR:

In
neuroscience Neuroscience is the scientific study of the nervous system (the brain, spinal cord, and peripheral nervous system), its functions and disorders. It is a multidisciplinary science that combines physiology, anatomy, molecular biology, development ...
, predictive coding (also known as predictive processing) is a theory of
brain A brain is an organ that serves as the center of the nervous system in all vertebrate and most invertebrate animals. It is located in the head, usually close to the sensory organs for senses such as vision. It is the most complex organ in a v ...
function which postulates that the brain is constantly generating and updating a "
mental model A mental model is an explanation of someone's thought process about how something works in the real world. It is a representation of the surrounding world, the relationships between its various parts and a person's intuitive perception about thei ...
" of the environment. According to the theory, such a mental model is used to predict input signals from the
senses A sense is a biological system used by an organism for sensation, the process of gathering information about the world through the detection of stimuli. (For example, in the human body, the brain which is part of the central nervous system rec ...
that are then compared with the actual input signals from those senses. With the rising popularity of
representation learning In machine learning, feature learning or representation learning is a set of techniques that allows a system to automatically discover the representations needed for feature detection or classification from raw data. This replaces manual feature e ...
, the theory is being actively pursued and applied in
machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...
and related fields. The phrase 'predictive coding' is also used in several other disciplines such as signal-processing technologies and law in loosely-related or unrelated senses.


Origins

Theoretical ancestors to predictive coding date back as early as 1860 with
Helmholtz Hermann Ludwig Ferdinand von Helmholtz (31 August 1821 – 8 September 1894) was a German physicist and physician who made significant contributions in several scientific fields, particularly hydrodynamic stability. The Helmholtz Association, ...
's concept of
unconscious inference Unconscious inference (German: unbewusster Schluss), also referred to as unconscious conclusion, is a term of perceptual psychology coined in 1867 by the German physicist and polymath Hermann von Helmholtz to describe an involuntary, pre-rational an ...
. Unconscious inference refers to the idea that the human brain fills in visual information to make sense of a scene. For example, if something is relatively smaller than another object in the visual field, the brain uses that information as a likely cue of depth, such that the perceiver ultimately (and involuntarily) experiences depth. The understanding of perception as the interaction between sensory stimuli (bottom-up) and conceptual knowledge (top-down) continued to be established by
Jerome Bruner Jerome Seymour Bruner (October 1, 1915 – June 5, 2016) was an American psychologist who made significant contributions to human cognitive psychology and cognitive learning theory in educational psychology. Bruner was a senior research fellow ...
who, starting in the 1940s, studied the ways in which needs, motivations and expectations influence perception, research that came to be known as 'New Look' psychology. In 1981,
McClelland McClelland is a surname. Notable people with the surname include: * Alyssa McClelland, Australian actress * Charles A. McClelland (1917–2006), American political systems analyst * Charles P. McClelland (1854–1944), New York politician, and US ...
and Rumelhart in their seminal paper examined the interaction between processing features (lines and contours) which form letters, which in turn form words. While the features suggest the presence of a word, they found that when letters were situated in the context of a word, people were able to identify them faster than when they were situated in a non-word without semantic context. McClelland and Rumelhart's parallel processing model describes perception as the meeting of top-down (conceptual) and bottom-up (sensory) elements. In the late 1990s, the idea of top-down and bottom-up processing was translated into a computational model of vision by Rao and Ballard. Their paper demonstrated that there could be a generative model of a scene (top-down processing), which would receive feedback via error signals (how much the visual input varied from the prediction), which would subsequently lead to updating the prediction. The computational model was able to replicate well-established receptive field effects, as well as less understood extra-classical receptive field effects such as
end-stopping An end-stopped line is a feature in poetry in which the syntactic unit (phrase, clause, or sentence) corresponds in length to the line. Its opposite is enjambment, where the sentence runs on into the next line. According to A. C. Bradley, "a line ...
. Today, the fields of computer science and cognitive science incorporate these same concepts to create the multilayer generative models that underlie machine learning and neural nets.


General framework

Most of the research literature in the field has been about
sensory perception Perception () is the organization, identification, and interpretation of sensory information in order to represent and understand the presented information or environment. All perception involves signals that go through the nervous system ...
, particularly vision, which is more easily conceptualized. However, the predictive coding framework could also be applied to different neural systems. Taking the sensory system as an example, the brain solves the seemingly intractable problem of modelling distal causes of sensory input through a version of
Bayesian inference Bayesian inference is a method of statistical inference in which Bayes' theorem is used to update the probability for a hypothesis as more evidence or information becomes available. Bayesian inference is an important technique in statistics, a ...
. It does this by modelling predictions of lower-level sensory inputs via backward connections from relatively higher levels in a cortical hierarchy. Constrained by the statistical regularities of the outside world (and certain evolutionarily prepared predictions), the brain encodes top-down generative models at various temporal and spatial scales in order to predict and effectively suppress sensory inputs rising up from lower levels. A comparison between predictions (priors) and sensory input (likelihood) yields a difference measure (e.g. prediction error, free energy, or surprise) which, if it is sufficiently large beyond the levels of expected statistical noise, will cause the generative model to update so that it better predicts sensory input in the future. In general, it can be more easily stated that it minimizes the amount of surprise (the measure of difference). This is also the reason for what is nowadays called
confirmation bias Confirmation bias is the tendency to search for, interpret, favor, and recall information in a way that confirms or supports one's prior beliefs or values. People display this bias when they select information that supports their views, ignoring ...
or what might historically be
prejudice Prejudice can be an affective feeling towards a person based on their perceived group membership. The word is often used to refer to a preconceived (usually unfavourable) evaluation or classification of another person based on that person's per ...
(although the latter has more negative connotations) since it better fits one's individual experience accumulated so far and supports consistency. Therefore, this turns out to be rather a disadvantage in today's world. If, instead, the model accurately predicts driving sensory signals, activity at higher levels cancels out activity at lower levels, and the posterior probability of the model is increased. Thus, predictive coding inverts the conventional view of perception as a mostly bottom-up process, suggesting that it is largely constrained by prior predictions, where signals from the external world only shape perception to the extent that they are propagated up the cortical hierarchy in the form of prediction error. In predictive coding, errors are neither good nor bad, but simply signal the difference between the expected and actual input. The exception is in reward processing, where a better than expected reward produces a positive prediction error and a disappointing result produces a negative prediction error.


Precision weighting

Expectations about the precision (or inverse variance) of incoming sensory input are crucial for effectively minimizing prediction error in that the expected precision of a given prediction error can inform confidence in that error, which influences the extent to which the error is weighted in updating predictions. Given that the world we live in is loaded with
statistical noise In statistics, the fraction of variance unexplained (FVU) in the context of a regression task is the fraction of variance of the regressand (dependent variable) ''Y'' which cannot be explained, i.e., which is not correctly predicted, by the e ...
, precision expectations must be represented as part of the brain's generative models, and they should be able to flexibly adapt to changing contexts. For instance, the expected precision of visual prediction errors likely varies between dawn and dusk, such that greater conditional confidence is assigned to errors in broad daylight than errors in prediction at nightfall. It has recently been proposed that such weighting of prediction errors in proportion to their estimated precision is, in essence, attention, and that the process of devoting attention may be neurobiologically accomplished by ascending reticular activating systems (ARAS) optimizing the “gain” of prediction error units.


Active inference

The same principle of prediction error minimization has been used to provide an account of behavior in which motor actions are not commands but descending proprioceptive predictions. In this scheme of
active inference The free energy principle is a mathematical principle in biophysics and cognitive science that provides a formal account of the representational capacities of physical systems: that is, why things that exist look as if they track properties of the ...
, classical reflex arcs are coordinated so as to selectively sample sensory input in ways that better fulfill predictions, thereby minimizing proprioceptive prediction errors. Indeed, Adams et al. (2013) review evidence suggesting that this view of hierarchical predictive coding in the motor system provides a principled and neurally plausible framework for explaining the agranular organization of the motor cortex. This view suggests that “perceptual and motor systems should not be regarded as separate but instead as a single active inference machine that tries to predict its sensory input in all domains: visual, auditory, somatosensory, interoceptive and, in the case of the motor system, proprioceptive."


Neural theory in predictive coding

Evaluating the empirical evidence that suggests a neurologically plausible basis for predictive coding is a broad and varied task. For one thing, and according to the model, predictive coding occurs at every iterative step in the perceptual and cognitive processes; accordingly, manifestations of predictive coding in the brain include genetics, specific cytoarchitecture of cells, systemic networks of neurons, and whole brain analyses. Due to this range of specificity, different methods of investigating the neural mechanisms of predictive coding have been applied, where available; more generally, however, and at least as it relates to humans, there are significant methodological limitations to investigating the potential evidence and much of the work is based on computational modeling of microcircuits in the brain. Notwithstanding, there has been substantial (theoretical) work that has been applied to understanding predictive coding mechanisms in the brain. This section will focus on specific evidence as it relates to the predictive coding phenomenon, rather than analogues, such as homeostasis (which are, nonetheless, integral to our overall understanding of Bayesian inference but already supported heavily; see Clark for a review). Much of the early work that applied a predictive coding framework to neural mechanisms came from sensory neurons, particularly in the visual cortex. More generally, however, what seems to be required by the theory are (at least) two types of neurons (at every level of the perceptual hierarchy): one set of neurons that encode incoming sensory input, so called feed-forward projections; one set of neurons that send down predictions, so called feed-backward projections. It is important to note that these neurons must also carry properties of error detection; which class of neurons has these properties is still up for debate. These sort of neurons have found support in superficial and non-superficial pyramidal neurons. At a more whole-brain level, there is evidence that different cortical layers (aka laminae) may facilitate the integration of feedforward and feed-backward projections across hierarchies. These cortical layers, divided into granular, agranular, and dysgranular, which house the subpopulations of neurons mentioned above, are divided into 6 main layers. The cytoarchitecture within these layers are the same, but they differ across layers. For example, layer 4 of the granular cortex contain granule cells which are excitatory and distribute thalamocortical inputs to the rest of the cortex. According to one model:
“...prediction neurons... in deep layers of agranular cortex drive active inference by sending sensory predictions via projections ...to supragranular layers of dysgranular and granular sensory cortices. Prediction-error neurons ….in the supragranular layers of granular cortex compute the difference between the predicted and received sensory signal, and send prediction-error signals via projections...back to the deep layers of agranular cortical regions. Precision cells … tune the gain on predictions and prediction error dynamically, thereby giving these signals reduced (or, in some cases, greater) weight depending on the relative confidence in the descending predictions or the reliability of incoming sensory signals.”
The theory that the unit of prediction is the
cortical column A cortical column is a group of neurons forming a cylindrical structure through the cerebral cortex of the brain perpendicular to the cortical surface. The structure was first identified by Mountcastle in 1957. He later identified cortical minicolu ...
is based on the remarkable correspondence between the microcircuitry of the cortical column and the connectivity implied by predictive coding.


Applying predictive coding


Perception

The empirical evidence for predictive coding is most robust for perceptual processing. As early as 1999, Rao and Ballard proposed a hierarchical
visual processing Visual processing is a term that is used to refer to the brain's ability to use and interpret visual information from the world around us. The process of converting light energy into a meaningful image is a complex process that is facilitated by ...
model in which higher-order visual cortical area sends down predictions and the feedforward connections carry the residual errors between the predictions and the actual lower-level activities. According to this model, each level in the hierarchical model network (except the lowest level, which represents the image) attempts to predict the responses at the next lower level via feedback connections, and the error signal is used to correct the estimate of the input signal at each level concurrently. Emberson et al. established the top-down modulation in infants using a cross-modal audiovisual omission paradigm, determining that even infant brains have expectation about future sensory input that is carried downstream from visual cortices and are capable of expectation-based feedback. Functional near-infrared spectroscopy (fNIRS) data showed that infant
occipital cortex The occipital lobe is one of the four major lobes of the cerebral cortex in the brain of mammals. The name derives from its position at the back of the head, from the Latin ''ob'', "behind", and ''caput'', "head". The occipital lobe is the vi ...
responded to unexpected visual omission (with no visual information input) but not to expected visual omission. These results establish that in a hierarchically organized perception system, higher-order neurons send down predictions to lower-order neurons, which in turn sends back up the prediction error signal.


Interoception

There have been several competing models for the role of predictive coding in interoception. In 2013, Anil Seth proposed that our subjective feeling states, otherwise known as emotions, are generated by predictive models that are actively built out of causal interoceptive appraisals. In relation to how we attribute internal states of others to causes, Sasha Ondobaka, James Kilner, and
Karl Friston Karl John Friston FRS FMedSci FRSB (born 12 July 1959) is a British neuroscientist and theoretician at University College London. He is an authority on brain imaging and theoretical neuroscience, especially the use of physics-inspired stati ...
(2015) proposed that the
free energy principle The free energy principle is a mathematical principle in biophysics and cognitive science that provides a formal account of the representational capacities of physical systems: that is, why things that exist look as if they track properties of the ...
requires the brain to produce a continuous series of predictions with the goal of reducing the amount of prediction error that manifests as “free energy”. These errors are then used to model anticipatory information about what the state of the outside world will be and attributions of causes of that world state, including understanding of causes of others’ behavior. This is especially necessary because, to create these attributions, our multimodal sensory systems need interoceptive predictions to organize themselves. Therefore, Ondobaka posits that predictive coding is key to understanding other people's internal states. In 2015,
Lisa Feldman Barrett Lisa Feldman Barrett is a distinguished professor of psychology at Northeastern University, where she focuses on affective science. She is a director of the Interdisciplinary Affective Science Laboratory. Along with James Russell, she is the fou ...
and W. Kyle Simmons (2015) proposed the Embodied Predictive Interoception Coding model, a framework that unifies Bayesian active inference principles with a physiological framework of corticocortical connections. Using this model, they posited that agranular visceromotor cortices are responsible for generating predictions about interoception, thus, defining the experience of interoception. In 2017, contrary to the inductive notion that emotion categories are biologically distinct, Barrett (2017) proposed the theory of constructed emotion, which is the account that a biological emotion category is constructed based on a conceptual category—the accumulation of instances sharing a goal. In a predictive coding model, Barrett hypothesizes that, in interoception, our brains regulate our bodies by activating "embodied simulations" (full-bodied representations of sensory experience) to anticipate what our brains predict that the external world will throw at us sensorially and how we will respond to it with action. These simulations are either preserved if, based on our brain's predictions, they prepare us well for what actually subsequently occurs in the external world, or they, and our predictions, are adjusted to compensate for their error in comparison to what actually occurs in the external world and how well-prepared we were for it. Then, in a trial-error-adjust process, our bodies find similarities in goals among certain successful anticipatory simulations and group them together under conceptual categories. Every time a new experience arises, our brains use this past trial-error-adjust history to match the new experience to one of the categories of accumulated corrected simulations that is shares the most similarity with. Then, they apply the corrected simulation of that category to the new experience in the hopes of preparing our bodies for the rest of the experience. If it does not, the prediction, the simulation, and perhaps the boundaries of the conceptual category are revised in the hopes of higher accuracy next time, and the process continues. Barrett hypothesizes that, when prediction error for a certain category of simulations for x-like experiences is minimized, what results is a correction-informed simulation that the body will reenact for every x-like experience, resulting in a correction-informed full-bodied representation of sensory experience—an emotion. In this sense, Barrett proposes that we construct our emotions because the conceptual category framework our brains use to compare new experiences, and to pick the appropriate predictive sensory simulation to activate, is built on the go.


Challenges

As a mechanistic theory, predictive coding has not been mapped out physiologically on the neuronal level. One of the biggest challenges to the theory has been the imprecision of exactly how prediction error minimization works. In some studies, the increase in
BOLD In typography, emphasis is the strengthening of words in a text with a font in a different style from the rest of the text, to highlight them. It is the equivalent of prosody stress in speech. Methods and use The most common methods in W ...
signal has been interpreted as error signal while in others it indicates changes in the input representation. A crucial question that needs to be addressed is what exactly constitutes error signal and how it is computed at each level of information processing. Another challenge that has been posed is predictive coding's computational tractability. According to Kwisthout and van Rooij, the subcomputation in each level of the predictive coding framework potentially hides a computationally intractable problem, which amounts to “intractable hurdles” that computational modelers have yet to overcome. Ransom and Fazelpour (2015) indicate "Three Problems for the Predictive Coding Theory of Attention".Ransom M. & Fazelpour S (2015). Three Problems for the Predictive Coding Theory of Attention. http://mindsonline.philosophyofbrains.com/2015/session4/three-problems-for-the-predictive-coding-theory-of-attention/ Future research could focus on clarifying the neurophysiological mechanism and computational model of predictive coding.


See also

*
Blue Brain Project The Blue Brain Project is a Swiss brain research initiative that aims to create a digital reconstruction of the mouse brain. The project was founded in May 2005 by the Brain and Mind Institute of ''École Polytechnique Fédérale de Lausanne'' (EP ...
*
Cognitive biology Cognitive biology is an emerging science that regards natural cognition as a biological function. It is based on the theoretical assumption that every organism—whether a single cell or multicellular—is continually engaged in systematic acts of ...
*
Cognitive linguistics Cognitive linguistics is an interdisciplinary branch of linguistics, combining knowledge and research from cognitive science, cognitive psychology, neuropsychology and linguistics. Models and theoretical accounts of cognitive linguistics are con ...
*
Cognitive neuropsychology Cognitive neuropsychology is a branch of cognitive psychology that aims to understand how the structure and function of the brain relates to specific psychological processes. Cognitive psychology is the science that looks at how mental processes ...
*
Cognitive neuroscience Cognitive neuroscience is the scientific field that is concerned with the study of the biological processes and aspects that underlie cognition, with a specific focus on the neural connections in the brain which are involved in mental proces ...
* Cognitive science *
Conceptual blending In cognitive linguistics, conceptual blending, also called conceptual integration or view application, is a theory of cognition developed by Gilles Fauconnier and Mark Turner. According to this theory, elements and vital relations from diverse sce ...
*
Conceptual metaphor In cognitive linguistics, conceptual metaphor, or cognitive metaphor, refers to the understanding of one idea, or conceptual domain, in terms of another. An example of this is the understanding of quantity in terms of directionality (e.g. "the pr ...
*
Cortical column A cortical column is a group of neurons forming a cylindrical structure through the cerebral cortex of the brain perpendicular to the cortical surface. The structure was first identified by Mountcastle in 1957. He later identified cortical minicolu ...
* Embodied bilingual language *
Embodied cognitive science Embodied cognitive science is an interdisciplinary field of research, the aim of which is to explain the mechanisms underlying intelligent behavior. It comprises three main methodologies: the modeling of psychological and biological systems in a ...
*
Embodied Embedded Cognition Embodied embedded cognition (EEC) is a philosophical theoretical position in cognitive science, closely related to situated cognition, embodied cognition, embodied cognitive science and dynamical systems theory. The theory states that intellige ...
*
Embodied music cognition Embodied music cognition is a direction within systematic musicology interested in studying the role of the human body in relation to all musical activities. It considers the human body as the natural mediator between mind (focused on musical in ...
*
Enactivism Enactivism is a position in cognitive science that argues that cognition arises through a dynamic interaction between an acting organism and its environment. It claims that the environment of an organism is brought about, or enacted, by the active ...
* Extended cognition *
Extended mind thesis In philosophy of mind, the extended mind thesis (EMT) says that the mind does not exclusively reside in the brain or even the body, but extends into the physical world. The EMT proposes that some objects in the external environment can be part o ...
*
Externalism Externalism is a group of positions in the philosophy of mind which argues that the conscious mind is not only the result of what is going on inside the nervous system (or the brain), but also what ''occurs'' or ''exists'' outside the subject. It i ...
*
Heuristic A heuristic (; ), or heuristic technique, is any approach to problem solving or self-discovery that employs a practical method that is not guaranteed to be optimal, perfect, or rational, but is nevertheless sufficient for reaching an immediate, ...
*
Image schema An image schema (both ''schemas'' and ''schemata'' are used as plural forms) is a recurring structure within our cognitive processes which establishes patterns of understanding and reasoning. As an understudy to embodied cognition, image schemas ar ...
*
Moravec's paradox Moravec's paradox is the observation by artificial intelligence and robotics researchers that, contrary to traditional assumptions, reasoning requires very little computation, but sensorimotor and perception skills require enormous computational ...
*
Neuroconstructivism Neuroconstructivism is a theory that states that phylogenetic developmental processes such as gene–gene interaction, gene–environment interaction and, crucially, ontogeny all play a vital role in how the brain progressively sculpts itself and ...
*
Neuropsychology Neuropsychology is a branch of psychology concerned with how a person's cognition and behavior are related to the brain and the rest of the nervous system. Professionals in this branch of psychology often focus on how injuries or illnesses of t ...
*
Neurophenomenology Neurophenomenology refers to a scientific research program aimed to address the hard problem of consciousness in a pragmatic way. It combines neuroscience with phenomenology in order to study experience, mind, and consciousness with an emphasis on ...
*
Philosophy of mind Philosophy of mind is a branch of philosophy that studies the ontology and nature of the mind and its relationship with the body. The mind–body problem is a paradigmatic issue in philosophy of mind, although a number of other issues are addre ...
* Plant cognition *
Practopoiesis An adaptive system is a set of interacting or interdependent entities, real or abstract, forming an integrated whole that together are able to respond to environmental changes or changes in the interacting parts, in a way analogous to either conti ...
*
Situated cognition Situated cognition is a theory that posits that knowing is inseparable from doing by arguing that all knowledge is situated in activity bound to social, cultural and physical contexts. Under this assumption, which requires an epistemological shift ...
*
Where Mathematics Comes From ''Where Mathematics Comes From: How the Embodied Mind Brings Mathematics into Being'' (hereinafter ''WMCF'') is a book by George Lakoff, a cognitive linguistics, cognitive linguist, and Rafael E. Núñez, a psychologist. Published in 2000, ''WMCF' ...
{{div col end


References

Cognitive modeling Neuropsychology Perception