Visual perception is the ability to interpret the surrounding
environment
Environment most often refers to:
__NOTOC__
* Natural environment, all living and non-living things occurring naturally
* Biophysical environment, the physical and biological factors along with their chemical interactions that affect an organism or ...
through
photopic vision
Photopic vision is the vision of the eye under well-lit conditions (luminance levels from 10 to 108 cd/m2). In humans and many other animals, photopic vision allows color perception, mediated by cone cells, and a significantly higher visua ...
(daytime vision),
color vision
Color vision, a feature of visual perception, is an ability to perceive differences between light composed of different wavelengths (i.e., different spectral power distributions) independently of light intensity. Color perception is a part of ...
,
scotopic vision
In the study of human visual perception, scotopic vision (or scotopia) is the vision of the eye under low-light conditions. The term comes from Greek ''skotos'', meaning "darkness", and ''-opia'', meaning "a condition of sight". In the human eye, ...
(night vision), and
mesopic vision
Mesopic vision, sometimes also called twilight vision, is a combination of photopic and scotopic vision under low-light (but not necessarily dark) conditions. Mesopic levels range approximately from 0.01 to 3.0 cd/m2 in luminance. Most nigh ...
(twilight vision), using light in the
visible spectrum
The visible spectrum is the portion of the electromagnetic spectrum that is visual perception, visible to the human eye. Electromagnetic radiation in this range of wavelengths is called ''visible light'' or simply light. A typical human eye wil ...
reflected by objects in the environment. This is different from
visual acuity
Visual acuity (VA) commonly refers to the clarity of vision, but technically rates an examinee's ability to recognize small details with precision. Visual acuity is dependent on optical and neural factors, i.e. (1) the sharpness of the retinal ...
, which refers to how clearly a person sees (for example "20/20 vision"). A person can have problems with visual perceptual processing even if they have 20/20 vision.
The resulting
perception
Perception () is the organization, identification, and interpretation of sensory information in order to represent and understand the presented information or environment. All perception involves signals that go through the nervous system ...
is also known as vision, sight, or eyesight (adjectives ''visual'', ''optical'', and ''ocular'', respectively). The various physiological components involved in vision are referred to collectively as the
visual system
The visual system comprises the sensory organ (the eye) and parts of the central nervous system (the retina containing photoreceptor cells, the optic nerve, the optic tract and the visual cortex) which gives organisms the sense of sight (the a ...
, and are the focus of much research in
linguistics
Linguistics is the scientific study of human language. It is called a scientific study because it entails a comprehensive, systematic, objective, and precise analysis of all aspects of language, particularly its nature and structure. Linguis ...
,
psychology
Psychology is the scientific study of mind and behavior. Psychology includes the study of conscious and unconscious phenomena, including feelings and thoughts. It is an academic discipline of immense scope, crossing the boundaries betwe ...
,
cognitive science,
neuroscience
Neuroscience is the scientific study of the nervous system (the brain, spinal cord, and peripheral nervous system), its functions and disorders. It is a multidisciplinary science that combines physiology, anatomy, molecular biology, development ...
, and
molecular biology
Molecular biology is the branch of biology that seeks to understand the molecular basis of biological activity in and between cells, including biomolecular synthesis, modification, mechanisms, and interactions. The study of chemical and physi ...
, collectively referred to as
vision science
Vision science is the scientific study of visual perception. Researchers in vision science can be called vision scientists, especially if their research spans some of the science's many disciplines.
Vision science encompasses all studies of vision ...
.
Visual system
In humans and a number of other mammals, light enters the eye through the
cornea
The cornea is the transparent front part of the eye that covers the iris, pupil, and anterior chamber. Along with the anterior chamber and lens, the cornea refracts light, accounting for approximately two-thirds of the eye's total optical power ...
and is focused by the
lens
A lens is a transmissive optical device which focuses or disperses a light beam by means of refraction. A simple lens consists of a single piece of transparent material, while a compound lens consists of several simple lenses (''elements''), ...
onto the
retina
The retina (from la, rete "net") is the innermost, light-sensitive layer of tissue of the eye of most vertebrates and some molluscs. The optics of the eye create a focused two-dimensional image of the visual world on the retina, which then ...
, a light-sensitive membrane at the back of the eye. The retina serves as a
transducer
A transducer is a device that converts energy from one form to another. Usually a transducer converts a signal in one form of energy to a signal in another.
Transducers are often employed at the boundaries of automation, measurement, and contr ...
for the conversion of light into
neuron
A neuron, neurone, or nerve cell is an electrically excitable cell that communicates with other cells via specialized connections called synapses. The neuron is the main component of nervous tissue in all animals except sponges and placozoa. N ...
al signals. This transduction is achieved by specialized
photoreceptive cells of the retina, also known as the rods and cones, which detect the
photon
A photon () is an elementary particle that is a quantum of the electromagnetic field, including electromagnetic radiation such as light and radio waves, and the force carrier for the electromagnetic force. Photons are massless, so they always ...
s of light and respond by producing
neural impulses. These signals are transmitted by the
optic nerve
In neuroanatomy, the optic nerve, also known as the second cranial nerve, cranial nerve II, or simply CN II, is a paired cranial nerve that transmits visual system, visual information from the retina to the brain. In humans, the optic nerve i ...
, from the retina upstream to central
ganglia
A ganglion is a group of neuron cell bodies in the peripheral nervous system. In the somatic nervous system this includes dorsal root ganglia and trigeminal ganglia among a few others. In the autonomic nervous system there are both sympatheti ...
in the
brain
A brain is an organ that serves as the center of the nervous system in all vertebrate and most invertebrate animals. It is located in the head, usually close to the sensory organs for senses such as vision. It is the most complex organ in a v ...
. The
lateral geniculate nucleus
In neuroanatomy, the lateral geniculate nucleus (LGN; also called the lateral geniculate body or lateral geniculate complex) is a structure in the thalamus and a key component of the mammalian visual pathway. It is a small, ovoid, ventral projec ...
, which transmits the information to the
visual cortex
The visual cortex of the brain is the area of the cerebral cortex that processes visual information. It is located in the occipital lobe. Sensory input originating from the eyes travels through the lateral geniculate nucleus in the thalamus and ...
. Signals from the retina also travel directly from the retina to the
superior colliculus
In neuroanatomy, the superior colliculus () is a structure lying on the roof of the mammalian midbrain. In non-mammalian vertebrates, the homologous structure is known as the optic tectum, or optic lobe. The adjective form ''tectal'' is commonly ...
.
The lateral geniculate nucleus sends signals to
primary visual cortex
The visual cortex of the brain is the area of the cerebral cortex that processes visual information. It is located in the occipital lobe. Sensory input originating from the eyes travels through the lateral geniculate nucleus in the thalamus and ...
, also called striate cortex.
Extrastriate cortex
The extrastriate cortex is the region of the occipital cortex of the mammalian brain located next to the primary visual cortex. Primary visual cortex (V1) is also named striate cortex because of its striped appearance in the microscope. The extra ...
, also called
visual association cortex
The visual cortex of the brain is the area of the cerebral cortex that processes visual information. It is located in the occipital lobe. Sensory input originating from the eyes travels through the lateral geniculate nucleus in the thalamus and ...
is a set of cortical structures, that receive information from striate cortex, as well as each other.
Recent descriptions of visual association cortex describe a division into two functional pathways, a
ventral
Standard anatomical terms of location are used to unambiguously describe the anatomy of animals, including humans. The terms, typically derived from Latin or Greek language, Greek roots, describe something in its standard anatomical position. Th ...
and a
dorsal
Dorsal (from Latin ''dorsum'' ‘back’) may refer to:
* Dorsal (anatomy), an anatomical term of location referring to the back or upper side of an organism or parts of an organism
* Dorsal, positioned on top of an aircraft's fuselage
* Dorsal co ...
pathway. This conjecture is known as the
two streams hypothesis
The two-streams hypothesis is a model of the neural processing of vision as well as hearing. The hypothesis, given its initial characterisation in a paper by David Milner and Melvyn A. Goodale in 1992, argues that humans possess two distinct visu ...
.
The human visual system is generally believed to be sensitive to
visible light
Light or visible light is electromagnetic radiation that can be perceived by the human eye. Visible light is usually defined as having wavelengths in the range of 400–700 nanometres (nm), corresponding to frequencies of 750–420 te ...
in the range of wavelengths between 370 and 730 nanometers (0.00000037 to 0.00000073 meters) of the
electromagnetic spectrum
The electromagnetic spectrum is the range of frequencies (the spectrum) of electromagnetic radiation and their respective wavelengths and photon energies.
The electromagnetic spectrum covers electromagnetic waves with frequencies ranging from ...
.
However, some research suggests that humans can perceive light in wavelengths down to 340 nanometers (UV-A), especially the young. Under optimal conditions these limits of human perception can extend to 310 nm (
UV) to 1100 nm (
NIR).
Study
The major problem in visual perception is that what people see is not simply a translation of retinal stimuli (i.e., the image on the retina). Thus people interested in perception have long struggled to explain what
visual processing
Visual processing is a term that is used to refer to the brain's ability to use and interpret visual information from the world around us. The process of converting light energy into a meaningful image is a complex process that is facilitated by ...
does to create what is actually seen.
Early studies
There were two major
ancient Greek
Ancient Greek includes the forms of the Greek language used in ancient Greece and the ancient world from around 1500 BC to 300 BC. It is often roughly divided into the following periods: Mycenaean Greek (), Dark Ages (), the Archaic peri ...
schools, providing a primitive explanation of how vision works.
The first was the "
emission theory
Emission theory, also called emitter theory or ballistic theory of light, was a competing theory for the special theory of relativity, explaining the results of the Michelson–Morley experiment of 1887. Emission theories obey the principle of rela ...
" of vision which maintained that vision occurs when rays emanate from the eyes and are intercepted by visual objects. If an object was seen directly it was by 'means of rays' coming out of the eyes and again falling on the object. A refracted image was, however, seen by 'means of rays' as well, which came out of the eyes, traversed through the air, and after refraction, fell on the visible object which was sighted as the result of the movement of the rays from the eye. This theory was championed by scholars who were followers of
Euclid
Euclid (; grc-gre, Wikt:Εὐκλείδης, Εὐκλείδης; BC) was an ancient Greek mathematician active as a geometer and logician. Considered the "father of geometry", he is chiefly known for the ''Euclid's Elements, Elements'' trea ...
's ''
Optics
Optics is the branch of physics that studies the behaviour and properties of light, including its interactions with matter and the construction of instruments that use or detect it. Optics usually describes the behaviour of visible, ultraviole ...
'' and
Ptolemy
Claudius Ptolemy (; grc-gre, Πτολεμαῖος, ; la, Claudius Ptolemaeus; AD) was a mathematician, astronomer, astrologer, geographer, and music theorist, who wrote about a dozen scientific treatises, three of which were of importanc ...
's ''
Optics
Optics is the branch of physics that studies the behaviour and properties of light, including its interactions with matter and the construction of instruments that use or detect it. Optics usually describes the behaviour of visible, ultraviole ...
''.
The second school advocated the so-called 'intromission' approach which sees vision as coming from something entering the eyes representative of the object. With its main propagator
Aristotle
Aristotle (; grc-gre, Ἀριστοτέλης ''Aristotélēs'', ; 384–322 BC) was a Greek philosopher and polymath during the Classical period in Ancient Greece. Taught by Plato, he was the founder of the Peripatetic school of phil ...
(''
De Sensu''),
and his followers,
[ this theory seems to have some contact with modern theories of what vision really is, but it remained only a speculation lacking any experimental foundation. (In eighteenth-century England, ]Isaac Newton
Sir Isaac Newton (25 December 1642 – 20 March 1726/27) was an English mathematician, physicist, astronomer, alchemist, theologian, and author (described in his time as a "natural philosopher"), widely recognised as one of the grea ...
, John Locke
John Locke (; 29 August 1632 – 28 October 1704) was an English philosopher and physician, widely regarded as one of the most influential of Age of Enlightenment, Enlightenment thinkers and commonly known as the "father of liberalism ...
, and others, carried the intromission theory of vision forward by insisting that vision involved a process in which rays—composed of actual corporeal matter—emanated from seen objects and entered the seer's mind/sensorium through the eye's aperture.)
Both schools of thought relied upon the principle that "like is only known by like", and thus upon the notion that the eye was composed of some "internal fire" that interacted with the "external fire" of visible light and made vision possible. Plato
Plato ( ; grc-gre, Πλάτων ; 428/427 or 424/423 – 348/347 BC) was a Greek philosopher born in Athens during the Classical period in Ancient Greece. He founded the Platonist school of thought and the Academy, the first institution ...
makes this assertion in his dialogue ''Timaeus Timaeus (or Timaios) is a Greek name. It may refer to:
* ''Timaeus'' (dialogue), a Socratic dialogue by Plato
*Timaeus of Locri, 5th-century BC Pythagorean philosopher, appearing in Plato's dialogue
*Timaeus (historian) (c. 345 BC-c. 250 BC), Greek ...
'' (45b and 46b), as does Empedocles
Empedocles (; grc-gre, Ἐμπεδοκλῆς; , 444–443 BC) was a Greek pre-Socratic philosopher and a native citizen of Akragas, a Greek city in Sicily. Empedocles' philosophy is best known for originating the cosmogonic theory of the fo ...
(as reported by Aristotle in his ''De Sensu'', DK frag. B17).[
]Alhazen
Ḥasan Ibn al-Haytham, Latinized as Alhazen (; full name ; ), was a medieval mathematician, astronomer, and physicist of the Islamic Golden Age from present-day Iraq.For the description of his main fields, see e.g. ("He is one of the prin ...
(965 – 1040) carried out many investigations and experiment
An experiment is a procedure carried out to support or refute a hypothesis, or determine the efficacy or likelihood of something previously untried. Experiments provide insight into Causality, cause-and-effect by demonstrating what outcome oc ...
s on visual perception, extended the work of Ptolemy on binocular vision
In biology, binocular vision is a type of vision in which an animal has two eyes capable of facing the same direction to perceive a single three-dimensional image of its surroundings. Binocular vision does not typically refer to vision where an ...
, and commented on the anatomical works of Galen. He was the first person to explain that vision occurs when light bounces on an object and then is directed to one's eyes.
Leonardo da Vinci
Leonardo di ser Piero da Vinci (15 April 14522 May 1519) was an Italian polymath of the High Renaissance who was active as a painter, Drawing, draughtsman, engineer, scientist, theorist, sculptor, and architect. While his fame initially res ...
(1452–1519) is believed to be the first to recognize the special optical qualities of the eye. He wrote "The function of the human eye ... was described by a large number of authors in a certain way. But I found it to be completely different." His main experimental finding was that there is only a distinct and clear vision at the line of sight—the optical line that ends at the fovea
Fovea () (Latin for "pit"; plural foveae ) is a term in anatomy. It refers to a pit or depression in a structure.
Human anatomy
*Fovea centralis of the retina
* Fovea buccalis or Dimple
* Fovea of the femoral head
* Trochlear fovea of the fr ...
. Although he did not use these words literally he actually is the father of the modern distinction between foveal and peripheral vision
Peripheral vision, or ''indirect vision'', is vision as it occurs outside the point of fixation, i.e. away from the center of gaze or, when viewed at large angles, in (or out of) the "corner of one's eye". The vast majority of the area in the ...
.
Isaac Newton
Sir Isaac Newton (25 December 1642 – 20 March 1726/27) was an English mathematician, physicist, astronomer, alchemist, theologian, and author (described in his time as a "natural philosopher"), widely recognised as one of the grea ...
(1642–1726/27) was the first to discover through experimentation, by isolating individual colors of the spectrum of light passing through a prism
Prism usually refers to:
* Prism (optics), a transparent optical component with flat surfaces that refract light
* Prism (geometry), a kind of polyhedron
Prism may also refer to:
Science and mathematics
* Prism (geology), a type of sedimentary ...
, that the visually perceived color of objects appeared due to the character of light the objects reflected, and that these divided colors could not be changed into any other color, which was contrary to scientific expectation of the day.
Unconscious inference
Hermann von Helmholtz
Hermann Ludwig Ferdinand von Helmholtz (31 August 1821 – 8 September 1894) was a German physicist and physician who made significant contributions in several scientific fields, particularly hydrodynamic stability. The Helmholtz Association, ...
is often credited with the first modern study of visual perception. Helmholtz examined the human eye and concluded that it was incapable of producing a high-quality image. Insufficient information seemed to make vision impossible. He, therefore, concluded that vision could only be the result of some form of "unconscious inference", coining that term in 1867. He proposed the brain was making assumptions and conclusions from incomplete data, based on previous experiences.
Inference requires prior experience of the world.
Examples of well-known assumptions, based on visual experience, are:
* light comes from above
* objects are normally not viewed from below
* faces are seen (and recognized) upright.
* closer objects can block the view of more distant objects, but not vice versa
* figures (i.e., foreground objects) tend to have convex borders
The study of visual illusions
Within visual perception, an optical illusion (also called a visual illusion) is an illusion caused by the visual system and characterized by a visual percept that arguably appears to differ from reality. Illusions come in a wide variety; thei ...
(cases when the inference process goes wrong) has yielded much insight into what sort of assumptions the visual system makes.
Another type of unconscious inference hypothesis (based on probabilities) has recently been revived in so-called Bayesian
Thomas Bayes (/beɪz/; c. 1701 – 1761) was an English statistician, philosopher, and Presbyterian minister.
Bayesian () refers either to a range of concepts and approaches that relate to statistical methods based on Bayes' theorem, or a follower ...
studies of visual perception. Proponents of this approach consider that the visual system performs some form of Bayesian inference
Bayesian inference is a method of statistical inference in which Bayes' theorem is used to update the probability for a hypothesis as more evidence or information becomes available. Bayesian inference is an important technique in statistics, a ...
to derive a perception from sensory data. However, it is not clear how proponents of this view derive, in principle, the relevant probabilities required by the Bayesian equation. Models based on this idea have been used to describe various visual perceptual functions, such as the perception of motion, the perception of depth, and figure-ground perception. The " wholly empirical theory of perception" is a related and newer approach that rationalizes visual perception without explicitly invoking Bayesian formalisms.
Gestalt theory
Gestalt psychologists
Gestalt-psychology, gestaltism, or configurationism is a school of psychology that emerged in the early twentieth century in Austria and Germany as a theory of perception that was a rejection of basic principles of Wilhelm Wundt's and Edward T ...
working primarily in the 1930s and 1940s raised many of the research questions that are studied by vision scientists today.
The Gestalt Laws of Organization have guided the study of how people perceive visual components as organized patterns or wholes, instead of many different parts. "Gestalt" is a German word that partially translates to "configuration or pattern" along with "whole or emergent structure". According to this theory, there are eight main factors that determine how the visual system automatically groups elements into patterns: Proximity, Similarity, Closure, Symmetry, Common Fate (i.e. common motion), Continuity as well as Good Gestalt (pattern that is regular, simple, and orderly) and Past Experience.
Analysis of eye movement
During the 1960s, technical development permitted the continuous registration of eye movement during reading, in picture viewing, and later, in visual problem solving, and when headset-cameras became available, also during driving.
The picture to the right shows what may happen during the first two seconds of visual inspection. While the background is out of focus, representing the peripheral vision
Peripheral vision, or ''indirect vision'', is vision as it occurs outside the point of fixation, i.e. away from the center of gaze or, when viewed at large angles, in (or out of) the "corner of one's eye". The vast majority of the area in the ...
, the first eye movement goes to the boots of the man (just because they are very near the starting fixation and have a reasonable contrast). Eye movements serve the function of attentional selection, i.e., to select a fraction of all visual inputs for deeper processing by the brain.
The following fixations jump from face to face. They might even permit comparisons between faces.
It may be concluded that the icon ''face'' is a very attractive search icon within the peripheral field of vision. The foveal
The fovea centralis is a small, central pit composed of closely packed cones in the eye. It is located in the center of the macula lutea of the retina.
The fovea is responsible for sharp central vision (also called foveal vision), which is nec ...
vision adds detailed information to the peripheral ''first impression''.
It can also be noted that there are different types of eye movements: fixational eye movements
Fixation or visual fixation is the maintaining of the gaze on a single location. An animal can exhibit visual fixation if it possess a fovea in the anatomy of their eye. The fovea is typically located at the center of the retina and is the poi ...
(microsaccade
Microsaccades are a kind of fixational eye movement. They are small, jerk-like, involuntary eye movements, similar to miniature versions of voluntary saccades. They typically occur during prolonged visual fixation (of at least several seconds), ...
s, ocular drift, and tremor), vergence movements, saccadic movements and pursuit movements. ''Fixations'' are comparably static points where the eye rests. However, the eye is never completely still, but gaze position will drift. These drifts are in turn corrected by microsaccades, very small fixational eye movements. ''Vergence movements'' involve the cooperation of both eyes to allow for an image to fall on the same area of both retinas. This results in a single focused image. '' Saccadic movements'' is the type of eye movement that makes jumps from one position to another position and is used to rapidly scan a particular scene/image. Lastly, ''pursuit movement
In the Vision science, scientific study of vision, smooth pursuit describes a type of eye movement in which the human eye, eyes remain Fixation (visual), fixated on a moving object. It is one of two ways that visual animals can voluntarily shift ...
'' is smooth eye movement and is used to follow objects in motion.
Face and object recognition
There is considerable evidence that face and object recognition
Object recognition – technology in the field of computer vision for finding and identifying objects in an image or video sequence. Humans recognize a multitude of objects in images with little effort, despite the fact that the image of the ...
are accomplished by distinct systems. For example, prosopagnosic patients show deficits in face, but not object processing, while object agnosic patients (most notably, patient C.K.) show deficits in object processing with spared face processing. Behaviorally, it has been shown that faces, but not objects, are subject to inversion effects, leading to the claim that faces are "special". Further, face and object processing recruit distinct neural systems. Notably, some have argued that the apparent specialization of the human brain for face processing does not reflect true domain specificity, but rather a more general process of expert-level discrimination within a given class of stimulus, though this latter claim is the subject of substantial debate. Using fMRI and electrophysiology Doris Tsao and colleagues described brain regions and a mechanism for face recognition
A facial recognition system is a technology capable of matching a human face from a digital image or a video frame against a database of faces. Such a system is typically employed to authenticate users through ID verification services, and wo ...
in macaque monkeys.
The inferotemporal cortex
The inferior temporal gyrus is one of three gyri of the temporal lobe and is located below the middle temporal gyrus, connected behind with the inferior occipital gyrus; it also extends around the infero-lateral border on to the inferior surface ...
has a key role in the task of recognition and differentiation of different objects. A study by MIT shows that subset regions of the IT cortex are in charge of different objects. By selectively shutting off neural activity of many small areas of the cortex, the animal gets alternately unable to distinguish between certain particular pairments of objects. This shows that the IT cortex is divided into regions that respond to different and particular visual features. In a similar way, certain particular patches and regions of the cortex are more involved in face recognition than other object recognition.
Some studies tend to show that rather than the uniform global image, some particular features and regions of interest of the objects are key elements when the brain needs to recognise an object in an image. In this way, the human vision is vulnerable to small particular changes to the image, such as disrupting the edges of the object, modifying texture or any small change in a crucial region of the image.
Studies of people whose sight has been restored after a long blindness reveal that they cannot necessarily recognize objects and faces (as opposed to color, motion, and simple geometric shapes). Some hypothesize that being blind during childhood prevents some part of the visual system necessary for these higher-level tasks from developing properly. The general belief that a critical period
In developmental psychology and developmental biology, a critical period is a maturational stage in the lifespan of an organism during which the nervous system is especially sensitive to certain environmental stimuli. If, for some reason, the org ...
lasts until age 5 or 6 was challenged by a 2007 study that found that older patients could improve these abilities with years of exposure.
Cognitive and computational approaches
In the 1970s, David Marr developed a multi-level theory of vision, which analyzed the process of vision at different levels of abstraction. In order to focus on the understanding of specific problems in vision, he identified three levels of analysis: the ''computational'', ''algorithmic'' and ''implementational'' levels. Many vision scientists, including Tomaso Poggio
Tomaso Armando Poggio (born 11 September 1947 in Genoa, Italy), is the Eugene McDermott professor in the Department of Brain and Cognitive Sciences, an investigator at the McGovern Institute for Brain Research, a member of the MIT Computer Scien ...
, have embraced these levels of analysis and employed them to further characterize vision from a computational perspective.
The ''computational level'' addresses, at a high level of abstraction, the problems that the visual system must overcome. The ''algorithmic level'' attempts to identify the strategy that may be used to solve these problems. Finally, the ''implementational level'' attempts to explain how solutions to these problems are realized in neural circuitry.
Marr suggested that it is possible to investigate vision at any of these levels independently. Marr described vision as proceeding from a two-dimensional visual array (on the retina) to a three-dimensional description of the world as output. His stages of vision include:
* A ''2D'' or ''primal sketch'' of the scene, based on feature extraction of fundamental components of the scene, including edges, regions, etc. Note the similarity in concept to a pencil sketch drawn quickly by an artist as an impression.
* A ''2 D sketch'' of the scene, where textures are acknowledged, etc. Note the similarity in concept to the stage in drawing where an artist highlights or shades areas of a scene, to provide depth.
* A ''3 D model'', where the scene is visualized in a continuous, 3-dimensional map.
Marr's 2D sketch assumes that a depth map is constructed, and that this map is the basis of 3D shape perception. However, both stereoscopic and pictorial perception, as well as monocular viewing, make clear that the perception of 3D shape precedes, and does not rely on, the perception of the depth of points. It is not clear how a preliminary depth map could, in principle, be constructed, nor how this would address the question of figure-ground organization, or grouping. The role of perceptual organizing constraints, overlooked by Marr, in the production of 3D shape percepts from binocularly-viewed 3D objects has been demonstrated empirically for the case of 3D wire objects, e.g. For a more detailed discussion, see Pizlo (2008).
A more recent, alternative framework proposes that vision is composed instead of the following three stages: encoding, selection, and decoding. Encoding is to sample and represent visual inputs (e.g., to represent visual inputs as neural activities in the retina). Selection, or attentional selection, is to select a tiny fraction of input information for further processing, e.g., by shifting gaze to an object or visual location to better process the visual signals at that location. Decoding is to infer or recognize the selected input signals, e.g., to recognize the object at the center of gaze as somebody's face. In this framework, attentional selection starts at the primary visual cortex
The visual cortex of the brain is the area of the cerebral cortex that processes visual information. It is located in the occipital lobe. Sensory input originating from the eyes travels through the lateral geniculate nucleus in the thalamus and ...
along the visual pathway, and the attentional constraints impose a dichotomy between the central and peripheral
A peripheral or peripheral device is an auxiliary device used to put information into and get information out of a computer. The term ''peripheral device'' refers to all hardware components that are attached to a computer and are controlled by the ...
visual fields for visual recognition or decoding.
Transduction
Transduction is the process through which energy from environmental stimuli is converted to neural activity. The retina
The retina (from la, rete "net") is the innermost, light-sensitive layer of tissue of the eye of most vertebrates and some molluscs. The optics of the eye create a focused two-dimensional image of the visual world on the retina, which then ...
contains three different cell layers: photoreceptor layer, bipolar cell layer and ganglion cell layer. The photoreceptor layer where transduction occurs is farthest from the lens. It contains photoreceptors with different sensitivities called rods and cones. The cones are responsible for color perception and are of three distinct types labelled red, green and blue. Rods are responsible for the perception of objects in low light. Photoreceptors contain within them a special chemical called a photopigment, which is embedded in the membrane of the lamellae; a single human rod contains approximately 10 million of them. The photopigment molecules consist of two parts: an opsin
Animal opsins are G-protein-coupled receptors and a group of proteins made light-sensitive via a chromophore, typically retinal. When bound to retinal, opsins become Retinylidene proteins, but are usually still called opsins regardless. Most pro ...
(a protein) and retinal
Retinal (also known as retinaldehyde) is a polyene chromophore. Retinal, bound to proteins called opsins, is the chemical basis of visual phototransduction, the light-detection stage of visual perception (vision).
Some microorganisms use retin ...
(a lipid). There are 3 specific photopigments (each with their own wavelength sensitivity) that respond across the spectrum of visible light. When the appropriate wavelengths (those that the specific photopigment is sensitive to) hit the photoreceptor, the photopigment splits into two, which sends a signal to the bipolar cell layer, which in turn sends a signal to the ganglion cells, the axons of which form the optic nerve
In neuroanatomy, the optic nerve, also known as the second cranial nerve, cranial nerve II, or simply CN II, is a paired cranial nerve that transmits visual system, visual information from the retina to the brain. In humans, the optic nerve i ...
and transmit the information to the brain. If a particular cone type is missing or abnormal, due to a genetic anomaly, a color vision deficiency
Color blindness or color vision deficiency (CVD) is the decreased ability to see color or differences in color. It can impair tasks such as selecting ripe fruit, choosing clothing, and reading traffic lights. Color blindness may make some aca ...
, sometimes called color blindness will occur.
Opponent process
Transduction involves chemical messages sent from the photoreceptors to the bipolar cells to the ganglion cells. Several photoreceptors may send their information to one ganglion cell. There are two types of ganglion cells: red/green and yellow/blue. These neurons constantly fire—even when not stimulated. The brain interprets different colors (and with a lot of information, an image) when the rate of firing of these neurons alters. Red light stimulates the red cone, which in turn stimulates the red/green ganglion cell. Likewise, green light stimulates the green cone, which stimulates the green/red ganglion cell and blue light stimulates the blue cone which stimulates the blue/yellow ganglion cell. The rate of firing of the ganglion cells is increased when it is signaled by one cone and decreased (inhibited) when it is signaled by the other cone. The first color in the name of the ganglion cell is the color that excites it and the second is the color that inhibits it. i.e.: A red cone would excite the red/green ganglion cell and the green cone would inhibit the red/green ganglion cell. This is an opponent process
The opponent process is a color theory that states that the human visual system interprets information about color by processing signals from photoreceptor cells in an antagonistic manner. The opponent-process theory suggests that there are thr ...
. If the rate of firing of a red/green ganglion cell is increased, the brain would know that the light was red, if the rate was decreased, the brain would know that the color of the light was green.
Artificial visual perception
Theories and observations of visual perception have been the main source of inspiration for computer vision
Computer vision is an interdisciplinary scientific field that deals with how computers can gain high-level understanding from digital images or videos. From the perspective of engineering, it seeks to understand and automate tasks that the hum ...
(also called machine vision
Machine vision (MV) is the technology and methods used to provide imaging-based automatic inspection and analysis for such applications as automatic inspection, process control, and robot guidance, usually in industry. Machine vision refers to m ...
, or computational vision). Special hardware structures and software algorithms provide machines with the capability to interpret the images coming from a camera or a sensor.
For instance, the 2022 Toyota 86
The Toyota 86 and the Subaru BRZ are 2+2 sports cars jointly developed by Toyota and Subaru, manufactured at Subaru's Gunma assembly plant.
The 2+2 fastback coupé has a naturally-aspirated boxer engine, front-engined, rear-wheel-drive conf ...
uses the Subaru EyeSight system for driver-assist technology.
See also
* Color vision
Color vision, a feature of visual perception, is an ability to perceive differences between light composed of different wavelengths (i.e., different spectral power distributions) independently of light intensity. Color perception is a part of ...
* Computer vision
Computer vision is an interdisciplinary scientific field that deals with how computers can gain high-level understanding from digital images or videos. From the perspective of engineering, it seeks to understand and automate tasks that the hum ...
* Depth perception
Depth perception is the ability to perceive distance to objects in the world using the visual system and visual perception. It is a major factor in perceiving the world in three dimensions. Depth perception happens primarily due to stereopsis an ...
* Entoptic phenomenon
Entoptic phenomena () are visual effects whose source is within the human eye itself. (Occasionally, these are called entopic phenomena, which is probably a typographical mistake.)
In Helmholtz's words: "Under suitable conditions light falling o ...
* Gestalt psychology
Gestalt-psychology, gestaltism, or configurationism is a school of psychology that emerged in the early twentieth century in Austria and Germany as a theory of perception that was a rejection of basic principles of Wilhelm Wundt's and Edward T ...
* Lateral masking
Lateral masking is a problem for the human visual perception of identical or similar entities in close proximity. This can be illustrated by the difficulty of counting the vertical bars of a barcode.
In linguistics lateral masking refers to the in ...
* Looming
''Looming'' is a term found in the study of perception, as it relates directly to psychology. Looming occurs when an object begins moving closer to the eye. As the resulting image becomes increasingly larger on the perceiver's retina, i.e., when ...
* Naked eye
Naked eye, also called bare eye or unaided eye, is the practice of engaging in visual perception unaided by a magnifying, light-collecting optical instrument, such as a telescope or microscope, or eye protection. Vision corrected to normal ...
* Machine vision
Machine vision (MV) is the technology and methods used to provide imaging-based automatic inspection and analysis for such applications as automatic inspection, process control, and robot guidance, usually in industry. Machine vision refers to m ...
* Motion perception
Motion perception is the process of inferring the speed and direction of elements in a scene based on visual, vestibular and proprioceptive inputs. Although this process appears straightforward to most observers, it has proven to be a difficult pr ...
* Multisensory integration
Multisensory integration, also known as multimodal integration, is the study of how information from the different sensory modalities (such as sight, sound, touch, smell, self-motion, and taste) may be integrated by the nervous system. A coherent r ...
* Interpretation (philosophy)
A philosophical interpretation is the assignment of meanings to various concepts, symbols, or objects under consideration. Two broad types of interpretation can be distinguished: interpretations of physical objects, and interpretations of concepts ...
* Spatial frequency
In mathematics, physics, and engineering, spatial frequency is a characteristic of any structure that is periodic across position in space. The spatial frequency is a measure of how often sinusoidal components (as determined by the Fourier tra ...
* Visual illusion
Within visual perception, an optical illusion (also called a visual illusion) is an illusion caused by the visual system and characterized by a visual percept that arguably appears to differ from reality. Illusions come in a wide variety; thei ...
* Visual processing
Visual processing is a term that is used to refer to the brain's ability to use and interpret visual information from the world around us. The process of converting light energy into a meaningful image is a complex process that is facilitated by ...
* Visual system
The visual system comprises the sensory organ (the eye) and parts of the central nervous system (the retina containing photoreceptor cells, the optic nerve, the optic tract and the visual cortex) which gives organisms the sense of sight (the a ...
* Sensation
Sensation (psychology) refers to the processing of the senses by the sensory system.
Sensation or sensations may also refer to:
In arts and entertainment In literature
*Sensation (fiction), a fiction writing mode
*Sensation novel, a British ...
s
Vision deficiencies or disorders
* Achromatopsia
Achromatopsia, also known as Rod monochromacy, is a medical syndrome that exhibits symptoms relating to five conditions, most notably monochromacy. Historically, the name referred to monochromacy in general, but now typically refers only to an au ...
* Akinetopsia
Akinetopsia (Greek: a for "without", kine for "to move" and opsia for "seeing"), also known as cerebral akinetopsia or motion blindness, is a term introduced by Semir Zeki to describe an extremely rare neuropsychological disorder, having only been ...
* Apperceptive agnosia
Apperceptive agnosia is a failure in recognition that is due to a failure of perception. In contrast, associative agnosia is a type of agnosia where perception occurs but recognition still does not occur. When referring to apperceptive agnosia, v ...
* Associative visual agnosia
Associative visual agnosia is a form of visual agnosia. It is an impairment in recognition or assigning meaning to a stimulus that is accurately perceived and not associated with a generalized deficit in intelligence, memory, language or attention ...
* Color blindness
Color blindness or color vision deficiency (CVD) is the decreased ability to color vision, see color or differences in color. It can impair tasks such as selecting ripe fruit, choosing clothing, and reading traffic lights. Color blindness may ...
* Hallucinogen persisting perception disorder
Hallucinogen persisting perception disorder (HPPD) is a non-psychotic disorder in which a person experiences apparent lasting or persistent visual hallucinations or perceptual distortions after a previous use of drugs, including but not limited t ...
* Illusory palinopsia
Illusory palinopsia is a subtype of palinopsia, a visual disturbance defined as the persistence or recurrence of a visual image after the stimulus has been removed. Palinopsia is a broad term describing a heterogeneous group of symptoms, which is ...
* Prosopagnosia
Prosopagnosia (from Greek ''prósōpon'', meaning "face", and ''agnōsía'', meaning "non-knowledge"), also called face blindness, ("illChoisser had even begun tpopularizea name for the condition: face blindness.") is a cognitive disorder of fac ...
* Refractive error
Refractive error, also known as refraction error, is a problem with focus (optics), focusing light accurately on the retina due to the shape of the human eye, eye and or cornea. The most common types of refractive error are myopia, near-sightedne ...
* Recovery from blindness
Recovery from blindness is the phenomenon of a blind person gaining the ability to see, usually as a result of medical treatment. As a thought experiment, the phenomenon is usually referred to as Molyneux's problem. It is often stated that the fir ...
* Scotopic sensitivity syndrome
Irlen syndrome, occasionally referred to as scotopic sensitivity syndrome (SSS) or Meares–Irlen syndrome, is a postulated disorder of vision or image-processing in the brain. Irlen syndrome is also sometimes referred to as asfedia or visual ...
* Visual agnosia Visual agnosia is an impairment in recognition of visually presented objects. It is not due to a deficit in vision (acuity, visual field, and scanning), language, memory, or intellect. While cortical blindness results from lesions to primary visual ...
* Visual snow
Visual snow syndrome (VSS) is an uncommon neurological condition in which the primary symptom is that affected individuals see persistent flickering white, black, transparent, or coloured dots across the whole visual field. Other common symptom ...
Related disciplines
* Cognitive psychology
Cognitive psychology is the scientific study of mental processes such as attention, language use, memory, perception, problem solving, creativity, and reasoning.
Cognitive psychology originated in the 1960s in a break from behaviorism, which ...
* Cognitive science
* Neuroscience
Neuroscience is the scientific study of the nervous system (the brain, spinal cord, and peripheral nervous system), its functions and disorders. It is a multidisciplinary science that combines physiology, anatomy, molecular biology, development ...
* Ophthalmology
Ophthalmology ( ) is a surgical subspecialty within medicine that deals with the diagnosis and treatment of eye disorders.
An ophthalmologist is a physician who undergoes subspecialty training in medical and surgical eye care. Following a medic ...
* Optometry
Optometry is a specialized health care profession that involves examining the eyes and related structures for defects or abnormalities. Optometrists are health care professionals who typically provide comprehensive primary eye care.
In the Uni ...
* Psychophysics
Psychophysics quantitatively investigates the relationship between physical stimuli and the sensations and perceptions they produce. Psychophysics has been described as "the scientific study of the relation between stimulus and sensation" or, m ...
References
Further reading
* Quotations are from the English translation produced by Optical Society of America (1924–25):
Treatise on Physiological Optics
''.
External links
The Organization of the Retina and Visual System
Effect of Detail on Visual Perception
by Jon McLoone, the Wolfram Demonstrations Project
The Wolfram Demonstrations Project is an organized, open-source collection of small (or medium-size) interactive programs called Demonstrations, which are meant to visually and interactively represent ideas from a range of fields. It is hos ...
The Joy of Visual Perception
Resource on the eye's perception abilities.
VisionScience. Resource for Research in Human and Animal Vision
A collection of resources in vision science and perception
Visibility in Social Theory and Social Research
An inquiry into the cognitive and social meanings of visibility
Vision
Scholarpedia
''Scholarpedia'' is an English-language wiki-based online encyclopedia with features commonly associated with open-access online academic journals, which aims to have quality content in science and medicine.
''Scholarpedia'' articles are written ...
Expert articles about Vision
{{Authority control
Perception
Perception
Perception () is the organization, identification, and interpretation of sensory information in order to represent and understand the presented information or environment. All perception involves signals that go through the nervous system ...