HOME

TheInfoList



OR:

Psychoacoustics is the branch of
psychophysics Psychophysics is the field of psychology which quantitatively investigates the relationship between physical stimulus (physiology), stimuli and the sensation (psychology), sensations and perceptions they produce. Psychophysics has been described ...
involving the scientific study of the
perception Perception () is the organization, identification, and interpretation of sensory information in order to represent and understand the presented information or environment. All perception involves signals that go through the nervous syste ...
of
sound In physics, sound is a vibration that propagates as an acoustic wave through a transmission medium such as a gas, liquid or solid. In human physiology and psychology, sound is the ''reception'' of such waves and their ''perception'' by the br ...
by the human
auditory system The auditory system is the sensory system for the sense of hearing. It includes both the ear, sensory organs (the ears) and the auditory parts of the sensory system. System overview The outer ear funnels sound vibrations to the eardrum, incre ...
. It is the branch of science studying the
psychological Psychology is the scientific study of mind and behavior. Its subject matter includes the behavior of humans and nonhumans, both consciousness, conscious and Unconscious mind, unconscious phenomena, and mental processes such as thoughts, feel ...
responses associated with sound including
noise Noise is sound, chiefly unwanted, unintentional, or harmful sound considered unpleasant, loud, or disruptive to mental or hearing faculties. From a physics standpoint, there is no distinction between noise and desired sound, as both are vibrat ...
,
speech Speech is the use of the human voice as a medium for language. Spoken language combines vowel and consonant sounds to form units of meaning like words, which belong to a language's lexicon. There are many different intentional speech acts, suc ...
, and
music Music is the arrangement of sound to create some combination of Musical form, form, harmony, melody, rhythm, or otherwise Musical expression, expressive content. Music is generally agreed to be a cultural universal that is present in all hum ...
. Psychoacoustics is an interdisciplinary field including psychology,
acoustics Acoustics is a branch of physics that deals with the study of mechanical waves in gases, liquids, and solids including topics such as vibration, sound, ultrasound and infrasound. A scientist who works in the field of acoustics is an acoustician ...
, electronic engineering, physics, biology, physiology, and computer science.


Background

Hearing is not a purely mechanical phenomenon of
wave propagation In physics, mathematics, engineering, and related fields, a wave is a propagating dynamic disturbance (change from equilibrium) of one or more quantities. '' Periodic waves'' oscillate repeatedly about an equilibrium (resting) value at some f ...
, but is also a sensory and perceptual event. When a person hears something, that something arrives at the ear as a mechanical sound wave traveling through the air, but within the ear it is transformed into neural
action potentials An action potential (also known as a nerve impulse or "spike" when in a neuron) is a series of quick changes in voltage across a cell membrane. An action potential occurs when the membrane potential of a specific cell rapidly rises and falls. ...
. These nerve pulses then travel to the brain where they are perceived. Hence, in many problems in acoustics, such as for audio processing, it is advantageous to take into account not just the mechanics of the environment, but also the fact that both the ear and the brain are involved in a person's listening experience. The
inner ear The inner ear (internal ear, auris interna) is the innermost part of the vertebrate ear. In vertebrates, the inner ear is mainly responsible for sound detection and balance. In mammals, it consists of the bony labyrinth, a hollow cavity in the ...
, for example, does significant
signal processing Signal processing is an electrical engineering subfield that focuses on analyzing, modifying and synthesizing ''signals'', such as audio signal processing, sound, image processing, images, Scalar potential, potential fields, Seismic tomograph ...
in converting sound
waveform In electronics, acoustics, and related fields, the waveform of a signal is the shape of its Graph of a function, graph as a function of time, independent of its time and Magnitude (mathematics), magnitude Scale (ratio), scales and of any dis ...
s into neural stimuli, this processing renders certain differences between waveforms imperceptible.
Data compression In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compressi ...
techniques, such as MP3, make use of this fact. In addition, the ear has a nonlinear response to sounds of different intensity levels; this nonlinear response is called
loudness In acoustics, loudness is the subjectivity, subjective perception of sound pressure. More formally, it is defined as the "attribute of auditory sensation in terms of which sounds can be ordered on a scale extending from quiet to loud". The relat ...
.
Telephone network A telephone network is a telecommunications network that connects telephones, which allows telephone calls between two or more parties, as well as newer features such as fax and internet. The idea was revolutionized in the 1920s, as more and more ...
s and audio
noise reduction Noise reduction is the process of removing noise from a signal. Noise reduction techniques exist for audio and images. Noise reduction algorithms may distort the signal to some degree. Noise rejection is the ability of a circuit to isolate an u ...
systems make use of this fact by nonlinearly compressing data samples before transmission and then expanding them for playback. Another effect of the ear's nonlinear response is that sounds that are close in frequency produce phantom beat notes, or
intermodulation Intermodulation (IM) or intermodulation distortion (IMD) is the amplitude modulation of Signal (electrical engineering), signals containing two or more different frequencies, caused by non-linear, nonlinearities or time variance in a system. ...
distortion products.


Limits of perception

The human ear can nominally hear sounds in the range . The upper limit tends to decrease with age; most adults are unable to hear above . Under ideal laboratory conditions, the lowest frequency that has been identified as a musical tone is 12 Hz. Tones between 4 and 16 Hz can be perceived via the body's sense of touch. Human perception of audio signal time separation has been measured to be less than . This does not mean that frequencies above are audible, but that time discrimination is not directly coupled with frequency range. Frequency resolution of the ear is about 3.6 Hz within the octave of That is, changes in pitch larger than 3.6 Hz can be perceived in a clinical setting. However, even smaller pitch differences can be perceived through other means. For example, the interference of two pitches can often be heard as a repetitive variation in the volume of the tone. This amplitude modulation occurs with a frequency equal to the difference in frequencies of the two tones and is known as beating. The
semitone A semitone, also called a minor second, half step, or a half tone, is the smallest musical interval commonly used in Western tonal music, and it is considered the most dissonant when sounded harmonically. It is defined as the interval between ...
scale used in Western musical notation is not a linear frequency scale but logarithmic. Other scales have been derived directly from experiments on human hearing perception, such as the
mel scale The mel scale (after the word ''melody'') is a perceptual scale of pitches judged by listeners to be equal in distance from one another. The reference point between this scale and normal frequency measurement is defined by assigning a percept ...
and
Bark scale The Bark scale is a psychoacoustical scale proposed by Eberhard Zwicker in 1961. It is named after Heinrich Barkhausen, who proposed the first subjective measurements of loudness.Zwicker, E. (1961),Subdivision of the audible frequency range i ...
(these are used in studying perception, but not usually in musical composition), and these are approximately logarithmic in frequency at the high-frequency end, but nearly linear at the low-frequency end. The intensity range of audible sounds is enormous. Human eardrums are sensitive to variations in sound pressure and can detect pressure changes from as small as a few micropascals (μPa) to greater than . For this reason,
sound pressure level Sound pressure or acoustic pressure is the local pressure deviation from the ambient (average or equilibrium) atmospheric pressure, caused by a sound wave. In air, sound pressure can be measured using a microphone, and in water with a hydrophone ...
is also measured logarithmically, with all pressures referenced to (or ). The lower limit of audibility is therefore defined as , but the upper limit is not as clearly defined. The upper limit is more a question of the potential to cause
noise-induced hearing loss Noise-induced hearing loss (NIHL) is a Hearing loss, hearing impairment resulting from exposure to loud sound. People may have a loss of perception of a narrow range of Frequency, frequencies or impaired perception of sound including hyperacusi ...
. A more rigorous exploration of the lower limits of audibility determines that the minimum threshold at which a sound can be heard is frequency dependent. By measuring this minimum intensity for test tones of various frequencies, a frequency-dependent absolute threshold of hearing (ATH) curve may be derived. Typically, the ear shows a peak of sensitivity (i.e., its lowest ATH) between , though the threshold changes with age, with older ears showing decreased sensitivity above 2 kHz. The ATH is the lowest of the
equal-loudness contour An equal-loudness contour is a measure of sound pressure level, over the frequency spectrum, for which a listener perceives a constant loudness when presented with pure steady tones. The unit of measurement for loudness levels is the phon an ...
s. Equal-loudness contours indicate the sound pressure level (dB SPL), over the range of audible frequencies, that are perceived as being of equal loudness. Equal-loudness contours were first measured by Fletcher and Munson at
Bell Labs Nokia Bell Labs, commonly referred to as ''Bell Labs'', is an American industrial research and development company owned by Finnish technology company Nokia. With headquarters located in Murray Hill, New Jersey, Murray Hill, New Jersey, the compa ...
in 1933 using pure tones reproduced via headphones, and the data they collected are called
Fletcher–Munson curves An equal-loudness contour is a measure of sound pressure level, over the frequency spectrum, for which a listener perceives a constant loudness when presented with pure steady tones. The unit of measurement for loudness levels is the phon an ...
. Because subjective loudness was difficult to measure, the Fletcher–Munson curves were averaged over many subjects. Robinson and Dadson refined the process in 1956 to obtain a new set of equal-loudness curves for a frontal sound source measured in an
anechoic chamber An anechoic chamber (''an-echoic'' meaning "non-reflective" or "without echoes") is a room designed to stop reflection (physics), reflections or Echo (phenomenon), echoes of either sound or electromagnetic waves. They are also often isolate ...
. The Robinson-Dadson curves were standardized as in 1986. In 2003, was revised using data collected from 12 international studies.


Sound localization

Sound localization Sound localization is a listener's ability to identify the location or origin of a detected sound in direction and distance. The sound localization mechanisms of the mammalian auditory system have been extensively studied. The auditory system u ...
is the process of determining the location of a sound source. The brain utilizes subtle differences in loudness, tone and timing between the two ears to allow us to localize sound sources.Thompson, Daniel M. Understanding Audio: Getting the Most out of Your Project or Professional Recording Studio. Boston, MA: Berklee, 2005. Print. Localization can be described in terms of three-dimensional position: the
azimuth An azimuth (; from ) is the horizontal angle from a cardinal direction, most commonly north, in a local or observer-centric spherical coordinate system. Mathematically, the relative position vector from an observer ( origin) to a point ...
or horizontal angle, the
zenith The zenith (, ) is the imaginary point on the celestial sphere directly "above" a particular location. "Above" means in the vertical direction (Vertical and horizontal, plumb line) opposite to the gravity direction at that location (nadir). The z ...
or vertical angle, and the distance (for static sounds) or velocity (for moving sounds).Roads, Curtis. The Computer Music Tutorial. Cambridge, MA: MIT, 2007. Print. Humans, as most four-legged animals, are adept at detecting direction in the horizontal, but less so in the vertical directions due to the ears being placed symmetrically. Some species of owls have their ears placed asymmetrically and can detect sound in all three planes, an adaptation to hunt small mammals in the dark.


Masking effects

Suppose a listener can hear a given acoustical signal under silent conditions. When a signal is playing while another sound is being played, the signal has to be stronger for the listener to hear it. The interfering signal is known as the ''masker'' and the impeded listening, ''masking''. The masker does not need to have the frequency components of the original signal for masking to happen. A masked signal can be heard even though it is weaker than the masker. Masking happens when a signal and a masker are played together—for instance, when one person whispers while another person shouts—and the listener doesn't hear the weaker signal as it has been masked by the louder masker. Masking can also happen to a signal before a masker starts or after a masker stops. For example, a sudden loud clap sound can make sounds inaudible immediately preceding or following. The effect of
backward masking The concept of backward masking originated in psychoacoustics, referring to temporal masking of quiet sounds that occur moments before a louder sound. In cognitive psychology, visual backward masking involves presenting one visual Stimulus (physio ...
is weaker than forward masking. The masking effect has been widely studied in psychoacoustical research and are exploited in lossy audio encoding, such as MP3.


Missing fundamental

When presented with a harmonic series of frequencies in the relationship 2''f'', 3''f'', 4''f'', 5''f'', etc. (where ''f'' is a specific frequency), humans tend to perceive that the pitch is ''f''. An audible example can be found on YouTube.


Software

The psychoacoustic model provides for high quality lossy signal compression by describing which parts of a given digital audio signal can be removed (or aggressively compressed) safely—that is, without significant losses in the (consciously) perceived quality of the sound. It can explain how a sharp clap of the hands might seem painfully loud in a quiet library but is hardly noticeable after a car backfires on a busy, urban street. This provides great benefit to the overall compression ratio, and psychoacoustic analysis routinely leads to compressed music files that are one-tenth to one-twelfth the size of high-quality masters, but with discernibly less proportional quality loss. Such compression is a feature of nearly all modern lossy audio compression formats. Some of these formats include
Dolby Digital Dolby Digital, originally synonymous with Dolby AC-3 (see below), is the name for a family of audio compression technologies developed by Dolby Laboratories. Called Dolby Stereo Digital until 1995, it is lossy compression (except for Dolby Tr ...
(AC-3), MP3, Opus,
Ogg Vorbis Vorbis is a free and open-source software project headed by the Xiph.Org Foundation. The project produces an audio coding format and software reference encoder/decoder (codec) for lossy audio compression, libvorbis. Vorbis is most common ...
, AAC, WMA,
MPEG-1 Layer II MP2 (formally MPEG-1 Audio Layer II or MPEG-2 Audio Layer II, sometimes incorrectly called Musicam) is a lossy data compression, lossy audio compression format. It is standardised as one of the three audio codecs of MPEG-1 alongside MPEG-1 Audio ...
(used for
digital audio broadcasting Digital Audio Broadcasting (DAB) is a digital radio international standard, standard for broadcasting digital audio radio services in many countries around the world, defined, supported, marketed and promoted by the WorldDAB organisation. T ...
in several countries), and ATRAC, the compression used in
MiniDisc MiniDisc (MD) is an erasable magneto-optical disc-based data storage format offering a capacity of 60, 74, or 80 minutes of digitized audio. Sony announced the MiniDisc in September 1992 and released it in November of that year for sale i ...
and some
Walkman is a brand of Personal stereo, portable audio players manufactured by Sony since 1979. It was originally introduced as a portable Compact Cassette, cassette player and later expanded to include a range of portable audio products. Since 2011, ...
models. Psychoacoustics is based heavily on
human anatomy Human anatomy (gr. ἀνατομία, "dissection", from ἀνά, "up", and τέμνειν, "cut") is primarily the scientific study of the morphology of the human body. Anatomy is subdivided into gross anatomy and microscopic anatomy. Gross ...
, especially the ear's limitations in perceiving sound as outlined previously. To summarize, these limitations are: * High-frequency limit * Absolute threshold of hearing * Temporal masking (forward masking, backward masking) * Simultaneous masking (also known as spectral masking) A compression algorithm can assign a lower priority to sounds outside the range of human hearing. By carefully shifting bits away from the unimportant components and toward the important ones, the algorithm ensures that the sounds a listener is most likely to perceive are most accurately represented.


Music

Psychoacoustics includes topics and studies that are relevant to
music psychology The psychology of music, or music psychology, is a branch of psychology, cognitive science, neuroscience, and/or musicology. It aims to explain and understand musical behaviour and experience, including the processes through which music is pe ...
and
music therapy Music therapy, an allied health profession, "is the clinical and evidence-based use of music interventions to accomplish individualized goals within a therapeutic relationship by a credentialed professional who has completed an approved music t ...
. Theorists such as Benjamin Boretz consider some of the results of psychoacoustics to be meaningful only in a musical context. Irv Teibel's '' Environments series'' LPs (1969–79) are an early example of commercially available sounds released expressly for enhancing psychological abilities.


Applied psychoacoustics

Psychoacoustics has long enjoyed a symbiotic relationship with
computer science Computer science is the study of computation, information, and automation. Computer science spans Theoretical computer science, theoretical disciplines (such as algorithms, theory of computation, and information theory) to Applied science, ...
. Internet pioneers J. C. R. Licklider and Bob Taylor both completed graduate-level work in psychoacoustics, while
BBN Technologies Raytheon BBN (originally Bolt, Beranek and Newman, Inc.) is an American research and development company based in Cambridge, Massachusetts. In 1966, the Franklin Institute awarded the firm the Frank P. Brown Medal, in 1999 BBN received the ...
originally specialized in consulting on acoustics issues before it began building the first
packet-switched network In telecommunications, packet switching is a method of grouping data into short messages in fixed format, i.e. '' packets,'' that are transmitted over a digital network. Packets consist of a header and a payload. Data in the header is used b ...
. Licklider wrote a paper entitled "A duplex theory of pitch perception". Psychoacoustics is applied within many fields of software development, where developers map proven and experimental mathematical patterns in digital signal processing. Many audio compression codecs such as MP3 and Opus use a psychoacoustic model to increase compression ratios. The success of conventional audio systems for the reproduction of music in theatres and homes can be attributed to psychoacoustics and psychoacoustic considerations gave rise to novel audio systems, such as psychoacoustic sound field synthesis. Furthermore, scientists have experimented with limited success in creating new acoustic weapons, which emit frequencies that may impair, harm, or kill. Psychoacoustics are also leveraged in sonification to make multiple independent data dimensions audible and easily interpretable. This enables auditory guidance without the need for spatial audio and in sonification
computer games A video game or computer game is an electronic game that involves interaction with a user interface or input device (such as a joystick, game controller, controller, computer keyboard, keyboard, or motion sensing device) to generate visual fe ...
and other applications, such as drone flying and image-guided surgery. It is also applied today within music, where musicians and artists continue to create new auditory experiences by masking unwanted frequencies of instruments, causing other frequencies to be enhanced. Yet another application is in the design of small or lower-quality loudspeakers, which can use the phenomenon of
missing fundamental The pitch being perceived with the first harmonic being absent in the waveform is called the missing fundamental phenomenon. It is established in psychoacoustics that the auditory system, with its natural tendency to distinguish a tone from anoth ...
s to give the effect of bass notes at lower frequencies than the loudspeakers are physically able to produce (see references). Automobile manufacturers engineer their engines and even doors to have a certain sound.


See also


Related fields

*
Cognitive neuroscience of music The neuroscience of music is the scientific study of brain-based mechanisms involved in the cognitive processes underlying music. These behaviours include music listening, Performance, performing, Musical composition, composing, reading, writing, a ...
*
Music psychology The psychology of music, or music psychology, is a branch of psychology, cognitive science, neuroscience, and/or musicology. It aims to explain and understand musical behaviour and experience, including the processes through which music is pe ...


Psychoacoustic topics

*
A-weighting A-weighting is a form of frequency weighting and the most commonly used of a family of curves defined in the International standard IEC 61672:2003 and various national standards relating to the measurement of sound pressure level. A-weighti ...
, a commonly used perceptual loudness
transfer function In engineering, a transfer function (also known as system function or network function) of a system, sub-system, or component is a function (mathematics), mathematical function that mathematical model, models the system's output for each possible ...
*
ABX test An ABX test is a method of comparing two choices of sensory stimuli to identify detectable differences between them. A subject is presented with two known samples (sample , the first reference, and sample , the second reference) followed by one un ...
*
Audiology Audiology (from Latin 'to hear'; and from Ancient Greek, Greek branch of learning , ''wikt:-logia, -logia'') is a branch of science that studies hearing, balance, and related disorders. Audiologists treat those with hearing loss and proactivel ...
*
Auditory illusion Auditory illusions are Illusion, illusions of real sound or outside stimulus. These false perceptions are the equivalent of an optical illusion: the listener hears either sounds which are not present in the Stimulus (physiology), stimulus, or sound ...
* Auditory scene analysis incl. 3D-sound perception, localization *
Binaural beats In acoustics, a beat is an Interference (wave propagation), interference pattern between two sounds of slightly different frequency, frequencies, ''perceived'' as a periodic variation in amplitude (music), volume whose rate is the Difference (math ...
*
Blind signal separation Source separation, blind signal separation (BSS) or blind source separation, is the separation of a set of source signal processing, signals from a set of mixed signals, without the aid of information (or with very little information) about the s ...
* Combination tone (also Tartini tone) * Deutsch's Scale illusion * Equivalent rectangular bandwidth (ERB) * Franssen effect * Glissando illusion * Hypersonic effect * Language processing * Levitin effect *
Misophonia Misophonia (or selective sound sensitivity syndrome) is a disorder of decreased Distress tolerance, tolerance to specific sounds or their associated Stimulus (psychology), stimuli, or cues. These cues, known as "triggers", are experienced as Dis ...
*
Musical tuning In music, there are two common meanings for tuning: * #Tuning practice, Tuning practice, the act of tuning an instrument or voice. * #Tuning systems, Tuning systems, the various systems of Pitch (music), pitches used to tune an instrument, and ...
* Noise health effects * Octave illusion *
Pitch (music) Pitch is a perception, perceptual property that allows sounds to be ordered on a frequency-related scale (music), scale, or more commonly, pitch is the quality that makes it possible to judge sounds as "higher" and "lower" in the sense associate ...
*
Precedence effect The precedence effect or law of the first wavefront is a binaural psychoacoustical effect concerning sound reflection and the perception of echoes. When two versions of the same sound presented are separated by a sufficiently short time delay ...
*
Psycholinguistics Psycholinguistics or psychology of language is the study of the interrelation between linguistic factors and psychological aspects. The discipline is mainly concerned with the mechanisms by which language is processed and represented in the mind ...
* Rate-distortion theory *
Sound localization Sound localization is a listener's ability to identify the location or origin of a detected sound in direction and distance. The sound localization mechanisms of the mammalian auditory system have been extensively studied. The auditory system u ...
* Sound of fingernails scraping chalkboard * Sound masking *
Speech perception Speech perception is the process by which the sounds of language are heard, interpreted, and understood. The study of speech perception is closely linked to the fields of phonology and phonetics in linguistics and cognitive psychology and percept ...
*
Speech recognition Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers. It is also ...
*
Timbre In music, timbre (), also known as tone color or tone quality (from psychoacoustics), is the perceived sound of a musical note, sound or tone. Timbre distinguishes sounds according to their source, such as choir voices and musical instrument ...
* Tritone paradox


References


Notes


Sources

*E. Larsen and R.M. Aarts (2004)
Audio Bandwidth extension. Application of Psychoacoustics, Signal Processing and Loudspeaker Design.
J. Wiley. * *


External links


The Musical Ear—Perception of Sound
*—Simulation of Free-field Hearing by Head Phones
GPSYCHO—An Open-source Psycho-Acoustic and Noise-Shaping Model for ISO-Based MP3 Encoders.Definition of: perceptual audio codingThe MP3 as Standard Object
{{Authority control Cognitive musicology Music psychology Acoustics