Binaural recording is a method of
recording
A record, recording or records may refer to:
An item or collection of data Computing
* Record (computer science), a data structure
** Record, or row (database), a set of fields in a database related to one entity
** Boot sector or boot record, ...
sound
In physics, sound is a vibration that propagates as an acoustic wave, through a transmission medium such as a gas, liquid or solid.
In human physiology and psychology, sound is the ''reception'' of such waves and their ''perception'' by the ...
that uses two
microphone
A microphone, colloquially called a mic or mike (), is a transducer that converts sound into an electrical signal. Microphones are used in many applications such as telephones, hearing aids, public address systems for concert halls and public ...
s, arranged with the intent to create a
3-D stereo sound sensation for the listener of actually being in the room with the performers or instruments. This effect is often created using a technique known as
dummy head recording
In acoustics, the dummy head recording (also known as ''artificial head'', ''Kunstkopf'' or ''Head and Torso Simulator'') is a method of recording used to generate binaural recordings. The tracks are then listened to through headphones allowi ...
, wherein a
mannequin
A mannequin (also called a dummy, lay figure, or dress form) is a doll, often articulated, used by artists, tailors, dressmakers, window dressers and others, especially to display or fit clothing and show off different fabrics and textiles. P ...
head is fitted with a microphone in each ear. Binaural recording is intended for replay using headphones and will not translate properly over stereo speakers. This idea of a three-dimensional or "internal" form of sound has also translated into useful advancement of technology in many things such as stethoscopes creating "in-head" acoustics and IMAX movies being able to create a three-dimensional acoustic experience.
The term "binaural" has frequently been confused as a synonym for the word "
stereo
Stereophonic sound, or more commonly stereo, is a method of sound reproduction that recreates a multi-directional, 3-dimensional audible perspective. This is usually achieved by using two independent audio channels through a configuration ...
", due in part to systematic misuse in the mid-1950s by the
recording industry
A record, recording or records may refer to:
An item or collection of data Computing
* Record (computer science), a data structure
** Record, or row (database), a set of fields in a database related to one entity
** Boot sector or boot record, r ...
, as a marketing
buzzword
A buzzword is a word or phrase, new or already existing, that becomes popular for a period of time. Buzzwords often derive from technical terms yet often have much of the original technical meaning removed through fashionable use, being simply used ...
. Conventional stereo recordings do not factor in natural ear spacing or "
head shadow A head shadow (or acoustic shadow) is a region of reduced amplitude of a sound because it is obstructed by the head. It is an example of diffraction.
Sound may have to travel through and around the head in order to reach an ear. The obstruction c ...
" of the head and ears, since these things happen naturally as a person listens, generating
interaural time difference
The interaural time difference (or ITD) when concerning humans or animals, is the difference in arrival time of a sound between two ears. It is important in the localization of sounds, as it provides a cue to the direction or angle of the sound s ...
s (ITDs) and interaural level differences (ILDs) specific to their listening position. Because loudspeaker-crosstalk with conventional stereo interferes with binaural reproduction (i.e., because the sound from each channel's speaker is heard by both ears rather than only by the ear on the corresponding side, as would be the case with headphones), either headphones are required, or
crosstalk
In electronics, crosstalk is any phenomenon by which a signal transmitted on one circuit or channel of a transmission system creates an undesired effect in another circuit or channel. Crosstalk is usually caused by undesired capacitive, induc ...
cancellation of signals intended for loudspeakers such as
Ambiophonics is required. For listening using conventional speaker-stereo, or
MP3 players
A portable media player (PMP) (also including the related digital audio player (DAP)) is a portable consumer electronics device capable of storing and playing digital media such as audio, images, and video files. The data is typically stored o ...
, a
pinna-less dummy head may be preferable for quasi-binaural recordings such as the sphere microphone or Ambiophone. As a general rule, for true binaural results, an audio recording and reproduction system chain, from the microphone to the listener's brain, should contain one and only one set of pinnae (preferably the listener's own), and one head-shadow.
History
The history of binaural recording goes back to 1881.
The first binaural unit, the
théâtrophone
Théâtrophone ("the theatre phone") was a telephonic distribution system available in portions of Europe that allowed the subscribers to listen to opera and theatre performances over the telephone lines. The théâtrophone evolved from a Clément ...
, was invented by
Clément Ader
Clément Ader (2 April 1841 – 3 May 1925) was a French inventor and engineer who was born near Toulouse in Muret, Haute-Garonne, and died in Toulouse. He is remembered primarily for his pioneering work in aviation. In 1870 he was also one ...
.
It consisted of an array of carbon telephone microphones installed along the front edge of the
Opera Garnier
The Palais Garnier (, Garnier Palace), also known as Opéra Garnier (, Garnier Opera), is a 1,979-seatBeauvert 1996, p. 102. opera house at the Place de l'Opéra in the 9th arrondissement of Paris, France. It was built for the Paris Opera from ...
. The signal was sent to subscribers through the
telephone
A telephone is a telecommunications device that permits two or more users to conduct a conversation when they are too far apart to be easily heard directly. A telephone converts sound, typically and most efficiently the human voice, into e ...
system, and required that they wear a special headset, which had a tiny speaker for each ear.
In 1978,
Lou Reed
Lewis Allan Reed (March 2, 1942October 27, 2013) was an American musician, songwriter, and poet. He was the guitarist, singer, and principal songwriter for the rock band the Velvet Underground and had a solo career that spanned five decades. ...
released the first commercially produced binaural pop record, ''
Street Hassle
''Street Hassle'' is the eighth solo studio album by American musician Lou Reed, released in February 1978 by Arista Records. Richard Robinson and Reed produced the album. It is the first commercially released pop album to employ binaural recordi ...
'', a combination of live and studio recordings.
Binaural stayed in the background due to the expensive, specialized equipment required for quality recordings, and the requirement of headphones for proper reproduction. Particularly in pre-
Walkman
Walkman, stylised as , is a brand of portable audio players manufactured and marketed by Japanese technology company Sony since 1979. The original Walkman was a portable cassette player and its popularity made "walkman" an unofficial term for p ...
days, most consumers considered headphones an inconvenience, and were only interested in recordings that could be listened to on a home stereo system or in automobiles. Lastly, the types of things that can be recorded do not have a typically high market value. Studio recordings would have little to benefit from using a binaural set up, beyond natural cross-feed, as the spatial quality of the studio would not be very dynamic and interesting. Recordings that are of interest are live
orchestra
An orchestra (; ) is a large instrumental ensemble typical of classical music, which combines instruments from different families.
There are typically four main sections of instruments:
* bowed string instruments, such as the violin, viola, c ...
l performances, and ambient "environmental" recordings of city sounds, nature, and other such subject matters.
The modern era has seen a resurgence of interest in binaural, partially due to the widespread availability of headphones, cheaper methods of recording and the general increased commercial interest in 360° audio technology.
The online
ASMR community is another movement that has widely employed binaural recordings.
The rise of
Dolby Atmos
Dolby Atmos is a surround sound technology developed by Dolby Laboratories. It expands on existing surround sound systems by adding height channels, allowing sounds to be interpreted as three-dimensional objects with neither horizontal, nor verti ...
and other 360° audio film technology in relation to commercial entertainment has seen a rise in popularity of the use of binaural simulation. This is with the purpose of fully adapting the 360° soundtrack for headphones and earphones. Users can ostensibly watch 360° films and music with the immersive
surround sound
Surround sound is a technique for enriching the fidelity and depth of sound reproduction by using multiple audio channels from speakers that surround the listener ( surround channels). Its first application was in movie theaters. Prior to sur ...
experience remaining intact despite using just the two headset speakers. Notably, any full 360° multi-channel soundtrack is automatically converted to simulated binaural audio when listened to with headphones.
In 2020, British film-maker Nicholas Cooley released the binaural short film
Rear Mirror',
which was featured on a video-on-demand platform (
Amazon Prime Video
Amazon Prime Video, also known simply as Prime Video, is an American Video on demand#Subscription models, subscription video on-demand Over-the-top media service, over-the-top Streaming media, streaming and Renting, rental service of Amazon (c ...
).
In 2021, British singer-songwriter Anna Aarons released the single
A Perfect Day' in binaural format.
Recording techniques
With a simple recording method, two microphones are placed 18 cm (7") apart facing away from each other. This method will not create a real binaural recording. The distance and placement roughly approximate the position of an average human's
ear canals, but that is not all that is needed. More elaborate techniques exist in pre-packaged forms. A typical binaural recording unit has two
high-fidelity microphones mounted in a dummy head, inset in ear-shaped
molds to fully capture all of the
audio frequency
An audio frequency or audible frequency (AF) is a periodic vibration whose frequency is audible to the average human. The SI unit of frequency is the hertz (Hz). It is the property of sound that most determines pitch.
The generally accepted ...
adjustments (known as
head-related transfer function
A head-related transfer function (HRTF), also known as anatomical transfer function (ATF), is a response that characterizes how an ear receives a sound from a point in space. As sound strikes the listener, the size and shape of the head, ears, e ...
s (HRTFs) in the
psychoacoustic
Psychoacoustics is the branch of psychophysics involving the scientific study of sound perception and audiology—how humans perceive various sounds. More specifically, it is the branch of science studying the psychological responses associated wit ...
research community) that happen naturally as sound wraps around the human head and is "shaped" by the form of the outer and inner
ear
An ear is the organ that enables hearing and, in mammals, body balance using the vestibular system. In mammals, the ear is usually described as having three parts—the outer ear, the middle ear and the inner ear. The outer ear consists of ...
.
Re-recording techniques
The technique of binaural re-recording is simple but has not been well established. It follows the same principles of Worldizing, a technique used by film sound designers in which sound is played over a loudspeaker in a real-world location and then re-recorded, taking along all the aspects and characteristics of the real-world environment with it.
Using space to manipulate a sound and then re-recording it has been done through the use of echo chambers in recording studios for many years. In 1959, an echo chamber was famously used by Irving Townsend during the post-production process of
Miles Davis
Miles Dewey Davis III (May 26, 1926September 28, 1991) was an American trumpeter, bandleader, and composer. He is among the most influential and acclaimed figures in the history of jazz and 20th-century music. Davis adopted a variety of music ...
's 1959 album ''
Kind of Blue
''Kind of Blue'' is a studio album by American jazz trumpeter, composer, and bandleader Miles Davis. It was recorded on March 2 and April 22, 1959, at Columbia's 30th Street Studio in New York City, and released on August 17 of that year by Co ...
''. "
he effect of the echo chamber on Kind of Blue isjust a bit of sweetening. At 30th Street, a line was run from the mixing console down into a low-ceilinged, concrete basement room—about 12 by 15 feet in size—anywhere we set up a speaker and a good omnidirectional microphone."
In binaural re-recording, a binaural microphone is used to record content being played over a multi-channel speaker set-up. The binaural head, or microphone, is therefore theoretically making a recording of how humans will hear multi-channel content. The soundtrack to a film, for example, will be recorded by the binaural microphone with all the environmental cues of the given location, as well as reverberations, including those commonly created by the human torso (assuming a HATS model is used). This method, like certain binaural recordings made with a Neumann KU 100.
Using an MRI scanner, Brüel & Kjær and DTU collected the geometries of a large population of human ears. By capturing the full ear canal geometry including the bony part adjoining the eardrum was, this data was post-processed to determine the average human ear canal geometry. Based on this, High-frequency Head and Torso Simulator (HATS) Type 5128, creates a very realistic reproduction of the acoustic properties, covering the full audible frequency range (up to 20 kHz).
Playback
There are some complications with the playback of binaural recordings through headphones. The sound that is picked up by a microphone placed in or at the entrance of the ear channel has a frequency spectrum that is very different from the one that would be picked up by a free-standing microphone. The diffuse-field head-transfer function (HRTF), that is, the frequency response at the eardrum averaged for sounds coming from all possible directions, is quite grotesque, with peaks and dips exceeding 10
dB. Frequencies from around 2 kHz to 5 kHz in particular are strongly amplified as compared to free field presentation.
Known issues
Timbral issues
In January 2012 BBC R&D worked together with
BBC Radio 4
BBC Radio 4 is a British national radio station owned and operated by the BBC that replaced the BBC Home Service in 1967. It broadcasts a wide variety of spoken-word programmes, including news, drama, comedy, science and history from the BBC' ...
to produce a binaural production of ''
Private Peaceful
''Private Peaceful'' is a novel for older children by British author Michael Morpurgo first published in 2003. It is about a fictional young soldier called Thomas "Tommo" Peaceful, who is looking back on his life from the trenches of World War ...
'', the book by
Michael Morpurgo
Sir Michael Andrew Bridge Morpurgo ('' né'' Bridge; 5 October 1943) is an English book author, poet, playwright, and librettist who is known best for children's novels such as ''War Horse'' (1982). His work is noted for its "magical storytell ...
.
The 88 minute dramatization featured a reproduction of a 5.1 speaker system, and had 4 variations. At the start of each variation, the listener would hear a series of test signals allowing for a choice of which version gives the listener the best spatial experience. By doing this, BBC R&D have accepted that there will be variations on the success of the binaural reproduction, and therefore provided different mixes based on different sets of HRTF data. The release of ''Private Peaceful'' had an accompanying survey which all listeners were asked to complete. It asked questions about the success that the binaural reproduction had with the listeners and which version (1-4) the listener thought was most successful.
During an interview with Chris Pike from BBC R&D in September 2012, Pike stated that "you may get good spatial impression but timbral coloration is often an issue".
The issue of timbral coloration is mentioned in a large amount of spatial enhancement research and is sometimes seen as the outcome of the misuse or insufficient amount of HRTF data when reproducing binaural audio for example, or the fact that the end-user simply will not respond well to the collected HRTF data. Francis Rumsey states in the 2011 article "Whose head is it anyway?"
that "badly implemented HRTFs can give rise to poor timbral quality, poor externalisation, and a host of other unwanted results".
Getting the HRTF data correct is a key point in making the final product a success, and possibly by making the HRTF data as extensive as possible, there will be less room for error such as timbral issues. The HRTFs used for ''Private Peaceful''
were designed by measuring impulse responses in a reverberant room, done so to capture a sense of space, but is not very external and there are obvious timbral issues as pointed out by Pike.
Juha Merimaa's from Sennheiser Research Laboratories in California discusses using HRTF filters and EQ to reduce timbral issues in his paper entitled 'Modification of HRTF Filters to Reduce Timbral Effects in Binaural Synthesis, Part2: Individual HRTFs' (2010). His research found that using HRTF filters to reduce timbral issues did not affect the spatial localisation previously achieved using the data when tested on a panel of listeners. This explains that there are ways of reducing the effects of timbral issues on audio that have been processed with HRTF data, but this does mean further EQ manipulation of the audio. If this route is to be further explored, researchers will have to be happy with the fact that the audio is being manipulated in great amounts to achieve a greater sense of spatial awareness, and that this further manipulation will cause irreversible changes to the audio, something content creators may not be happy with. Consideration will have to be taken into how much manipulation is appropriate and to what extent, if any, will this affect the end users experience.It is important to consider the room that the BRIR and HRTF data has been collected in, as different rooms will influence the end results.
When recording a series of HRTF data, only a limited amount of measurements can be taken for distribution, and the end-users will have to find the best results for themselves. Of course the best HRTF data for any individuals will be the information that would be collect from their own pinna, not something that content creators for mobile applications are currently taking part in. Because of this, timbral issues may be unavoidable while using non-personal HRTF data, or attempting to distribute any audio that has already been affected by spatial manipulation. It may be that the most feasible route to improving spatial awareness in audio is to explore the possibilities of head tracking or other methods of collecting HRTF data at the user-end.
Timbral issues related to headphones
The headphones used by consumers will inevitably make an impact on the end results. An issue surrounding headphone use is the wide range in quality of consumer level headphones. Many mp3 players and tablets are traditionally supplied with low budget earphones and these can cause problems for spatially enhanced audio.
Ideal listening conditions will most likely be experienced with headphones designed and calibrated to give an as flat frequency response as possible in order to reduce colouration of the audio the user is listening to. In most circumstances this has not seemed enough of a problem for end-users to make an investment into headphones that will allow them to hear audio exactly how the creator of the content intended, and will instead continue to use bundled headphones, or in some cases make investments into headphones endorsed and branded by certain artists. As previously discussed, there are issues of timbral effects present while using BRIR and HRTF data to create spatially improved audio, techniques used by Chris Pike and BBC R&D.
The results experienced timbral issues and therefore this method may not yet be a successful way of creating spatially enhanced audio for headphones, but these timbral issues are also experienced with headphone choice. "
re timbral issues brought about by the use of BRIR and HRFT dataany worse than the difference between some cheap headphones that you get with an mp3 player versus some nice Sennheisers".
Commonly used binaural microphones
Brüel & Kjær Head and Torso Simulators (HATS)
Designed to be used in-situ electroacoustics tests on, for example, telephone handsets, headsets, audio conference devices, microphones, headphones, hearing aids and hearing protectors.
Neumann KU 100
The Neumann KU 100 is a dummy head microphone used to record in binaural stereo. ''"It resembles the human head and has two microphone capsules built into the ears"''.
The Neumann is a commonly used binaural microphone and features use by BBC R&D teams.
G.R.A.S. Head & Torso Simulator KEMAR (HATS)
KEMAR was initially invented in collaboration with the audiological industry for the use of hearing aid development, and is still the de facto standard for this industry – however since then the usage of KEMAR has spread into a multitude of other industries like: telecommunications, hearing protection test, automotive development etc. KEMAR is designed using large statistical research to as close to the average human measurements as possible. The KEMAR model is also the only microphone on this list to feature a torso model. Torso reflections have been seen to be a considerable contributor to creating a successful binaural recording.
3Dio range
The 3Dio range of binaural microphones feature two silicone ear (pinna) moulds separated by —close to average distance between human ears. Microphones are placed inside the ears range from Primo EM172 in the Free Space and Free Space XLR models, to DPA 4060s in the Pro II model. The 3Dio range is considerably cheaper than the Neumann KU 100 for example and therefore used more on a consumer to prosumer level. The main difference with the 3Dio models compared to the KEMAR or KU 100 is the absence of a head model. The 3Dio relies entirely in the use on pinna moulds to achieve a binaural effect from the stereo recording.
Sound.Codes Kaan
Kaan is a DIY binaural microphone for sound artist. It is a 3D printed model averaging human ear canal to average the resonating frequency inherently present in every human. Because of the form factor and weight it makes it easy to sample environments which would be otherwise harder to with other Microphones along with ADC and recorders.
Microphones are placed exactly at the ear drum using Primo EM 172 and 235mm being the average earlobe to earlobe distance. The sigmoid form in the canal of Kaan makes up for the missing head to a greater extent.
Sound Professionals SP-TFB-2
An in-ear wearable stereo microphone used like earphones, placed inside the human pinna. This microphone uses the user's pinna to create the binaural effect.
ZiBionic
The ZiBionic One is a binaural microphone for
ASMR recording. The specific shapes and sizes of a binaural recording device ''"affect the behaviour - such as absorption, transmission, reflection, interference - of acoustic waves".'' Similarly to 3Dio, ZiBionic has no head model, but its head shadow and body shape was bent in such a way that ASMR recording technics (close range sound source, for example whispering) can be detected more effectively with the two capsules inside the ear-shaped microphones.
Hooke Verse
The Hooke Verse is a relatively newer binaural device that is an in-ear wearable set of microphones that connects to recording devices utilizing Bluetooth with lossless recording. The codec developed allows the user to capture audio along with video. Additionally, the device utilizes microphone windscreens to cut down on wind noise, a common problem with wearable devices and smart phones. ''"Smartphone manufacturers face a double problem with wind noise. Not only is turbulence present in the airflow at large, but the rectangular shape of a smartphone produces eddies around itself."''
See also
*
Binaural beats
*
Binaural fusion Binaural fusion or binaural integration is a cognitive process that involves the combination of different auditory information presented binaurally, or to each ear. In humans, this process is essential in understanding speech as one ear may pick u ...
*
Dynamic Binaural recording
*
Franssen effect The Franssen effect is an auditory illusion where the listener incorrectly localizes a sound. It was found in 1960 by Nico Valentinus Franssen (1926–1979), a Dutch physicist and inventor. There are two classical experiments, which are related ...
*
Precedence effect
The precedence effect or law of the first wavefront is a binaural psychoacoustical effect. When a sound is followed by another sound separated by a sufficiently short time delay (below the listener's echo threshold), listeners perceive a single ...
References
External links
* The Verge Article
Surrounded by sound: how 3D audio hacks your brainExamples of 3D binaural recordings
{{Music technology
Autonomous sensory meridian response
Microphones
Sound recording technology
Stereophonic sound