HOME

TheInfoList



OR:

A vocoder (, a portmanteau of ''voice'' and ''encoder'') is a category of speech coding that analyzes and
synthesizes As a topic of chemistry, chemical synthesis (or combination) is the artificial execution of chemical reactions to obtain one or several products. This occurs by physical and chemical manipulations usually involving one or more reactions. In mode ...
the human voice signal for audio data compression,
multiplexing In telecommunications and computer networking, multiplexing (sometimes contracted to muxing) is a method by which multiple analog or digital signals are combined into one signal over a shared medium. The aim is to share a scarce resource - a ...
, voice encryption or voice transformation. The vocoder was invented in 1938 by Homer Dudley at Bell Labs as a means of synthesizing human speech. This work was developed into the channel vocoder which was used as a voice codec for telecommunications for speech coding to conserve bandwidth in transmission. By encrypting the control signals, voice transmission can be secured against interception. Its primary use in this fashion is for secure radio communication. The advantage of this method of encryption is that none of the original signal is sent, only envelopes of the bandpass filters. The receiving unit needs to be set up in the same filter configuration to re-synthesize a version of the original signal spectrum. The vocoder has also been used extensively as an
electronic musical instrument An electronic musical instrument or electrophone is a musical instrument that produces sound using electronic circuitry. Such an instrument sounds by outputting an electrical, electronic or digital audio signal that ultimately is plugged into ...
. The decoder portion of the vocoder, called a voder, can be used independently for speech synthesis.


Theory

The
human voice The human voice consists of sound made by a human being using the vocal tract, including talking, singing, laughing, crying, screaming, shouting, humming or yelling. The human voice frequency is specifically a part of human sound production ...
consists of sounds generated by the opening and closing of the
glottis The glottis is the opening between the vocal folds (the rima glottidis). The glottis is crucial in producing vowels and voiced consonants. Etymology From Ancient Greek ''γλωττίς'' (glōttís), derived from ''γλῶττα'' (glôtta), va ...
by the vocal cords, which produces a periodic waveform with many
harmonic A harmonic is a wave with a frequency that is a positive integer multiple of the ''fundamental frequency'', the frequency of the original periodic signal, such as a sinusoidal wave. The original signal is also called the ''1st harmonic'', the ...
s. This basic sound is then filtered by the nose and throat (a complicated resonant piping system) to produce differences in harmonic content ( formants) in a controlled way, creating the wide variety of sounds used in speech. There is another set of sounds, known as the unvoiced and
plosive In phonetics, a plosive, also known as an occlusive or simply a stop, is a pulmonic consonant in which the vocal tract is blocked so that all airflow ceases. The occlusion may be made with the tongue tip or blade (, ), tongue body (, ), lips ...
sounds, which are created or modified by the mouth in different fashions. The vocoder examines speech by measuring how its spectral characteristics change over time. This results in a series of signals representing these frequencies at any particular time as the user speaks. In simple terms, the signal is split into a number of frequency bands (the larger this number, the more accurate the analysis) and the level of signal present at each frequency band gives the instantaneous representation of the spectral energy content. To recreate speech, the vocoder simply reverses the process, processing a broadband noise source by passing it through a stage that filters the frequency content based on the originally recorded series of numbers. Specifically, in the encoder, the input is passed through a multiband filter, then the output of each band is measured using an envelope follower, and the signals from the envelope followers are transmitted to the decoder. The decoder applies these as control signals to corresponding amplifiers of the output filter channels. Information about the instantaneous frequency of the original voice signal (as distinct from its spectral characteristic) is discarded; it was not important to preserve this for the vocoder's original use as an encryption aid. It is this dehumanizing aspect of the vocoding process that has made it useful in creating special voice effects in popular music and audio entertainment. Instead of a point-by-point recreation of the waveform, the vocoder process sends only the parameters of the vocal model over the communication link. Since the parameters change slowly compared to the original speech waveform, the bandwidth required to transmit speech can be reduced. This allows more speech channels to utilize a given communication channel, such as a radio channel or a submarine cable. Analog vocoders typically analyze an incoming signal by splitting the signal into multiple tuned frequency bands or ranges. To reconstruct the signal, a carrier signal is sent through a series of these tuned bandpass filters. In the example of a typical robot voice the carrier is noise or a
sawtooth waveform The sawtooth wave (or saw wave) is a kind of non-sinusoidal waveform. It is so named based on its resemblance to the teeth of a plain-toothed saw with a zero rake angle. A single sawtooth, or an intermittently triggered sawtooth, is called a ...
. There are usually between 8 and 20 bands. The amplitude of the modulator for each of the individual analysis bands generates a voltage that is used to control amplifiers for each of the corresponding carrier bands. The result is that frequency components of the modulating signal are mapped onto the carrier signal as discrete amplitude changes in each of the frequency bands. Often there is an unvoiced band or sibilance channel. This is for frequencies that are outside the analysis bands for typical speech but are still important in speech. Examples are words that start with the letters ''s'', ''f'', ''ch'' or any other sibilant sound. Using this band produces recognizable speech, although somewhat mechanical sounding. Vocoders often include a second system for generating unvoiced sounds, using a
noise generator A noise generator is a circuit that produces electrical noise (i.e., a random signal). Noise generators are used to test signals for measuring noise figure, frequency response, and other parameters. Noise generators are also used for generating ...
instead of the fundamental frequency. This is mixed with the carrier output to increase clarity. In the channel vocoder algorithm, among the two components of an analytic signal, considering only the amplitude component and simply ignoring the phase component tends to result in an unclear voice; on methods for rectifying this, see phase vocoder.


History

The development of a vocoder was started in 1928 by Bell Labs engineer Homer Dudley, who was granted patents for it on March 21, 1939, (filed October 30, 1935) and Nov 16, 1937. To demonstrate the
speech synthesis Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal languag ...
ability of its decoder section, the voder (voice operating demonstrator) was introduced to the public at the AT&T building at the 1939–1940 New York World's Fair. The voder consisted of an
electronic oscillator An electronic oscillator is an electronic circuit that produces a periodic, oscillation, oscillating electronic signal, often a sine wave or a square wave or a triangle wave. Oscillation, Oscillators convert direct current (DC) from a power supp ...
a sound source of pitched tone and
noise generator A noise generator is a circuit that produces electrical noise (i.e., a random signal). Noise generators are used to test signals for measuring noise figure, frequency response, and other parameters. Noise generators are also used for generating ...
for
hiss Hiss or Hissing may refer to: * Hiss (electromagnetic), a wave generated in the plasma of the Earth's ionosphere or magnetosphere * Hiss (surname) * ''Hissing'' (manhwa), a Korean manhwa series by Kang EunYoung * Noise (electronics) or electroni ...
, a 10-band resonator filters with variable-gain amplifiers as a vocal tract, and the manual controllers including a set of pressure-sensitive keys for filter control, and a foot pedal for pitch control of tone. Based on
See
schematic diagram of the Voder synthesizer
The filters controlled by keys convert the tone and the hiss into vowels, consonants, and inflections. This was a complex machine to operate, but a skilled operator could produce recognizable speech. A demonstration of the voder (not the vocoder). Dudley's vocoder was used in the SIGSALY system, which was built by Bell Labs engineers in 1943. SIGSALY was used for encrypted voice communications during World War II. The KO-6 voice coder was released in 1949 in limited quantities; it was a close approximation to the SIGSALY at 1200 bit/s. In 1953, KY-9 THESEUS 1650 bit/s voice coder used solid-state logic to reduce the weight to from SIGSALY's , and in 1961 the HY-2 voice coder, a 16-channel 2400 bit/s system, weighed and was the last implementation of a channel vocoder in a secure speech system. Later work in this field has since used digital speech coding. The most widely used speech coding technique is linear predictive coding (LPC). Another speech coding technique, adaptive differential pulse-code modulation (ADPCM), was developed by P. Cummiskey, Nikil S. Jayant and James L. Flanagan at Bell Labs in 1973.


Applications

* Terminal equipment for systems based on digital mobile radio (DMR). * Digital voice scrambling and encryption * Cochlear implants: noise and tone vocoding is used to simulate the effects of cochlear implants. * Musical and other artistic effects


Modern implementations

Even with the need to record several frequencies, and additional unvoiced sounds, the compression of vocoder systems is impressive. Standard speech-recording systems capture frequencies from about 500 Hz to 3,400 Hz, where most of the frequencies used in speech lie, typically using a sampling rate of 8 kHz (slightly greater than the Nyquist rate). The sampling resolution is typically 12 or more bits per sample resolution (16 is standard), for a final data rate in the range of 96–128 kbit/s, but a good vocoder can provide a reasonably good simulation of voice with as little as 2.4 kbit/s of data. ''Toll quality'' voice coders, such as ITU G.729, are used in many telephone networks. G.729 in particular has a final data rate of 8 kbit/s with superb voice quality. G.723 achieves slightly worse quality at data rates of 5.3 kbit/s and 6.4 kbit/s. Many voice vocoder systems use lower data rates, but below 5 kbit/s voice quality begins to drop rapidly. Several vocoder systems are used in NSA encryption systems: * LPC-10, FIPS Pub 137, 2400 bit/s, which uses linear predictive coding * Code-excited linear prediction (CELP), 2400 and 4800 bit/s, Federal Standard 1016, used in STU-III * Continuously variable slope delta modulation (CVSD), 16 kbit/s, used in wide band encryptors such as the KY-57. * Mixed-excitation linear prediction (MELP), MIL STD 3005, 2400 bit/s, used in the Future Narrowband Digital Terminal FNBDT, NSA's 21st century secure telephone. *
Adaptive Differential Pulse Code Modulation Adaptive differential pulse-code modulation (ADPCM) is a variant of differential pulse-code modulation (DPCM) that varies the size of the quantization step, to allow further reduction of the required data bandwidth for a given signal-to-noise ratio ...
( ADPCM), former ITU-T G.721, 32 kbit/s used in STE secure telephone (ADPCM is not a proper vocoder but rather a waveform codec. ITU has gathered G.721 along with some other ADPCM codecs into G.726.) Vocoders are also currently used in developing
psychophysics Psychophysics quantitatively investigates the relationship between physical stimuli and the sensations and perceptions they produce. Psychophysics has been described as "the scientific study of the relation between stimulus and sensation" or, m ...
, linguistics, computational neuroscience and cochlear implant research. Modern vocoders that are used in communication equipment and in voice storage devices today are based on the following algorithms: * Algebraic code-excited linear prediction (ACELP 4.7 kbit/s – 24 kbit/s) * Mixed-excitation linear prediction (MELPe 2400, 1200 and 600 bit/s) *
Multi-band excitation In telecommunications, a multi-band device (including (2) dual-band, (3) tri-band, (4) quad-band and (5) penta-band devices) is a communication device (especially a mobile phone) that supports multiple radio frequency bands. All devices which ha ...
(AMBE 2000 bit/s – 9600 bit/s) * Sinusoidal-Pulsed Representation (SPR 600 bit/s – 4800 bit/s) * Robust Advanced Low-complexity Waveform Interpolation (RALCWI 2050bit/s, 2400bit/s and 2750bit/s) * Tri-Wave Excited Linear Prediction (TWELP 600 bit/s – 9600 bit/s) * Noise Robust Vocoder (NRV 300 bit/s and 800 bit/s)


Linear prediction-based

Since the late 1970s, most non-musical vocoders have been implemented using linear prediction, whereby the target signal's spectral envelope (formant) is estimated by an all-pole IIR filter. In linear prediction coding, the all-pole filter replaces the bandpass filter bank of its predecessor and is used at the encoder to ''whiten'' the signal (i.e., flatten the spectrum) and again at the decoder to re-apply the spectral shape of the target speech signal. One advantage of this type of filtering is that the location of the linear predictor's spectral peaks is entirely determined by the target signal, and can be as precise as allowed by the time period to be filtered. This is in contrast with vocoders realized using fixed-width filter banks, where spectral peaks can generally only be determined to be within the scope of a given frequency band. LP filtering also has disadvantages in that signals with a large number of constituent frequencies may exceed the number of frequencies that can be represented by the linear prediction filter. This restriction is the primary reason that LP coding is almost always used in tandem with other methods in high-compression voice coders.


Waveform-interpolative

Waveform-interpolative (WI) vocoder was developed in AT&T Bell Laboratories around 1995 by W.B. Kleijn, and subsequently, a low- complexity version was developed by AT&T for the DoD secure vocoder competition. Notable enhancements to the WI coder were made at the University of California, Santa Barbara. AT&T holds the core patents related to WI, and other institutes hold additional patents.


Artistic effects


Uses in music

For musical applications, a source of musical sounds is used as the carrier, instead of extracting the fundamental frequency. For instance, one could use the sound of a
synthesizer A synthesizer (also spelled synthesiser) is an electronic musical instrument that generates audio signals. Synthesizers typically create sounds by generating waveforms through methods including subtractive synthesis, additive synthesis and ...
as the input to the filter bank, a technique that became popular in the 1970s.


History

Werner Meyer-Eppler, a German scientist with a special interest in electronic voice synthesis, published a thesis in 1948 on electronic music and
speech synthesis Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal languag ...
from the viewpoint of sound synthesis. Later he was instrumental in the founding of the Studio for Electronic Music of
WDR WDR may refer to: * Waddell & Reed (stock ticker: WDR), an American asset management and financial planning company * Walt Disney Records, an American record label of the Disney Music Group * WDR neuron, a type of neuron involved in pain signall ...
in Cologne, in 1951. One of the first attempts to use a vocoder in creating music was the "Siemens Synthesizer" at the Siemens Studio for Electronic Music, developed between 1956 and 1959.
(See also excerpt of pp
157

160
from th
3rd edition in 2008
())

  Details of the Siemens Electronic Music Studio, exhibited at the Deutsches Museum.
In 1968, Robert Moog developed one of the first solid-state musical vocoders for the electronic music studio of the University at Buffalo. In 1968, Bruce Haack built a prototype vocoder, named ''Farad'' after Michael Faraday. It was first featured on "The Electronic Record For Children" released in 1969 and then on his rock album '' The Electric Lucifer'' released in 1970.
  A sample of earlier Vocoder.
In 1970, Wendy Carlos and Robert Moog built another musical vocoder, a ten-band device inspired by the vocoder designs of Homer Dudley. It was originally called a spectrum encoder-decoder and later referred to simply as a vocoder. The carrier signal came from a Moog
modular synthesizer Modular synthesizers are synthesizers composed of separate modules for different functions. The modules can be connected together by the user to create a patch. The outputs from the modules may include audio signals, analog control voltages, o ...
, and the modulator from a microphone input. The output of the ten-band vocoder was fairly intelligible but relied on specially articulated
speech Speech is a human vocal communication using language. Each language uses Phonetics, phonetic combinations of vowel and consonant sounds that form the sound of its words (that is, all English words sound different from all French words, even if ...
. Some vocoders use a high-pass filter to let some sibilance through from the microphone; this ruins the device for its original speech-coding application, but it makes the talking synthesizer effect much more intelligible. In 1972, Isao Tomita's first electronic music album ''Electric Samurai: Switched on Rock'' was an early attempt at applying
speech synthesis Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal languag ...
technique in electronic rock and pop music. The album featured electronic renditions of contemporary rock and
pop Pop or POP may refer to: Arts, entertainment, and media Music * Pop music, a musical genre Artists * POP, a Japanese idol group now known as Gang Parade * Pop!, a UK pop group * Pop! featuring Angie Hart, an Australian band Albums * ''Pop'' (G ...
songs, while utilizing synthesized voices in place of human voices. In 1974, he utilized synthesized voices in his popular
classical music Classical music generally refers to the art music of the Western world, considered to be distinct from Western folk music or popular music traditions. It is sometimes distinguished as Western classical music, as the term "classical music" also ...
album '' Snowflakes are Dancing'', which became a worldwide success and helped to popularize electronic music. In 1973, the british band Emerson, Lake and Palmer used a vocoder on their album '' Brain Salad Surgery'', for the song " Karn Evil 9: 3rd Impression". The 1975 song " The Raven" from the album '' Tales of Mystery and Imagination'' by The Alan Parsons Project features Alan Parsons performing vocals through an EMI vocoder. According to the album's liner notes, "The Raven" was the first rock song to feature a digital vocoder.
Pink Floyd Pink Floyd are an English rock band formed in London in 1965. Gaining an early following as one of the first British psychedelic music, psychedelic groups, they were distinguished by their extended compositions, sonic experimentation, philo ...
also used a vocoder on three of their albums, first on their 1977 '' Animals'' for the songs "Sheep" and "Pigs (Three Different Ones)", then on '' A Momentary Lapse of Reason'' on "A New Machine Part 1" and "A New Machine Part 2" (1987), and finally on 1994's '' The Division Bell'', on "Keep Talking". The
Electric Light Orchestra The Electric Light Orchestra (ELO) are an English rock band formed in Birmingham in 1970 by songwriters and multi-instrumentalists Jeff Lynne and Roy Wood with drummer Bev Bevan. Their music is characterised by a fusion of pop, classical a ...
was among the first to use the vocoder in a commercial context, with their 1977 album ''
Out of the Blue Out of the Blue may refer to: Film and television Film * ''Out of the Blue'' (1931 film), a British musical by Gene Gerrard * ''Out of the Blue'' (1947 film), an American comedy directed by Leigh Jason *'' Out of the Blue: Live at Wembley'', a ...
''. The band extensively uses it on the album, including on the hits "
Sweet Talkin' Woman "Sweet Talkin' Woman" is a 1978 single by Electric Light Orchestra (ELO) from the album '' Out of the Blue'' (1977). Its original title was "Dead End Street", but it was changed during recording. Some words that survived from that version can be ...
" and "
Mr. Blue Sky "Mr. Blue Sky" is a song by the Electric Light Orchestra (ELO), featured on the band's seventh studio album '' Out of the Blue'' (1977). Written and produced by frontman Jeff Lynne, the song forms the fourth and final track of the "Concerto fo ...
". On following albums, the band made sporadic use of it, notably on their hits " The Diary of Horace Wimp" and " Confusion" from their 1979 album '' Discovery'', the tracks "Prologue", "Yours Truly, 2095", and "Epilogue" on their 1981 album '' Time'', and "
Calling America "Calling America" is a song by the rock music group Electric Light Orchestra (ELO) released as a single from their 1986 album '' Balance of Power''. The single reached number 28 in the United Kingdom, making it their 26th and final Top 40 hit ...
" from their 1986 album '' Balance of Power''. In the late 1970s, French duo Space Art used a vocoder during the recording of their second album, ''Trip in the Centre Head''.
Phil Collins Philip David Charles Collins (born 30 January 1951) is an English singer, musician, songwriter, record producer and actor. He was the drummer and lead singer of the rock band Genesis and also has a career as a solo performer. Between 1982 and ...
used a vocoder to provide a vocal effect for his 1981 international hit single " In the Air Tonight". Vocoders have appeared on pop recordings from time to time, most often simply as a
special effect Special effects (often abbreviated as SFX, F/X or simply FX) are illusions or visual tricks used in the theatre, film, television, video game, amusement park and simulator industries to simulate the imagined events in a story or virtual wor ...
rather than a featured aspect of the work. However, many experimental electronic artists of the
new-age music New-age is a genre of music intended to create artistic inspiration, relaxation technique, relaxation, and optimism. It is used by listeners for yoga, massage, meditation, and reading as a method of stress management to bring about a state of ecs ...
genre often utilize vocoder in a more comprehensive manner in specific works, such as Jean-Michel Jarre (on '' Zoolook'', 1984) and Mike Oldfield (on ''
QE2 ''Queen Elizabeth 2'' (''QE2'') is a retired British ocean liner converted into a floating hotel. Originally built for the Cunard Line, the ship, named as the second ship named ''Queen Elizabeth'', was operated by Cunard as both a transatlantic ...
'', 1980 and '' Five Miles Out'', 1982). Vocoder module and use by M. Oldfield can be clearly seen on his ''Live At Montreux 1981'' DVD (Track "Sheba"). There are also some artists who have made vocoders an essential part of their music, overall or during an extended phase. Examples include the German
synthpop Synth-pop (short for synthesizer pop; also called techno-pop; ) is a subgenre of new wave music that first became prominent in the late 1970s and features the synthesizer as the dominant musical instrument. It was prefigured in the 1960s a ...
group
Kraftwerk Kraftwerk (, "power station") is a German band formed in Düsseldorf in 1970 by Ralf Hütter and Florian Schneider. Widely considered innovators and pioneers of electronic music, Kraftwerk were among the first successful acts to popularize the ...
, the Japanese new wave group Polysics,
Stevie Wonder Stevland Hardaway Morris ( Judkins; May 13, 1950), known professionally as Stevie Wonder, is an American singer-songwriter, who is credited as a pioneer and influence by musicians across a range of genres that include rhythm and blues, Pop musi ...
("Send One Your Love", "A Seed's a Star") and jazz/fusion keyboardist
Herbie Hancock Herbert Jeffrey Hancock (born April 12, 1940) is an American jazz pianist, keyboardist, bandleader, and composer. Hancock started his career with trumpeter Donald Byrd's group. He shortly thereafter joined the Miles Davis Quintet, where he help ...
during his late 1970s period. In 1982 Neil Young used a Sennheiser Vocoder VSM201 on six of the nine tracks on '' Trans''. The chorus and bridge of Michael Jackson's " P.Y.T. (Pretty Young Thing)". features a vocoder ("Pretty young thing/You make me sing"), courtesy of session musician Michael Boddicker. Coldplay have used a vocoder in some of their songs. For example, in " Major Minus" and " Hurts Like Heaven", both from the album '' Mylo Xyloto'' (2011), Chris Martin's vocals are mostly vocoder-processed. "
Midnight Midnight is the transition time from one day to the next – the moment when the date changes, on the local official clock time for any particular jurisdiction. By clock time, midnight is the opposite of noon, differing from it by 12 hours. ...
", from '' Ghost Stories'' (2014), also features Martin singing through a vocoder. The hidden track "X Marks the Spot" from '' A Head Full of Dreams'' was also recorded through a vocoder. Noisecore band
Atari Teenage Riot Atari Teenage Riot (ATR) is a German band formed in Berlin in 1992. Highly political, they fuse left-wing, anarchist and anti-fascist views with punk vocals and a techno sound called digital hardcore, which is a term band member Alec Empire use ...
have used vocoders in variety of their songs and live performances such as '' Live at the Brixton Academy'' (2002) alongside other digital audio technology both old and new. The Red Hot Chili Peppers song " By the Way" uses a vocoder effect on
Anthony Kiedis Anthony Kiedis ( ; born November 1, 1962) is an American singer and songwriter. He is a founding member and lead vocalist of the rock band Red Hot Chili Peppers. Kiedis and his fellow band members were inducted into the Rock and Roll Hall of Fa ...
' vocals. Among the most consistent users of the vocoder in emulating the human voice are
Daft Punk Daft Punk were a French electronic music duo formed in 1993 in Paris by Thomas Bangalter and Guy-Manuel de Homem-Christo. Widely regarded as one of the most influential acts in dance music history, they achieved popularity in the late 1990s as p ...
, who have used this instrument from their first album '' Homework'' (1997) to their latest work '' Random Access Memories'' (2013) and consider the convergence of technological and human voice "the identity of their musical project". For instance, the lyrics of " Around the World" (1997) are integrally vocoder-processed, " Get Lucky" (2013) features a mix of natural and processed human voices, and " Instant Crush" (2013) features
Julian Casablancas Julian Fernando Casablancas (born August 23, 1978) is an American singer, musician and songwriter, best known as the lead vocalist and primary songwriter of Rock music, rock band The Strokes, with whom he has released six studio albums since the ...
singing into a vocoder. Producer Zedd, American country singer
Maren Morris Maren Larae Morris (born April 10, 1990) is an American singer-songwriter. While rooted in the Country music, country genre, her music also blends elements of Pop music, pop, R&B music, R&B, and Hip-Hop music, hip-hop. Born and raised in Arlingt ...
and American musical duo Grey made a song titled " The Middle" which featured a vocoder and reached the top ten of the charts in 2018.


Voice effects in other arts

Robot voices became a recurring element in popular music during the 20th century. Apart from vocoders, several other methods of producing variations on this effect include: the Sonovox, Talk box, and Auto-Tune,
  A sample of Auto-Tune effect (a.k.a. ''
T-Pain effect Auto-Tune (or autotune) is an audio processor introduced in 1996 by American company Antares Audio Technologies. Auto-Tune uses a proprietary device to measure and alter pitch in vocal and instrumental music recording and performances. Auto-Tu ...
'').
linear prediction vocoders,
speech synthesis Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal languag ...
,
  A sample of earlier computer-based
speech synthesis Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal languag ...
and song synthesis, by John Larry Kelly, Jr. and Louis Gerstman at Bell Labs, using
IBM 704 The IBM 704 is a large digital mainframe computer introduced by IBM in 1954. It was the first mass-produced computer with hardware for floating-point arithmetic. The IBM 704 ''Manual of operation'' states: The type 704 Electronic Data-Pro ...
computer. The demo song " Daisy Bell", musical accompanied by Max Mathews, impressed
Arthur C. Clarke Sir Arthur Charles Clarke (16 December 191719 March 2008) was an English science-fiction writer, science writer, futurist, inventor, undersea explorer, and television series host. He co-wrote the screenplay for the 1968 film '' 2001: A Spac ...
and later he used it in the climactic scene of the screenplay for his novel '' 2001: A Space Odyssey''.

  A sample of
speech synthesis Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal languag ...
.
ring modulation and comb filter. Vocoders are used in television production, filmmaking and games, usually for robots or talking computers. The robot voices of the Cylons in ''
Battlestar Galactica ''Battlestar Galactica'' is an American science fiction media franchise created by Glen A. Larson. The franchise began with the Battlestar Galactica (1978 TV series), original television series in 1978, and was followed by a short-run sequel se ...
'' were created with an EMS Vocoder 2000. The 1980 version of the ''
Doctor Who ''Doctor Who'' is a British science fiction television series broadcast by the BBC since 1963. The series depicts the adventures of a Time Lord called the Doctor, an extraterrestrial being who appears to be human. The Doctor explores the u ...
'' theme, as arranged and recorded by
Peter Howell Peter Howell may refer to: *Peter Howell (musician) (born c. 1948), musician and composer *Peter Howell (actor) (1919–2015), British actor *Peter Howell (historian) (born 1941), British academic and historian *Peter Howell (psychologist) Pete ...
, has a section of the main melody generated by a Roland SVC-350 vocoder. A similar
Roland VP-330 The Roland VP-330 is a paraphonic ten band vocoder and string machine manufactured by Roland Corporation from 1979 to 1980. While there are several string machines and vocoders, a single device combining the two is rare, despite the advantage ...
vocoder was used to create the voice of Soundwave, a character from the Transformers series.


See also

* Audio timescale-pitch modification * Auto-Tune * Homer Dudley *
List of vocoders __NOTOC__ Analogue vocoder models *Analog-Lab X-32 2-band *Behringer VC340 Analog Vocoder * Bode Model 7702 6-band *Doepfer Modular Vocoder subsystem A-129 *Dynacord SRV66 *Elektronika (Электроника) EM 26 * EMS **EMS Vocoder 100 ...
* Phase vocoder * Silent speech interface * Talk box * Werner Meyer-Eppler


References

;Multimedia references


External links

*
Description, photographs, and diagram for the vocoder at 120years.net
* Description of a modern Vocoder.




Object of Interest: The Vocoder The New Yorker Magazine mini documentary
{{Authority control Audio effects Electronic musical instruments Music hardware Lossy compression algorithms Speech codecs Robotics