HOME

TheInfoList



OR:

The Bell Telephone Laboratory's Voder (from ''Voice Operating Demonstrator'') was the first attempt to electronically synthesize human speech by breaking it down into its acoustic components. It was invented by
Homer Dudley Homer W. Dudley (14 November 1896– 18 September 1980) was a pioneering electronic and acoustic engineer who created the first electronic voice synthesizer for Bell Labs in the 1930s and led the development of a method of sending secure voice tra ...
in 1937–1938 and developed on his earlier work on the
vocoder A vocoder (, a portmanteau of ''voice'' and ''encoder'') is a category of speech coding that analyzes and synthesizes the human voice signal for audio data compression, multiplexing, voice encryption or voice transformation. The vocoder was i ...
. The quality of the speech was limited; however, it demonstrated the synthesis of the human voice, which became one component of the vocoder used in voice communications for security and to save bandwidth.Ben Gold, Nelson Morgan, Dan Ellis, ''Speech and Audio Signal Processing: Processing and Perception of Speech and Music''. John Wiley & Sons, 2011; , pages 9‒13 The Voder synthesized human speech by imitating the effects of the human
vocal tract The vocal tract is the cavity in human bodies and in animals where the sound produced at the sound source (larynx in mammals; syrinx in birds) is filtered. In birds it consists of the trachea, the syrinx, the oral cavity, the upper part of the es ...
. The operator could select one of two basic sounds by using a wrist bar. A buzz tone generated by a
relaxation oscillator In electronics a relaxation oscillator is a nonlinear electronic oscillator circuit that produces a nonsinusoidal repetitive output signal, such as a triangle wave or square wave. on Peter Millet'Tubebookswebsite The circuit consists of a feedba ...
produced the voiced vowels and nasal sounds, with the pitch controlled by a foot pedal. A hissing noise produced by a white noise tube created the sibilants (voiceless fricative sounds). These initial sounds were passed through a bank of 10 band-pass filters that were selected by keys; their outputs were combined, amplified and fed to a loudspeaker. The filters were controlled by a set of keys and a foot pedal to convert the hisses and tones into vowels, consonants, and inflections. Additional special keys were provided to make the
plosive In phonetics, a plosive, also known as an occlusive or simply a stop, is a pulmonic consonant in which the vocal tract is blocked so that all airflow ceases. The occlusion may be made with the tongue tip or blade (, ), tongue body (, ), lip ...
sounds such as "p" or "d", and the affrictive sounds of the "j" in "jaw" and the "ch" in "cheese". This was a complex machine to operate. After months of practice, a trained operator could produce recognizable speech. Performances on the Voder were featured at the
1939 New York World's Fair The 1939–40 New York World's Fair was a world's fair held at Flushing Meadows–Corona Park in Queens, New York, United States. It was the second-most expensive American world's fair of all time, exceeded only by St. Louis's Louisiana Purcha ...
and in
San Francisco San Francisco (; Spanish for " Saint Francis"), officially the City and County of San Francisco, is the commercial, financial, and cultural center of Northern California. The city proper is the fourth most populous in California and 17th ...
. Twenty operators were trained by Helen Harper, particularly noted for her skill with the machine. The machine said the words "Good afternoon, radio audience." The Voder was developed from research into compression schemes for transmission of voice on copper wires and for voice encryption. In 1948,
Werner Meyer-Eppler Werner Meyer-Eppler (30 April 1913 – 8 July 1960), was a Belgian-born German physicist, experimental acoustician, phoneticist and information theorist. Meyer-Eppler was born in Antwerp. He studied mathematics, physics, and chemistry, ...
recognized the capability of the Voder machine to generate electronic music, as described in Dudley's patent. Whereas the vocoder analyzes speech, transforms it into electronically transmitted information, and recreates it, the voder generates synthesized speech by means of a console with fifteen touch-sensitive keys and a pedal. It basically consists of the "second half" of the vocoder, but with manual filter controls, and requires a highly trained operator.


See also

*
Formant In speech science and phonetics, a formant is the broad spectral maximum that results from an acoustic resonance of the human vocal tract. In acoustics, a formant is usually defined as a broad peak, or local maximum, in the spectrum. For harmoni ...


References

;Bibliography * * *


External links


"The Voder"
– has photos and block diagrams *{{cite web , title = The Voder - Homer Dudley (Bell Labs) 1939 , url = https://www.youtube.com/watch?v=5hyI_dM5cGo#t=3m00s , format = audio + slide , work = YouTube
— Voder sang the "
Auld Lang Syne "Auld Lang Syne" (: note "s" rather than "z") is a popular song, particularly in the English-speaking world. Traditionally, it is sung to bid farewell to the old year at the stroke of midnight on New Year's Eve. By extension, it is also often ...
" a
3'00"
(20 sec). Speech synthesis