Sinewave synthesis, or sine wave speech, is a technique for
synthesizing speech by replacing the
formants
In speech science and phonetics, a formant is the broad spectral maximum that results from an acoustic resonance of the human vocal tract. In acoustics, a formant is usually defined as a broad peak, or local maximum, in the spectrum. For harmon ...
(main bands of energy) with pure tone whistles. The first sinewave synthesis program (''SWS'') for the automatic creation of stimuli for perceptual experiments was developed by
Philip Rubin at
Haskins Laboratories
Haskins Laboratories, Inc. is an independent 501(c) non-profit corporation, founded in 1935 and located in New Haven, Connecticut, since 1970. Haskins has formal affiliation agreements with both Yale University and the University of Connecticut; ...
in the 1970s. This program was subsequently used by
Robert Remez,
Philip Rubin, David Pisoni, and other colleagues to show that listeners can
perceive
Perception () is the organization, identification, and interpretation of sensory information in order to represent and understand the presented information or environment. All perception involves signals that go through the nervous syste ...
continuous speech without traditional speech cues, i.e., pitch, stress, and intonation. This work paved the way for a view of
speech
Speech is the use of the human voice as a medium for language. Spoken language combines vowel and consonant sounds to form units of meaning like words, which belong to a language's lexicon. There are many different intentional speech acts, suc ...
as a dynamic pattern of trajectories through articulatory-acoustic space.
Bibliography
* Rubin, P.E. Sinewave synthesis. Internal memorandum, Haskins Laboratories, New Haven, CT, 198
* Remez, R.E., Rubin, P.E., Pisoni, D.B., & Carrell, T.D. Speech perception without traditional speech cues. ''Science'', 1981, 212, 947-950.
* Best, C.T., Morrongiello, B. & Robson, R. Perceptual equivalence of acoustic cues in speech and nonspeech perception. ''Perception & Psychophysics'', 1981, 29, 191-211.
* Remez, R.E., Rubin, P.E., Berns, S.M., Pardo, J.S. & Lang, J.M. On the perceptual organization of speech. ''Psychological Review'', 1994, 101, 129-156.
* Remez, R. E., Fellowes, J. M., & Rubin, P.E. Talker identification based on phonetic information. ''Journal of Experimental Psychology: Human Perception and Performance'', 1997, 23, 651-666.
References
{{reflist
External links
Haskins LaboratoriesRobert Remez*
ttps://github.com/johnhw/sinewave_speech A Python tool to convert WAV files to sinewave speech using linear predictive codingSinSyn - Sinusoidal SynthesizerA browser-based tool for creating individual sounds with sinewave synthesis
Speech synthesis
Applications of artificial intelligence
Computational linguistics