Source–filter Model
   HOME
*



picture info

Source–filter Model
The source–filter model represents speech as a combination of a sound source, such as the vocal cords, and a linear acoustic filter, the vocal tract. While only an approximation, the model is widely used in a number of applications such as speech synthesis and speech analysis because of its relative simplicity. It is also related to linear prediction. The development of the model is due, in large part, to the early work of Gunnar Fant, although others, notably Ken Stevens, have also contributed substantially to the models underlying acoustic analysis of speech and speech synthesis. Fant built off the work of Tsutomu Chiba and Masato Kajiyama, who first showed the relationship between a vowel's acoustic properties and the shape of the vocal tract. An important assumption that is often made in the use of the source–filter model is the independence of source and filter. In such cases, the model should more accurately be referred to as the "independent source–filter model". H ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Vocal Cords
In humans, vocal cords, also known as vocal folds or voice reeds, are folds of throat tissues that are key in creating sounds through vocalization. The size of vocal cords affects the pitch of voice. Open when breathing and vibrating for speech or singing, the folds are controlled via the recurrent laryngeal nerve, recurrent laryngeal branch of the vagus nerve. They are composed of twin infoldings of mucous membrane stretched horizontally, from back to front, across the larynx. They vibration, vibrate, modulating the flow of air being expelled from the lungs during phonation. The 'true vocal cords' are distinguished from the 'false vocal folds', known as vestibular folds or ''ventricular folds'', which sit slightly superior to the more delicate true folds. These have a minimal role in normal phonation, but can produce deep sonorous tones, screams and growls. The length of the vocal fold at birth is approximately six to eight millimeters and grows to its adult length of eight to ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Fricative Consonant
A fricative is a consonant produced by forcing air through a narrow channel made by placing two articulators close together. These may be the lower lip against the upper teeth, in the case of ; the back of the tongue against the soft palate in the case of German (the final consonant of ''Bach''); or the side of the tongue against the molars, in the case of Welsh (appearing twice in the name ''Llanelli''). This turbulent airflow is called frication. A particular subset of fricatives are the sibilants. When forming a sibilant, one still is forcing air through a narrow channel, but in addition, the tongue is curled lengthwise to direct the air over the edge of the teeth. English , , , and are examples of sibilants. The usage of two other terms is less standardized: "Spirant" is an older term for fricatives used by some American and European phoneticians and phonologists. "Strident" could mean just "sibilant", but some authors include also labiodental and uvular fricatives in ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Inverse Filter
Signal processing is an electrical engineering subfield that focuses on analysing, modifying, and synthesizing signals such as sound, images, and scientific measurements. For example, with a filter ''g'', an inverse filter ''h'' is one such that the sequence of applying ''g'' then ''h'' to a signal results in the original signal. Software or electronic inverse filters are often used to compensate for the effect of unwanted environmental filtering of signals. In speech science In all proposed models for the production of human speech, an important variable is the waveform of the airflow, or volume velocity, at the glottis. The glottal volume velocity waveform provides the link between movements of the vocal folds and the acoustical results of such movements, in that the glottis acts approximately as a source of volume velocity. That is, the impedance of the glottis is usually much higher than that of the vocal tract, and so glottal airflow is controlled mostly (but not entirely) by ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Acoustic Attenuation
Acoustic attenuation is a measure of the energy loss of sound propagation in media. Most media have viscosity and are therefore not ideal media. When sound propagates in such media, there is always thermal consumption of energy caused by viscosity. This effect can be quantified through the Stokes's law of sound attenuation. Sound attenuation may also be a result of heat conductivity in the media as has been shown by G. Kirchhoff in 1868. The Stokes-Kirchhoff attenuation formula takes into account both viscosity and thermal conductivity effects. For heterogeneous media, besides media viscosity, acoustic scattering is another main reason for removal of acoustic energy. Acoustic attenuation in a lossy medium plays an important role in many scientific researches and engineering fields, such as medical ultrasonography, vibration and noise reduction. Power-law frequency-dependent acoustic attenuation Many experimental and field measurements show that the acoustic attenuation coeffi ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Amplitude
The amplitude of a periodic variable is a measure of its change in a single period (such as time or spatial period). The amplitude of a non-periodic signal is its magnitude compared with a reference value. There are various definitions of amplitude (see below), which are all functions of the magnitude of the differences between the variable's extreme values. In older texts, the phase of a periodic function is sometimes called the amplitude. Definitions Peak amplitude & semi-amplitude For symmetric periodic waves, like sine waves, square waves or triangle waves ''peak amplitude'' and ''semi amplitude'' are the same. Peak amplitude In audio system measurements, telecommunications and others where the measurand is a signal that swings above and below a reference value but is not sinusoidal, peak amplitude is often used. If the reference is zero, this is the maximum absolute value of the signal; if the reference is a mean value (DC component), the peak amplitude is the maximu ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Harmonic
A harmonic is a wave with a frequency that is a positive integer multiple of the ''fundamental frequency'', the frequency of the original periodic signal, such as a sinusoidal wave. The original signal is also called the ''1st harmonic'', the other harmonics are known as ''higher harmonics''. As all harmonics are periodic at the fundamental frequency, the sum of harmonics is also periodic at that frequency. The set of harmonics forms a '' harmonic series''. The term is employed in various disciplines, including music, physics, acoustics, electronic power transmission, radio technology, and other fields. For example, if the fundamental frequency is 50  Hz, a common AC power supply frequency, the frequencies of the first three higher harmonics are 100 Hz (2nd harmonic), 150 Hz (3rd harmonic), 200 Hz (4th harmonic) and any addition of waves with these frequencies is periodic at 50 Hz. In music, harmonics are used on string instruments and wind instrum ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Articulation (phonetics)
The field of articulatory phonetics is a subfield of phonetics that studies articulation and ways that humans produce speech. Articulatory phoneticians explain how humans produce speech sounds via the interaction of different physiological structures. Generally, articulatory phonetics is concerned with the transformation of aerodynamic energy into acoustic energy. Aerodynamic energy refers to the airflow through the vocal tract. Its potential form is air pressure; its kinetic form is the actual dynamic airflow. Acoustic energy is variation in the air pressure that can be represented as sound waves, which are then perceived by the human auditory system as sound. Respiratory sounds can be produced by expelling air from the lungs. However, to vary the sound quality in a way useful for speaking, two speech organs normally move towards each other to contact each other to create an obstruction that shapes the air in a particular fashion. The point of maximum obstruction is called the ' ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Phonation
The term phonation has slightly different meanings depending on the subfield of phonetics. Among some phoneticians, ''phonation'' is the process by which the vocal folds produce certain sounds through quasi-periodic vibration. This is the definition used among those who study laryngeal anatomy and physiology and speech production in general. Phoneticians in other subfields, such as linguistic phonetics, call this process '' voicing'', and use the term ''phonation'' to refer to any oscillatory state of any part of the larynx that modifies the airstream, of which voicing is just one example. Voiceless and supra-glottal phonations are included under this definition. Voicing The phonatory process, or voicing, occurs when air is expelled from the lungs through the glottis, creating a pressure drop across the larynx. When this drop becomes sufficiently large, the vocal folds start to oscillate. The minimum pressure drop required to achieve phonation is called the phonation threshold ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Vocal Cords
In humans, vocal cords, also known as vocal folds or voice reeds, are folds of throat tissues that are key in creating sounds through vocalization. The size of vocal cords affects the pitch of voice. Open when breathing and vibrating for speech or singing, the folds are controlled via the recurrent laryngeal nerve, recurrent laryngeal branch of the vagus nerve. They are composed of twin infoldings of mucous membrane stretched horizontally, from back to front, across the larynx. They vibration, vibrate, modulating the flow of air being expelled from the lungs during phonation. The 'true vocal cords' are distinguished from the 'false vocal folds', known as vestibular folds or ''ventricular folds'', which sit slightly superior to the more delicate true folds. These have a minimal role in normal phonation, but can produce deep sonorous tones, screams and growls. The length of the vocal fold at birth is approximately six to eight millimeters and grows to its adult length of eight to ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Pharynx
The pharynx (plural: pharynges) is the part of the throat behind the mouth and nasal cavity, and above the oesophagus and trachea (the tubes going down to the stomach and the lungs). It is found in vertebrates and invertebrates, though its structure varies across species. The pharynx carries food and air to the esophagus and larynx respectively. The flap of cartilage called the epiglottis stops food from entering the larynx. In humans, the pharynx is part of the digestive system and the conducting zone of the respiratory system. (The conducting zone—which also includes the nostrils of the nose, the larynx, trachea, bronchi, and bronchioles—filters, warms and moistens air and conducts it into the lungs). The human pharynx is conventionally divided into three sections: the nasopharynx, oropharynx, and laryngopharynx. It is also important in vocalization. In humans, two sets of pharyngeal muscles form the pharynx and determine the shape of its lumen. They are arranged as an ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Dirac Comb
In mathematics, a Dirac comb (also known as shah function, impulse train or sampling function) is a periodic function with the formula \operatorname_(t) \ := \sum_^ \delta(t - k T) for some given period T. Here ''t'' is a real variable and the sum extends over all integers ''k.'' The Dirac delta function \delta and the Dirac comb are tempered distributions. The graph of the function resembles a comb (with the \deltas as the comb's ''teeth''), hence its name and the use of the comb-like Cyrillic letter sha (Ш) to denote the function. The symbol \operatorname\,\,(t), where the period is omitted, represents a Dirac comb of unit period. This implies \operatorname_(t) \ = \frac\operatorname\ \!\!\!\left(\frac\right). Because the Dirac comb function is periodic, it can be represented as a Fourier series based on the Dirichlet kernel: \operatorname_(t) = \frac\sum_^ e^. The Dirac comb function allows one to represent both continuous and discrete phenomena, such as sampling and al ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]