Mel Scale
   HOME

TheInfoList



OR:

The mel scale (after the word ''
melody A melody (from Greek language, Greek μελῳδία, ''melōidía'', "singing, chanting"), also tune, voice or line, is a Linearity#Music, linear succession of musical tones that the listener perceives as a single entity. In its most liter ...
'') is a perceptual scale of pitches judged by listeners to be equal in distance from one another. The reference point between this scale and normal
frequency Frequency is the number of occurrences of a repeating event per unit of time. It is also occasionally referred to as ''temporal frequency'' for clarity, and is distinct from ''angular frequency''. Frequency is measured in hertz (Hz) which is eq ...
measurement is defined by assigning a perceptual pitch of 1000 mels to a 1000 Hz tone, 40 dB above the listener's threshold. Above about 500 Hz, increasingly large intervals are judged by listeners to produce equal pitch increments.


Formula

A formula to convert ''f'' hertz into ''m'' mels is: m = 2595 \log_\left(1 + \frac\right)


History and other formulas

The formula from O'Shaughnessy's book can be expressed with different logarithmic bases: m = 2595 \log_\left(1 + \frac\right) = 1127 \ln\left(1 + \frac\right) The corresponding inverse expressions are: f = 700\left(10^\frac - 1\right) = 700\left(e^\frac - 1\right) There were published curves and tables on psychophysical pitch scales since Steinberg's 1937 curves based on
just-noticeable difference In the branch of experimental psychology focused on sense, sensation, and perception, which is called psychophysics, a just-noticeable difference or JND is the amount something must be changed in order for a difference to be noticeable, detectabl ...
s of pitch. More curves soon followed in Fletcher and Munson's 1937 and Fletcher's 1938 and Stevens' 1937 and Stevens and Volkmann's 1940 papers using a variety of experimental methods and analysis approaches. In 1949 Koenig published an approximation based on separate linear and logarithmic segments, with a break at 1000 Hz.
Gunnar Fant Carl Gunnar Michael Fant (October 8, 1919 – June 6, 2009) was a leading researcher in speech science in general and speech synthesis in particular who spent most of his career as a professor at the Swedish Royal Institute of Technology (KTH) in ...
proposed the current popular linear/logarithmic formula in 1949, but with the 1000 Hz corner frequency. An alternate expression of the formula, not depending on choice of logarithm base, is noted in Fant (1968): m = \frac \log\left(1 + \frac\right) \ In 1976, Makhoul and Cosell published the now-popular version with the 700 Hz corner frequency. As Ganchev et al. have observed, "The formulae
ith 700 The Ith () is a ridge in Germany's Central Uplands which is up to 439 m high. It lies about 40 km southwest of Hanover and, at 22 kilometres, is the longest line of crags in North Germany. Geography Location The Ith is immediatel ...
when compared to ant's with 1000 provide a closer approximation of the Mel scale for frequencies below 1000 Hz, at the price of higher inaccuracy for frequencies higher than 1000 Hz." Above 7 kHz, however, the situation is reversed, and the 700 Hz version again fits better. Data by which some of these formulas are motivated are tabulated in Beranek (1949), as measured from the curves of Stevens and Volkmann: A formula with a break frequency of 625 Hz is given by Lindsay & Norman (1977); the formula does not appear in their 1972 first edition: m = 2410 \log_(0.0016 f + 1) For direct comparison with other formulae, this is equivalent to: m = 2410 \log_\left(1 + \frac\right) Most mel-scale formulas give exactly 1000 mels at 1000 Hz. The break frequency (e.g. 700 Hz, 1000 Hz, or 625 Hz) is the only free parameter in the usual form of the formula. Some non-mel auditory-frequency-scale formulas use the same form but with much lower break frequency, not necessarily mapping to 1000 at 1000 Hz; for example the ERB-rate scale of Glasberg & Moore (1990) uses a break point of 228.8 Hz, and the cochlear frequency–place map of Greenwood (1990) uses 165.3 Hz. Other functional forms for the mel scale have been explored by Umesh et al.; they point out that the traditional formulas with a logarithmic region and a linear region do not fit the data from Stevens and Volkmann's curves as well as some other forms, based on the following data table of measurements that they made from those curves:


Criticism

Stevens' student, Donald D. Greenwood, who had worked on the mel scale experiments in 1956, considers the scale biased by experimental flaws. In 2009 he posted to a mailing list,


See also

*
Bark scale The Bark scale is a psychoacoustical scale proposed by Eberhard Zwicker in 1961. It is named after Heinrich Barkhausen who proposed the first subjective measurements of loudness.Zwicker, E. (1961),Subdivision of the audible frequency range into ...
* Mel-frequency cepstrum *
Fletcher–Munson curves An equal-loudness contour is a measure of sound pressure level, over the frequency spectrum, for which a listener perceives a constant loudness when presented with pure steady tones. The unit of measurement for loudness levels is the phon and i ...


References


External links

*
Hz–mel, mel–Hz conversion
(uses the O'Shaughnessy equation) *

{{Acoustics Scales Psychoacoustics