MPEG-1 Audio Layer II or MPEG-2 Audio Layer II (MP2, sometimes incorrectly called Musicam or MUSICAM)
is a
lossy
In information technology, lossy compression or irreversible compression is the class of data compression methods that uses inexact approximations and partial data discarding to represent the content. These techniques are used to reduce data size ...
audio compression format defined by ISO/IEC 11172-3 alongside
MPEG-1 Audio Layer I
MPEG-1 Audio Layer I, commonly abbreviated to MP1, is one of three audio formats included in the MPEG-1 standard. It is a deliberately simplified version of MPEG-1 Audio Layer II (MP2), created for applications where lower compression efficiency ...
and
MPEG-1 Audio Layer III
MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany, with support from other digital scientists in the United States and elsewhere. Origin ...
(MP3). While
MP3
MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany, with support from other digital scientists in the United States and elsewhere. Origin ...
is much more popular for
PC and
Internet
The Internet (or internet) is the global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a '' network of networks'' that consists of private, pub ...
applications, MP2 remains a dominant standard for audio broadcasting.
History of development from MP2 to MP3
MUSICAM
MPEG-1 Audio Layer 2 encoding was derived from the MUSICAM (''Masking pattern adapted Universal Subband Integrated Coding And Multiplexing'') audio codec, developed by
Centre commun d'études de télévision et télécommunications (CCETT),
Philips
Koninklijke Philips N.V. (), commonly shortened to Philips, is a Dutch multinational conglomerate corporation that was founded in Eindhoven in 1891. Since 1997, it has been mostly headquartered in Amsterdam, though the Benelux headquarters i ...
, and the
Institut für Rundfunktechnik
The Institut für Rundfunktechnik GmbH (IRT) (''Institute for Broadcasting Technology Ltd.'') was a research centre of German broadcasters ( ARD / ZDF / DLR), Austria's broadcaster (ORF) and the Swiss public broadcaster ( SRG / SSR). It was respo ...
(IRT) in 1989 as part of the
EUREKA 147
Eureka (often abbreviated as E!, or Σ!) is an intergovernmental organisation for research and development funding and coordination. Eureka is an open platform for international cooperation in innovation. Organisations and companies applying th ...
pan-European inter-governmental research and development initiative for the development of a system for the broadcasting of audio and data to fixed, portable or mobile receivers (established in 1987).
It began as the
Digital Audio Broadcast
Digital radio is the use of digital technology to transmit or receive across the radio spectrum. Digital transmission by radio waves includes digital broadcasting, and especially digital audio radio services.
Types
In digital broadcasting sy ...
(DAB) project managed by Egon Meier-Engelen of the Deutsche Forschungs- und Versuchsanstalt für Luft- und Raumfahrt (later on called Deutsches Zentrum für Luft- und Raumfahrt, German Aerospace Center) in Germany. The European Community financed this project, commonly known as EU-147, from 1987 to 1994 as a part of the
EUREKA
Eureka (often abbreviated as E!, or Σ!) is an intergovernmental organisation for research and development funding and coordination. Eureka is an open platform for international cooperation in innovation. Organisations and companies applying th ...
research program.
The Eureka 147 System comprised three main elements: MUSICAM Audio Coding (''Masking pattern Universal Sub-band Integrated Coding And Multiplexing''), Transmission Coding & Multiplexing and COFDM Modulation.
MUSICAM was one of the few codecs able to achieve high audio quality at bit rates in the range of 64 to 192 kbit/s per monophonic channel. It has been designed to meet the technical requirements of most applications (in the field of broadcasting, telecommunication and recording on digital storage media) — low delay, low complexity, error robustness, short access units, etc.
As a predecessor of the MP3 format and technology, the perceptual codec MUSICAM is based on integer arithmetics 32 subbands transform, driven by a psychoacoustic model. It was primarily designed for Digital Audio Broadcasting and digital TV, and disclosed by CCETT(France) and IRT (Germany) in Atlanta during an IEEE-ICASSP conference. This codec incorporated into a broadcasting system using COFDM modulation was demonstrated on air and on the field together with
Radio Canada
The Canadian Broadcasting Corporation (french: Société Radio-Canada), branded as CBC/Radio-Canada, is a Canadian public broadcaster for both radio and television. It is a federal Crown corporation that receives funding from the government ...
and
CRC Canada during the NAB show (Las Vegas) in 1991. The implementation of the audio part of this broadcasting system was based on a two chips encoder (one for the subband transform, one for the psychoacoustic model designed by the team of
G. Stoll
G is the seventh letter of the Latin alphabet.
G may also refer to:
Places
* Gabon, international license plate code G
* Glasgow, UK postal code G
* Eastern Quebec, Canadian postal prefix G
* Melbourne Cricket Ground in Melbourne, Australia, g ...
(IRT Germany), later known as Psychoacoustic model I in the ISO MPEG audio standard) and a real time decoder using one
Motorola 56001
The Motorola DSP56000 (also known as 56K) is a family of digital signal processor (DSP) chips produced by Motorola Semiconductor (later Freescale Semiconductor then NXP) starting in 1986 with later models are still being produced in the 2020s. Th ...
DSP
DSP may refer to:
Computing
* Digital signal processing, the mathematical manipulation of an information signal
* Digital signal processor, a microprocessor designed for digital signal processing
* Yamaha DSP-1, a proprietary digital signal ...
chip running an integer arithmetics software designed by
Y.F. Dehery YF, Y.F., or Yf can refer to:
Fictional vehicles:
* YF-19 Alpha One Excalibur, a spacecraft in the anime series ''Macross''
* YF-21 Omega One, a spacecraft in the anime series ''Macross''
Real vehicles:
* A series of American aircraft, including: ...
's team (
CCETT, France). The simplicity of the corresponding decoder together with the high audio quality of this codec using for the first time a 48 kHz sampling frequency, a 20 bits/sample input format (the highest available sampling standard in 1991, compatible with the AES/EBU professional digital input studio standard) were the main reasons to later adopt the characteristics of MUSICAM as the basic features for an advanced digital music compression codec such as MP3.
The audio coding algorithm used by the Eureka 147 Digital Audio Broadcasting (DAB) system has been subject to the standardization process within the ISO/Moving Pictures Expert Group (MPEG) in 1989–94.
MUSICAM audio coding was used as a basis for some coding schemes of MPEG-1 and MPEG-2 Audio.
Most key features of MPEG-1 Audio were directly inherited from MUSICAM, including the filter bank, time-domain processing, audio frame sizes, etc. However, improvements were made, and the actual MUSICAM algorithm was not used in the final MPEG-1 Audio Layer II standard.
Since the finalisation of MPEG-1 Audio and MPEG-2 Audio (in 1992 and 1994), the original MUSICAM algorithm is not used anymore.
The name MUSICAM is often mistakenly used when MPEG-1 Audio Layer II is meant. This can lead to some confusion, because the name MUSICAM is trademarked by different companies in different regions of the world.
(Musicam is the name used for MP2 in some specifications for Astra Digital Radio as well as in the BBC's DAB documents.)
The Eureka Project 147 resulted in the publication of European Standard, ETS 300 401 in 1995, for DAB which now has worldwide acceptance. The DAB standard uses the MPEG-1 Audio Layer II (ISO/IEC 11172-3) for 48 kHz sampling frequency and the MPEG-2 Audio Layer II (ISO/IEC 13818-3) for 24 kHz sampling frequency.
MPEG Audio
In the late 1980s,
ISO
ISO is the most common abbreviation for the International Organization for Standardization.
ISO or Iso may also refer to: Business and finance
* Iso (supermarket), a chain of Danish supermarkets incorporated into the SuperBest chain in 2007
* Iso ...
's
Moving Picture Experts Group (MPEG) started an effort to standardize digital audio and video encoding, expected to have a wide range of applications in digital radio and TV broadcasting (later
DAB,
DMB DMB may refer to:
* "D.M.B.", a song by ASAP Rocky
* DMB Development, a Limited Liability Corporation based in Scottsdale, Arizona
* DaMarcus Beasley, nicknamed DMB, an American soccer player
* Dave Matthews Band, a U.S. rock band
* Del McCoury Ba ...
,
DVB
Digital Video Broadcasting (DVB) is a set of international open standards for digital television. DVB standards are maintained by the DVB Project, an international industry consortium, and are published by a Joint Technical Committee (JTC) o ...
), and use on CD-ROM (later
Video CD
Video CD (abbreviated as VCD, and also known as Compact Disc Digital Video) is a home video format and the first format for distributing films on standard optical discs. The format was widely adopted in Southeast Asia, Central Asia and the ...
). The MUSICAM audio coding was one of 14 proposals for MPEG-1 Audio standard that were submitted to ISO in 1989.
The MPEG-1 Audio standard was based on the existing MUSICAM and ASPEC audio formats.
The
MPEG-1
MPEG-1 is a standard for lossy compression of video and audio. It is designed to compress VHS-quality raw digital video and CD audio down to about 1.5 Mbit/s (26:1 and 6:1 compression ratios respectively) without excessive quality loss, making ...
Audio standard included the three audio "layers" (encoding techniques) now known as Layer I (MP1), Layer II (MP2) and Layer III (MP3).
All algorithms for MPEG-1 Audio Layer I, II and III were approved in 1991 as the committee draft of ISO-11172
and finalized in 1992
as part of
MPEG-1
MPEG-1 is a standard for lossy compression of video and audio. It is designed to compress VHS-quality raw digital video and CD audio down to about 1.5 Mbit/s (26:1 and 6:1 compression ratios respectively) without excessive quality loss, making ...
, the first standard suite by
MPEG
The Moving Picture Experts Group (MPEG) is an alliance of working groups established jointly by International Organization for Standardization, ISO and International Electrotechnical Commission, IEC that sets standards for media coding, includ ...
, which resulted in the international standard
ISO
ISO is the most common abbreviation for the International Organization for Standardization.
ISO or Iso may also refer to: Business and finance
* Iso (supermarket), a chain of Danish supermarkets incorporated into the SuperBest chain in 2007
* Iso ...
/
IEC
The International Electrotechnical Commission (IEC; in French: ''Commission électrotechnique internationale'') is an international standards organization that prepares and publishes international standards for all electrical, electronic and r ...
11172-3 (a.k.a. ''MPEG-1 Audio'' or ''MPEG-1 Part 3''), published in 1993.
Further work on MPEG audio
was finalized in 1994 as part of the second suite of MPEG standards,
MPEG-2, more formally known as international standard ISO/IEC 13818-3 (a.k.a. ''MPEG-2 Part 3'' or
backward compatible
Backward compatibility (sometimes known as backwards compatibility) is a property of an operating system, product, or technology that allows for interoperability with an older legacy system, or with input designed for such a system, especially in ...
''MPEG-2 Audio'' or ''MPEG-2 Audio BC''
), originally published in 1995.
MPEG-2 Part 3 (ISO/IEC 13818-3) defined additional bit rates and sample rates for MPEG-1 Audio Layer I, II and III. The new sampling rates are exactly half that of those originally defined for MPEG-1 Audio. MPEG-2 Part 3 also enhanced MPEG-1's audio by allowing the coding of audio programs with more than two channels, up to 5.1 multichannel.
The Layer III (
MP3
MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany, with support from other digital scientists in the United States and elsewhere. Origin ...
) component uses a
lossy compression
In information technology, lossy compression or irreversible compression is the class of data compression methods that uses inexact approximations and partial data discarding to represent the content. These techniques are used to reduce data size ...
algorithm that was designed to greatly reduce the amount of data required to represent an audio recording and sound like a decent reproduction of the original uncompressed audio for most listeners.
Emmy Award in Engineering
CCETT (France),
IRT (Germany) and
Philips
Koninklijke Philips N.V. (), commonly shortened to Philips, is a Dutch multinational conglomerate corporation that was founded in Eindhoven in 1891. Since 1997, it has been mostly headquartered in Amsterdam, though the Benelux headquarters i ...
(The Netherlands) won an
Emmy Award
The Emmy Awards, or Emmys, are an extensive range of awards for artistic and technical merit for the American and international television industry. A number of annual Emmy Award ceremonies are held throughout the calendar year, each with the ...
in Engineering 2000 for development of a digital audio two-channel compression system known as Musicam or MPEG Audio Layer II.
Technical specifications
MPEG-1 Audio Layer II is defined in ISO/IEC 11172-3 (MPEG-1 Part 3)
*
Sampling rate
In signal processing, sampling is the reduction of a continuous-time signal to a discrete-time signal. A common example is the conversion of a sound wave to a sequence of "samples".
A sample is a value of the signal at a point in time and/or s ...
s: 32, 44.1 and 48 kHz
*
Bit rates: 32, 48, 56, 64, 80, 96, 112, 128, 160, 192, 224, 256, 320 and 384 kbit/s
An extension has been provided in MPEG-2 Audio Layer II and is defined in ISO/IEC 13818-3 (MPEG-2 Part 3)
*Additional sampling rates: 16, 22.05 and 24 kHz
*Additional bit rates: 8, 16, 24, 40 and 144 kbit/s
*
Multichannel support - up to 5 full range audio channels and an LFE-channel (Low Frequency Enhancement channel)
The format is based on successive digital frames of 1152 sampling intervals with four possible formats:
* mono format
* stereo format
* intensity encoded joint stereo format (stereo irrelevance)
* dual channel (uncorrelated) format
Variable bit rate
MPEG audio may have
variable bit rate
Variable bitrate (VBR) is a term used in telecommunications and computing that relates to the bitrate used in sound or video encoding. As opposed to constant bitrate (CBR), VBR files vary the amount of output data per time segment. VBR allows a ...
(VBR), but it is not widely supported. Layer II can use a method called bit rate switching. Each frame may be created with a different bit rate.
According to ISO/IEC 11172-3:1993, Section 2.4.2.3: To provide the smallest possible delay and complexity, the (MPEG audio) decoder is not required to support a continuously variable bit rate when in layer I or II.
How the MP2 format works
* MP2 is a
sub-band audio encoder, which means that compression takes place in the
time domain
Time domain refers to the analysis of mathematical functions, physical signals or time series of economic or environmental data, with respect to time. In the time domain, the signal or function's value is known for all real numbers, for the c ...
with a low-delay filter bank producing 32 frequency domain components. By comparison, MP3 is a
transform audio encoder with hybrid filter bank, which means that compression takes place in the
frequency domain
In physics, electronics, control systems engineering, and statistics, the frequency domain refers to the analysis of mathematical functions or signals with respect to frequency, rather than time. Put simply, a time-domain graph shows how a s ...
after a hybrid (double) transformation from the time domain.
* MPEG Audio Layer II is the core algorithm of the MP3 standards. All psychoacoustical characteristics and frame format structures of the MP3 format are derived from the basic MP2 algorithm and format.
* The MP2 encoder may exploit inter channel redundancies using optional "joint stereo"
intensity encoding.
* Like MP3, MP2 is a perceptual coding format, which means that it removes information that the human auditory system will not be able to easily perceive. To choose which information to remove, the audio signal is analyzed according to a psychoacoustic model, which takes into account the parameters of the human auditory system. Research into
psychoacoustics
Psychoacoustics is the branch of psychophysics involving the scientific study of sound perception and audiology—how humans perceive various sounds. More specifically, it is the branch of science studying the psychological responses associated wi ...
has shown that if there is a strong signal on a certain frequency, then weaker signals at frequencies close to the strong signal's frequency cannot be perceived by the human auditory system. This is called frequency masking. Perceptual audio codecs take advantage of this frequency masking by ignoring information at frequencies that are deemed to be imperceptible, thus allowing more data to be allocated to the reproduction of perceptible frequencies.
* MP2 splits the input audio signal into 32 sub-bands, and if the audio in a sub-band is deemed to be imperceptible then that sub-band is not transmitted. MP3, on the other hand, transforms the input audio signal to the frequency domain in 576 frequency components. Therefore, MP3 has a higher frequency resolution than MP2, which allows the psychoacoustic model to be applied more selectively than for MP2. So MP3 has greater scope to reduce the bit rate.
* The use of an additional
entropy coding
In information theory, an entropy coding (or entropy encoding) is any lossless data compression method that attempts to approach the lower bound declared by Shannon's source coding theorem, which states that any lossless data compression method ...
tool, and higher frequency accuracy (due to the larger number of frequency sub-bands used by MP3) explains why MP3 does not need as high a bit rate as MP2 to get an acceptable audio quality. Conversely, MP2 shows a better behavior than MP3 in the time domain, due to its lower frequency resolution. This implies less codec time delay — which can make editing audio simpler — as well as "ruggedness" and resistance to errors which may occur during the digital recording process, or during transmission errors.
* The MP2 sub-band filter bank also provides an inherent "
transient concealment" feature, due to the specific temporal masking effect of its mother filter. This unique characteristic of the MPEG-1 Audio family implies a very good sound quality on audio signals with rapid energy changes, such as percussive sounds. Because both the MP2 and MP3 formats use the same basic sub-band filter bank, both benefit from this characteristic.
Applications of MP2
Part of the
DAB digital radio and
DVB
Digital Video Broadcasting (DVB) is a set of international open standards for digital television. DVB standards are maintained by the DVB Project, an international industry consortium, and are published by a Joint Technical Committee (JTC) o ...
digital television
Digital television (DTV) is the transmission of television signals using digital encoding, in contrast to the earlier analog television technology which used analog signals. At the time of its development it was considered an innovative adva ...
standards.
Layer II is commonly used within the broadcast industry for distributing live audio over satellite, ISDN and IP Network connections as well as for storage of audio in digital playout systems. An example is
NPR
National Public Radio (NPR, stylized in all lowercase) is an American privately and state funded nonprofit media organization headquartered in Washington, D.C., with its NPR West headquarters in Culver City, California. It differs from other ...
's
PRSS Content Depot programming distribution system. The Content Depot distributes MPEG-1 L2 audio in a Broadcast Wave File wrapper. MPEG2 with RIFF headers (used in
.wav
) is specified in the RIFF/WAV standards. As a result, Windows Media Player will directly play Content Depot files, however, less intelligent
.wav
players often do not. As the encoding and decoding process would have been a significant drain on CPU resources in the first generations of broadcast playout systems, professional broadcast playout systems typically implement the codec in hardware, such as by delegating the task of encoding and decoding to a compatible soundcard rather than the system CPU.
All
DVD-Video players in
PAL
Phase Alternating Line (PAL) is a colour encoding system for analogue television. It was one of three major analogue colour television standards, the others being NTSC and SECAM. In most countries it was broadcast at 625 lines, 50 fields (25 ...
countries contain stereo MP2 decoders, making MP2 a possible competitor to
Dolby Digital in these markets. DVD-Video players in
NTSC
The first American standard for analog television broadcast was developed by National Television System Committee (NTSC)National Television System Committee (1951–1953), Report and Reports of Panel No. 11, 11-A, 12–19, with Some supplement ...
countries are not required to decode MP2 audio, although most do. While some
DVD recorder
A DVD recorder is an optical disc recorder that uses optical disc recording technologies to digitally record analog or digital signals onto blank writable DVD media. Such devices are available as either installable drives for computers o ...
s store audio in MP2 and many consumer-authored DVDs use the format, commercial DVDs with MP2 soundtracks are rare.
MPEG-1 Audio Layer II is the standard audio format used in the
Video CD
Video CD (abbreviated as VCD, and also known as Compact Disc Digital Video) is a home video format and the first format for distributing films on standard optical discs. The format was widely adopted in Southeast Asia, Central Asia and the ...
and
Super Video CD formats (VCD and SVCD also support variable bit rate and
MPEG Multichannel
__NOTOC__
MPEG Multichannel is an extension to the MPEG-1 Layer II sound, audio compression specification, as defined in the MPEG-2 Audio standard (International Organization for Standardization, ISO/International Electrotechnical Commission, IE ...
as added by MPEG-2).
MPEG-1 Audio Layer II is the standard audio format used in the
MHP standard for set-top boxes.
MPEG-1 Audio Layer II is the audio format used in
HDV
HDV is a format for recording of high-definition video on DV cassette tape. The format was originally developed by JVC and supported by Sony, Canon, and Sharp. The four companies formed the HDV Consortium in September 2003.
Conceived as an af ...
camcorders.
MP2 files are compatible with some
Portable audio player
A portable audio player is a personal mobile device that allows the user to listen to recorded audio while mobile. Sometimes a distinction is made between a ''portable'' player, battery-powered and with one or more small loudspeakers, and a ''pers ...
s.
Naming and extensions
The term MP2 and
filename extension
A filename extension, file name extension or file extension is a suffix to the name of a computer file (e.g., .txt, .docx, .md). The extension indicates a characteristic of the file contents or its intended use. A filename extension is typically ...
.mp2
usually refer MPEG-1 Audio Layer II data, but can also refer to MPEG-2 Audio Layer II, a mostly backward compatible extension which adds support for
multichannel audio,
variable bit rate
Variable bitrate (VBR) is a term used in telecommunications and computing that relates to the bitrate used in sound or video encoding. As opposed to constant bitrate (CBR), VBR files vary the amount of output data per time segment. VBR allows a ...
encoding, and additional sampling rates, defined in ISO/IEC 13818-3. The abbreviation MP2 is also sometimes erroneously applied to
MPEG-2 video or MPEG-2
AAC audio.
Licensing
Sisvel S.p.A., a Luxembourg-based company, administers licenses for patents applying to MPEG Audio.
See also
*
MPEG-1
MPEG-1 is a standard for lossy compression of video and audio. It is designed to compress VHS-quality raw digital video and CD audio down to about 1.5 Mbit/s (26:1 and 6:1 compression ratios respectively) without excessive quality loss, making ...
**
MPEG-1 Audio Layer I
MPEG-1 Audio Layer I, commonly abbreviated to MP1, is one of three audio formats included in the MPEG-1 standard. It is a deliberately simplified version of MPEG-1 Audio Layer II (MP2), created for applications where lower compression efficiency ...
**
MPEG-1 Audio Layer III
MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany, with support from other digital scientists in the United States and elsewhere. Origin ...
*
MPEG-2
*
MP4 (container format)
*
Elementary stream An elementary stream (ES) as defined by the MPEG communication protocol is usually the output of an audio encoder or video encoder. An ES contains only one kind of data (e.g. audio, video, or closed caption). An elementary stream is often referred ...
*
Musepack
Musepack or MPC is an open source lossy audio codec, specifically optimized for transparent compression of stereo audio at bitrates of 160–180 (manual set allows bitrates up to 320) kbit/s. It was formerly known as MPEGplus, MPEG+ or MP+ ...
originally MP2-based, with numerous improvements
Notes
References
Genesis of the MP3 Audio Coding Standard by Hans Georg Musmann in IEEE Transactions on Consumer Electronics, Vol. 52, Nr. 3, pp. 1043–1049, August 2006
* MUSICAM Source Coding by Yves-François Dehery, AES 10th International Conference: Kensington, London, England, (7-9 Sept 1991), pp 71–79.
External links
TooLAME– An MP2 encoder
TwoLAME– A fork of the tooLAME code
– The document defining MIME type for MPEG-1 Audio Layer II
A MPEG Audio Layer II decoder in 4k– Source code for small open source decoder.
Official MPEG web sitePatent Status of MPEG-1, H.261 and MPEG-2– Some information about patents
{{MPEG
Audio codecs
MP3
MPEG