An audio coding format (or sometimes audio compression format) is a
content representation format for storage or transmission of
digital audio (such as in
digital television
Digital television (DTV) is the transmission of television signals using digital encoding, in contrast to the earlier analog television technology which used analog signals. At the time of its development it was considered an innovative advanc ...
,
digital radio and in audio and video files). Examples of audio coding formats include
MP3
MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany, with support from other digital scientists in the United States and elsewhere. Orig ...
,
AAC,
Vorbis
Vorbis is a free and open-source software project headed by the Xiph.Org Foundation. The project produces an audio coding format and software reference encoder/decoder (codec) for lossy audio compression. Vorbis is most commonly used in con ...
,
FLAC
FLAC (; Free Lossless Audio Codec) is an audio coding format for lossless compression of digital audio, developed by the Xiph.Org Foundation, and is also the name of the free software project producing the FLAC tools, the reference software p ...
, and
Opus. A specific software or hardware implementation capable of
audio compression and decompression to/from a specific audio coding format is called an
audio codec
An audio codec is a device or computer program capable of encoding or decoding a digital data stream (a codec) that encodes or decodes audio. In software, an audio codec is a computer program implementing an algorithm that compresses and decompres ...
; an example of an audio codec is
LAME, which is one of several different codecs which implements encoding and decoding audio in the
MP3
MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany, with support from other digital scientists in the United States and elsewhere. Orig ...
audio coding format in software.
Some audio coding formats are documented by a detailed
technical specification document known as an audio coding specification. Some such specifications are written and approved by
standardization organizations as
technical standard
A technical standard is an established norm or requirement for a repeatable technical task which is applied to a common and repeated use of rules, conditions, guidelines or characteristics for products or related processes and production methods, ...
s, and are thus known as an audio coding standard. The term "standard" is also sometimes used for
''de facto'' standards as well as formal standards.
Audio content encoded in a particular audio coding format is normally encapsulated within a
container format
A container format (informally, sometimes called a wrapper) or metafile is a file format that allows multiple data streams to be embedded into a single file, usually along with metadata for identifying and further detailing those streams. Notab ...
. As such, the user normally doesn't have a raw
AAC file, but instead has a .m4a
audio file, which is a
MPEG-4 Part 14
MPEG-4 Part 14 or MP4 is a digital multimedia container format most commonly used to store video and audio, but it can also be used to store other data such as subtitles and still images. Like most modern container formats, it allows stream ...
container containing AAC-encoded audio. The container also contains
metadata such as title and other tags, and perhaps an index for fast seeking. A notable exception is
MP3
MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany, with support from other digital scientists in the United States and elsewhere. Orig ...
files, which are raw audio coding without a container format. De facto standards for adding metadata tags such as title and artist to MP3s, such as
ID3, are
hacks which work by appending the tags to the MP3, and then relying on the MP3 player to recognize the chunk as malformed audio coding and therefore skip it. In video files with audio, the encoded audio content is bundled with video (in a
video coding format
A video coding format (or sometimes video compression format) is a content representation format for storage or transmission of digital video content (such as in a data file or bitstream). It typically uses a standardized video compression alg ...
) inside a
multimedia container format.
An audio coding format does not dictate all
algorithm
In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific problems or to perform a computation. Algorithms are used as specifications for performing ...
s used by a
codec
A codec is a device or computer program that encodes or decodes a data stream or signal. ''Codec'' is a portmanteau of coder/decoder.
In electronic communications, an endec is a device that acts as both an encoder and a decoder on a signal or ...
implementing the format. An important part of how lossy audio compression works is by removing data in ways humans can't hear, according to a
psychoacoustic model; the implementer of an encoder has some freedom of choice in which data to remove (according to their psychoacoustic model).
Lossless, lossy, and uncompressed audio coding formats
A
lossless
Lossless compression is a class of data compression that allows the original data to be perfectly reconstructed from the compressed data with no loss of information. Lossless compression is possible because most real-world data exhibits statistic ...
audio coding format reduces the total data needed to represent a sound but can be de-coded to its original, uncompressed form. A
lossy audio coding format additionally reduces the
bit resolution of the sound on top of compression, which results in far less data at the cost of irretrievably lost information.
Consumer audio is most often compressed using lossy audio codecs as the smaller size is far more convenient for distribution. The most widely used audio coding formats are
MP3
MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany, with support from other digital scientists in the United States and elsewhere. Orig ...
and
Advanced Audio Coding
Advanced Audio Coding (AAC) is an audio coding standard for lossy digital audio compression. Designed to be the successor of the MP3 format, AAC generally achieves higher sound quality than MP3 encoders at the same bit rate.
AAC has been sta ...
(AAC), both of which are lossy formats based on
modified discrete cosine transform (MDCT) and
perceptual coding algorithms.
Lossless audio coding formats such as
FLAC
FLAC (; Free Lossless Audio Codec) is an audio coding format for lossless compression of digital audio, developed by the Xiph.Org Foundation, and is also the name of the free software project producing the FLAC tools, the reference software p ...
and
Apple Lossless are sometimes available, though at the cost of larger files.
Uncompressed audio formats, such as
pulse-code modulation
Pulse-code modulation (PCM) is a method used to digitally represent sampled analog signals. It is the standard form of digital audio in computers, compact discs, digital telephony and other digital audio applications. In a PCM stream, the a ...
(PCM, or .wav), are also sometimes used. PCM was the standard format for
Compact Disc Digital Audio (CDDA), before lossy compression eventually became the standard after the introduction of MP3.
History
In 1950,
Bell Labs
Nokia Bell Labs, originally named Bell Telephone Laboratories (1925–1984),
then AT&T Bell Laboratories (1984–1996)
and Bell Labs Innovations (1996–2007),
is an American industrial research and scientific development company owned by mult ...
filed the patent on
differential pulse-code modulation (DPCM).
Adaptive DPCM (ADPCM) was introduced by P. Cummiskey,
Nikil S. Jayant and
James L. Flanagan at
Bell Labs
Nokia Bell Labs, originally named Bell Telephone Laboratories (1925–1984),
then AT&T Bell Laboratories (1984–1996)
and Bell Labs Innovations (1996–2007),
is an American industrial research and scientific development company owned by mult ...
in 1973.
Perceptual coding was first used for
speech coding
Speech coding is an application of data compression of digital audio signals containing speech. Speech coding uses speech-specific parameter estimation using audio signal processing techniques to model the speech signal, combined with generic ...
compression, with
linear predictive coding (LPC).
Initial concepts for LPC date back to the work of
Fumitada Itakura (
Nagoya University
, abbreviated to or NU, is a Japanese national research university located in Chikusa-ku, Nagoya. It was the seventh Imperial University in Japan, one of the first five Designated National University and selected as a Top Type university of ...
) and Shuzo Saito (
Nippon Telegraph and Telephone) in 1966. During the 1970s,
Bishnu S. Atal and
Manfred R. Schroeder at
Bell Labs
Nokia Bell Labs, originally named Bell Telephone Laboratories (1925–1984),
then AT&T Bell Laboratories (1984–1996)
and Bell Labs Innovations (1996–2007),
is an American industrial research and scientific development company owned by mult ...
developed a form of LPC called
adaptive predictive coding Adaptive predictive coding (APC) is a narrowband analog-to-digital conversion that uses a one-level or multilevel sampling system in which the value of the signal at each sampling instant is predicted according to a linear function of the past valu ...
(APC), a perceptual coding algorithm that exploited the masking properties of the human ear, followed in the early 1980s with the
code-excited linear prediction (CELP) algorithm which achieved a significant compression ratio for its time.
Perceptual coding is used by modern audio compression formats such as
MP3
MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany, with support from other digital scientists in the United States and elsewhere. Orig ...
and
AAC.
Discrete cosine transform (DCT), developed by
Nasir Ahmed, T. Natarajan and
K. R. Rao
Kamisetty Ramamohan Rao was an Indian-American electrical engineer. He was a professor of Electrical Engineering at the University of Texas at Arlington (UT Arlington). Academically known as K. R. Rao, he is credited with the co-invention of di ...
in 1974,
provided the basis for the
modified discrete cosine transform (MDCT) used by modern audio compression formats such as MP3
and AAC. MDCT was proposed by J. P. Princen, A. W. Johnson and A. B. Bradley in 1987, following earlier work by Princen and Bradley in 1986. The MDCT is used by modern audio compression formats such as
Dolby Digital
Dolby Digital, originally synonymous with Dolby AC-3, is the name for what has now become a family of audio compression (data), audio compression technologies developed by Dolby Laboratories. Formerly named Dolby Stereo Digital until 1995 in film, ...
,
MP3
MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany, with support from other digital scientists in the United States and elsewhere. Orig ...
,
and
Advanced Audio Coding
Advanced Audio Coding (AAC) is an audio coding standard for lossy digital audio compression. Designed to be the successor of the MP3 format, AAC generally achieves higher sound quality than MP3 encoders at the same bit rate.
AAC has been sta ...
(AAC).
List of lossy formats
General
Speech
{{further, Speech coding
*
Linear predictive coding (LPC)
**
Adaptive predictive coding Adaptive predictive coding (APC) is a narrowband analog-to-digital conversion that uses a one-level or multilevel sampling system in which the value of the signal at each sampling instant is predicted according to a linear function of the past valu ...
(APC)
**
Code-excited linear prediction (CELP)
**
Algebraic code-excited linear prediction (ACELP)
**
Relaxed code-excited linear prediction (RCELP)
**
Low-delay CELP (LD-CELP)
**
Adaptive Multi-Rate (used in
GSM and
3GPP)
**
Codec2 Codec 2 is a low-bitrate speech audio codec (speech coding) that is patent free and open source. Codec 2 compresses speech using sinusoidal coding, a method specialized for human speech. Bit rates of 3200 to 450 bit/s have been successfully cre ...
(noted for its lack of patent restrictions)
**
Speex (noted for its lack of patent restrictions)
*
Modified discrete cosine transform (MDCT)
**
AAC-LD
**
Constrained Energy Lapped Transform
The Celts (, see Names of the Celts#Pronunciation, pronunciation for different usages) or Celtic peoples () are. "CELTS location: Greater Europe time period: Second millennium B.C.E. to present ancestry: Celtic a collection of Indo-Europea ...
(CELT)
**
Opus (mostly for real-time applications)
List of lossless formats
*
Apple Lossless (ALAC – Apple Lossless Audio Codec)
*
Adaptive Transform Acoustic Coding (ATRAC)
*
Audio Lossless Coding (also known as MPEG-4 ALS)
*
Direct Stream Transfer (DST)
*
Dolby TrueHD
Dolby TrueHD is a lossless, multi-channel audio codec developed by Dolby Laboratories for home video, used principally in Blu-ray Disc and compatible hardware. Dolby TrueHD, along with Dolby Digital Plus (E-AC-3) and Dolby AC-4, is one of th ...
*
DTS-HD Master Audio
*
Free Lossless Audio Codec (FLAC)
*
Lossless discrete cosine transform (LDCT)
*
Meridian Lossless Packing (MLP)
*
Monkey's Audio (Monkey's Audio APE)
*
MPEG-4 SLS (also known as HD-AAC)
*
OptimFROG
*
Original Sound Quality (OSQ)
*
RealPlayer
RealPlayer, formerly RealAudio Player, RealOne Player and RealPlayer G2, is a cross-platform media player (software), media player app, developed by RealNetworks. The media player is compatible with numerous container file formats of the multime ...
(RealAudio Lossless)
*
Shorten (SHN)
*
TTA (True Audio Lossless)
*
WavPack (WavPack lossless)
*
WMA Lossless (Windows Media Lossless)
See also
*
Comparison of audio coding formats
The following tables compare general and technical information for a variety of audio coding formats.
For listening tests comparing the perceived audio quality of audio formats and codecs, see the article Codec listening test.
General informati ...
*
Data compression#Audio
*
Audio file format
An audio file format is a file format for storing digital audio data on a computer system. The bit layout of the audio data (excluding metadata) is called the audio coding format and can be uncompressed, or compressed to reduce the file size, o ...
*
List of audio compression formats
The following is a list of compression formats and related codecs.
Audio compression formats
Non-compression
* Linear pulse-code modulation (LPCM, generally only described as PCM) is the format for uncompressed audio in media files and it is als ...
References