HOME

TheInfoList



OR:

The Adaptive Multi-Rate (AMR, AMR-NB or GSM-AMR) audio codec is an audio compression format optimized for
speech coding Speech coding is an application of data compression In information theory Information theory is the scientific study of the quantification, storage, and communication of information. The field was originally established by the works ...
. AMR speech codec consists of a multi-rate
narrowband Narrowband signals are signals that occupy a narrow range of frequencies or that have a small fractional bandwidth. In the audio spectrum, narrowband sounds are sounds that occupy a narrow range of frequencies. In telephony Telephony ( ) i ...
speech codec that encodes narrowband (200–3400 Hz) signals at variable bit rates ranging from 4.75 to 12.2 kbit/s with toll quality speech starting at 7.4 kbit/s. AMR was adopted as the standard speech
codec A codec is a device or computer program A computer program is a sequence or set of instructions in a programming language A programming language is a system of notation for writing computer programs. Most programming languages are text-b ...
by 3GPP in October 1999 and is now widely used in GSM and
UMTS The Universal Mobile Telecommunications System (UMTS) is a third generation mobile cellular system for networks based on the GSM standard. Developed and maintained by the 3GPP (3rd Generation Partnership Project), UMTS is a component of the In ...
. It uses link adaptation to select from one of eight different bit rates based on link conditions. AMR is also a file format for storing spoken audio using the AMR codec. Many modern mobile telephone handsets can store short audio recordings in the AMR format, and both free and proprietary programs exist (see Software support) to convert between this and other formats, although AMR is a speech format and is unlikely to give ideal results for other audio. The common
filename extension A filename extension, file name extension or file extension is a suffix to the name of a computer file (e.g., .txt, .docx, .md). The extension indicates a characteristic of the file contents or its intended use. A filename extension is typically ...
is .amr. There also exists another storage format for AMR that is suitable for applications with more advanced demands on the storage format, like random access or synchronization with video. This format is the 3GPP-specified 3GP container format based on ISO base media file format.


Usage

The frames contain 160 samples and are 20 milliseconds long. AMR uses various techniques, such as ACELP, DTX, VAD and CNG. The usage of AMR requires optimized link adaptation that selects the best codec mode to meet the local radio channel and capacity requirements. If the radio conditions are bad, source coding is reduced and
channel coding In computing Computing is any goal-oriented activity requiring, benefiting from, or creating computing machinery. It includes the study and experimentation of algorithm In mathematics Mathematics is an area of knowledge tha ...
is increased. This improves the quality and robustness of the network connection while sacrificing some voice clarity. In the particular case of AMR this improvement is somewhere around S/N = 4–6 dB for usable communication. The new intelligent system allows the network operator to prioritize capacity or quality per base station. There are a total of 14 modes of the AMR codec, eight are available in a full rate channel (FR) and six on a half rate channel (HR).


Features

* Sampling frequency 8 kHz/13-bit (160 samples for 20 ms frames), filtered to 200–3400 Hz. * The AMR codec uses eight source codecs with bit-rates of 12.2, 10.2, 7.95, 7.40, 6.70, 5.90, 5.15 and 4.75 kbit/s. * Generates frame length of 95, 103, 118, 134, 148, 159, 204, or 244 bits for AMR FR bit rates 4.75, 5.15, 5.90, 6.70, 7.40, 7.95, 10.2, or 12.2 kbit/s, respectively. AMR HR frame lengths are different. * AMR utilizes
discontinuous transmission Discontinuous transmission (DTX) is a means by which a mobile telephone is temporarily shut off or muted while the phone lacks a voice input. Misconception A common misconception is that DTX improves capacity by freeing up TDMA time slots for us ...
(DTX), with
voice activity detection Voice activity detection (VAD), also known as speech activity detection or speech detection, is the detection of the presence or absence of human speech, used in speech processing. The main uses of VAD are in speech coding and speech recognition ...
(VAD) and comfort noise generation (CNG) to reduce bandwidth usage during silence periods * Algorithmic delay is 20 ms per frame. For bit-rates of 12.2, there is no "algorithm" look-ahead delay. For other rates, look-ahead delay is 5 ms. Note that there is 5 ms "dummy" look-ahead delay, to allow seamless frame-wise mode switching with the rest of rates. * AMR is a hybrid speech coder, and as such transmits both speech parameters and a waveform signal **
Linear predictive coding Linear predictive coding (LPC) is a method used mostly in audio signal processing and speech processing for representing the spectral envelope of a digital signal of speech Speech is a human vocal communication Communication (fro ...
(LPC) is used to synthesize the speech from a residual waveform. The LPC parameters are encoded as line spectral pairs (LSP). ** The residual waveform is coded using algebraic code-excited linear prediction (ACELP). * The complexity of the algorithm is rated at 5, using a relative scale where G.711 is 1 and
G.729a G.729 is a royalty-free narrow-band vocoder-based audio data compression algorithm using a frame length of 10 milliseconds. It is officially described as ''Coding of speech at 8 kbit/s using code-excited linear prediction'' speech coding (CS- ...
is 15. *
PSQM Perceptual Speech Quality Measure (PSQM) is a computational and modeling algorithm defined in Recommendation ITU-T The ITU Telecommunication Standardization Sector (ITU-T) is one of the three sectors (divisions or units) of the International Tel ...
testing under ideal conditions yields
mean opinion score Mean opinion score (MOS) is a measure used in the domain of Quality of Experience and telecommunications engineering, representing overall quality of a stimulus or system. It is the arithmetic mean In mathematics Mathematics is an area ...
s of 4.14 for AMR (12.2 kbit/s), compared to 4.45 for G.711 (μ-law) * PSQM testing under network stress yields
mean opinion score Mean opinion score (MOS) is a measure used in the domain of Quality of Experience and telecommunications engineering, representing overall quality of a stimulus or system. It is the arithmetic mean In mathematics Mathematics is an area ...
s of 3.79 for AMR (12.2 kbit/s), compared to 4.13 for G.711 (μ-law)


Licensing and patent issues

AMR codecs incorporate several
patent A patent is a type of intellectual property that gives its owner the legal right to exclude others from making, using, or selling an invention for a limited period of time in exchange for publishing an enabling disclosure of the invention."A ...
s of
Nokia Nokia Corporation (natively Nokia Oyj, referred to as Nokia) is a Finnish multinational telecommunications, information technology, and consumer electronics corporation, established in 1865. Nokia's main headquarters are in Espoo, Finlan ...
,
Ericsson (lit. "Telephone Stock Company of LM Ericsson"), commonly known as Ericsson, is a Sweden, Swedish multinational networking and telecommunications company headquartered in Stockholm. The company sells infrastructure, software, and services in ...
, NTT and VoiceAge, the last one being the ''License Administrator'' for the AMR
patent pool In patent law, a patent pool is a consortium A consortium (plural: consortia) is an association of two or more individuals, companies, organizations or governments (or any combination of these entities) with the objective of participating ...
s. VoiceAge also accepts submission of patents for determination of their possible essentiality to these standards. However, it's very difficult to determine if there were actually any patents in existence for the so-called inventions related to AMR/AMR-WB codecs, since inventors (and their lawyers) do everything they can to hide patents related to AMR/AMR-WB technology. Apparently, all these patents are hidden from all other researches and general audience that could perhaps spot prior art in the claimed "inventions" patented by the patent holders of the AMR/AMR-WB codecs. The initial fee for professional content creation tools and "real-time channel" products is US$6,500. The minimum annual royalty is $10,000, which, in the first year, excludes the initial fee. Per-channel license fees fall from $0.99 to $0.50 with volume, up to a maximum of $2 million annually. In the category of personal computer products, e.g., media players, the AMR decoder is licensed for free. The license fee for a sold encoder falls from $0.40 to $0.30 with volume, up to a maximum of $300,000 annually. The minimum annual royalty is not applied to licensed products that fall under the category of personal computer products and use only the free decoder. More information:
VoiceAge licensing information
including pricing to license the AMR codecs




AMR Codecs as Shared Libraries
— legal notices for usage of amrnb and amrwb libraries based on the reference implementation


Software support

* 3GPP TS 26.073AMR speech Codec (C source code)reference implementation * Audacity (beta version 1.3) via the FFmpeg integration librariesRetrieved on 2010-02-28 (both input and output format) *
FFmpeg FFmpeg is a free and open-source software project consisting of a suite of libraries A library is a collection of materials, books or media that are accessible for use and not just for display purposes. A library provides physical (hard co ...
with OpenCORE AMR librariesFFmpeg General Documentation - AMR external library
Retrieved on 2009-07-08
*
Android Android may refer to: Science and technology * Android (robot) An android is a humanoid robot or other artificial being often made from a flesh-like material. Historically, androids were completely within the domain of science fiction ...
Android AMR codecs
Retrieved on 2009-07-08
Used for voice recorder.
AMR Codecs as Shared Libraries
mrnb and amrwb libraries development site. These libraries are based on the reference implementation and were created to prevent embedding of possibly patented source code into many open source projects. * Open source software to convert the .amr format
RetroCodeAmr2Wav
both are in an early developmental stage
AMR Player
is freeware to play AMR audio files, and can convert AMR from/to MP3/WAV audio format.

can convert (create) samples, one can use Nokia's conversion tool to create both .amr and .awb files. It works in Windows 7 as well if the setup is run in XP compatibility mode. *
MPlayer MPlayer is a free and open-source media player software application. It is available for Linux Linux ( or ) is a family of open-source Unix-like operating system An operating system (OS) is system software that manages compute ...
( SMPlayer, KMPlayerKMPlayer Internal Audio Decoder Preferences
, Retrieved 2014-10-22
) * Parole Media Player 0.8.1 (in Ubuntu 16.04) *
QuickTime QuickTime is an extensible multimedia framework developed by Apple Inc., capable of handling various formats of digital video, picture, sound, panoramic images, and interactivity. Created in 1991, the latest Mac version, QuickTime X, is ...
Player and multimedia framework * RealPlayer version 11 and later *
VLC media player VLC media player (previously the VideoLAN Client and commonly known as simply VLC) is a free and open-source, portable, cross-platform In computing, cross-platform software (also called multi-platform software, platform-agnostic software, ...
version 1.1.0 and later (input format only, not output format) * ffdshow * Apple iPhone (can play back AMR files) *
iOS iOS (formerly iPhone OS) is a mobile operating system A mobile operating system is an operating system An operating system (OS) is system software that manages computer hardware, software resources, and provides common services for ...
&
macOS macOS (; previously OS X and originally Mac OS X) is a Unix operating system developed and marketed by Apple Inc. since 2001. It is the primary operating system for Apple's Mac (computer), Mac computers. Within the market of ...
(iMessage) *
BlackBerry The blackberry is an edible fruit In botany, a fruit is the seed-bearing structure in flowering plants that is formed from the ovary after flowering. Fruits are the means by which flowering plants (also known as angiosperms) dissem ...
smartphones (used for voice recorder file format, while BlackBerry 10 cannot play AMR format) * K-Lite Codec Pack * Media Player Classic Home Cinema, around 1.7.1 *
foobar2000 foobar2000 (often abbreviated as fb2k or f2k) is a freeware Freeware is software Software is a set of computer programs and associated documentation and data. This is in contrast to hardware, from which the system is built and whi ...
with the componen
foo_input_amr


See also

* Adaptive Multi-Rate Wideband (AMR-WB) * Extended Adaptive Multi-Rate – Wideband (AMR-WB+) * Half Rate * Full Rate * Enhanced Full Rate (EFR) * Sampling rate * IS-641 * 3GP *
Comparison of audio coding formats The following tables compare general and technical information for a variety of audio coding formats. For listening tests comparing the perceived audio quality of audio formats and codecs, see the article Codec listening test. General informat ...
* RTP audio video profile


References


External links


3GPP TS 26.090Mandatory Speech Codec speech processing functions; Adaptive Multi-Rate (AMR) speech codec; Transcoding functions


* ttp://www.3gpp.org/ftp/Specs/html-info/26-series.htm 3GPP codecs specifications; 3G and beyond / GSM, 26 series* RTP Payload Format and File Storage Format for the Adaptive Multi-Rate (AMR) and Adaptive Multi-Rate Wideband (AMR-WB) Audio Codecs * The Codecs Parameter for "Bucket" Media Types {{Compression formats Speech codecs 1999 software