HOME

TheInfoList



OR:

Spectral band replication (SBR) is a technology to enhance audio or
speech codec Speech coding is an application of data compression to digital audio signals containing speech. Speech coding uses speech-specific parameter estimation using audio signal processing techniques to model the speech signal, combined with generic d ...
s, especially at low
bit rate In telecommunications and computing, bit rate (bitrate or as a variable ''R'') is the number of bits that are conveyed or processed per unit of time. The bit rate is expressed in the unit bit per second (symbol: bit/s), often in conjunction ...
s and is based on harmonic redundancy in the frequency domain. It can be combined with any audio compression codec: the codec itself transmits the lower and midfrequencies of the spectrum, while SBR replicates higher frequency content by transposing up harmonics from the lower and midfrequencies at the decoder. Some guidance information for reconstruction of the high-frequency spectral envelope is transmitted as side information. When needed, it also reconstructs or adaptively mixes in noise-like information in selected frequency bands in order to faithfully replicate signals that originally contained no or fewer tonal components. The SBR idea is based on the principle that the psychoacoustic part of the human brain tends to analyse higher frequencies with less accuracy; thus harmonic phenomena associated with the spectral band replication process needs only be accurate in a perceptual sense and not technically or mathematically exact.


History and use

A Swedish company Coding Technologies (acquired by
Dolby Dolby Laboratories, Inc. (Dolby Labs or simply Dolby) is a British-American technology corporation specializing in audio noise reduction, audio encoding/compression, spatial audio, and high-dynamic-range television (HDR) imaging. Dolby li ...
in 2007) developed and pioneered the use of SBR in its MPEG-2
AAC AAC may refer to: Aviation * Advanced Aircraft, a company from Carlsbad, California * Airborne aircraft carrier, a type of aircraft * Alaskan Air Command, a radar network * American Aeronautical Corporation, a company from Port Washington, New ...
-derived codec called aacPlus, which first appeared in 2001. This codec was submitted to MPEG and formed the basis of MPEG-4 High-Efficiency AAC (HE-AAC), standardized in 2003. Lars Liljeryd, Kristofer Kjörling, and Martin Dietz received the
IEEE Masaru Ibuka Consumer Electronics Award The IEEE Masaru Ibuka Consumer Electronics Award is a Institute of Electrical and Electronics Engineers#Technical field awards, Technical Field Award of the IEEE given for outstanding contributions to Consumer electronics, consumer electronic ...
in 2013 for their work developing and marketing HE-AAC. Coding Technologies' SBR method has also been used with WMA 10 Professional to create WMA 10 Pro LBR, and with
MP3 MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany under the lead of Karlheinz Brandenburg. It was designed to greatly reduce the amount ...
to create mp3PRO. HE-AAC which uses SBR is used in broadcast systems like
DAB+ Digital Audio Broadcasting (DAB) is a digital radio international standard, standard for broadcasting digital audio radio services in many countries around the world, defined, supported, marketed and promoted by the WorldDAB organisation. T ...
,
Digital Radio Mondiale Digital Radio Mondiale (DRM; ''mondiale'' being Italian and French for "worldwide") is a set of digital audio broadcasting technologies designed to work over the bands currently used for analogue radio broadcasting including AM broadcasting—p ...
(including xHE-AAC),
HD Radio HD Radio (HDR) is a trademark for in-band on-channel (IBOC) digital radio broadcast technology. HD radio generally simulcast, simulcasts an existing analog radio station in digital format with less noise and with additional text information. HD R ...
, and
XM Satellite Radio XM Satellite Radio Holdings Inc. (XM) was one of the three satellite radio ( SDARS) and online radio services in the United States and Canada, operated by Sirius XM Holdings. It provided pay-for-service radio, analogous to subscription cable ...
. If the player is not capable of using the side information that has been transmitted alongside the "normal" compressed audio data, it may still be able to play the "baseband" data (e.g. sampled at 22.05 kHz instead of 44.1 kHz) as usual, resulting in a dull (since the high frequencies are missing), but otherwise mostly acceptable sound. This is, for example, the case if an mp3PRO file is played back with MP3 software incapable of utilizing the SBR information. Opus's
CELT The Celts ( , see Names of the Celts#Pronunciation, pronunciation for different usages) or Celtic peoples ( ) were a collection of Indo-European languages, Indo-European peoples. "The Celts, an ancient Indo-European people, reached the apoge ...
part performs ''spectral folding'' on the MDCT bin level, making it a far less advanced but lower-delay technique compared to SBR. Dolby Digital Plus (E-AC3) performs ''Spectral Extension'' (SPX). SPX reduces high-frequency components to metadata and is similar to E-AC3 multichannel coupling calculation.
Dolby AC-4 Dolby AC-4 is an audio compression technology developed by Dolby Laboratories. Dolby AC-4 bitstreams can contain audio channels and/or audio objects. Dolby AC-4 has been adopted by the DVB project and standardized by the ETSI. History Its develop ...
expands the technique to Advanced Spectral Extension (A-SPX), with the option of interleaving with regular, non-extended data in time or frequency domain. As a result, SPX can be selective disabled for difficult portions.


Methods

Encoding of SBR produces a downsampled (usually 2:1) audio signal and guidance information. In an early publication, the guiding data is described as being produced by
quadrature mirror filter In digital signal processing, a quadrature mirror filter is a filter whose magnitude response is the mirror image around \pi/2 of that of another filter. Together these filters, first introduced by Croisier et al., are known as the quadrature mirror ...
(QMF) analysis and an envelope estimator. Decoding of SBR requires transposing harmonics, a case of audio time stretching and pitch scaling. * A traditional approach starts with small intervals of
discrete fourier transform In mathematics, the discrete Fourier transform (DFT) converts a finite sequence of equally-spaced Sampling (signal processing), samples of a function (mathematics), function into a same-length sequence of equally-spaced samples of the discre ...
(DFT), phase adjustments, IDFT, and ends with overlap-add. This method is sensitive to transient signals which can cause echos, requiring some padding (50% in USAC) in the DFT. * A newer approach is the QMF. One single filter bank can perform a whole time-stretch and pitch-scale operation for lower computational complexity.


See also

*
MPEG-4 Part 3 MPEG-4 Part 3 or MPEG-4 Audio (formally ISO/ IEC 14496-3) is the third part of the ISO/ IEC MPEG-4 international standard developed by Moving Picture Experts Group. It specifies audio coding methods. The first version of ISO/IEC 14496-3 was publis ...
*
Psychoacoustics Psychoacoustics is the branch of psychophysics involving the scientific study of the perception of sound by the human auditory system. It is the branch of science studying the psychological responses associated with sound including noise, speech, ...
* Spectral bands


External links

* Coding Technologies page describing SBR, as it appeared in 2007 at the Dolby acquisition


References

Audio codecs {{Sound-tech-stub