HOME

TheInfoList



OR:

In
audio engineering Audio most commonly refers to sound, as it is transmitted in signal form. It may also refer to: Sound *Audio signal, an electrical representation of sound * Audio frequency, a frequency in the audio spectrum *Digital audio, representation of soun ...
, joint encoding is the joining of several channels of similar information during
encoding In communications and Data processing, information processing, code is a system of rules to convert information—such as a letter (alphabet), letter, word, sound, image, or gesture—into another form, sometimes data compression, shortened or ...
in order to obtain higher quality, a smaller file size, or both.


Joint stereo

The term joint stereo has become prominent as the
Internet The Internet (or internet) is the Global network, global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a internetworking, network of networks ...
has allowed for the transfer of relatively low
bit rate In telecommunications and computing, bit rate (bitrate or as a variable ''R'') is the number of bits that are conveyed or processed per unit of time. The bit rate is expressed in the unit bit per second (symbol: bit/s), often in conjunction ...
, acceptable-quality audio with modest Internet access speeds. Joint stereo refers to any number of encoding techniques used for this purpose. Two forms are described here, both of which are implemented in various ways with different
codec A codec is a computer hardware or software component that encodes or decodes a data stream or signal. ''Codec'' is a portmanteau of coder/decoder. In electronic communications, an endec is a device that acts as both an encoder and a decoder o ...
s, such as
MP3 MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany under the lead of Karlheinz Brandenburg. It was designed to greatly reduce the amount ...
,
AAC AAC may refer to: Aviation * Advanced Aircraft, a company from Carlsbad, California * Airborne aircraft carrier, a type of aircraft * Alaskan Air Command, a radar network * American Aeronautical Corporation, a company from Port Washington, New ...
and
Ogg Vorbis Vorbis is a free and open-source software project headed by the Xiph.Org Foundation. The project produces an audio coding format and software reference encoder/decoder (codec) for lossy audio compression, libvorbis. Vorbis is most common ...
.


Intensity stereo coding

This form of joint stereo uses a technique known as joint frequency encoding, which functions on the principle of
sound localization Sound localization is a listener's ability to identify the location or origin of a detected sound in direction and distance. The sound localization mechanisms of the mammalian auditory system have been extensively studied. The auditory system u ...
. Human hearing is predominantly less acute at perceiving the direction of certain audio frequencies. By exploiting this characteristic, intensity stereo coding can reduce the data rate of an audio stream with little or no perceived change in apparent quality. More specifically, the dominance of ''inter-aural time differences (ITD)'' for sound localization by humans is only present for lower frequencies. That leaves ''inter-aural amplitude differences (IAD)'' as the dominant location indicator for higher frequencies (the cutoff being ~2 kHz). The idea of ''intensity stereo coding'' is to merge the lower spectrum into just one channel (thus reducing overall differences between channels) and to transmit a little side information about how to pan certain frequency regions to recover the ''IAD'' cues. ITD is not lost completely in this scheme, however: the shape of the ear makes it such that the ITD can be recovered from IAD if the sound comes from free space, e.g. played through loudspeakers. This type of coding does not perfectly reconstruct the original audio because of the loss of information which results in the simplification of the stereo image and can produce perceptible
compression artifacts A compression artifact (or artefact) is a noticeable distortion of media (including Image, images, Sound recording, audio, and video) caused by the application of lossy compression. Lossy data compression involves discarding some of the medi ...
. However, for very low bit rates this type of coding usually yields a gain in perceived quality of the audio. It is supported by many audio compression formats (including
MP3 MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany under the lead of Karlheinz Brandenburg. It was designed to greatly reduce the amount ...
,
AAC AAC may refer to: Aviation * Advanced Aircraft, a company from Carlsbad, California * Airborne aircraft carrier, a type of aircraft * Alaskan Air Command, a radar network * American Aeronautical Corporation, a company from Port Washington, New ...
,
Vorbis Vorbis is a free and open-source software project headed by the Xiph.Org Foundation. The project produces an audio coding format and software reference encoder/decoder ( codec) for lossy audio compression, libvorbis. Vorbis is most comm ...
and
Opus Opus (: opera Opera is a form of History of theatre#European theatre, Western theatre in which music is a fundamental component and dramatic roles are taken by Singing, singers. Such a "work" (the literal translation of the Italian word "opera ...
) but not always by every encoder.


M/S stereo coding

M/S stereo coding transforms the left and right channels into a mid channel and a side channel. The mid channel is the sum of the left and right channels, or M = L+R. The side channel is the difference of the left and right channels, or S = L-R. Unlike intensity stereo coding, M/S coding is a special case of
transform coding Transform coding is a type of data compression for "natural" data like audio signals or photographic images. The transformation is typically lossless (perfectly reversible) on its own but is used to enable better (more targeted) quantization, whi ...
, and retains the audio perfectly without introducing artifacts. Lossless codecs such as
FLAC FLAC (; Free Lossless Audio Codec) is an audio coding format for lossless compression of digital audio, developed by the Xiph.Org Foundation, and is also the name of the free software project producing the FLAC tools, the reference software ...
or
Monkey's Audio Monkey's Audio is an algorithm and file format for lossless audio data compression. Lossless data compression does not discard data during the process of encoding, unlike lossy compression methods such as Advanced Audio Coding, MP3, Vorbis ...
use M/S stereo coding because of this characteristic. To reconstruct the original signal, the channels are either added L = \frac or subtracted R = \frac. This form of coding is also sometimes known as matrix stereo and is used in many different forms of audio processing and recording equipment. It is not limited to digital systems and can even be created with passive audio
transformer In electrical engineering, a transformer is a passive component that transfers electrical energy from one electrical circuit to another circuit, or multiple Electrical network, circuits. A varying current in any coil of the transformer produces ...
s or analog
amplifier An amplifier, electronic amplifier or (informally) amp is an electronic device that can increase the magnitude of a signal (a time-varying voltage or current). It is a two-port electronic circuit that uses electric power from a power su ...
s. One example of the use of M/S stereo is in FM stereo broadcasting, where L+R modulates the
carrier wave In telecommunications, a carrier wave, carrier signal, or just carrier, is a periodic waveform (usually sinusoidal) that conveys information through a process called ''modulation''. One or more of the wave's properties, such as amplitude or freq ...
and L-R modulates a
subcarrier A subcarrier is a sideband of a radio frequency carrier wave, which is modulated to send additional information. Examples include the provision of colour in a black and white television system or the provision of stereo in a monophonic radio bro ...
. This enables backwards compatibility with mono equipment, which will only require the mid channel. Another example of M/S stereo is the stereophonic microgroove record. Lateral motions of a stylus represent the sum of two channels and the vertical motion represents the difference between the channels; two perpendicular coils mechanically decode the channels. M/S is also a common technique for production of stereo recordings. See . M/S encoding does not strictly require that the left and right channels use the same weight. In Opus CELT, M/S encoding is combined with an angle parameter, so that different weights can be used to maximize de-correlation. A similar form of joining multiple channels is seen in the
ambisonics Ambisonics is a ''full-sphere'' surround sound format: in addition to the horizontal plane, it covers sound sources above and below the listener. Unlike some other multichannel surround formats, its transmission channels do not carry speaker ...
implementation of Opus 1.3. A matrix may be used to mix the spherical harmonic channels together, reducing redundancy.


Parametric stereo

Parametric stereo is similar to intensity stereo, except that parameters beyond the intensity difference is used. In the MPEG-4 (HE-AAC) version, the intensity difference and time delay difference are used, allowing all bands to be used without hurting localization. HE-AAC also adds "correlation" information, which replicates ambience by synthesizing some difference between channels. Binaural cue coding (BCC) is the HE-AAC PS technique extended for many input channels, all downmixing to one. The very same ILD, ITD, and IC parameters were used. MPEG Surround is similar to BCC, but allows downmixing to multiple channels, and does not seem to use ITD.


Joint frequency encoding

Joint frequency encoding is an
encoding In communications and Data processing, information processing, code is a system of rules to convert information—such as a letter (alphabet), letter, word, sound, image, or gesture—into another form, sometimes data compression, shortened or ...
technique used in
audio data compression In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compression ...
to reduce the data rate. The idea is to merge a given frequency range of multiple sound channels together so that the resulting encoding will preserve the sound information of that range not as a bundle of separate channels but as one homogeneous data stream. This will destroy the original channel separation permanently, as the information cannot be accurately reconstructed, but will greatly lessen the amount of required storage space. Only some forms of joint stereo use the joint frequency encoding technique, such as intensity stereo coding.


Implementations

When used within the MP3 compression process, joint stereo normally employs multiple techniques, and can switch between them for each MPEG frame. Typically, a modern encoder's joint stereo mode uses M/S stereo for some frames and L/R stereo for others, whichever method yields the best result. Encoders use different algorithms to determine when to switch and how much space to allocate to each channel; quality can suffer if the switching is too frequent or if the side channel doesn't get enough bits. With some encoding software, it is possible to force the use of M/S stereo for all frames, mimicking the joint stereo mode of some early encoders like
Xing Xing may refer to: * an abbreviation for crossing such as Pedestrian crossing, Pedestrian Xing or Wildlife crossing, Wildlife Xing, primarily used in North America * Chinese surname (姓, ''xing'') * Xing (surname) (邢), a Chinese surname * Xing ...
. Within the
LAME LAME is a software encoder that converts digital audio into the MP3 audio coding format. LAME is a free software project that was first released in 1998 and has incorporated many improvements since then, including an improved psychoacoustic ...
encoder, this is known as forced joint stereo. As with MP3,
Ogg Vorbis Vorbis is a free and open-source software project headed by the Xiph.Org Foundation. The project produces an audio coding format and software reference encoder/decoder (codec) for lossy audio compression, libvorbis. Vorbis is most common ...
stereo files can employ either L/R stereo or joint stereo. When using joint stereo, both M/S stereo and intensity stereo methods may be used. As opposed to MP3 where M/S stereo (when used) is applied before quantization, an Ogg Vorbis encoder applies M/S stereo to samples in the frequency domain after quantization, making application of M/S stereo a lossless step. After this step, any frequency area can be converted to intensity stereo by removing the corresponding part of the M/S signal's side channel. Ogg Vorbis' floor function will take care of the required left-right panning.
Opus Opus (: opera Opera is a form of History of theatre#European theatre, Western theatre in which music is a fundamental component and dramatic roles are taken by Singing, singers. Such a "work" (the literal translation of the Italian word "opera ...
similarly has support for all three options in the CELT layer; the SILK layer is M/S-only., §§ 4.2.1, 4.3


Notes


References

{{reflist


External links

*Jürgen Herre, Fraunhofer IIS
''From Joint Stereo to Spatial Audio Coding - Recent Progress and Standardization''.
October 2004, Paper 157, DAFx'04 7th International Conference of Digital Audio Effects. Audio engineering