HOME

TheInfoList



OR:

In
digital audio Digital audio is a representation of sound recorded in, or converted into, digital form. In digital audio, the sound wave of the audio signal is typically encoded as numerical samples in a continuous sequence. For example, in CD audio, sa ...
, 44,100  Hz (alternately represented as 44.1 kHz) is a common sampling frequency. Analog audio is often recorded by sampling it 44,100 times per second, and then these samples are used to
reconstruct Reconstruction may refer to: Politics, history, and sociology *Reconstruction (law), the transfer of a company's (or several companies') business to a new company *'' Perestroika'' (Russian for "reconstruction"), a late 20th century Soviet Unio ...
the audio signal when playing it back. The 44.1 kHz audio sampling rate is widely used due to the compact disc (CD) format, dating back to its use by Sony from 1979.


History

The 44.1 kHz sampling rate originated in the late 1970s with PCM adaptors, which recorded digital audio on video cassettes,Specifically U-matic cassettes notably the
Sony PCM-1600 A PCM adaptor is a device that encodes digital audio as video for recording on a videocassette recorder. The adapter also has the ability to decode a video signal back to digital audio for playback. This digital audio system was used for mas ...
introduced in 1979 and carried forward in subsequent models in this series. This then became the basis for Compact Disc Digital Audio (CD-DA), defined in the Red Book standard in 1980. Its use has continued as an option in 1990s standards such as the DVD, and in 2000s, standards such as HDMI. This sampling frequency is commonly used for MP3 and other consumer audio file formats which were originally created from material ripped from compact discs.


Origin

The selection of the sample rate was based primarily on the need to reproduce the audible frequency range of 20–20,000 Hz (20 kHz). The Nyquist–Shannon sampling theorem states that a sampling rate of more than twice the maximum frequency of the signal to be recorded is needed, resulting in a required rate of at least 40 kHz. The exact sampling rate of 44.1 kHz was inherited from PCM adaptors which was the most affordable way to transfer data from the recording studio to the CD manufacturer at the time the CD specification was being developed. The rate was chosen following debate between manufacturers, notably Sony and Philips, and its implementation by Sony, yielding a de facto standard. The actual choice of rate was the point of some debate, with other alternatives including 44.1 / 1.001 ≈ 44.056 kHz (corresponding to the NTSC color field rate of 60 / 1.001 = 59.94 Hz) or approximately 44 kHz, proposed by Philips. Ultimately Sony prevailed on both sample rate (44.1 kHz) and bit depth (16 bits per sample, rather than 14 bits per sample). The technical reasoning behind the rate being chosen is associated with characteristics of human hearing and early digital audio recording systems as described below.


Human hearing and signal processing

The Nyquist–Shannon sampling theorem says the sampling frequency must be greater than twice the maximum frequency one wishes to reproduce. Since human hearing range is roughly 20 Hz to 20,000 Hz, the sampling rate had to be greater than 40 kHz. In addition, signals must be low-pass filtered before sampling to avoid aliasing. While an ideal low-pass filter would perfectly pass frequencies below 20 kHz (without attenuating them) and perfectly cut off frequencies above 20 kHz, such an ideal filter is theoretically and practically impossible to implement as it is noncausal, so in practice a transition band is necessary, where frequencies are partly attenuated. The wider this transition band is, the easier and more economical it is to make an anti-aliasing filter. The 44.1 kHz sampling frequency allows for a 2.05 kHz transition band.


Recording on video equipment

Early digital audio was recorded to existing analog video cassette tapes, as VCRs were the only available transports with sufficient capacity to store meaningful lengths of digital audio. To enable reuse with minimal modification of the video equipment, these ran at the same speed as video, and used much of the same circuitry. 44.1 kHz was deemed the highest usable rate compatible with both PAL and NTSC video and requiring encoding no more than 3 samples per video line per audio channel. The sample rate is composed as follows: NTSC has 490 active lines per frame, out of 525 lines total; PAL has 588 active lines per frame, out of 625 lines total.


Related rates

44,100 is the product of the squares of the first four prime numbers (2^2 \cdot 3^2 \cdot 5^2 \cdot 7^2) and hence has many useful
integer factors In number theory, integer factorization is the decomposition of a composite number into a product of smaller integers. If these factors are further restricted to prime numbers, the process is called prime factorization. When the numbers are suf ...
. Various halvings and doublings of 44.1 kHz are used – the lower rates 11.025 kHz and 22.05 kHz are found in
WAV Waveform Audio File Format (WAVE, or WAV due to its filename extension; pronounced "wave") is an audio file format standard, developed by IBM and Microsoft, for storing an audio bitstream on PCs. It is the main format used on Microsoft Win ...
files, and are suitable for low-bandwidth applications, while the higher rates of 88.2 kHz and 176.4 kHz are used in mastering and in DVD-Audio – the higher rates are useful both for the usual reason of providing additional resolution (hence less sensitive to distortions introduced by editing), and also making the low-pass filtering easier, since a much larger transition band (between human-audible at 20 kHz and the sampling rate) is possible. The 88.2 kHz and 176.4 kHz rates are primarily used when the ultimate target is a CD.


Other rates

Several other sampling rates were also used in early digital audio. A 50 kHz sample rate, used by
Soundstream Soundstream Inc. was the first United States audiophile digital audio recording company, providing commercial services for recording and computer-based editing.Robert Easton, ''Soundstream, the first Digital Studio'', Recording Engineer/Producer, ...
in the 1970s, following a 37 kHz prototype. In the early 1980s, a 32 kHz sampling rate was used in broadcast (esp. in UK and Japan), because this is sufficient for FM stereo broadcasts, which have 15 kHz bandwidth. Some digital audio was provided for domestic use in two incompatible EIAJ formats, corresponding to 525/59.94 (44,056 Hz sampling) and 625/50 (44.1 kHz sampling). The Digital Audio Tape (DAT) format was released in 1987 with 48 kHz sampling. This sample rate has become the standard rate for professional audio. Until recently, sample rate conversion between 44,100 kHz and 48,000 kHz was complicated by the high ratio number between the rates of these as the lowest common denominator of 44,100 and 48,000 is 147:160, but with modern technology this conversion is accomplished quickly and efficiently. Early consumer DAT machines did not support 44.1 kHz and this difference made it difficult to make direct digital copies of 44.1 kHz CDs using 48 kHz DAT equipment.


Status

Due to the popularity of CDs, a great deal of 44.1 kHz equipment exists, as does a great deal of audio recorded in 44.1 kHz (or multiples thereof). However, some more recent standards use 48 kHz in addition to or instead of 44.1 kHz. In video, 48 kHz is now the standard, but for audio targeted at CDs, 44.1 kHz (and multiples) are still used. The HDMI TV standard (2003) allows both 44.1 kHz and 48 kHz (and multiples thereof). This provides compatibility with DVD players playing CD, VCD and SVCD content. The
DVD-Video DVD-Video is a consumer video format used to store digital video on DVD discs. DVD-Video was the dominant consumer home video format in Asia, North America, Europe, and Australia in the 2000s until it was supplanted by the high-definition Blu-r ...
and
Blu-ray Disc The Blu-ray Disc (BD), often known simply as Blu-ray, is a Digital media, digital optical disc data storage format. It was invented and developed in 2005 and released on June 20, 2006 worldwide. It is designed to supersede the DVD format, and c ...
standards use multiples of 48 kHz only. Most PC sound cards contain a digital-to-analog converter capable of operating natively at either 44.1 kHz or 48 kHz. Some older processors include only 44.1 kHz output, and some cheaper newer processors only include 48 kHz output, requiring the PC to perform digital sample rate conversion to output other sample rates. Similarly, cards have limitations on the sample rates they support for recording.


See also

*
Crystal oscillator frequencies Crystal oscillators can be manufactured for oscillation over a wide range of frequencies, from a few kilohertz up to several hundred megahertz. Many applications call for a crystal oscillator frequency conveniently related to some other desired f ...


Notes


References

* ''The Art of Digital Audio,'' John Watkinson, 2nd edition ** Watkinson, section 1.14: "The PCM adaptor", pp. 22–24 ** Watkinson, section 4.5: "Choice of sampling rate", pp. 207–209 ** Watkinson, section 9.2: "PCM adaptors", pp. 499–502 * * {{refend Digital audio