In
digital audio
Digital audio is a representation of sound recorded in, or converted into, digital signal (signal processing), digital form. In digital audio, the sound wave of the audio signal is typically encoded as numerical sampling (signal processing), ...
, 44,100 Hz (alternately represented as 44.1 kHz) is a common
sampling frequency
In signal processing, sampling is the reduction of a continuous-time signal to a discrete-time signal. A common example is the conversion of a sound wave to a sequence of "samples".
A sample is a value of the signal at a point in time and/or s ...
. Analog audio is often recorded by sampling it 44,100 times per second, and then these samples are used to
reconstruct
Reconstruction may refer to:
Politics, history, and sociology
*Reconstruction (law), the transfer of a company's (or several companies') business to a new company
*''Perestroika'' (Russian for "reconstruction"), a late 20th century Soviet Union ...
the audio signal when playing it back.
The audio sampling rate is widely used due to the
compact disc
The compact disc (CD) is a Digital media, digital optical disc data storage format co-developed by Philips and Sony to store and play digital audio recordings. It employs the Compact Disc Digital Audio (CD-DA) standard and was capable of hol ...
(CD) format, dating back to its use by
Sony
is a Japanese multinational conglomerate (company), conglomerate headquartered at Sony City in Minato, Tokyo, Japan. The Sony Group encompasses various businesses, including Sony Corporation (electronics), Sony Semiconductor Solutions (i ...
from 1979.
History

The 44.1 kHz sampling rate originated in the late 1970s with
PCM adaptors, which recorded digital audio on
video cassettes,
[Specifically ]U-matic
U-matic, also known as -inch Type E Helical Scan or SMPTE E, is an analog recording videocassette format developed by Sony. First shown as a prototype in October 1969 and introduced commercially in September 1971, it was among the earliest vi ...
cassettes notably the
Sony PCM-1600 introduced in 1979 and carried forward in subsequent models in this series. This then became the basis for
Compact Disc Digital Audio
Compact Disc Digital Audio (CDDA or CD-DA), also known as Digital Audio Compact Disc or simply as Audio CD, is the standard format for audio compact discs. The standard is defined in the '' Red Book'' technical specifications, which is why t ...
(CD-DA), defined in the
Red Book standard in 1980.
Its use has continued as an option in 1990s standards such as the
DVD
The DVD (common abbreviation for digital video disc or digital versatile disc) is a digital optical disc data storage format. It was invented and developed in 1995 and first released on November 1, 1996, in Japan. The medium can store any ki ...
, and in 2000s, standards such as
HDMI
High-Definition Multimedia Interface (HDMI) is a proprietary digital interface used to transmit high-quality video and audio signals between devices. It is commonly used to connect devices such as televisions, computer monitors, projectors, gam ...
. This sampling frequency is commonly used for
MP3
MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany under the lead of Karlheinz Brandenburg. It was designed to greatly reduce the amount ...
and other consumer
audio file format
An audio file format is a file format for storing digital audio data on a computer system. The bit layout of the audio data (excluding metadata) is called the audio coding format and can be uncompressed, or audio compression (data), compressed t ...
s which were originally created from material
ripped from compact discs.
Origin
The selection of the sample rate was based primarily on the need to reproduce the audible frequency range of 20–20,000 Hz (20 kHz). The
Nyquist–Shannon sampling theorem
The Nyquist–Shannon sampling theorem is an essential principle for digital signal processing linking the frequency range of a signal and the sample rate required to avoid a type of distortion called aliasing. The theorem states that the sample r ...
states that a sampling rate of more than twice the maximum frequency of the signal to be recorded is needed, resulting in a required rate of greater than 40 kHz. The exact sampling rate of 44.1 kHz was inherited from PCM adaptors which was the most affordable way to transfer data from the recording studio to the CD manufacturer at the time the CD specification was being developed.
The rate was chosen following debate between manufacturers, notably
Sony
is a Japanese multinational conglomerate (company), conglomerate headquartered at Sony City in Minato, Tokyo, Japan. The Sony Group encompasses various businesses, including Sony Corporation (electronics), Sony Semiconductor Solutions (i ...
and
Philips
Koninklijke Philips N.V. (), simply branded Philips, is a Dutch multinational health technology company that was founded in Eindhoven in 1891. Since 1997, its world headquarters have been situated in Amsterdam, though the Benelux headquarter ...
, and its implementation by Sony, yielding a
de facto standard. The actual choice of rate was the point of some debate, with other alternatives including 44.1 / 1.001 ≈ 44.056 kHz (corresponding to the NTSC color field rate of 60 / 1.001 = 59.94 Hz) or approximately 44 kHz, proposed by Philips. Ultimately Sony prevailed on both sample rate (44.1 kHz) and bit depth (16 bits per sample, rather than 14 bits per sample). The technical reasoning behind the rate being chosen is associated with characteristics of human hearing and early digital audio recording systems as described below.
Human hearing and signal processing
The Nyquist–Shannon sampling theorem says the sampling frequency must be greater than twice the maximum frequency one wishes to reproduce. To capture the
human hearing range of roughly 20 Hz to 20,000 Hz, the sampling rate had to be greater than 40 kHz.
But to avoid
aliasing
In signal processing and related disciplines, aliasing is a phenomenon that a reconstructed signal from samples of the original signal contains low frequency components that are not present in the original one. This is caused when, in the ori ...
when sampling, signals must first be
bandlimited to within half the sampling frequency, which can be achieved with
low-pass filtering. While an ideal low-pass filter (a
sinc filter
In signal processing, a sinc filter can refer to either a sinc-in-time filter whose impulse response is a sinc function and whose frequency response is rectangular, or to a sinc-in-frequency filter whose impulse response is rectangular and who ...
) can perfectly pass frequencies below 20 kHz (without attenuating them) and perfectly cut frequencies above 20 kHz, this ideal filter is theoretically and practically impossible to implement as it is
noncausal, so in practice a
transition band is necessary, where frequencies are partly attenuated. The wider this transition band is, the easier and more economical it is to make an
anti-aliasing filter. The 44.1 kHz sampling frequency allows for a 2.05 kHz transition band.
Recording on video equipment
Early digital audio was recorded to existing analog video cassette tapes, as VCRs were the only available
transports with sufficient capacity to store meaningful lengths of digital audio. To enable reuse with minimal modification of the video equipment, these ran at the same speed as video, and used much of the same circuitry. 44.1 kHz was deemed the highest usable rate compatible with both
PAL
Phase Alternating Line (PAL) is a color encoding system for analog television. It was one of three major analogue colour television standards, the others being NTSC and SECAM. In most countries it was broadcast at 625 lines, 50 fields (25 ...
and
NTSC
NTSC (from National Television System Committee) is the first American standard for analog television, published and adopted in 1941. In 1961, it was assigned the designation System M. It is also known as EIA standard 170.
In 1953, a second ...
video and requiring encoding no more than 3 samples per video line per audio channel.
The sample rate is composed as follows:
NTSC has 490 active lines per frame, out of 525 lines total; PAL has 588 active lines per frame, out of 625 lines total.
Related rates
44,100 is the product of the squares of the first four prime numbers (
) and hence has many useful
integer factors.
Various halvings and doublings of 44.1 kHz are used – the lower rates 11.025 kHz and 22.05 kHz are found in
WAV files, and are suitable for low-bandwidth applications, while the higher rates of 88.2 kHz and 176.4 kHz are used in mastering and in
DVD-Audio
DVD-Audio (commonly abbreviated as DVD-A) is a digital format for delivering high-fidelity audio content on a DVD. DVD-Audio uses most of the storage on the disc for high-quality audio and is not intended to be a video delivery format.
The ...
– the higher rates are useful both for the usual reason of providing additional resolution (hence less sensitive to distortions introduced by editing), and also making the low-pass filtering easier, since a much larger transition band (between human-audible at 20 kHz and half the sampling rate) is possible. The 88.2 kHz and 176.4 kHz rates are primarily used when the ultimate target is a CD.
Other rates
Several other sampling rates were also used in early digital audio. A 50 kHz sample rate, used by
Soundstream
Soundstream Inc. was the first United States audiophile digital audio recording company, providing commercial services for recording and computer-based editing.Robert Easton, ''Soundstream, the first Digital Studio'', Recording Engineer/Producer, ...
in the 1970s, following a 37 kHz prototype. In the early 1980s, a 32 kHz sampling rate was used in broadcast (esp. in UK and Japan), because this is sufficient for
FM stereo
FM broadcasting is a method of radio broadcasting that uses frequency modulation (FM) of the radio broadcast carrier wave. Invented in 1933 by American engineer Edwin Armstrong, wide-band FM is used worldwide to transmit high-fidelity sound ...
broadcasts, which have 15 kHz bandwidth. Some digital audio was provided for domestic use in two incompatible
EIAJ formats, corresponding to 525/59.94 (44,056 Hz sampling) and 625/50 (44.1 kHz sampling).
The
Digital Audio Tape
Digital Audio Tape (DAT or R-DAT) is a signal recording and playback medium developed by Sony and introduced in 1987. In appearance it is similar to a Compact Cassette, using 3.81 mm / 0.15" (commonly referred to as 4 mm) magnetic t ...
(DAT) format was released in 1987 with 48 kHz sampling. This sample rate has become the standard rate for
professional audio
Professional audio, abbreviated as pro audio, refers to both an activity and a category of high-quality, studio-grade audio equipment. Typically it encompasses sound recording, sound reinforcement system setup and audio mixing, and studio mus ...
.
Until recently,
sample rate conversion between 44,100 Hz and
48,000 Hz was complicated by the high ratio number between the rates of these as the
lowest common denominator
In mathematics, the lowest common denominator or least common denominator (abbreviated LCD) is the lowest common multiple of the denominators of a set of fractions. It simplifies adding, subtracting, and comparing fractions.
Description
The l ...
of 44,100 and 48,000 is 147:160, but with modern technology this conversion is accomplished quickly and efficiently. Early consumer DAT machines did not support 44.1 kHz and this difference made it difficult to make direct digital copies of 44.1 kHz CDs using 48 kHz DAT equipment.
Status
Due to the popularity of CDs, a great deal of 44.1 kHz equipment exists, as does a great deal of audio recorded in 44.1 kHz (or multiples thereof). However, some more recent standards use 48 kHz in addition to or instead of 44.1 kHz.
In video, 48 kHz is now the standard, but for audio targeted at CDs, 44.1 kHz (and multiples) are still used.
The HDMI TV standard (2003) allows both 44.1 kHz and 48 kHz (and multiples thereof). This provides compatibility with DVD players playing CD,
VCD
Video CD (abbreviated as VCD, and also known as Compact Disc Digital Video), (not to be confused with CD Video which is a type of LaserDisc, Laserdisc) is a home video format and the first format for distributing films on standard optical di ...
and
SVCD content. The
DVD-Video
DVD-Video is a consumer video format used to store digital video on DVDs. DVD-Video was the dominant consumer home video format in most of the world in the 2000s. As of 2024, it competes with the high-definition Blu-ray Disc, while both rece ...
and
Blu-ray Disc
Blu-ray (Blu-ray Disc or BD) is a Digital media, digital optical disc data storage format designed to supersede the DVD format. It was invented and developed in 2005 and released worldwide on June 20, 2006, capable of storing several hours of ...
standards use multiples of 48 kHz only.
Most PC
sound card
A sound card (also known as an audio card) is an internal expansion card that provides input and output of audio signals to and from a computer under the control of computer programs. The term ''sound card'' is also applied to external audio ...
s contain a
digital-to-analog converter
In electronics, a digital-to-analog converter (DAC, D/A, D2A, or D-to-A) is a system that converts a digital signal into an analog signal. An analog-to-digital converter (ADC) performs the reverse function.
DACs are commonly used in musi ...
capable of operating natively at either 44.1 kHz or 48 kHz. Some older processors include only 44.1 kHz output, and some cheaper newer processors only include 48 kHz output, requiring the PC to perform digital sample rate conversion to output other sample rates. Similarly, cards have limitations on the sample rates they support for recording.
See also
*
Crystal oscillator frequencies
Notes
References
* ''The Art of Digital Audio,'' John Watkinson, 2nd edition
** Watkinson, section 1.14: "The PCM adaptor", pp. 22–24
** Watkinson, section 4.5: "Choice of sampling rate", pp. 207–209
** Watkinson, section 9.2: "PCM adaptors", pp. 499–502
*
*
{{refend
Digital audio