HOME

TheInfoList



OR:

Digital audio is a representation of
sound In physics, sound is a vibration that propagates as an acoustic wave, through a transmission medium such as a gas, liquid or solid. In human physiology and psychology, sound is the ''reception'' of such waves and their ''perception'' by the ...
recorded in, or converted into, digital form. In digital audio, the
sound wave In physics, sound is a vibration that propagates as an acoustic wave, through a transmission medium such as a gas, liquid or solid. In human physiology and psychology, sound is the ''reception'' of such waves and their ''perception'' by the ...
of the
audio signal An audio signal is a representation of sound, typically using either a changing level of electrical voltage for analog signals, or a series of binary numbers for digital signals. Audio signals have frequencies in the audio frequency range of r ...
is typically encoded as numerical samples in a continuous sequence. For example, in CD audio, samples are taken 44,100 times per second, each with 16-bit sample depth. Digital audio is also the name for the entire technology of
sound recording and reproduction Sound recording and reproduction is the electrical, mechanical, electronic, or digital inscription and re-creation of sound waves, such as spoken voice, singing, instrumental music, or sound effects. The two main classes of sound recording te ...
using audio signals that have been encoded in digital form. Following significant advances in digital audio technology during the 1970s and 1980s, it gradually replaced analog audio technology in many areas of
audio engineering Audio most commonly refers to sound, as it is transmitted in signal form. It may also refer to: Sound * Audio signal, an electrical representation of sound *Audio frequency, a frequency in the audio spectrum * Digital audio, representation of sou ...
,
record production A record producer is a recording project's creative and technical leader, commanding studio time and coaching artists, and in popular genres typically creates the song's very sound and structure.Virgil Moorefield"Introduction" ''The Producer as ...
and
telecommunications Telecommunication is the transmission of information by various types of technologies over wire, radio, optical, or other electromagnetic systems. It has its origin in the desire of humans for communication over a distance greater than that fe ...
in the 1990s and 2000s In a digital audio system, an analog electrical signal representing the sound is converted with an
analog-to-digital converter In electronics, an analog-to-digital converter (ADC, A/D, or A-to-D) is a system that converts an analog signal, such as a sound picked up by a microphone or light entering a digital camera, into a digital signal. An ADC may also provide ...
(ADC) into a digital signal, typically using
pulse-code modulation Pulse-code modulation (PCM) is a method used to digitally represent sampled analog signals. It is the standard form of digital audio in computers, compact discs, digital telephony and other digital audio applications. In a PCM stream, the ...
(PCM). This digital signal can then be recorded, edited, modified, and copied using
computer A computer is a machine that can be programmed to Execution (computing), carry out sequences of arithmetic or logical operations (computation) automatically. Modern digital electronic computers can perform generic sets of operations known as C ...
s, audio playback machines, and other digital tools. For playback, a
digital-to-analog converter In electronics, a digital-to-analog converter (DAC, D/A, D2A, or D-to-A) is a system that converts a digital signal into an analog signal. An analog-to-digital converter (ADC) performs the reverse function. There are several DAC architec ...
(DAC) performs the reverse process, converting a digital signal back into an analog signal, which is then sent through an
audio power amplifier An audio power amplifier (or power amp) is an electronic amplifier that amplifies low-power electronic audio signals, such as the signal from a radio receiver or an electric guitar pickup (music technology), pickup, to a level that is high e ...
and ultimately to a
loudspeaker A loudspeaker (commonly referred to as a speaker or speaker driver) is an electroacoustic transducer that converts an electrical audio signal into a corresponding sound. A ''speaker system'', also often simply referred to as a "speaker" or " ...
. Digital audio systems may include
compression Compression may refer to: Physical science *Compression (physics), size reduction due to forces *Compression member, a structural element such as a column *Compressibility, susceptibility to compression * Gas compression *Compression ratio, of a ...
, storage,
processing Processing is a free graphical library and integrated development environment (IDE) built for the electronic arts, new media art, and visual design communities with the purpose of teaching non-programmers the fundamentals of computer programming ...
, and
transmission Transmission may refer to: Medicine, science and technology * Power transmission ** Electric power transmission ** Propulsion transmission, technology allowing controlled application of power *** Automatic transmission *** Manual transmission *** ...
components. Conversion to a digital format allows convenient manipulation, storage, transmission, and retrieval of an audio signal. Unlike analog audio, in which making copies of a recording results in
generation loss Generation loss is the loss of quality between subsequent copies or transcodes of data. Anything that reduces the quality of the representation when copying, and would cause further reduction in quality on making a copy of the copy, can be consid ...
and degradation of signal quality, digital audio allows an infinite number of copies to be made without any degradation of signal quality.


Overview

Digital audio technologies are used in the recording, manipulation, mass-production, and distribution of sound, including recordings of
song A song is a musical composition intended to be performed by the human voice. This is often done at distinct and fixed pitches (melodies) using patterns of sound and silence. Songs contain various forms, such as those including the repetitio ...
s, instrumental pieces,
podcast A podcast is a program made available in digital format for download over the Internet. For example, an episodic series of digital audio or video files that a user can download to a personal device to listen to at a time of their choosing ...
s, sound effects, and other sounds. Modern online music distribution depends on digital recording and
data compression In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compression ...
. The availability of music as data files, rather than as physical objects, has significantly reduced the costs of distribution as well as made it easier to share copies. Before digital audio, the music industry distributed and sold music by selling physical copies in the form of
records A record, recording or records may refer to: An item or collection of data Computing * Record (computer science), a data structure ** Record, or row (database), a set of fields in a database related to one entity ** Boot sector or boot record, ...
and
cassette tape The Compact Cassette or Musicassette (MC), also commonly called the tape cassette, cassette tape, audio cassette, or simply tape or cassette, is an analog magnetic tape recording format for audio recording and playback. Invented by Lou Ottens ...
s. With digital-audio and online distribution systems such as
iTunes iTunes () is a software program that acts as a media player, media library, mobile device management utility, and the client app for the iTunes Store. Developed by Apple Inc., it is used to purchase, play, download, and organize digital mul ...
, companies sell digital sound files to consumers, which the consumer receives over the Internet. Popular streaming services such as
Apple Music Apple Music is a music, audio and video streaming service developed by Apple Inc. Users select music to stream to their device on-demand, or they can listen to existing playlists. The service also includes the Internet radio stations Apple M ...
,
Spotify Spotify (; ) is a proprietary Swedish audio streaming and media services provider founded on 23 April 2006 by Daniel Ek and Martin Lorentzon. It is one of the largest music streaming service providers, with over 456 million monthly active us ...
, or
Youtube YouTube is a global online video platform, online video sharing and social media, social media platform headquartered in San Bruno, California. It was launched on February 14, 2005, by Steve Chen, Chad Hurley, and Jawed Karim. It is owned by ...
, offer temporary access to the digital file, and are now the most common form of music consumption An analog audio system converts physical waveforms of sound into electrical representations of those waveforms by use of a
transducer A transducer is a device that converts energy from one form to another. Usually a transducer converts a signal in one form of energy to a signal in another. Transducers are often employed at the boundaries of automation, measurement, and contr ...
, such as a
microphone A microphone, colloquially called a mic or mike (), is a transducer that converts sound into an electrical signal. Microphones are used in many applications such as telephones, hearing aids, public address systems for concert halls and public ...
. The sounds are then stored on an analog medium such as
magnetic tape Magnetic tape is a medium for magnetic storage made of a thin, magnetizable coating on a long, narrow strip of plastic film. It was developed in Germany in 1928, based on the earlier magnetic wire recording from Denmark. Devices that use magne ...
, or transmitted through an analog medium such as a
telephone line A telephone line or telephone circuit (or just line or circuit industrywide) is a single-user circuit on a telephone communication system. It is designed to reproduce speech of a quality that is understandable. It is the physical wire or ot ...
or
radio Radio is the technology of signaling and communicating using radio waves. Radio waves are electromagnetic waves of frequency between 30 hertz (Hz) and 300 gigahertz (GHz). They are generated by an electronic device called a transmit ...
. The process is reversed for reproduction: the electrical audio signal is amplified and then converted back into physical waveforms via a
loudspeaker A loudspeaker (commonly referred to as a speaker or speaker driver) is an electroacoustic transducer that converts an electrical audio signal into a corresponding sound. A ''speaker system'', also often simply referred to as a "speaker" or " ...
. Analog audio retains its fundamental wave-like characteristics throughout its storage, transformation, duplication, and amplification. Analog audio signals are susceptible to noise and distortion, due to the innate characteristics of electronic circuits and associated devices. Disturbances in a digital system do not result in error unless they are so large as to result in a symbol being misinterpreted as another symbol or disturb the sequence of symbols. It is therefore generally possible to have an entirely error-free digital audio system in which no noise or distortion is introduced between conversion to digital format and conversion back to analog. A digital audio signal may be encoded for correction of any errors that might occur in the storage or transmission of the signal. This technique, known as
channel coding In computing, telecommunication, information theory, and coding theory, an error correction code, sometimes error correcting code, (ECC) is used for controlling errors in data over unreliable or noisy communication channels. The central idea is ...
, is essential for broadcast or recorded digital systems to maintain bit accuracy.
Eight-to-fourteen modulation Eight-to-fourteen modulation (EFM) is a data encoding technique – formally, a ''line code'' – used by compact discs (CD), laserdiscs (LD) and pre- Hi-MD MiniDiscs. EFMPlus is a related code, used in DVDs and Super Audio CDs (SACDs). EFM and ...
is the channel code used for the audio
compact disc The compact disc (CD) is a Digital media, digital optical disc data storage format that was co-developed by Philips and Sony to store and play digital audio recordings. In August 1982, the first compact disc was manufactured. It was then rele ...
(CD).


Conversion process

If an audio signal is analog, a digital audio system starts with an ADC that converts an analog signal to a digital signal.Some audio signals such as those created by
digital synthesis A digital synthesizer is a synthesizer that uses digital signal processing (DSP) techniques to make musical sounds. This in contrast to older analog synthesizers, which produce music using analog electronics, and samplers, which play back digital ...
originate entirely in the digital domain, in which case analog to digital conversion does not take place.
The ADC runs at a specified
sampling rate In signal processing, sampling is the reduction of a continuous-time signal to a discrete-time signal. A common example is the conversion of a sound wave to a sequence of "samples". A sample is a value of the signal at a point in time and/or spac ...
and converts at a known bit resolution. CD audio, for example, has a sampling rate of 44.1 
kHz The hertz (symbol: Hz) is the unit of frequency in the International System of Units (SI), equivalent to one event (or cycle) per second. The hertz is an SI derived unit whose expression in terms of SI base units is s−1, meaning that on ...
(44,100 samples per second), and has 16-bit
resolution Resolution(s) may refer to: Common meanings * Resolution (debate), the statement which is debated in policy debate * Resolution (law), a written motion adopted by a deliberative body * New Year's resolution, a commitment that an individual mak ...
for each
stereo Stereophonic sound, or more commonly stereo, is a method of sound reproduction that recreates a multi-directional, 3-dimensional audible perspective. This is usually achieved by using two independent audio channels through a configuration ...
channel. Analog signals that have not already been
bandlimited Bandlimiting is the limiting of a signal's frequency domain representation or spectral density to zero above a certain finite frequency. A band-limited signal is one whose Fourier transform or spectral density has bounded support. A bandli ...
must be passed through an
anti-aliasing filter An anti-aliasing filter (AAF) is a filter used before a signal sampler to restrict the bandwidth of a signal to satisfy the Nyquist–Shannon sampling theorem over the band of interest. Since the theorem states that unambiguous reconstruction ...
before conversion, to prevent the aliasing distortion that is caused by audio signals with frequencies higher than the
Nyquist frequency In signal processing, the Nyquist frequency (or folding frequency), named after Harry Nyquist, is a characteristic of a sampler, which converts a continuous function or signal into a discrete sequence. In units of cycles per second ( Hz), it ...
(half the sampling rate). A digital audio signal may be stored or transmitted. Digital audio can be stored on a CD, a
digital audio player A portable media player (PMP) (also including the related digital audio player (DAP)) is a portable consumer electronics device capable of storing and playing digital media such as audio, images, and video files. The data is typically stored o ...
, a
hard drive A hard disk drive (HDD), hard disk, hard drive, or fixed disk is an electro-mechanical data storage device that stores and retrieves digital data using magnetic storage with one or more rigid rapidly rotating platters coated with magnet ...
, a
USB flash drive A USB flash drive (also called a thumb drive) is a data storage device that includes flash memory with an integrated USB interface. It is typically removable, rewritable and much smaller than an optical disc. Most weigh less than . Since firs ...
, or any other digital
data storage device Data storage is the recording (storing) of information (data) in a storage medium. Handwriting, phonographic recording, magnetic tape, and optical discs are all examples of storage media. Biological molecules such as RNA and DNA are conside ...
. The digital signal may be altered through
digital signal processing Digital signal processing (DSP) is the use of digital processing, such as by computers or more specialized digital signal processors, to perform a wide variety of signal processing operations. The digital signals processed in this manner are ...
, where it may be
filter Filter, filtering or filters may refer to: Science and technology Computing * Filter (higher-order function), in functional programming * Filter (software), a computer program to process a data stream * Filter (video), a software component tha ...
ed or have
effect Effect may refer to: * A result or change of something ** List of effects ** Cause and effect, an idiom describing causality Pharmacy and pharmacology * Drug effect, a change resulting from the administration of a drug ** Therapeutic effect, a ...
s applied.
Sample-rate conversion Sample-rate conversion, sampling-frequency conversion or resampling is the process of changing the sampling rate or sampling frequency of a discrete signal to obtain a new discrete representation of the underlying continuous signal. Application ar ...
including
upsampling In digital signal processing, upsampling, expansion, and interpolation are terms associated with the process of resampling in a multi-rate digital signal processing system. ''Upsampling'' can be synonymous with ''expansion'', or it can describe an ...
and
downsampling In digital signal processing, downsampling, compression, and decimation are terms associated with the process of ''resampling'' in a multi-rate digital signal processing system. Both ''downsampling'' and ''decimation'' can be synonymous with ''comp ...
may be used to change signals that have been encoded with a different sampling rate to a common sampling rate prior to processing. Audio data compression techniques, such as
MP3 MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany, with support from other digital scientists in the United States and elsewhere. Origin ...
,
Advanced Audio Coding Advanced Audio Coding (AAC) is an audio coding standard for lossy digital audio compression. Designed to be the successor of the MP3 format, AAC generally achieves higher sound quality than MP3 encoders at the same bit rate. AAC has been stan ...
,
Ogg Vorbis Vorbis is a free and open-source software project headed by the Xiph.Org Foundation. The project produces an audio coding format and software reference encoder/decoder (codec) for lossy audio compression. Vorbis is most commonly used in conjun ...
, or
FLAC FLAC (; Free Lossless Audio Codec) is an audio coding format for lossless compression of digital audio, developed by the Xiph.Org Foundation, and is also the name of the free software project producing the FLAC tools, the reference software p ...
, are commonly employed to reduce the file size. Digital audio can be carried over
digital audio interface Audio connectors and video connectors are electrical or optical connectors for carrying audio or video signals. Audio interfaces or video interfaces define physical parameters and interpretation of signals. For digital audio and digital video, ...
s such as
AES3 AES3 is a standard for the exchange of digital audio signals between professional audio devices. An AES3 signal can carry two channels of pulse-code-modulated digital audio over several transmission media including balanced lines, unbalanced l ...
or
MADI Madi may refer to: Places * Madi, Chitwan, a municipality in Chitwan District in Nepal * Madi Municipality, Sankhuwasabha, a municipality in Sankhuwasabha District in Nepal * Madi Rural Municipality, Rolpa, a rural municipality in Rolpa Distr ...
. Digital audio can be carried over a network using
audio over Ethernet In audio and broadcast engineering, Audio over Ethernet (sometimes AoE—not to be confused with ATA over Ethernet) is the use of an Ethernet-based network to distribute real-time digital audio. AoE replaces bulky snake cables or audio-specif ...
,
audio over IP Audio over IP (AoIP) is the distribution of digital audio across an IP network such as the Internet. It is used increasingly to provide high-quality audio feeds over long distances. The application is also known as audio contribution over IP (ACI ...
or other
streaming media Streaming media is multimedia that is delivered and consumed in a continuous manner from a source, with little or no intermediate storage in network elements. ''Streaming'' refers to the delivery method of content, rather than the content it ...
standards and systems. For playback, digital audio must be converted back to an analog signal with a DAC. According to the
Nyquist–Shannon sampling theorem The Nyquist–Shannon sampling theorem is a theorem in the field of signal processing which serves as a fundamental bridge between continuous-time signals and discrete-time signals. It establishes a sufficient condition for a sample rate that pe ...
, with some practical and theoretical restrictions, a band-limited version of the original analog signal can be accurately reconstructed from the digital signal. During conversion, audio data can be embedded with a
digital watermark A digital watermark is a kind of marker covertly embedded in a noise-tolerant signal such as audio, video or image data. It is typically used to identify ownership of the copyright of such signal. "Watermarking" is the process of hiding digital inf ...
to prevent piracy and unauthorized use. Watermarking is done using a direct-sequence spread-spectrum (DSSS) method. The audio information is then modulated by a pseudo-noise (PN) sequence, then shaped within the frequency domain and put back in the original signal. The strength of the embedding determines the strength of the watermark on the audio data.


History


Coding

Pulse-code modulation Pulse-code modulation (PCM) is a method used to digitally represent sampled analog signals. It is the standard form of digital audio in computers, compact discs, digital telephony and other digital audio applications. In a PCM stream, the ...
(PCM) was invented by British scientist
Alec Reeves Alec Harley Reeves (10 March 1902 – 13 October 1971) was a British scientist best known for his invention of pulse-code modulation (PCM). He was awarded 82 patents. Early life Alec Reeves was born in Redhill, Surrey in 1902 and was educated a ...
in 1937. In 1950,
C. Chapin Cutler Cassius Chapin Cutler (December 16, 1914 – December 1, 2002) was an American electrical engineer at Bell Labs. His notable achievements include the invention of the corrugated waveguide and differential pulse-code modulation (DPCM). Biogr ...
of
Bell Labs Nokia Bell Labs, originally named Bell Telephone Laboratories (1925–1984), then AT&T Bell Laboratories (1984–1996) and Bell Labs Innovations (1996–2007), is an American industrial research and scientific development company owned by mult ...
filed the patent on
differential pulse-code modulation Differential pulse-code modulation (DPCM) is a signal encoder that uses the baseline of pulse-code modulation (PCM) but adds some functionalities based on the prediction of the samples of the signal. The input can be an analog signal or a digital ...
(DPCM), a
data compression In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compression ...
algorithm.
Adaptive DPCM Adaptive differential pulse-code modulation (ADPCM) is a variant of differential pulse-code modulation (DPCM) that varies the size of the quantization step, to allow further reduction of the required data bandwidth for a given signal-to-noise ratio ...
(ADPCM) was introduced by P. Cummiskey, Nikil S. Jayant and
James L. Flanagan James Loton Flanagan (August 26, 1925 – August 25, 2015) was an American electrical engineer. He was Rutgers University's vice president for research until 2004. He was also director of Rutgers' Center for Advanced Information Processing and t ...
at Bell Labs in 1973.
Perceptual coding Psychoacoustics is the branch of psychophysics involving the scientific study of sound perception and audiology—how humans perceive various sounds. More specifically, it is the branch of science studying the psychological responses associated wit ...
was first used for
speech coding Speech coding is an application of data compression of digital audio signals containing speech. Speech coding uses speech-specific parameter estimation using audio signal processing techniques to model the speech signal, combined with generic da ...
compression, with
linear predictive coding Linear predictive coding (LPC) is a method used mostly in audio signal processing and speech processing for representing the spectral envelope of a digital signal of speech in compressed form, using the information of a linear predictive model. ...
(LPC). Initial concepts for LPC date back to the work of
Fumitada Itakura is a Japanese scientist. He did pioneering work in statistical signal processing, and its application to speech analysis, synthesis and coding, including the development of the linear predictive coding (LPC) and line spectral pairs (LSP) methods. ...
(
Nagoya University , abbreviated to or NU, is a Japanese national research university located in Chikusa-ku, Nagoya. It was the seventh Imperial University in Japan, one of the first five Designated National University and selected as a Top Type university of T ...
) and Shuzo Saito (
Nippon Telegraph and Telephone , commonly known as NTT, is a Japanese telecommunications company headquartered in Tokyo, Japan. Ranked 55th in Fortune Global 500, ''Fortune'' Global 500, NTT is the fourth largest telecommunications company in the world in terms of revenue, as w ...
) in 1966. During the 1970s, Bishnu S. Atal and Manfred R. Schroeder at Bell Labs developed a form of LPC called adaptive predictive coding (APC), a perceptual coding algorithm that exploited the masking properties of the human ear, followed in the early 1980s with the
code-excited linear prediction Code-excited linear prediction (CELP) is a linear predictive speech coding algorithm originally proposed by Manfred R. Schroeder and Bishnu S. Atal in 1985. At the time, it provided significantly better quality than existing low bit-rate algori ...
(CELP) algorithm. Discrete cosine transform (DCT) coding, a
lossy compression In information technology, lossy compression or irreversible compression is the class of data compression methods that uses inexact approximations and partial data discarding to represent the content. These techniques are used to reduce data size ...
method first proposed by Nasir Ahmed in 1972, provided the basis for the
modified discrete cosine transform The modified discrete cosine transform (MDCT) is a transform based on the type-IV discrete cosine transform (DCT-IV), with the additional property of being lapped transform, lapped: it is designed to be performed on consecutive blocks of a larger ...
(MDCT), which was developed by J. P. Princen, A. W. Johnson and A. B. Bradley in 1987.J. P. Princen, A. W. Johnson und A. B. Bradley: ''Subband/transform coding using filter bank designs based on time domain aliasing cancellation'', IEEE Proc. Intl. Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2161–2164, 1987. The MDCT is the basis for most
audio coding standards An audio coding format (or sometimes audio compression format) is a content representation format for storage or transmission of digital audio (such as in digital television, digital radio and in audio and video files). Examples of audio coding f ...
, such as
Dolby Digital Dolby Digital, originally synonymous with Dolby AC-3, is the name for what has now become a family of audio compression technologies developed by Dolby Laboratories. Formerly named Dolby Stereo Digital until 1995, the audio compression is lossy ...
(AC-3),
MP3 MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany, with support from other digital scientists in the United States and elsewhere. Origin ...
(
MPEG The Moving Picture Experts Group (MPEG) is an alliance of working groups established jointly by International Organization for Standardization, ISO and International Electrotechnical Commission, IEC that sets standards for media coding, includ ...
Layer III),
Advanced Audio Coding Advanced Audio Coding (AAC) is an audio coding standard for lossy digital audio compression. Designed to be the successor of the MP3 format, AAC generally achieves higher sound quality than MP3 encoders at the same bit rate. AAC has been stan ...
(AAC),
Windows Media Audio Windows Media Audio (WMA) is a series of audio codecs and their corresponding audio coding formats developed by Microsoft. It is a proprietary technology that forms part of the Windows Media framework. WMA consists of four distinct codecs. The or ...
(WMA), and
Vorbis Vorbis is a free and open-source software project headed by the Xiph.Org Foundation. The project produces an audio coding format and software reference encoder/decoder (codec) for lossy audio compression. Vorbis is most commonly used in conjun ...
(
Ogg Ogg is a free, open container format maintained by the Xiph.Org Foundation. The authors of the Ogg format state that it is unrestricted by software patents and is designed to provide for efficient streaming and manipulation of high-quality d ...
).


Recording

PCM was used in
telecommunications Telecommunication is the transmission of information by various types of technologies over wire, radio, optical, or other electromagnetic systems. It has its origin in the desire of humans for communication over a distance greater than that fe ...
applications long before its first use in commercial broadcast and recording. Commercial digital recording was pioneered in Japan by
NHK , also known as NHK, is a Japanese public broadcaster. NHK, which has always been known by this romanized initialism in Japanese, is a statutory corporation funded by viewers' payments of a television license fee. NHK operates two terrestr ...
and
Nippon Columbia , often pronounced ''Korombia'', operating internationally as , is a Japanese record label founded in 1910 as Nipponophone Co., Ltd. It affiliated itself with the Columbia Graphophone Company of the United Kingdom and adopted the standard UK C ...
and their
Denon is a Japanese electronics company started in 1910 by Frederick Whitney Horn, an American entrepreneur. Denon produced the first cylinder audio media in Japan and players to play them. Decades later, Denon was involved in the early stages of de ...
brand, in the 1960s. The first commercial digital recordings were released in 1971. The
BBC #REDIRECT BBC #REDIRECT BBC Here i going to introduce about the best teacher of my life b BALAJI sir. He is the precious gift that I got befor 2yrs . How has helped and thought all the concept and made my success in the 10th board exam. ...
...
also began to experiment with digital audio in the 1960s. By the early 1970s, it had developed a 2-channel recorder, and in 1972 it deployed a digital audio transmission system that linked their broadcast center to their remote transmitters. The first 16-bit PCM recording in the
United States The United States of America (U.S.A. or USA), commonly known as the United States (U.S. or US) or America, is a country primarily located in North America. It consists of 50 states, a federal district, five major unincorporated territorie ...
was made by
Thomas Stockham Thomas Greenway Stockham (December 22, 1933 – January 6, 2004) was an American scientist who developed one of the first practical digital audio recording systems, and pioneered techniques for digital audio recording and processing. He also l ...
at the
Santa Fe Opera Santa Fe Opera (SFO) is an American opera company, located north of Santa Fe, New Mexico. After creating the ''Opera Association of New Mexico'' in 1956, its founding director, John Crosby (conductor), John Crosby, oversaw the building of the fir ...
in 1976, on a
Soundstream Soundstream Inc. was the first United States audiophile digital audio recording company, providing commercial services for recording and computer-based editing.Robert Easton, ''Soundstream, the first Digital Studio'', Recording Engineer/Producer, ...
recorder. An improved version of the Soundstream system was used to produce several classical recordings by
Telarc Telarc International Corporation is an American audiophile independent record label founded in 1977 by two classically trained musicians and former teachers, Jack Renner and Robert Woods. Based in Cleveland, Ohio, the label has had a long associ ...
in 1978. The 3M digital multitrack recorder in development at the time was based on BBC technology. The first all-digital album recorded on this machine was
Ry Cooder Ryland Peter "Ry" Cooder (born March 15, 1947) is an American musician, songwriter, film score composer, record producer, and writer. He is a multi-instrumentalist but is best known for his slide guitar work, his interest in traditional music, an ...
's '' Bop till You Drop'' in 1979. British record label
Decca Decca may refer to: Music * Decca Records or Decca Music Group, a record label * Decca Gold, a classical music record label owned by Universal Music Group * Decca Broadway, a musical theater record label * Decca Studios, a recording facility in W ...
began development of its own 2-track digital audio recorders in 1978 and released the first European digital recording in 1979. Popular professional digital multitrack recorders produced by Sony/Studer (
DASH The dash is a punctuation mark consisting of a long horizontal line. It is similar in appearance to the hyphen but is longer and sometimes higher from the baseline. The most common versions are the endash , generally longer than the hyphen b ...
) and Mitsubishi (
ProDigi Mitsubishi's ProDigi was a professional audio, reel-to-reel, digital audio tape format with a stationary head position, similar to Sony's Digital Audio Stationary Head, which competed against ProDigi when the format was available in the mid- ...
) in the early 1980s helped to bring about digital recording's acceptance by the major record companies. Machines for these formats had their own transports built-in as well, using
reel-to-reel Reel-to-reel audio tape recording, also called open-reel recording, is magnetic tape audio recording in which the recording tape is spooled between reels. To prepare for use, the ''supply reel'' (or ''feed reel'') containing the tape is plac ...
tape in either 1/4", 1/2", or 1" widths, with the audio data being recorded to the tape using a multi-track stationary tape head.
PCM adaptor A PCM adaptor is a device that encodes digital audio as video for recording on a videocassette recorder. The adapter also has the ability to decode a video signal back to digital audio for playback. This digital audio system was used for mast ...
s allowed for stereo digital audio recording on a conventional NTCS or PAL
video tape recorder A video tape recorder (VTR) is a tape recorder designed to record and playback video and audio material from magnetic tape. The early VTRs were open-reel devices that record on individual reels of 2-inch-wide (5.08 cm) tape. They were use ...
. The 1982 introduction of the CD popularized digital audio with consumers. ADAT became available in the early 1990s, which allowed eight-track 44.1 or 48 kHz recording on S-VHS cassettes, and DTRS performed a similar function with Hi8 tapes. Formats like ProDigi and DASH were referred to as SDAT (Stationary-head Digital Audio Tape) formats, as opposed to formats like the PCM adaptor-based systems and DAT, which were referred to as RDAT (Rotating-head Digital Audio Tape) formats, due to their helical-scan process of recording. Like the DAT cassette, ProDigi and DASH machines also accommodated the obligatory 44.1 kHz sampling rate, but also 48 kHz on all machines, and eventually a 96 kHz sampling rate. They overcame the problems that made typical analog recorders unable to meet the bandwidth (frequency range) demands of digital recording by a combination of higher tape speeds, narrower head gaps used in combination with metal-formulation tapes, and the spreading of data across multiple parallel tracks. Unlike analog systems, modern
digital audio workstation A digital audio workstation (DAW) is an electronic device or application software used for Sound recording and reproduction, recording, editing and producing audio files. DAWs come in a wide variety of configurations from a single software pro ...
s and
audio interface An audio interface is a piece of computer hardware that allows the input and output of audio signals to and from a host computer or recording device. Audio interfaces are closely related to computer sound cards, but whereas sound cards are opti ...
s allow as many channels in as many different sampling rates as the computer can effectively run at a single time.
Avid Audio Avid Audio (formerly Digidesign) is an American digital audio technology company. It was founded in 1984 by Peter Gotcher and Evan Brooks. The company began as a project to raise money for the founders' band, selling EPROM chips for drum machi ...
and
Steinberg Steinberg Media Technologies GmbH (trading as Steinberg) is a German musical software and hardware company based in Hamburg. It develops music writing, recording, arranging, and editing software, most notably Cubase, Nuendo, and Dorico. It als ...
released the first digital audio workstation software programs in 1989. Digital audio workstations make multitrack recording and mixing much easier for large projects which would otherwise be difficult with analog equipment.


Telephony

The rapid development and wide adoption of PCM
digital telephony Telephony ( ) is the field of technology involving the development, application, and deployment of telecommunication services for the purpose of electronic transmission of voice, fax, or data, between distant parties. The history of telephony is i ...
was enabled by
metal–oxide–semiconductor The metal–oxide–semiconductor field-effect transistor (MOSFET, MOS-FET, or MOS FET) is a type of field-effect transistor (FET), most commonly fabricated by the controlled oxidation of silicon. It has an insulated gate, the voltage of which d ...
(MOS)
switched capacitor A switched capacitor (SC) is an electronic circuit that implements a function by moving charges into and out of capacitors when electronic switches are opened and closed. Usually, non-overlapping clock signals are used to control the switches, so ...
(SC) circuit technology, developed in the early 1970s. This led to the development of PCM codec-filter chips in the late 1970s. The
silicon-gate In Semiconductor device fabrication, semiconductor electronics fabrication technology, a self-aligned gate is a transistor manufacturing approach whereby the gate (transistor), gate electrode of a MOSFET (metal–oxide–semiconductor field-effec ...
CMOS Complementary metal–oxide–semiconductor (CMOS, pronounced "sea-moss", ) is a type of metal–oxide–semiconductor field-effect transistor (MOSFET) fabrication process that uses complementary and symmetrical pairs of p-type and n-type MOSFE ...
(complementary MOS) PCM codec-filter chip, developed by
David A. Hodges David Albert Hodges (1937–2022) was an American electrical engineer, digital telephony pioneer, and professor of electrical engineering at the University of California, Berkeley. Hodges was elected a member of the National Academy of Engineering ...
and W.C. Black in 1980, has since been the industry standard for digital telephony. By the 1990s,
telecommunication network A telecommunications network is a group of nodes interconnected by telecommunications links that are used to exchange messages between the nodes. The links may use a variety of technologies based on the methodologies of circuit switching, message ...
s such as the
public switched telephone network The public switched telephone network (PSTN) provides Communications infrastructure, infrastructure and services for public Telecommunications, telecommunication. The PSTN is the aggregate of the world's circuit-switched telephone networks that ...
(PSTN) had been largely digitized with
VLSI Very large-scale integration (VLSI) is the process of creating an integrated circuit (IC) by combining millions or billions of MOS transistors onto a single chip. VLSI began in the 1970s when MOS integrated circuit (Metal Oxide Semiconductor) c ...
(very
large-scale integration An integrated circuit or monolithic integrated circuit (also referred to as an IC, a chip, or a microchip) is a set of electronic circuits on one small flat piece (or "chip") of semiconductor material, usually silicon. Large numbers of tiny ...
) CMOS PCM codec-filters, widely used in electronic switching systems for
telephone exchanges telephone exchange, telephone switch, or central office is a telecommunications system used in the public switched telephone network (PSTN) or in large enterprises. It interconnects telephone subscriber lines or virtual circuits of digital syste ...
, user-end
modems A modulator-demodulator or modem is a computer hardware device that converts data from a digital format into a format suitable for an analog transmission medium such as telephone or radio. A modem transmits data by Modulation#Digital modulati ...
and a range of
digital transmission Data transmission and data reception or, more broadly, data communication or digital communications is the transfer and reception of data in the form of a digital bitstream or a digitized analog signal transmitted over a point-to-point or ...
applications such as the
integrated services digital network Integrated Services Digital Network (ISDN) is a set of communication standards for simultaneous digital transmission of voice, video, data, and other network services over the digitalised circuits of the public switched telephone network. W ...
(ISDN),
cordless telephones A cordless telephone or portable telephone has a portable telephone handset that connects by radio to a base station connected to the public telephone network. The operational range is limited, usually to the same building or within some short ...
and
cell phones A mobile phone, cellular phone, cell phone, cellphone, handphone, hand phone or pocket phone, sometimes shortened to simply mobile, cell, or just phone, is a portable telephone that can make and receive calls over a radio frequency link whil ...
.


Technologies

Digital audio is used in
broadcasting Broadcasting is the distribution (business), distribution of sound, audio or video content to a dispersed audience via any electronic medium (communication), mass communications medium, but typically one using the electromagnetic spectrum (radio ...
of audio. Standard technologies include
Digital audio broadcasting Digital radio is the use of digital technology to transmit or receive across the radio spectrum. Digital transmission by radio waves includes digital broadcasting, and especially digital audio radio services. Types In digital broadcasting syst ...
(DAB),
Digital Radio Mondiale Digital Radio Mondiale (DRM; ''mondiale'' being Italian and French for "worldwide") is a set of digital audio broadcasting technologies designed to work over the bands currently used for analogue radio broadcasting including AM broadcasting—pa ...
(DRM),
HD Radio HD Radio (HDR) is a trademark for an in-band on-channel (IBOC) digital radio broadcast technology. It generally simulcasts an existing analog radio station in digital format with less noise and with additional text information. HD Radio is used ...
and
In-band on-channel In-band on-channel (IBOC) is a hybrid method of transmitting digital radio and analog radio broadcast signals simultaneously on the same frequency. The name refers to the new digital signals being broadcast in the same AM or FM band (in-band), a ...
(IBOC). Digital audio in recording applications is stored on audio-specific technologies including CD, Digital Audio Tape (DAT),
Digital Compact Cassette The Digital Compact Cassette (DCC) is a magnetic tape sound recording format introduced by Philips and Matsushita Electric in late and marketed as the successor to the standard analog Compact Cassette. It was also a direct competitor to Sony ...
(DCC) and
MiniDisc MiniDisc (MD) is an erasable magneto-optical disc-based data storage format offering a capacity of 60, 74, and later, 80 minutes of digitized audio. Sony announced the MiniDisc in September 1992 and released it in November of that year fo ...
. Digital audio may be stored in a standard
audio file format An audio file format is a file format for storing digital audio data on a computer system. The bit layout of the audio data (excluding metadata) is called the audio coding format and can be uncompressed, or compressed to reduce the file size, o ...
s and stored on a
Hard disk recorder A hard disk recorder (HDR) is a system that uses a high-capacity hard disk to record digital audio or digital video. Hard disk recording systems represent an alternative to reel-to-reel audio tape recording and video tape recorders, and provid ...
,
Blu-ray The Blu-ray Disc (BD), often known simply as Blu-ray, is a digital optical disc data storage format. It was invented and developed in 2005 and released on June 20, 2006 worldwide. It is designed to supersede the DVD format, and capable of sto ...
or
DVD-Audio DVD-Audio (commonly abbreviated as DVD-A) is a digital format for delivering high-fidelity audio content on a DVD. DVD-Audio uses most of the storage on the disc for high-quality audio and is not intended to be a video delivery format. The sta ...
. Files may be played back on smartphones, computers or
MP3 player A portable media player (PMP) (also including the related digital audio player (DAP)) is a portable consumer electronics device capable of storing and playing digital media such as audio, images, and video files. The data is typically stored o ...
. Digital audio resolution is measured in sample depth. Most digital audio formats use a sample depth of either 16-bit, 24-bit, and 32-bit.


Interfaces

For
personal computer A personal computer (PC) is a multi-purpose microcomputer whose size, capabilities, and price make it feasible for individual use. Personal computers are intended to be operated directly by an end user, rather than by a computer expert or tec ...
s,
USB Universal Serial Bus (USB) is an industry standard that establishes specifications for cables, connectors and protocols for connection, communication and power supply (interfacing) between computers, peripherals and other computers. A broad ...
and
IEEE 1394 IEEE 1394 is an interface standard for a serial bus for high-speed communications and isochronous real-time data transfer. It was developed in the late 1980s and early 1990s by Apple in cooperation with a number of companies, primarily Sony an ...
have provisions to deliver real-time digital audio. USB interfaces have become increasingly popular among independent
audio engineers An audio engineer (also known as a sound engineer or recording engineer) helps to produce a recording or a live performance, balancing and adjusting sound sources using equalization, dynamics processing and audio effects, mixing, reproduction, ...
and producers due to their small size and ease of use. In professional architectural or installation applications, many
audio over Ethernet In audio and broadcast engineering, Audio over Ethernet (sometimes AoE—not to be confused with ATA over Ethernet) is the use of an Ethernet-based network to distribute real-time digital audio. AoE replaces bulky snake cables or audio-specif ...
protocols and interfaces exist. In
broadcasting Broadcasting is the distribution (business), distribution of sound, audio or video content to a dispersed audience via any electronic medium (communication), mass communications medium, but typically one using the electromagnetic spectrum (radio ...
, a more general
audio over IP Audio over IP (AoIP) is the distribution of digital audio across an IP network such as the Internet. It is used increasingly to provide high-quality audio feeds over long distances. The application is also known as audio contribution over IP (ACI ...
network technology is favored. In telephony
voice over IP Voice over Internet Protocol (VoIP), also called IP telephony, is a method and group of technologies for the delivery of speech, voice communications and multimedia sessions over Internet Protocol (IP) networks, such as the Internet. The terms In ...
is used as a network interface for digital audio for voice communications. Several interfaces are engineered to carry digital video and audio together, including
HDMI High-Definition Multimedia Interface (HDMI) is a proprietary audio/video interface for transmitting uncompressed video data and compressed or uncompressed digital audio data from an HDMI-compliant source device, such as a display controller, ...
and
DisplayPort DisplayPort (DP) is a digital display interface developed by a consortium of PC and chip manufacturers and standardized by the Video Electronics Standards Association (VESA). It is primarily used to connect a video source to a display device su ...
. Some interfaces offer
MIDI MIDI (; Musical Instrument Digital Interface) is a technical standard that describes a communications protocol, digital interface, and electrical connectors that connect a wide variety of electronic musical instruments, computers, and re ...
support as well as XLR and TRS analog ports. Digital-audio-specific interfaces include: *
A2DP In order to use Bluetooth, a device must be compatible with the subset of Bluetooth ''profiles'' (often called services or functions) necessary to use the desired services. A Bluetooth profile is a specification regarding an aspect of Bluetooth-b ...
via
Bluetooth Bluetooth is a short-range wireless technology standard that is used for exchanging data between fixed and mobile devices over short distances and building personal area networks (PANs). In the most widely used mode, transmission power is limi ...
*
AC'97 AC'97 (''Audio Codec '97;'' also MC'97 for ''Modem Codec '97'') is an Sound reproduction, audio codec standard developed by Intel Architecture Labs in 1997. The standard was used in motherboards, modems, and sound cards. The specification covers ...
(Audio Codec 1997) interface between
integrated circuit An integrated circuit or monolithic integrated circuit (also referred to as an IC, a chip, or a microchip) is a set of electronic circuits on one small flat piece (or "chip") of semiconductor material, usually silicon. Large numbers of tiny ...
s on PC motherboards *
ADAT Lightpipe The ADAT Lightpipe, officially the ADAT Optical Interface, is a standard for the transfer of digital audio between equipment. It was originally developed by Alesis but has since become widely accepted, with many third party hardware manufacturers ...
interface *
AES3 AES3 is a standard for the exchange of digital audio signals between professional audio devices. An AES3 signal can carry two channels of pulse-code-modulated digital audio over several transmission media including balanced lines, unbalanced l ...
interface with
XLR connector The XLR connector is a type of electrical connector primarily used in professional audio, video, and stage lighting equipment. XLR connectors are cylindical in design, and have three to seven connector pins, and are often employed for analog b ...
s, common in professional audio equipment *
AES47 AES47 is a standard which describes a method for transporting AES3 professional digital audio streams over Asynchronous Transfer Mode (ATM) networks. The Audio Engineering Society (AES) published AES47 in 2002. The method described by AES47 is al ...
- professional AES3-style digital audio over
Asynchronous Transfer Mode Asynchronous Transfer Mode (ATM) is a telecommunications standard defined by American National Standards Institute (ANSI) and ITU-T (formerly CCITT) for digital transmission of multiple types of traffic. ATM was developed to meet the needs of ...
networks *
Intel High Definition Audio Intel High Definition Audio (IHDA) (also called HD Audio or development codename Azalia) is a specification for the audio sub-system of personal computers. It was released by Intel in 2004 as the successor to their AC'97 PC audio standard. Featur ...
- modern replacement for AC'97 *
I²S I²S (Inter-IC Sound, pronounced "eye-squared-ess"), is an electrical serial bus interface standard used for connecting digital audio devices together. It is used to communicate PCM audio data between integrated circuits in an electronic device ...
(Inter-IC sound) interface between
integrated circuit An integrated circuit or monolithic integrated circuit (also referred to as an IC, a chip, or a microchip) is a set of electronic circuits on one small flat piece (or "chip") of semiconductor material, usually silicon. Large numbers of tiny ...
s in consumer electronics *
MADI Madi may refer to: Places * Madi, Chitwan, a municipality in Chitwan District in Nepal * Madi Municipality, Sankhuwasabha, a municipality in Sankhuwasabha District in Nepal * Madi Rural Municipality, Rolpa, a rural municipality in Rolpa Distr ...
(Multichannel Audio Digital Interface) *
MIDI MIDI (; Musical Instrument Digital Interface) is a technical standard that describes a communications protocol, digital interface, and electrical connectors that connect a wide variety of electronic musical instruments, computers, and re ...
- low-bandwidth interconnect for carrying instrument data; cannot carry sound but can carry digital sample data in non-real time *
S/PDIF S/PDIF (Sony/Philips Digital Interface) is a type of digital audio interface used in consumer audio equipment to output audio over relatively short distances. The signal is transmitted over either a coaxial cable (using RCA or BNC connectors) ...
- either over
coaxial cable Coaxial cable, or coax (pronounced ) is a type of electrical cable consisting of an inner conductor surrounded by a concentric conducting shield, with the two separated by a dielectric ( insulating material); many coaxial cables also have a p ...
or
TOSLINK TOSLINK (from ''Toshiba Link'') is a standardized optical fiber connector system. Also known generically as optical audio, its most common use is in consumer audio equipment (via a "digital optical" socket), where it carries a digital audio st ...
, common in consumer audio equipment and derived from AES3 * TDIF, TASCAM proprietary format with
D-sub The D-subminiature or D-sub is a common type of electrical connector. They are named for their characteristic D-shaped metal shield. When they were introduced, D-subs were among the smallest connectors used on computer systems. Description, no ...
cable


See also

* Digital audio editor *
Digital synthesizer A digital synthesizer is a synthesizer that uses digital signal processing (DSP) techniques to make musical sounds. This in contrast to older analog synthesizers, which produce music using analog electronics, and samplers, which play back digit ...
*
Frequency modulation synthesis Frequency modulation synthesis (or FM synthesis) is a form of sound synthesis whereby the frequency of a waveform is changed by modulating its frequency with a modulator. The frequency of an oscillator is altered "in accordance with the amplitude ...
*
Sound chip A sound chip is an integrated circuit (chip) designed to produce audio signals through digital, analog or mixed-mode electronics. Sound chips are typically fabricated on metal–oxide–semiconductor (MOS) mixed-signal chips that process ...
*
Sound Card A sound card (also known as an audio card) is an internal expansion card that provides input and output of audio signals to and from a computer under the control of computer programs. The term ''sound card'' is also applied to external audio i ...
*
Audio Interface An audio interface is a piece of computer hardware that allows the input and output of audio signals to and from a host computer or recording device. Audio interfaces are closely related to computer sound cards, but whereas sound cards are opti ...
* Quantization * Sampling *
Multitrack recording Multitrack recording (MTR), also known as multitracking or tracking, is a method of sound recording developed in 1955 that allows for the separate recording of multiple sound sources or of sound sources recorded at different times to create a ...
*
Digital audio workstation A digital audio workstation (DAW) is an electronic device or application software used for Sound recording and reproduction, recording, editing and producing audio files. DAWs come in a wide variety of configurations from a single software pro ...


Notes


References


Further reading

*Borwick, John, ed., 1994: ''Sound Recording Practice'' (Oxford: Oxford University Press) *Bosi, Marina, and Goldberg, Richard E., 2003: ''Introduction to Digital Audio Coding and Standards'' (Springer) *Ifeachor, Emmanuel C., and Jervis, Barrie W., 2002: ''Digital Signal Processing: A Practical Approach'' (Harlow, England: Pearson Education Limited) *Rabiner, Lawrence R., and Gold, Bernard, 1975: ''Theory and Application of Digital Signal Processing'' (Englewood Cliffs, New Jersey: Prentice-Hall, Inc.) *Watkinson, John, 1994: ''The Art of Digital Audio'' (Oxford: Focal Press)


External links

* * * {{Music technology