Audio data
   HOME

TheInfoList



OR:

Digital audio is a representation of
sound In physics, sound is a vibration that propagates as an acoustic wave, through a transmission medium such as a gas, liquid or solid. In human physiology and psychology, sound is the ''reception'' of such waves and their ''perception'' by ...
recorded in, or converted into, digital form. In digital audio, the
sound wave In physics, sound is a vibration that propagates as an acoustic wave, through a transmission medium such as a gas, liquid or solid. In human physiology and psychology, sound is the ''reception'' of such waves and their ''perception'' by the ...
of the
audio signal An audio signal is a representation of sound, typically using either a changing level of electrical voltage for analog signals, or a series of binary numbers for digital signals. Audio signals have frequencies in the audio frequency range of ro ...
is typically encoded as numerical samples in a continuous sequence. For example, in
CD audio Compact Disc Digital Audio (CDDA or CD-DA), also known as Digital Audio Compact Disc or simply as Audio CD, is the standard format for audio compact discs. The standard is defined in the ''Red Book'', one of a series of Rainbow Books (named f ...
, samples are taken 44,100 times per second, each with 16-bit sample depth. Digital audio is also the name for the entire technology of
sound recording and reproduction Sound recording and reproduction is the electrical, mechanical, electronic, or digital inscription and re-creation of sound waves, such as spoken voice, singing, instrumental music, or sound effects. The two main classes of sound recording ...
using audio signals that have been encoded in digital form. Following significant advances in digital audio technology during the 1970s and 1980s, it gradually replaced analog audio technology in many areas of
audio engineering Audio most commonly refers to sound, as it is transmitted in signal form. It may also refer to: Sound *Audio signal, an electrical representation of sound *Audio frequency, a frequency in the audio spectrum * Digital audio, representation of sou ...
,
record production A record producer is a recording project's creative and technical leader, commanding studio time and coaching artists, and in popular genres typically creates the song's very sound and structure.Virgil Moorefield"Introduction" ''The Producer as ...
and
telecommunications Telecommunication is the transmission of information by various types of technologies over wire, radio, optical, or other electromagnetic systems. It has its origin in the desire of humans for communication over a distance greater than that ...
in the 1990s and 2000s In a digital audio system, an analog electrical signal representing the sound is converted with an
analog-to-digital converter In electronics, an analog-to-digital converter (ADC, A/D, or A-to-D) is a system that converts an analog signal, such as a sound picked up by a microphone or light entering a digital camera, into a digital signal. An ADC may also provide ...
(ADC) into a digital signal, typically using
pulse-code modulation Pulse-code modulation (PCM) is a method used to digitally represent sampled analog signals. It is the standard form of digital audio in computers, compact discs, digital telephony and other digital audio applications. In a PCM stream, the ...
(PCM). This digital signal can then be recorded, edited, modified, and copied using
computer A computer is a machine that can be programmed to carry out sequences of arithmetic or logical operations ( computation) automatically. Modern digital electronic computers can perform generic sets of operations known as programs. These prog ...
s, audio playback machines, and other digital tools. For playback, a
digital-to-analog converter In electronics, a digital-to-analog converter (DAC, D/A, D2A, or D-to-A) is a system that converts a digital signal into an analog signal. An analog-to-digital converter (ADC) performs the reverse function. There are several DAC archit ...
(DAC) performs the reverse process, converting a digital signal back into an analog signal, which is then sent through an
audio power amplifier An audio power amplifier (or power amp) is an electronic amplifier that amplifies low-power electronic audio signals, such as the signal from a radio receiver or an electric guitar pickup, to a level that is high enough for driving loudsp ...
and ultimately to a
loudspeaker A loudspeaker (commonly referred to as a speaker or speaker driver) is an electroacoustic transducer that converts an electrical audio signal into a corresponding sound. A ''speaker system'', also often simply referred to as a "speaker" or ...
. Digital audio systems may include compression, storage,
processing Processing is a free graphical library and integrated development environment (IDE) built for the electronic arts, new media art, and visual design communities with the purpose of teaching non-programmers the fundamentals of computer programming ...
, and
transmission Transmission may refer to: Medicine, science and technology * Power transmission ** Electric power transmission ** Propulsion transmission, technology allowing controlled application of power *** Automatic transmission *** Manual transmission ** ...
components. Conversion to a digital format allows convenient manipulation, storage, transmission, and retrieval of an audio signal. Unlike analog audio, in which making copies of a recording results in
generation loss Generation loss is the loss of quality between subsequent copies or transcodes of data. Anything that reduces the quality of the representation when copying, and would cause further reduction in quality on making a copy of the copy, can be consid ...
and degradation of signal quality, digital audio allows an infinite number of copies to be made without any degradation of signal quality.


Overview

Digital audio technologies are used in the recording, manipulation, mass-production, and distribution of sound, including recordings of
song A song is a musical composition intended to be performed by the human voice. This is often done at distinct and fixed pitches (melodies) using patterns of sound and silence. Songs contain various forms, such as those including the repetiti ...
s, instrumental pieces,
podcast A podcast is a program made available in digital format for download over the Internet. For example, an episodic series of digital audio or video files that a user can download to a personal device to listen to at a time of their choosin ...
s, sound effects, and other sounds. Modern online music distribution depends on digital recording and
data compression In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compressio ...
. The availability of music as data files, rather than as physical objects, has significantly reduced the costs of distribution as well as made it easier to share copies. Before digital audio, the music industry distributed and sold music by selling physical copies in the form of
records A record, recording or records may refer to: An item or collection of data Computing * Record (computer science), a data structure ** Record, or row (database), a set of fields in a database related to one entity ** Boot sector or boot record, r ...
and
cassette tape The Compact Cassette or Musicassette (MC), also commonly called the tape cassette, cassette tape, audio cassette, or simply tape or cassette, is an analog magnetic tape recording format for audio recording and playback. Invented by Lou Ott ...
s. With digital-audio and online distribution systems such as
iTunes iTunes () is a software program that acts as a media player, media library, mobile device management utility, and the client app for the iTunes Store. Developed by Apple Inc., it is used to purchase, play, download, and organize digital mu ...
, companies sell digital sound files to consumers, which the consumer receives over the Internet. Popular streaming services such as
Apple Music Apple Music is a music, audio and video streaming service developed by Apple Inc. Users select music to stream to their device on-demand, or they can listen to existing playlists. The service also includes the Internet radio stations Appl ...
,
Spotify Spotify (; ) is a proprietary Swedish audio streaming and media services provider founded on 23 April 2006 by Daniel Ek and Martin Lorentzon. It is one of the largest music streaming service providers, with over 456 million monthly active use ...
, or
Youtube YouTube is a global online video sharing and social media platform headquartered in San Bruno, California. It was launched on February 14, 2005, by Steve Chen, Chad Hurley, and Jawed Karim. It is owned by Google, and is the second mo ...
, offer temporary access to the digital file, and are now the most common form of music consumption An analog audio system converts physical waveforms of sound into electrical representations of those waveforms by use of a
transducer A transducer is a device that converts energy from one form to another. Usually a transducer converts a signal in one form of energy to a signal in another. Transducers are often employed at the boundaries of automation, measurement, and con ...
, such as a
microphone A microphone, colloquially called a mic or mike (), is a transducer that converts sound into an electrical signal. Microphones are used in many applications such as telephones, hearing aids, public address systems for concert halls and publ ...
. The sounds are then stored on an analog medium such as
magnetic tape Magnetic tape is a medium for magnetic storage made of a thin, magnetizable coating on a long, narrow strip of plastic film. It was developed in Germany in 1928, based on the earlier magnetic wire recording from Denmark. Devices that use magnet ...
, or transmitted through an analog medium such as a
telephone line A telephone line or telephone circuit (or just line or circuit industrywide) is a single-user circuit on a telephone communication system. It is designed to reproduce speech of a quality that is understandable. It is the physical wire or ot ...
or
radio Radio is the technology of signaling and communicating using radio waves. Radio waves are electromagnetic waves of frequency between 30  hertz (Hz) and 300  gigahertz (GHz). They are generated by an electronic device called a tr ...
. The process is reversed for reproduction: the electrical audio signal is amplified and then converted back into physical waveforms via a
loudspeaker A loudspeaker (commonly referred to as a speaker or speaker driver) is an electroacoustic transducer that converts an electrical audio signal into a corresponding sound. A ''speaker system'', also often simply referred to as a "speaker" or ...
. Analog audio retains its fundamental wave-like characteristics throughout its storage, transformation, duplication, and amplification. Analog audio signals are susceptible to noise and distortion, due to the innate characteristics of electronic circuits and associated devices. Disturbances in a digital system do not result in error unless they are so large as to result in a symbol being misinterpreted as another symbol or disturb the sequence of symbols. It is therefore generally possible to have an entirely error-free digital audio system in which no noise or distortion is introduced between conversion to digital format and conversion back to analog. A digital audio signal may be encoded for correction of any errors that might occur in the storage or transmission of the signal. This technique, known as
channel coding In computing, telecommunication, information theory, and coding theory, an error correction code, sometimes error correcting code, (ECC) is used for controlling errors in data over unreliable or noisy communication channels. The central idea ...
, is essential for broadcast or recorded digital systems to maintain bit accuracy.
Eight-to-fourteen modulation Eight-to-fourteen modulation (EFM) is a data encoding technique – formally, a ''line code'' – used by compact discs (CD), laserdiscs (LD) and pre-Hi-MD MiniDiscs. EFMPlus is a related code, used in DVDs and Super Audio CDs (SACDs). EFM and E ...
is the channel code used for the audio
compact disc The compact disc (CD) is a digital optical disc data storage format that was co-developed by Philips and Sony to store and play digital audio recordings. In August 1982, the first compact disc was manufactured. It was then released in O ...
(CD).


Conversion process

If an audio signal is analog, a digital audio system starts with an ADC that converts an analog signal to a digital signal.Some audio signals such as those created by
digital synthesis A digital synthesizer is a synthesizer that uses digital signal processing (DSP) techniques to make musical sounds. This in contrast to older analog synthesizers, which produce music using analog electronics, and samplers, which play back digit ...
originate entirely in the digital domain, in which case analog to digital conversion does not take place.
The ADC runs at a specified
sampling rate In signal processing, sampling is the reduction of a continuous-time signal In mathematical dynamics, discrete time and continuous time are two alternative frameworks within which variables that evolve over time are modeled. Discrete time ...
and converts at a known bit resolution.
CD audio Compact Disc Digital Audio (CDDA or CD-DA), also known as Digital Audio Compact Disc or simply as Audio CD, is the standard format for audio compact discs. The standard is defined in the ''Red Book'', one of a series of Rainbow Books (named f ...
, for example, has a sampling rate of 44.1  kHz (44,100 samples per second), and has 16-bit resolution for each
stereo Stereophonic sound, or more commonly stereo, is a method of sound reproduction that recreates a multi-directional, 3-dimensional audible perspective. This is usually achieved by using two independent audio channels through a configuration ...
channel. Analog signals that have not already been
bandlimited Bandlimiting is the limiting of a signal's frequency domain representation or spectral density to zero above a certain finite frequency. A band-limited signal is one whose Fourier transform or spectral density has bounded support. A bandli ...
must be passed through an
anti-aliasing filter An anti-aliasing filter (AAF) is a filter used before a signal sampler to restrict the bandwidth of a signal to satisfy the Nyquist–Shannon sampling theorem over the band of interest. Since the theorem states that unambiguous reconstruct ...
before conversion, to prevent the aliasing distortion that is caused by audio signals with frequencies higher than the
Nyquist frequency In signal processing, the Nyquist frequency (or folding frequency), named after Harry Nyquist, is a characteristic of a sampler, which converts a continuous function or signal into a discrete sequence. In units of cycles per second ( Hz), it ...
(half the sampling rate). A digital audio signal may be stored or transmitted. Digital audio can be stored on a CD, a
digital audio player A portable media player (PMP) (also including the related digital audio player (DAP)) is a portable consumer electronics device capable of storing and playing digital media such as audio, images, and video files. The data is typically stored o ...
, a
hard drive A hard disk drive (HDD), hard disk, hard drive, or fixed disk is an electro-mechanical data storage device that stores and retrieves digital data using magnetic storage with one or more rigid rapidly rotating platters coated with mag ...
, a
USB flash drive Universal Serial Bus (USB) is an industry standard that establishes specifications for cables, connectors and protocols for connection, communication and power supply ( interfacing) between computers, peripherals and other computers. A bro ...
, or any other digital
data storage device Data storage is the recording (storing) of information (data) in a storage medium. Handwriting, phonographic recording, magnetic tape, and optical discs are all examples of storage media. Biological molecules such as RNA and DNA are consi ...
. The digital signal may be altered through
digital signal processing Digital signal processing (DSP) is the use of digital processing, such as by computers or more specialized digital signal processors, to perform a wide variety of signal processing operations. The digital signals processed in this manner are ...
, where it may be
filter Filter, filtering or filters may refer to: Science and technology Computing * Filter (higher-order function), in functional programming * Filter (software), a computer program to process a data stream * Filter (video), a software component tha ...
ed or have
effect Effect may refer to: * A result or change of something ** List of effects ** Cause and effect, an idiom describing causality Pharmacy and pharmacology * Drug effect, a change resulting from the administration of a drug ** Therapeutic effect, a ...
s applied.
Sample-rate conversion Sample-rate conversion, sampling-frequency conversion or resampling is the process of changing the sampling rate or sampling frequency of a discrete signal to obtain a new discrete representation of the underlying continuous signal. Application a ...
including
upsampling In digital signal processing, upsampling, expansion, and interpolation are terms associated with the process of resampling in a multi-rate digital signal processing system. ''Upsampling'' can be synonymous with ''expansion'', or it can describe a ...
and
downsampling In digital signal processing, downsampling, compression, and decimation are terms associated with the process of ''resampling'' in a multi-rate digital signal processing system. Both ''downsampling'' and ''decimation'' can be synonymous with ''com ...
may be used to change signals that have been encoded with a different sampling rate to a common sampling rate prior to processing. Audio data compression techniques, such as
MP3 MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany, with support from other digital scientists in the United States and elsewhere. Origin ...
,
Advanced Audio Coding Advanced Audio Coding (AAC) is an audio coding standard for lossy digital audio compression. Designed to be the successor of the MP3 format, AAC generally achieves higher sound quality than MP3 encoders at the same bit rate. AAC has been stan ...
,
Ogg Vorbis Vorbis is a free and open-source software project headed by the Xiph.Org Foundation. The project produces an audio coding format and software reference encoder/decoder (codec) for lossy audio compression. Vorbis is most commonly used in conjun ...
, or
FLAC FLAC (; Free Lossless Audio Codec) is an audio coding format for lossless compression of digital audio, developed by the Xiph.Org Foundation, and is also the name of the free software project producing the FLAC tools, the reference softwa ...
, are commonly employed to reduce the file size. Digital audio can be carried over
digital audio interface Audio connectors and video connectors are electrical or optical connectors for carrying audio or video signals. Audio interfaces or video interfaces define physical parameters and interpretation of signals. For digital audio and digital video, ...
s such as
AES3 AES3 is a standard for the exchange of digital audio signals between professional audio devices. An AES3 signal can carry two channels of pulse-code-modulated digital audio over several transmission media including balanced lines, unbalanced ...
or
MADI Madi may refer to: Places * Madi, Chitwan, a municipality in Chitwan District in Nepal * Madi Municipality, Sankhuwasabha, a municipality in Sankhuwasabha District in Nepal * Madi Rural Municipality, Rolpa, a rural municipality in Rolpa Distric ...
. Digital audio can be carried over a network using
audio over Ethernet In audio and broadcast engineering, Audio over Ethernet (sometimes AoE—not to be confused with ATA over Ethernet) is the use of an Ethernet-based network to distribute real-time digital audio. AoE replaces bulky snake cables or audio-specific ...
,
audio over IP Audio over IP (AoIP) is the distribution of digital audio across an IP network such as the Internet. It is used increasingly to provide high-quality audio feeds over long distances. The application is also known as audio contribution over IP (ACI ...
or other
streaming media Streaming media is multimedia that is delivered and consumed in a continuous manner from a source, with little or no intermediate storage in network elements. ''Streaming'' refers to the delivery method of content, rather than the content i ...
standards and systems. For playback, digital audio must be converted back to an analog signal with a DAC. According to the
Nyquist–Shannon sampling theorem The Nyquist–Shannon sampling theorem is a theorem in the field of signal processing which serves as a fundamental bridge between continuous-time signals and discrete-time signals. It establishes a sufficient condition for a sample rate that per ...
, with some practical and theoretical restrictions, a band-limited version of the original analog signal can be accurately reconstructed from the digital signal. During conversion, audio data can be embedded with a digital watermark to prevent piracy and unauthorized use. Watermarking is done using a direct-sequence spread-spectrum (DSSS) method. The audio information is then modulated by a pseudo-noise (PN) sequence, then shaped within the frequency domain and put back in the original signal. The strength of the embedding determines the strength of the watermark on the audio data.


History


Coding

Pulse-code modulation Pulse-code modulation (PCM) is a method used to digitally represent sampled analog signals. It is the standard form of digital audio in computers, compact discs, digital telephony and other digital audio applications. In a PCM stream, the ...
(PCM) was invented by British scientist Alec Reeves in 1937. In 1950, C. Chapin Cutler of
Bell Labs Nokia Bell Labs, originally named Bell Telephone Laboratories (1925–1984), then AT&T Bell Laboratories (1984–1996) and Bell Labs Innovations (1996–2007), is an American industrial research and scientific development company owned by mul ...
filed the patent on
differential pulse-code modulation Differential pulse-code modulation (DPCM) is a signal encoder that uses the baseline of pulse-code modulation (PCM) but adds some functionalities based on the prediction of the samples of the signal. The input can be an analog signal or a digital ...
(DPCM), a
data compression In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compressio ...
algorithm.
Adaptive DPCM Adaptive differential pulse-code modulation (ADPCM) is a variant of differential pulse-code modulation (DPCM) that varies the size of the quantization step, to allow further reduction of the required data bandwidth for a given signal-to-noise ratio ...
(ADPCM) was introduced by P. Cummiskey, Nikil S. Jayant and
James L. Flanagan James Loton Flanagan (August 26, 1925 – August 25, 2015) was an American electrical engineer. He was Rutgers University's vice president for research until 2004. He was also director of Rutgers' Center for Advanced Information Processing and t ...
at Bell Labs in 1973.
Perceptual coding Psychoacoustics is the branch of psychophysics involving the scientific study of sound perception and audiology—how humans perceive various sounds. More specifically, it is the branch of science studying the psychological responses associated wit ...
was first used for
speech coding Speech coding is an application of data compression of digital audio signals containing speech. Speech coding uses speech-specific parameter estimation using audio signal processing techniques to model the speech signal, combined with generic d ...
compression, with
linear predictive coding Linear predictive coding (LPC) is a method used mostly in audio signal processing and speech processing for representing the spectral envelope of a digital signal of speech in compressed form, using the information of a linear predictive mod ...
(LPC). Initial concepts for LPC date back to the work of
Fumitada Itakura is a Japanese scientist. He did pioneering work in statistical signal processing, and its application to speech analysis, synthesis and coding, including the development of the linear predictive coding (LPC) and line spectral pairs (LSP) method ...
(
Nagoya University , abbreviated to or NU, is a Japanese national research university located in Chikusa-ku, Nagoya. It was the seventh Imperial University in Japan, one of the first five Designated National University and selected as a Top Type university of ...
) and Shuzo Saito (
Nippon Telegraph and Telephone , commonly known as NTT, is a Japanese telecommunications company headquartered in Tokyo, Japan. Ranked 55th in ''Fortune'' Global 500, NTT is the fourth largest telecommunications company in the world in terms of revenue, as well as the third la ...
) in 1966. During the 1970s,
Bishnu S. Atal Bishnu S. Atal (born 1933) is an Indian physicist and engineer. He is a noted researcher in acoustics, and is best known for developments in speech coding. He advanced linear predictive coding (LPC) during the late 1960s to 1970s, and developed ...
and Manfred R. Schroeder at Bell Labs developed a form of LPC called
adaptive predictive coding Adaptive predictive coding (APC) is a narrowband analog-to-digital conversion that uses a one-level or multilevel sampling system in which the value of the signal at each sampling instant is predicted according to a linear function of the past valu ...
(APC), a perceptual coding algorithm that exploited the masking properties of the human ear, followed in the early 1980s with the
code-excited linear prediction Code-excited linear prediction (CELP) is a linear predictive speech coding algorithm originally proposed by Manfred R. Schroeder and Bishnu S. Atal in 1985. At the time, it provided significantly better quality than existing low bit-rate algori ...
(CELP) algorithm. Discrete cosine transform (DCT) coding, a
lossy compression In information technology, lossy compression or irreversible compression is the class of data compression methods that uses inexact approximations and partial data discarding to represent the content. These techniques are used to reduce data si ...
method first proposed by Nasir Ahmed in 1972, provided the basis for the
modified discrete cosine transform The modified discrete cosine transform (MDCT) is a transform based on the type-IV discrete cosine transform (DCT-IV), with the additional property of being lapped: it is designed to be performed on consecutive blocks of a larger dataset, where ...
(MDCT), which was developed by J. P. Princen, A. W. Johnson and A. B. Bradley in 1987.J. P. Princen, A. W. Johnson und A. B. Bradley: ''Subband/transform coding using filter bank designs based on time domain aliasing cancellation'', IEEE Proc. Intl. Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2161–2164, 1987. The MDCT is the basis for most audio coding standards, such as
Dolby Digital Dolby Digital, originally synonymous with Dolby AC-3, is the name for what has now become a family of audio compression technologies developed by Dolby Laboratories. Formerly named Dolby Stereo Digital until 1995, the audio compression is loss ...
(AC-3),
MP3 MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany, with support from other digital scientists in the United States and elsewhere. Origin ...
(
MPEG The Moving Picture Experts Group (MPEG) is an alliance of working groups established jointly by ISO and IEC that sets standards for media coding, including compression coding of audio, video, graphics, and genomic data; and transmission and f ...
Layer III),
Advanced Audio Coding Advanced Audio Coding (AAC) is an audio coding standard for lossy digital audio compression. Designed to be the successor of the MP3 format, AAC generally achieves higher sound quality than MP3 encoders at the same bit rate. AAC has been stan ...
(AAC),
Windows Media Audio Windows Media Audio (WMA) is a series of audio codecs and their corresponding audio coding formats developed by Microsoft. It is a proprietary technology that forms part of the Windows Media framework. WMA consists of four distinct codecs. The ...
(WMA), and
Vorbis Vorbis is a free and open-source software project headed by the Xiph.Org Foundation. The project produces an audio coding format and software reference encoder/decoder ( codec) for lossy audio compression. Vorbis is most commonly used in con ...
( Ogg).


Recording

PCM was used in
telecommunications Telecommunication is the transmission of information by various types of technologies over wire, radio, optical, or other electromagnetic systems. It has its origin in the desire of humans for communication over a distance greater than that ...
applications long before its first use in commercial broadcast and recording. Commercial digital recording was pioneered in Japan by
NHK , also known as NHK, is a Japanese public broadcaster. NHK, which has always been known by this romanized initialism in Japanese, is a statutory corporation funded by viewers' payments of a television license fee. NHK operates two terrestr ...
and
Nippon Columbia , often pronounced ''Korombia'', operating internationally as , is a Japanese record label founded in 1910 as Nipponophone Co., Ltd. It affiliated itself with the Columbia Graphophone Company of the United Kingdom and adopted the standard UK ...
and their
Denon is a Japanese electronics company started in 1910 by Frederick Whitney Horn, an American entrepreneur. Denon produced the first cylinder audio media in Japan and players to play them. Decades later, Denon was involved in the early stages of de ...
brand, in the 1960s. The first commercial digital recordings were released in 1971. The BBC also began to experiment with digital audio in the 1960s. By the early 1970s, it had developed a 2-channel recorder, and in 1972 it deployed a digital audio transmission system that linked their broadcast center to their remote transmitters. The first 16-bit PCM recording in the
United States The United States of America (U.S.A. or USA), commonly known as the United States (U.S. or US) or America, is a country Continental United States, primarily located in North America. It consists of 50 U.S. state, states, a Washington, D.C., ...
was made by
Thomas Stockham Thomas Greenway Stockham (December 22, 1933 – January 6, 2004) was an American scientist who developed one of the first practical digital audio recording systems, and pioneered techniques for digital audio recording and processing. He also l ...
at the
Santa Fe Opera Santa Fe Opera (SFO) is an American opera company, located north of Santa Fe, New Mexico. After creating the ''Opera Association of New Mexico'' in 1956, its founding director, John Crosby, oversaw the building of the first opera house on a newl ...
in 1976, on a
Soundstream Soundstream Inc. was the first United States audiophile digital audio recording company, providing commercial services for recording and computer-based editing.Robert Easton, ''Soundstream, the first Digital Studio'', Recording Engineer/Producer, ...
recorder. An improved version of the Soundstream system was used to produce several classical recordings by
Telarc Telarc International Corporation is an American audiophile independent record label founded in 1977 by two classically trained musicians and former teachers, Jack Renner (recording engineer), Jack Renner and Robert Woods (producer), Robert Woods. ...
in 1978. The 3M digital
multitrack recorder Multitrack recording (MTR), also known as multitracking or tracking, is a method of sound recording developed in 1955 that allows for the separate recording of multiple sound sources or of sound sources recorded at different times to create a ...
in development at the time was based on BBC technology. The first all-digital album recorded on this machine was
Ry Cooder Ryland Peter "Ry" Cooder (born March 15, 1947) is an American musician, songwriter, film score composer, record producer, and writer. He is a multi-instrumentalist but is best known for his slide guitar work, his interest in traditional music, a ...
's '' Bop till You Drop'' in 1979. British record label
Decca Decca may refer to: Music * Decca Records or Decca Music Group, a record label * Decca Gold, a classical music record label owned by Universal Music Group * Decca Broadway, a musical theater record label * Decca Studios, a recording facility in We ...
began development of its own 2-track digital audio recorders in 1978 and released the first European digital recording in 1979. Popular professional digital multitrack recorders produced by Sony/Studer (
DASH The dash is a punctuation mark consisting of a long horizontal line. It is similar in appearance to the hyphen but is longer and sometimes higher from the baseline. The most common versions are the endash , generally longer than the hyphen ...
) and Mitsubishi ( ProDigi) in the early 1980s helped to bring about digital recording's acceptance by the major record companies. Machines for these formats had their own transports built-in as well, using
reel-to-reel Reel-to-reel audio tape recording, also called open-reel recording, is magnetic tape audio recording in which the recording tape is spooled between reels. To prepare for use, the ''supply reel'' (or ''feed reel'') containing the tape is pla ...
tape in either 1/4", 1/2", or 1" widths, with the audio data being recorded to the tape using a multi-track stationary tape head.
PCM adaptor A PCM adaptor is a device that encodes digital audio as video for recording on a videocassette recorder. The adapter also has the ability to decode a video signal back to digital audio for playback. This digital audio system was used for mast ...
s allowed for stereo digital audio recording on a conventional NTCS or PAL
video tape recorder A video tape recorder (VTR) is a tape recorder designed to record and playback video and audio material from magnetic tape. The early VTRs were open-reel devices that record on individual reels of 2-inch-wide (5.08 cm) tape. They were u ...
. The 1982 introduction of the CD popularized digital audio with consumers.
ADAT Alesis Digital Audio Tape (ADAT) is a magnetic tape format used for the recording of eight digital audio tracks onto the same S-VHS tape used by consumer VCRs. Although it is a tape-based format, the term ''ADAT'' now refers to its succ ...
became available in the early 1990s, which allowed eight-track 44.1 or 48 kHz recording on S-VHS cassettes, and DTRS performed a similar function with Hi8 tapes. Formats like ProDigi and DASH were referred to as SDAT (Stationary-head Digital Audio Tape) formats, as opposed to formats like the PCM adaptor-based systems and DAT, which were referred to as RDAT (Rotating-head Digital Audio Tape) formats, due to their helical-scan process of recording. Like the DAT cassette, ProDigi and DASH machines also accommodated the obligatory 44.1 kHz sampling rate, but also 48 kHz on all machines, and eventually a 96 kHz sampling rate. They overcame the problems that made typical analog recorders unable to meet the bandwidth (frequency range) demands of digital recording by a combination of higher tape speeds, narrower head gaps used in combination with metal-formulation tapes, and the spreading of data across multiple parallel tracks. Unlike analog systems, modern
digital audio workstation A digital audio workstation (DAW) is an electronic device or application software used for recording, editing and producing audio files. DAWs come in a wide variety of configurations from a single software program on a laptop, to an integr ...
s and
audio interface An audio interface is a piece of computer hardware that allows the input and output of audio signals to and from a host computer or recording device. Audio interfaces are closely related to computer sound cards, but whereas sound cards are opti ...
s allow as many channels in as many different sampling rates as the computer can effectively run at a single time. Avid Audio and
Steinberg Steinberg Media Technologies GmbH (trading as Steinberg) is a German musical software and hardware company based in Hamburg. It develops music writing, recording, arranging, and editing software, most notably Cubase, Nuendo, and Dorico. It als ...
released the first digital audio workstation software programs in 1989. Digital audio workstations make multitrack recording and mixing much easier for large projects which would otherwise be difficult with analog equipment.


Telephony

The rapid development and wide adoption of PCM
digital telephony Telephony ( ) is the field of technology involving the development, application, and deployment of telecommunication services for the purpose of electronic transmission of voice, fax, or data, between distant parties. The history of telephony is i ...
was enabled by
metal–oxide–semiconductor The metal–oxide–semiconductor field-effect transistor (MOSFET, MOS-FET, or MOS FET) is a type of field-effect transistor (FET), most commonly fabricated by the controlled oxidation of silicon. It has an insulated gate, the voltage of which d ...
(MOS)
switched capacitor A switched capacitor (SC) is an electronic circuit that implements a function by moving charges into and out of capacitors when electronic switches are opened and closed. Usually, non-overlapping clock signals are used to control the switches, s ...
(SC) circuit technology, developed in the early 1970s. This led to the development of PCM codec-filter chips in the late 1970s. The silicon-gate
CMOS Complementary metal–oxide–semiconductor (CMOS, pronounced "sea-moss", ) is a type of metal–oxide–semiconductor field-effect transistor (MOSFET) fabrication process that uses complementary and symmetrical pairs of p-type and n-type MOSF ...
(complementary MOS) PCM codec-filter chip, developed by David A. Hodges and W.C. Black in 1980, has since been the industry standard for digital telephony. By the 1990s,
telecommunication network A telecommunications network is a group of nodes interconnected by telecommunications links that are used to exchange messages between the nodes. The links may use a variety of technologies based on the methodologies of circuit switching, mess ...
s such as the
public switched telephone network The public switched telephone network (PSTN) provides infrastructure and services for public telecommunication. The PSTN is the aggregate of the world's circuit-switched telephone networks that are operated by national, regional, or local telep ...
(PSTN) had been largely
digitized DigitizationTech Target. (2011, April). Definition: digitization. ''WhatIs.com''. Retrieved December 15, 2021, from https://whatis.techtarget.com/definition/digitization is the process of converting information into a Digital data, digital (i ...
with VLSI (very
large-scale integration An integrated circuit or monolithic integrated circuit (also referred to as an IC, a chip, or a microchip) is a set of electronic circuits on one small flat piece (or "chip") of semiconductor material, usually silicon. Large numbers of tiny ...
) CMOS PCM codec-filters, widely used in electronic switching systems for
telephone exchanges telephone exchange, telephone switch, or central office is a telecommunications system used in the public switched telephone network (PSTN) or in large enterprises. It interconnects telephone subscriber lines or virtual circuits of digital syste ...
, user-end
modems A modulator-demodulator or modem is a computer hardware device that converts data from a digital format into a format suitable for an analog transmission medium such as telephone or radio. A modem transmits data by modulating one or more carr ...
and a range of
digital transmission Data transmission and data reception or, more broadly, data communication or digital communications is the transfer and reception of data in the form of a digital bitstream or a digitized analog signal transmitted over a point-to-point or ...
applications such as the
integrated services digital network Integrated Services Digital Network (ISDN) is a set of communication standards for simultaneous digital transmission of voice, video, data, and other network services over the digitalised circuits of the public switched telephone network. Work ...
(ISDN),
cordless telephones A cordless telephone or portable telephone has a portable telephone handset that connects by radio to a base station connected to the public telephone network. The operational range is limited, usually to the same building or within some short ...
and
cell phones A mobile phone, cellular phone, cell phone, cellphone, handphone, hand phone or pocket phone, sometimes shortened to simply mobile, cell, or just phone, is a portable telephone that can make and receive calls over a radio frequency link whi ...
.


Technologies

Digital audio is used in
broadcasting Broadcasting is the distribution of audio or video content to a dispersed audience via any electronic mass communications medium, but typically one using the electromagnetic spectrum (radio waves), in a one-to-many model. Broadcasting began wi ...
of audio. Standard technologies include
Digital audio broadcasting Digital radio is the use of digital technology to transmit or receive across the radio spectrum. Digital transmission by radio waves includes digital broadcasting, and especially digital audio radio services. Types In digital broadcasting ...
(DAB),
Digital Radio Mondiale Digital Radio Mondiale (DRM; ''mondiale'' being Italian and French for "worldwide") is a set of digital audio broadcasting technologies designed to work over the bands currently used for analogue radio broadcasting including AM broadcasting—pa ...
(DRM),
HD Radio HD Radio (HDR) is a trademark for an in-band on-channel (IBOC) digital radio broadcast technology. It generally simulcasts an existing analog radio station in digital format with less noise and with additional text information. HD Radio is used ...
and
In-band on-channel In-band on-channel (IBOC) is a hybrid method of transmitting digital radio and analog radio broadcast signals simultaneously on the same frequency. The name refers to the new digital signals being broadcast in the same AM or FM band (in-band) ...
(IBOC). Digital audio in recording applications is stored on audio-specific technologies including CD, Digital Audio Tape (DAT),
Digital Compact Cassette The Digital Compact Cassette (DCC) is a magnetic tape sound recording format introduced by Philips and Matsushita Electric in late and marketed as the successor to the standard analog Compact Cassette. It was also a direct competitor to Sony ...
(DCC) and
MiniDisc MiniDisc (MD) is an erasable magneto-optical disc-based data storage format offering a capacity of 60, 74, and later, 80 minutes of digitized audio. Sony announced the MiniDisc in September 1992 and released it in November of that year fo ...
. Digital audio may be stored in a standard
audio file format An audio file format is a file format for storing digital audio data on a computer system. The bit layout of the audio data (excluding metadata) is called the audio coding format and can be uncompressed, or compressed to reduce the file size, ofte ...
s and stored on a
Hard disk recorder A hard disk recorder (HDR) is a system that uses a high-capacity hard disk to record digital audio or digital video. Hard disk recording systems represent an alternative to reel-to-reel audio tape recording and video tape recorders, and provide ...
,
Blu-ray The Blu-ray Disc (BD), often known simply as Blu-ray, is a digital optical disc data storage format. It was invented and developed in 2005 and released on June 20, 2006 worldwide. It is designed to supersede the DVD format, and capable of st ...
or
DVD-Audio DVD-Audio (commonly abbreviated as DVD-A) is a digital format for delivering high-fidelity audio content on a DVD. DVD-Audio uses most of the storage on the disc for high-quality audio and is not intended to be a video delivery format. The st ...
. Files may be played back on smartphones, computers or
MP3 player A portable media player (PMP) (also including the related digital audio player (DAP)) is a portable consumer electronics device capable of storing and playing digital media such as audio, images, and video files. The data is typically stored ...
. Digital audio resolution is measured in sample depth. Most digital audio formats use a sample depth of either 16-bit, 24-bit, and 32-bit.


Interfaces

For
personal computer A personal computer (PC) is a multi-purpose microcomputer whose size, capabilities, and price make it feasible for individual use. Personal computers are intended to be operated directly by an end user, rather than by a computer expert or te ...
s, USB and
IEEE 1394 IEEE 1394 is an interface standard for a serial bus for high-speed communications and isochronous real-time data transfer. It was developed in the late 1980s and early 1990s by Apple in cooperation with a number of companies, primarily Sony an ...
have provisions to deliver real-time digital audio. USB interfaces have become increasingly popular among independent audio engineers and producers due to their small size and ease of use. In professional architectural or installation applications, many
audio over Ethernet In audio and broadcast engineering, Audio over Ethernet (sometimes AoE—not to be confused with ATA over Ethernet) is the use of an Ethernet-based network to distribute real-time digital audio. AoE replaces bulky snake cables or audio-specific ...
protocols and interfaces exist. In
broadcasting Broadcasting is the distribution of audio or video content to a dispersed audience via any electronic mass communications medium, but typically one using the electromagnetic spectrum (radio waves), in a one-to-many model. Broadcasting began wi ...
, a more general
audio over IP Audio over IP (AoIP) is the distribution of digital audio across an IP network such as the Internet. It is used increasingly to provide high-quality audio feeds over long distances. The application is also known as audio contribution over IP (ACI ...
network technology is favored. In telephony
voice over IP Voice over Internet Protocol (VoIP), also called IP telephony, is a method and group of technologies for the delivery of voice communications and multimedia sessions over Internet Protocol (IP) networks, such as the Internet. The terms Internet t ...
is used as a network interface for digital audio for voice communications. Several interfaces are engineered to carry digital video and audio together, including
HDMI High-Definition Multimedia Interface (HDMI) is a proprietary audio/video interface for transmitting uncompressed video data and compressed or uncompressed digital audio data from an HDMI-compliant source device, such as a display controlle ...
and
DisplayPort DisplayPort (DP) is a digital display interface developed by a consortium of PC and chip manufacturers and standardized by the Video Electronics Standards Association (VESA). It is primarily used to connect a video source to a display device su ...
. Some interfaces offer
MIDI MIDI (; Musical Instrument Digital Interface) is a technical standard that describes a communications protocol, digital interface, and electrical connectors that connect a wide variety of electronic musical instruments, computers, and ...
support as well as XLR and TRS analog ports. Digital-audio-specific interfaces include: *
A2DP In order to use Bluetooth, a device must be compatible with the subset of Bluetooth ''profiles'' (often called services or functions) necessary to use the desired services. A Bluetooth profile is a specification regarding an aspect of Bluetooth-b ...
via
Bluetooth Bluetooth is a short-range wireless technology standard that is used for exchanging data between fixed and mobile devices over short distances and building personal area networks (PANs). In the most widely used mode, transmission power is limi ...
*
AC'97 AC'97 (''Audio Codec '97;'' also MC'97 for ''Modem Codec '97'') is an audio codec standard developed by Intel Architecture Labs in 1997. The standard was used in motherboards, modems, and sound cards. The specification covers two types of compo ...
(Audio Codec 1997) interface between
integrated circuit An integrated circuit or monolithic integrated circuit (also referred to as an IC, a chip, or a microchip) is a set of electronic circuits on one small flat piece (or "chip") of semiconductor material, usually silicon. Large numbers of tiny ...
s on PC motherboards *
ADAT Lightpipe The ADAT Lightpipe, officially the ADAT Optical Interface, is a standard for the transfer of digital audio between equipment. It was originally developed by Alesis but has since become widely accepted, with many third party hardware manufacturers ...
interface *
AES3 AES3 is a standard for the exchange of digital audio signals between professional audio devices. An AES3 signal can carry two channels of pulse-code-modulated digital audio over several transmission media including balanced lines, unbalanced ...
interface with
XLR connector The XLR connector is a type of electrical connector primarily used in professional audio, video, and stage lighting equipment. XLR connectors are cylindical in design, and have three to seven connector pins, and are often employed for analog b ...
s, common in professional audio equipment * AES47 - professional AES3-style digital audio over
Asynchronous Transfer Mode Asynchronous Transfer Mode (ATM) is a telecommunications standard defined by American National Standards Institute (ANSI) and ITU-T (formerly CCITT) for digital transmission of multiple types of traffic. ATM was developed to meet the needs of ...
networks *
Intel High Definition Audio Intel High Definition Audio (IHDA) (also called HD Audio or development codename Azalia) is a specification for the audio sub-system of personal computers. It was released by Intel in 2004 as the successor to their AC'97 PC audio standard. Featur ...
- modern replacement for AC'97 *
I²S I²S (Inter-IC Sound, pronounced "eye-squared-ess"), is an electrical serial bus interface standard used for connecting digital audio devices together. It is used to communicate PCM audio data between integrated circuits in an electronic device ...
(Inter-IC sound) interface between
integrated circuit An integrated circuit or monolithic integrated circuit (also referred to as an IC, a chip, or a microchip) is a set of electronic circuits on one small flat piece (or "chip") of semiconductor material, usually silicon. Large numbers of tiny ...
s in consumer electronics *
MADI Madi may refer to: Places * Madi, Chitwan, a municipality in Chitwan District in Nepal * Madi Municipality, Sankhuwasabha, a municipality in Sankhuwasabha District in Nepal * Madi Rural Municipality, Rolpa, a rural municipality in Rolpa Distric ...
(Multichannel Audio Digital Interface) *
MIDI MIDI (; Musical Instrument Digital Interface) is a technical standard that describes a communications protocol, digital interface, and electrical connectors that connect a wide variety of electronic musical instruments, computers, and ...
- low-bandwidth interconnect for carrying instrument data; cannot carry sound but can carry digital sample data in non-real time *
S/PDIF S/PDIF (Sony/Philips Digital Interface) is a type of digital audio interface used in consumer audio equipment to output audio over relatively short distances. The signal is transmitted over either a coaxial cable (using RCA or BNC connectors ...
- either over
coaxial cable Coaxial cable, or coax (pronounced ) is a type of electrical cable consisting of an inner conductor surrounded by a concentric conducting shield, with the two separated by a dielectric ( insulating material); many coaxial cables also have a p ...
or TOSLINK, common in consumer audio equipment and derived from AES3 * TDIF, TASCAM proprietary format with
D-sub The D-subminiature or D-sub is a common type of electrical connector. They are named for their characteristic D-shaped metal shield. When they were introduced, D-subs were among the smallest connectors used on computer systems. Description, no ...
cable


See also

* Digital audio editor *
Digital synthesizer A digital synthesizer is a synthesizer that uses digital signal processing (DSP) techniques to make musical sounds. This in contrast to older analog synthesizers, which produce music using analog electronics, and samplers, which play back digit ...
*
Frequency modulation synthesis Frequency modulation synthesis (or FM synthesis) is a form of sound synthesis whereby the frequency of a waveform is changed by modulating its frequency with a modulator. The frequency of an oscillator is altered "in accordance with the amplitu ...
*
Sound chip A sound chip is an integrated circuit (chip) designed to produce audio signals through digital, analog or mixed-mode electronics. Sound chips are typically fabricated on metal–oxide–semiconductor (MOS) mixed-signal chips that process ...
*
Sound Card A sound card (also known as an audio card) is an internal expansion card that provides input and output of audio signals to and from a computer under the control of computer programs. The term ''sound card'' is also applied to external audio ...
*
Audio Interface An audio interface is a piece of computer hardware that allows the input and output of audio signals to and from a host computer or recording device. Audio interfaces are closely related to computer sound cards, but whereas sound cards are opti ...
* Quantization * Sampling *
Multitrack recording Multitrack recording (MTR), also known as multitracking or tracking, is a method of sound recording developed in 1955 that allows for the separate recording of multiple sound sources or of sound sources recorded at different times to create a ...
*
Digital audio workstation A digital audio workstation (DAW) is an electronic device or application software used for recording, editing and producing audio files. DAWs come in a wide variety of configurations from a single software program on a laptop, to an integr ...


Notes


References


Further reading

*Borwick, John, ed., 1994: ''Sound Recording Practice'' (Oxford: Oxford University Press) *Bosi, Marina, and Goldberg, Richard E., 2003: ''Introduction to Digital Audio Coding and Standards'' (Springer) *Ifeachor, Emmanuel C., and Jervis, Barrie W., 2002: ''Digital Signal Processing: A Practical Approach'' (Harlow, England: Pearson Education Limited) *Rabiner, Lawrence R., and Gold, Bernard, 1975: ''Theory and Application of Digital Signal Processing'' (Englewood Cliffs, New Jersey: Prentice-Hall, Inc.) *Watkinson, John, 1994: ''The Art of Digital Audio'' (Oxford: Focal Press)


External links

* * * {{Music technology