HOME

TheInfoList



OR:

Digital audio is a representation of sound recorded in, or converted into, digital form. In digital audio, the
sound wave In physics, sound is a vibration that propagates as an acoustic wave through a transmission medium such as a gas, liquid or solid. In human physiology and psychology, sound is the ''reception'' of such waves and their ''perception'' by the ...
of the
audio signal An audio signal is a representation of sound, typically using either a changing level of electrical voltage for analog signals or a series of binary numbers for Digital signal (signal processing), digital signals. Audio signals have frequencies i ...
is typically encoded as numerical samples in a continuous sequence. For example, in CD audio, samples are taken 44,100 times per second, each with 16-bit resolution. Digital audio is also the name for the entire technology of
sound recording and reproduction Sound recording and reproduction is the electrical, Mechanical system, mechanical, electronic, or digital inscription and re-creation of sound waves, such as spoken voice, singing, instrumental music, or sound effects. The two main classes of ...
using audio signals that have been encoded in digital form. Following significant advances in digital audio technology during the 1970s and 1980s, it gradually replaced analog audio technology in many areas of
audio engineering Audio most commonly refers to sound, as it is transmitted in signal form. It may also refer to: Sound *Audio signal, an electrical representation of sound * Audio frequency, a frequency in the audio spectrum *Digital audio, representation of soun ...
, record production and
telecommunications Telecommunication, often used in its plural form or abbreviated as telecom, is the transmission of information over a distance using electronic means, typically through cables, radio waves, or other communication technologies. These means of ...
in the 1990s and 2000s. In a digital audio system, an analog electrical signal representing the sound is converted with an
analog-to-digital converter In electronics, an analog-to-digital converter (ADC, A/D, or A-to-D) is a system that converts an analog signal, such as a sound picked up by a microphone or light entering a digital camera, into a Digital signal (signal processing), digi ...
(ADC) into a digital signal, typically using pulse-code modulation (PCM). This digital signal can then be recorded, edited, modified, and copied using
computer A computer is a machine that can be Computer programming, programmed to automatically Execution (computing), carry out sequences of arithmetic or logical operations (''computation''). Modern digital electronic computers can perform generic set ...
s, audio playback machines, and other digital tools. For playback, a digital-to-analog converter (DAC) performs the reverse process, converting a digital signal back into an analog signal, which is then sent through an audio power amplifier and ultimately to a loudspeaker. Digital audio systems may include compression, storage, processing, and transmission components. Conversion to a digital format allows convenient manipulation, storage, transmission, and retrieval of an audio signal. Unlike analog audio, in which making copies of a recording results in generation loss and degradation of signal quality, digital audio allows an infinite number of copies to be made without any degradation of signal quality.


Overview

Digital audio technologies are used in the recording, manipulation, mass-production, and distribution of sound, including recordings of
song A song is a musical composition performed by the human voice. The voice often carries the melody (a series of distinct and fixed pitches) using patterns of sound and silence. Songs have a structure, such as the common ABA form, and are usu ...
s, instrumental pieces,
podcast A podcast is a Radio program, program made available in digital format for download over the Internet. Typically, a podcast is an Episode, episodic series of digital audio Computer file, files that users can download to a personal device or str ...
s, sound effects, and other sounds. Modern online music distribution depends on digital recording and
data compression In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compressi ...
. The availability of music as data files, rather than as physical objects, has significantly reduced the costs of distribution as well as making it easier to share copies. Before digital audio, the music industry distributed and sold music by selling physical copies in the form of records and
cassette tape The Compact Cassette, also commonly called a cassette tape, audio cassette, or simply tape or cassette, is an analog audio, analog magnetic tape recording format for Sound recording and reproduction, audio recording and playback. Invented by L ...
s. With digital audio and online distribution systems such as
iTunes iTunes is a media player, media library, and mobile device management (MDM) utility developed by Apple. It is used to purchase, play, download and organize digital multimedia on personal computers running the macOS and Windows operating s ...
, companies sell digital sound files to consumers, which the consumer receives over the Internet. Popular streaming services such as
Apple Music Apple Music is an audio and video streaming service developed by Apple Inc. Users can select music to stream to their device on-demand, or listen to existing playlists. The service also includes the sister internet radio stations Apple Musi ...
,
Spotify Spotify (; ) is a List of companies of Sweden, Swedish Music streaming service, audio streaming and media service provider founded on 23 April 2006 by Daniel Ek and Martin Lorentzon. , it is one of the largest providers of music streaming services ...
, or
YouTube YouTube is an American social media and online video sharing platform owned by Google. YouTube was founded on February 14, 2005, by Steve Chen, Chad Hurley, and Jawed Karim who were three former employees of PayPal. Headquartered in ...
, offer temporary access to the digital file, and are now the most common form of music consumption. An analog audio system converts physical waveforms of sound into electrical representations of those waveforms by use of a
transducer A transducer is a device that Energy transformation, converts energy from one form to another. Usually a transducer converts a signal in one form of energy to a signal in another. Transducers are often employed at the boundaries of automation, M ...
, such as a
microphone A microphone, colloquially called a mic (), or mike, is a transducer that converts sound into an electrical signal. Microphones are used in many applications such as telephones, hearing aids, public address systems for concert halls and publi ...
. The sounds are then stored on an analog medium such as
magnetic tape Magnetic tape is a medium for magnetic storage made of a thin, magnetizable coating on a long, narrow strip of plastic film. It was developed in Germany in 1928, based on the earlier magnetic wire recording from Denmark. Devices that use magnetic ...
, or transmitted through an analog medium such as a telephone line or
radio Radio is the technology of communicating using radio waves. Radio waves are electromagnetic waves of frequency between 3  hertz (Hz) and 300  gigahertz (GHz). They are generated by an electronic device called a transmitter connec ...
. The process is reversed for reproduction: the electrical audio signal is amplified and then converted back into physical waveforms via a loudspeaker. Analog audio retains its fundamental wave-like characteristics throughout its storage, transformation, duplication, and amplification. Analog audio signals are susceptible to noise and distortion, due to the innate characteristics of electronic circuits and associated devices. Disturbances in a digital system do not result in error unless they are so large as to result in a symbol being misinterpreted as another symbol or disturbing the sequence of symbols. It is, therefore, generally possible to have an entirely error-free digital audio system in which no noise or distortion is introduced between conversion to digital format and conversion back to analog. A digital audio signal may be encoded for correction of any errors that might occur in the storage or transmission of the signal. This technique, known as
channel coding In computing, telecommunication, information theory, and coding theory, forward error correction (FEC) or channel coding is a technique used for error control, controlling errors in data transmission over unreliable or noisy communication channel ...
, is essential for broadcast or recorded digital systems to maintain bit accuracy. Eight-to-fourteen modulation is the channel code used for the audio
compact disc The compact disc (CD) is a Digital media, digital optical disc data storage format co-developed by Philips and Sony to store and play digital audio recordings. It employs the Compact Disc Digital Audio (CD-DA) standard and was capable of hol ...
(CD).


Conversion process

If an audio signal is analog, a digital audio system starts with an ADC that converts an analog signal to a digital signal. The ADC runs at a specified sampling rate and converts at a known bit resolution. CD audio, for example, has a sampling rate of 44.1  kHz (44,100 samples per second), and has 16-bit resolution for each stereo channel. Analog signals that have not already been bandlimited must be passed through an anti-aliasing filter before conversion, to prevent the aliasing distortion that is caused by audio signals with frequencies higher than the Nyquist frequency (half the sampling rate). A digital audio signal may be stored or transmitted. Digital audio can be stored on a CD, a digital audio player, a
hard drive A hard disk drive (HDD), hard disk, hard drive, or fixed disk is an electro-mechanical data storage device that stores and retrieves digital data using magnetic storage with one or more rigid rapidly rotating hard disk drive platter, pla ...
, a
USB flash drive A flash drive (also thumb drive, memory stick, and pen drive/pendrive) is a data storage device that includes flash memory with an integrated USB interface. A typical USB drive is removable, rewritable, and smaller than an optical disc, and u ...
, or any other digital data storage device. The digital signal may be altered through
digital signal processing Digital signal processing (DSP) is the use of digital processing, such as by computers or more specialized digital signal processors, to perform a wide variety of signal processing operations. The digital signals processed in this manner are a ...
, where it may be filtered or have effects applied. Sample-rate conversion including upsampling and downsampling may be used to change signals that have been encoded with a different sampling rate to a common sampling rate prior to processing. Audio data compression techniques, such as MP3,
Advanced Audio Coding Advanced Audio Coding (AAC) is an audio coding standard for lossy digital audio compression. It was developed by Dolby, AT&T, Fraunhofer and Sony, originally as part of the MPEG-2 specification but later improved under MPEG-4.ISO (2006ISO/ ...
(AAC), Opus, Ogg Vorbis, or FLAC, are commonly employed to reduce the file size. Digital audio can be carried over digital audio interfaces such as AES3 or MADI. Digital audio can be carried over a network using
audio over Ethernet In audio engineering, audio and broadcast engineering, broadcast engineering, Audio networking is the use of a Computer networking, network to distribute real-time digital audio. Audio Networking replaces bulky snake cables or audio-specific insta ...
, audio over IP or other
streaming media Streaming media refers to multimedia delivered through a Computer network, network for playback using a Media player (disambiguation), media player. Media is transferred in a ''stream'' of Network packet, packets from a Server (computing), ...
standards and systems. For playback, digital audio must be converted back to an analog signal with a DAC. According to the Nyquist–Shannon sampling theorem, with some practical and theoretical restrictions, a band-limited version of the original analog signal can be accurately reconstructed from the digital signal. During conversion, audio data can be embedded with a digital watermark to prevent piracy and unauthorized use. Watermarking is done using a direct-sequence spread-spectrum (DSSS) method. The audio information is then modulated by a pseudo-noise (PN) sequence, then shaped within the frequency domain and put back in the original signal. The strength of the embedding determines the strength of the watermark on the audio data.


History


Coding

Pulse-code modulation (PCM) was invented by British scientist Alec Reeves in 1937. In 1950, C. Chapin Cutler of
Bell Labs Nokia Bell Labs, commonly referred to as ''Bell Labs'', is an American industrial research and development company owned by Finnish technology company Nokia. With headquarters located in Murray Hill, New Jersey, Murray Hill, New Jersey, the compa ...
filed the patent on differential pulse-code modulation (DPCM), a
data compression In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compressi ...
algorithm. Adaptive DPCM (ADPCM) was introduced by P. Cummiskey, Nikil S. Jayant and James L. Flanagan at Bell Labs in 1973. Perceptual coding was first used for
speech coding Speech coding is an application of data compression to digital audio signals containing speech. Speech coding uses speech-specific parameter estimation using audio signal processing techniques to model the speech signal, combined with generic da ...
compression, with linear predictive coding (LPC). Initial concepts for LPC date back to the work of Fumitada Itakura ( Nagoya University) and Shuzo Saito ( Nippon Telegraph and Telephone) in 1966. During the 1970s, Bishnu S. Atal and Manfred R. Schroeder at Bell Labs developed a form of LPC called adaptive predictive coding (APC), a perceptual coding algorithm that exploited the masking properties of the human ear, followed in the early 1980s with the code-excited linear prediction (CELP) algorithm. Discrete cosine transform (DCT) coding, a lossy compression method first proposed by Nasir Ahmed in 1972, provided the basis for the modified discrete cosine transform (MDCT), which was developed by J. P. Princen, A. W. Johnson and A. B. Bradley in 1987.J. P. Princen, A. W. Johnson und A. B. Bradley: ''Subband/transform coding using filter bank designs based on time domain aliasing cancellation'', IEEE Proc. Intl. Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2161–2164, 1987. The MDCT is the basis for most audio coding standards, such as Dolby Digital (AC-3), MP3 (
MPEG The Moving Picture Experts Group (MPEG) is an alliance of working groups established jointly by International Organization for Standardization, ISO and International Electrotechnical Commission, IEC that sets standards for media coding, includ ...
Layer III), AAC, Windows Media Audio (WMA), Opus and Vorbis ( Ogg).


Recording

PCM was used in
telecommunications Telecommunication, often used in its plural form or abbreviated as telecom, is the transmission of information over a distance using electronic means, typically through cables, radio waves, or other communication technologies. These means of ...
applications long before its first use in commercial broadcast and recording. Commercial digital recording was pioneered in Japan by
NHK , also known by its Romanization of Japanese, romanized initialism NHK, is a Japanese public broadcasting, public broadcaster. It is a statutory corporation funded by viewers' payments of a television licence, television license fee. NHK ope ...
and Nippon Columbia and their Denon brand, in the 1960s. The first commercial digital recordings were released in 1971. The
BBC The British Broadcasting Corporation (BBC) is a British public service broadcaster headquartered at Broadcasting House in London, England. Originally established in 1922 as the British Broadcasting Company, it evolved into its current sta ...
also began to experiment with digital audio in the 1960s. By the early 1970s, it had developed a 2-channel recorder, and in 1972 it deployed a digital audio transmission system that linked their broadcast center to their remote transmitters. The first 16-bit PCM recording in the
United States The United States of America (USA), also known as the United States (U.S.) or America, is a country primarily located in North America. It is a federal republic of 50 U.S. state, states and a federal capital district, Washington, D.C. The 48 ...
was made by Thomas Stockham at the Santa Fe Opera in 1976, on a Soundstream recorder. An improved version of the Soundstream system was used to produce several classical recordings by Telarc in 1978. The 3M digital multitrack recorder in development at the time was based on BBC technology. The first all-digital album recorded on this machine was Ry Cooder's '' Bop till You Drop'' in 1979. British record label Decca began development of its own 2-track digital audio recorders in 1978 and released the first European digital recording in 1979. Popular professional digital multitrack recorders produced by Sony/Studer (
DASH The dash is a punctuation mark consisting of a long horizontal line. It is similar in appearance to the hyphen but is longer and sometimes higher from the baseline. The most common versions are the endash , generally longer than the hyphen ...
) and Mitsubishi ( ProDigi) in the early 1980s helped to bring about digital recording's acceptance by the major record companies. Machines for these formats had their own transports built-in as well, using reel-to-reel tape in either 1/4", 1/2", or 1" widths, with the audio data being recorded to the tape using a multi-track stationary tape head. PCM adaptors allowed for stereo digital audio recording on a conventional NTSC or PAL
video tape recorder A video tape recorder (VTR) is a tape recorder designed to record and playback video and audio signal, audio material from magnetic tape. The early VTRs were open-reel devices that record on individual reels of 2-inch-wide (5.08 cm) tape. ...
. The 1982 introduction of the CD by Philips and
Sony is a Japanese multinational conglomerate (company), conglomerate headquartered at Sony City in Minato, Tokyo, Japan. The Sony Group encompasses various businesses, including Sony Corporation (electronics), Sony Semiconductor Solutions (i ...
popularized digital audio with consumers. ADAT became available in the early 1990s, which allowed eight-track 44.1 or 48 kHz recording on S-VHS cassettes, and DTRS performed a similar function with Hi8 tapes. Formats like ProDigi and DASH were referred to as SDAT (stationary-head digital audio tape) formats, as opposed to formats like the PCM adaptor-based systems and Digital Audio Tape (DAT), which were referred to as RDAT (rotating-head digital audio tape) formats, due to their helical-scan process of recording. Like the DAT cassette, ProDigi and DASH machines also accommodated the obligatory 44.1 kHz sampling rate, but also 48 kHz on all machines, and eventually a 96 kHz sampling rate. They overcame the problems that made typical analog recorders unable to meet the bandwidth (frequency range) demands of digital recording by a combination of higher tape speeds, narrower head gaps used in combination with metal-formulation tapes, and the spreading of data across multiple parallel tracks. Unlike analog systems, modern
digital audio workstation A digital audio workstation (DAW ) is an electronic device or application software used for Sound recording and reproduction, recording, editing and producing audio files. DAWs come in a wide variety of configurations from a single software pr ...
s and audio interfaces allow as many channels in as many different sampling rates as the computer can effectively run at a single time. Avid Audio and Steinberg released the first digital audio workstation software programs in 1989. Digital audio workstations make multitrack recording and mixing much easier for large projects which would otherwise be difficult with analog equipment.


Telephony

The rapid development and wide adoption of PCM digital telephony was enabled by metal–oxide–semiconductor (MOS) switched capacitor (SC) circuit technology, developed in the early 1970s. This led to the development of PCM codec-filter chips in the late 1970s. The silicon-gate CMOS (complementary MOS) PCM codec-filter chip, developed by David A. Hodges and W.C. Black in 1980, has since been the industry standard for digital telephony. By the 1990s, telecommunication networks such as the
public switched telephone network The public switched telephone network (PSTN) is the aggregate of the world's telephone networks that are operated by national, regional, or local telephony operators. It provides infrastructure and services for public telephony. The PSTN consists o ...
(PSTN) had been largely digitized with VLSI (very
large-scale integration An integrated circuit (IC), also known as a microchip or simply chip, is a set of electronic circuits, consisting of various electronic components (such as transistors, resistors, and capacitors) and their interconnections. These components a ...
) CMOS PCM codec-filters, widely used in electronic switching systems for telephone exchanges, user-end modems and a range of digital transmission applications such as the integrated services digital network (ISDN), cordless telephones and
cell phones A mobile phone or cell phone is a portable telephone that allows users to make and receive Telephone call, calls over a radio frequency link while moving within a designated telephone service area, unlike fixed-location phones (landline phone ...
.


Technologies

Digital audio is used in
broadcasting Broadcasting is the data distribution, distribution of sound, audio audiovisual content to dispersed audiences via a electronic medium (communication), mass communications medium, typically one using the electromagnetic spectrum (radio waves), ...
of audio. Standard technologies include
Digital audio broadcasting Digital Audio Broadcasting (DAB) is a digital radio international standard, standard for broadcasting digital audio radio services in many countries around the world, defined, supported, marketed and promoted by the WorldDAB organisation. T ...
(DAB), Digital Radio Mondiale (DRM), HD Radio and In-band on-channel (IBOC). Digital audio in recording applications is stored on audio-specific technologies including CD, DAT, Digital Compact Cassette (DCC) and MiniDisc. Digital audio may be stored in a standard
audio file format An audio file format is a file format for storing digital audio data on a computer system. The bit layout of the audio data (excluding metadata) is called the audio coding format and can be uncompressed, or audio compression (data), compressed t ...
s and stored on a Hard disk recorder,
Blu-ray Blu-ray (Blu-ray Disc or BD) is a digital optical disc data storage format designed to supersede the DVD format. It was invented and developed in 2005 and released worldwide on June 20, 2006, capable of storing several hours of high-defin ...
or DVD-Audio. Files may be played back on smartphones, computers or MP3 player. Digital audio resolution is measured in audio bit depth. Most digital audio formats use either 16-bit, 24-bit, and 32-bit resolution.


See also

*
Digital audio editor Audio editing software is any software or computer program which allows editing and generating audio data. Audio editing software can be implemented completely or partly as a library, as a computer application, as a web application, or as a ...
* Digital synthesizer *
Frequency modulation synthesis Frequency modulation synthesis (or FM synthesis) is a form of Synthesizer#Sound synthesis, sound synthesis whereby the frequency of a waveform is changed by Frequency modulation, modulating its frequency with a modulator. The instantaneous frequen ...
* Sound chip * Sound card * Audio Interface * Quantization * Sampling *
Multitrack recording Multitrack recording (MTR), also known as multitracking, is a method of sound recording developed in 1955 that allows for the separate recording of multiple sound sources or of sound sources recorded at different times to create a cohesive who ...
*
Digital audio workstation A digital audio workstation (DAW ) is an electronic device or application software used for Sound recording and reproduction, recording, editing and producing audio files. DAWs come in a wide variety of configurations from a single software pr ...


Notes


References


Further reading

*Borwick, John, ed., 1994: ''Sound Recording Practice'' (Oxford: Oxford University Press) *Bosi, Marina, and Goldberg, Richard E., 2003: ''Introduction to Digital Audio Coding and Standards'' (Springer) *Ifeachor, Emmanuel C., and Jervis, Barrie W., 2002: ''Digital Signal Processing: A Practical Approach'' (Harlow, England: Pearson Education Limited) *Rabiner, Lawrence R., and Gold, Bernard, 1975: ''Theory and Application of Digital Signal Processing'' (Englewood Cliffs, New Jersey: Prentice-Hall, Inc.) *Watkinson, John, 1994: ''The Art of Digital Audio'' (Oxford: Focal Press)


External links

* * * {{Music technology