HOME

TheInfoList



OR:

Waveform Audio File Format (WAVE, or WAV due to its
filename extension A filename extension, file name extension or file extension is a suffix to the name of a computer file (for example, .txt, .mp3, .exe) that indicates a characteristic of the file contents or its intended use. A filename extension is typically d ...
; pronounced or ) is an
audio file format An audio file format is a file format for storing digital audio data on a computer system. The bit layout of the audio data (excluding metadata) is called the audio coding format and can be uncompressed, or audio compression (data), compressed t ...
standard for storing an audio
bitstream A bitstream (or bit stream), also known as binary sequence, is a sequence of bits. A bytestream is a sequence of bytes. Typically, each byte is an 8-bit quantity, and so the term octet stream is sometimes used interchangeably. An octet may ...
on
personal computer A personal computer, commonly referred to as PC or computer, is a computer designed for individual use. It is typically used for tasks such as Word processor, word processing, web browser, internet browsing, email, multimedia playback, and PC ...
s. The format was developed and published for the first time in 1991 by
IBM International Business Machines Corporation (using the trademark IBM), nicknamed Big Blue, is an American Multinational corporation, multinational technology company headquartered in Armonk, New York, and present in over 175 countries. It is ...
and
Microsoft Microsoft Corporation is an American multinational corporation and technology company, technology conglomerate headquartered in Redmond, Washington. Founded in 1975, the company became influential in the History of personal computers#The ear ...
. It is the main format used on
Microsoft Windows Windows is a Product lining, product line of Proprietary software, proprietary graphical user interface, graphical operating systems developed and marketed by Microsoft. It is grouped into families and subfamilies that cater to particular sec ...
systems for
uncompressed audio Pulse-code modulation (PCM) is a method used to digitally represent analog signals. It is the standard form of digital audio in computers, compact discs, digital telephony and other digital audio applications. In a PCM stream, the amplitude o ...
. The usual bitstream encoding is the
linear pulse-code modulation Pulse-code modulation (PCM) is a method used to digitally represent analog signals. It is the standard form of digital audio in computers, compact discs, digital telephony and other digital audio applications. In a PCM stream, the amplitude o ...
(LPCM) format. WAV is an application of the
Resource Interchange File Format Resource Interchange File Format (RIFF) is a generic file container format (digital), container format for storing data in tagged Chunk (information), chunks. It is primarily used for audio and video, though it can be used for arbitrary data. T ...
(RIFF) bitstream format method for storing data in ''chunks'', and thus is similar to the 8SVX and the
Audio Interchange File Format Audio Interchange File Format (AIFF) is an audio file format standard used for storing sound data for personal computers and other electronic audio devices. The format was developed by Apple Inc. in 1988 based on Electronic Arts' Interchange F ...
(AIFF) format used on
Amiga Amiga is a family of personal computers produced by Commodore International, Commodore from 1985 until the company's bankruptcy in 1994, with production by others afterward. The original model is one of a number of mid-1980s computers with 16-b ...
and
Macintosh Mac is a brand of personal computers designed and marketed by Apple Inc., Apple since 1984. The name is short for Macintosh (its official name until 1999), a reference to the McIntosh (apple), McIntosh apple. The current product lineup inclu ...
computers, respectively.


Description

The WAV file is an instance of a
Resource Interchange File Format Resource Interchange File Format (RIFF) is a generic file container format (digital), container format for storing data in tagged Chunk (information), chunks. It is primarily used for audio and video, though it can be used for arbitrary data. T ...
(RIFF) defined by
IBM International Business Machines Corporation (using the trademark IBM), nicknamed Big Blue, is an American Multinational corporation, multinational technology company headquartered in Armonk, New York, and present in over 175 countries. It is ...
and
Microsoft Microsoft Corporation is an American multinational corporation and technology company, technology conglomerate headquartered in Redmond, Washington. Founded in 1975, the company became influential in the History of personal computers#The ear ...
. The RIFF format acts as a ''wrapper'' for various
audio coding format An audio coding format (or sometimes audio compression format) is a content representation format for storage or transmission of digital audio (such as in digital television, digital radio and in audio and video files). Examples of audio coding f ...
s. Though a WAV file can contain compressed audio, the most common WAV audio format is uncompressed audio in the
linear pulse-code modulation Pulse-code modulation (PCM) is a method used to digitally represent analog signals. It is the standard form of digital audio in computers, compact discs, digital telephony and other digital audio applications. In a PCM stream, the amplitude o ...
(LPCM) format. LPCM is also the standard audio coding format for
audio CD Compact Disc Digital Audio (CDDA or CD-DA), also known as Digital Audio Compact Disc or simply as Audio CD, is the standard format for audio compact discs. The standard is defined in the '' Red Book'' technical specifications, which is why t ...
s, which store two-channel LPCM audio
sampled Sample or samples may refer to: * Sample (graphics), an intersection of a color channel and a pixel * Sample (material), a specimen or small quantity of something * Sample (signal), a digital discrete sample of a continuous analog signal * Sample ...
at 44.1 kHz with 16 bits per sample. Since LPCM is uncompressed and retains all of the samples of an audio track, professional users or audio experts may use the WAV format with LPCM audio for maximum audio quality. WAV files can also be edited and manipulated with relative ease using software. On Microsoft Windows, the WAV format supports compressed audio using the
Audio Compression Manager This article describes audio APIs and components in Microsoft Windows which are now obsolete or deprecated. Multimedia Extensions (MME) The MME API or the Windows Multimedia API (also known as ''WinMM'') was the first universal and standardized W ...
(ACM). Any ACM
codec A codec is a computer hardware or software component that encodes or decodes a data stream or signal. ''Codec'' is a portmanteau of coder/decoder. In electronic communications, an endec is a device that acts as both an encoder and a decoder o ...
can be used to compress a WAV file. The
user interface In the industrial design field of human–computer interaction, a user interface (UI) is the space where interactions between humans and machines occur. The goal of this interaction is to allow effective operation and control of the machine fro ...
(UI) for ACM may be accessed through various programs that use it, including Sound Recorder in some versions of Windows. Beginning with
Windows 2000 Windows 2000 is a major release of the Windows NT operating system developed by Microsoft, targeting the server and business markets. It is the direct successor to Windows NT 4.0, and was Software release life cycle#Release to manufacturing (RT ...
, a WAVE_FORMAT_EXTENSIBLE header was defined which specifies multiple audio channel data along with speaker positions, eliminates ambiguity regarding sample types and container sizes in the standard WAV format and supports defining custom extensions to the format.


File specifications


RIFF

A RIFF file is a tagged file format. It has a specific container format (a ''chunk'') with a header that includes a four-character tag (
FourCC A FourCC ("four-character code") is a sequence of four bytes (typically ASCII) used to uniquely identify data formats. It originated from the OSType or ResType metadata system used in classic Mac OS and was adopted for the Amiga/Electronic Arts ...
) and the size (number of bytes) of the chunk. The tag specifies how the data within the chunk should be interpreted, and there are several standard FourCC tags. Tags consisting of all capital letters are reserved tags. The outermost chunk of a RIFF file has a RIFF tag; the first four bytes of chunk data are an additional FourCC tag that specify the form type and are followed by a sequence of subchunks. In the case of a WAV file, the additional tag is WAVE. The remainder of the RIFF data is a sequence of chunks describing the audio information. The advantage of a tagged file format is that the format can be extended later while maintaining
backward compatibility In telecommunications and computing, backward compatibility (or backwards compatibility) is a property of an operating system, software, real-world product, or technology that allows for interoperability with an older legacy system, or with Input ...
. The rule for a RIFF (or WAV) reader is that it should ignore any tagged chunk that it does not recognize. The reader will not be able to use the new information, but the reader should not be confused. The specification for RIFF files includes the definition of an INFO chunk. The chunk may include information such as the title of the work, the author, the creation date, and copyright information. Although the INFO chunk was defined for RIFF in version 1.0, the chunk was not referenced in the formal specification of a WAV file. Many readers had trouble processing this. Consequently, the safest thing to do from an interchange standpoint was to omit the INFO chunk and other extensions and send a lowest-common-denominator file. There are other INFO chunk placement problems. RIFF files were expected to be used in international environments, so there is CSET chunk to specify the country code, language, dialect, and
code page In computing, a code page is a character encoding and as such it is a specific association of a set of printable character (computing), characters and control characters with unique numbers. Typically each number represents the binary value in a s ...
for the strings in a RIFF file. For example, specifying an appropriate CSET chunk should allow the strings in an INFO chunk (and other chunks throughout the RIFF file) to be interpreted as Cyrillic or Japanese characters. RIFF also defines a JUNK chunk whose contents are uninteresting. The chunk allows a chunk to be deleted by just changing its FourCC. The chunk could also be used to reserve some space for future edits so the file could be modified without being resized. A later definition of RIFF introduced a similar PAD chunk.


RIFF WAVE

The top-level definition of a WAV file is:
 → RIFF('WAVE'
                               // Format of the file
                    fact-ck>        // Fact chunk
                    cue-ck>         // Cue points
                    playlist-ck>    // Playlist
                    assoc-data-list>// Associated data list
                    )       // Wave data
The top-level RIFF form uses a WAVE tag. It is followed by a mandatory chunk that describes the format of the sample data that follows. This chunk includes information such as the sample encoding, number of bits per channel, the number of channels, and the sample rate. The WAV specification includes some optional features. The optional chunk reports the number of samples for some compressed coding schemes. The chunk identifies some significant sample numbers in the wave file. The chunk allows the samples to be played out of order or repeated rather than just from beginning to end. The associated data list () allows labels and notes to be attached to cue points; text annotation may be given for a group of samples (e.g., caption information). Finally, the mandatory chunk contains the actual samples in the format previously specified. Note that the WAV file definition does not show where an INFO chunk should be placed. It is also silent about the placement of a CSET chunk (which specifies the character set used). The RIFF specification attempts to be a formal specification, but its formalism lacks the precision seen in other tagged formats. For example, the RIFF specification does not clearly distinguish between a set of subchunks and an ordered sequence of subchunks. The RIFF form chunk suggests it should be a sequence container. Sequencing information is specified in the RIFF form of a WAV file consistent with the formalism: "However, must always occur before , and both of these chunks are mandatory in a WAVE file." The specification suggests a LIST chunk is also a sequence: "A LIST chunk contains a list, or ordered sequence, of subchunks." However, the specification does not give a formal specification of the INFO chunk; an example INFO LIST chunk ignores the chunk sequence implied in the INFO description. The LIST chunk definition for does use the LIST chunk as a sequence container with good formal semantics. The WAV specification supports, and most WAV files use, a single contiguous array of audio samples. The specification also supports discrete blocks of samples and silence that are played in order. The specification for the sample data contains apparent errors:
The  contains the waveform data. It is defined as follows:
      → data(  )
    → LIST( 'wavl' ... ) // Silence
   → slnt(  ) // Count of silent samples
Apparently (undefined) and (defined but not referenced) should be identical. Even with this resolved, the productions then allow a to contain a
recursive Recursion occurs when the definition of a concept or process depends on a simpler or previous version of itself. Recursion is used in a variety of disciplines ranging from linguistics to logic. The most common application of recursion is in m ...
(which implies data interpretation problems). To avoid the recursion, the specification can be interpreted as:
    → data(  ... )
    → LIST( 'wavl' ... ) // Silence
   → slnt(  ) // Count of silent samples
WAV files can contain embedded IFF ''lists'', which can contain several ''sub-chunks''.


WAV file header

This is an example of a WAV file header (44 bytes). Data is stored in little-endian byte order. aster RIFF chunk FileTypeBlocID (4 bytes) : Identifier « RIFF » (0x52, 0x49, 0x46, 0x46) FileSize (4 bytes) : Overall file size minus 8 bytes FileFormatID (4 bytes) : Format = « WAVE » (0x57, 0x41, 0x56, 0x45) hunk describing the data format FormatBlocID (4 bytes) : Identifier « fmt␣ » (0x66, 0x6D, 0x74, 0x20) BlocSize (4 bytes) : Chunk size minus 8 bytes, which is 16 bytes here (0x10) AudioFormat (2 bytes) : Audio format (1: PCM integer, 3: IEEE 754 float) NbrChannels (2 bytes) : Number of channels Frequency (4 bytes) : Sample rate (in hertz) BytePerSec (4 bytes) : Number of bytes to read per second (Frequency * BytePerBloc). BytePerBloc (2 bytes) : Number of bytes per block (NbrChannels * BitsPerSample / 8). BitsPerSample (2 bytes) : Number of bits per sample hunk containing the sampled data DataBlocID (4 bytes) : Identifier « data » (0x64, 0x61, 0x74, 0x61) DataSize (4 bytes) : SampledData size SampledData


Metadata

As a derivative of RIFF, WAV files can be tagged with
metadata Metadata (or metainformation) is "data that provides information about other data", but not the content of the data itself, such as the text of a message or the image itself. There are many distinct types of metadata, including: * Descriptive ...
in the INFO chunk. In addition, WAV files can embed any kind of metadata, including but not limited to
Extensible Metadata Platform The Extensible Metadata Platform (XMP) is an ISO standard, originally created by Adobe Systems Inc., for the creation, processing and interchange of standardized and custom metadata for digital documents and data sets. XMP standardizes a data ...
(XMP) data or ID3 tags in extra chunks. The RIFF specification requires that applications ignore chunks they do not recognize and applications may not necessarily use this extra information.


Popularity

Uncompressed WAV files are large, so
file sharing File sharing is the practice of distributing or providing access to digital media, such as computer programs, multimedia (audio, images and video), documents or electronic books. Common methods of storage, transmission and dispersion include ...
of WAV files over the
Internet The Internet (or internet) is the Global network, global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a internetworking, network of networks ...
is uncommon except among video, music and audio professionals. The high resolution of the format makes it suitable for retaining first generation archived files of high quality, for use on a system where disk space and network bandwidth are not constraints.


Use by broadcasters

In spite of their large size, uncompressed WAV files are used by most radio broadcasters, especially those that have adopted a tapeless system. *
BBC Radio BBC Radio is an operational business division and service of the British Broadcasting Corporation (BBC) which has operated in the United Kingdom under the terms of a royal charter since 1927. The service provides national radio stations cove ...
in the UK requires LPCM 48 kHz 16-bit WAV audio as standard for all content made for broadcast on its stations. * The UK Commercial radio company
Global Radio Global Media & Entertainment Limited, trading as Global, is a British media company formed in 2007. It is the owner of the largest commercial radio company in Europe having expanded through a number of historical acquisitions, including Chrysal ...
uses 44.1 kHz 16-bit two-channel WAV files throughout their broadcast chain. * The ABC "D-Cart" system, which was developed by the Australian broadcaster, uses 48 kHz 16-bit two-channel WAV files. * The Digital Radio Mondiale consortium uses WAV files as an informal standard for transmitter simulation and receiver testing.


Limitations

The WAV format is limited to files that are less than , because of its use of a
32-bit In computer architecture, 32-bit computing refers to computer systems with a processor, memory, and other major system components that operate on data in a maximum of 32- bit units. Compared to smaller bit widths, 32-bit computers can perform la ...
unsigned integer In computer science, an integer is a datum of integral data type, a data type that represents some range of mathematical integers. Integral data types may be of different sizes and may or may not be allowed to contain negative values. Integers are ...
to record the file size in the header. Although this is equivalent to about 6.8 hours of CD-quality audio at 44.1 kHz, 16-bit
stereo Stereophonic sound, commonly shortened to stereo, is a method of sound reproduction that recreates a multi-directional, 3-dimensional audible perspective. This is usually achieved by using two independent audio channels through a configurat ...
, it is sometimes necessary to exceed this limit, especially when greater
sampling rate In signal processing, sampling is the reduction of a continuous-time signal to a discrete-time signal. A common example is the conversion of a sound wave to a sequence of "samples". A sample is a value of the signal at a point in time and/or s ...
s, bit resolutions or channel count are required. The W64 format was therefore created for use in Sound Forge. Its
64-bit In computer architecture, 64-bit integers, memory addresses, or other data units are those that are 64 bits wide. Also, 64-bit central processing units (CPU) and arithmetic logic units (ALU) are those that are based on processor registers, a ...
file size field in the header allows for much longer recording times. The
RF64 RF64 is a BWF-compatible multichannel audio file format enabling file sizes to exceed {{val, 4, ul=GiB. It has been specified by the European Broadcasting Union. The file format is designed to meet the requirements for multichannel sound in broa ...
format specified by the
European Broadcasting Union The European Broadcasting Union (EBU; , UER) is an alliance of Public broadcasting, public service media organisations in countries within the European Broadcasting Area (EBA) or who are member states of the Council of Europe, members of the ...
has also been created to solve this problem.


Non-audio data

Since the sampling rate of a WAV file can vary from to , and the number of channels can be as high as 65535, WAV files have also been used for non-audio data. LTspice, for instance, can store multiple circuit trace
waveform In electronics, acoustics, and related fields, the waveform of a signal is the shape of its Graph of a function, graph as a function of time, independent of its time and Magnitude (mathematics), magnitude Scale (ratio), scales and of any dis ...
s in separate channels, at any appropriate sampling rate, with the full-scale range representing ±1  V or A rather than a sound pressure.


Audio compact discs

Audio
compact disc The compact disc (CD) is a Digital media, digital optical disc data storage format co-developed by Philips and Sony to store and play digital audio recordings. It employs the Compact Disc Digital Audio (CD-DA) standard and was capable of hol ...
s (CDs) do not use the WAV file format, using instead Red Book audio. The commonality is that audio CDs are encoded as uncompressed 16-bit
44.1 kHz In digital audio, 44,100 Hz (alternately represented as 44.1 kHz) is a common sampling frequency. Analog audio is often recorded by sampling it 44,100 times per second, and then these samples are used to Signal reconstruction, reconstru ...
stereo LPCM, which is one of the formats supported by WAV.


Comparison of coding schemes

Audio in WAV files can be encoded in a variety of audio coding formats, such as
GSM The Global System for Mobile Communications (GSM) is a family of standards to describe the protocols for second-generation (2G) digital cellular networks, as used by mobile devices such as mobile phones and Mobile broadband modem, mobile broadba ...
or
MP3 MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany under the lead of Karlheinz Brandenburg. It was designed to greatly reduce the amount ...
, to reduce the file size. All WAV files, even those that use
MP3 MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is a coding format for digital audio developed largely by the Fraunhofer Society in Germany under the lead of Karlheinz Brandenburg. It was designed to greatly reduce the amount ...
compression, use the .wav extension. This is a reference to compare the
monophonic Monaural sound or monophonic sound (often shortened to mono) is sound intended to be heard as if it were emanating from one position. This contrasts with stereophonic sound or ''stereo'', which uses two separate audio channels to reproduce sou ...
(not
stereophonic Stereophonic sound, commonly shortened to stereo, is a method of sound reproduction that recreates a multi-directional, 3-dimensional audible perspective. This is usually achieved by using two independent audio channels through a configurat ...
) audio quality and compression bitrates of audio coding formats available for WAV files including LPCM,
ADPCM Adaptive differential pulse-code modulation (ADPCM) is a variant of differential pulse-code modulation (DPCM) that varies the size of the quantization step, to allow further reduction of the required data bandwidth for a given signal-to-noise rati ...
, Microsoft GSM 06.10, CELP, SBC,
Truespeech Truespeech is a proprietary audio codec produced by the DSP Group. It is designed for encoding voice data at low bitrates (8.5 kbps for 8 kHz samples), and to be embedded into DSP chips. Truespeech had been integrated into Windows Media P ...
and
MPEG The Moving Picture Experts Group (MPEG) is an alliance of working groups established jointly by International Organization for Standardization, ISO and International Electrotechnical Commission, IEC that sets standards for media coding, includ ...
Layer-3. These are the default ACM codecs that come with Windows.


See also

*
Windows Media Audio Windows Media Audio (WMA) is a series of audio codecs and their corresponding audio coding formats developed by Microsoft. It is a proprietary technology that forms part of the Windows Media framework. Audio encoded in WMA is stored in a digi ...


References


External links


WAVE file format specifications
- from McGill University, (Last update: 2011-01-03)
Extensible Wave-Format Descriptors
from Microsoft (Updated October 26, 2017)

- University of Bath

(1999)
WAV & BWF Metadata Guide

Exif tags
see, for example, page 128 {{DEFAULTSORT:Wav Audio file formats Digital container formats Computer-related introductions in 1991 Microsoft Windows multimedia technology