In
numerical analysis
Numerical analysis is the study of algorithms that use numerical approximation (as opposed to symbolic manipulations) for the problems of mathematical analysis (as distinguished from discrete mathematics). It is the study of numerical methods th ...
and
functional analysis
Functional analysis is a branch of mathematical analysis, the core of which is formed by the study of vector spaces endowed with some kind of limit-related structure (e.g. inner product, norm, topology, etc.) and the linear functions defined ...
, a discrete wavelet transform (DWT) is any
wavelet transform
In mathematics, a wavelet series is a representation of a square-integrable ( real- or complex-valued) function by a certain orthonormal series generated by a wavelet. This article provides a formal, mathematical definition of an orthonormal ...
for which the
wavelet
A wavelet is a wave-like oscillation with an amplitude that begins at zero, increases or decreases, and then returns to zero one or more times. Wavelets are termed a "brief oscillation". A taxonomy of wavelets has been established, based on the num ...
s are discretely sampled. As with other wavelet transforms, a key advantage it has over
Fourier transform
A Fourier transform (FT) is a mathematical transform that decomposes functions into frequency components, which are represented by the output of the transform as a function of frequency. Most commonly functions of time or space are transformed, ...
s is temporal resolution: it captures both frequency ''and'' location information (location in time).
Examples
Haar wavelets
The first DWT was invented by Hungarian mathematician
Alfréd Haar
Alfréd Haar ( hu, Haar Alfréd; 11 October 1885, Budapest – 16 March 1933, Szeged) was a Hungarian mathematician. In 1904 he began to study at the University of Göttingen. His doctorate was supervised by David Hilbert. The Haar ...
. For an input represented by a list of
numbers, the
Haar wavelet
In mathematics, the Haar wavelet is a sequence of rescaled "square-shaped" functions which together form a wavelet family or basis. Wavelet analysis is similar to Fourier analysis in that it allows a target function over an interval to be repres ...
transform may be considered to pair up input values, storing the difference and passing the sum. This process is repeated recursively, pairing up the sums to prove the next scale, which leads to
differences and a final sum.
Daubechies wavelets
The most commonly used set of discrete wavelet transforms was formulated by the Belgian mathematician
Ingrid Daubechies
Baroness Ingrid Daubechies ( ; ; born 17 August 1954) is a Belgian physicist and mathematician. She is best known for her work with wavelets in image compression.
Daubechies is recognized for her study of the mathematical methods that enhance i ...
in 1988. This formulation is based on the use of
recurrence relation
In mathematics, a recurrence relation is an equation according to which the nth term of a sequence of numbers is equal to some combination of the previous terms. Often, only k previous terms of the sequence appear in the equation, for a paramete ...
s to generate progressively finer discrete samplings of an implicit mother wavelet function; each resolution is twice that of the previous scale. In her seminal paper, Daubechies derives a family of
wavelets
A wavelet is a wave-like oscillation with an amplitude that begins at zero, increases or decreases, and then returns to zero one or more times. Wavelets are termed a "brief oscillation". A taxonomy of wavelets has been established, based on the num ...
, the first of which is the Haar wavelet. Interest in this field has exploded since then, and many variations of Daubechies' original wavelets were developed.
The dual-tree complex wavelet transform (DWT)
The dual-tree complex wavelet transform (
WT) is a relatively recent enhancement to the discrete wavelet transform (DWT), with important additional properties: It is nearly shift invariant and directionally selective in two and higher dimensions. It achieves this with a redundancy factor of only
, substantially lower than the undecimated DWT. The multidimensional (M-D) dual-tree
WT is nonseparable but is based on a computationally efficient, separable filter bank (FB).
Others
Other forms of discrete wavelet transform include the Le Gall–Tabatabai (LGT) 5/3 wavelet developed by Didier Le Gall and Ali J. Tabatabai in 1988 (used in
JPEG 2000
JPEG 2000 (JP2) is an image compression standard and coding system. It was developed from 1997 to 2000 by a Joint Photographic Experts Group committee chaired by Touradj Ebrahimi (later the JPEG president), with the intention of superseding th ...
or
JPEG XS ), the
Binomial QMF developed by
Ali Naci Akansu in 1990, the
set partitioning in hierarchical trees Set partitioning in hierarchical trees (SPIHT) is an image
An image is a visual representation of something. It can be two-dimensional, three-dimensional, or somehow otherwise feed into the visual system to convey information. An image can b ...
(SPIHT) algorithm developed by Amir Said with William A. Pearlman in 1996,
the
non- or undecimated wavelet transform (where downsampling is omitted), and the
Newland transform (where an
orthonormal
In linear algebra, two vectors in an inner product space are orthonormal if they are orthogonal (or perpendicular along a line) unit vectors. A set of vectors form an orthonormal set if all vectors in the set are mutually orthogonal and all of un ...
basis of wavelets is formed from appropriately constructed
top-hat filters in
frequency space).
Wavelet packet transforms are also related to the discrete wavelet transform.
Complex wavelet transform is another form.
Properties
The Haar DWT illustrates the desirable properties of wavelets in general. First, it can be performed in
operations; second, it captures not only a notion of the frequency content of the input, by examining it at different scales, but also temporal content, i.e. the times at which these frequencies occur. Combined, these two properties make the
Fast wavelet transform (FWT) an alternative to the conventional
fast Fourier transform
A fast Fourier transform (FFT) is an algorithm that computes the discrete Fourier transform (DFT) of a sequence, or its inverse (IDFT). Fourier analysis converts a signal from its original domain (often time or space) to a representation in t ...
(FFT).
Time issues
Due to the rate-change operators in the filter bank, the discrete WT is not time-invariant but actually very sensitive to the alignment of the signal in time. To address the time-varying problem of wavelet transforms, Mallat and Zhong proposed a new algorithm for wavelet representation of a signal, which is invariant to time shifts. According to this algorithm, which is called a TI-DWT, only the scale parameter is sampled along the dyadic sequence 2^j (j∈Z) and the wavelet transform is calculated for each point in time.
Applications
The discrete wavelet transform has a huge number of applications in science, engineering, mathematics and computer science. Most notably, it is used for
signal coding, to represent a discrete signal in a more redundant form, often as a preconditioning for
data compression
In information theory, data compression, source coding, or bit-rate reduction is the process of encoding information using fewer bits than the original representation. Any particular compression is either lossy or lossless. Lossless compressi ...
. Practical applications can also be found in signal processing of accelerations for gait analysis, image processing, in digital communications and many others.
It is shown that discrete wavelet transform (discrete in scale and shift, and continuous in time) is successfully implemented as analog filter bank in biomedical signal processing for design of low-power pacemakers and also in ultra-wideband (UWB) wireless communications.
Example in image processing

Wavelets are often used to denoise two dimensional signals, such as images. The following example provides three steps to remove unwanted white Gaussian noise from the noisy image shown.
Matlab
MATLAB (an abbreviation of "MATrix LABoratory") is a proprietary multi-paradigm programming language and numeric computing environment developed by MathWorks. MATLAB allows matrix manipulations, plotting of functions and data, implementa ...
was used to import and filter the image.
The first step is to choose a wavelet type, and a level N of decomposition. In this case
biorthogonal 3.5 wavelets were chosen with a level N of 10. Biorthogonal wavelets are commonly used in image processing to detect and filter white Gaussian noise, due to their high contrast of neighboring pixel intensity values. Using these wavelets a
wavelet transform
In mathematics, a wavelet series is a representation of a square-integrable ( real- or complex-valued) function by a certain orthonormal series generated by a wavelet. This article provides a formal, mathematical definition of an orthonormal ...
ation is performed on the two dimensional image.
Following the decomposition of the image file, the next step is to determine threshold values for each level from 1 to N. Birgé-Massart strategy is a fairly common method for selecting these thresholds. Using this process individual thresholds are made for N = 10 levels. Applying these thresholds are the majority of the actual filtering of the signal.
The final step is to reconstruct the image from the modified levels. This is accomplished using an inverse wavelet transform. The resulting image, with white Gaussian noise removed is shown below the original image. When filtering any form of data it is important to quantify the
signal-to-noise-ratio of the result. In this case, the SNR of the noisy image in comparison to the original was 30.4958%, and the SNR of the denoised image is 32.5525%. The resulting improvement of the wavelet filtering is a SNR gain of 2.0567%.
It is important to note that choosing other wavelets, levels, and thresholding strategies can result in different types of filtering. In this example, white Gaussian noise was chosen to be removed. Although, with different thresholding, it could just as easily have been amplified.
To illustrate the differences and similarities between the discrete wavelet transform with the
discrete Fourier transform
In mathematics, the discrete Fourier transform (DFT) converts a finite sequence of equally-spaced Sampling (signal processing), samples of a function (mathematics), function into a same-length sequence of equally-spaced samples of the discre ...
, consider the DWT and DFT of the following sequence: (1,0,0,0), a
unit impulse.
The DFT has orthogonal basis (
DFT matrix):
:
while the DWT with Haar wavelets for length 4 data has orthogonal basis in the rows of:
:
(To simplify notation, whole numbers are used, so the bases are
orthogonal
In mathematics, orthogonality is the generalization of the geometric notion of '' perpendicularity''.
By extension, orthogonality is also used to refer to the separation of specific features of a system. The term also has specialized meanings in ...
but not
orthonormal
In linear algebra, two vectors in an inner product space are orthonormal if they are orthogonal (or perpendicular along a line) unit vectors. A set of vectors form an orthonormal set if all vectors in the set are mutually orthogonal and all of un ...
.)
Preliminary observations include:
* Sinusoidal waves differ only in their frequency. The first does not complete any cycles, the second completes one full cycle, the third completes two cycles, and the fourth completes three cycles (which is equivalent to completing one cycle in the opposite direction). Differences in phase can be represented by multiplying a given basis vector by a complex constant.
* Wavelets, by contrast, have both frequency and location. As before, the first completes zero cycles, and the second completes one cycle. However, the third and fourth both have the same frequency, twice that of the first. Rather than differing in frequency, they differ in ''location'' — the third is nonzero over the first two elements, and the fourth is nonzero over the second two elements.
:
The DWT demonstrates the localization: the (1,1,1,1) term gives the average signal value, the (1,1,–1,–1) places the signal in the left side of the domain, and the
(1,–1,0,0) places it at the left side of the left side, and truncating at any stage yields a downsampled version of the signal:
:

The DFT, by contrast, expresses the sequence by the interference of waves of various frequencies – thus truncating the series yields a
low-pass filter
A low-pass filter is a filter that passes signals with a frequency lower than a selected cutoff frequency and attenuates signals with frequencies higher than the cutoff frequency. The exact frequency response of the filter depends on the filt ...
ed version of the series:
:
Notably, the middle approximation (2-term) differs. From the frequency domain perspective, this is a better approximation, but from the time domain perspective it has drawbacks – it exhibits
undershoot – one of the values is negative, though the original series is non-negative everywhere – and
ringing, where the right side is non-zero, unlike in the wavelet transform. On the other hand, the Fourier approximation correctly shows a peak, and all points are within
of their correct value, though all points have error. The wavelet approximation, by contrast, places a peak on the left half, but has no peak at the first point, and while it is exactly correct for half the values (reflecting location), it has an error of
for the other values.
This illustrates the kinds of trade-offs between these transforms, and how in some respects the DWT provides preferable behavior, particularly for the modeling of transients.
Definition
One level of the transform
The DWT of a signal
is calculated by passing it through a series of filters. First the samples are passed through a
low-pass filter
A low-pass filter is a filter that passes signals with a frequency lower than a selected cutoff frequency and attenuates signals with frequencies higher than the cutoff frequency. The exact frequency response of the filter depends on the filt ...
with
impulse response
In signal processing and control theory, the impulse response, or impulse response function (IRF), of a dynamic system is its output when presented with a brief input signal, called an impulse (). More generally, an impulse response is the reac ...
resulting in a
convolution
In mathematics (in particular, functional analysis), convolution is a mathematical operation on two functions ( and ) that produces a third function (f*g) that expresses how the shape of one is modified by the other. The term ''convolution' ...
of the two:
:
The signal is also decomposed simultaneously using a
high-pass filter
A high-pass filter (HPF) is an electronic filter that passes signals with a frequency higher than a certain cutoff frequency and attenuates signals with frequencies lower than the cutoff frequency. The amount of attenuation for each frequenc ...
. The outputs give the detail coefficients (from the high-pass filter) and approximation coefficients (from the low-pass). It is important that the two filters are related to each other and they are known as a
quadrature mirror filter In digital signal processing, a quadrature mirror filter is a filter whose magnitude response is the mirror image around \pi/2 of that of another filter. Together these filters, first introduced by Croisier et al., are known as the quadrature mirror ...
.
However, since half the frequencies of the signal have now been removed, half the samples can be discarded according to Nyquist's rule. The filter output of the low-pass filter
in the diagram above is then
subsampled by 2 and further processed by passing it again through a new low-pass filter
and a high- pass filter
with half the cut-off frequency of the previous one, i.e.:
:
:
This decomposition has halved the time resolution since only half of each filter output characterises the signal. However, each output has half the frequency band of the input, so the frequency resolution has been doubled.
With the
subsampling operator
:
the above summation can be written more concisely.
:
:
However computing a complete convolution
with subsequent downsampling would waste computation time.
The
Lifting scheme is an optimization where these two computations are interleaved.
Cascading and filter banks
This decomposition is repeated to further increase the frequency resolution and the approximation coefficients decomposed with high- and low-pass filters and then down-sampled. This is represented as a binary tree with nodes representing a sub-space with a different time-frequency localisation. The tree is known as a
filter bank.
At each level in the above diagram the signal is decomposed into low and high frequencies. Due to the decomposition process the input signal must be a multiple of
where
is the number of levels.
For example a signal with 32 samples, frequency range 0 to
and 3 levels of decomposition, 4 output scales are produced:
Relationship to the mother wavelet
The filterbank implementation of wavelets can be interpreted as computing the wavelet coefficients of a
discrete set of child wavelets for a given mother wavelet
. In the case of the discrete wavelet transform, the mother wavelet is shifted and scaled by powers of two
where
is the scale parameter and
is the shift parameter, both which are integers.
Recall that the wavelet coefficient
of a signal
is the projection of
onto a wavelet, and let
be a signal of length
. In the case of a child wavelet in the discrete family above,
Now fix
at a particular scale, so that
is a function of
only. In light of the above equation,
can be viewed as a
convolution
In mathematics (in particular, functional analysis), convolution is a mathematical operation on two functions ( and ) that produces a third function (f*g) that expresses how the shape of one is modified by the other. The term ''convolution' ...
of
with a dilated, reflected, and normalized version of the mother wavelet,
, sampled at the points
. But this is precisely what the detail coefficients give at level
of the discrete wavelet transform. Therefore, for an appropriate choice of