Bilinear time–frequency distributions, or quadratic time–frequency distributions, arise in a sub-field of signal analysis and

signal processing Signal processing is an electrical engineering subfield that focuses on analyzing, modifying and synthesizing ''signals'', such as audio signal processing, sound, image processing, images, Scalar potential, potential fields, Seismic tomograph ...

called time–frequency signal processing, and, in the

statistical analysis Statistical inference is the process of using data analysis to infer properties of an underlying probability distribution.Upton, G., Cook, I. (2008) ''Oxford Dictionary of Statistics'', OUP. . Inferential statistical analysis infers properties of ...

time series In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. ...

data. Such methods are used where one needs to deal with a situation where the frequency composition of a signal may be changing over time; this sub-field used to be called time–frequency signal analysis, and is now more often called time–frequency signal processing due to the progress in using these methods to a wide range of signal-processing problems.

Background

Methods for analysing time series, in both signal analysis and

time series analysis In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. ...

, have been developed as essentially separate methodologies applicable to, and based in, either the

time Time is the continuous progression of existence that occurs in an apparently irreversible process, irreversible succession from the past, through the present, and into the future. It is a component quantity of various measurements used to sequ ...

or the

frequency domain In mathematics, physics, electronics, control systems engineering, and statistics, the frequency domain refers to the analysis of mathematical functions or signals with respect to frequency (and possibly phase), rather than time, as in time ser ...

. A mixed approach is required in

time–frequency analysis In signal processing, time–frequency analysis comprises those techniques that study a signal in both the time and frequency domains simultaneously, using various time–frequency representations. Rather than viewing a 1-dimensional signal (a fun ...

techniques which are especially effective in analyzing non-stationary signals, whose frequency distribution and magnitude vary with time. Examples of these are acoustic signals. Classes of "quadratic time-frequency distributions" (or bilinear time–frequency distributions") are used for time–frequency signal analysis. This class is similar in formulation to Cohen's class distribution function that was used in 1966 in the context of quantum mechanics. This distribution function is mathematically similar to a generalized

time–frequency representation A time–frequency representation (TFR) is a view of a signal (taken to be a function of time) represented over both time and frequency. Time–frequency analysis means analysis into the time–frequency domain provided by a TFR. This is achieved ...

which utilizes bilinear transformations. Compared with other

techniques, such as

short-time Fourier transform The short-time Fourier transform (STFT) is a Fourier-related transform used to determine the sinusoidal frequency and phase content of local sections of a signal as it changes over time. In practice, the procedure for computing STFTs is to divide ...

(STFT), the bilinear-transformation (or quadratic time–frequency distributions) may not have higher clarity for most practical signals, but it provides an alternative framework to investigate new definitions and new methods. While it does suffer from an inherent cross-term contamination when analyzing multi-component signals, by using a carefully chosen

window function In signal processing and statistics, a window function (also known as an apodization function or tapering function) is a mathematical function that is zero-valued outside of some chosen interval. Typically, window functions are symmetric around ...

(s), the interference can be significantly mitigated, at the expense of resolution. All these bilinear distributions are inter-convertible to each other, cf. transformation between distributions in time–frequency analysis.

Wigner–Ville distribution

The Wigner–Ville distribution is a quadratic form that measures a local time-frequency energy given by: :

P_V f(u,\xi )=\int_^\infty f \left (u+\tfrac \right) f^* \left(u-\tfrac \right) e^ \, d\tau

The Wigner–Ville distribution remains real as it is the fourier transform of ''f''(''u'' + ''τ''/2)·''f''*(''u'' − ''τ''/2), which has Hermitian symmetry in ''τ''. It can also be written as a frequency integration by applying the Parseval formula: :

P_V f(u,\xi )=\frac\int_^\infty \hat \left(\xi +\tfrac \right) \hat^* \left(\xi -\tfrac \right) e^ \, d\gamma

:Proposition 1. for any ''f'' in L²(R) ::

\int_^\infty P_V f(u,\xi) \, du= ,  \hat(\xi) , ^2

\int_^\infty P_V f(u,\xi) \, d\xi =2\pi , f(u), ^2

:Moyal Theorem. For ''f'' and ''g'' in L²(R), ::

2\pi \left,  \int_^\infty f(t)g^*(t)\,dt \^2=\iint P_Vg(u,\xi )\,du\,d\xi

:Proposition 2 (time-frequency support). If ''f'' has a compact support, then for all ''ξ'' the support of

P_V f(u,\xi )

along ''u'' is equal to the support of ''f''. Similarly, if

\hat

has a compact support, then for all ''u'' the support of

P_Vf(u,\xi )

along ''ξ'' is equal to the support of

\hat

. :Proposition 3 (instantaneous frequency). If

f_a(t)=a(t)e^

then ::

\phi'(u)= \frac

Interference

Let

f= f_1 + f_2

be a composite signal. We can then write, :

P_Vf=P_Vf_1+P_Vf_2+P_V \left_1,f_2 \right P_V \left_2,f_1 \right /math>
where 
: P_V,g u,\xi )=\int_^ h\left (u+\tfrac \right)g^* \left (u-\tfrac \right) e^d\tau is the cross Wigner–Ville distribution of two signals. The interference term
: I_1,f_2 P_V

_1, f_2 Onekama ( ) is a village in Manistee County in the U.S. state of Michigan. The population was 399 at the 2020 census. The village is located on the northeast shore of Portage Lake and is surrounded by Onekama Township. The town's name is deriv ...

P_V _2, f_1/math> is a real function that creates non-zero values at unexpected locations (close to the origin) in the

(u,\xi )

plane. Interference terms present in a real signal can be avoided by computing the analytic part

f_a(t)

Positivity and smoothing kernel

The interference terms are oscillatory since the marginal integrals vanish and can be partially removed by smoothing

P_V f

with a kernel ''θ'' :

P_\theta f(u,\xi )=\int_^ \theta (u,u',\xi ,\xi') \, du' \, d\xi '

The time-frequency resolution of this distribution depends on the spread of kernel ''θ'' in the neighborhood of

(u,\xi )

. Since the interferences take negative values, one can guarantee that all interferences are removed by imposing that :

P_\theta f(u,\xi )\ge 0, \qquad \forall (u,\xi )\in

The spectrogram and scalogram are examples of positive time-frequency energy distributions. Let a linear transform

Tf(\gamma )=\left\langle f,\phi_ \right\rangle

be defined over a family of time-frequency atoms

\left\_

. For any

(u,\xi )

there exists a unique atom

\phi_

centered in time-frequency at

(u,\xi )

. The resulting time-frequency energy density is :

P_T f(u,\xi ) = \left ,  \left \langle f, \phi_ \right\rangle  \right , ^2

From the Moyal formula, :

P_T f(u,\xi )=\frac \int_^\infty \int_^\infty P_V f(u', \xi') P_V \phi_ (u',\xi') \, du' \, d\xi '

which is the time frequency averaging of a Wigner–Ville distribution. The smoothing kernel thus can be written as :

\theta (u,u',\xi ,\xi')=\fracP_V \phi_(u',\xi')

The loss of time-frequency resolution depends on the spread of the distribution

P_V \phi_ (u',\xi')

in the neighborhood of

(u,\xi )

Example 1

A spectrogram computed with windowed fourier atoms, :

\phi_(t)=g(t-u) e^

\theta (u, u', \xi, \xi')=\frac P_V \phi_ (u', \xi')=\frac P_V g(u'-u, \xi'-\xi )

For a spectrogram, the Wigner–Ville averaging is therefore a 2-dimensional convolution with

P_V g

. If g is a Gaussian window,

P_V g

is a 2-dimensional Gaussian. This proves that averaging

P_V f

with a sufficiently wide Gaussian defines positive energy density. The general class of time-frequency distributions obtained by convolving

P_V f

with an arbitrary kernel ''θ'' is called a Cohen's class, discussed below. Wigner Theorem. There is no positive quadratic energy distribution ''Pf'' that satisfies the following time and frequency marginal integrals: :

\int_^\infty Pf(u,\xi ) \, d\xi =2\pi , f(u), ^2

\int_^\infty Pf(u,\xi ) \, du= , \hat(\xi), ^2

Mathematical definition

The definition of Cohen's class of bilinear (or quadratic) time–frequency distributions is as follows: :

C_x(t, f)=\int_^\infty \int_^\infty A_x(\eta,\tau)\Phi(\eta,\tau)\exp (j2\pi(\eta t-\tau f))\, d\eta \, d\tau,

where

A_x(\eta,\tau)

is the ambiguity function (AF), which will be discussed later; and

\Phi (\eta,\tau)

is Cohen's kernel function, which is often a low-pass function, and normally serves to mask out the interference. In the original Wigner representation,

\Phi \equiv 1

. An equivalent definition relies on a convolution of the

Wigner distribution function The Wigner distribution function (WDF) is used in signal processing as a transform in time-frequency analysis. The WDF was first proposed in physics to account for quantum corrections to classical statistical mechanics in 1932 by Eugene Wigner, ...

(WD) instead of the AF : :

C_x(t, f)=\int_^\infty \int_^\infty W_x(\theta,\nu) \Pi(t - \theta, f - \nu)\, d\theta \, d\nu =_x \ast \Pi (t,f)

where the kernel function

\Pi (t,f)

is defined in the time-frequency domain instead of the ambiguity one. In the original Wigner representation,

\Pi = \delta_

. The relationship between the two kernels is the same as the one between the WD and the AF, namely two successive Fourier transforms (cf. diagram). :

\Phi = \mathcal_t \mathcal^_f \Pi

i.e. :

\Phi(\eta, \tau) = \int_^\infty \int_^\infty \Pi(t,f) \exp (-j2\pi(t \eta-f \tau))\, dt \, df,

or equivalently :

\Pi(t,f) = \int_^\infty \int_^\infty \Phi(\eta,\tau) \exp (j2\pi(\eta t-\tau f))\, d\eta \, d\tau.

Ambiguity function

The class of bilinear (or quadratic) time–frequency distributions can be most easily understood in terms of the ambiguity function, an explanation of which follows. Consider the well known

power spectral density In signal processing, the power spectrum S_(f) of a continuous time signal x(t) describes the distribution of power into frequency components f composing that signal. According to Fourier analysis, any physical signal can be decomposed into ...

P_x(f)

and the signal auto-correlation function

R_x (\tau)

in the case of a stationary process. The relationship between these functions is as follows: :

P_x(f)= \int_^\infty R_x(\tau)e^ \, d\tau,

R_x(\tau) = \int_^\infty x \left (t+ \tfrac \right )x^* \left (t- \tfrac \right ) \, dt.

For a non-stationary signal

x(t)

, these relations can be generalized using a time-dependent power spectral density or equivalently the famous

x(t)

as follows: :

W_x(t, f)= \int_^\infty R_x(t, \tau)e^\, d\tau,

R_x (t ,\tau) =  x \left (t+ \tfrac \right )x^* \left(t- \tfrac \right).

If the

Fourier transform In mathematics, the Fourier transform (FT) is an integral transform that takes a function as input then outputs another function that describes the extent to which various frequencies are present in the original function. The output of the tr ...

of the auto-correlation function is taken with respect to ''t'' instead of ''τ'', we get the ambiguity function as follows: :

A_x(\eta,\tau)=\int_^\infty x \left (t+ \tfrac \right )x^* \left(t- \tfrac \right)e^\, dt.

The relationship between the Wigner distribution function, the auto-correlation function and the ambiguity function can then be illustrated by the following figure. By comparing the definition of bilinear (or quadratic) time–frequency distributions with that of the Wigner distribution function, it is easily found that the latter is a special case of the former with

\Phi(\eta,\tau) = 1

. Alternatively, bilinear (or quadratic) time–frequency distributions can be regarded as a masked version of the Wigner distribution function if a kernel function

\Phi(\eta,\tau) \neq 1

is chosen. A properly chosen kernel function can significantly reduce the undesirable cross-term of the Wigner distribution function. What is the benefit of the additional kernel function? The following figure shows the distribution of the auto-term and the cross-term of a multi-component signal in both the ambiguity and the Wigner distribution function. For multi-component signals in general, the distribution of its auto-term and cross-term within its Wigner distribution function is generally not predictable, and hence the cross-term cannot be removed easily. However, as shown in the figure, for the ambiguity function, the auto-term of the multi-component signal will inherently tend to close the origin in the ''ητ''-plane, and the cross-term will tend to be away from the origin. With this property, the cross-term in can be filtered out effortlessly if a proper low-pass kernel function is applied in ''ητ''-domain. The following is an example that demonstrates how the cross-term is filtered out.

Kernel properties

The Fourier transform of

\theta (u,\xi )

is :

\hat(\tau ,\gamma )=\int_^ \int_^ \theta (u,\xi ) e^ \, du \, d\xi

The following proposition gives necessary and sufficient conditions to ensure that

P_\theta

satisfies marginal energy properties like those of the Wigner–Ville distribution. :Proposition: The marginal energy properties ::

\int_^\infty P_\theta f(u,\xi ) \, d\xi =2\pi ,  f(u) , ^2,

\int_^\infty P_\theta f(u,\xi ) \, du= , \hat(\xi ), ^2

:are satisfied for all

f\in L^2(\mathbf)

if and only if ::

\forall (\tau ,\gamma )\in \mathbf^2: \qquad \hat(\tau ,0)=\hat(0,\gamma )=1

Some time-frequency distributions

Wigner distribution function

Aforementioned, the Wigner distribution function is a member of the class of quadratic time-frequency distributions (QTFDs) with the kernel function

\Phi (\eta,\tau) = 1

. The definition of Wigner distribution is as follows: :

W_x(t, f)= \int_^\infty x \left (t+ \tfrac \right ) x^*\left(t- \tfrac \right)e ^\, d\tau.

Modified Wigner distribution functions

Affine invariance

We can design time-frequency energy distributions that satisfy the scaling property :

\fracf\left( \tfrac \right) \longleftrightarrow P_V f\left( \tfrac,s\xi \right)

as does the Wigner–Ville distribution. If :

g(t)=\fracf\left( \tfrac \right)

then :

P_\theta g(u,\xi)= P_\theta f\left( \tfrac,s\xi  \right).

This is equivalent to imposing that :

\forall s\in \mathbf^+: \qquad \theta \left( su,\tfrac \right)=\theta (u,\xi) ,

and hence :

\theta (u,\xi )=\theta (u\xi ,1)=\beta (u\xi )

The Rihaczek and Choi–Williams distributions are examples of affine invariant Cohen's class distributions.

Choi–Williams distribution function

The kernel of Choi–Williams distribution is defined as follows: :

\Phi (\eta,\tau ) = \exp (-\alpha(\eta \tau)^2),

where ''α'' is an adjustable parameter.

Rihaczek distribution function

The kernel of Rihaczek distribution is defined as follows: :

\Phi (\eta,\tau) = \exp \left(-i 2\pi \frac \right),

With this particular kernel a simple calculation proves that :

C_x (t,f) = x(t) \hat^*(f) e^

Cone-shape distribution function

The kernel of cone-shape distribution function is defined as follows: :

\Phi (\eta,\tau) = \frac\exp \left(-2\pi \alpha \tau^2  \right),

where ''α'' is an adjustable parameter. See Transformation between distributions in time-frequency analysis. More such QTFDs and a full list can be found in, e.g., Cohen's text cited.

Spectrum of non-stationary processes

A time-varying spectrum for non-stationary processes is defined from the expected Wigner–Ville distribution. Locally stationary processes appear in many physical systems where random fluctuations are produced by a mechanism that changes slowly in time. Such processes can be approximated locally by a stationary process. Let

X(t)

be a real valued zero-mean process with covariance :

R(t,s)=E (t)X(s) /math>

The covariance operator ''K'' is defined for any deterministic signal f\in L^2(\mathbf) by

: Kf(t)=\int_^\infty R( t,s)f(s) \, ds For locally stationary processes, the eigenvectors of ''K'' are well approximated by the Wigner–Ville spectrum.

Wigner–Ville spectrum

The properties of the covariance

R(t,s)

are studied as a function of

\tau =t-s

and

u=\frac

: :

R(t,s)=R\left( u+\tfrac,u-\tfrac \right)=C( u,\tau)

The process is ''wide-sense stationary'' if the covariance depends only on

\tau =t-s

: :

Kf(t)=\int_^\infty C(t-s) f(s)\,ds=C*f(t)

The eigenvectors are the complex exponentials

e^

and the corresponding eigenvalues are given by the power spectrum :

P_X (\omega)=\int_^\infty C(\tau) e^ \, d\tau

For non-stationary processes, Martin and Flandrin have introduced a ''time-varying spectrum'' :

e^ \, d\tau

To avoid convergence issues we suppose that ''X'' has compact support so that

C(u,\tau)

has compact support in

\tau

. From above we can write :

P_X ( u,\xi)=E P_V X ( u,\xi) /math>

which proves that the time varying spectrum is the expected value of the Wigner–Ville transform of the process ''X''. Here, the Wigner–Ville stochastic integral is interpreted as a mean-square integral:''a wavelet tour of signal processing'', Stephane Mallat : P_V ( u,\xi)=\int_^\infty \left\ e^ \, d\tau

References

* L. Cohen, Time-Frequency Analysis, Prentice-Hall, New York, 1995. * B. Boashash, editor, "Time-Frequency Signal Analysis and Processing – A Comprehensive Reference", Elsevier Science, Oxford, 2003. * L. Cohen, "Time-Frequency Distributions—A Review," Proceedings of the IEEE, vol. 77, no. 7, pp. 941–981, 1989. * S. Qian and D. Chen, Joint Time-Frequency Analysis: Methods and Applications, Chap. 5, Prentice Hall, N.J., 1996. * H. Choi and W. J. Williams, "Improved time-frequency representation of multicomponent signals using exponential kernels," IEEE. Trans. Acoustics, Speech, Signal Processing, vol. 37, no. 6, pp. 862–871, June 1989. * Y. Zhao, L. E. Atlas, and R. J. Marks, "The use of cone-shape kernels for generalized time-frequency representations of nonstationary signals," IEEE Trans. Acoustics, Speech, Signal Processing, vol. 38, no. 7, pp. 1084–1091, July 1990. * B. Boashash, "Heuristic Formulation of Time-Frequency Distributions", Chapter 2, pp. 29–58, in B. Boashash, editor, Time-Frequency Signal Analysis and Processing: A Comprehensive Reference, Elsevier Science, Oxford, 2003. * B. Boashash, "Theory of Quadratic TFDs", Chapter 3, pp. 59–82, in B. Boashash, editor, Time-Frequency Signal Analysis & Processing: A Comprehensive Reference, Elsevier, Oxford, 2003. {{DEFAULTSORT:Bilinear time-frequency distribution Signal processing Fourier analysis Digital signal processing Time–frequency analysis