Projection filters are a set of
algorithms
In mathematics and computer science, an algorithm () is a finite sequence of mathematically rigorous instructions, typically used to solve a class of specific problems or to perform a computation. Algorithms are used as specifications for per ...
based on
stochastic analysis
Stochastic calculus is a branch of mathematics that operates on stochastic processes. It allows a consistent theory of integration to be defined for integrals of stochastic processes with respect to stochastic processes. This field was created an ...
and
information geometry
Information geometry is an interdisciplinary field that applies the techniques of differential geometry to study probability theory and statistics. It studies statistical manifolds, which are Riemannian manifolds whose points correspond to proba ...
, or the differential geometric approach to statistics, used to find approximate solutions for
filtering problems for nonlinear state-space systems.
The filtering problem consists of estimating the unobserved signal of a random dynamical system from partial noisy observations of the signal. The objective is computing the probability distribution of the signal conditional on the history of the noise-perturbed observations. This distribution allows for calculations of all statistics of the signal given the history of observations. If this distribution has a density, the density satisfies
specific stochastic partial differential equations (SPDEs) called Kushner-Stratonovich equation, or Zakai equation.
It is known that the nonlinear filter density evolves in an infinite dimensional function space.
One can choose a finite dimensional family of probability densities, for example
Gaussian densities, Gaussian
mixtures, or
exponential families, on which the infinite-dimensional filter density can be approximated. The basic idea of the projection filter is to use a geometric structure in the chosen spaces of densities to project the infinite dimensional SPDE of the optimal filter onto the chosen finite dimensional family, obtaining a finite dimensional
stochastic differential equation
A stochastic differential equation (SDE) is a differential equation in which one or more of the terms is a stochastic process, resulting in a solution which is also a stochastic process. SDEs have many applications throughout pure mathematics an ...
(SDE) for the parameter of the density in the finite dimensional family that approximates the full filter evolution.
To do this, the chosen finite dimensional family is equipped with a
manifold
In mathematics, a manifold is a topological space that locally resembles Euclidean space near each point. More precisely, an n-dimensional manifold, or ''n-manifold'' for short, is a topological space with the property that each point has a N ...
structure as in
information geometry
Information geometry is an interdisciplinary field that applies the techniques of differential geometry to study probability theory and statistics. It studies statistical manifolds, which are Riemannian manifolds whose points correspond to proba ...
.
The projection filter was tested against the optimal filter for the cubic sensor problem. The projection filter could track effectively bimodal densities of the optimal filter that would have been difficult to approximate with standard algorithms like the
extended Kalman filter
In estimation theory, the extended Kalman filter (EKF) is the nonlinear version of the Kalman filter which linearizes about an estimate of the current mean and covariance. In the case of well defined transition models, the EKF has been considered t ...
.
Projection filters are ideal for in-line estimation, as they are quick to implement and run efficiently in time, providing a finite dimensional SDE for the parameter that can be implemented efficiently.
Projection filters are also flexible, as they allow fine tuning the precision of the approximation by choosing richer approximating families, and some exponential families make the correction step in the projection filtering algorithm exact.
Some formulations coincide with heuristic based assumed density filters
or with
Galerkin methods.
Projection filters can also approximate the full infinite-dimensional filter in an optimal way, beyond the optimal approximation of the SPDE coefficients alone, according to precise criteria such as mean square minimization.
Projection filters have been studied by the
Swedish Defense Research Agency and have also been successfully applied to a variety of fields including
navigation
Navigation is a field of study that focuses on the process of monitoring and controlling the motion, movement of a craft or vehicle from one place to another.Bowditch, 2003:799. The field of navigation includes four general categories: land navig ...
,
ocean dynamics
Ocean dynamics define and describe the flow of water within the oceans. Ocean temperature and motion fields can be separated into three distinct layers: mixed (surface) layer, upper ocean (above the thermocline), and deep ocean.
Ocean dynamics ...
,
quantum optics
Quantum optics is a branch of atomic, molecular, and optical physics and quantum chemistry that studies the behavior of photons (individual quanta of light). It includes the study of the particle-like properties of photons and their interaction ...
and
quantum system
Quantum mechanics is the fundamental physical Scientific theory, theory that describes the behavior of matter and of light; its unusual characteristics typically occur at and below the scale of atoms. Reprinted, Addison-Wesley, 1989, It is ...
s, estimation of
fiber
Fiber (spelled fibre in British English; from ) is a natural or artificial substance that is significantly longer than it is wide. Fibers are often used in the manufacture of other materials. The strongest engineering materials often inco ...
diameters, estimation of chaotic
time series
In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. ...
,
change point detection and other areas.
[Armstrong, J., Brigo, D., and Hanzon, B. (2023). Optimal projection filters with information geometry. Info. Geo. (2023). https://doi.org/10.1007/s41884-023-00108-x]
History and development
The term "projection filter" was first coined in 1987 by Bernard Hanzon,
[Bernard Hanzon (1987). A differential-geometric approach to approximate nonlinear filtering. In: C.T.J. Dodson, Editor, Geometrization of Statistical Theory, pages 219–223. ULMD Publications, University of Lancaster] and the related theory and numerical examples were fully developed, expanded and made rigorous during the
Ph.D. work of
Damiano Brigo, in collaboration with Bernard Hanzon and Francois LeGland.
[Brigo, D. (1996). Filtering by projection on the manifold of exponential densities. PhD dissertation, Free University of Amsterdam]
These works dealt with the projection filters in
Hellinger distance
In probability and statistics, the Hellinger distance (closely related to, although different from, the Bhattacharyya distance) is used to quantify the similarity between two probability distributions. It is a type of ''f''-divergence. The Hell ...
and
Fisher information metric
In information geometry, the Fisher information metric is a particular Riemannian metric which can be defined on a smooth statistical manifold, ''i.e.'', a smooth manifold whose points are probability distributions. It can be used to calculate the ...
, that were used to project the optimal filter infinite-dimensional SPDE on a chosen exponential family. The exponential family can be chosen so as to make the prediction step of the filtering algorithm exact.
A different type of projection filters, based on an alternative projection metric, the direct
metric, was introduced in Armstrong and Brigo (2016).
With this metric, the projection filters on families of
mixture distribution
In probability and statistics, a mixture distribution is the probability distribution of a random variable that is derived from a collection of other random variables as follows: first, a random variable is selected by chance from the collection a ...
s coincide with filters based on
Galerkin methods. Later on, Armstrong, Brigo and Rossi Ferrucci (2021)
derive optimal projection filters that satisfy specific optimality criteria in approximating the infinite dimensional optimal filter. Indeed, the Stratonovich-based projection filters optimized the approximations of the SPDE separate coefficients on the chosen manifold but not the SPDE solution as a whole. This has been dealt with by introducing the optimal projection filters. The innovation here is to work directly with Ito calculus, instead of resorting to the Stratonovich calculus version of the filter equation. This is based on research on the geometry of Ito Stochastic differential equations on manifolds based on the
jet bundle, the so-called 2-jet interpretation of Ito stochastic differential equations on manifolds.
[John Armstrong and Damiano Brigo (2018). Intrinsic stochastic differential equations as jets.
Proceedings of the Royal Society A - Mathematical physical and engineering sciences, 474(2210), 28 pages. doi: 10.1098/rspa.2017.0559.]
Projection filters derivation
Here the derivation of the different projection filters is sketched.
Stratonovich-based projection filters
This is a derivation of both the initial filter in Hellinger/Fisher metric sketched by Hanzon
and fully developed by Brigo, Hanzon and LeGland,
and the later projection filter in direct L2 metric by Armstrong and Brigo (2016).
It is assumed that the unobserved random signal
is modelled by the Ito
stochastic differential equation
A stochastic differential equation (SDE) is a differential equation in which one or more of the terms is a stochastic process, resulting in a solution which is also a stochastic process. SDEs have many applications throughout pure mathematics an ...
:
:
where ''f'' and
are
valued and
is a
Brownian motion
Brownian motion is the random motion of particles suspended in a medium (a liquid or a gas). The traditional mathematical formulation of Brownian motion is that of the Wiener process, which is often called Brownian motion, even in mathematical ...
. Validity of all regularity conditions necessary for the results to hold will be assumed, with details given in the references. The associated noisy observation process
is modelled by
:
where
is
valued and
is a Brownian motion independent of
. As hinted above, the full filter is the conditional
distribution of
given a prior for
and the history of
up to time
. If this distribution has a density described informally as
:
where
is the sigma-field generated by the history of noisy observations
up to time
, under suitable technical conditions the density
satisfies the Kushner—Stratonovich SPDE:
: