In the theory of
stochastic processes
In probability theory and related fields, a stochastic () or random process is a mathematical object usually defined as a family of random variables. Stochastic processes are widely used as mathematical models of systems and phenomena that appe ...
, filtering describes the problem of determining the
state
State may refer to:
Arts, entertainment, and media Literature
* ''State Magazine'', a monthly magazine published by the U.S. Department of State
* ''The State'' (newspaper), a daily newspaper in Columbia, South Carolina, United States
* ''Our S ...
of a system from an incomplete and potentially
noisy set of observations. While originally motivated by problems in engineering, filtering found applications in many fields from signal processing to finance.
The problem of optimal non-linear filtering (even for the non-stationary case) was solved by
Ruslan L. Stratonovich (1959, 1960), see also
Harold J. Kushner
Harold Joseph Kushner is an American applied mathematician and a Professor Emeritus of Applied Mathematics at Brown University. He is known for his work on the theory of stochastic stability (based on the concept of supermartingales as Lyapunov f ...
's work and
Moshe Zakai
Moshe Zakai (December 22, 1926 – November 27, 2015) was a Distinguished Professor at the Technion, Israel in electrical engineering, member of the Israel Academy of Sciences and Humanities and Rothschild Prize winner.
Biography
Moshe Zakai w ...
's, who introduced a simplified dynamics for the unnormalized conditional law of the filter known as
Zakai equation Zakai is a surname. Notable people with the surname include:
*Johanan ben Zakai
:''See Yohanan for more rabbis by this name''.
Yohanan ben Zakkai ( he, יוֹחָנָן בֶּן זַכַּאי, ''Yōḥānān ben Zakkaʾy''; 1st century CE), s ...
. The solution, however, is infinite-dimensional in the general case. Certain approximations and special cases are well understood: for example, the linear filters are optimal for Gaussian random variables, and are known as the
Wiener filter
In signal processing, the Wiener filter is a filter used to produce an estimate of a desired or target random process by linear time-invariant ( LTI) filtering of an observed noisy process, assuming known stationary signal and noise spectra, and ...
and the
Kalman-Bucy filter. More generally, as the solution is infinite dimensional, it requires finite dimensional approximations to be implemented in a computer with finite memory. A finite dimensional approximated
nonlinear filter
In signal processing, a nonlinear (or non-linear) filter is a filter whose output is not a linear function of its input. That is, if the filter outputs signals ''R'' and ''S'' for two input signals ''r'' and ''s'' separately, but does not always o ...
may be more based on heuristics, such as the
extended Kalman filter
In estimation theory, the extended Kalman filter (EKF) is the nonlinear version of the Kalman filter which linearizes about an estimate of the current mean and covariance. In the case of well defined transition models, the EKF has been considered t ...
or the assumed density filters, or more methodologically oriented such as for example the Projection Filters, some sub-families of which are shown to coincide with the Assumed Density Filters.
Damiano Brigo Damiano Brigo (born Venice, Italy 1966) is an applied mathematician and Chair in Mathematical Finance at Imperial College London. He is known for research in filtering theory and mathematical finance.
Main results
Brigo started his work with the ...
, Bernard Hanzon and François Le Gland, Approximate Nonlinear Filtering by Projection on Exponential Manifolds of Densities, Bernoulli, Vol. 5, N. 3 (1999), pp. 495--534
In general, if the
separation principle In control theory, a separation principle, more formally known as a principle of separation of estimation and control, states that under some assumptions the problem of designing an optimal feedback controller for a stochastic system can be solved b ...
applies, then filtering also arises as part of the solution of an
optimal control
Optimal control theory is a branch of mathematical optimization that deals with finding a control for a dynamical system over a period of time such that an objective function is optimized. It has numerous applications in science, engineering and ...
problem. For example, the
Kalman filter
For statistics and control theory, Kalman filtering, also known as linear quadratic estimation (LQE), is an algorithm that uses a series of measurements observed over time, including statistical noise and other inaccuracies, and produces estimat ...
is the estimation part of the optimal control solution to the
linear-quadratic-Gaussian control problem.
The mathematical formalism
Consider a
probability space
In probability theory, a probability space or a probability triple (\Omega, \mathcal, P) is a mathematical construct that provides a formal model of a random process or "experiment". For example, one can define a probability space which models t ...
(Ω, Σ, P) and suppose that the (random) state ''Y''
''t'' in ''n''-
dimension
In physics and mathematics, the dimension of a Space (mathematics), mathematical space (or object) is informally defined as the minimum number of coordinates needed to specify any Point (geometry), point within it. Thus, a Line (geometry), lin ...
al
Euclidean space
Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, that is, in Euclid's Elements, Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics ther ...
R
''n'' of a system of interest at time ''t'' is a
random variable
A random variable (also called random quantity, aleatory variable, or stochastic variable) is a mathematical formalization of a quantity or object which depends on random events. It is a mapping or a function from possible outcomes (e.g., the po ...
''Y''
''t'' : Ω → R
''n'' given by the solution to an
Itō stochastic differential equation
A stochastic differential equation (SDE) is a differential equation in which one or more of the terms is a stochastic process, resulting in a solution which is also a stochastic process. SDEs are used to model various phenomena such as stock pr ...
of the form
:
where ''B'' denotes standard ''p''-dimensional
Brownian motion
Brownian motion, or pedesis (from grc, πήδησις "leaping"), is the random motion of particles suspended in a medium (a liquid or a gas).
This pattern of motion typically consists of random fluctuations in a particle's position insi ...
, ''b'' : [0, +∞) × R
''n'' → R
''n'' is the drift field, and ''σ'' : [0, +∞) × R
''n'' → R
''n''×''p'' is the diffusion field. It is assumed that observations ''H''
''t'' in R
''m'' (note that ''m'' and ''n'' may, in general, be unequal) are taken for each time ''t'' according to
:
Adopting the Itō interpretation of the stochastic differential and setting
:
this gives the following stochastic integral representation for the observations ''Z''
''t'':
:
where ''W'' denotes standard ''r''-dimensional
Brownian motion
Brownian motion, or pedesis (from grc, πήδησις "leaping"), is the random motion of particles suspended in a medium (a liquid or a gas).
This pattern of motion typically consists of random fluctuations in a particle's position insi ...
, independent of ''B'' and the initial condition ''Y''
0, and ''c'' : [0, +∞) × R
''n'' → R
''n'' and ''γ'' : [0, +∞) × R
''n'' → R
''n''×''r'' satisfy
:
for all ''t'' and ''x'' and some constant ''C''.
The filtering problem is the following: given observations ''Z''
''s'' for 0 ≤ ''s'' ≤ ''t'', what is the best estimate ''Ŷ''
''t'' of the true state ''Y''
''t'' of the system based on those observations?
By "based on those observations" it is meant that ''Ŷ''
''t'' is measurable function, measurable with respect to the sigma algebra, ''σ''-algebra ''G''
''t'' generated by the observations ''Z''
''s'', 0 ≤ ''s'' ≤ ''t''. Denote by ''K'' = ''K''(''Z'', ''t'') the collection of all R
''n''-valued random variables ''Y'' that are square-integrable and ''G''
''t''-measurable:
:
By "best estimate", it is meant that ''Ŷ''
''t'' minimizes the mean-square distance between ''Y''
''t'' and all candidates in ''K'':
:
Basic result: orthogonal projection
The space ''K''(''Z'', ''t'') of candidates is a
Hilbert space
In mathematics, Hilbert spaces (named after David Hilbert) allow generalizing the methods of linear algebra and calculus from (finite-dimensional) Euclidean vector spaces to spaces that may be infinite-dimensional. Hilbert spaces arise natural ...
, and the general theory of Hilbert spaces implies that the solution ''Ŷ''
''t'' of the minimization problem (M) is given by
:
where ''P''
''K''(''Z'',''t'') denotes the
orthogonal projection
In linear algebra and functional analysis, a projection is a linear transformation P from a vector space to itself (an endomorphism) such that P\circ P=P. That is, whenever P is applied twice to any vector, it gives the same result as if it wer ...
of ''L''
2(Ω, Σ, P; R
''n'') onto the
linear subspace
In mathematics, and more specifically in linear algebra, a linear subspace, also known as a vector subspaceThe term ''linear subspace'' is sometimes used for referring to flats and affine subspaces. In the case of vector spaces over the reals, li ...
''K''(''Z'', ''t'') = ''L''
2(Ω, ''G''
''t'', P; R
''n''). Furthermore, it is a general fact about
conditional expectation
In probability theory, the conditional expectation, conditional expected value, or conditional mean of a random variable is its expected value – the value it would take “on average” over an arbitrarily large number of occurrences – give ...
s that if ''F'' is any sub-''σ''-algebra of Σ then the orthogonal projection
:
is exactly the conditional expectation operator E
''F'' i.e.,
:
Hence,
:
This elementary result is the basis for the general Fujisaki-Kallianpur-Kunita equation of filtering theory.
See also
* The
smoothing problem
The smoothing problem (not to be confused with smoothing in statistics, image processing and other contexts) is the problem of estimating an unknown probability density function recursively over time using incremental incoming measurements. It i ...
, closely related to the filtering problem
*
Filter (signal processing)
In signal processing, a filter is a device or process that removes some unwanted components or features from a signal. Filtering is a class of signal processing, the defining feature of filters being the complete or partial suppression of some aspe ...
*
Kalman filter
For statistics and control theory, Kalman filtering, also known as linear quadratic estimation (LQE), is an algorithm that uses a series of measurements observed over time, including statistical noise and other inaccuracies, and produces estimat ...
, a well-known filtering algorithm related both to the filtering problem and the smoothing problem
*
Smoothing
In statistics and image processing, to smooth a data set is to create an approximating function that attempts to capture important patterns in the data, while leaving out noise or other fine-scale structures/rapid phenomena. In smoothing, the dat ...
References
Further reading
*
* {{cite book
, last = Øksendal
, first = Bernt K.
, authorlink = Bernt Øksendal
, title = Stochastic Differential Equations: An Introduction with Applications
, edition = Sixth
, publisher=Springer
, location = Berlin
, year = 2003
, isbn = 3-540-04758-1
(See Section 6.1)
Control theory
Signal estimation
Stochastic differential equations