In
statistical mechanics
In physics, statistical mechanics is a mathematical framework that applies statistical methods and probability theory to large assemblies of microscopic entities. Sometimes called statistical physics or statistical thermodynamics, its applicati ...
and
information theory
Information theory is the mathematical study of the quantification (science), quantification, Data storage, storage, and telecommunications, communication of information. The field was established and formalized by Claude Shannon in the 1940s, ...
, the Fokker–Planck equation is a
partial differential equation
In mathematics, a partial differential equation (PDE) is an equation which involves a multivariable function and one or more of its partial derivatives.
The function is often thought of as an "unknown" that solves the equation, similar to ho ...
that describes the
time evolution
Time evolution is the change of state brought about by the passage of time, applicable to systems with internal state (also called ''stateful systems''). In this formulation, ''time'' is not required to be a continuous parameter, but may be discr ...
of the
probability density function
In probability theory, a probability density function (PDF), density function, or density of an absolutely continuous random variable, is a Function (mathematics), function whose value at any given sample (or point) in the sample space (the s ...
of the velocity of a particle under the influence of
drag forces and random forces, as in
Brownian motion
Brownian motion is the random motion of particles suspended in a medium (a liquid or a gas). The traditional mathematical formulation of Brownian motion is that of the Wiener process, which is often called Brownian motion, even in mathematical ...
. The equation can be generalized to other observables as well. The Fokker–Planck equation has multiple applications in information theory, graph theory, data science, finance, economics etc.
It is named after
Adriaan Fokker
Adriaan Daniël Fokker (; 17 August 1887 – 24 September 1972) was a Dutch physicist. He worked in the fields of special relativity and statistical mechanics. He was the inventor of the Fokker organ, a 31 equal temperament, 31-tone equal-temp ...
and
Max Planck
Max Karl Ernst Ludwig Planck (; ; 23 April 1858 – 4 October 1947) was a German Theoretical physics, theoretical physicist whose discovery of energy quantum, quanta won him the Nobel Prize in Physics in 1918.
Planck made many substantial con ...
, who described it in 1914 and 1917. It is also known as the Kolmogorov forward equation, after
Andrey Kolmogorov
Andrey Nikolaevich Kolmogorov ( rus, Андре́й Никола́евич Колмого́ров, p=ɐnˈdrʲej nʲɪkɐˈlajɪvʲɪtɕ kəlmɐˈɡorəf, a=Ru-Andrey Nikolaevich Kolmogorov.ogg, 25 April 1903 – 20 October 1987) was a Soviet ...
, who independently discovered it in 1931. When applied to particle position distributions, it is better known as the Smoluchowski equation (after
Marian Smoluchowski), and in this context it is equivalent to the
convection–diffusion equation
The convection–diffusion equation is a parabolic partial differential equation that combines the diffusion equation, diffusion and convection (advection equation, advection) equations. It describes physical phenomena where particles, energy, or o ...
. When applied to particle position and momentum distributions, it is known as the
Klein–Kramers equation In physics and mathematics, the Oskar Klein, Klein–Hans Kramers, Kramers equation or sometimes referred as Kramers–Subrahmanyan_Chandrasekhar, Chandrasekhar equation is a partial differential equation that describes the probability density funct ...
. The case with zero
diffusion
Diffusion is the net movement of anything (for example, atoms, ions, molecules, energy) generally from a region of higher concentration to a region of lower concentration. Diffusion is driven by a gradient in Gibbs free energy or chemical p ...
is the
continuity equation
A continuity equation or transport equation is an equation that describes the transport of some quantity. It is particularly simple and powerful when applied to a conserved quantity, but it can be generalized to apply to any extensive quantity ...
. The Fokker–Planck equation is obtained from the
master equation
In physics, chemistry, and related fields, master equations are used to describe the time evolution of a system that can be modeled as being in a probabilistic combination of states at any given time, and the switching between states is determi ...
through
Kramers–Moyal expansion.
The first consistent microscopic derivation of the Fokker–Planck equation in the single scheme of
classical and
quantum mechanics
Quantum mechanics is the fundamental physical Scientific theory, theory that describes the behavior of matter and of light; its unusual characteristics typically occur at and below the scale of atoms. Reprinted, Addison-Wesley, 1989, It is ...
was performed by
Nikolay Bogoliubov and
Nikolay Krylov.
One dimension
In one spatial dimension ''x'', for an
Itô process driven by the standard
Wiener process
In mathematics, the Wiener process (or Brownian motion, due to its historical connection with Brownian motion, the physical process of the same name) is a real-valued continuous-time stochastic process discovered by Norbert Wiener. It is one o ...
and described by the
stochastic differential equation
A stochastic differential equation (SDE) is a differential equation in which one or more of the terms is a stochastic process, resulting in a solution which is also a stochastic process. SDEs have many applications throughout pure mathematics an ...
(SDE)
with
drift and
diffusion
Diffusion is the net movement of anything (for example, atoms, ions, molecules, energy) generally from a region of higher concentration to a region of lower concentration. Diffusion is driven by a gradient in Gibbs free energy or chemical p ...
coefficient
, the Fokker–Planck equation for the probability density
of the random variable
is
In the following, use
.
Define the
infinitesimal generator (the following can be found in Ref.
):
The ''transition probability''
, the probability of going from
to
, is introduced here; the expectation can be written as
Now we replace in the definition of
, multiply by
and integrate over
. The limit is taken on
Note now that
which is the Chapman–Kolmogorov theorem. Changing the dummy variable
to
, one gets
which is a time derivative. Finally we arrive to
From here, the Kolmogorov backward equation can be deduced. If we instead use the adjoint operator of
,
, defined such that
then we arrive to the Kolmogorov forward equation, or Fokker–Planck equation, which, simplifying the notation
, in its differential form reads
Remains the issue of defining explicitly
. This can be done taking the expectation from the integral form of the
Itô's lemma
In mathematics
Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. ...
:
The part that depends on
vanished because of the martingale property.
Then, for a particle subject to an Itô equation, using
it can be easily calculated, using integration by parts, that
which bring us to the Fokker–Planck equation:
While the Fokker–Planck equation is used with problems where the initial distribution is known, if the problem is to know the distribution at previous times, the
Feynman–Kac formula
The Feynman–Kac formula, named after Richard Feynman and Mark Kac, establishes a link between parabolic partial differential equations and stochastic processes. In 1947, when Kac and Feynman were both faculty members at Cornell University, Kac ...
can be used, which is a consequence of the Kolmogorov backward equation.
The stochastic process defined above in the Itô sense can be rewritten within the
Stratonovich convention as a Stratonovich SDE:
It includes an added noise-induced drift term due to diffusion gradient effects if the noise is state-dependent. This convention is more often used in physical applications. Indeed, it is well known that any solution to the Stratonovich SDE is a solution to the Itô SDE.
The zero-drift equation with constant diffusion can be considered as a model of classical
Brownian motion
Brownian motion is the random motion of particles suspended in a medium (a liquid or a gas). The traditional mathematical formulation of Brownian motion is that of the Wiener process, which is often called Brownian motion, even in mathematical ...
:
This model has discrete spectrum of solutions if the condition of fixed boundaries is added for
:
It has been shown
that in this case an analytical spectrum of solutions allows deriving a local uncertainty relation for the coordinate-velocity phase volume:
Here
is a minimal value of a corresponding diffusion spectrum
, while
and
represent the uncertainty of coordinate–velocity definition.
Higher dimensions
More generally, if
where
and
are -dimensional
vectors,
is an
matrix and
is an ''M''-dimensional standard
Wiener process
In mathematics, the Wiener process (or Brownian motion, due to its historical connection with Brownian motion, the physical process of the same name) is a real-valued continuous-time stochastic process discovered by Norbert Wiener. It is one o ...
, the probability density
for
satisfies the Fokker–Planck equationwith drift vector
and diffusion
tensor
In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects associated with a vector space. Tensors may map between different objects such as vectors, scalars, and even other ...
, i.e.
If instead of an Itô SDE, a
Stratonovich SDE is considered,
the Fokker–Planck equation will read:
[
]
Generalization
In general, the Fokker–Planck equations are a special case to the general Kolmogorov forward equation
where the linear operator is the Hermitian adjoint
In mathematics, specifically in operator theory, each linear operator A on an inner product space defines a Hermitian adjoint (or adjoint) operator A^* on that space according to the rule
:\langle Ax,y \rangle = \langle x,A^*y \rangle,
where \l ...
to the infinitesimal generator for the Markov process
In probability theory and statistics, a Markov chain or Markov process is a stochastic process describing a sequence of possible events in which the probability of each event depends only on the state attained in the previous event. Informally, ...
.
Examples
Wiener process
A standard scalar Wiener process
In mathematics, the Wiener process (or Brownian motion, due to its historical connection with Brownian motion, the physical process of the same name) is a real-valued continuous-time stochastic process discovered by Norbert Wiener. It is one o ...
is generated by the stochastic differential equation
A stochastic differential equation (SDE) is a differential equation in which one or more of the terms is a stochastic process, resulting in a solution which is also a stochastic process. SDEs have many applications throughout pure mathematics an ...
Here the drift term is zero and the diffusion coefficient is 1/2. Thus the corresponding Fokker–Planck equation is
which is the simplest form of a diffusion equation
The diffusion equation is a parabolic partial differential equation. In physics, it describes the macroscopic behavior of many micro-particles in Brownian motion, resulting from the random movements and collisions of the particles (see Fick's l ...
. If the initial condition is , the solution is
Boltzmann distribution at the thermodynamic equilibrium
The overdamped Langevin equationgives . The Boltzmann distribution is an equilibrium distribution, and assuming grows sufficiently rapidly (that is, the potential well is deep enough to confine the particle), the Boltzmann distribution is the unique equilibrium.
Ornstein–Uhlenbeck process
The Ornstein–Uhlenbeck process
In mathematics, the Ornstein–Uhlenbeck process is a stochastic process with applications in financial mathematics and the physical sciences. Its original application in physics was as a model for the velocity of a massive Brownian particle ...
is a process defined as
with . Physically, this equation can be motivated as follows: a particle of mass with velocity moving in a medium, e.g., a fluid, will experience a friction force which resists motion whose magnitude can be approximated as being proportional to particle's velocity with . Other particles in the medium will randomly kick the particle as they collide with it and this effect can be approximated by a white noise term; . Newton's second law is written as
Taking for simplicity and changing the notation as leads to the familiar form .
The corresponding Fokker–Planck equation is
The stationary solution () is
Plasma physics
In plasma physics, the distribution function for a particle species , , takes the place of the probability density function
In probability theory, a probability density function (PDF), density function, or density of an absolutely continuous random variable, is a Function (mathematics), function whose value at any given sample (or point) in the sample space (the s ...
. The corresponding Boltzmann equation is given by
where the third term includes the particle acceleration due to the Lorentz force
In electromagnetism, the Lorentz force is the force exerted on a charged particle by electric and magnetic fields. It determines how charged particles move in electromagnetic environments and underlies many physical phenomena, from the operation ...
and the Fokker–Planck term at the right-hand side represents the effects of particle collisions. The quantities and are the average change in velocity a particle of type experiences due to collisions with all other particle species in unit time. Expressions for these quantities are given elsewhere. If collisions are ignored, the Boltzmann equation reduces to the Vlasov equation
In plasma physics, the Vlasov equation is a differential equation describing time evolution of the distribution function of collisionless plasma consisting of charged particles with long-range interaction, such as the Coulomb interaction. The e ...
.
Smoluchowski diffusion equation
Consider an overdamped Brownian particle under external force :where the term is negligible (the meaning of "overdamped"). Thus, it is just . The Fokker–Planck equation for this particle is the Smoluchowski diffusion equation:
Where is the diffusion constant and . The importance of this equation is it allows for both the inclusion of the effect of temperature on the system of particles and a spatially dependent diffusion constant.
Starting with the Langevin Equation
In physics, a Langevin equation (named after Paul Langevin) is a stochastic differential equation describing how a system evolves when subjected to a combination of deterministic and fluctuating ("random") forces. The dependent variables in a Lange ...
of a Brownian particle in external field , where is the friction term, is a fluctuating force on the particle, and is the amplitude of the fluctuation.
At equilibrium the frictional force is much greater than the inertial force, . Therefore, the Langevin equation becomes,
Which generates the following Fokker–Planck equation,
Rearranging the Fokker–Planck equation,
Where . Note, the diffusion coefficient may not necessarily be spatially independent if or are spatially dependent.
Next, the total number of particles in any particular volume is given by,
Therefore, the flux of particles can be determined by taking the time derivative of the number of particles in a given volume, plugging in the Fokker–Planck equation, and then applying Gauss's Theorem
In vector calculus, the divergence theorem, also known as Gauss's theorem or Ostrogradsky's theorem, reprinted in is a theorem relating the ''flux'' of a vector field through a closed surface to the ''divergence'' of the field in the volume en ...
.
In equilibrium, it is assumed that the flux goes to zero. Therefore, Boltzmann statistics can be applied for the probability of a particles location at equilibrium, where is a conservative force and the probability of a particle being in a state is given as .
This relation is a realization of the fluctuation–dissipation theorem
The fluctuation–dissipation theorem (FDT) or fluctuation–dissipation relation (FDR) is a powerful tool in statistical physics for predicting the behavior of systems that obey detailed balance. Given that a system obeys detailed balance, the the ...
. Now applying to and using the Fluctuation-dissipation theorem,
Rearranging,
Therefore, the Fokker–Planck equation becomes the Smoluchowski equation,
for an arbitrary force .
Computational considerations
Brownian motion follows the Langevin equation
In physics, a Langevin equation (named after Paul Langevin) is a stochastic differential equation describing how a system evolves when subjected to a combination of deterministic and fluctuating ("random") forces. The dependent variables in a Lange ...
, which can be solved for many different stochastic forcings with results being averaged (canonical ensemble in molecular dynamics
Molecular dynamics (MD) is a computer simulation method for analyzing the Motion (physics), physical movements of atoms and molecules. The atoms and molecules are allowed to interact for a fixed period of time, giving a view of the dynamics ( ...
). However, instead of this computationally intensive approach, one can use the Fokker–Planck equation and consider the probability of the particle having a velocity in the interval when it starts its motion with at time 0.
1-D linear potential example
Brownian dynamics in one dimension is simple.
Theory
Starting with a linear potential of the form the corresponding Smoluchowski equation becomes,
Where the diffusion constant, , is constant over space and time. The boundary conditions are such that the probability vanishes at with an initial condition of the ensemble of particles starting in the same place, .
Defining and and applying the coordinate transformation,
With the Smoluchowki equation becomes,
Which is the free diffusion equation with solution,
And after transforming back to the original coordinates,
Simulation
The simulation on the right was completed using a Brownian dynamics simulation. Starting with a Langevin equation for the system,
where is the friction term, is a fluctuating force on the particle, and is the amplitude of the fluctuation. At equilibrium the frictional force is much greater than the inertial force, . Therefore, the Langevin equation becomes,
For the Brownian dynamic simulation the fluctuation force is assumed to be Gaussian with the amplitude being dependent of the temperature of the system . Rewriting the Langevin equation,
where is the Einstein relation. The integration of this equation was done using the Euler–Maruyama method
In Itô calculus, the Euler–Maruyama method (also simply called the Euler method) is a method for the approximate numerical analysis, numerical solution of a stochastic differential equation (SDE). It is an extension of the Euler method for ord ...
to numerically approximate the path of this Brownian particle.
Solution
Being a partial differential equation
In mathematics, a partial differential equation (PDE) is an equation which involves a multivariable function and one or more of its partial derivatives.
The function is often thought of as an "unknown" that solves the equation, similar to ho ...
, the Fokker–Planck equation can be solved analytically only in special cases. A formal analogy of the Fokker–Planck equation with the Schrödinger equation
The Schrödinger equation is a partial differential equation that governs the wave function of a non-relativistic quantum-mechanical system. Its discovery was a significant landmark in the development of quantum mechanics. It is named after E ...
allows the use of advanced operator techniques known from quantum mechanics for its solution in a number of cases. Furthermore, in the case of overdamped dynamics when the Fokker–Planck equation contains second partial derivatives with respect to all spatial variables, the equation can be written in the form of a master equation
In physics, chemistry, and related fields, master equations are used to describe the time evolution of a system that can be modeled as being in a probabilistic combination of states at any given time, and the switching between states is determi ...
that can easily be solved numerically.
In many applications, one is only interested in the steady-state probability distribution , which can be found from .
The computation of mean first passage time
In statistics, first-hitting-time models are simplified models that estimate the amount of time that passes before some random or stochastic process crosses a barrier, boundary or reaches a specified state, termed the first hitting time, or the ...
s and splitting probabilities can be reduced to the solution of an ordinary differential equation which is intimately related to the Fokker–Planck equation.
Particular cases with known solution and inversion
In mathematical finance
Mathematical finance, also known as quantitative finance and financial mathematics, is a field of applied mathematics, concerned with mathematical modeling in the financial field.
In general, there exist two separate branches of finance that req ...
for volatility smile
Volatility smiles are implied volatility patterns that arise in pricing financial options. It is a parameter (implied volatility) that is needed to be modified for the Black–Scholes formula to fit market prices. In particular for a given ex ...
modeling of options via local volatility
A local volatility model, in mathematical finance and financial engineering, is an option pricing model that treats Volatility (finance), volatility as a function of both the current asset level S_t and of time t . As such, it is a generalisati ...
, one has the problem of deriving a diffusion coefficient consistent with a probability density obtained from market option quotes. The problem is therefore an inversion of the Fokker–Planck equation: Given the density f(x,t) of the option underlying ''X'' deduced from the option market, one aims at finding the local volatility consistent with ''f''. This is an inverse problem
An inverse problem in science is the process of calculating from a set of observations the causal factors that produced them: for example, calculating an image in X-ray computed tomography, sound source reconstruction, source reconstruction in ac ...
that has been solved in general by Dupire (1994, 1997) with a non-parametric solution. Brigo and Mercurio (2002, 2003) propose a solution in parametric form via a particular local volatility consistent with a solution of the Fokker–Planck equation given by a mixture model
In statistics, a mixture model is a probabilistic model for representing the presence of subpopulations within an overall population, without requiring that an observed data set should identify the sub-population to which an individual observati ...
. More information is available also in Fengler (2008), Gatheral (2008), and Musiela and Rutkowski (2008).
Fokker–Planck equation and path integral
Every Fokker–Planck equation is equivalent to a path integral. The path integral formulation is an excellent starting point for the application of field theory methods. This is used, for instance, in critical dynamics.
A derivation of the path integral is possible in a similar way as in quantum mechanics. The derivation for a Fokker–Planck equation with one variable is as follows. Start by inserting a delta function
In mathematical analysis, the Dirac delta function (or distribution), also known as the unit impulse, is a generalized function on the real numbers, whose value is zero everywhere except at zero, and whose integral over the entire real lin ...
and then integrating by parts:
The -derivatives here only act on the -function, not on . Integrate over a time interval ,
Insert the Fourier integral
for the -function,
This equation expresses as functional of . Iterating times and performing the limit gives a path integral with action
Action may refer to:
* Action (philosophy), something which is done by a person
* Action principles the heart of fundamental physics
* Action (narrative), a literary mode
* Action fiction, a type of genre fiction
* Action game, a genre of video gam ...
The variables conjugate to are called "response variables".
Although formally equivalent, different problems may be solved more easily in the Fokker–Planck equation or the path integral formulation. The equilibrium distribution for instance may be obtained more directly from the Fokker–Planck equation.
See also
* Bogoliubov–Born–Green–Kirkwood–Yvon hierarchy of equations
* Boltzmann equation
The Boltzmann equation or Boltzmann transport equation (BTE) describes the statistical behaviour of a thermodynamic system not in a state of equilibrium; it was devised by Ludwig Boltzmann in 1872.Encyclopaedia of Physics (2nd Edition), R. G ...
* Convection–diffusion equation
The convection–diffusion equation is a parabolic partial differential equation that combines the diffusion equation, diffusion and convection (advection equation, advection) equations. It describes physical phenomena where particles, energy, or o ...
* Klein–Kramers equation In physics and mathematics, the Oskar Klein, Klein–Hans Kramers, Kramers equation or sometimes referred as Kramers–Subrahmanyan_Chandrasekhar, Chandrasekhar equation is a partial differential equation that describes the probability density funct ...
* Kolmogorov backward equation
In probability theory, Kolmogorov equations characterize continuous-time Markov processes. In particular, they describe how the probability of a continuous-time Markov process in a certain state changes over time. There are four distinct equati ...
* Kolmogorov equation
* Langevin equation
In physics, a Langevin equation (named after Paul Langevin) is a stochastic differential equation describing how a system evolves when subjected to a combination of deterministic and fluctuating ("random") forces. The dependent variables in a Lange ...
* Master equation
In physics, chemistry, and related fields, master equations are used to describe the time evolution of a system that can be modeled as being in a probabilistic combination of states at any given time, and the switching between states is determi ...
* Mean-field game theory
Mean-field game theory is the study of strategic decision making by small interacting agents in very large populations. It lies at the intersection of game theory with stochastic analysis and control theory. The use of the term "mean field" is ins ...
* Ornstein–Uhlenbeck process
In mathematics, the Ornstein–Uhlenbeck process is a stochastic process with applications in financial mathematics and the physical sciences. Its original application in physics was as a model for the velocity of a massive Brownian particle ...
* Vlasov equation
In plasma physics, the Vlasov equation is a differential equation describing time evolution of the distribution function of collisionless plasma consisting of charged particles with long-range interaction, such as the Coulomb interaction. The e ...
Notes and references
Further reading
*
*
*
*
{{DEFAULTSORT:Fokker-Planck equation
Stochastic processes
Equations
Parabolic partial differential equations
Max Planck
Stochastic calculus
Mathematical finance
Transport phenomena