In
numerical linear algebra, the alternating-direction implicit (ADI) method is an iterative method used to solve
Sylvester matrix equations. It is a popular method for solving the large matrix equations that arise in
systems theory
Systems theory is the interdisciplinary study of systems, i.e. cohesive groups of interrelated, interdependent components that can be natural or human-made. Every system has causal boundaries, is influenced by its context, defined by its structu ...
and
control,
and can be formulated to construct solutions in a memory-efficient, factored form.
It is also used to numerically solve
parabolic and
elliptic partial differential equations, and is a classic method used for modeling
heat conduction and solving the
diffusion equation
The diffusion equation is a parabolic partial differential equation. In physics, it describes the macroscopic behavior of many micro-particles in Brownian motion, resulting from the random movements and collisions of the particles (see Fick's law ...
in two or more dimensions.
[.] It is an example of an
operator splitting method.
ADI for matrix equations
The method
The ADI method is a two step iteration process that alternately updates the column and row spaces of an approximate solution to
. One ADI iteration consists of the following steps:
1. Solve for , where
2. Solve for , where .
The numbers
are called shift parameters, and convergence depends strongly on the choice of these parameters.
To perform
iterations of ADI, an initial guess
is required, as well as
shift parameters,
.
When to use ADI
If
and
, then
can be solved directly in
using the
Bartels-Stewart method. It is therefore only beneficial to use ADI when matrix-vector multiplication and linear solves involving
and
can be applied cheaply.
The equation
has a unique solution if and only if
, where
is the
spectrum
A spectrum (plural ''spectra'' or ''spectrums'') is a condition that is not limited to a specific set of values but can vary, without gaps, across a continuum. The word was first used scientifically in optics to describe the rainbow of color ...
of
.
However, the ADI method performs especially well when
and
are well-separated, and
and
are
normal matrices. These assumptions are met, for example, by the Lyapunov equation
when
is
positive definite. Under these assumptions, near-optimal shift parameters are known for several choices of
and
.
Additionally, a priori error bounds can be computed, thereby eliminating the need to monitor the residual error in implementation.
The ADI method can still be applied when the above assumptions are not met. The use of suboptimal shift parameters may adversely affect convergence,
and convergence is also affected by the non-normality of
or
(sometimes advantageously).
Krylov subspace methods, such as the Rational Krylov Subspace Method, are observed to typically converge more rapidly than ADI in this setting,
and this has led to the development of hybrid ADI-projection methods.
Shift-parameter selection and the ADI error equation
The problem of finding good shift parameters is nontrivial. This problem can be understood by examining the ADI error equation. After
iterations, the error is given by
Choosing
results in the following bound on the relative error:
where
is the
operator norm. The ideal set of shift parameters
defines a
rational function
In mathematics, a rational function is any function that can be defined by a rational fraction, which is an algebraic fraction such that both the numerator and the denominator are polynomials. The coefficients of the polynomials need not be ...
that minimizes the quantity
. If
and
are
normal matrices and have
eigendecompositions and
, then
.
Near-optimal shift parameters
Near-optimal shift parameters are known in certain cases, such as when