Numerical linear algebra, sometimes called applied linear algebra, is the study of how matrix operations can be used to create
computer algorithms
In mathematics and computer science, an algorithm () is a finite sequence of rigorous instructions, typically used to solve a class of specific problems or to perform a computation. Algorithms are used as specifications for performing c ...
which
efficiently and accurately provide approximate answers to questions in
continuous mathematics. It is a subfield of
numerical analysis
Numerical analysis is the study of algorithms that use numerical approximation (as opposed to symbolic manipulations) for the problems of mathematical analysis (as distinguished from discrete mathematics). It is the study of numerical methods th ...
, and a type of
linear algebra
Linear algebra is the branch of mathematics concerning linear equations such as:
:a_1x_1+\cdots +a_nx_n=b,
linear maps such as:
:(x_1, \ldots, x_n) \mapsto a_1x_1+\cdots +a_nx_n,
and their representations in vector spaces and through matric ...
.
Computers use
floating-point arithmetic
In computing, floating-point arithmetic (FP) is arithmetic that represents real numbers approximately, using an integer with a fixed precision, called the significand, scaled by an integer exponent of a fixed base. For example, 12.345 can b ...
and cannot exactly represent
irrational
Irrationality is cognition, thinking, talking, or acting without inclusion of rationality. It is more specifically described as an action or opinion given through inadequate use of reason, or through emotional distress or cognitive deficiency. ...
data, so when a computer algorithm is applied to a matrix of data, it can sometimes
increase the difference between a number stored in the computer and the true number that it is an approximation of. Numerical linear algebra uses properties of vectors and matrices to develop computer algorithms that minimize the error introduced by the computer, and is also concerned with ensuring that the algorithm is as efficient as possible.
Numerical linear algebra aims to solve problems of continuous
mathematics using finite precision computers, so its applications to the
natural
Nature, in the broadest sense, is the physical world or universe. "Nature" can refer to the phenomena of the physical world, and also to life in general. The study of nature is a large, if not the only, part of science. Although humans are ...
and
social sciences
Social science is one of the branches of science, devoted to the study of society, societies and the Social relation, relationships among individuals within those societies. The term was formerly used to refer to the field of sociology, the o ...
are as vast as the applications of continuous mathematics. It is often a fundamental part of
engineering
Engineering is the use of scientific method, scientific principles to design and build machines, structures, and other items, including bridges, tunnels, roads, vehicles, and buildings. The discipline of engineering encompasses a broad rang ...
and
computational science
Computational science, also known as scientific computing or scientific computation (SC), is a field in mathematics that uses advanced computing capabilities to understand and solve complex problems. It is an area of science that spans many disc ...
problems, such as
image
An image is a visual representation of something. It can be two-dimensional, three-dimensional, or somehow otherwise feed into the visual system to convey information. An image can be an artifact, such as a photograph or other two-dimensio ...
and
signal processing
Signal processing is an electrical engineering subfield that focuses on analyzing, modifying and synthesizing '' signals'', such as sound, images, and scientific measurements. Signal processing techniques are used to optimize transmissions, ...
,
telecommunication
Telecommunication is the transmission of information by various types of technologies over wire, radio, optical, or other electromagnetic systems. It has its origin in the desire of humans for communication over a distance greater than tha ...
,
computational finance,
materials science simulations,
structural biology
Structural biology is a field that is many centuries old which, and as defined by the Journal of Structural Biology, deals with structural analysis of living material (formed, composed of, and/or maintained and refined by living cells) at every le ...
,
data mining,
bioinformatics
Bioinformatics () is an interdisciplinary field that develops methods and software tools for understanding biological data, in particular when the data sets are large and complex. As an interdisciplinary field of science, bioinformatics combin ...
, and
fluid dynamics
In physics and engineering, fluid dynamics is a subdiscipline of fluid mechanics that describes the flow of fluids—liquids and gases. It has several subdisciplines, including '' aerodynamics'' (the study of air and other gases in motion) ...
. Matrix methods are particularly used in
finite difference methods
In numerical analysis, finite-difference methods (FDM) are a class of numerical techniques for solving differential equations by approximating Derivative, derivatives with Finite difference approximation, finite differences. Both the spatial dom ...
,
finite element methods, and the modeling of
differential equations
In mathematics, a differential equation is an equation that relates one or more unknown functions and their derivatives. In applications, the functions generally represent physical quantities, the derivatives represent their rates of change, a ...
. Noting the broad applications of numerical linear algebra,
Lloyd N. Trefethen and David Bau, III argue that it is "as fundamental to the mathematical sciences as calculus and differential equations",
even though it is a comparatively small field.
Because many properties of matrices and vectors also apply to functions and operators, numerical linear algebra can also be viewed as a type of
functional analysis
Functional analysis is a branch of mathematical analysis, the core of which is formed by the study of vector spaces endowed with some kind of limit-related structure (e.g. inner product, norm, topology, etc.) and the linear functions defined ...
which has a particular emphasis on practical algorithms.
[
Common problems in numerical linear algebra include obtaining matrix decompositions like the ]singular value decomposition
In linear algebra, the singular value decomposition (SVD) is a factorization of a real or complex matrix. It generalizes the eigendecomposition of a square normal matrix with an orthonormal eigenbasis to any \ m \times n\ matrix. It is r ...
, the QR factorization, the LU factorization, or the eigendecomposition, which can then be used to answer common linear algebraic problems like solving linear systems of equations, locating eigenvalues, or least squares optimisation. Numerical linear algebra's central concern with developing algorithms that do not introduce errors when applied to real data on a finite precision computer is often achieved by iterative methods rather than direct ones.
History
Numerical linear algebra was developed by computer pioneers like John von Neumann
John von Neumann (; hu, Neumann János Lajos, ; December 28, 1903 – February 8, 1957) was a Hungarian-American mathematician, physicist, computer scientist, engineer and polymath. He was regarded as having perhaps the widest cove ...
, Alan Turing
Alan Mathison Turing (; 23 June 1912 – 7 June 1954) was an English mathematician, computer scientist, logician, cryptanalyst, philosopher, and theoretical biologist. Turing was highly influential in the development of theoretical c ...
, James H. Wilkinson
James Hardy Wilkinson FRS (27 September 1919 – 5 October 1986) was a prominent figure in the field of numerical analysis, a field at the boundary of applied mathematics and computer science particularly useful to physics and engineering.
Edu ...
, Alston Scott Householder, George Forsythe, and Heinz Rutishauser
Heinz Rutishauser (30 January 1918 – 10 November 1970) was a Swiss mathematician and a pioneer of modern numerical mathematics and computer science.
Life
Rutishauser's father died when he was 13 years old and his mother died three years la ...
, in order to apply the earliest computers to problems in continuous mathematics, such as ballistics problems and the solutions to systems of partial differential equation
In mathematics, a partial differential equation (PDE) is an equation which imposes relations between the various partial derivatives of a multivariable function.
The function is often thought of as an "unknown" to be solved for, similarly to ...
s.[ The first serious attempt to minimize computer error in the application of algorithms to real data is John von Neumann and Herman Goldstine's work in 1947. The field has grown as technology has increasingly enabled researchers to solve complex problems on extremely large high-precision matrices, and some numerical algorithms have grown in prominence as technologies like parallel computing have made them practical approaches to scientific problems.][
]
Matrix decompositions
Partitioned matrices
For many problems in applied linear algebra, it is useful to adopt the perspective of a matrix as being a concatenation of column vectors. For example, when solving the linear system , rather than understanding ''x'' as the product of with ''b'', it is helpful to think of ''x'' as the vector of coefficients
In mathematics, a coefficient is a multiplicative factor in some term of a polynomial, a series, or an expression; it is usually a number, but may be any expression (including variables such as , and ). When the coefficients are themselves ...
in the linear expansion of ''b'' in the basis formed by the columns of ''A''.[ Thinking of matrices as a concatenation of columns is also a practical approach for the purposes of matrix algorithms. This is because matrix algorithms frequently contain two nested loops: one over the columns of a matrix ''A'', and another over the rows of ''A''. For example, for matrices and vectors and , we could use the column partitioning perspective to compute ''Ax'' + ''y'' as
for q = 1:n
for p = 1:m
y(p) = A(p,q)*x(q) + y(p)
end
end
]
Singular value decomposition
The singular value decomposition of a matrix is where ''U'' and ''V'' are unitary, and is diagonal
In geometry, a diagonal is a line segment joining two vertices of a polygon or polyhedron, when those vertices are not on the same edge. Informally, any sloping line is called diagonal. The word ''diagonal'' derives from the ancient Gree ...
. The diagonal entries of are called the singular values of ''A''. Because singular values are the square roots of the eigenvalues
In linear algebra, an eigenvector () or characteristic vector of a linear transformation is a nonzero vector that changes at most by a scalar factor when that linear transformation is applied to it. The corresponding eigenvalue, often denoted b ...
of , there is a tight connection between the singular value decomposition and eigenvalue decompositions. This means that most methods for computing the singular value decomposition are similar to eigenvalue methods;[ perhaps the most common method involves Householder procedures.][
]
QR factorization
The QR factorization of a matrix is a matrix and a matrix so that ''A = QR'', where ''Q'' is orthogonal
In mathematics, orthogonality is the generalization of the geometric notion of '' perpendicularity''.
By extension, orthogonality is also used to refer to the separation of specific features of a system. The term also has specialized meanings in ...
and ''R'' is upper triangular. The two main algorithms for computing QR factorizations are the Gram–Schmidt process and the Householder transformation. The QR factorization is often used to solve linear least-squares
Linear least squares (LLS) is the least squares approximation of linear functions to data.
It is a set of formulations for solving statistical problems involved in linear regression, including variants for ordinary (unweighted), weighted, and g ...
problems, and eigenvalue problems (by way of the iterative QR algorithm).
LU factorization
An LU factorization of a matrix ''A'' consists of a lower triangular matrix ''L'' and an upper triangular matrix ''U'' so that ''A = LU''. The matrix ''U'' is found by an upper triangularization procedure which involves left-multiplying ''A'' by a series of matrices to form the product , so that equivalently .[
]
Eigenvalue decomposition
The eigenvalue decomposition of a matrix is , where the columns of ''X'' are the eigenvectors of ''A'', and is a diagonal matrix the diagonal entries of which are the corresponding eigenvalues of ''A''.[ There is no direct method for finding the eigenvalue decomposition of an arbitrary matrix. Because it is not possible to write a program that finds the exact roots of an arbitrary polynomial in finite time, any general eigenvalue solver must necessarily be iterative.][
]
Algorithms
Gaussian elimination
From the numerical linear algebra perspective, Gaussian elimination is a procedure for factoring a matrix ''A'' into its ''LU'' factorization, which Gaussian elimination accomplishes by left-multiplying ''A'' by a succession of matrices until ''U'' is upper triangular and ''L'' is lower triangular, where .[ Naive programs for Gaussian elimination are notoriously highly unstable, and produce huge errors when applied to matrices with many significant digits.][ The simplest solution is to introduce pivoting, which produces a modified Gaussian elimination algorithm that is stable.][
]
Solutions of linear systems
Numerical linear algebra characteristically approaches matrices as a concatenation of columns vectors. In order to solve the linear system , the traditional algebraic approach is to understand ''x'' as the product of with ''b''. Numerical linear algebra instead interprets ''x'' as the vector of coefficients of the linear expansion of ''b'' in the basis formed by the columns of ''A''.[
Many different decompositions can be used to solve the linear problem, depending on the characteristics of the matrix ''A'' and the vectors ''x'' and ''b'', which may make one factorization much easier to obtain than others. If ''A'' = ''QR'' is a QR factorization of ''A'', then equivalently . This is easy to compute as a matrix factorization.][ If is an eigendecomposition ''A'', and we seek to find ''b'' so that ''b'' = ''Ax'', with and , then we have .][ This is closely related to the solution to the linear system using the singular value decomposition, because singular values of a matrix are the square roots of its eigenvalues. And if ''A'' = ''LU'' is an ''LU'' factorization of ''A'', then ''Ax'' = ''b'' can be solved using the triangular matrices ''Ly'' = ''b'' and ''Ux'' = ''y''.][
]
Least squares optimisation
Matrix decompositions suggest a number of ways to solve the linear system ''r'' = ''b'' − ''Ax'' where we seek to minimize ''r'', as in the regression problem. The QR algorithm solves this problem by first defining ''y'' ''= Ax'', and then computing the reduced QR factorization of ''A'' and rearranging to obtain . This upper triangular system can then be solved for ''b''. The SVD also suggests an algorithm for obtaining linear least squares. By computing the reduced SVD decomposition and then computing the vector , we reduce the least squares problem to a simple diagonal system.[ The fact that least squares solutions can be produced by the QR and SVD factorizations means that, in addition to the classical ]normal equations
In statistics, ordinary least squares (OLS) is a type of linear least squares method for choosing the unknown parameters in a linear regression model (with fixed level-one effects of a linear function of a set of explanatory variables) by the ...
method for solving least squares problems, these problems can also be solved by methods that include the Gram-Schmidt algorithm and Householder methods.
Conditioning and stability
Allow that a problem is a function , where ''X'' is a normed vector space of data and ''Y'' is a normed vector space of solutions. For some data point , the problem is said to be ill-conditioned if a small perturbation in ''x'' produces a large change in the value of ''f''(''x''). We can quantify this by defining a condition number
In numerical analysis, the condition number of a function measures how much the output value of the function can change for a small change in the input argument. This is used to measure how sensitive a function is to changes or errors in the inpu ...
which represents how well-conditioned a problem is, defined as
Instability is the tendency of computer algorithms, which depend on floating-point arithmetic
In computing, floating-point arithmetic (FP) is arithmetic that represents real numbers approximately, using an integer with a fixed precision, called the significand, scaled by an integer exponent of a fixed base. For example, 12.345 can b ...
, to produce results that differ dramatically from the exact mathematical solution to a problem. When a matrix contains real data with many significant digits
Significant figures (also known as the significant digits, ''precision'' or ''resolution'') of a number in positional notation are digits in the number that are reliable and necessary to indicate the quantity of something.
If a number expres ...
, many algorithms for solving problems like linear systems of equation or least squares optimisation may produce highly inaccurate results. Creating stable algorithms for ill-conditioned problems is a central concern in numerical linear algebra. One example is that the stability of householder triangularization makes it a particularly robust solution method for linear systems, whereas the instability of the normal equations method for solving least squares problems is a reason to favour matrix decomposition methods like using the singular value decomposition. Some matrix decomposition methods may be unstable, but have straightforward modifications that make them stable; one example is the unstable Gram–Schmidt, which can easily be changed to produce the stable modified Gram–Schmidt.[ Another classical problem in numerical linear algebra is the finding that Gaussian elimination is unstable, but becomes stable with the introduction of pivoting.
]
Iterative methods
There are two reasons that iterative algorithms are an important part of numerical linear algebra. First, many important numerical problems have no direct solution; in order to find the eigenvalues and eigenvectors of an arbitrary matrix, we can only adopt an iterative approach. Second, noniterative algorithms for an arbitrary matrix require time, which is a surprisingly high floor given that matrices contain only numbers. Iterative approaches can take advantage of several features of some matrices to reduce this time. For example, when a matrix is sparse
Sparse is a computer software tool designed to find possible coding faults in the Linux kernel. Unlike other such tools, this static analysis tool was initially designed to only flag constructs that were likely to be of interest to kernel de ...
, an iterative algorithm can skip many of the steps that a direct approach would necessarily follow, even if they are redundant steps given a highly structured matrix.
The core of many iterative methods in numerical linear algebra is the projection of a matrix onto a lower dimensional Krylov subspace, which allows features of a high-dimensional matrix to be approximated by iteratively computing the equivalent features of similar matrices starting in a low dimension space and moving to successively higher dimensions. When ''A'' is symmetric and we wish to solve the linear problem ''Ax'' = ''b'', the classical iterative approach is the conjugate gradient method
In mathematics, the conjugate gradient method is an algorithm for the numerical solution of particular systems of linear equations, namely those whose matrix is positive-definite. The conjugate gradient method is often implemented as an iter ...
. If ''A'' is not symmetric, then examples of iterative solutions to the linear problem are the generalized minimal residual method In mathematics, the generalized minimal residual method (GMRES) is an iterative method for the numerical solution of an indefinite nonsymmetric system of linear equations. The method approximates the solution by the vector in a Krylov subspace wit ...
and CGN. If ''A'' is symmetric, then to solve the eigenvalue and eigenvector problem we can use the Lanczos algorithm, and if ''A'' is non-symmetric, then we can use Arnoldi iteration.
Software
Several programming languages use numerical linear algebra optimisation techniques and are designed to implement numerical linear algebra algorithms. These languages include MATLAB
MATLAB (an abbreviation of "MATrix LABoratory") is a proprietary multi-paradigm programming language and numeric computing environment developed by MathWorks. MATLAB allows matrix manipulations, plotting of functions and data, implementa ...
, Analytica, Maple
''Acer'' () is a genus of trees and shrubs commonly known as maples. The genus is placed in the family Sapindaceae.Stevens, P. F. (2001 onwards). Angiosperm Phylogeny Website. Version 9, June 2008 nd more or less continuously updated since ht ...
, and Mathematica
Wolfram Mathematica is a software system with built-in libraries for several areas of technical computing that allow machine learning, statistics, symbolic computation, data manipulation, network analysis, time series analysis, NLP, optimi ...
. Other programming languages which are not explicitly designed for numerical linear algebra have libraries that provide numerical linear algebra routines and optimisation; C and Fortran have packages like Basic Linear Algebra Subprograms and LAPACK
LAPACK ("Linear Algebra Package") is a standard software library for numerical linear algebra. It provides routines for solving systems of linear equations and linear least squares, eigenvalue problems, and singular value decomposition. It al ...
, python has the library NumPy, and Perl
Perl is a family of two High-level programming language, high-level, General-purpose programming language, general-purpose, Interpreter (computing), interpreted, dynamic programming languages. "Perl" refers to Perl 5, but from 2000 to 2019 it ...
has the Perl Data Language. Many numerical linear algebra commands in R rely on these more fundamental libraries like LAPACK
LAPACK ("Linear Algebra Package") is a standard software library for numerical linear algebra. It provides routines for solving systems of linear equations and linear least squares, eigenvalue problems, and singular value decomposition. It al ...
. More libraries can be found on the List of numerical libraries
This is a list of numerical libraries, which are libraries used in software development for performing numerical calculations. It is not a complete listing but is instead a list of numerical libraries with articles on Wikipedia, with few exceptio ...
.
References
Further reading
*
External links
Freely available software for numerical algebra on the web
composed by Jack Dongarra and Hatem Ltaief, University of Tennessee
* (Research group in the United Kingdom)
* (Activity group about numerical linear algebra in the Society for Industrial and Applied Mathematics
Society for Industrial and Applied Mathematics (SIAM) is a professional society dedicated to applied mathematics, computational science, and data science through research, publications, and community. SIAM is the world's largest scientific soci ...
)
The GAMM Activity Group on Applied and Numerical Linear Algebra
{{Industrial and applied mathematics
Computational fields of study