In
mathematics, the kernel of a
linear map
In mathematics, and more specifically in linear algebra, a linear map (also called a linear mapping, linear transformation, vector space homomorphism, or in some contexts linear function) is a mapping V \to W between two vector spaces that pr ...
, also known as the null space or nullspace, is the
linear subspace of the
domain
Domain may refer to:
Mathematics
*Domain of a function, the set of input values for which the (total) function is defined
** Domain of definition of a partial function
**Natural domain of a partial function
**Domain of holomorphy of a function
*Do ...
of the map which is mapped to the zero vector. That is, given a linear map between two
vector space
In mathematics and physics, a vector space (also called a linear space) is a set whose elements, often called '' vectors'', may be added together and multiplied ("scaled") by numbers called '' scalars''. Scalars are often real numbers, but ...
s and , the kernel of is the vector space of all elements of such that , where denotes the
zero vector in ,
or more symbolically:
:
Properties
The kernel of is a
linear subspace of the domain .
[Linear algebra, as discussed in this article, is a very well established mathematical discipline for which there are many sources. Almost all of the material in this article can be found in , , and Strang's lectures.]
In the linear map
two elements of have the same
image
An image is a visual representation of something. It can be two-dimensional, three-dimensional, or somehow otherwise feed into the visual system to convey information. An image can be an artifact, such as a photograph or other two-dimensio ...
in if and only if their difference lies in the kernel of , that is,
From this, it follows that the image of is
isomorphic to the
quotient
In arithmetic, a quotient (from lat, quotiens 'how many times', pronounced ) is a quantity produced by the division of two numbers. The quotient has widespread use throughout mathematics, and is commonly referred to as the integer part of a ...
of by the kernel:
In the case where is
finite-dimensional, this implies the
rank–nullity theorem
The rank–nullity theorem is a theorem in linear algebra, which asserts that the dimension of the domain of a linear map is the sum of its rank (the dimension of its image) and its ''nullity'' (the dimension of its kernel). p. 70, §2.1, Th ...
:
where the term refers the dimension of the image of ,
while ' refers to the dimension of the kernel of ,
That is,
so that the rank–nullity theorem can be restated as
When is an
inner product space
In mathematics, an inner product space (or, rarely, a Hausdorff pre-Hilbert space) is a real vector space or a complex vector space with an operation called an inner product. The inner product of two vectors in the space is a scalar, often ...
, the quotient
can be identified with the
orthogonal complement in of
This is the generalization to linear operators of the
row space, or coimage, of a matrix.
Application to modules
The notion of kernel also makes sense for
homomorphism
In algebra, a homomorphism is a structure-preserving map between two algebraic structures of the same type (such as two groups, two rings, or two vector spaces). The word ''homomorphism'' comes from the Ancient Greek language: () meaning "sa ...
s of
modules, which are generalizations of vector spaces where the scalars are elements of a
ring, rather than a
field. The domain of the mapping is a module, with the kernel constituting a
submodule. Here, the concepts of rank and nullity do not necessarily apply.
In functional analysis
If ''V'' and ''W'' are
topological vector space
In mathematics, a topological vector space (also called a linear topological space and commonly abbreviated TVS or t.v.s.) is one of the basic structures investigated in functional analysis.
A topological vector space is a vector space that is al ...
s such that ''W'' is finite-dimensional, then a linear operator ''L'': ''V'' → ''W'' is
continuous if and only if the kernel of ''L'' is a
closed subspace of ''V''.
Representation as matrix multiplication
Consider a linear map represented as a ''m'' × ''n'' matrix ''A'' with coefficients in a
field ''K'' (typically
or
), that is operating on column vectors x with ''n'' components over ''K''.
The kernel of this linear map is the set of solutions to the equation , where 0 is understood as the
zero vector. The
dimension
In physics and mathematics, the dimension of a mathematical space (or object) is informally defined as the minimum number of coordinates needed to specify any point within it. Thus, a line has a dimension of one (1D) because only one coor ...
of the kernel of ''A'' is called the nullity of ''A''. In
set-builder notation
In set theory and its applications to logic, mathematics, and computer science, set-builder notation is a mathematical notation for describing a set by enumerating its elements, or stating the properties that its members must satisfy.
Definin ...
,
:
The matrix equation is equivalent to a homogeneous
system of linear equations:
:
Thus the kernel of ''A'' is the same as the solution set to the above homogeneous equations.
Subspace properties
The kernel of a matrix ''A'' over a field ''K'' is a
linear subspace of K
''n''. That is, the kernel of ''A'', the set Null(''A''), has the following three properties:
# Null(''A'') always contains the
zero vector, since .
# If and , then . This follows from the distributivity of matrix multiplication over addition.
# If and ''c'' is a
scalar , then , since .
The row space of a matrix
The product ''A''x can be written in terms of the
dot product
In mathematics, the dot product or scalar productThe term ''scalar product'' means literally "product with a scalar as a result". It is also used sometimes for other symmetric bilinear forms, for example in a pseudo-Euclidean space. is an alg ...
of vectors as follows:
:
Here, a
1, ... , a
''m'' denote the rows of the matrix ''A''. It follows that x is in the kernel of ''A'', if and only if x is
orthogonal
In mathematics, orthogonality is the generalization of the geometric notion of '' perpendicularity''.
By extension, orthogonality is also used to refer to the separation of specific features of a system. The term also has specialized meanings in ...
(or perpendicular) to each of the row vectors of ''A'' (since orthogonality is defined as having a dot product of 0).
The
row space, or coimage, of a matrix ''A'' is the
span
Span may refer to:
Science, technology and engineering
* Span (unit), the width of a human hand
* Span (engineering), a section between two intermediate supports
* Wingspan, the distance between the wingtips of a bird or aircraft
* Sorbitan es ...
of the row vectors of ''A''. By the above reasoning, the kernel of ''A'' is the
orthogonal complement to the row space. That is, a vector x lies in the kernel of ''A'', if and only if it is perpendicular to every vector in the row space of ''A''.
The dimension of the row space of ''A'' is called the
rank of ''A'', and the dimension of the kernel of ''A'' is called the nullity of ''A''. These quantities are related by the
rank–nullity theorem
The rank–nullity theorem is a theorem in linear algebra, which asserts that the dimension of the domain of a linear map is the sum of its rank (the dimension of its image) and its ''nullity'' (the dimension of its kernel). p. 70, §2.1, Th ...
:
Left null space
The left null space, or
cokernel, of a matrix ''A'' consists of all column vectors x such that x
T''A'' = 0
T, where T denotes the
transpose
In linear algebra, the transpose of a matrix is an operator which flips a matrix over its diagonal;
that is, it switches the row and column indices of the matrix by producing another matrix, often denoted by (among other notations).
The tr ...
of a matrix. The left null space of ''A'' is the same as the kernel of ''A''
T. The left null space of ''A'' is the orthogonal complement to the
column space
In linear algebra, the column space (also called the range or image) of a matrix ''A'' is the span (set of all possible linear combinations) of its column vectors. The column space of a matrix is the image or range of the corresponding mat ...
of ''A'', and is dual to the
cokernel of the associated linear transformation. The kernel, the row space, the column space, and the left null space of ''A'' are the four fundamental subspaces associated to the matrix ''A''.
Nonhomogeneous systems of linear equations
The kernel also plays a role in the solution to a nonhomogeneous system of linear equations:
:
If u and v are two possible solutions to the above equation, then
:
Thus, the difference of any two solutions to the equation ''A''x = b lies in the kernel of ''A''.
It follows that any solution to the equation ''A''x = b can be expressed as the sum of a fixed solution v and an arbitrary element of the kernel. That is, the solution set to the equation ''A''x = b is
:
Geometrically, this says that the solution set to ''A''x = b is the
translation
Translation is the communication of the Meaning (linguistic), meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The ...
of the kernel of ''A'' by the vector v. See also
Fredholm alternative and
flat (geometry).
Illustration
The following is a simple illustration of the computation of the kernel of a matrix (see , below for methods better suited to more complex calculations). The illustration also touches on the row space and its relation to the kernel.
Consider the matrix
:
The kernel of this matrix consists of all vectors for which
:
which can be expressed as a homogeneous
system of linear equations involving ''x'', ''y'', and ''z'':
:
The same linear equations can also be written in matrix form as:
:
Through
Gauss–Jordan elimination, the matrix can be reduced to:
:
Rewriting the matrix in equation form yields:
:
The elements of the kernel can be further expressed in parametric form, as follows:
:
Since ''c'' is a
free variable ranging over all real numbers, this can be expressed equally well as:
:
The kernel of ''A'' is precisely the solution set to these equations (in this case, a
line
Line most often refers to:
* Line (geometry), object with zero thickness and curvature that stretches to infinity
* Telephone line, a single-user circuit on a telephone communication system
Line, lines, The Line, or LINE may also refer to:
Art ...
through the origin in R
3). Here, since the vector (−1,−26,16)
T constitutes a
basis of the kernel of ''A''. The nullity of ''A'' is 1.
The following dot products are zero:
:
which illustrates that vectors in the kernel of ''A'' are orthogonal to each of the row vectors of ''A''.
These two (linearly independent) row vectors span the row space of ''A''—a plane orthogonal to the vector (−1,−26,16)
T.
With the rank 2 of ''A'', the nullity 1 of ''A'', and the dimension 3 of ''A'', we have an illustration of the rank-nullity theorem.
Examples
*If , then the kernel of ''L'' is the solution set to a homogeneous
system of linear equations. As in the above illustration, if ''L'' is the operator:
then the kernel of ''L'' is the set of solutions to the equations
*Let ''C''
,1denote the
vector space
In mathematics and physics, a vector space (also called a linear space) is a set whose elements, often called '' vectors'', may be added together and multiplied ("scaled") by numbers called '' scalars''. Scalars are often real numbers, but ...
of all continuous real-valued functions on the interval
,1 and define by the rule
Then the kernel of ''L'' consists of all functions for which .
*Let ''C''
∞(R) be the vector space of all infinitely differentiable functions , and let be the
differentiation operator:
Then the kernel of ''D'' consists of all functions in whose derivatives are zero, i.e. the set of all
constant function
In mathematics, a constant function is a function whose (output) value is the same for every input value. For example, the function is a constant function because the value of is 4 regardless of the input value (see image).
Basic properti ...
s.
*Let be the
direct product
In mathematics, one can often define a direct product of objects already known, giving a new one. This generalizes the Cartesian product of the underlying sets, together with a suitably defined structure on the product set. More abstractly, one t ...
of infinitely many copies of , and let be the
shift operator
In mathematics, and in particular functional analysis, the shift operator also known as translation operator is an operator that takes a function
to its translation . In time series analysis, the shift operator is called the lag operator.
Shift ...
Then the kernel of ''s'' is the one-dimensional subspace consisting of all vectors .
*If is an
inner product space
In mathematics, an inner product space (or, rarely, a Hausdorff pre-Hilbert space) is a real vector space or a complex vector space with an operation called an inner product. The inner product of two vectors in the space is a scalar, often ...
and is a subspace, the kernel of the
orthogonal projection
In linear algebra and functional analysis, a projection is a linear transformation P from a vector space to itself (an endomorphism) such that P\circ P=P. That is, whenever P is applied twice to any vector, it gives the same result as if i ...
is the
orthogonal complement to in .
Computation by Gaussian elimination
A
basis of the kernel of a matrix may be computed by
Gaussian elimination
In mathematics, Gaussian elimination, also known as row reduction, is an algorithm for solving systems of linear equations. It consists of a sequence of operations performed on the corresponding matrix of coefficients. This method can also be used ...
.
For this purpose, given an ''m'' × ''n'' matrix ''A'', we construct first the row
augmented matrix where is the ''n'' × ''n''
identity matrix
In linear algebra, the identity matrix of size n is the n\times n square matrix with ones on the main diagonal and zeros elsewhere.
Terminology and notation
The identity matrix is often denoted by I_n, or simply by I if the size is immaterial ...
.
Computing its
column echelon form by Gaussian elimination (or any other suitable method), we get a matrix
A basis of the kernel of ''A'' consists in the non-zero columns of ''C'' such that the corresponding column of ''B'' is a
zero column.
In fact, the computation may be stopped as soon as the upper matrix is in column echelon form: the remainder of the computation consists in changing the basis of the vector space generated by the columns whose upper part is zero.
For example, suppose that
:
Then
:
Putting the upper part in column echelon form by column operations on the whole matrix gives
:
The last three columns of ''B'' are zero columns. Therefore, the three last vectors of ''C'',
:
are a basis of the kernel of ''A''.
Proof that the method computes the kernel: Since column operations correspond to post-multiplication by invertible matrices, the fact that
reduces to
means that there exists an invertible matrix
such that
with
in column echelon form. Thus
and
A column vector
belongs to the kernel of
(that is
) if and only
where
As
is in column echelon form,
if and only if the nonzero entries of
correspond to the zero columns of
By multiplying by
, one may deduce that this is the case if and only if
is a linear combination of the corresponding columns of
Numerical computation
The problem of computing the kernel on a computer depends on the nature of the coefficients.
Exact coefficients
If the coefficients of the matrix are exactly given numbers, the
column echelon form of the matrix may be computed by
Bareiss algorithm more efficiently than with Gaussian elimination. It is even more efficient to use
modular arithmetic
In mathematics, modular arithmetic is a system of arithmetic for integers, where numbers "wrap around" when reaching a certain value, called the modulus. The modern approach to modular arithmetic was developed by Carl Friedrich Gauss in his bo ...
and
Chinese remainder theorem
In mathematics, the Chinese remainder theorem states that if one knows the remainders of the Euclidean division of an integer ''n'' by several integers, then one can determine uniquely the remainder of the division of ''n'' by the product of the ...
, which reduces the problem to several similar ones over
finite field
In mathematics, a finite field or Galois field (so-named in honor of Évariste Galois) is a field that contains a finite number of elements. As with any field, a finite field is a set on which the operations of multiplication, addition, subt ...
s (this avoids the overhead induced by the non-linearity of the
computational complexity
In computer science, the computational complexity or simply complexity of an algorithm is the amount of resources required to run it. Particular focus is given to computation time (generally measured by the number of needed elementary operations) ...
of integer multiplication).
For coefficients in a finite field, Gaussian elimination works well, but for the large matrices that occur in
cryptography
Cryptography, or cryptology (from grc, , translit=kryptós "hidden, secret"; and ''graphein'', "to write", or ''-logia'', "study", respectively), is the practice and study of techniques for secure communication in the presence of adve ...
and
Gröbner basis
In mathematics, and more specifically in computer algebra, computational algebraic geometry, and computational commutative algebra, a Gröbner basis is a particular kind of generating set of an ideal in a polynomial ring over a field . A Grö ...
computation, better algorithms are known, which have roughly the same
computational complexity
In computer science, the computational complexity or simply complexity of an algorithm is the amount of resources required to run it. Particular focus is given to computation time (generally measured by the number of needed elementary operations) ...
, but are faster and behave better with modern
computer hardware.
Floating point computation
For matrices whose entries are
floating-point numbers, the problem of computing the kernel makes sense only for matrices such that the number of rows is equal to their rank: because of the
rounding error
A roundoff error, also called rounding error, is the difference between the result produced by a given algorithm using exact arithmetic and the result produced by the same algorithm using finite-precision, rounded arithmetic. Rounding errors are ...
s, a floating-point matrix has almost always a
full rank
In linear algebra, the rank of a matrix is the dimension of the vector space generated (or spanned) by its columns. p. 48, § 1.16 This corresponds to the maximal number of linearly independent columns of . This, in turn, is identical to the dime ...
, even when it is an approximation of a matrix of a much smaller rank. Even for a full-rank matrix, it is possible to compute its kernel only if it is
well conditioned, i.e. it has a low
condition number
In numerical analysis, the condition number of a function measures how much the output value of the function can change for a small change in the input argument. This is used to measure how sensitive a function is to changes or errors in the inpu ...
.
Even for a well conditioned full rank matrix, Gaussian elimination does not behave correctly: it introduces rounding errors that are too large for getting a significant result. As the computation of the kernel of a matrix is a special instance of solving a homogeneous system of linear equations, the kernel may be computed by any of the various algorithms designed to solve homogeneous systems. A state of the art software for this purpose is the
Lapack library.
See also
*
Kernel (algebra)
*
Zero set
In mathematics, a zero (also sometimes called a root) of a real-, complex-, or generally vector-valued function f, is a member x of the domain of f such that f(x) ''vanishes'' at x; that is, the function f attains the value of 0 at x, or ...
*
System of linear equations
*
Row and column spaces
In linear algebra, the column space (also called the range or image) of a matrix ''A'' is the span (set of all possible linear combinations) of its column vectors. The column space of a matrix is the image or range of the corresponding matri ...
*
Row reduction
*
Four fundamental subspaces
*
Vector space
In mathematics and physics, a vector space (also called a linear space) is a set whose elements, often called '' vectors'', may be added together and multiplied ("scaled") by numbers called '' scalars''. Scalars are often real numbers, but ...
*
Linear subspace
*
Linear operator
*
Function space
*
Fredholm alternative
Notes and references
Bibliography
*
*
*
*
*
*
*
*
External links
*
*
Khan Academy
Khan Academy is an American non-profit educational organization created in 2008 by Sal Khan. Its goal is creating a set of online tools that help educate students. The organization produces short lessons in the form of videos. Its website also i ...
Introduction to the Null Space of a Matrix
{{DEFAULTSORT:Kernel (linear algebra)
Linear algebra
Functional analysis
Matrices
Numerical linear algebra