mathematics Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...

, the kernel of a

linear map In mathematics, and more specifically in linear algebra, a linear map (also called a linear mapping, linear transformation, vector space homomorphism, or in some contexts linear function) is a mapping V \to W between two vector spaces that p ...

, also known as the null space or nullspace, is the part of the domain which is mapped to the

zero vector In mathematics, a zero element is one of several generalizations of the number zero to other algebraic structures. These alternate meanings may or may not reduce to the same thing, depending on the context. Additive identities An '' additive id ...

of the co-domain; the kernel is always a

linear subspace In mathematics, the term ''linear'' is used in two distinct senses for two different properties: * linearity of a ''function (mathematics), function'' (or ''mapping (mathematics), mapping''); * linearity of a ''polynomial''. An example of a li ...

of the domain. That is, given a linear map between two

vector space In mathematics and physics, a vector space (also called a linear space) is a set (mathematics), set whose elements, often called vector (mathematics and physics), ''vectors'', can be added together and multiplied ("scaled") by numbers called sc ...

s and , the kernel of is the vector space of all elements of such that , where denotes the

in , or more symbolically:

\ker(L) = \left\ = L^(\mathbf).

Properties

The kernel of is a

of the domain .Linear algebra, as discussed in this article, is a very well established mathematical discipline for which there are many sources. Almost all of the material in this article can be found in , , and Strang's lectures. In the linear map

L : V \to W,

two elements of have the same

image An image or picture is a visual representation. An image can be Two-dimensional space, two-dimensional, such as a drawing, painting, or photograph, or Three-dimensional space, three-dimensional, such as a carving or sculpture. Images may be di ...

if and only if In logic and related fields such as mathematics and philosophy, "if and only if" (often shortened as "iff") is paraphrased by the biconditional, a logical connective between statements. The biconditional is true in two cases, where either bo ...

their difference lies in the kernel of , that is,

L\left(\mathbf_1\right) = L\left(\mathbf_2\right) \quad \text \quad L\left(\mathbf_1-\mathbf_2\right) = \mathbf.

From this, it follows by the first isomorphism theorem that the image of is

isomorphic In mathematics, an isomorphism is a structure-preserving mapping or morphism between two structures of the same type that can be reversed by an inverse mapping. Two mathematical structures are isomorphic if an isomorphism exists between the ...

to the quotient of by the kernel:

\operatorname(L) \cong V / \ker(L).

In the case where is

finite-dimensional In mathematics, the dimension of a vector space ''V'' is the cardinality (i.e., the number of vectors) of a basis of ''V'' over its base field. p. 44, §2.36 It is sometimes called Hamel dimension (after Georg Hamel) or algebraic dimension to d ...

, this implies the rank–nullity theorem:

\dim(\ker L) + \dim(\operatorname L) = \dim(V).

where the term refers to the dimension of the image of ,

\dim(\operatorname L),

while ' refers to the dimension of the kernel of ,

\dim(\ker L).

That is,

\operatorname(L) = \dim(\operatorname L) \qquad \text \qquad \operatorname(L) = \dim(\ker L),

so that the rank–nullity theorem can be restated as

\operatorname(L) + \operatorname(L) = \dim \left(\operatorname L\right).

When is an

inner product space In mathematics, an inner product space (or, rarely, a Hausdorff pre-Hilbert space) is a real vector space or a complex vector space with an operation called an inner product. The inner product of two vectors in the space is a scalar, ofte ...

, the quotient

V / \ker(L)

can be identified with the orthogonal complement in of

\ker(L)

. This is the generalization to linear operators of the row space, or

coimage In algebra, the coimage of a homomorphism :f : A \rightarrow B is the quotient :\text f = A/\ker(f) of the domain by the kernel. The coimage is canonically isomorphic to the image by the first isomorphism theorem, when that theorem applies ...

, of a matrix.

Generalization to modules

The notion of kernel also makes sense for

homomorphism In algebra, a homomorphism is a morphism, structure-preserving map (mathematics), map between two algebraic structures of the same type (such as two group (mathematics), groups, two ring (mathematics), rings, or two vector spaces). The word ''homo ...

s of modules, which are generalizations of vector spaces where the scalars are elements of a ring, rather than a field. The domain of the mapping is a module, with the kernel constituting a submodule. Here, the concepts of rank and nullity do not necessarily apply.

In functional analysis

If and are

topological vector space In mathematics, a topological vector space (also called a linear topological space and commonly abbreviated TVS or t.v.s.) is one of the basic structures investigated in functional analysis. A topological vector space is a vector space that is als ...

s such that is finite-dimensional, then a linear operator is continuous if and only if the kernel of is a closed subspace of .

Representation as matrix multiplication

Consider a linear map represented as a matrix with coefficients in a field (typically

\mathbb

\mathbb

), that is operating on column vectors with components over . The kernel of this linear map is the set of solutions to the equation , where is understood as the

. The

dimension In physics and mathematics, the dimension of a mathematical space (or object) is informally defined as the minimum number of coordinates needed to specify any point within it. Thus, a line has a dimension of one (1D) because only one coo ...

of the kernel of ''A'' is called the nullity of ''A''. In set-builder notation,

\operatorname(A) = \operatorname(A) = \operatorname(A) = \left\.

The matrix equation is equivalent to a homogeneous

system of linear equations In mathematics, a system of linear equations (or linear system) is a collection of two or more linear equations involving the same variable (math), variables. For example, : \begin 3x+2y-z=1\\ 2x-2y+4z=-2\\ -x+\fracy-z=0 \end is a system of th ...

A\mathbf=\mathbf \;\;\Leftrightarrow\;\;
\begin
a_ x_1 &&\; + \;&& a_ x_2 &&\; + \;\cdots\; + \;&& a_ x_n &&\; = \;&&& 0      \\
a_ x_1 &&\; + \;&& a_ x_2 &&\; + \;\cdots\; + \;&& a_ x_n &&\; = \;&&& 0      \\
           &&       &&            &&                    &&            &&\vdots\ \;&&&    \\
a_ x_1 &&\; + \;&& a_ x_2 &&\; + \;\cdots\; + \;&& a_ x_n &&\; = \;&&& 0\text      \\
\end

Thus the kernel of ''A'' is the same as the solution set to the above homogeneous equations.

Subspace properties

The kernel of a matrix over a field is a

of . That is, the kernel of , the set , has the following three properties: # always contains the

, since . # If and , then . This follows from the distributivity of

matrix multiplication In mathematics, specifically in linear algebra, matrix multiplication is a binary operation that produces a matrix (mathematics), matrix from two matrices. For matrix multiplication, the number of columns in the first matrix must be equal to the n ...

over addition. # If and is a scalar , then , since .

The row space of a matrix

The product ''A''x can be written in terms of the

dot product In mathematics, the dot product or scalar productThe term ''scalar product'' means literally "product with a Scalar (mathematics), scalar as a result". It is also used for other symmetric bilinear forms, for example in a pseudo-Euclidean space. N ...

of vectors as follows:

A\mathbf = \begin \mathbf_1 \cdot \mathbf \\ \mathbf_2 \cdot \mathbf \\ \vdots \\ \mathbf_m \cdot \mathbf \end.

Here, denote the rows of the matrix . It follows that is in the kernel of , if and only if is

orthogonal In mathematics, orthogonality (mathematics), orthogonality is the generalization of the geometric notion of ''perpendicularity''. Although many authors use the two terms ''perpendicular'' and ''orthogonal'' interchangeably, the term ''perpendic ...

(or perpendicular) to each of the row vectors of (since orthogonality is defined as having a dot product of 0). The row space, or coimage, of a matrix is the span of the row vectors of . By the above reasoning, the kernel of is the orthogonal complement to the row space. That is, a vector lies in the kernel of , if and only if it is perpendicular to every vector in the row space of . The dimension of the row space of is called the rank of ''A'', and the dimension of the kernel of is called the nullity of . These quantities are related by the rank–nullity theorem

\operatorname(A) + \operatorname(A) = n.

Left null space

The left null space, or

cokernel The cokernel of a linear mapping of vector spaces is the quotient space of the codomain of by the image of . The dimension of the cokernel is called the ''corank'' of . Cokernels are dual to the kernels of category theory, hence the nam ...

, of a matrix consists of all column vectors such that , where T denotes the

transpose In linear algebra, the transpose of a Matrix (mathematics), matrix is an operator which flips a matrix over its diagonal; that is, it switches the row and column indices of the matrix by producing another matrix, often denoted by (among other ...

of a matrix. The left null space of is the same as the kernel of . The left null space of is the orthogonal complement to the column space of , and is dual to the

of the associated linear transformation. The kernel, the row space, the column space, and the left null space of are the four fundamental subspaces associated with the matrix .

Nonhomogeneous systems of linear equations

The kernel also plays a role in the solution to a nonhomogeneous system of linear equations:

A\mathbf = \mathbf\quad \text \quad \begin
a_ x_1 &&\; + \;&& a_ x_2 &&\; + \;\cdots\; + \;&& a_ x_n &&\; = \;&&& b_1      \\
a_ x_1 &&\; + \;&& a_ x_2 &&\; + \;\cdots\; + \;&& a_ x_n &&\; = \;&&& b_2      \\
           &&       &&            &&                    &&            &&\vdots\ \;&&&       \\
a_ x_1 &&\; + \;&& a_ x_2 &&\; + \;\cdots\; + \;&& a_ x_n &&\; = \;&&& b_m      \\
\end

If and are two possible solutions to the above equation, then

A(\mathbf - \mathbf) = A\mathbf - A\mathbf = \mathbf - \mathbf = \mathbf

Thus, the difference of any two solutions to the equation lies in the kernel of . It follows that any solution to the equation can be expressed as the sum of a fixed solution and an arbitrary element of the kernel. That is, the solution set to the equation is

\left\,

Geometrically, this says that the solution set to is the

translation Translation is the communication of the semantics, meaning of a #Source and target languages, source-language text by means of an Dynamic and formal equivalence, equivalent #Source and target languages, target-language text. The English la ...

of the kernel of by the vector . See also Fredholm alternative and

flat (geometry) In geometry, a flat is an affine subspace, i.e. a subset of an affine space that is itself an affine space. Particularly, in the case the parent space is Euclidean, a flat is a Euclidean subspace which inherits the notion of distance from it ...

Illustration

The following is a simple illustration of the computation of the kernel of a matrix (see , below for methods better suited to more complex calculations). The illustration also touches on the row space and its relation to the kernel. Consider the matrix

A = \begin 2 & 3 & 5 \\ -4 & 2 & 3 \end.

The kernel of this matrix consists of all vectors for which

\begin 2 & 3 & 5 \\ -4 & 2 & 3 \end \begin x \\ y \\ z \end = \begin 0 \\ 0 \end,

which can be expressed as a homogeneous

involving , , and :

\begin
 2x + 3y + 5z &= 0, \\
-4x + 2y + 3z &= 0.
\end

The same linear equations can also be written in matrix form as:

\left begin
    2 & 3 & 5 & 0 \\
    -4 & 2 & 3 & 0
  \end\right

Through Gauss–Jordan elimination, the matrix can be reduced to:

\left begin
    1 & 0 & 1/16 & 0 \\
    0 & 1 & 13/8 & 0
  \end\right

Rewriting the matrix in equation form yields:

\begin
x &= -\fracz \\
y &= -\fracz.
\end

The elements of the kernel can be further expressed in parametric vector form, as follows:

\begin x \\ y \\ z\end = c \begin -1/16 \\ -13/8 \\ 1\end\quad (\textc \in \mathbb)

Since is a free variable ranging over all real numbers, this can be expressed equally well as:

\begin x \\ y \\ z \end
 = c \begin -1 \\ -26 \\ 16 \end.

The kernel of is precisely the solution set to these equations (in this case, a line through the origin in ). Here, the vector constitutes a basis of the kernel of . The nullity of is therefore 1, as it is spanned by a single vector. The following dot products are zero:

\begin 2 & 3 & 5 \end
 \begin -1 \\ -26 \\ 16 \end
= 0
\quad\mathrm\quad
 \begin -4 & 2 & 3 \end
 \begin -1 \\ -26 \\ 16 \end
= 0 ,

which illustrates that vectors in the kernel of are orthogonal to each of the row vectors of . These two (linearly independent) row vectors span the row space of —a plane orthogonal to the vector . With the rank 2 of , the nullity 1 of , and the dimension 3 of , we have an illustration of the rank-nullity theorem.

Examples

*If , then the kernel of is the solution set to a homogeneous

. As in the above illustration, if is the operator:

L(x_1,  x_2, x_3) = (2 x_1 + 3 x_2 + 5 x_3,\; - 4 x_1 + 2 x_2 + 3 x_3)

then the kernel of is the set of solutions to the equations

\begin
    2x_1 &\;+\;& 3x_2 &\;+\;& 5x_3 &\;=\;& 0 \\
   -4x_1 &\;+\;& 2x_2 &\;+\;& 3x_3 &\;=\;& 0
\end

*Let denote the

of all continuous real-valued functions on the interval ,1 and define by the rule

L(f) = f(0.3).

Then the kernel of consists of all functions for which . *Let be the vector space of all infinitely differentiable functions , and let be the differentiation operator:

D(f) = \frac.

Then the kernel of consists of all functions in whose derivatives are zero, i.e. the set of all

constant function In mathematics, a constant function is a function whose (output) value is the same for every input value. Basic properties As a real-valued function of a real-valued argument, a constant function has the general form or just For example, ...

s. *Let be the direct product of infinitely many copies of , and let be the shift operator

s(x_1, x_2, x_3, x_4, \ldots) = (x_2, x_3, x_4, \ldots).

Then the kernel of is the one-dimensional subspace consisting of all vectors . *If is an

and is a subspace, the kernel of the orthogonal projection is the orthogonal complement to in .

Computation by Gaussian elimination

A basis of the kernel of a matrix may be computed by

Gaussian elimination In mathematics, Gaussian elimination, also known as row reduction, is an algorithm for solving systems of linear equations. It consists of a sequence of row-wise operations performed on the corresponding matrix of coefficients. This method can a ...

. For this purpose, given an matrix , we construct first the row

augmented matrix In linear algebra, an augmented matrix (A \vert B) is a k \times (n+1) matrix obtained by appending a k-dimensional column vector B, on the right, as a further column to a k \times n-dimensional matrix A. This is usually done for the purpose of p ...

\beginA \\ \hline I \end,

where is the

identity matrix In linear algebra, the identity matrix of size n is the n\times n square matrix with ones on the main diagonal and zeros elsewhere. It has unique properties, for example when the identity matrix represents a geometric transformation, the obje ...

. Computing its column echelon form by Gaussian elimination (or any other suitable method), we get a matrix

\begin B \\\hline C \end.

A basis of the kernel of consists in the non-zero columns of such that the corresponding column of is a zero column. In fact, the computation may be stopped as soon as the upper matrix is in column echelon form: the remainder of the computation consists in changing the basis of the vector space generated by the columns whose upper part is zero. For example, suppose that

A = \begin
1 & 0 & -3 & 0 &  2 & -8 \\
0 & 1 &  5 & 0 & -1 & 4 \\
0 & 0 &  0 & 1 & 7 & -9 \\
0 & 0 & 0 & 0 & 0 & 0
\end.

Then

\begin A \\ \hline I \end =
\begin
1 & 0 & -3 & 0 &  2 & -8 \\
0 & 1 &  5 & 0 & -1 & 4 \\
0 & 0 &  0 & 1 & 7 & -9 \\
0 & 0 & 0 & 0 & 0 & 0 \\
\hline
1 & 0 & 0 & 0 & 0 & 0 \\
0 & 1 & 0 & 0 & 0 & 0 \\
0 & 0 & 1 & 0 & 0 & 0 \\
0 & 0 & 0 & 1 & 0 & 0 \\
0 & 0 & 0 & 0 & 1 & 0 \\
0 & 0 & 0 & 0 & 0 & 1
\end.

Putting the upper part in column echelon form by column operations on the whole matrix gives

\begin B \\ \hline C \end =
\begin
1 & 0 &  0 & 0 &  0 & 0 \\
0 & 1 &  0 & 0 &  0 & 0 \\
0 & 0 &  1 & 0 &  0 & 0 \\
0 & 0 &  0 & 0 &  0 & 0 \\
\hline
1 & 0 &  0 & 3 & -2 & 8 \\
0 & 1 &  0 & -5 & 1 & -4 \\
0 & 0 &  0 & 1 & 0 & 0 \\
0 & 0 &  1 & 0 & -7 & 9 \\
0 & 0 &  0 & 0 & 1 & 0 \\
0 & 0 &  0 & 0 & 0 & 1
\end.

The last three columns of are zero columns. Therefore, the three last vectors of ,

\left!\! \begin 3 \\ -5 \\ 1 \\ 0 \\ 0 \\ 0 \end \right,\;
\left!\! \begin -2 \\ 1 \\ 0 \\ -7 \\ 1 \\ 0 \end \right \;
\left!\! \begin 8 \\ -4 \\ 0 \\ 9 \\ 0 \\ 1 \end \right

are a basis of the kernel of . Proof that the method computes the kernel: Since column operations correspond to post-multiplication by invertible matrices, the fact that

\begin A \\ \hline I \end

reduces to

\begin B \\ \hline C \end

means that there exists an invertible matrix

P

such that

\begin A \\ \hline I \end P = \begin B \\ \hline C \end,

with

B

in column echelon form. Thus and A column vector

\mathbf v

belongs to the kernel of

A

(that is

A \mathbf v = \mathbf 0

) if and only if

B \mathbf w = \mathbf 0,

where As

B

is in column echelon form, if and only if the nonzero entries of

\mathbf w

correspond to the zero columns of By multiplying by one may deduce that this is the case if and only if

\mathbf v = C \mathbf w

is a linear combination of the corresponding columns of

Numerical computation

The problem of computing the kernel on a computer depends on the nature of the coefficients.

Exact coefficients

If the coefficients of the matrix are exactly given numbers, the column echelon form of the matrix may be computed with Bareiss algorithm more efficiently than with Gaussian elimination. It is even more efficient to use

modular arithmetic In mathematics, modular arithmetic is a system of arithmetic operations for integers, other than the usual ones from elementary arithmetic, where numbers "wrap around" when reaching a certain value, called the modulus. The modern approach to mo ...

and

Chinese remainder theorem In mathematics, the Chinese remainder theorem states that if one knows the remainders of the Euclidean division of an integer ''n'' by several integers, then one can determine uniquely the remainder of the division of ''n'' by the product of thes ...

, which reduces the problem to several similar ones over

finite field In mathematics, a finite field or Galois field (so-named in honor of Évariste Galois) is a field (mathematics), field that contains a finite number of Element (mathematics), elements. As with any field, a finite field is a Set (mathematics), s ...

s (this avoids the overhead induced by the non-linearity of the computational complexity of integer multiplication). For coefficients in a finite field, Gaussian elimination works well, but for the large matrices that occur in

cryptography Cryptography, or cryptology (from "hidden, secret"; and ''graphein'', "to write", or ''-logy, -logia'', "study", respectively), is the practice and study of techniques for secure communication in the presence of Adversary (cryptography), ...

and

Gröbner basis In mathematics, and more specifically in computer algebra, computational algebraic geometry, and computational commutative algebra, a Gröbner basis is a particular kind of generating set of an ideal in a polynomial ring K _1,\ldots,x_n/math> ove ...

computation, better algorithms are known, which have roughly the same computational complexity, but are faster and behave better with modern

computer hardware Computer hardware includes the physical parts of a computer, such as the central processing unit (CPU), random-access memory (RAM), motherboard, computer data storage, graphics card, sound card, and computer case. It includes external devices ...

Floating point computation

For matrices whose entries are floating-point numbers, the problem of computing the kernel makes sense only for matrices such that the number of rows is equal to their rank: because of the rounding errors, a floating-point matrix has almost always a full rank, even when it is an approximation of a matrix of a much smaller rank. Even for a full-rank matrix, it is possible to compute its kernel only if it is well conditioned, i.e. it has a low condition number. Even for a well conditioned full rank matrix, Gaussian elimination does not behave correctly: it introduces rounding errors that are too large for getting a significant result. As the computation of the kernel of a matrix is a special instance of solving a homogeneous system of linear equations, the kernel may be computed with any of the various algorithms designed to solve homogeneous systems. A state of the art software for this purpose is the Lapack library.

Notes and references

Bibliography

* * * * * * * *

External links

* * Khan Academy
Introduction to the Null Space of a Matrix
{{DEFAULTSORT:Kernel (linear algebra) Linear algebra Functional analysis Matrices (mathematics) Numerical linear algebra