linear algebra Linear algebra is the branch of mathematics concerning linear equations such as :a_1x_1+\cdots +a_nx_n=b, linear maps such as :(x_1, \ldots, x_n) \mapsto a_1x_1+\cdots +a_nx_n, and their representations in vector spaces and through matrix (mathemat ...

, Cramer's rule is an explicit formula for the solution of a

system of linear equations In mathematics, a system of linear equations (or linear system) is a collection of two or more linear equations involving the same variable (math), variables. For example, : \begin 3x+2y-z=1\\ 2x-2y+4z=-2\\ -x+\fracy-z=0 \end is a system of th ...

with as many equations as unknowns, valid whenever the system has a unique solution. It expresses the solution in terms of the

determinant In mathematics, the determinant is a Scalar (mathematics), scalar-valued function (mathematics), function of the entries of a square matrix. The determinant of a matrix is commonly denoted , , or . Its value characterizes some properties of the ...

s of the (square)

coefficient matrix In linear algebra, a coefficient matrix is a matrix consisting of the coefficients of the variables in a set of linear equations. The matrix is used in solving systems of linear equations. Coefficient matrix In general, a system with linear ...

and of matrices obtained from it by replacing one column by the column vector of right-sides of the equations. It is named after Gabriel Cramer, who published the rule for an arbitrary number of unknowns in 1750, although Colin Maclaurin also published special cases of the rule in 1748, and possibly knew of it as early as 1729. Cramer's rule, implemented in a naive way, is computationally inefficient for systems of more than two or three equations. In the case of equations in unknowns, it requires computation of determinants, while

Gaussian elimination In mathematics, Gaussian elimination, also known as row reduction, is an algorithm for solving systems of linear equations. It consists of a sequence of row-wise operations performed on the corresponding matrix of coefficients. This method can a ...

produces the result with the same (up to a constant factor independent of ) computational complexity as the computation of a single determinant. Moreover, Bareiss algorithm is a simple modification of Gaussian elimination that produces in a single computation a matrix whose nonzero entries are the determinants involved in Cramer's rule.

General case

Consider a system of linear equations for unknowns, represented in matrix multiplication form as follows: :

A\mathbf = \mathbf

where the matrix has a nonzero determinant, and the vector

\mathbf = (x_1, \ldots, x_n)^\mathsf

is the column vector of the variables. Then the theorem states that in this case the system has a unique solution, whose individual values for the unknowns are given by: :

x_i = \frac \qquad i = 1, \ldots, n

where

A_i

is the matrix formed by replacing the -th column of by the column vector . A more general version of Cramer's rule considers the matrix equation :

AX = B

where the matrix has a nonzero determinant, and , are matrices. Given sequences

1 \leq i_1 < i_2 < \cdots < i_k \leq n

and

1 \leq j_1 < j_2 < \cdots < j_k \leq m

, let

X_

be the submatrix of with rows in

I := (i_1, \ldots, i_k )

and columns in

J := (j_1, \ldots, j_k )

. Let

A_(I,J)

be the matrix formed by replacing the

i_s

column of by the

j_s

column of , for all

s = 1,\ldots, k

. Then :

\det X_ = \frac.

In the case

k = 1

, this reduces to the normal Cramer's rule. The rule holds for systems of equations with coefficients and unknowns in any field, not just in the

real number In mathematics, a real number is a number that can be used to measure a continuous one- dimensional quantity such as a duration or temperature. Here, ''continuous'' means that pairs of values can have arbitrarily small differences. Every re ...

Proof

The proof for Cramer's rule uses the following properties of the determinants: linearity with respect to any given column and the fact that the determinant is zero whenever two columns are equal, which is implied by the property that the sign of the determinant flips if you switch two columns. Fix the index of a column, and consider that the entries of the other columns have fixed values. This makes the determinant a function of the entries of the th column. Linearity with respect to this column means that this function has the form :

D_j(a_, \ldots, a_)= C_a_+\cdots, C_a_,

where the

C_

are coefficients that depend on the entries of that are not in column . So, one has :

\det(A)=D_j(a_, \ldots, a_)=C_a_+\cdots, C_a_

( Laplace expansion provides a formula for computing the

C_

but their expression is not important here.) If the function

D_j

is applied to any ''other'' column of , then the result is the determinant of the matrix obtained from by replacing column by a copy of column , so the resulting determinant is 0 (the case of two equal columns). Now consider a system of linear equations in unknowns

x_1, \ldots,x_n

, whose coefficient matrix is , with det(''A'') assumed to be nonzero: :

\begin
a_x_1+a_x_2+\cdots+a_x_n&=&b_1\\
a_x_1+a_x_2+\cdots+a_x_n&=&b_2\\
&\vdots&\\
a_x_1+a_x_2+\cdots+a_x_n&=&b_n.
\end

If one combines these equations by taking times the first equation, plus times the second, and so forth until times the last, then for every the resulting coefficient of becomes :

D_j(a_,\ldots,a_).

So, all coefficients become zero, except the coefficient of

x_j

that becomes

\det(A).

Similarly, the constant coefficient becomes

D_j(b_1,\ldots,b_n),

and the resulting equation is thus :

\det(A)x_j=D_j(b_1,\ldots, b_n),

which gives the value of

x_j

as :

x_j=\frac1D_j(b_1,\ldots, b_n).

As, by construction, the numerator is the determinant of the matrix obtained from by replacing column by , we get the expression of Cramer's rule as a necessary condition for a solution. It remains to prove that these values for the unknowns form a solution. Let be the matrix that has the coefficients of

D_j

as th row, for

j=1,\ldots,n

(this is the adjugate matrix for ). Expressed in matrix terms, we have thus to prove that :

\mathbf x = \frac1M\mathbf b

is a solution; that is, that :

A\left(\frac1M\right)\mathbf b=\mathbf b.

For that, it suffices to prove that :

A\,\left(\frac1M\right)=I_n,

where

I_n

is the

identity matrix In linear algebra, the identity matrix of size n is the n\times n square matrix with ones on the main diagonal and zeros elsewhere. It has unique properties, for example when the identity matrix represents a geometric transformation, the obje ...

. The above properties of the functions

D_j

show that one has , and therefore, :

\left(\frac1M\right)\,A=I_n.

This completes the proof, since a left inverse of a square matrix is also a right-inverse (see Invertible matrix theorem). For other proofs, see below.

Finding inverse matrix

Let be an matrix with entries in a field . Then :

A\,\operatorname(A) = \operatorname(A)\,A=\det(A) I

where denotes the adjugate matrix, is the determinant, and is the

. If is nonzero, then the inverse matrix of is :

A^ = \frac \operatorname(A).

This gives a formula for the inverse of , provided . In fact, this formula works whenever is a

commutative ring In mathematics, a commutative ring is a Ring (mathematics), ring in which the multiplication operation is commutative. The study of commutative rings is called commutative algebra. Complementarily, noncommutative algebra is the study of ring prope ...

, provided that is a unit. If is not a unit, then is not invertible over the ring (it may be invertible over a larger ring in which some non-unit elements of may be invertible).

Applications

Explicit formulas for small systems

Consider the linear system :

\left\{\begin{matrix}
a_1x + b_1y&= {\color{red}c_1}\\
a_2x + b_2y&= {\color{red}c_2}
\end{matrix}\right.

which in matrix format is :

\begin{bmatrix} a_1 & b_1 \\ a_2 & b_2 \end{bmatrix}\begin{bmatrix} x \\ y \end{bmatrix}=\begin{bmatrix} {\color{red}c_1} \\ {\color{red}c_2} \end{bmatrix}.

Assume is nonzero. Then, with the help of

s, and can be found with Cramer's rule as :

\begin{align}
x &= \frac{\begin{vmatrix} {\color{red}{c_1 & b_1 \\ {\color{red}{c_2 & b_2 \end{vmatrix{\begin{vmatrix} a_1 & b_1 \\ a_2 & b_2 \end{vmatrix = { {\color{red}c_1}b_2 - b_1{\color{red}c_2} \over a_1b_2 - b_1a_2}, \quad
y = \frac{\begin{vmatrix} a_1 & {\color{red}{c_1 \\ a_2 & {\color{red}{c_2 \end{vmatrix{\begin{vmatrix} a_1 & b_1 \\ a_2 & b_2 \end{vmatrix  = { a_1{\color{red}c_2} - {\color{red}c_1}a_2 \over a_1b_2 - b_1a_2}
\end{align}.

The rules for matrices are similar. Given :

\left\{\begin{matrix}
a_1x + b_1y + c_1z&= {\color{red}d_1}\\
a_2x + b_2y + c_2z&= {\color{red}d_2}\\
a_3x + b_3y + c_3z&= {\color{red}d_3}
\end{matrix}\right.

which in matrix format is :

\begin{bmatrix} a_1 & b_1 & c_1 \\ a_2 & b_2 & c_2 \\ a_3 & b_3 & c_3 \end{bmatrix}\begin{bmatrix} x \\ y \\ z \end{bmatrix}=\begin{bmatrix} {\color{red}d_1} \\ {\color{red}d_2} \\ {\color{red}d_3} \end{bmatrix}.

Then the values of and can be found as follows: :

x = \frac{\begin{vmatrix} {\color{red}d_1} & b_1 & c_1 \\ {\color{red}d_2} & b_2 & c_2 \\ {\color{red}d_3} & b_3 & c_3 \end{vmatrix} } { \begin{vmatrix} a_1 & b_1 & c_1 \\ a_2 & b_2 & c_2 \\ a_3 & b_3 & c_3 \end{vmatrix, \quad
y = \frac {\begin{vmatrix} a_1 & {\color{red}d_1} & c_1 \\ a_2 & {\color{red}d_2} & c_2 \\ a_3 & {\color{red}d_3} & c_3 \end{vmatrix {\begin{vmatrix} a_1 & b_1 & c_1 \\ a_2 & b_2 & c_2 \\ a_3 & b_3 & c_3 \end{vmatrix, \text{ and }
z = \frac { \begin{vmatrix} a_1 & b_1 & {\color{red}d_1} \\ a_2 & b_2 & {\color{red}d_2} \\ a_3 & b_3 & {\color{red}d_3} \end{vmatrix {\begin{vmatrix} a_1 & b_1 & c_1 \\ a_2 & b_2 & c_2 \\ a_3 & b_3 & c_3 \end{vmatrix} }.

Differential geometry

Ricci calculus

Cramer's rule is used in the Ricci calculus in various calculations involving the Christoffel symbols of the first and second kind. In particular, Cramer's rule can be used to prove that the divergence operator on a

Riemannian manifold In differential geometry, a Riemannian manifold is a geometric space on which many geometric notions such as distance, angles, length, volume, and curvature are defined. Euclidean space, the N-sphere, n-sphere, hyperbolic space, and smooth surf ...

is invariant with respect to change of coordinates. We give a direct proof, suppressing the role of the Christoffel symbols. Let

(M,g)

be a Riemannian manifold equipped with local coordinates

(x^1, x^2, \dots, x^n)

. Let

A=A^i \frac{\partial}{\partial x^i}

be a vector field. We use the summation convention throughout. :Theorem. :''The ''divergence'' of

A

,'' ::

\operatorname{div} A = \frac{1}{\sqrt{\det g \frac{\partial}{\partial x^i} \left( A^i \sqrt{\det g} \right),

:''is invariant under change of coordinates.'' Let

(x^1,x^2,\ldots,x^n)\mapsto (\bar x^1,\ldots,\bar x^n)

be a coordinate transformation with non-singular Jacobian. Then the classical transformation laws imply that

A=\bar A^{k}\frac{\partial}{\partial\bar x^{k

where

\bar A^{k}=\frac{\partial \bar x^{k{\partial x^{jA^{j}

. Similarly, if

g=g_{mk}\,dx^{m}\otimes dx^{k}=\bar{g}_{ij}\,d\bar x^{i}\otimes d\bar x^{j}

, then

\bar{g}_{ij}=\,\frac{\partial x^{m{\partial\bar x^{i\frac{\partial x^{k{\partial \bar x^{jg_{mk}

. Writing this transformation law in terms of matrices yields

\bar g=\left(\frac{\partial x}{\partial\bar{x\right)^{\text{Tg\left(\frac{\partial x}{\partial\bar{x\right)

, which implies

\det\bar g=\left(\det\left(\frac{\partial x}{\partial\bar{x\right)\right)^{2}\det g

. Now one computes :

\begin{align}
\operatorname{div} A &=\frac{1}{\sqrt{\det g\frac{\partial}{\partial x^{i\left( A^{i}\sqrt{\det g}\right)\\
	&=\det\left(\frac{\partial x}{\partial\bar{x\right)\frac{1}{\sqrt{\det\bar g\frac{\partial \bar x^k}{\partial x^{i\frac{\partial}{\partial\bar x^{k\left(\frac{\partial x^{i{\partial \bar x^{\ell\bar{A}^{\ell}\det\!\left(\frac{\partial x}{\partial\bar{x\right)^{\!\!-1}\!\sqrt{\det\bar g}\right).
\end{align}

In order to show that this equals

\frac{1}{\sqrt{\det\bar g\frac{\partial}{\partial\bar x^{k\left(\bar A^{k}\sqrt{\det\bar{g\right)

, it is necessary and sufficient to show that :

\frac{\partial\bar x^{k{\partial x^{i\frac{\partial}{\partial\bar x^{k\left(\frac{\partial x^{i{\partial \bar x^{\ell\det\!\left(\frac{\partial x}{\partial\bar{x\right)^{\!\!\!-1}\right)=0\qquad\text{for all } \ell,

which is equivalent to :

\frac{\partial}{\partial \bar x^{\ell\det\left(\frac{\partial x}{\partial\bar{x\right)
=\det\left(\frac{\partial x}{\partial\bar{x\right)\frac{\partial\bar x^{k{\partial x^{i\frac{\partial^{2}x^{i{\partial\bar x^{k}\partial\bar x^{\ell.

Carrying out the differentiation on the left-hand side, we get: :

\begin{align}
	\frac{\partial}{\partial\bar x^{\ell\det\left(\frac{\partial x}{\partial\bar{x\right)
	&=(-1)^{i+j}\frac{\partial^{2}x^{i{\partial\bar x^{\ell}\partial\bar x^{j\det M(i, j)\\
	&=\frac{\partial^{2}x^{i{\partial\bar x^{\ell}\partial\bar x^{j\det\left(\frac{\partial x}{\partial\bar{x\right)\frac{(-1)^{i+j{\det\left(\frac{\partial x}{\partial\bar{x\right)}\det M(i, j)=(\ast),
	\end{align}

where

M(i, j)

denotes the matrix obtained from

\left(\frac{\partial x}{\partial\bar{x\right)

by deleting the

i

th row and

j

th column. But Cramer's Rule says that :

\frac{(-1)^{i+j{\det\left(\frac{\partial x}{\partial\bar{x\right)}\det M(i, j)

is the

(j,i)

th entry of the matrix

\left(\frac{\partial \bar{x{\partial x}\right)

. Thus :

(\ast)=\det\left(\frac{\partial x}{\partial\bar{x\right)\frac{\partial^{2}x^{i{\partial\bar x^{\ell}\partial\bar x^{j\frac{\partial\bar x^{j{\partial x^{i,

completing the proof.

Computing derivatives implicitly

Consider the two equations

F(x, y, u, v) = 0

and

G(x, y, u, v) = 0

. When ''u'' and ''v'' are independent variables, we can define

x = X(u, v)

and

y = Y(u, v).

An equation for

\dfrac{\partial x}{\partial u}

can be found by applying Cramer's rule. First, calculate the first derivatives of ''F'', ''G'', ''x'', and ''y'': :

dy &= \frac{\partial Y}{\partial u} du + \frac{\partial Y}{\partial v} dv. \end{align}

Substituting ''dx'', ''dy'' into ''dF'' and ''dG'', we have: :

dG &= \left(\frac{\partial G}{\partial x} \frac{\partial x}{\partial u} +\frac{\partial G}{\partial y} \frac{\partial y}{\partial u} +\frac{\partial G}{\partial u} \right) du + \left(\frac{\partial G}{\partial x} \frac{\partial x}{\partial v} +\frac{\partial G}{\partial y} \frac{\partial y}{\partial v} +\frac{\partial G}{\partial v} \right) dv = 0. \end{align}

Since ''u'', ''v'' are both independent, the coefficients of ''du'', ''dv'' must be zero. So we can write out equations for the coefficients: :

\frac{\partial G}{\partial x} \frac{\partial x}{\partial v} +\frac{\partial G}{\partial y} \frac{\partial y}{\partial v} & = -\frac{\partial G}{\partial v}. \end{align}

Now, by Cramer's rule, we see that: :

\frac{\partial x}{\partial u} = \frac{\begin{vmatrix} -\frac{\partial F}{\partial u} & \frac{\partial F}{\partial y} \\ -\frac{\partial G}{\partial u} & \frac{\partial G}{\partial y}\end{vmatrix{\begin{vmatrix}\frac{\partial F}{\partial x} & \frac{\partial F}{\partial y} \\ \frac{\partial G}{\partial x} & \frac{\partial G}{\partial y}\end{vmatrix.

This is now a formula in terms of two Jacobians: :

\frac{\partial x}{\partial u} = -\frac{\left(\frac{\partial (F, G)}{\partial (u, y)}\right)}{\left(\frac{\partial (F, G)}{\partial(x, y)}\right)}.

Similar formulas can be derived for

\frac{\partial x}{\partial v}, \frac{\partial y}{\partial u}, \frac{\partial y}{\partial v}.

Integer programming

Cramer's rule can be used to prove that an integer programming problem whose constraint matrix is totally unimodular and whose right-hand side is integer, has integer basic solutions. This makes the integer program substantially easier to solve.

Ordinary differential equations

Cramer's rule is used to derive the general solution to an inhomogeneous linear differential equation by the method of variation of parameters.

Example

Consider the linear system :

\begin{matrix}
12x + 3y&= 15\\
2x - 3y&= 13
\end{matrix}

Applying Cramer's Rule gives :

\begin{align}
x &= \frac{\begin{vmatrix} 15 & 3 \\ {13} & -3 \end{vmatrix{\begin{vmatrix} 12 & 3 \\ 2 & -3 \end{vmatrix = { -84 \over -42} = {\color{red}2}, \quad
y = \frac{\begin{vmatrix} 12 & 15 \\ 2 & {13} \end{vmatrix{\begin{vmatrix} 12 & 3 \\ 2 & -3 \end{vmatrix  = -{ 126 \over 42} = {\color{red}-3}
\end{align}.

These values can be verified by substituting back into the original equations:

12x + 3y = (12 \times {\color{red}2}) + (3 \times ({\color{red}-3})) = 24 - 9 = 15

and

2x - 3y = (2 \times {\color{red}2}) - (3 \times ({\color{red}-3})) = 4 - (-9) = 13,

as required.

Geometric interpretation

Cramer's rule has a geometric interpretation that can be considered also a proof or simply giving insight about its geometric nature. These geometric arguments work in general and not only in the case of two equations with two unknowns presented here. Given the system of equations :

\begin{matrix}a_{11}x_1+a_{12}x_2&=b_1\\a_{21}x_1+a_{22}x_2&=b_2\end{matrix}

it can be considered as an equation between vectors :

x_1\binom{a_{11{a_{21+x_2\binom{a_{12{a_{22=\binom{b_1}{b_2}.

The area of the parallelogram determined by

\binom{a_{11{a_{21

and

\binom{a_{12{a_{22

is given by the determinant of the system of equations: :

\begin{vmatrix}a_{11}&a_{12}\\a_{21}&a_{22}\end{vmatrix}.

In general, when there are more variables and equations, the determinant of vectors of length will give the ''volume'' of the ''

parallelepiped In geometry, a parallelepiped is a three-dimensional figure formed by six parallelograms (the term ''rhomboid'' is also sometimes used with this meaning). By analogy, it relates to a parallelogram just as a cube relates to a square. Three equiva ...

'' determined by those vectors in the -th dimensional

Euclidean space Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, in Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics there are ''Euclidean spaces ...

. Therefore, the area of the parallelogram determined by

x_1\binom{a_{11{a_{21

and

\binom{a_{12{a_{22

has to be

x_1

times the area of the first one since one of the sides has been multiplied by this factor. Now, this last parallelogram, by Cavalieri's principle, has the same area as the parallelogram determined by

\binom{b_1}{b_2}=x_1\binom{a_{11{a_{21+x_2\binom{a_{12{a_{22

and

\binom{a_{12{a_{22.

Equating the areas of this last and the second parallelogram gives the equation :

\begin{vmatrix}b_1&a_{12}\\b_2&a_{22}\end{vmatrix} = \begin{vmatrix}a_{11}x_1&a_{12}\\a_{21}x_1&a_{22}\end{vmatrix} =x_1 \begin{vmatrix}a_{11}&a_{12}\\a_{21}&a_{22}\end{vmatrix}

from which Cramer's rule follows.

Other proofs

A proof by abstract linear algebra

This is a restatement of the proof above in abstract language. Consider the map

\mathbf{x}=(x_1,\ldots, x_n) \mapsto  \frac{1}{\det A} \left(\det (A_1),\ldots, \det(A_n)\right),

where

A_i

is the matrix

A

with

\mathbf{x}

substituted in the

i

th column, as in Cramer's rule. Because of linearity of determinant in every column, this map is linear. Observe that it sends the

i

th column of

A

to the

i

th basis vector

\mathbf{e}_i=(0,\ldots, 1, \ldots, 0)

(with 1 in the

i

th place), because determinant of a matrix with a repeated column is 0. So we have a linear map which agrees with the inverse of

A

on the column space; hence it agrees with

A^{-1}

on the span of the column space. Since

A

is invertible, the column vectors span all of

\mathbb{R}^n

, so our map really is the inverse of

A

. Cramer's rule follows.

A short proof

A short proof of Cramer's rule can be given by noticing that

x_1

is the determinant of the matrix :

X_1=\begin{bmatrix}
x_1 & 0 & 0 & \cdots & 0\\
x_2 & 1 & 0 & \cdots & 0\\
x_3 & 0 & 1 & \cdots & 0\\
\vdots & \vdots & \vdots & \ddots &\vdots \\
x_n & 0 & 0 & \cdots & 1
\end{bmatrix}

On the other hand, assuming that our original matrix is invertible, this matrix

X_1

has columns

A^{-1}\mathbf{b}, A^{-1}\mathbf{v}_2, \ldots, A^{-1}\mathbf{v}_n

, where

\mathbf{v}_n

is the ''n''-th column of the matrix . Recall that the matrix

A_1

has columns

\mathbf{b}, \mathbf{v}_2, \ldots, \mathbf{v}_n

, and therefore

X_1=A^{-1}A_1

. Hence, by using that the determinant of the product of two matrices is the product of the determinants, we have :

x_1= \det (X_1) = \det (A^{-1}) \det (A_1)= \frac{\det (A_1)}{\det (A)}.

The proof for other

x_j

is similar.

Using Geometric Algebra

Inconsistent and indeterminate cases

A system of equations is said to be inconsistent when there are no solutions and it is called indeterminate when there is more than one solution. For linear equations, an indeterminate system will have infinitely many solutions (if it is over an infinite field), since the solutions can be expressed in terms of one or more parameters that can take arbitrary values. Cramer's rule applies to the case where the coefficient determinant is nonzero. In the 2×2 case, if the coefficient determinant is zero, then the system is inconsistent if the numerator determinants are nonzero, or indeterminate if the numerator determinants are zero. For 3×3 or higher systems, the only thing one can say when the coefficient determinant equals zero is that if any of the numerator determinants are nonzero, then the system must be inconsistent. However, having all determinants zero does not imply that the system is indeterminate. A simple example where all determinants vanish (equal zero) but the system is still inconsistent is the 3×3 system ''x''+''y''+''z''=1, ''x''+''y''+''z''=2, ''x''+''y''+''z''=3.

References

External links

Proof of Cramer's Rule

WebApp descriptively solving systems of linear equations with Cramer's Rule

Online Calculator of System of linear equations

{{DEFAULTSORT:Cramer's Rule Theorems in linear algebra Determinants 1750 in science

General case

Proof

Finding inverse matrix

Applications

Explicit formulas for small systems

Differential geometry

Ricci calculus

Computing derivatives implicitly

Integer programming

Ordinary differential equations

Example

Geometric interpretation

Other proofs

A proof by abstract linear algebra

A short proof

Using Geometric Algebra

Inconsistent and indeterminate cases

See also

References

External links