linear algebra Linear algebra is the branch of mathematics concerning linear equations such as :a_1x_1+\cdots +a_nx_n=b, linear maps such as :(x_1, \ldots, x_n) \mapsto a_1x_1+\cdots +a_nx_n, and their representations in vector spaces and through matrix (mathemat ...

, two ''n''-by-''n''

matrices Matrix (: matrices or matrixes) or MATRIX may refer to: Science and mathematics * Matrix (mathematics), a rectangular array of numbers, symbols or expressions * Matrix (logic), part of a formula in prenex normal form * Matrix (biology), the ...

and are called similar if there exists an

invertible In mathematics, the concept of an inverse element generalises the concepts of opposite () and reciprocal () of numbers. Given an operation denoted here , and an identity element denoted , if , one says that is a left inverse of , and that ...

''n''-by-''n'' matrix such that

B = P^ A P .

Similar matrices represent the same

linear map In mathematics, and more specifically in linear algebra, a linear map (also called a linear mapping, linear transformation, vector space homomorphism, or in some contexts linear function) is a mapping V \to W between two vector spaces that p ...

under two possibly different bases, with being the change-of-basis matrix. A transformation is called a similarity transformation or conjugation of the matrix . In the

general linear group In mathematics, the general linear group of degree n is the set of n\times n invertible matrices, together with the operation of ordinary matrix multiplication. This forms a group, because the product of two invertible matrices is again inve ...

, similarity is therefore the same as conjugacy, and similar matrices are also called conjugate; however, in a given

subgroup In group theory, a branch of mathematics, a subset of a group G is a subgroup of G if the members of that subset form a group with respect to the group operation in G. Formally, given a group (mathematics), group under a binary operation ...

of the general linear group, the notion of conjugacy may be more restrictive than similarity, since it requires that be chosen to lie in .

Motivating example

When defining a linear transformation, it can be the case that a change of basis can result in a simpler form of the same transformation. For example, the matrix representing a rotation in when the

axis of rotation Rotation or rotational/rotary motion is the circular movement of an object around a central line, known as an ''axis of rotation''. A plane figure can rotate in either a clockwise or counterclockwise sense around a perpendicular axis intersect ...

is not aligned with the coordinate axis can be complicated to compute. If the axis of rotation were aligned with the positive -axis, then it would simply be

S = \begin
  \cos\theta & -\sin\theta & 0 \\
  \sin\theta &  \cos\theta & 0 \\
           0 &           0 & 1
\end,

where

\theta

is the angle of rotation. In the new coordinate system, the transformation would be written as

y' = Sx',

where and are respectively the original and transformed vectors in a new basis containing a vector parallel to the axis of rotation. In the original basis, the transform would be written as

y = Tx,

where vectors and and the unknown transform matrix are in the original basis. To write in terms of the simpler matrix, we use the change-of-basis matrix that transforms and as

x' = Px

and

y' = Py

&\Rightarrow & y &= \left(P^ S P\right) x = T x \end

Thus, the matrix in the original basis,

T

, is given by

T = P^SP

. The transform in the original basis is found to be the product of three easy-to-derive matrices. In effect, the similarity transform operates in three steps: change to a new basis (), perform the simple transformation (), and change back to the old basis ().

Properties

Similarity is an

equivalence relation In mathematics, an equivalence relation is a binary relation that is reflexive, symmetric, and transitive. The equipollence relation between line segments in geometry is a common example of an equivalence relation. A simpler example is equ ...

on the space of square matrices. Because matrices are similar if and only if they represent the same linear operator with respect to (possibly) different bases, similar matrices share all properties of their shared underlying operator: * Rank *

Characteristic polynomial In linear algebra, the characteristic polynomial of a square matrix is a polynomial which is invariant under matrix similarity and has the eigenvalues as roots. It has the determinant and the trace of the matrix among its coefficients. The ...

, and attributes that can be derived from it: **

Determinant In mathematics, the determinant is a Scalar (mathematics), scalar-valued function (mathematics), function of the entries of a square matrix. The determinant of a matrix is commonly denoted , , or . Its value characterizes some properties of the ...

** Trace **

Eigenvalues In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by a ...

, and their algebraic multiplicities * Geometric multiplicities of eigenvalues (but not the eigenspaces, which are transformed according to the base change matrix ''P'' used). * Minimal polynomial * Frobenius normal form *

Jordan normal form \begin \lambda_1 1\hphantom\hphantom\\ \hphantom\lambda_1 1\hphantom\\ \hphantom\lambda_1\hphantom\\ \hphantom\lambda_2 1\hphantom\hphantom\\ \hphantom\hphantom\lambda_2\hphantom\\ \hphantom\lambda_3\hphantom\\ \hphantom\ddots\hphantom\\ ...

, up to a permutation of the Jordan blocks * Index of nilpotence * Elementary divisors, which form a complete set of invariants for similarity of matrices over a

principal ideal domain In mathematics, a principal ideal domain, or PID, is an integral domain (that is, a non-zero commutative ring without nonzero zero divisors) in which every ideal is principal (that is, is formed by the multiples of a single element). Some author ...

Because of this, for a given matrix ''A'', one is interested in finding a simple "normal form" ''B'' which is similar to ''A''—the study of ''A'' then reduces to the study of the simpler matrix ''B''. For example, ''A'' is called diagonalizable if it is similar to a

diagonal matrix In linear algebra, a diagonal matrix is a matrix in which the entries outside the main diagonal are all zero; the term usually refers to square matrices. Elements of the main diagonal can either be zero or nonzero. An example of a 2×2 diagon ...

. Not all matrices are diagonalizable, but at least over the

complex number In mathematics, a complex number is an element of a number system that extends the real numbers with a specific element denoted , called the imaginary unit and satisfying the equation i^= -1; every complex number can be expressed in the for ...

s (or any

algebraically closed field In mathematics, a field is algebraically closed if every non-constant polynomial in (the univariate polynomial ring with coefficients in ) has a root in . In other words, a field is algebraically closed if the fundamental theorem of algebra ...

), every matrix is similar to a matrix in Jordan form. Neither of these forms is unique (diagonal entries or Jordan blocks may be permuted) so they are not really

normal forms Database normalization is the process of structuring a relational database in accordance with a series of so-called '' normal forms'' in order to reduce data redundancy and improve data integrity. It was first proposed by British computer sci ...

; moreover their determination depends on being able to factor the minimal or characteristic polynomial of ''A'' (equivalently to find its eigenvalues). The rational canonical form does not have these drawbacks: it exists over any field, is truly unique, and it can be computed using only arithmetic operations in the field; ''A'' and ''B'' are similar if and only if they have the same rational canonical form. The rational canonical form is determined by the elementary divisors of ''A''; these can be immediately read off from a matrix in Jordan form, but they can also be determined directly for any matrix by computing the

Smith normal form In mathematics, the Smith normal form (sometimes abbreviated SNF) is a normal form that can be defined for any matrix (not necessarily square) with entries in a principal ideal domain (PID). The Smith normal form of a matrix is diagonal, and can ...

, over the ring of polynomials, of the matrix (with polynomial entries) (the same one whose determinant defines the characteristic polynomial). Note that this Smith normal form is not a normal form of ''A'' itself; moreover it is not similar to either, but obtained from the latter by left and right multiplications by different invertible matrices (with polynomial entries). Similarity of matrices does not depend on the base field: if ''L'' is a field containing ''K'' as a subfield, and ''A'' and ''B'' are two matrices over ''K'', then ''A'' and ''B'' are similar as matrices over ''K''

if and only if In logic and related fields such as mathematics and philosophy, "if and only if" (often shortened as "iff") is paraphrased by the biconditional, a logical connective between statements. The biconditional is true in two cases, where either bo ...

they are similar as matrices over ''L''. This is so because the rational canonical form over ''K'' is also the rational canonical form over ''L''. This means that one may use Jordan forms that only exist over a larger field to determine whether the given matrices are similar. In the definition of similarity, if the matrix ''P'' can be chosen to be a

permutation matrix In mathematics, particularly in matrix theory, a permutation matrix is a square binary matrix that has exactly one entry of 1 in each row and each column with all other entries 0. An permutation matrix can represent a permutation of elements. ...

then ''A'' and ''B'' are permutation-similar; if ''P'' can be chosen to be a

unitary matrix In linear algebra, an invertible complex square matrix is unitary if its matrix inverse equals its conjugate transpose , that is, if U^* U = UU^* = I, where is the identity matrix. In physics, especially in quantum mechanics, the conjugate ...

then ''A'' and ''B'' are unitarily equivalent. The

spectral theorem In linear algebra and functional analysis, a spectral theorem is a result about when a linear operator or matrix can be diagonalized (that is, represented as a diagonal matrix in some basis). This is extremely useful because computations involvin ...

says that every

normal matrix In mathematics, a complex square matrix is normal if it commutes with its conjugate transpose : :A \text \iff A^*A = AA^* . The concept of normal matrices can be extended to normal operators on infinite-dimensional normed spaces and to nor ...

is unitarily equivalent to some diagonal matrix. Specht's theorem states that two matrices are unitarily equivalent if and only if they satisfy certain trace equalities.

References

Citations

General references

* (Similarity is discussed many places, starting at page 44.) {{refend Matrices (mathematics) Equivalence (mathematics)

Motivating example

Properties

See also

References

Citations

General references