In
linear algebra
Linear algebra is the branch of mathematics concerning linear equations such as
:a_1x_1+\cdots +a_nx_n=b,
linear maps such as
:(x_1, \ldots, x_n) \mapsto a_1x_1+\cdots +a_nx_n,
and their representations in vector spaces and through matrix (mathemat ...
, two ''n''-by-''n''
matrices
Matrix (: matrices or matrixes) or MATRIX may refer to:
Science and mathematics
* Matrix (mathematics), a rectangular array of numbers, symbols or expressions
* Matrix (logic), part of a formula in prenex normal form
* Matrix (biology), the ...
and are called similar if there exists an
invertible
In mathematics, the concept of an inverse element generalises the concepts of opposite () and reciprocal () of numbers.
Given an operation denoted here , and an identity element denoted , if , one says that is a left inverse of , and that ...
''n''-by-''n'' matrix such that
Similar matrices represent the same
linear map
In mathematics, and more specifically in linear algebra, a linear map (also called a linear mapping, linear transformation, vector space homomorphism, or in some contexts linear function) is a mapping V \to W between two vector spaces that p ...
under two possibly different
bases, with being the
change-of-basis matrix.
A transformation is called a similarity transformation or conjugation of the matrix . In the
general linear group
In mathematics, the general linear group of degree n is the set of n\times n invertible matrices, together with the operation of ordinary matrix multiplication. This forms a group, because the product of two invertible matrices is again inve ...
, similarity is therefore the same as
conjugacy, and similar matrices are also called conjugate; however, in a given
subgroup
In group theory, a branch of mathematics, a subset of a group G is a subgroup of G if the members of that subset form a group with respect to the group operation in G.
Formally, given a group (mathematics), group under a binary operation  ...
of the general linear group, the notion of conjugacy may be more restrictive than similarity, since it requires that be chosen to lie in .
Motivating example
When defining a linear transformation, it can be the case that a change of basis can result in a simpler form of the same transformation. For example, the matrix representing a rotation in when the
axis of rotation
Rotation or rotational/rotary motion is the circular movement of an object around a central line, known as an ''axis of rotation''. A plane figure can rotate in either a clockwise or counterclockwise sense around a perpendicular axis intersect ...
is not aligned with the coordinate axis can be complicated to compute. If the axis of rotation were aligned with the positive -axis, then it would simply be
where
is the angle of rotation. In the new coordinate system, the transformation would be written as
where and are respectively the original and transformed vectors in a new basis containing a vector parallel to the axis of rotation. In the original basis, the transform would be written as
where vectors and and the unknown transform matrix are in the original basis. To write in terms of the simpler matrix, we use the change-of-basis matrix that transforms and as
and
:
Thus, the matrix in the original basis,
, is given by
. The transform in the original basis is found to be the product of three easy-to-derive matrices. In effect, the similarity transform operates in three steps: change to a new basis (), perform the simple transformation (), and change back to the old basis ().
Properties
Similarity is an
equivalence relation
In mathematics, an equivalence relation is a binary relation that is reflexive, symmetric, and transitive. The equipollence relation between line segments in geometry is a common example of an equivalence relation. A simpler example is equ ...
on the space of square matrices.
Because matrices are similar if and only if they represent the same linear operator with respect to (possibly) different bases, similar matrices share all properties of their shared underlying operator:
*
Rank
*
Characteristic polynomial
In linear algebra, the characteristic polynomial of a square matrix is a polynomial which is invariant under matrix similarity and has the eigenvalues as roots. It has the determinant and the trace of the matrix among its coefficients. The ...
, and attributes that can be derived from it:
**
Determinant
In mathematics, the determinant is a Scalar (mathematics), scalar-valued function (mathematics), function of the entries of a square matrix. The determinant of a matrix is commonly denoted , , or . Its value characterizes some properties of the ...
**
Trace
**
Eigenvalues
In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by a ...
, and their
algebraic multiplicities
*
Geometric multiplicities of eigenvalues (but not the eigenspaces, which are transformed according to the base change matrix ''P'' used).
*
Minimal polynomial
*
Frobenius normal form
*
Jordan normal form
\begin
\lambda_1 1\hphantom\hphantom\\
\hphantom\lambda_1 1\hphantom\\
\hphantom\lambda_1\hphantom\\
\hphantom\lambda_2 1\hphantom\hphantom\\
\hphantom\hphantom\lambda_2\hphantom\\
\hphantom\lambda_3\hphantom\\
\hphantom\ddots\hphantom\\
...
, up to a permutation of the Jordan blocks
*
Index of nilpotence
*
Elementary divisors, which form a complete set of invariants for similarity of matrices over a
principal ideal domain
In mathematics, a principal ideal domain, or PID, is an integral domain (that is, a non-zero commutative ring without nonzero zero divisors) in which every ideal is principal (that is, is formed by the multiples of a single element). Some author ...
Because of this, for a given matrix ''A'', one is interested in finding a simple "normal form" ''B'' which is similar to ''A''—the study of ''A'' then reduces to the study of the simpler matrix ''B''. For example, ''A'' is called
diagonalizable if it is similar to a
diagonal matrix
In linear algebra, a diagonal matrix is a matrix in which the entries outside the main diagonal are all zero; the term usually refers to square matrices. Elements of the main diagonal can either be zero or nonzero. An example of a 2×2 diagon ...
. Not all matrices are diagonalizable, but at least over the
complex number
In mathematics, a complex number is an element of a number system that extends the real numbers with a specific element denoted , called the imaginary unit and satisfying the equation i^= -1; every complex number can be expressed in the for ...
s (or any
algebraically closed field
In mathematics, a field is algebraically closed if every non-constant polynomial in (the univariate polynomial ring with coefficients in ) has a root in . In other words, a field is algebraically closed if the fundamental theorem of algebra ...
), every matrix is similar to a matrix in
Jordan form. Neither of these forms is unique (diagonal entries or Jordan blocks may be permuted) so they are not really
normal forms
Database normalization is the process of structuring a relational database in accordance with a series of so-called '' normal forms'' in order to reduce data redundancy and improve data integrity. It was first proposed by British computer sci ...
; moreover their determination depends on being able to factor the minimal or characteristic polynomial of ''A'' (equivalently to find its eigenvalues). The
rational canonical form does not have these drawbacks: it exists over any field, is truly unique, and it can be computed using only arithmetic operations in the field; ''A'' and ''B'' are similar if and only if they have the same rational canonical form. The rational canonical form is determined by the elementary divisors of ''A''; these can be immediately read off from a matrix in Jordan form, but they can also be determined directly for any matrix by computing the
Smith normal form
In mathematics, the Smith normal form (sometimes abbreviated SNF) is a normal form that can be defined for any matrix (not necessarily square) with entries in a principal ideal domain (PID). The Smith normal form of a matrix is diagonal, and can ...
, over the ring of polynomials, of the matrix (with polynomial entries) (the same one whose determinant defines the characteristic polynomial). Note that this Smith normal form is not a normal form of ''A'' itself; moreover it is not similar to either, but obtained from the latter by left and right multiplications by different invertible matrices (with polynomial entries).
Similarity of matrices does not depend on the base field: if ''L'' is a field containing ''K'' as a
subfield, and ''A'' and ''B'' are two matrices over ''K'', then ''A'' and ''B'' are similar as matrices over ''K''
if and only if
In logic and related fields such as mathematics and philosophy, "if and only if" (often shortened as "iff") is paraphrased by the biconditional, a logical connective between statements. The biconditional is true in two cases, where either bo ...
they are similar as matrices over ''L''. This is so because the rational canonical form over ''K'' is also the rational canonical form over ''L''. This means that one may use Jordan forms that only exist over a larger field to determine whether the given matrices are similar.
In the definition of similarity, if the matrix ''P'' can be chosen to be a
permutation matrix
In mathematics, particularly in matrix theory, a permutation matrix is a square binary matrix that has exactly one entry of 1 in each row and each column with all other entries 0. An permutation matrix can represent a permutation of elements. ...
then ''A'' and ''B'' are permutation-similar; if ''P'' can be chosen to be a
unitary matrix
In linear algebra, an invertible complex square matrix is unitary if its matrix inverse equals its conjugate transpose , that is, if
U^* U = UU^* = I,
where is the identity matrix.
In physics, especially in quantum mechanics, the conjugate ...
then ''A'' and ''B'' are unitarily equivalent. The
spectral theorem
In linear algebra and functional analysis, a spectral theorem is a result about when a linear operator or matrix can be diagonalized (that is, represented as a diagonal matrix in some basis). This is extremely useful because computations involvin ...
says that every
normal matrix
In mathematics, a complex square matrix is normal if it commutes with its conjugate transpose :
:A \text \iff A^*A = AA^* .
The concept of normal matrices can be extended to normal operators on infinite-dimensional normed spaces and to nor ...
is unitarily equivalent to some diagonal matrix.
Specht's theorem states that two matrices are unitarily equivalent if and only if they satisfy certain trace equalities.
See also
*
Canonical forms
*
Matrix congruence
*
Matrix equivalence In linear algebra, two rectangular ''m''-by-''n'' matrices ''A'' and ''B'' are called equivalent if
:B = Q^ A P
for some invertible ''n''-by-''n'' matrix ''P'' and some invertible ''m''-by-''m'' matrix ''Q''. Equivalent matrices represent the sam ...
*
Jacobi rotation
References
Citations
General references
* (Similarity is discussed many places, starting at page 44.)
{{refend
Matrices (mathematics)
Equivalence (mathematics)