mathematics Mathematics is an area of knowledge that includes the topics of numbers, formulas and related structures, shapes and the spaces in which they are contained, and quantities and their changes. These topics are represented in modern mathematics ...

, a symmetric matrix

M

with real entries is positive-definite if the real number

z^\textsfMz

is positive for every nonzero real

column vector In linear algebra, a column vector with m elements is an m \times 1 matrix consisting of a single column of m entries, for example, \boldsymbol = \begin x_1 \\ x_2 \\ \vdots \\ x_m \end. Similarly, a row vector is a 1 \times n matrix for some n, ...

z,

where

z^\textsf

is the transpose of More generally, a Hermitian matrix (that is, a complex matrix equal to its conjugate transpose) is positive-definite if the real number

z^* Mz

is positive for every nonzero complex column vector

z,

where

z^*

denotes the conjugate transpose of

z.

Positive semi-definite matrices are defined similarly, except that the scalars

z^\textsfMz

and

z^* Mz

are required to be positive ''or zero'' (that is, nonnegative). Negative-definite and negative semi-definite matrices are defined analogously. A matrix that is not positive semi-definite and not negative semi-definite is sometimes called indefinite. A matrix is thus positive-definite if and only if it is the matrix of a positive-definite quadratic form or Hermitian form. In other words, a matrix is positive-definite if and only if it defines an inner product. Positive-definite and positive-semidefinite matrices can be characterized in many ways, which may explain the importance of the concept in various parts of mathematics. A matrix is positive-definite if and only if it satisfies any of the following equivalent conditions. * is congruent with a

diagonal matrix In linear algebra, a diagonal matrix is a matrix in which the entries outside the main diagonal are all zero; the term usually refers to square matrices. Elements of the main diagonal can either be zero or nonzero. An example of a 2×2 diagonal ...

with positive real entries. * is symmetric or Hermitian, and all its

eigenvalue In linear algebra, an eigenvector () or characteristic vector of a linear transformation is a nonzero vector that changes at most by a scalar factor when that linear transformation is applied to it. The corresponding eigenvalue, often denot ...

s are real and positive. * is symmetric or Hermitian, and all its leading principal minors are positive. * There exists an invertible matrix

B

with conjugate transpose

B^*

such that

M=B^*B.

A matrix is positive semi-definite if it satisfies similar equivalent conditions where "positive" is replaced by "nonnegative", "invertible matrix" is replaced by "matrix", and the word "leading" is removed. Positive-definite and positive-semidefinite real matrices are at the basis of convex optimization, since, given a

function of several real variables In mathematical analysis and its applications, a function of several real variables or real multivariate function is a function with more than one argument, with all arguments being real variables. This concept extends the idea of a functi ...

that is twice differentiable, then if its Hessian matrix (matrix of its second partial derivatives) is positive-definite at a point , then the function is convex near , and, conversely, if the function is convex near , then the Hessian matrix is positive-semidefinite at . Some authors use more general definitions of definiteness, including some non-symmetric real matrices, or non-Hermitian complex ones.

Definitions

In the following definitions,

\mathbf^\textsf

is the transpose of

\mathbf x

\mathbf^*

is the conjugate transpose of

\mathbf x

and

\mathbf

denotes the ''n''-dimensional zero-vector.

Definitions for real matrices

n \times n

symmetric real matrix

M

is said to be positive-definite if

\mathbf^\textsf M\mathbf > 0

for all non-zero

\mathbf

\R^n

. Formally, An

n \times n

symmetric real matrix

M

is said to be positive-semidefinite or non-negative-definite if

\mathbf^\textsf M\mathbf \geq 0

for all

\mathbf

\R^n

. Formally, An

n \times n

symmetric real matrix

M

is said to be negative-definite if

\mathbf^\textsf M\mathbf < 0

for all non-zero

\mathbf

\R^n

. Formally, An

n \times n

symmetric real matrix

M

is said to be negative-semidefinite or non-positive-definite if

x^\textsf Mx \leq 0

for all

x

\R^n

. Formally, An

n \times n

symmetric real matrix which is neither positive semidefinite nor negative semidefinite is called indefinite.

Definitions for complex matrices

The following definitions all involve the term

\mathbf^* M\mathbf

. Notice that this is always a real number for any Hermitian square matrix

M

. An

n \times n

Hermitian complex matrix

M

is said to be positive-definite if

\mathbf^* M\mathbf > 0

for all non-zero

\mathbf

\Complex^n

. Formally, An

n \times n

Hermitian complex matrix

M

is said to be positive semi-definite or non-negative-definite if

x^* Mx \geq 0

for all

x

\Complex^n

. Formally, An

n \times n

Hermitian complex matrix

M

is said to be negative-definite if

\mathbf^* M\mathbf < 0

for all non-zero

\mathbf

\Complex^n

. Formally, An

n \times n

Hermitian complex matrix

M

is said to be negative semi-definite or non-positive-definite if

\mathbf^* M\mathbf \leq 0

for all

\mathbf

\Complex^n

. Formally, An

n \times n

Hermitian complex matrix which is neither positive semidefinite nor negative semidefinite is called indefinite.

Consistency between real and complex definitions

Since every real matrix is also a complex matrix, the definitions of "definiteness" for the two classes must agree. For complex matrices, the most common definition says that "

M

is positive-definite if and only if

\mathbf^* M\mathbf

is real and positive for all non-zero ''complex'' column vectors

\mathbf z

". This condition implies that

M

is Hermitian (i.e. its transpose is equal to its conjugate). To see this, consider the matrices

A = \frac \left(M + M^*\right)

and

B = \frac \left(M - M^*\right)

, so that

M = A + iB

and

\mathbf^* M\mathbf = \mathbf^* A\mathbf + i\mathbf^* B\mathbf

. The matrices

A

and

B

are Hermitian, therefore

\mathbf^* A\mathbf

and

\mathbf^* B\mathbf

are individually real. If

\mathbf^* M\mathbf

is real, then

\mathbf^* B\mathbf

must be zero for all

\mathbf z

. Then

B

is the zero matrix and

M = A

, proving that

M

is Hermitian. By this definition, a positive-definite ''real'' matrix

M

is Hermitian, hence symmetric; and

\mathbf^\textsf M\mathbf

is positive for all non-zero ''real'' column vectors

\mathbf

. However the last condition alone is not sufficient for

M

to be positive-definite. For example, if

M = \begin 1 & 1 \\ -1 & 1 \end,

then for any real vector

\mathbf

with entries

a

and

b

we have

\mathbf^\textsf M\mathbf = \left(a + b\right)a + \left(-a + b\right)b = a^2 + b^2

, which is always positive if

\mathbf z

is not zero. However, if

\mathbf z

is the complex vector with entries

1

and

i

, one gets

\mathbf^* M \mathbf = \begin1 & -i\end M \begin1 \\ i\end = \begin1 + i & 1 - i\end \begin1 \\ i\end = 2+2i

which is not real. Therefore,

M

is not positive-definite. On the other hand, for a ''symmetric'' real matrix

M

, the condition "

\mathbf^\textsf M\mathbf > 0

for all nonzero real vectors

\mathbf z

" ''does'' imply that

M

is positive-definite in the complex sense.

Notation

If a Hermitian matrix

M

is positive semi-definite, one sometimes writes

M \succeq 0

and if

M

is positive-definite one writes

M \succ 0

. To denote that

M

is negative semi-definite one writes

M \preceq 0

and to denote that

M

is negative-definite one writes

M \prec 0

. The notion comes from functional analysis where positive semidefinite matrices define positive operators. If two matrices

A

and

B

satisfy

B - A \succeq 0

, we can define a non-strict partial order

B \succeq A

that is reflexive, antisymmetric, and transitive; It is not a total order, however, as

B - A

in general may be indefinite. A common alternative notation is

M \geq 0

M > 0

M \leq 0

and

M < 0

for positive semi-definite and positive-definite, negative semi-definite and negative-definite matrices, respectively. This may be confusing, as sometimes nonnegative matrices (respectively, nonpositive matrices) are also denoted in this way.

Examples

Eigenvalues

Let

M

be an

n \times n

Hermitian matrix (this includes real symmetric matrices). All eigenvalues of

M

are real, and their sign characterize its definiteness: *

M

is positive definite if and only if all of its eigenvalues are positive. *

M

is positive semi-definite if and only if all of its eigenvalues are non-negative. *

M

is negative definite if and only if all of its eigenvalues are negative *

M

is negative semi-definite if and only if all of its eigenvalues are non-positive. *

M

is indefinite if and only if it has both positive and negative eigenvalues. Let

PD P^

be an eigendecomposition of

M

, where

P

is a unitary complex matrix whose columns comprise an orthonormal basis of eigenvectors of

M

, and

D

is a ''real''

whose main diagonal contains the corresponding

s. The matrix

M

may be regarded as a diagonal matrix

D

that has been re-expressed in coordinates of the (eigenvectors) basis

P

. Put differently, applying

M

to some vector , giving , is the same as changing the basis to the eigenvector coordinate system using , giving , applying the stretching transformation to the result, giving , and then changing the basis back using , giving . With this in mind, the one-to-one change of variable

\mathbf = P\mathbf

shows that

\mathbf^* M\mathbf

is real and positive for any complex vector

\mathbf z

if and only if

\mathbf^* D\mathbf

is real and positive for any

y

; in other words, if

D

is positive definite. For a diagonal matrix, this is true only if each element of the main diagonal—that is, every eigenvalue of

M

—is positive. Since the spectral theorem guarantees all eigenvalues of a Hermitian matrix to be real, the positivity of eigenvalues can be checked using Descartes' rule of alternating signs when the characteristic polynomial of a real, symmetric matrix

M

is available.

Decomposition

Let

M

be an

n \times n

Hermitian matrix.

M

is positive semidefinite if and only if it can be decomposed as a product

M = B^* B

of a matrix

B

with its conjugate transpose. When

M

is real,

B

can be real as well and the decomposition can be written as

M = B^\textsf B.

M

is positive definite if and only if such a decomposition exists with

B

invertible. More generally,

M

is positive semidefinite with rank

k

if and only if a decomposition exists with a

k \times n

matrix

B

of full row rank (i.e. of rank

k

). Moreover, for any decomposition

M = B^* B

\operatorname(M) = \operatorname(B)

. The columns

b_1,\dots,b_n

B

can be seen as vectors in the complex or real vector space

\mathbb^k

, respectively. Then the entries of

M

are inner products (that is dot products, in the real case) of these vectors

M_ = \langle b_i, b_j\rangle.

In other words, a Hermitian matrix

M

is positive semidefinite if and only if it is the Gram matrix of some vectors

b_1,\dots,b_n

. It is positive definite if and only if it is the Gram matrix of some linearly independent vectors. In general, the rank of the Gram matrix of vectors

b_1,\dots,b_n

equals the dimension of the space spanned by these vectors.

Uniqueness up to unitary transformations

The decomposition is not unique: if

M = B^* B

for some

k \times n

matrix

B

and if

Q

is any unitary

k \times k

matrix (meaning

Q^* Q = Q Q^* = I

), then

M = B^* B = B^* Q^* Q B =A^* A

for

A=Q B

. However, this is the only way in which two decompositions can differ: the decomposition is unique up to unitary transformations. More formally, if

A

is a

k \times n

matrix and

B

is a

\ell \times n

matrix such that

A^* A = B^* B

, then there is a

\ell \times k

matrix

Q

with orthonormal columns (meaning

Q^* Q = I_

) such that

B = Q A

. When

\ell = k

this means

Q

is unitary. This statement has an intuitive geometric interpretation in the real case: let the columns of

A

and

B

be the vectors

a_1,\dots,a_n

and

b_1,\dots,b_n

\mathbb^k

. A real unitary matrix is an orthogonal matrix, which describes a

rigid transformation In mathematics, a rigid transformation (also called Euclidean transformation or Euclidean isometry) is a geometric transformation of a Euclidean space that preserves the Euclidean distance between every pair of points. The rigid transformations ...

(an isometry of Euclidean space

\mathbb^k

) preserving the 0 point (i.e.

rotations Rotation, or spin, is the circular movement of an object around a '' central axis''. A two-dimensional rotating object has only one possible central axis and can rotate in either a clockwise or counterclockwise direction. A three-dimensional ...

and reflections, without translations). Therefore, the dot products

a_i \cdot a_j

and

b_i \cdot b_j

are equal if and only if some rigid transformation of

\mathbb^k

transforms the vectors

a_1,\dots,a_n

b_1,\dots,b_n

(and 0 to 0).

Square root

A matrix

M

is positive semidefinite if and only if there is a positive semidefinite matrix

B

(in particular

B

is Hermitian, so

B^* = B

) satisfying

M = B B

. This matrix

B

is unique, is called the ''non-negative square root'' of

M

, and is denoted with

B = M^\frac

. When

M

is positive definite, so is

M^\frac

, hence it is also called the ''positive square root'' of

M

. The non-negative square root should not be confused with other decompositions

M = B^*B

. Some authors use the name ''square root'' and

M^\frac

for any such decomposition, or specifically for the Cholesky decomposition, or any decomposition of the form

M = B B

; other only use it for the non-negative square root. If

M > N > 0

then

M^\frac > N^\frac > 0

Cholesky decomposition

A positive semidefinite matrix

M

can be written as

M = LL^*

, where

L

is lower triangular with non-negative diagonal (equivalently

M = B^*B

where

B=L^*

is upper triangular); this is the Cholesky decomposition. If

M

is positive definite, then the diagonal of

L

is positive and the Cholesky decomposition is unique. Conversely if

L

is lower triangular with nonnegative diagonal then

L

is positive semidefinite. The Cholesky decomposition is especially useful for efficient numerical calculations. A closely related decomposition is the LDL decomposition,

M = L D L^*

, where

D

is diagonal and

L

is lower unitriangular.

Other characterizations

Let

M

be an

n \times n

real symmetric matrix, and let

B_1(M) := \

be the "unit ball" defined by

M

. Then we have the following *

B_1(vv^\mathsf)

is a solid slab sandwiched between

\pm \

. *

M\succeq 0

if and only if

B_1(M)

is an ellipsoid, or an ellipsoidal cylinder. *

M\succ 0

if and only if

B_1(M)

is bounded, that is, it is an ellipsoid. * If

N\succ 0

, then

M \succeq N

if and only if

B_1(M) \subseteq B_1(N)

;

M \succ N

if and only if

B_1(M) \subseteq \operatorname(B_1(N))

. * If

N\succ 0

, then

M \succeq \frac

for all

v \neq 0

if and only if

B_1(M) \subset \bigcap_ B_1(vv^\mathsf)

. So, since the polar dual of an ellipsoid is also an ellipsoid with the same principal axes, with inverse lengths, we have

B_1(N^) = \bigcap_ B_1(vv^\mathsf) = \bigcap_\

That is, if

N

is positive-definite, then

M \succeq \frac

for all

v\neq 0

if and only if

M \succeq N^

Let

M

be an

n \times n

Hermitian matrix. The following properties are equivalent to

M

being positive definite: ; The associated sesquilinear form is an inner product: The sesquilinear form defined by

M

is the function

\langle \cdot, \cdot\rangle

from

\Complex^n \times \Complex^n

\Complex^n

such that

\langle x, y \rangle := y^*M x

for all

x

and

y

\Complex^n

, where

y^*

is the conjugate transpose of

y

. For any complex matrix

M

, this form is linear in

x

and semilinear in

y

. Therefore, the form is an inner product on

\Complex^n

if and only if

\langle z, z \rangle

is real and positive for all nonzero

z

; that is if and only if

M

is positive definite. (In fact, every inner product on

\Complex^n

arises in this fashion from a Hermitian positive definite matrix.) ; Its leading principal minors are all positive: The ''k''th leading principal minor of a matrix

M

is the determinant of its upper-left

k \times k

sub-matrix. It turns out that a matrix is positive definite if and only if all these determinants are positive. This condition is known as Sylvester's criterion, and provides an efficient test of positive definiteness of a symmetric real matrix. Namely, the matrix is reduced to an upper triangular matrix by using

elementary row operations In mathematics, an elementary matrix is a matrix which differs from the identity matrix by one single elementary row operation. The elementary matrices generate the general linear group GL''n''(F) when F is a field. Left multiplication (pre-multipl ...

, as in the first part of the Gaussian elimination method, taking care to preserve the sign of its determinant during pivoting process. Since the ''k''th leading principal minor of a triangular matrix is the product of its diagonal elements up to row

k

, Sylvester's criterion is equivalent to checking whether its diagonal elements are all positive. This condition can be checked each time a new row

k

of the triangular matrix is obtained. A positive semidefinite matrix is positive definite if and only if it is invertible. A matrix

M

is negative (semi)definite if and only if

-M

is positive (semi)definite.

Quadratic forms

The (purely)

quadratic form In mathematics, a quadratic form is a polynomial with terms all of degree two ("form" is another name for a homogeneous polynomial). For example, :4x^2 + 2xy - 3y^2 is a quadratic form in the variables and . The coefficients usually belong to a ...

associated with a real

n \times n

matrix

M

is the function

Q : \mathbb^n \to \mathbb

such that

Q(x) = x^\textsf Mx

for all

x

M

can be assumed symmetric by replacing it with

\tfrac \left(M + M^\textsf\right)

. A symmetric matrix

M

is positive definite if and only if its quadratic form is a strictly convex function. More generally, any

quadratic function In mathematics, a quadratic polynomial is a polynomial of degree two in one or more variables. A quadratic function is the polynomial function defined by a quadratic polynomial. Before 20th century, the distinction was unclear between a polynomial ...

from

\mathbb^n

\mathbb

can be written as

x^\textsf Mx + x^\textsf b + c

where

M

is a symmetric

n \times n

matrix,

b

is a real

n

-vector, and

c

a real constant. In the

n=1

case, this is a parabola, and just like in the

n=1

case, we have Theorem: This quadratic function is strictly convex, and hence has a unique finite global minimum, if and only if

M

is positive definite. Proof: If

M

is positive definite, then the function is strictly convex. Its gradient is zero at the unique point of

M^b

, which must be the global minimum since the function is strictly convex. If

M

is not positive definite, then there exists some vector

v

such that

v^T M v \leq 0

, so the function

f(t) := (vt)^T M (vt) + b^t (vt) + c

is a line or a downward parabola, thus not strictly convex and not having a global minimum. For this reason, positive definite matrices play an important role in optimization problems.

Simultaneous diagonalization

One symmetric matrix and another matrix that is both symmetric and positive definite can be simultaneously diagonalized. This is so although simultaneous diagonalization is not necessarily performed with a similarity transformation. This result does not extend to the case of three or more matrices. In this section we write for the real case. Extension to the complex case is immediate. Let

M

be a symmetric and

N

a symmetric and positive definite matrix. Write the generalized eigenvalue equation as

\left(M - \lambda N\right)\mathbf = 0

where we impose that

x

be normalized, i.e.

\mathbf^\textsf N\mathbf = 1

. Now we use Cholesky decomposition to write the inverse of

N

Q^\textsf Q

. Multiplying by

Q

and letting

\mathbf = Q^\textsf \mathbf

, we get

Q\left(M - \lambda N\right)Q^\textsf \mathbf = 0

, which can be rewritten as

\left(QMQ^\textsf\right)\mathbf = \lambda \mathbf

where

\mathbf^\textsf \mathbf = 1

. Manipulation now yields

MX = NX\Lambda

where

X

is a matrix having as columns the generalized eigenvectors and

\Lambda

is a diagonal matrix of the generalized eigenvalues. Now premultiplication with

X^\textsf

gives the final result:

X^\textsf MX = \Lambda

and

X^\textsf NX = I

, but note that this is no longer an orthogonal diagonalization with respect to the inner product where

\mathbf^\textsf \mathbf = 1

. In fact, we diagonalized

M

with respect to the inner product induced by

N

. Note that this result does not contradict what is said on simultaneous diagonalization in the article Diagonalizable matrix, which refers to simultaneous diagonalization by a similarity transformation. Our result here is more akin to a simultaneous diagonalization of two quadratic forms, and is useful for optimization of one form under conditions on the other.

Properties

Induced partial ordering

For arbitrary square matrices

M

N

we write

M \ge N

M - N \ge 0

i.e.,

M - N

is positive semi-definite. This defines a partial ordering on the set of all square matrices. One can similarly define a strict partial ordering

M > N

. The ordering is called the Loewner order.

Inverse of positive definite matrix

Every positive definite matrix is invertible and its inverse is also positive definite. If

M \geq N > 0

then

N^ \geq M^ > 0

. Moreover, by the min-max theorem, the ''k''th largest eigenvalue of

M

is greater than the ''k''th largest eigenvalue of

N

Scaling

M

is positive definite and

r > 0

is a real number, then

r M

is positive definite., p. 430, Observation 7.1.3

Addition

* If

M

and

N

are positive-definite, then the sum

M + N

is also positive-definite. * If

M

and

N

are positive-semidefinite, then the sum

M + N

is also positive-semidefinite. * If

M

is positive-definite and

N

is positive-semidefinite, then the sum

M + N

is also positive-definite.

Multiplication

* If

M

and

N

are positive definite, then the products

MNM

and

NMN

are also positive definite. If

MN = NM

, then

MN

is also positive definite. * If

M

is positive semidefinite, then

A^* MA

is positive semidefinite for any (possibly rectangular) matrix

A

. If

M

is positive definite and

A

has full column rank, then

A^* M A

is positive definite.

Trace

The diagonal entries

m_

of a positive-semidefinite matrix are real and non-negative. As a consequence the trace,

\operatorname(M)\ge0

. Furthermore, since every principal sub-matrix (in particular, 2-by-2) is positive semidefinite,

\left, m_\ \leq \sqrt \quad \forall i, j

and thus, when

n \ge 1

\max_ \left, m_\ \leq \max_i m_

n \times n

Hermitian matrix

M

is positive definite if it satisfies the following trace inequalities:

\operatorname(M) > 0  \quad \mathrm \quad \frac > n-1.

Another important result is that for any

M

and

N

positive-semidefinite matrices,

\operatorname(MN) \ge 0

Hadamard product

M, N \geq 0

, although

MN

is not necessary positive semidefinite, the Hadamard product is,

M \circ N \geq 0

(this result is often called the Schur product theorem). Regarding the Hadamard product of two positive semidefinite matrices

M = (m_) \geq 0

N \geq 0

, there are two notable inequalities: * Oppenheim's inequality:

\det(M \circ N) \geq \det (N) \prod\nolimits_i m_.

\det(M \circ N) \geq \det(M) \det(N)

., Corollary 3.6, p. 227

Kronecker product

M, N \geq 0

, although

MN

is not necessary positive semidefinite, the Kronecker product

M \otimes N \geq 0

Frobenius product

M, N \geq 0

, although

MN

is not necessary positive semidefinite, the Frobenius inner product

M : N \geq 0

(Lancaster–Tismenetsky, ''The Theory of Matrices'', p. 218).

Convexity

The set of positive semidefinite symmetric matrices is convex. That is, if

M

and

N

are positive semidefinite, then for any

\alpha

between 0 and 1,

\alpha M + \left(1 - \alpha\right) N

is also positive semidefinite. For any vector

\mathbf x

\mathbf^\textsf \left(\alpha M + \left(1 - \alpha\right)N\right)\mathbf = \alpha \mathbf^\textsf M\mathbf + (1 - \alpha) \mathbf^\textsf N\mathbf \geq 0.

This property guarantees that semidefinite programming problems converge to a globally optimal solution.

Relation with cosine

The positive-definiteness of a matrix

A

expresses that the angle

\theta

between any vector

\mathbf x

and its image

A\mathbf

is always

-\pi / 2 < \theta < +\pi / 2

\cos\theta = \frac=\frac , \theta=\theta(\mathbf,A\mathbf)=\widehat= \text \mathbf \text A\mathbf

Further properties

# If

M

is a symmetric Toeplitz matrix, i.e. the entries

m_

are given as a function of their absolute index differences:

m_ = h(, i-j, )

, and the ''strict'' inequality

\sum_ \left, h(j)\ < h(0)

holds, then

M

is ''strictly'' positive definite. # Let

M > 0

and

N

Hermitian. If

MN + NM \ge 0

(resp.,

MN + NM > 0

) then

N \ge 0

(resp.,

N > 0

). # If

M > 0

is real, then there is a

\delta > 0

such that

M>\delta I

, where

I

is the

identity matrix In linear algebra, the identity matrix of size n is the n\times n square matrix with ones on the main diagonal and zeros elsewhere. Terminology and notation The identity matrix is often denoted by I_n, or simply by I if the size is immaterial o ...

. # If

M_k

denotes the leading

k \times k

minor,

\det\left(M_k\right)/\det\left(M_\right)

is the ''k''th pivot during

LU decomposition In numerical analysis and linear algebra, lower–upper (LU) decomposition or factorization factors a matrix as the product of a lower triangular matrix and an upper triangular matrix (see matrix decomposition). The product sometimes includes a ...

. # A matrix is negative definite if its ''k-''th order leading principal minor is negative when

k

is odd, and positive when

k

is even. A Hermitian matrix is positive semidefinite if and only if all of its principal minors are nonnegative. It is however not enough to consider the leading principal minors only, as is checked on the diagonal matrix with entries 0 and −1.

Block matrices and submatrices

A positive

2n \times 2n

matrix may also be defined by blocks:

M =  \begin A & B \\ C & D \end

where each block is

n \times n

. By applying the positivity condition, it immediately follows that

A

and

D

are hermitian, and

C = B^*

. We have that

\mathbf^* M\mathbf \ge 0

for all complex

\mathbf z

, and in particular for

\textsf

. Then

\begin \mathbf^* & 0 \end \begin A & B \\ B^* & D \end \begin \mathbf \\ 0 \end = \mathbf^* A\mathbf \ge 0.

A similar argument can be applied to

D

, and thus we conclude that both

A

and

D

must be positive definite. The argument can be extended to show that any

principal submatrix In mathematics, a matrix (plural matrices) is a rectangular array or table of numbers, symbols, or expressions, arranged in rows and columns, which is used to represent a mathematical object or a property of such an object. For example, \ ...

M

is itself positive definite. Converse results can be proved with stronger conditions on the blocks, for instance using the

Schur complement In linear algebra and the theory of matrices, the Schur complement of a block matrix is defined as follows. Suppose ''p'', ''q'' are nonnegative integers, and suppose ''A'', ''B'', ''C'', ''D'' are respectively ''p'' × ''p'', ''p'' × ''q'', ''q'' ...

Local extrema

A general

f(\mathbf)

n

real variables

x_1, \ldots, x_n

can always be written as

\mathbf^\textsf M \mathbf

where

\mathbf

is the column vector with those variables, and

M

is a symmetric real matrix. Therefore, the matrix being positive definite means that

f

has a unique minimum (zero) when

\mathbf

is zero, and is strictly positive for any other

\mathbf

. More generally, a twice-differentiable real function

f

n

real variables has local minimum at arguments

x_1, \ldots, x_n

if its gradient is zero and its Hessian (the matrix of all second derivatives) is positive semi-definite at that point. Similar statements can be made for negative definite and semi-definite matrices.

Covariance

statistics Statistics (from German: '' Statistik'', "description of a state, a country") is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a scientific, indust ...

, the covariance matrix of a multivariate probability distribution is always positive semi-definite; and it is positive definite unless one variable is an exact linear function of the others. Conversely, every positive semi-definite matrix is the covariance matrix of some multivariate distribution.

Extension for non-Hermitian square matrices

The definition of positive definite can be generalized by designating any complex matrix

M

(e.g. real non-symmetric) as positive definite if

\Re\left(\mathbf^* M\mathbf\right) > 0

for all non-zero complex vectors

\mathbf z

, where

\Re(c)

denotes the real part of a complex number

c

.Weisstein, Eric W.
Positive Definite Matrix.
' From ''MathWorld--A Wolfram Web Resource''. Accessed on 2012-07-26 Only the Hermitian part

\frac\left(M + M^*\right)

determines whether the matrix is positive definite, and is assessed in the narrower sense above. Similarly, if

\mathbf x

and

M

are real, we have

\mathbf^\textsf M \mathbf > 0

for all real nonzero vectors

\mathbf x

if and only if the symmetric part

\frac\left(M + M^\textsf\right)

is positive definite in the narrower sense. It is immediately clear that

\mathbf^\textsf M \mathbf = \sum_ x_i M_ x_j

is insensitive to transposition of ''M''. Consequently, a non-symmetric real matrix with only positive eigenvalues does not need to be positive definite. For example, the matrix

M = \left begin 4 & 9 \\ 1 & 4 \end\right /math> has positive eigenvalues yet is not positive definite; in particular a negative value of \mathbf^\textsf M\mathbf is obtained with the choice \mathbf = \left begin -1 \\ 1 \end\right (which is the eigenvector associated with the negative eigenvalue of the symmetric part of 

In summary, the distinguishing feature between the real and complex case is that, a

bounded Boundedness or bounded may refer to: Economics * Bounded rationality, the idea that human rationality in decision-making is bounded by the available information, the cognitive limitations, and the time available to make the decision * Bounded e ...

positive operator on a complex Hilbert space is necessarily Hermitian, or self adjoint. The general claim can be argued using the

polarization identity In linear algebra, a branch of mathematics, the polarization identity is any one of a family of formulas that express the inner product of two vectors in terms of the norm of a normed vector space. If a norm arises from an inner product then t ...

. That is no longer true in the real case.

Applications

Heat conductivity matrix

Fourier's law of heat conduction, giving heat flux

\mathbf q

in terms of the temperature gradient

\mathbf g = \nabla T

is written for anisotropic media as

\mathbf = -K\mathbf

, in which

K

is the symmetric

thermal conductivity The thermal conductivity of a material is a measure of its ability to conduct heat. It is commonly denoted by k, \lambda, or \kappa. Heat transfer occurs at a lower rate in materials of low thermal conductivity than in materials of high thermal ...

matrix. The negative is inserted in Fourier's law to reflect the expectation that heat will always flow from hot to cold. In other words, since the temperature gradient

\mathbf g

always points from cold to hot, the heat flux

\mathbf q

is expected to have a negative inner product with

\mathbf g

so that

\mathbf^\textsf\mathbf < 0

. Substituting Fourier's law then gives this expectation as

\mathbf^\textsfK\mathbf > 0

, implying that the conductivity matrix should be positive definite.

Notes

References

* * *

External links

*
Wolfram MathWorld: Positive Definite Matrix
{{DEFAULTSORT:Positive-Definite Matrix Matrices de:Definitheit#Definitheit von Matrizen

Definitions

Definitions for real matrices

Definitions for complex matrices

Consistency between real and complex definitions

Notation

Examples

Eigenvalues

Decomposition

Uniqueness up to unitary transformations

Square root

Cholesky decomposition

Other characterizations

Quadratic forms

Simultaneous diagonalization

Properties

Induced partial ordering

Inverse of positive definite matrix

Scaling

Addition

Multiplication

Trace

Hadamard product

Kronecker product

Frobenius product

Convexity

Relation with cosine

Further properties

Block matrices and submatrices

Local extrema

Covariance

Extension for non-Hermitian square matrices

Applications

Heat conductivity matrix

See also

Notes

References

External links