The Cauchy–Schwarz inequality (also called Cauchy–Bunyakovsky–Schwarz inequality) is an upper bound on the

absolute value In mathematics, the absolute value or modulus of a real number x, is the non-negative value without regard to its sign. Namely, , x, =x if x is a positive number, and , x, =-x if x is negative (in which case negating x makes -x positive), ...

of the inner product between two vectors in an inner product space in terms of the product of the vector norms. It is considered one of the most important and widely used inequalities in mathematics. Inner products of vectors can describe finite sums (via finite-dimensional vector spaces), infinite series (via vectors in sequence spaces), and

integral In mathematics, an integral is the continuous analog of a Summation, sum, which is used to calculate area, areas, volume, volumes, and their generalizations. Integration, the process of computing an integral, is one of the two fundamental oper ...

s (via vectors in Hilbert spaces). The inequality for sums was published by . The corresponding inequality for integrals was published by and . Schwarz gave the modern proof of the integral version.

Statement of the inequality

The Cauchy–Schwarz inequality states that for all vectors

\mathbf

and

\mathbf

of an inner product space where

\langle \cdot, \cdot \rangle

is the inner product. Examples of inner products include the real and complex

dot product In mathematics, the dot product or scalar productThe term ''scalar product'' means literally "product with a Scalar (mathematics), scalar as a result". It is also used for other symmetric bilinear forms, for example in a pseudo-Euclidean space. N ...

; see the examples in inner product. Every inner product gives rise to a Euclidean

\ell_2

norm, called the or , where the norm of a vector

\mathbf

is denoted and defined by

\, \mathbf\,  := \sqrt,

where

\langle \mathbf, \mathbf \rangle

is always a non-negative real number (even if the inner product is complex-valued). By taking the square root of both sides of the above inequality, the Cauchy–Schwarz inequality can be written in its more familiar form in terms of the norm: Moreover, the two sides are equal if and only if

\mathbf

and

\mathbf

are linearly dependent.

Special cases

Sedrakyan's lemma – positive real numbers

Sedrakyan's inequality, also known as Bergström's inequality, Engel's form, Titu's lemma (or the T2 lemma), states that for real numbers

u_1, u_2, \dots, u_n

and positive real numbers

v_1, v_2, \dots, v_n

\frac \leq \frac + \frac + \cdots + \frac,

or, using summation notation,

\biggl(\sum_^n u_i\biggr)^2 \bigg/ \sum_^n v_i \,\leq\, \sum_^n \frac.

It is a direct consequence of the Cauchy–Schwarz inequality, obtained by using the

\R^n

upon substituting

u_i' = \frac

and

v_i' =

. This form is especially helpful when the inequality involves fractions where the numerator is a perfect square.

- The plane

Cauchy-Schwarz inequation in Euclidean plane

The real vector space

\R^2

denotes the 2-dimensional plane. It is also the 2-dimensional

Euclidean space Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, in Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics there are ''Euclidean spaces ...

where the inner product is the

. If

\mathbf = (u_1, u_2)

and

\mathbf = (v_1, v_2)

then the Cauchy–Schwarz inequality becomes:

\langle \mathbf, \mathbf \rangle^2 = \bigl(\, \mathbf\,  \, \mathbf\,  \cos \theta\bigr)^2 \leq \, \mathbf\, ^2 \, \mathbf\, ^2,

where

\theta

is the

angle In Euclidean geometry, an angle can refer to a number of concepts relating to the intersection of two straight Line (geometry), lines at a Point (geometry), point. Formally, an angle is a figure lying in a Euclidean plane, plane formed by two R ...

between

\mathbf

and

\mathbf

. The form above is perhaps the easiest in which to understand the inequality, since the square of the cosine can be at most 1, which occurs when the vectors are in the same or opposite directions. It can also be restated in terms of the vector coordinates

u_1

u_2

v_1

, and

v_2

\left(u_1 v_1 + u_2 v_2\right)^2 \leq \left(u_1^2 + u_2^2\right) \left(v_1^2 + v_2^2\right),

where equality holds if and only if the vector

\left(u_1, u_2\right)

is in the same or opposite direction as the vector

\left(v_1, v_2\right)

, or if one of them is the zero vector.

: ''n''-dimensional Euclidean space

\R^n

with the standard inner product, which is the

, the Cauchy–Schwarz inequality becomes:

\biggl(\sum_^n u_i v_i\biggr)^2 \leq \biggl(\sum_^n u_i^2\biggr) \biggl(\sum_^n v_i^2\biggr).

The Cauchy–Schwarz inequality can be proved using only elementary algebra in this case by observing that the difference of the right and the left hand side is

\tfrac \sum_^n\sum_^n (u_i v_j - u_j v_i)^2 \ge 0

or by considering the following quadratic polynomial in

x

(u_1 x + v_1)^2 + \cdots + (u_n x + v_n)^2 = \biggl(\sum_i u_i^2\biggr) x^2 + 2 \biggl(\sum_i u_i v_i\biggr) x + \sum_i v_i^2.

Since the latter polynomial is nonnegative, it has at most one real root, hence its discriminant is less than or equal to zero. That is,

\biggl(\sum_i u_i v_i\biggr)^2 - \biggl(\sum_i \biggr)  \biggl(\sum_i \biggr) \leq 0.

: ''n''-dimensional complex space

\mathbf, \mathbf \in \Complex^n

with

\mathbf = (u_1, \ldots, u_n)

and

\mathbf = (v_1, \ldots, v_n)

(where

u_1, \ldots, u_n \in \Complex

and

v_1, \ldots, v_n \in \Complex

) and if the inner product on the vector space

\Complex^n

is the canonical complex inner product (defined by

\langle \mathbf, \mathbf \rangle := u_1 \overline + \cdots + u_ \overline,

where the bar notation is used for complex conjugation), then the inequality may be restated more explicitly as follows:

\bigl, \langle \mathbf, \mathbf \rangle\bigr, ^2 
= \Biggl, \sum_^n u_k\bar_k\Biggr, ^2 
\leq \langle \mathbf, \mathbf \rangle \langle \mathbf, \mathbf \rangle 
= \biggl(\sum_^n u_k \bar_k\biggr) \biggl(\sum_^n v_k \bar_k\biggr) 
= \sum_^n , u_j, ^2 \sum_^n , v_k, ^2.

That is,

\bigl, u_1 \bar_1 + \cdots + u_n \bar_n\bigr, ^2 \leq \bigl(, u_1, ^2 + \cdots + , u_n, ^2\bigr) \bigl(, v_1, ^2 + \cdots + , v_n, ^2\bigr).

For the inner product space of square-integrable complex-valued functions, the following inequality holds.

\left, \int_ f(x) \overline\,dx\^2  \leq  \int_ \bigl, f(x)\bigr, ^2\,dx \int_ \bigl, g(x)\bigr, ^2 \,dx.

The Hölder inequality is a generalization of this.

Applications

Analysis

In any inner product space, the triangle inequality is a consequence of the Cauchy–Schwarz inequality, as is now shown:

\begin
\, \mathbf + \mathbf\, ^2 
&= \langle \mathbf + \mathbf, \mathbf + \mathbf \rangle && \\
&= \, \mathbf\, ^2 + \langle \mathbf, \mathbf \rangle + \langle \mathbf, \mathbf \rangle + \, \mathbf\, ^2 ~ && ~ \text \langle \mathbf, \mathbf \rangle = \overline \\
&= \, \mathbf\, ^2 + 2 \operatorname \langle \mathbf, \mathbf \rangle + \, \mathbf\, ^2 && \\
&\leq \, \mathbf\, ^2 + 2, \langle \mathbf, \mathbf \rangle,  + \, \mathbf\, ^2 && \\
&\leq \, \mathbf\, ^2 + 2\, \mathbf\, \, \mathbf\,  + \, \mathbf\, ^2 ~ && ~ \text\\
&=\bigl(\, \mathbf\,  + \, \mathbf\, \bigr)^2. && 
\end

Taking square roots gives the triangle inequality:

\, \mathbf + \mathbf\,  \leq \, \mathbf\,  + \, \mathbf\, .

The Cauchy–Schwarz inequality is used to prove that the inner product is a continuous function with respect to the

topology Topology (from the Greek language, Greek words , and ) is the branch of mathematics concerned with the properties of a Mathematical object, geometric object that are preserved under Continuous function, continuous Deformation theory, deformat ...

induced by the inner product itself.

Geometry

The Cauchy–Schwarz inequality allows one to extend the notion of "angle between two vectors" to any real inner-product space by defining:

\cos\theta_ = \frac.

The Cauchy–Schwarz inequality proves that this definition is sensible, by showing that the right-hand side lies in the interval and justifies the notion that (real) Hilbert spaces are simply generalizations of the

. It can also be used to define an angle in complex inner-product spaces, by taking the absolute value or the real part of the right-hand side, as is done when extracting a metric from quantum fidelity.

Probability theory

Let

X

and

Y

random variable A random variable (also called random quantity, aleatory variable, or stochastic variable) is a Mathematics, mathematical formalization of a quantity or object which depends on randomness, random events. The term 'random variable' in its mathema ...

s. Then the covariance inequality is given by:

\operatorname(X) \geq \frac.

After defining an inner product on the set of random variables using the expectation of their product,

\langle X, Y \rangle := \operatorname(X Y),

the Cauchy–Schwarz inequality becomes

\bigl, \operatorname(XY)\bigr, ^2 \leq \operatorname(X^2) \operatorname(Y^2).

To prove the covariance inequality using the Cauchy–Schwarz inequality, let

\mu = \operatorname(X)

and

\nu = \operatorname(Y),

then

\begin
\bigl, \operatorname(X, Y)\bigr, ^2 
&= \bigl, \operatorname((X - \mu)(Y - \nu))\bigr, ^2 \\
&= \bigl, \langle X - \mu, Y - \nu \rangle \bigr, ^2\\
&\leq \langle X - \mu, X - \mu \rangle \langle Y - \nu, Y - \nu \rangle \\
& = \operatorname\left((X - \mu)^2\right) \operatorname\left((Y - \nu)^2\right) \\
& = \operatorname(X) \operatorname(Y),
\end

where

\operatorname

denotes

variance In probability theory and statistics, variance is the expected value of the squared deviation from the mean of a random variable. The standard deviation (SD) is obtained as the square root of the variance. Variance is a measure of dispersion ...

and

\operatorname

denotes covariance.

Proofs

There are many different proofs of the Cauchy–Schwarz inequality other than those given below. When consulting other sources, there are often two sources of confusion. First, some authors define to be linear in the second argument rather than the first. Second, some proofs are only valid when the field is

\mathbb R

and not

\mathbb C.

This section gives two proofs of the following theorem: In both of the proofs given below, the proof in the trivial case where at least one of the vectors is zero (or equivalently, in the case where

\, \mathbf\, \, \mathbf\, = 0

) is the same. It is presented immediately below only once to reduce repetition. It also includes the easy part of the proof of the Equality Characterization given above; that is, it proves that if

\mathbf

and

\mathbf

are linearly dependent then

\bigl, \langle \mathbf, \mathbf \rangle\bigr,  = \, \mathbf\,  \, \mathbf\, .

By definition,

\mathbf

and

\mathbf

are linearly dependent if and only if one is a scalar multiple of the other. If

\mathbf = c \mathbf

where

c

is some scalar then

, \langle \mathbf, \mathbf \rangle,  
= , \langle c \mathbf, \mathbf \rangle,  
= , c \langle \mathbf, \mathbf \rangle,  
= , c, \, \mathbf\,  \, \mathbf\, 
=\, c \mathbf\,  \, \mathbf\, 
=\, \mathbf\,  \, \mathbf\,

which shows that equality holds in the . The case where

\mathbf = c \mathbf

for some scalar

c

follows from the previous case:

, \langle \mathbf, \mathbf \rangle,  
= , \langle \mathbf, \mathbf \rangle, 
=\, \mathbf\,  \, \mathbf\, .

In particular, if at least one of

\mathbf

and

\mathbf

is the zero vector then

\mathbf

and

\mathbf

are necessarily linearly dependent (for example, if

\mathbf = \mathbf

then

\mathbf = c \mathbf

where

c = 0

), so the above computation shows that the Cauchy–Schwarz inequality holds in this case. Consequently, the Cauchy–Schwarz inequality only needs to be proven only for non-zero vectors and also only the non-trivial direction of the Equality Characterization must be shown.

Proof via the Pythagorean theorem

The special case of

\mathbf = \mathbf

was proven above so it is henceforth assumed that

\mathbf \neq \mathbf.

Let

\mathbf := \mathbf - \frac   \mathbf.

It follows from the linearity of the inner product in its first argument that:

\langle \mathbf, \mathbf \rangle 
= \left\langle \mathbf - \frac  \mathbf, \mathbf \right\rangle 
= \langle \mathbf, \mathbf \rangle - \frac  \langle \mathbf, \mathbf \rangle 
= 0.

Therefore,

\mathbf

is a vector orthogonal to the vector

\mathbf

(Indeed,

\mathbf

is the projection of

\mathbf

onto the plane orthogonal to

\mathbf.

) We can thus apply the Pythagorean theorem to

\mathbf= \frac  \mathbf + \mathbf

which gives

\, \mathbf\, ^2 
= \left, \frac\^2 \, \mathbf\, ^2 + \, \mathbf\, ^2 
= \frac \,\, \mathbf\, ^2 + \, \mathbf\, ^2 
= \frac + \, \mathbf\, ^2 \geq \frac.

The Cauchy–Schwarz inequality follows by multiplying by

\, \mathbf\, ^2

and then taking the square root. Moreover, if the relation

\geq

in the above expression is actually an equality, then

\, \mathbf\, ^2 = 0

and hence

\mathbf = \mathbf;

the definition of

\mathbf

then establishes a relation of linear dependence between

\mathbf

and

\mathbf.

The converse was proved at the beginning of this section, so the proof is complete.

\blacksquare

Proof by analyzing a quadratic

Consider an arbitrary pair of vectors

\mathbf, \mathbf

. Define the function

p : \R \to \R

defined by

p(t) = \langle t\alpha\mathbf + \mathbf, t\alpha\mathbf + \mathbf\rangle

, where

\alpha

is a complex number satisfying

, \alpha,  = 1

and

\alpha\langle\mathbf, \mathbf\rangle = , \langle\mathbf, \mathbf\rangle,

. Such an

\alpha

exists since if

\langle\mathbf, \mathbf\rangle = 0

then

\alpha

can be taken to be 1. Since the inner product is positive-definite,

p(t)

only takes non-negative real values. On the other hand,

p(t)

can be expanded using the bilinearity of the inner product:

\begin
p(t)
&= \langle t\alpha\mathbf, t\alpha\mathbf\rangle + \langle t\alpha\mathbf, \mathbf\rangle + \langle\mathbf, t\alpha\mathbf\rangle + \langle\mathbf, \mathbf\rangle \\
&= t\alpha t\overline\langle\mathbf, \mathbf\rangle + t\alpha\langle\mathbf, \mathbf\rangle + t\overline\langle \mathbf, \mathbf\rangle + \langle\mathbf, \mathbf\rangle \\
&= \lVert \mathbf \rVert^2 t^2 + 2, \langle\mathbf, \mathbf\rangle, t + \lVert \mathbf \rVert^2
\end

Thus,

p

is a polynomial of degree

2

(unless

\mathbf = 0,

which is a case that was checked earlier). Since the sign of

p

does not change, the discriminant of this polynomial must be non-positive:

\Delta = 4 \bigl(\,, \langle \mathbf, \mathbf \rangle, ^2 - \Vert \mathbf \Vert^2 \Vert \mathbf \Vert^2\bigr) \leq 0.

The conclusion follows. For the equality case, notice that

\Delta = 0

happens if and only if

p(t) = \bigl(t\Vert \mathbf \Vert + \Vert \mathbf \Vert\bigr)^2.

t_0 = -\Vert \mathbf \Vert / \Vert \mathbf \Vert,

then

p(t_0) = \langle t_0\alpha\mathbf + \mathbf,t_0\alpha\mathbf + \mathbf\rangle = 0,

and hence

\mathbf = -t_0\alpha\mathbf.

Generalizations

Various generalizations of the Cauchy–Schwarz inequality exist.

Hölder's inequality In mathematical analysis, Hölder's inequality, named after Otto Hölder, is a fundamental inequality (mathematics), inequality between Lebesgue integration, integrals and an indispensable tool for the study of Lp space, spaces. The numbers an ...

generalizes it to

L^p

norms. More generally, it can be interpreted as a special case of the definition of the norm of a linear operator on a

Banach space In mathematics, more specifically in functional analysis, a Banach space (, ) is a complete normed vector space. Thus, a Banach space is a vector space with a metric that allows the computation of vector length and distance between vectors and ...

(Namely, when the space is a Hilbert space). Further generalizations are in the context of operator theory, e.g. for operator-convex functions and operator algebras, where the domain and/or range are replaced by a C*-algebra or W*-algebra. An inner product can be used to define a positive linear functional. For example, given a Hilbert space

L^2(m), m

being a finite measure, the standard inner product gives rise to a positive functional

\varphi

\varphi (g) = \langle g, 1 \rangle.

Conversely, every positive linear functional

\varphi

L^2(m)

can be used to define an inner product

\langle f, g \rangle _\varphi := \varphi\left(g^* f\right),

where

g^*

is the pointwise complex conjugate of

g.

In this language, the Cauchy–Schwarz inequality becomes

\bigl, \varphi(g^* f)\bigr, ^2 \leq \varphi\left(f^* f\right) \varphi\left(g^* g\right),

which extends verbatim to positive functionals on C*-algebras: The next two theorems are further examples in operator algebra. This extends the fact

\varphi\left(a^*a\right) \cdot 1 \geq \varphi(a)^* \varphi(a) = , \varphi(a), ^2,

when

\varphi

is a linear functional. The case when

a

is self-adjoint, that is,

a = a^*,

is sometimes known as Kadison's inequality. Another generalization is a refinement obtained by interpolating between both sides of the Cauchy–Schwarz inequality: This theorem can be deduced from

. There are also non-commutative versions for operators and tensor products of matrices. Several matrix versions of the Cauchy–Schwarz inequality and Kantorovich inequality are applied to linear regression models.

Notes

Citations

References

* * * * * * * . * * . * * *

External links

Earliest Uses: The entry on the Cauchy–Schwarz inequality has some historical information.

Tutorial and Interactive program. {{DEFAULTSORT:Cauchy-Schwarz inequality Augustin-Louis Cauchy Inequalities (mathematics) Linear algebra Operator theory Articles containing proofs Mathematical analysis Probabilistic inequalities

Statement of the inequality

Special cases

Sedrakyan's lemma – positive real numbers

- The plane

: ''n''-dimensional Euclidean space

: ''n''-dimensional complex space

Applications

Analysis

Geometry

Probability theory

Proofs

Proof via the Pythagorean theorem

Proof by analyzing a quadratic

Generalizations

See also

Notes

Citations

References

External links