In
mathematics
Mathematics is an area of knowledge that includes the topics of numbers, formulas and related structures, shapes and the spaces in which they are contained, and quantities and their changes. These topics are represented in modern mathematics ...
, especially the usage of
linear algebra in
Mathematical physics, Einstein notation (also known as the Einstein summation convention or Einstein summation notation) is a notational convention that implies
summation
In mathematics, summation is the addition of a sequence of any kind of numbers, called ''addends'' or ''summands''; the result is their ''sum'' or ''total''. Beside numbers, other types of values can be summed as well: functions, vectors, mat ...
over a set of indexed terms in a formula, thus achieving brevity. As part of mathematics it is a notational subset of
Ricci calculus
In mathematics, Ricci calculus constitutes the rules of index notation and manipulation for tensors and tensor fields on a differentiable manifold, with or without a metric tensor or connection. It is also the modern name for what used to be cal ...
; however, it is often used in physics applications that do not distinguish between
tangent and
cotangent spaces. It was introduced to physics by
Albert Einstein in 1916.
Introduction
Statement of convention
According to this convention, when an index variable appears twice in a single
term and is not otherwise defined (see
Free and bound variables), it implies summation of that term over all the values of the index. So where the indices can range over the
set ,
:
is simplified by the convention to:
:
The upper indices are not
exponents but are indices of coordinates,
coefficient
In mathematics, a coefficient is a multiplicative factor in some term of a polynomial, a series, or an expression; it is usually a number, but may be any expression (including variables such as , and ). When the coefficients are themselves var ...
s or
basis vectors. That is, in this context should be understood as the second component of rather than the square of (this can occasionally lead to ambiguity). The upper index position in is because, typically, an index occurs once in an upper (superscript) and once in a lower (subscript) position in a term (see ' below). Typically, would be equivalent to the traditional .
In
general relativity, a common convention is that
* the
Greek alphabet is used for space and time components, where indices take on values 0, 1, 2, or 3 (frequently used letters are ),
* the
Latin alphabet is used for spatial components only, where indices take on values 1, 2, or 3 (frequently used letters are ),
In general, indices can range over any
indexing set
In mathematics, an index set is a set whose members label (or index) members of another set. For instance, if the elements of a set may be ''indexed'' or ''labeled'' by means of the elements of a set , then is an index set. The indexing consists ...
, including an
infinite set. This should not be confused with a typographically similar convention used to distinguish between
tensor index notation and the closely related but distinct basis-independent
abstract index notation
Abstract index notation (also referred to as slot-naming index notation) is a mathematical notation for tensors and spinors that uses indices to indicate their types, rather than their components in a particular basis. The indices are mere placeho ...
.
An index that is summed over is a ''summation index'', in this case "". It is also called a
dummy index since any symbol can replace "" without changing the meaning of the expression (provided that it does not collide with other index symbols in the same term).
An index that is not summed over is a
''free index'' and should appear only once per term. If such an index does appear, it usually also appears in every other term in an equation. An example of a free index is the "" in the equation
, which is equivalent to the equation
.
Application
Einstein notation can be applied in slightly different ways. Typically, each index occurs once in an upper (superscript) and once in a lower (subscript) position in a term; however, the convention can be applied more generally to any repeated indices within a term.
When dealing with
covariant and contravariant vectors, where the position of an index also indicates the type of vector, the first case usually applies; a covariant vector can only be contracted with a contravariant vector, corresponding to summation of the products of coefficients. On the other hand, when there is a fixed coordinate basis (or when not considering coordinate vectors), one may choose to use only subscripts; see ' below.
Vector representations
Superscripts and subscripts versus only subscripts
In terms of
covariance and contravariance of vectors
In physics, especially in multilinear algebra and tensor analysis, covariance and contravariance describe how the quantitative description of certain geometric or physical entities changes with a change of basis. In modern mathematical notation ...
,
* upper indices represent components of
contravariant vectors (
vectors),
* lower indices represent components of
covariant vectors (
covector
In mathematics, a linear form (also known as a linear functional, a one-form, or a covector) is a linear map from a vector space to its field of scalars (often, the real numbers or the complex numbers).
If is a vector space over a field , the s ...
s).
They transform contravariantly or covariantly, respectively, with respect to
change of basis.
In recognition of this fact, the following notation uses the same symbol both for a vector or covector and its ''components'', as in:
:
where is the vector and are its components (not the th covector ), is the covector and are its components. The basis vector elements
are each column vectors, and the covector basis elements
are each row covectors. (See also
#Abstract description;
duality
Duality may refer to:
Mathematics
* Duality (mathematics), a mathematical concept
** Dual (category theory), a formalization of mathematical duality
** Duality (optimization)
** Duality (order theory), a concept regarding binary relations
** Dual ...
, below and the
examples)
In the presence of a
non-degenerate form (an
isomorphism , for instance a
Riemannian metric or
Minkowski metric), one can
raise and lower indices.
A basis gives such a form (via the
dual basis
In linear algebra, given a vector space ''V'' with a basis ''B'' of vectors indexed by an index set ''I'' (the cardinality of ''I'' is the dimension of ''V''), the dual set of ''B'' is a set ''B''∗ of vectors in the dual space ''V''∗ with th ...
), hence when working on with a
Euclidean metric and a fixed
orthonormal basis, one has the option to work with only subscripts.
However, if one changes coordinates, the way that coefficients change depends on the variance of the object, and one cannot ignore the distinction; see
Covariance and contravariance of vectors
In physics, especially in multilinear algebra and tensor analysis, covariance and contravariance describe how the quantitative description of certain geometric or physical entities changes with a change of basis. In modern mathematical notation ...
.
Mnemonics
In the above example, vectors are represented as
matrices (column vectors), while covectors are represented as matrices (row covectors).
When using the column vector convention:
* "Upper indices go up to down; lower indices go left to right."
* "Covariant tensors are row vectors that have indices that are below (co-row-below)."
* Covectors are row vectors:
Hence the lower index indicates which ''column'' you are in.
* Contravariant vectors are column vectors:
Hence the upper index indicates which ''row'' you are in.
Abstract description
The virtue of Einstein notation is that it represents the
invariant
Invariant and invariance may refer to:
Computer science
* Invariant (computer science), an expression whose value doesn't change during program execution
** Loop invariant, a property of a program loop that is true before (and after) each iteratio ...
quantities with a simple notation.
In physics, a
scalar is invariant under transformations of basis. In particular, a
Lorentz scalar
In a relativistic theory of physics, a Lorentz scalar is an expression, formed from items of the theory, which evaluates to a scalar, invariant under any Lorentz transformation. A Lorentz scalar may be generated from e.g., the scalar product of ve ...
is invariant under a
Lorentz transformation. The individual terms in the sum are not. When the basis is changed, the ''components'' of a vector change by a
linear transformation described by a matrix. This led Einstein to propose the convention that repeated indices imply the summation is to be done.
As for covectors, they change by the
inverse matrix. This is designed to guarantee that the linear function associated with the covector, the sum above, is the same no matter what the basis is.
The value of the Einstein convention is that it applies to other
vector spaces built from using the
tensor product and
duality
Duality may refer to:
Mathematics
* Duality (mathematics), a mathematical concept
** Dual (category theory), a formalization of mathematical duality
** Duality (optimization)
** Duality (order theory), a concept regarding binary relations
** Dual ...
. For example, , the tensor product of with itself, has a basis consisting of tensors of the form . Any tensor in can be written as:
:
, the dual of , has a basis , , …, which obeys the rule
:
where is the
Kronecker delta. As
:
the row/column coordinates on a matrix correspond to the upper/lower indices on the tensor product.
Common operations in this notation
In Einstein notation, the usual element reference
for the
th row and
th column of matrix
becomes
. We can then write the following operations in Einstein notation as follows.
Inner product (hence also
vector dot product
In mathematics, the dot product or scalar productThe term ''scalar product'' means literally "product with a scalar as a result". It is also used sometimes for other symmetric bilinear forms, for example in a pseudo-Euclidean space. is an algeb ...
)
Using an
orthogonal basis, the inner product is the sum of corresponding components multiplied together:
:
This can also be calculated by multiplying the covector on the vector.
Vector cross product
Again using an orthogonal basis (in 3 dimensions) the cross product intrinsically involves summations over permutations of components:
:
where
:
is the
Levi-Civita symbol, and is the generalized
Kronecker delta. Based on this definition of , there is no difference between and but the position of indices.
Matrix-vector multiplication
The product of a matrix with a column vector is :
:
equivalent to
:
This is a special case of matrix multiplication.
Matrix multiplication
The
matrix product of two matrices and is:
:
equivalent to
:
Trace
For a
square matrix
In mathematics, a square matrix is a matrix with the same number of rows and columns. An ''n''-by-''n'' matrix is known as a square matrix of order Any two square matrices of the same order can be added and multiplied.
Square matrices are often ...
, the trace is the sum of the diagonal elements, hence the sum over a common index .
Outer product
In linear algebra, the outer product of two coordinate vector
In linear algebra, a coordinate vector is a representation of a vector as an ordered list of numbers (a tuple) that describes the vector in terms of a particular ordered basis. An ea ...
The outer product of the column vector by the row vector yields an matrix :
:
Since and represent two ''different'' indices, there is no summation and the indices are not eliminated by the multiplication.
Raising and lowering indices
Given a
tensor, one can raise an index or lower an index by contracting the tensor with the
metric tensor
In the mathematical field of differential geometry, a metric tensor (or simply metric) is an additional structure on a manifold (such as a surface) that allows defining distances and angles, just as the inner product on a Euclidean space allows ...
, . For example, take the tensor , one can raise an index:
:
Or one can lower an index:
:
See also
*
Tensor
*
Abstract index notation
Abstract index notation (also referred to as slot-naming index notation) is a mathematical notation for tensors and spinors that uses indices to indicate their types, rather than their components in a particular basis. The indices are mere placeho ...
*
Bra–ket notation
In quantum mechanics, bra–ket notation, or Dirac notation, is used ubiquitously to denote quantum states. The notation uses angle brackets, and , and a vertical bar , to construct "bras" and "kets".
A ket is of the form , v \rangle. Mathema ...
*
Penrose graphical notation
*
Levi-Civita symbol
*
DeWitt notation
Physics often deals with classical models where the dynamical variables are a collection of functions
''α'' over a d-dimensional space/spacetime manifold ''M'' where ''α'' is the "flavor" index. This involves functionals over the ''φs, function ...
Notes
#This applies only for numerical indices. The situation is the opposite for
abstract indices. Then, vectors themselves carry upper abstract indices and covectors carry lower abstract indices, as per the example in the
introduction of this article. Elements of a basis of vectors may carry a lower ''numerical'' index and an upper ''abstract'' index.
References
Bibliography
* .
External links
*
*
{{tensors
Mathematical notation
Multilinear algebra
Tensors
Riemannian geometry
Mathematical physics
Albert Einstein