In
vector calculus, the gradient of a
scalar-valued differentiable function
In mathematics, a differentiable function of one real variable is a function whose derivative exists at each point in its domain. In other words, the graph of a differentiable function has a non- vertical tangent line at each interior point in ...
of
several variables is the
vector field (or
vector-valued function)
whose value at a point
is the "direction and rate of fastest increase". If the gradient of a function is non-zero at a point , the direction of the gradient is the direction in which the function increases most quickly from , and the
magnitude of the gradient is the rate of increase in that direction, the greatest
absolute directional derivative. Further, a point where the gradient is the zero vector is known as a
stationary point
In mathematics, particularly in calculus, a stationary point of a differentiable function of one variable is a point on the graph of the function where the function's derivative is zero. Informally, it is a point where the function "stops" i ...
. The gradient thus plays a fundamental role in
optimization theory, where it is used to maximize a function by
gradient ascent. In coordinate-free terms, the gradient of a function
may be defined by:
:
where ''df'' is the total infinitesimal change in ''f'' for an infinitesimal displacement
, and is seen to be maximal when
is in the direction of the gradient
. The
nabla symbol , written as an upside-down triangle and pronounced "del", denotes the
vector differential operator.
When a coordinate system is used in which the basis vectors are not functions of position, the gradient is given by the
vector whose components are the
partial derivative
In mathematics, a partial derivative of a function of several variables is its derivative with respect to one of those variables, with the others held constant (as opposed to the total derivative, in which all variables are allowed to vary). Pa ...
s of
at
. That is, for
, its gradient
is defined at the point
in ''n-''dimensional space as the vector
:
The gradient is dual to the
total derivative
In mathematics, the total derivative of a function at a point is the best linear approximation near this point of the function with respect to its arguments. Unlike partial derivatives, the total derivative approximates the function with r ...
: the value of the gradient at a point is a
tangent vector – a vector at each point; while the value of the derivative at a point is a
''co''tangent vector – a linear functional on vectors. They are related in that the
dot product
In mathematics, the dot product or scalar productThe term ''scalar product'' means literally "product with a scalar as a result". It is also used sometimes for other symmetric bilinear forms, for example in a pseudo-Euclidean space. is an alg ...
of the gradient of at a point with another tangent vector equals the
directional derivative
In mathematics, the directional derivative of a multivariable differentiable (scalar) function along a given vector v at a given point x intuitively represents the instantaneous rate of change of the function, moving through x with a velocity ...
of at of the function along ; that is,
.
The gradient admits multiple generalizations to more general functions on
manifold
In mathematics, a manifold is a topological space that locally resembles Euclidean space near each point. More precisely, an n-dimensional manifold, or ''n-manifold'' for short, is a topological space with the property that each point has a ...
s; see .
Motivation
Consider a room where the temperature is given by a
scalar field
In mathematics and physics, a scalar field is a function associating a single number to every point in a space – possibly physical space. The scalar may either be a pure mathematical number ( dimensionless) or a scalar physical quantit ...
, , so at each point the temperature is , independent of time. At each point in the room, the gradient of at that point will show the direction in which the temperature rises most quickly, moving away from . The magnitude of the gradient will determine how fast the temperature rises in that direction.
Consider a surface whose height above sea level at point is . The gradient of at a point is a plane vector pointing in the direction of the steepest slope or
grade at that point. The steepness of the slope at that point is given by the magnitude of the gradient vector.
The gradient can also be used to measure how a scalar field changes in other directions, rather than just the direction of greatest change, by taking a
dot product
In mathematics, the dot product or scalar productThe term ''scalar product'' means literally "product with a scalar as a result". It is also used sometimes for other symmetric bilinear forms, for example in a pseudo-Euclidean space. is an alg ...
. Suppose that the steepest slope on a hill is 40%. A road going directly uphill has slope 40%, but a road going around the hill at an angle will have a shallower slope. For example, if the road is at a 60° angle from the uphill direction (when both directions are projected onto the horizontal plane), then the slope along the road will be the dot product between the gradient vector and a
unit vector
In mathematics, a unit vector in a normed vector space is a vector (often a spatial vector) of length 1. A unit vector is often denoted by a lowercase letter with a circumflex, or "hat", as in \hat (pronounced "v-hat").
The term ''direction ve ...
along the road, namely 40% times the
cosine of 60°, or 20%.
More generally, if the hill height function is
differentiable, then the gradient of
dotted with a
unit vector
In mathematics, a unit vector in a normed vector space is a vector (often a spatial vector) of length 1. A unit vector is often denoted by a lowercase letter with a circumflex, or "hat", as in \hat (pronounced "v-hat").
The term ''direction ve ...
gives the slope of the hill in the direction of the vector, the
directional derivative
In mathematics, the directional derivative of a multivariable differentiable (scalar) function along a given vector v at a given point x intuitively represents the instantaneous rate of change of the function, moving through x with a velocity ...
of along the unit vector.
Notation
The gradient of a function
at point
is usually written as
. It may also be denoted by any of the following:
*
: to emphasize the vector nature of the result.
*
*
and
:
Einstein notation.
Definition
The gradient (or gradient vector field) of a scalar function is denoted or where (
nabla) denotes the vector
differential operator,
del. The notation is also commonly used to represent the gradient. The gradient of is defined as the unique vector field whose dot product with any
vector at each point is the directional derivative of along . That is,
:
where the right-side hand is the
directional derivative
In mathematics, the directional derivative of a multivariable differentiable (scalar) function along a given vector v at a given point x intuitively represents the instantaneous rate of change of the function, moving through x with a velocity ...
and there are many ways to represent it. Formally, the derivative is ''dual'' to the gradient; see
relationship with derivative.
When a function also depends on a parameter such as time, the gradient often refers simply to the vector of its spatial derivatives only (see
Spatial gradient).
The magnitude and direction of the gradient vector are
independent
Independent or Independents may refer to:
Arts, entertainment, and media Artist groups
* Independents (artist group), a group of modernist painters based in the New Hope, Pennsylvania, area of the United States during the early 1930s
* Independe ...
of the particular
coordinate representation.
Cartesian coordinates
In the three-dimensional
Cartesian coordinate system
A Cartesian coordinate system (, ) in a plane is a coordinate system that specifies each point uniquely by a pair of numerical coordinates, which are the signed distances to the point from two fixed perpendicular oriented lines, measured ...
with a
Euclidean metric, the gradient, if it exists, is given by:
:
where , , are the
standard unit vectors in the directions of the , and coordinates, respectively. For example, the gradient of the function
:
is
:
In some applications it is customary to represent the gradient as a
row vector or
column vector
In linear algebra, a column vector with m elements is an m \times 1 matrix consisting of a single column of m entries, for example,
\boldsymbol = \begin x_1 \\ x_2 \\ \vdots \\ x_m \end.
Similarly, a row vector is a 1 \times n matrix for some n, ...
of its components in a rectangular coordinate system; this article follows the convention of the gradient being a column vector, while the derivative is a row vector.
Cylindrical and spherical coordinates
In
cylindrical coordinates
A cylindrical coordinate system is a three-dimensional coordinate system that specifies point positions by the distance from a chosen reference axis ''(axis L in the image opposite)'', the direction from the axis relative to a chosen reference d ...
with a Euclidean metric, the gradient is given by:
[.]
:
where is the axial distance, is the azimuthal or azimuth angle, is the axial coordinate, and , and are unit vectors pointing along the coordinate directions.
In
spherical coordinates
In mathematics, a spherical coordinate system is a coordinate system for three-dimensional space where the position of a point is specified by three numbers: the ''radial distance'' of that point from a fixed origin, its ''polar angle'' mea ...
, the gradient is given by:
:
where is the radial distance, is the azimuthal angle and is the polar angle, and , and are again local unit vectors pointing in the coordinate directions (that is, the normalized
covariant basis).
For the gradient in other
orthogonal coordinate systems, see
Orthogonal coordinates (Differential operators in three dimensions).
General coordinates
We consider
general coordinates, which we write as , where is the number of dimensions of the domain. Here, the upper index refers to the position in the list of the coordinate or component, so refers to the second component—not the quantity squared. The index variable refers to an arbitrary element . Using
Einstein notation, the gradient can then be written as:
(Note that its
dual
Dual or Duals may refer to:
Paired/two things
* Dual (mathematics), a notion of paired concepts that mirror one another
** Dual (category theory), a formalization of mathematical duality
*** see more cases in :Duality theories
* Dual (grammatical ...
is
),
where
and
refer to the unnormalized local
covariant and contravariant bases respectively,
is the
inverse metric tensor, and the Einstein summation convention implies summation over ''i'' and ''j''.
If the coordinates are orthogonal we can easily express the gradient (and the
differential) in terms of the normalized bases, which we refer to as
and
, using the scale factors (also known as
Lamé coefficients)
:
(and
),
where we cannot use Einstein notation, since it is impossible to avoid the repetition of more than two indices. Despite the use of upper and lower indices,
,
, and
are neither contravariant nor covariant.
The latter expression evaluates to the expressions given above for cylindrical and spherical coordinates.
Relationship with derivative
Relationship with total derivative
The gradient is closely related to the
total derivative
In mathematics, the total derivative of a function at a point is the best linear approximation near this point of the function with respect to its arguments. Unlike partial derivatives, the total derivative approximates the function with r ...
(
total differential)
: they are
transpose
In linear algebra, the transpose of a matrix is an operator which flips a matrix over its diagonal;
that is, it switches the row and column indices of the matrix by producing another matrix, often denoted by (among other notations).
The tr ...
(
dual
Dual or Duals may refer to:
Paired/two things
* Dual (mathematics), a notion of paired concepts that mirror one another
** Dual (category theory), a formalization of mathematical duality
*** see more cases in :Duality theories
* Dual (grammatical ...
) to each other. Using the convention that vectors in
are represented by
column vector
In linear algebra, a column vector with m elements is an m \times 1 matrix consisting of a single column of m entries, for example,
\boldsymbol = \begin x_1 \\ x_2 \\ \vdots \\ x_m \end.
Similarly, a row vector is a 1 \times n matrix for some n, ...
s, and that covectors (linear maps
) are represented by
row vectors, the gradient
and the derivative
are expressed as a column and row vector, respectively, with the same components, but transpose of each other:
:
:
While these both have the same components, they differ in what kind of mathematical object they represent: at each point, the derivative is a
cotangent vector, a
linear form
In mathematics, a linear form (also known as a linear functional, a one-form, or a covector) is a linear map from a vector space to its field of scalars (often, the real numbers or the complex numbers).
If is a vector space over a field , t ...
(
covector) which expresses how much the (scalar) output changes for a given infinitesimal change in (vector) input, while at each point, the gradient is a
tangent vector, which represents an infinitesimal change in (vector) input. In symbols, the gradient is an element of the tangent space at a point,
, while the derivative is a map from the tangent space to the real numbers,
. The tangent spaces at each point of
can be "naturally" identified with the vector space
itself, and similarly the cotangent space at each point can be naturally identified with the
dual vector space
In mathematics, any vector space ''V'' has a corresponding dual vector space (or just dual space for short) consisting of all linear forms on ''V'', together with the vector space structure of pointwise addition and scalar multiplication by con ...
of covectors; thus the value of the gradient at a point can be thought of a vector in the original
, not just as a tangent vector.
Computationally, given a tangent vector, the vector can be ''multiplied'' by the derivative (as matrices), which is equal to taking the
dot product
In mathematics, the dot product or scalar productThe term ''scalar product'' means literally "product with a scalar as a result". It is also used sometimes for other symmetric bilinear forms, for example in a pseudo-Euclidean space. is an alg ...
with the gradient:
:
Differential or (exterior) derivative
The best linear approximation to a differentiable function
:
at a point in is a linear map from to which is often denoted by or and called the
differential or
total derivative
In mathematics, the total derivative of a function at a point is the best linear approximation near this point of the function with respect to its arguments. Unlike partial derivatives, the total derivative approximates the function with r ...
of at . The function , which maps to , is called the
total differential or
exterior derivative
On a differentiable manifold, the exterior derivative extends the concept of the differential of a function to differential forms of higher degree. The exterior derivative was first described in its current form by Élie Cartan in 1899. The res ...
of and is an example of a
differential 1-form.
Much as the derivative of a function of a single variable represents the
slope
In mathematics, the slope or gradient of a line is a number that describes both the ''direction'' and the ''steepness'' of the line. Slope is often denoted by the letter ''m''; there is no clear answer to the question why the letter ''m'' is used ...
of the
tangent
In geometry, the tangent line (or simply tangent) to a plane curve at a given point is the straight line that "just touches" the curve at that point. Leibniz defined it as the line through a pair of infinitely close points on the curve. Mo ...
to the
graph
Graph may refer to:
Mathematics
*Graph (discrete mathematics), a structure made of vertices and edges
**Graph theory, the study of such graphs and their properties
*Graph (topology), a topological space resembling a graph in the sense of discre ...
of the function, the directional derivative of a function in several variables represents the slope of the tangent
hyperplane
In geometry, a hyperplane is a subspace whose dimension is one less than that of its '' ambient space''. For example, if a space is 3-dimensional then its hyperplanes are the 2-dimensional planes, while if the space is 2-dimensional, its hype ...
in the direction of the vector.
The gradient is related to the differential by the formula
:
for any , where
is the
dot product
In mathematics, the dot product or scalar productThe term ''scalar product'' means literally "product with a scalar as a result". It is also used sometimes for other symmetric bilinear forms, for example in a pseudo-Euclidean space. is an alg ...
: taking the dot product of a vector with the gradient is the same as taking the directional derivative along the vector.
If is viewed as the space of (dimension ) column vectors (of real numbers), then one can regard as the row vector with components
:
so that is given by
matrix multiplication
In mathematics, particularly in linear algebra, matrix multiplication is a binary operation that produces a matrix from two matrices. For matrix multiplication, the number of columns in the first matrix must be equal to the number of rows in the ...
. Assuming the standard Euclidean metric on , the gradient is then the corresponding column vector, that is,
:
Linear approximation to a function
The best
linear approximation
In mathematics, a linear approximation is an approximation of a general function using a linear function (more precisely, an affine function). They are widely used in the method of finite differences to produce first order methods for solving o ...
to a function can be expressed in terms of the gradient, rather than the derivative. The gradient of a
function from the Euclidean space to at any particular point in characterizes the best
linear approximation
In mathematics, a linear approximation is an approximation of a general function using a linear function (more precisely, an affine function). They are widely used in the method of finite differences to produce first order methods for solving o ...
to at . The approximation is as follows:
:
for close to , where is the gradient of computed at , and the dot denotes the dot product on . This equation is equivalent to the first two terms in the
multivariable Taylor series expansion of at .
Relationship with Fréchet derivative
Let be an
open set
In mathematics, open sets are a generalization of open intervals in the real line.
In a metric space (a set along with a distance defined between any two points), open sets are the sets that, with every point , contain all points that a ...
in . If the function is differentiable, then the differential of is the
Fréchet derivative
In mathematics, the Fréchet derivative is a derivative defined on normed spaces. Named after Maurice Fréchet, it is commonly used to generalize the derivative of a real-valued function of a single real variable to the case of a vector-value ...
of . Thus is a function from to the space such that
where · is the dot product.
As a consequence, the usual properties of the derivative hold for the gradient, though the gradient is not a derivative itself, but rather dual to the derivative:
;
Linearity
:The gradient is linear in the sense that if and are two real-valued functions differentiable at the point , and and are two constants, then is differentiable at , and moreover
;
Product rule
:If and are real-valued functions differentiable at a point , then the product rule asserts that the product is differentiable at , and
;
Chain rule
In calculus, the chain rule is a formula that expresses the derivative of the Function composition, composition of two differentiable functions and in terms of the derivatives of and . More precisely, if h=f\circ g is the function such that h(x) ...
:Suppose that is a real-valued function defined on a subset of , and that is differentiable at a point . There are two forms of the chain rule applying to the gradient. First, suppose that the function is a
parametric curve
In mathematics, a parametric equation defines a group of quantities as functions of one or more independent variables called parameters. Parametric equations are commonly used to express the coordinates of the points that make up a geometric o ...
; that is, a function maps a subset into . If is differentiable at a point such that , then
where ∘ is the
composition operator: .
More generally, if instead , then the following holds:
where
T denotes the transpose
Jacobian matrix
In vector calculus, the Jacobian matrix (, ) of a vector-valued function of several variables is the matrix of all its first-order partial derivatives. When this matrix is square, that is, when the function takes the same number of variables ...
.
For the second form of the chain rule, suppose that is a real valued function on a subset of , and that is differentiable at the point . Then
Further properties and applications
Level sets
A level surface, or
isosurface
An isosurface is a three-dimensional analog of an isoline. It is a surface that represents points of a constant value (e.g. pressure, temperature, velocity, density) within a volume of space; in other words, it is a level set of a continuous ...
, is the set of all points where some function has a given value.
If is differentiable, then the dot product of the gradient at a point with a vector gives the directional derivative of at in the direction . It follows that in this case the gradient of is
orthogonal
In mathematics, orthogonality is the generalization of the geometric notion of '' perpendicularity''.
By extension, orthogonality is also used to refer to the separation of specific features of a system. The term also has specialized meanings in ...
to the
level set
In mathematics, a level set of a real-valued function of real variables is a set where the function takes on a given constant value , that is:
: L_c(f) = \left\~,
When the number of independent variables is two, a level set is cal ...
s of . For example, a level surface in three-dimensional space is defined by an equation of the form . The gradient of is then normal to the surface.
More generally, any
embedded hypersurface
In geometry, a hypersurface is a generalization of the concepts of hyperplane, plane curve, and surface. A hypersurface is a manifold or an algebraic variety of dimension , which is embedded in an ambient space of dimension , generally a Eucl ...
in a Riemannian manifold can be cut out by an equation of the form such that is nowhere zero. The gradient of is then normal to the hypersurface.
Similarly, an
affine algebraic hypersurface
In geometry, a hypersurface is a generalization of the concepts of hyperplane, plane curve, and surface. A hypersurface is a manifold or an algebraic variety of dimension , which is embedded in an ambient space of dimension , generally a Euclidean ...
may be defined by an equation , where is a polynomial. The gradient of is zero at a singular point of the hypersurface (this is the definition of a singular point). At a non-singular point, it is a nonzero normal vector.
Conservative vector fields and the gradient theorem
The gradient of a function is called a gradient field. A (continuous) gradient field is always a
conservative vector field: its
line integral
In mathematics, a line integral is an integral where the function to be integrated is evaluated along a curve. The terms ''path integral'', ''curve integral'', and ''curvilinear integral'' are also used; '' contour integral'' is used as well, ...
along any path depends only on the endpoints of the path, and can be evaluated by the gradient theorem (the fundamental theorem of calculus for line integrals). Conversely, a (continuous) conservative vector field is always the gradient of a function.
Generalizations
Jacobian
The
Jacobian matrix
In vector calculus, the Jacobian matrix (, ) of a vector-valued function of several variables is the matrix of all its first-order partial derivatives. When this matrix is square, that is, when the function takes the same number of variables ...
is the generalization of the gradient for vector-valued functions of several variables and
differentiable maps between
Euclidean space
Euclidean space is the fundamental space of geometry, intended to represent physical space. Originally, that is, in Euclid's ''Elements'', it was the three-dimensional space of Euclidean geometry, but in modern mathematics there are Euclidean sp ...
s or, more generally,
manifold
In mathematics, a manifold is a topological space that locally resembles Euclidean space near each point. More precisely, an n-dimensional manifold, or ''n-manifold'' for short, is a topological space with the property that each point has a ...
s. A further generalization for a function between
Banach space
In mathematics, more specifically in functional analysis, a Banach space (pronounced ) is a complete normed vector space. Thus, a Banach space is a vector space with a metric that allows the computation of vector length and distance between ve ...
s is the
Fréchet derivative
In mathematics, the Fréchet derivative is a derivative defined on normed spaces. Named after Maurice Fréchet, it is commonly used to generalize the derivative of a real-valued function of a single real variable to the case of a vector-value ...
.
Suppose is a function such that each of its first-order partial derivatives exist on . Then the Jacobian matrix of is defined to be an matrix, denoted by
or simply
. The th entry is
. Explicitly
Gradient of a vector field
Since the total derivative of a vector field is a
linear mapping from vectors to vectors, it is a
tensor
In mathematics, a tensor is an algebraic object that describes a multilinear relationship between sets of algebraic objects related to a vector space. Tensors may map between different objects such as vectors, scalars, and even other tens ...
quantity.
In rectangular coordinates, the gradient of a vector field is defined by:
:
(where the
Einstein summation notation is used and the
tensor product
In mathematics, the tensor product V \otimes W of two vector spaces and (over the same Field (mathematics), field) is a vector space to which is associated a bilinear map V\times W \to V\otimes W that maps a pair (v,w),\ v\in V, w\in W to an e ...
of the vectors and is a
dyadic tensor of type (2,0)). Overall, this expression equals the transpose of the Jacobian matrix:
:
In curvilinear coordinates, or more generally on a curved
manifold
In mathematics, a manifold is a topological space that locally resembles Euclidean space near each point. More precisely, an n-dimensional manifold, or ''n-manifold'' for short, is a topological space with the property that each point has a ...
, the gradient involves
Christoffel symbols:
:
where are the components of the inverse
metric tensor and the are the coordinate basis vectors.
Expressed more invariantly, the gradient of a vector field can be defined by the
Levi-Civita connection
In Riemannian or pseudo Riemannian geometry (in particular the Lorentzian geometry of general relativity), the Levi-Civita connection is the unique affine connection on the tangent bundle of a manifold (i.e. affine connection) that preserves ...
and metric tensor:
[.]
:
where is the connection.
Riemannian manifolds
For any
smooth function
In mathematical analysis, the smoothness of a function is a property measured by the number of continuous derivatives it has over some domain, called ''differentiability class''. At the very minimum, a function could be considered smooth if ...
on a Riemannian manifold , the gradient of is the vector field such that for any vector field ,
:
that is,
:
where denotes the
inner product
In mathematics, an inner product space (or, rarely, a Hausdorff pre-Hilbert space) is a real vector space or a complex vector space with an operation called an inner product. The inner product of two vectors in the space is a scalar, often ...
of tangent vectors at defined by the metric and is the function that takes any point to the directional derivative of in the direction , evaluated at . In other words, in a
coordinate chart from an open subset of to an open subset of , is given by:
:
where denotes the th component of in this coordinate chart.
So, the local form of the gradient takes the form:
:
Generalizing the case , the gradient of a function is related to its exterior derivative, since
:
More precisely, the gradient is the vector field associated to the differential 1-form using the
musical isomorphism
In mathematics—more specifically, in differential geometry—the musical isomorphism (or canonical isomorphism) is an isomorphism between the tangent bundle \mathrmM and the cotangent bundle \mathrm^* M of a pseudo-Riemannian manifold induc ...
:
(called "sharp") defined by the metric . The relation between the exterior derivative and the gradient of a function on is a special case of this in which the metric is the flat metric given by the dot product.
See also
*
Curl
*
Divergence
In vector calculus, divergence is a vector operator that operates on a vector field, producing a scalar field giving the quantity of the vector field's source at each point. More technically, the divergence represents the volume density of t ...
*
Four-gradient
*
Hessian matrix
In mathematics, the Hessian matrix or Hessian is a square matrix of second-order partial derivatives of a scalar-valued function, or scalar field. It describes the local curvature of a function of many variables. The Hessian matrix was developed ...
*
Skew gradient
Notes
References
*
*
*
*
*
*
*
*
*
*
*
*
Further reading
*
External links
*
* .
*
{{Calculus topics
Differential operators
Differential calculus
Generalizations of the derivative
Linear operators in calculus
Vector calculus
Rates