mathematics Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...

, a change of variables is a basic technique used to simplify problems in which the original variables are replaced with functions of other variables. The intent is that when expressed in new variables, the problem may become simpler, or equivalent to a better understood problem. Change of variables is an operation that is related to substitution. However these are different operations, as can be seen when considering differentiation (

chain rule In calculus, the chain rule is a formula that expresses the derivative of the Function composition, composition of two differentiable functions and in terms of the derivatives of and . More precisely, if h=f\circ g is the function such that h ...

) or integration (

integration by substitution In calculus, integration by substitution, also known as ''u''-substitution, reverse chain rule or change of variables, is a method for evaluating integrals and antiderivatives. It is the counterpart to the chain rule for differentiation, and c ...

). A very simple example of a useful variable change can be seen in the problem of finding the roots of the sixth-degree polynomial: :

x^6 - 9 x^3 + 8 = 0.

Sixth-degree polynomial equations are generally impossible to solve in terms of radicals (see

Abel–Ruffini theorem In mathematics, the Abel–Ruffini theorem (also known as Abel's impossibility theorem) states that there is no solution in radicals to general polynomial equations of degree five or higher with arbitrary coefficients. Here, ''general'' means t ...

). This particular equation, however, may be written :

(x^3)^2-9(x^3)+8=0

(this is a simple case of a polynomial decomposition). Thus the equation may be simplified by defining a new variable

u = x^3

. Substituting ''x'' by

\sqrt /math> into the polynomial gives

: u^2 - 9 u + 8 = 0 , which is just a

quadratic equation In mathematics, a quadratic equation () is an equation that can be rearranged in standard form as ax^2 + bx + c = 0\,, where the variable (mathematics), variable represents an unknown number, and , , and represent known numbers, where . (If and ...

with the two solutions: :

u = 1 \quad \text \quad u = 8.

The solutions in terms of the original variable are obtained by substituting ''x''³ back in for ''u'', which gives :

x^3 = 1 \quad \text \quad x^3 = 8.

Then, assuming that one is interested only in real solutions, the solutions of the original equation are :

x = (1)^ = 1 \quad \text \quad x = (8)^ = 2.

Simple example

Consider the system of equations :

xy+x+y=71

x^2y+xy^2=880

where

x

and

y

are positive integers with

x>y

. (Source: 1991 AIME) Solving this normally is not very difficult, but it may get a little tedious. However, we can rewrite the second equation as

xy(x+y)=880

. Making the substitutions

s=x+y

and

t=xy

reduces the system to

s+t=71, st=880

. Solving this gives

(s,t)=(16,55)

and

(s,t)=(55,16)

. Back-substituting the first ordered pair gives us

x+y=16, xy=55, x>y

, which gives the solution

(x,y)=(11,5).

Back-substituting the second ordered pair gives us

x+y=55, xy=16, x>y

, which gives no solutions. Hence the solution that solves the system is

(x,y)=(11,5)

Formal introduction

Let

A

B

smooth manifold In mathematics, a differentiable manifold (also differential manifold) is a type of manifold that is locally similar enough to a vector space to allow one to apply calculus. Any manifold can be described by a collection of charts (atlas). One may ...

s and let

\Phi: A \rightarrow B

be a

C^r

diffeomorphism In mathematics, a diffeomorphism is an isomorphism of differentiable manifolds. It is an invertible function that maps one differentiable manifold to another such that both the function and its inverse are continuously differentiable. Definit ...

between them, that is:

\Phi

is a

r

times continuously differentiable,

bijective In mathematics, a bijection, bijective function, or one-to-one correspondence is a function between two sets such that each element of the second set (the codomain) is the image of exactly one element of the first set (the domain). Equival ...

map from

A

B

with

r

times continuously differentiable inverse from

B

A

. Here

r

may be any natural number (or zero),

\infty

( smooth) or

\omega

( analytic). The map

\Phi

is called a ''regular coordinate transformation'' or ''regular variable substitution'', where ''regular'' refers to the

C^r

-ness of

\Phi

. Usually one will write

x = \Phi(y)

to indicate the replacement of the variable

x

by the variable

y

by substituting the value of

\Phi

y

for every occurrence of

x

Other examples

Coordinate transformation

Some systems can be more easily solved when switching to

polar coordinates In mathematics, the polar coordinate system specifies a given point (mathematics), point in a plane (mathematics), plane by using a distance and an angle as its two coordinate system, coordinates. These are *the point's distance from a reference ...

. Consider for example the equation :

U(x, y) := (x^2 + y^2) \sqrt = 0.

This may be a potential energy function for some physical problem. If one does not immediately see a solution, one might try the substitution :

\displaystyle (x, y) = \Phi(r, \theta)

given by

\displaystyle \Phi(r,\theta) = (r \cos(\theta), r \sin(\theta)).

Note that if

\theta

runs outside a

2\pi

-length interval, for example,

, 2\pi /math>, the map \Phi is no longer bijective. Therefore, \Phi should be limited to, for example (0, \infty] \times [0, 2\pi) . Notice how r = 0 is excluded, for \Phi is not bijective in the origin (\theta can take any value, the point will be mapped to (0, 0)). Then, replacing all occurrences of the original variables by the new expression (mathematics), expressions prescribed by \Phi and using the identity \sin^2 x + \cos^2 x = 1, we get

: V(r, \theta) = r^2 \sqrt = r^2 \sqrt = r^2\left, \sin\theta\. Now the solutions can be readily found: \sin(\theta) = 0, so \theta = 0 or \theta = \pi . Applying the inverse of \Phi shows that this is equivalent to y = 0 while x \not= 0 . Indeed, we see that for y = 0 the function vanishes, except for the origin.

Note that, had we allowed r = 0, the origin would also have been a solution, though it is not a solution to the original problem. Here the bijectivity of \Phi is crucial. The function is always positive (for x,y\in\reals), hence the absolute values.

Differentiation

The

is used to simplify complicated differentiation. For example, consider the problem of calculating the derivative :

\frac\sin(x^2).

Let

y = \sin u

with

u = x^2.

Then: :

&= (\cos u) (2x) \\ &= \left (\cos(x^2) \right) (2x) \\ &= 2x\cos(x^2) \end

Integration

Difficult integrals may often be evaluated by changing variables; this is enabled by the substitution rule and is analogous to the use of the chain rule above. Difficult integrals may also be solved by simplifying the integral using a change of variables given by the corresponding

Jacobian matrix and determinant In vector calculus, the Jacobian matrix (, ) of a vector-valued function of several variables is the matrix of all its first-order partial derivatives. If this matrix is square, that is, if the number of variables equals the number of compon ...

. Using the Jacobian determinant and the corresponding change of variable that it gives is the basis of coordinate systems such as polar, cylindrical, and spherical coordinate systems.

Change of variables formula in terms of Lebesgue measure

The following theorem allows us to relate integrals with respect to Lebesgue measure to an equivalent integral with respect to the pullback measure under a parameterization G. The proof is due to approximations of the Jordan content.

Suppose that $\Omega$ is an open subset of $\mathbb^n$ and $G:\Omega \to \mathbb^n$ is a $C^1$ diffeomorphism. * If $f$ is a Lebesgue measurable function on $G(\Omega)$ , then $f \circ G$ is Lebesgue measurable on $\Omega$ . If $f \geq 0$ or $f\in L^1(G(\Omega),m),$ then $\int_ f(x) dx = \int_\Omega f\circ G(x), \textD_xG, dx$ . * If $E\subset \Omega$ and $E$ is Lebesgue measurable, then $G(E)$ is Lebesgue measurable, then $m(G(E)) = \int_E , \textD_xG, dx$ .

As a corollary of this theorem, we may compute the Radon–Nikodym derivatives of both the pullback and pushforward measures of

m

under

T

= Pullback measure and transformation formula

= The pullback measure in terms of a transformation

T

is defined as

T^*\mu:= \mu(T(A))

. The change of variables formula for pullback measures is

\int_g d\mu = \int_\Omega g \circ T  dT^* \mu

. Pushforward measure and transformation formula The pushforward measure in terms of a transformation

T

, is defined as

T_*\mu:= \mu(T^(A))

. The change of variables formula for pushforward measures is

\int_g\circ T d\mu = \int_ g   dT_* \mu

. As a corollary of the change of variables formula for Lebesgue measure, we have that * Radon-Nikodym derivative of the pullback with respect to Lebesgue measure:

\frac(x) = , \textD_xT,

* Radon-Nikodym derivative of the pushforward with respect to Lebesgue measure:

\frac(x) = , \textD_xT^,

From which we may obtain * The change of variables formula for pullback measure:

\int_g dm = \int_\Omega g \circ T  dT^* m=\int_\Omega g \circ T  , \textD_xT, dm(x)

* The change of variables formula for pushforward measure:

\int_g dm = \int_ g \circ T^  dT_* m= \int_ g \circ T^, \textD_xT^, dm(x)

Differential equations

Variable changes for differentiation and integration are taught in elementary

calculus Calculus is the mathematics, mathematical study of continuous change, in the same way that geometry is the study of shape, and algebra is the study of generalizations of arithmetic operations. Originally called infinitesimal calculus or "the ...

and the steps are rarely carried out in full. The very broad use of variable changes is apparent when considering differential equations, where the independent variables may be changed using the

or the dependent variables are changed resulting in some differentiation to be carried out. Exotic changes, such as the mingling of dependent and independent variables in point and contact transformations, can be very complicated but allow much freedom. Very often, a general form for a change is substituted into a problem and parameters picked along the way to best simplify the problem.

Scaling and shifting

Probably the simplest change is the scaling and shifting of variables, that is replacing them with new variables that are "stretched" and "moved" by constant amounts. This is very common in practical applications to get physical parameters out of problems. For an ''n''^th order derivative, the change simply results in :

\frac = \frac \frac

where :

x = \hat x x_\text + x_\text

y = \hat y y_\text + y_\text.

This may be shown readily through the

and linearity of differentiation. This change is very common in practical applications to get physical parameters out of problems, for example, the boundary value problem :

\mu \frac = \frac \quad ; \quad u(0) = u(L) = 0

describes parallel fluid flow between flat solid walls separated by a distance δ; μ is the

viscosity Viscosity is a measure of a fluid's rate-dependent drag (physics), resistance to a change in shape or to movement of its neighboring portions relative to one another. For liquids, it corresponds to the informal concept of ''thickness''; for e ...

and

d p/d x

the

pressure gradient In hydrodynamics and hydrostatics, the pressure gradient (typically of air but more generally of any fluid) is a physical quantity that describes in which direction and at what rate the pressure increases the most rapidly around a particular locat ...

, both constants. By scaling the variables the problem becomes :

\frac = 1 \quad ; \quad \hat u(0) = \hat u(1) = 0

where :

y = \hat y L \qquad \text \qquad u = \hat u \frac \frac.

Scaling is useful for many reasons. It simplifies analysis both by reducing the number of parameters and by simply making the problem neater. Proper scaling may ''normalize'' variables, that is make them have a sensible unitless range such as 0 to 1. Finally, if a problem mandates numeric solution, the fewer the parameters the fewer the number of computations.

Momentum vs. velocity

Consider a system of equations :

m \dot x & = \frac \end

for a given function

H(x, v)

. The mass can be eliminated by the (trivial) substitution

\Phi(p) = 1/m \cdot p

. Clearly this is a bijective map from

\mathbb

\mathbb

. Under the substitution

v = \Phi(p)

the system becomes :

\dot x & = \frac \end

Lagrangian mechanics

Given a force field

\varphi(t, x, v)

, Newton's

equations of motion In physics, equations of motion are equations that describe the behavior of a physical system in terms of its motion as a function of time. More specifically, the equations of motion describe the behavior of a physical system as a set of mathem ...

are :

m \ddot x = \varphi(t, x, v).

Lagrange examined how these equations of motion change under an arbitrary substitution of variables

x = \Psi(t, y)

v = \frac + \frac \cdot w.

He found that the equations :

\frac = \frac \frac

are equivalent to Newton's equations for the function

L = T - V

, where ''T'' is the kinetic, and ''V'' the potential energy. In fact, when the substitution is chosen well (exploiting for example symmetries and constraints of the system) these equations are much easier to solve than Newton's equations in Cartesian coordinates.

References

{{Reflist Elementary algebra Mathematical physics