Taylor's theorem
   HOME

TheInfoList



OR:

In
calculus Calculus is the mathematics, mathematical study of continuous change, in the same way that geometry is the study of shape, and algebra is the study of generalizations of arithmetic operations. Originally called infinitesimal calculus or "the ...
, Taylor's theorem gives an approximation of a k-times
differentiable function In mathematics, a differentiable function of one real variable is a function whose derivative exists at each point in its domain. In other words, the graph of a differentiable function has a non- vertical tangent line at each interior point in ...
around a given point by a
polynomial In mathematics, a polynomial is a Expression (mathematics), mathematical expression consisting of indeterminate (variable), indeterminates (also called variable (mathematics), variables) and coefficients, that involves only the operations of addit ...
of degree k, called the k-th-order Taylor polynomial. For a
smooth function In mathematical analysis, the smoothness of a function is a property measured by the number of continuous derivatives (''differentiability class)'' it has over its domain. A function of class C^k is a function of smoothness at least ; t ...
, the Taylor polynomial is the truncation at the order ''k'' of the
Taylor series In mathematics, the Taylor series or Taylor expansion of a function is an infinite sum of terms that are expressed in terms of the function's derivatives at a single point. For most common functions, the function and the sum of its Taylor ser ...
of the function. The first-order Taylor polynomial is the linear approximation of the function, and the second-order Taylor polynomial is often referred to as the quadratic approximation. There are several versions of Taylor's theorem, some giving explicit estimates of the approximation error of the function by its Taylor polynomial. Taylor's theorem is named after the mathematician Brook Taylor, who stated a version of it in 1715, although an earlier version of the result was already mentioned in
1671 Events January–March * January 1 – The Criminal Ordinance of 1670, the first attempt at a uniform code of criminal procedure in France, goes into effect after having been passed on August 26, 1670. * January 5 – The ...
by James Gregory. Taylor's theorem is taught in introductory-level calculus courses and is one of the central elementary tools in
mathematical analysis Analysis is the branch of mathematics dealing with continuous functions, limit (mathematics), limits, and related theories, such as Derivative, differentiation, Integral, integration, measure (mathematics), measure, infinite sequences, series ( ...
. It gives simple arithmetic formulas to accurately compute values of many
transcendental function In mathematics, a transcendental function is an analytic function that does not satisfy a polynomial equation whose coefficients are functions of the independent variable that can be written using only the basic operations of addition, subtraction ...
s such as the exponential function and
trigonometric function In mathematics, the trigonometric functions (also called circular functions, angle functions or goniometric functions) are real functions which relate an angle of a right-angled triangle to ratios of two side lengths. They are widely used in all ...
s. It is the starting point of the study of
analytic function In mathematics, an analytic function is a function that is locally given by a convergent power series. There exist both real analytic functions and complex analytic functions. Functions of each type are infinitely differentiable, but complex ...
s, and is fundamental in various areas of mathematics, as well as in
numerical analysis Numerical analysis is the study of algorithms that use numerical approximation (as opposed to symbolic computation, symbolic manipulations) for the problems of mathematical analysis (as distinguished from discrete mathematics). It is the study of ...
and
mathematical physics Mathematical physics is the development of mathematics, mathematical methods for application to problems in physics. The ''Journal of Mathematical Physics'' defines the field as "the application of mathematics to problems in physics and the de ...
. Taylor's theorem also generalizes to multivariate and vector valued functions. It provided the mathematical basis for some landmark early computing machines:
Charles Babbage Charles Babbage (; 26 December 1791 – 18 October 1871) was an English polymath. A mathematician, philosopher, inventor and mechanical engineer, Babbage originated the concept of a digital programmable computer. Babbage is considered ...
's
difference engine A difference engine is an automatic mechanical calculator designed to tabulate polynomial functions. It was designed in the 1820s, and was created by Charles Babbage. The name ''difference engine'' is derived from the method of finite differen ...
calculated sines, cosines, logarithms, and other transcendental functions by numerically integrating the first 7 terms of their Taylor series.


Motivation

If a real-valued function f(x) is
differentiable In mathematics, a differentiable function of one real variable is a function whose derivative exists at each point in its domain. In other words, the graph of a differentiable function has a non- vertical tangent line at each interior point in ...
at the point x=a, then it has a linear approximation near this point. This means that there exists a function ''h''1(''x'') such that f(x) = f(a) + f'(a)(x - a) + h_1(x)(x - a), \quad \lim_ h_1(x) = 0. Here P_1(x) = f(a) + f'(a)(x - a) is the linear approximation of f(x) for ''x'' near the point ''a'', whose graph y=P_1(x) is the
tangent line In geometry, the tangent line (or simply tangent) to a plane curve at a given point is, intuitively, the straight line that "just touches" the curve at that point. Leibniz defined it as the line through a pair of infinitely close points o ...
to the graph y=f(x) at . The error in the approximation is: R_1(x) = f(x) - P_1(x) = h_1(x)(x - a). As ''x'' tends to ''a,'' this error goes to zero much faster than (x-a), making f(x)\approx P_1(x) a useful approximation. For a better approximation to f(x), we can fit a
quadratic polynomial In mathematics, a quadratic function of a single variable is a function of the form :f(x)=ax^2+bx+c,\quad a \ne 0, where is its variable, and , , and are coefficients. The expression , especially when treated as an object in itself rather tha ...
instead of a linear function: P_2(x) = f(a) + f'(a)(x - a) + \frac(x - a)^2. Instead of just matching one derivative of f(x) at x=a, this polynomial has the same first and second derivatives, as is evident upon differentiation. Taylor's theorem ensures that the ''quadratic approximation'' is, in a sufficiently small neighborhood of x=a, more accurate than the linear approximation. Specifically, f(x) = P_2(x) + h_2(x)(x - a)^2, \quad \lim_ h_2(x) = 0. Here the error in the approximation is R_2(x) = f(x) - P_2(x) = h_2(x)(x - a)^2, which, given the limiting behavior of h_2, goes to zero faster than (x - a)^2 as ''x'' tends to ''a''. Similarly, we might get still better approximations to ''f'' if we use
polynomial In mathematics, a polynomial is a Expression (mathematics), mathematical expression consisting of indeterminate (variable), indeterminates (also called variable (mathematics), variables) and coefficients, that involves only the operations of addit ...
s of higher degree, since then we can match even more derivatives with ''f'' at the selected base point. In general, the error in approximating a function by a polynomial of degree ''k'' will go to zero much faster than (x-a)^k as ''x'' tends to ''a''. However, there are functions, even infinitely differentiable ones, for which increasing the degree of the approximating polynomial does not increase the accuracy of approximation: we say such a function fails to be analytic at ''x = a'': it is not (locally) determined by its derivatives at this point. Taylor's theorem is of asymptotic nature: it only tells us that the error R_k in an
approximation An approximation is anything that is intentionally similar but not exactly equal to something else. Etymology and usage The word ''approximation'' is derived from Latin ''approximatus'', from ''proximus'' meaning ''very near'' and the prefix ...
by a k-th order Taylor polynomial ''Pk'' tends to zero faster than any nonzero k-th degree
polynomial In mathematics, a polynomial is a Expression (mathematics), mathematical expression consisting of indeterminate (variable), indeterminates (also called variable (mathematics), variables) and coefficients, that involves only the operations of addit ...
as x \to a. It does not tell us how large the error is in any concrete
neighborhood A neighbourhood (Commonwealth English) or neighborhood (American English) is a geographically localized community within a larger town, city, suburb or rural area, sometimes consisting of a single street and the buildings lining it. Neigh ...
of the center of expansion, but for this purpose there are explicit formulas for the remainder term (given below) which are valid under some additional regularity assumptions on ''f''. These enhanced versions of Taylor's theorem typically lead to uniform estimates for the approximation error in a small neighborhood of the center of expansion, but the estimates do not necessarily hold for neighborhoods which are too large, even if the function ''f'' is analytic. In that situation one may have to select several Taylor polynomials with different centers of expansion to have reliable Taylor-approximations of the original function (see animation on the right.) There are several ways we might use the remainder term: # Estimate the error for a polynomial ''Pk''(''x'') of degree ''k'' estimating f(x) on a given interval (''a'' – ''r'', ''a'' + ''r''). (Given the interval and degree, we find the error.) # Find the smallest degree ''k'' for which the polynomial ''Pk''(''x'') approximates f(x) to within a given error tolerance on a given interval (''a'' − ''r'', ''a'' + ''r'') . (Given the interval and error tolerance, we find the degree.) # Find the largest interval (''a'' − ''r'', ''a'' + ''r'') on which ''Pk''(''x'') approximates f(x) to within a given error tolerance. (Given the degree and error tolerance, we find the interval.)


Taylor's theorem in one real variable


Statement of the theorem

The precise statement of the most basic version of Taylor's theorem is as follows: The polynomial appearing in Taylor's theorem is the \boldsymbol-th order Taylor polynomial P_k(x) = f(a) + f'(a)(x-a) + \frac(x-a)^2 + \cdots + \frac(x-a)^k of the function ''f'' at the point ''a''. The Taylor polynomial is the unique "asymptotic best fit" polynomial in the sense that if there exists a function and a k-th order polynomial ''p'' such that f(x) = p(x) + h_k(x)(x-a)^k, \quad \lim_ h_k(x) = 0 , then ''p'' = ''Pk''. Taylor's theorem describes the asymptotic behavior of the remainder term R_k(x) = f(x) - P_k(x), which is the
approximation error The approximation error in a given data value represents the significant discrepancy that arises when an exact, true value is compared against some approximation derived for it. This inherent error in approximation can be quantified and express ...
when approximating ''f'' with its Taylor polynomial. Using the
little-o notation Big ''O'' notation is a mathematical notation that describes the limiting behavior of a function when the argument tends towards a particular value or infinity. Big O is a member of a family of notations invented by German mathematicians Pau ...
, the statement in Taylor's theorem reads as R_k(x) = o(, x-a, ^), \quad x\to a.


Explicit formulas for the remainder

Under stronger regularity assumptions on ''f'' there are several precise formulas for the remainder term ''Rk'' of the Taylor polynomial, the most common ones being the following. These refinements of Taylor's theorem are usually proved using the
mean value theorem In mathematics, the mean value theorem (or Lagrange's mean value theorem) states, roughly, that for a given planar arc (geometry), arc between two endpoints, there is at least one point at which the tangent to the arc is parallel to the secant lin ...
, whence the name. Additionally, notice that this is precisely the
mean value theorem In mathematics, the mean value theorem (or Lagrange's mean value theorem) states, roughly, that for a given planar arc (geometry), arc between two endpoints, there is at least one point at which the tangent to the arc is parallel to the secant lin ...
when k=0. Also other similar expressions can be found. For example, if ''G''(''t'') is continuous on the closed interval and differentiable with a non-vanishing derivative on the open interval between a and x, then R_k(x) = \frac(x-\xi)^k \frac for some number \xi between a and x. This version covers the Lagrange and Cauchy forms of the remainder as special cases, and is proved below using Cauchy's mean value theorem. The Lagrange form is obtained by taking G(t)=(x-t)^ and the Cauchy form is obtained by taking G(t)=t-a. The statement for the integral form of the remainder is more advanced than the previous ones, and requires understanding of Lebesgue integration theory for the full generality. However, it holds also in the sense of
Riemann integral In the branch of mathematics known as real analysis, the Riemann integral, created by Bernhard Riemann, was the first rigorous definition of the integral of a function on an interval. It was presented to the faculty at the University of Gö ...
provided the (''k'' + 1)th derivative of ''f'' is continuous on the closed interval 'a'',''x'' Due to the
absolute continuity In calculus and real analysis, absolute continuity is a smoothness property of functions that is stronger than continuity and uniform continuity. The notion of absolute continuity allows one to obtain generalizations of the relationship between ...
of ''f'' on the
closed interval In mathematics, a real interval is the set of all real numbers lying between two fixed endpoints with no "gaps". Each endpoint is either a real number or positive or negative infinity, indicating the interval extends without a bound. A real in ...
between a and x, its derivative ''f'' exists as an ''L''-function, and the result can be proven by a formal calculation using the
fundamental theorem of calculus The fundamental theorem of calculus is a theorem that links the concept of derivative, differentiating a function (mathematics), function (calculating its slopes, or rate of change at every point on its domain) with the concept of integral, inte ...
and
integration by parts In calculus, and more generally in mathematical analysis, integration by parts or partial integration is a process that finds the integral of a product of functions in terms of the integral of the product of their derivative and antiderivati ...
.


Estimates for the remainder

It is often useful in practice to be able to estimate the remainder term appearing in the Taylor approximation, rather than having an exact formula for it. Suppose that ''f'' is -times continuously differentiable in an interval ''I'' containing ''a''. Suppose that there are real constants ''q'' and ''Q'' such that q\le f^(x)\le Q throughout ''I''. Then the remainder term satisfies the inequality q\frac\le R_k(x)\le Q\frac, if , and a similar estimate if . This is a simple consequence of the Lagrange form of the remainder. In particular, if , f^(x), \le M on an interval with some r > 0 , then , R_k(x), \le M\frac\le M\frac for all The second inequality is called a uniform estimate, because it holds uniformly for all ''x'' on the interval


Example

Suppose that we wish to find the approximate value of the function f(x)=e^x on the interval 1,1/math> while ensuring that the error in the approximation is no more than 10−5. In this example we pretend that we only know the following properties of the exponential function: From these properties it follows that f^(x)=e^x for all k, and in particular, f^(0)=1. Hence the ''k''-th order Taylor polynomial of f at 0 and its remainder term in the Lagrange form are given by P_k(x) = 1+x+\frac+\cdots+\frac, \qquad R_k(x)=\fracx^, where \xi is some number between 0 and ''x''. Since ''e''''x'' is increasing by (), we can simply use e^x \leq 1 for x \in 1,0/math> to estimate the remainder on the subinterval 1,0/math>. To obtain an upper bound for the remainder on ,1/math>, we use the property e^\xi for 0<\xi to estimate e^x = 1 + x + \fracx^2 < 1 + x + \fracx^2, \qquad 0 < x\leq 1 using the second order Taylor expansion. Then we solve for ''ex'' to deduce that e^x \leq \frac = 2\frac \leq 4, \qquad 0 \leq x\leq 1 simply by maximizing the
numerator A fraction (from , "broken") represents a part of a whole or, more generally, any number of equal parts. When spoken in everyday English, a fraction describes how many parts of a certain size there are, for example, one-half, eight-fifths, thre ...
and minimizing the
denominator A fraction (from , "broken") represents a part of a whole or, more generally, any number of equal parts. When spoken in everyday English, a fraction describes how many parts of a certain size there are, for example, one-half, eight-fifths, thre ...
. Combining these estimates for ''ex'' we see that , R_k(x), \leq \frac \leq \frac, \qquad -1\leq x \leq 1, so the required precision is certainly reached, when \frac < 10^ \quad \Longleftrightarrow \quad 4\cdot 10^5 < (k+1)! \quad \Longleftrightarrow \quad k \geq 9. (See
factorial In mathematics, the factorial of a non-negative denoted is the Product (mathematics), product of all positive integers less than or equal The factorial also equals the product of n with the next smaller factorial: \begin n! &= n \times ...
or compute by hand the values 9! =362880 and 10! =3628800.) As a conclusion, Taylor's theorem leads to the approximation e^x = 1+x+\frac + \cdots + \frac + R_9(x), \qquad , R_9(x), < 10^, \qquad -1\leq x \leq 1. For instance, this approximation provides a decimal expression e \approx 2.71828, correct up to five decimal places.


Relationship to analyticity


Taylor expansions of real analytic functions

Let ''I'' ⊂ R be an
open interval In mathematics, a real interval is the set (mathematics), set of all real numbers lying between two fixed endpoints with no "gaps". Each endpoint is either a real number or positive or negative infinity, indicating the interval extends without ...
. By definition, a function ''f'' : ''I'' → R is
real analytic In mathematics, an analytic function is a function (mathematics), function that is locally given by a convergent series, convergent power series. There exist both real analytic functions and complex analytic functions. Functions of each type are ...
if it is locally defined by a convergent
power series In mathematics, a power series (in one variable) is an infinite series of the form \sum_^\infty a_n \left(x - c\right)^n = a_0 + a_1 (x - c) + a_2 (x - c)^2 + \dots where ''a_n'' represents the coefficient of the ''n''th term and ''c'' is a co ...
. This means that for every ''a'' ∈ ''I'' there exists some ''r'' > 0 and a sequence of coefficients ''ck'' ∈ R such that and f(x) = \sum_^\infty c_k(x-a)^k = c_0 + c_1(x-a) + c_2(x-a)^2 + \cdots, \qquad , x-a, In general, the
radius of convergence In mathematics, the radius of convergence of a power series is the radius of the largest Disk (mathematics), disk at the Power series, center of the series in which the series Convergent series, converges. It is either a non-negative real number o ...
of a power series can be computed from the Cauchy–Hadamard formula \frac = \limsup_, c_k, ^\frac. This result is based on comparison with a
geometric series In mathematics, a geometric series is a series (mathematics), series summing the terms of an infinite geometric sequence, in which the ratio of consecutive terms is constant. For example, 1/2 + 1/4 + 1/8 + 1/16 + ⋯, the series \tfrac12 + \tfrac1 ...
, and the same method shows that if the power series based on ''a'' converges for some ''b'' ∈ R, it must converge uniformly on the
closed interval In mathematics, a real interval is the set of all real numbers lying between two fixed endpoints with no "gaps". Each endpoint is either a real number or positive or negative infinity, indicating the interval extends without a bound. A real in ...
-r_b,a+r_b/math>, where r_b=\left\vert b-a \right\vert. Here only the convergence of the power series is considered, and it might well be that extends beyond the domain ''I'' of ''f''. The Taylor polynomials of the real analytic function ''f'' at ''a'' are simply the finite truncations P_k(x) = \sum_^k c_j(x-a)^j, \qquad c_j = \frac of its locally defining power series, and the corresponding remainder terms are locally given by the analytic functions R_k(x) = \sum_^\infty c_j(x-a)^j = (x-a)^k h_k(x), \qquad , x-a, Here the functions \begin & h_k:(a-r,a+r)\to \R \\ ex& h_k(x) = (x-a)\sum_^\infty c_ \left(x - a\right)^j \end are also analytic, since their defining power series have the same radius of convergence as the original series. Assuming that ⊂ ''I'' and ''r'' < ''R'', all these series converge uniformly on . Naturally, in the case of analytic functions one can estimate the remainder term R_k(x) by the tail of the sequence of the derivatives ''f′''(''a'') at the center of the expansion, but using
complex analysis Complex analysis, traditionally known as the theory of functions of a complex variable, is the branch of mathematical analysis that investigates functions of complex numbers. It is helpful in many branches of mathematics, including algebraic ...
also another possibility arises, which is described
below Below may refer to: *Earth *Ground (disambiguation) *Soil *Floor * Bottom (disambiguation) *Less than *Temperatures below freezing *Hell or underworld People with the surname * Ernst von Below (1863–1955), German World War I general * Fred Belo ...
.


Taylor's theorem and convergence of Taylor series

The Taylor series of ''f'' will converge in some interval in which all its derivatives are bounded and do not grow too fast as ''k'' goes to infinity. (However, even if the Taylor series converges, it might not converge to ''f'', as explained below; ''f'' is then said to be non- analytic.) One might think of the Taylor series f(x) \approx \sum_^\infty c_k(x-a)^k = c_0 + c_1(x-a) + c_2(x-a)^2 + \cdots of an infinitely many times differentiable function ''f'' : R → R as its "infinite order Taylor polynomial" at ''a''. Now the estimates for the remainder imply that if, for any ''r'', the derivatives of ''f'' are known to be bounded over (''a'' − ''r'', ''a'' + ''r''), then for any order ''k'' and for any ''r'' > 0 there exists a constant such that for every ''x'' ∈ (''a'' − ''r'',''a'' + ''r''). Sometimes the constants can be chosen in such way that is bounded above, for fixed ''r'' and all ''k''. Then the Taylor series of ''f'' converges uniformly to some analytic function \begin & T_f:(a-r,a+r)\to\R \\ & T_f(x) = \sum_^\infty \frac \left(x-a\right)^k \end (One also gets convergence even if is not bounded above as long as it grows slowly enough.) The limit function is by definition always analytic, but it is not necessarily equal to the original function ''f'', even if ''f'' is infinitely differentiable. In this case, we say ''f'' is a
non-analytic smooth function In mathematics, smooth functions (also called infinitely differentiable functions) and analytic functions are two very important types of functions. One can easily prove that any analytic function of a real argument is smooth. The converse is no ...
, for example a flat function: \begin & f:\R \to \R \\ & f(x) = \begin e^ & x>0 \\ 0 & x \leq 0 . \end \end Using the
chain rule In calculus, the chain rule is a formula that expresses the derivative of the Function composition, composition of two differentiable functions and in terms of the derivatives of and . More precisely, if h=f\circ g is the function such that h ...
repeatedly by
mathematical induction Mathematical induction is a method for mathematical proof, proving that a statement P(n) is true for every natural number n, that is, that the infinitely many cases P(0), P(1), P(2), P(3), \dots  all hold. This is done by first proving a ...
, one shows that for any order ''k'', f^(x) = \begin \frac\cdot e^ & x>0 \\ 0 & x \leq 0 \end for some polynomial ''pk'' of degree 2(''k'' − 1). The function e^ tends to zero faster than any polynomial as x \to 0, so ''f'' is infinitely many times differentiable and for every positive integer ''k''. The above results all hold in this case: * The Taylor series of ''f'' converges uniformly to the zero function ''Tf''(''x'') = 0, which is analytic with all coefficients equal to zero. * The function ''f'' is unequal to this Taylor series, and hence non-analytic. * For any order ''k'' ∈ N and radius ''r'' > 0 there exists ''Mk,r'' > 0 satisfying the remainder bound () above. However, as ''k'' increases for fixed ''r'', the value of ''Mk,r'' grows more quickly than ''rk'', and the error does not go to zero''.''


Taylor's theorem in complex analysis

Taylor's theorem generalizes to functions ''f'' : C → C which are
complex differentiable In mathematics, a holomorphic function is a complex-valued function of one or Function of several complex variables, more complex number, complex variables that is Differentiable function#Differentiability in complex analysis, complex differ ...
in an open subset ''U'' ⊂ C of the
complex plane In mathematics, the complex plane is the plane (geometry), plane formed by the complex numbers, with a Cartesian coordinate system such that the horizontal -axis, called the real axis, is formed by the real numbers, and the vertical -axis, call ...
. However, its usefulness is dwarfed by other general theorems in
complex analysis Complex analysis, traditionally known as the theory of functions of a complex variable, is the branch of mathematical analysis that investigates functions of complex numbers. It is helpful in many branches of mathematics, including algebraic ...
. Namely, stronger versions of related results can be deduced for
complex differentiable In mathematics, a holomorphic function is a complex-valued function of one or Function of several complex variables, more complex number, complex variables that is Differentiable function#Differentiability in complex analysis, complex differ ...
functions ''f'' : ''U'' → C using
Cauchy's integral formula In mathematics, Cauchy's integral formula, named after Augustin-Louis Cauchy, is a central statement in complex analysis. It expresses the fact that a holomorphic function defined on a disk is completely determined by its values on the boundary o ...
as follows. Let ''r'' > 0 such that the
closed disk In geometry, a disk ( also spelled disc) is the region in a plane bounded by a circle. A disk is said to be ''closed'' if it contains the circle that constitutes its boundary, and ''open'' if it does not. For a radius r, an open disk is usua ...
''B''(''z'', ''r'') ∪ ''S''(''z'', ''r'') is contained in ''U''. Then Cauchy's integral formula with a positive parametrization of the circle ''S''(''z'', ''r'') with t \in ,2 \pi/math> gives f(z) = \frac\int_\gamma \frac\,dw, \quad f'(z) = \frac\int_\gamma \frac \, dw, \quad \ldots, \quad f^(z) = \frac\int_\gamma \frac \, dw. Here all the integrands are continuous on the
circle A circle is a shape consisting of all point (geometry), points in a plane (mathematics), plane that are at a given distance from a given point, the Centre (geometry), centre. The distance between any point of the circle and the centre is cal ...
''S''(''z'', ''r''), which justifies differentiation under the integral sign. In particular, if ''f'' is once
complex differentiable In mathematics, a holomorphic function is a complex-valued function of one or Function of several complex variables, more complex number, complex variables that is Differentiable function#Differentiability in complex analysis, complex differ ...
on the open set ''U'', then it is actually infinitely many times
complex differentiable In mathematics, a holomorphic function is a complex-valued function of one or Function of several complex variables, more complex number, complex variables that is Differentiable function#Differentiability in complex analysis, complex differ ...
on ''U''. One also obtains Cauchy's estimate , f^(z), \leq \frac\int_\gamma \frac \, dw = \frac, \quad M_r = \max_, f(w), for any ''z'' ∈ ''U'' and ''r'' > 0 such that ''B''(''z'', ''r'') ∪ ''S''(''c'', ''r'') ⊂ ''U''. The estimate implies that the
complex Complex commonly refers to: * Complexity, the behaviour of a system whose components interact in multiple ways so possible interactions are difficult to describe ** Complex system, a system composed of many components which may interact with each ...
Taylor series In mathematics, the Taylor series or Taylor expansion of a function is an infinite sum of terms that are expressed in terms of the function's derivatives at a single point. For most common functions, the function and the sum of its Taylor ser ...
T_f(z) = \sum_^\infty \frac(z-c)^k of ''f'' converges uniformly on any
open disk In geometry, a disk ( also spelled disc) is the region in a plane bounded by a circle. A disk is said to be ''closed'' if it contains the circle that constitutes its boundary, and ''open'' if it does not. For a radius r, an open disk is usua ...
B(c,r) \subset U with S(c,r) \subset U into some function ''Tf''. Furthermore, using the
contour integral In the mathematical field of complex analysis, contour integration is a method of evaluating certain integrals along paths in the complex plane. Contour integration is closely related to the calculus of residues, a method of complex analysis. ...
formulas for the derivatives ''f''(''c''), \begin T_f(z) &= \sum_^\infty \frac\int_\gamma \frac \, dw \\ &= \frac \int_\gamma \frac \sum_^\infty \left(\frac\right)^k \, dw \\ &= \frac \int_\gamma \frac\left( \frac \right) \, dw \\ &= \frac \int_\gamma \frac \, dw \\ &= f(z), \end so any
complex differentiable In mathematics, a holomorphic function is a complex-valued function of one or Function of several complex variables, more complex number, complex variables that is Differentiable function#Differentiability in complex analysis, complex differ ...
function ''f'' in an open set ''U'' ⊂ C is in fact complex analytic. All that is said for real analytic functions here holds also for complex analytic functions with the open interval ''I'' replaced by an open subset ''U'' ∈ C and ''a''-centered intervals (''a'' − ''r'', ''a'' + ''r'') replaced by ''c''-centered disks ''B''(''c'', ''r''). In particular, the Taylor expansion holds in the form f(z) = P_k(z) + R_k(z), \quad P_k(z) = \sum_^k \frac(z-c)^j, where the remainder term ''Rk'' is complex analytic. Methods of complex analysis provide some powerful results regarding Taylor expansions. For example, using Cauchy's integral formula for any positively oriented
Jordan curve In mathematics, a curve (also called a curved line in older texts) is an object similar to a line, but that does not have to be straight. Intuitively, a curve may be thought of as the trace left by a moving point. This is the definition that ...
\gamma which parametrizes the boundary \partial W \subset U of a region W \subset U, one obtains expressions for the derivatives as above, and modifying slightly the computation for , one arrives at the exact formula R_k(z) = \sum_^\infty \frac \int_\gamma \frac \, dw = \frac \int_\gamma \frac , \qquad z\in W. The important feature here is that the quality of the approximation by a Taylor polynomial on the region W \subset U is dominated by the values of the function ''f'' itself on the boundary \partial W \subset U. Similarly, applying Cauchy's estimates to the series expression for the remainder, one obtains the uniform estimates , R_k(z), \leq \sum_^\infty \frac = \frac \frac \leq \frac, \qquad \frac \leq \beta < 1.


Example

The function \begin & f : \R \to \R \\ & f(x) = \frac \end is
real analytic In mathematics, an analytic function is a function (mathematics), function that is locally given by a convergent series, convergent power series. There exist both real analytic functions and complex analytic functions. Functions of each type are ...
, that is, locally determined by its Taylor series. This function was plotted above to illustrate the fact that some elementary functions cannot be approximated by Taylor polynomials in neighborhoods of the center of expansion which are too large. This kind of behavior is easily understood in the framework of complex analysis. Namely, the function ''f'' extends into a
meromorphic function In the mathematical field of complex analysis, a meromorphic function on an open subset ''D'' of the complex plane is a function that is holomorphic on all of ''D'' ''except'' for a set of isolated points, which are ''poles'' of the function. ...
\begin & f:\Complex \cup \ \to \Complex \cup \ \\ & f(z) = \frac \end on the compactified complex plane. It has simple poles at z=i and z=-i, and it is analytic elsewhere. Now its Taylor series centered at ''z''0 converges on any disc ''B''(''z''0, ''r'') with ''r'' < , ''z'' − ''z''0, , where the same Taylor series converges at ''z'' ∈ C. Therefore, Taylor series of ''f'' centered at 0 converges on ''B''(0, 1) and it does not converge for any ''z'' ∈ C with , ''z'',  > 1 due to the poles at ''i'' and −''i''. For the same reason the Taylor series of ''f'' centered at 1 converges on B(1, \sqrt) and does not converge for any ''z'' ∈ C with \left\vert z-1 \right\vert>\sqrt.


Generalizations of Taylor's theorem


Higher-order differentiability

A function is
differentiable In mathematics, a differentiable function of one real variable is a function whose derivative exists at each point in its domain. In other words, the graph of a differentiable function has a non- vertical tangent line at each interior point in ...
at
if and only if In logic and related fields such as mathematics and philosophy, "if and only if" (often shortened as "iff") is paraphrased by the biconditional, a logical connective between statements. The biconditional is true in two cases, where either bo ...
there exists a
linear functional In mathematics, a linear form (also known as a linear functional, a one-form, or a covector) is a linear mapIn some texts the roles are reversed and vectors are defined as linear maps from covectors to scalars from a vector space to its field of ...
and a function such that f(\boldsymbol) = f(\boldsymbol) + L(\boldsymbol-\boldsymbol) + h(\boldsymbol)\lVert\boldsymbol-\boldsymbol\rVert, \qquad \lim_ h(\boldsymbol)=0. If this is the case, then L = df(\boldsymbol) is the (uniquely defined) differential of at the point . Furthermore, then the
partial derivatives In mathematics, a partial derivative of a function of several variables is its derivative with respect to one of those variables, with the others held constant (as opposed to the total derivative, in which all variables are allowed to vary). Par ...
of exist at and the differential of at is given by df( \boldsymbol )( \boldsymbol ) = \frac(\boldsymbol) v_1 + \cdots + \frac(\boldsymbol) v_n. Introduce the
multi-index notation Multi-index notation is a mathematical notation that simplifies formulas used in multivariable calculus, partial differential equations and the theory of distributions, by generalising the concept of an integer index to an ordered tuple of indices ...
, \alpha, = \alpha_1+\cdots+\alpha_n, \quad \alpha!=\alpha_1!\cdots\alpha_n!, \quad \boldsymbol^\alpha=x_1^\cdots x_n^ for and . If all the k-th order
partial derivatives In mathematics, a partial derivative of a function of several variables is its derivative with respect to one of those variables, with the others held constant (as opposed to the total derivative, in which all variables are allowed to vary). Par ...
of are continuous at , then by Clairaut's theorem, one can change the order of mixed derivatives at , so the short-hand notation D^\alpha f = \frac = \frac for the higher order
partial derivatives In mathematics, a partial derivative of a function of several variables is its derivative with respect to one of those variables, with the others held constant (as opposed to the total derivative, in which all variables are allowed to vary). Par ...
is justified in this situation. The same is true if all the ()-th order partial derivatives of exist in some neighborhood of and are differentiable at . Then we say that is times differentiable at the point .


Taylor's theorem for multivariate functions

Using notations of the preceding section, one has the following theorem. If the function is times
continuously differentiable In mathematics, a differentiable function of one Real number, real variable is a Function (mathematics), function whose derivative exists at each point in its Domain of a function, domain. In other words, the Graph of a function, graph of a differ ...
in a
closed ball In mathematics, a ball is the solid figure bounded by a ''sphere''; it is also called a solid sphere. It may be a closed ball (including the boundary points that constitute the sphere) or an open ball (excluding them). These concepts are defin ...
B = \ for some r > 0, then one can derive an exact formula for the remainder in terms of order
partial derivatives In mathematics, a partial derivative of a function of several variables is its derivative with respect to one of those variables, with the others held constant (as opposed to the total derivative, in which all variables are allowed to vary). Par ...
of ''f'' in this neighborhood. Namely, \begin & f( \boldsymbol ) = \sum_ \frac (\boldsymbol-\boldsymbol)^\alpha + \sum_ R_\beta(\boldsymbol)(\boldsymbol-\boldsymbol)^\beta, \\ & R_\beta( \boldsymbol ) = \frac \int_0^1 (1-t)^D^\beta f \big(\boldsymbol+t( \boldsymbol-\boldsymbol )\big) \, dt. \end In this case, due to the continuity of ()-th order
partial derivative In mathematics, a partial derivative of a function of several variables is its derivative with respect to one of those variables, with the others held constant (as opposed to the total derivative, in which all variables are allowed to vary). P ...
s in the
compact set In mathematics, specifically general topology, compactness is a property that seeks to generalize the notion of a closed and bounded subset of Euclidean space. The idea is that a compact space has no "punctures" or "missing endpoints", i.e., i ...
, one immediately obtains the uniform estimates \left, R_\beta(\boldsymbol)\ \leq \frac \max_ \max_ , D^\alpha f(\boldsymbol), , \qquad \boldsymbol\in B.


Example in two dimensions

For example, the third-order Taylor polynomial of a smooth function f:\mathbb R^2\to\mathbb R is, denoting \boldsymbol-\boldsymbol=\boldsymbol, \begin P_3(\boldsymbol) = f ( \boldsymbol ) + &\frac( \boldsymbol ) v_1 + \frac( \boldsymbol ) v_2 + \frac( \boldsymbol ) \frac + \frac( \boldsymbol ) v_1 v_2 + \frac( \boldsymbol ) \frac \\ & + \frac( \boldsymbol ) \frac + \frac( \boldsymbol ) \frac + \frac( \boldsymbol ) \frac + \frac( \boldsymbol ) \frac \end


Proofs


Proof for Taylor's theorem in one real variable

Let h_k(x) = \begin \frac & x\not=a\\ 0&x=a \end where, as in the statement of Taylor's theorem, P(x) = f(a) + f'(a)(x-a) + \frac(x-a)^2 + \cdots + \frac(x-a)^k. It is sufficient to show that \lim_ h_k(x) =0. The proof here is based on repeated application of
L'Hôpital's rule L'Hôpital's rule (, ), also known as Bernoulli's rule, is a mathematical theorem that allows evaluating limits of indeterminate forms using derivatives. Application (or repeated application) of the rule often converts an indeterminate form ...
. Note that, for each j=0,1,...,k-1, f^(a)=P^(a). Hence each of the first k-1 derivatives of the numerator in h_k(x) vanishes at x=a, and the same is true of the denominator. Also, since the condition that the function f be k times differentiable at a point requires differentiability up to order k-1 in a neighborhood of said point (this is true, because differentiability requires a function to be defined in a whole neighborhood of a point), the numerator and its k-2 derivatives are differentiable in a neighborhood of a. Clearly, the denominator also satisfies said condition, and additionally, doesn't vanish unless x=a, therefore all conditions necessary for L'Hôpital's rule are fulfilled, and its use is justified. So \begin \lim_ \frac &= \lim_ \frac \\ ex&= \cdots \\ ex&= \lim_ \frac\\ ex&= \frac\lim_ \frac\\ ex&=\frac(f^(a) - P^(a)) = 0 \end where the second-to-last equality follows by the definition of the derivative at x=a.


Alternate proof for Taylor's theorem in one real variable

Let f(x) be any real-valued continuous function to be approximated by the Taylor polynomial. Step 1: Let F and G be functions. Set F and G to be \begin F(x) = f(x) - \sum^_ \frac(x-a)^ \end \begin G(x) = (x-a)^ \end Step 2: Properties of F and G: \begin F(a) & = f(a) - f(a) - f'(a)(a - a) - ... - \frac(a-a)^ = 0 \\ G(a) & = (a-a)^n = 0 \end Similarly, \begin F'(a) = f'(a) - f'(a) - \frac(a-a)^ - ... - \frac(a-a)^ = 0 \end \begin G'(a) &= n(a-a)^ = 0\\ &\qquad \vdots\\ G^(a) &= F^(a) = 0 \end Step 3: Use Cauchy Mean Value Theorem Let f_ and g_ be continuous functions on , b/math>. Since a < x < b so we can work with the interval
, x The comma is a punctuation mark that appears in several variants in different languages. Some typefaces render it as a small line, slightly curved or straight, but inclined from the vertical; others give it the appearance of a miniature fille ...
/math>. Let f_ and g_ be differentiable on (a, x). Assume g_'(x) \neq 0 for all x \in (a, b). Then there exists c_ \in (a, x) such that \begin \frac = \frac \end Note: G'(x) \neq 0 in (a, b) and F(a), G(a) = 0 so \begin \frac = \frac = \frac \end for some c_ \in (a, x). This can also be performed for (a, c_): \begin \frac = \frac = \frac \end for some c_ \in (a, c_). This can be continued to c_. This gives a partition in (a, b): a < c_ < c_ < \dots < c_ < x with \frac = \frac = \dots = \frac . Set c = c_: \frac = \frac Step 4: Substitute back \begin \frac = \frac = \frac \end By the Power Rule, repeated derivatives of (x - a)^, G^(c) = n(n-1)...1, so: \frac = \frac = \frac. This leads to: \begin f(x) - \sum^_ \frac(x-a)^ = \frac(x-a)^ \end. By rearranging, we get: \begin f(x) = \sum^_ \frac(x-a)^ + \frac(x-a)^ \end, or because c_ = a eventually: f(x) = \sum^_ \frac(x-a)^.


Derivation for the mean value forms of the remainder

Let ''G'' be any real-valued function, continuous on the closed interval between a and x and differentiable with a non-vanishing derivative on the open interval between a and x, and define F(t) = f(t) + f'(t)(x-t) + \frac(x-t)^2 + \cdots + \frac(x-t)^k. For t \in ,x. Then, by Cauchy's mean value theorem, for some \xi on the open interval between a and x. Note that here the numerator F(x)-F(a)=R_k(x) is exactly the remainder of the Taylor polynomial for y=f(x). Compute \begin F'(t) = & f'(t) + \big(f''(t)(x-t) - f'(t)\big) + \left(\frac(x-t)^2 - \frac(x-t)\right) + \cdots \\ & \cdots + \left( \frac(x-t)^k - \frac(x-t)^\right) = \frac(x-t)^k, \end plug it into () and rearrange terms to find that R_k(x) = \frac(x-\xi)^k \frac. This is the form of the remainder term mentioned after the actual statement of Taylor's theorem with remainder in the mean value form. The Lagrange form of the remainder is found by choosing G(t) = (x-t)^ and the Cauchy form by choosing G(t) = t-a. Remark. Using this method one can also recover the integral form of the remainder by choosing G(t) = \int_a^t \frac (x-s)^k \, ds, but the requirements for ''f'' needed for the use of mean value theorem are too strong, if one aims to prove the claim in the case that ''f'' is only
absolutely continuous In calculus and real analysis, absolute continuity is a smoothness property of functions that is stronger than continuity and uniform continuity. The notion of absolute continuity allows one to obtain generalizations of the relationship betwe ...
. However, if one uses
Riemann integral In the branch of mathematics known as real analysis, the Riemann integral, created by Bernhard Riemann, was the first rigorous definition of the integral of a function on an interval. It was presented to the faculty at the University of Gö ...
instead of
Lebesgue integral In mathematics, the integral of a non-negative Function (mathematics), function of a single variable can be regarded, in the simplest case, as the area between the Graph of a function, graph of that function and the axis. The Lebesgue integral, ...
, the assumptions cannot be weakened.


Derivation for the integral form of the remainder

Due to the
absolute continuity In calculus and real analysis, absolute continuity is a smoothness property of functions that is stronger than continuity and uniform continuity. The notion of absolute continuity allows one to obtain generalizations of the relationship between ...
of f^ on the
closed interval In mathematics, a real interval is the set of all real numbers lying between two fixed endpoints with no "gaps". Each endpoint is either a real number or positive or negative infinity, indicating the interval extends without a bound. A real in ...
between a and x, its derivative f^ exists as an L^1-function, and we can use the
fundamental theorem of calculus The fundamental theorem of calculus is a theorem that links the concept of derivative, differentiating a function (mathematics), function (calculating its slopes, or rate of change at every point on its domain) with the concept of integral, inte ...
and
integration by parts In calculus, and more generally in mathematical analysis, integration by parts or partial integration is a process that finds the integral of a product of functions in terms of the integral of the product of their derivative and antiderivati ...
. This same proof applies for the
Riemann integral In the branch of mathematics known as real analysis, the Riemann integral, created by Bernhard Riemann, was the first rigorous definition of the integral of a function on an interval. It was presented to the faculty at the University of Gö ...
assuming that f^ is continuous on the closed interval and
differentiable In mathematics, a differentiable function of one real variable is a function whose derivative exists at each point in its domain. In other words, the graph of a differentiable function has a non- vertical tangent line at each interior point in ...
on the
open interval In mathematics, a real interval is the set (mathematics), set of all real numbers lying between two fixed endpoints with no "gaps". Each endpoint is either a real number or positive or negative infinity, indicating the interval extends without ...
between a and x, and this leads to the same result as using the mean value theorem. The
fundamental theorem of calculus The fundamental theorem of calculus is a theorem that links the concept of derivative, differentiating a function (mathematics), function (calculating its slopes, or rate of change at every point on its domain) with the concept of integral, inte ...
states that f(x)=f(a)+ \int_a^x \, f'(t) \, dt. Now we can integrate by parts and use the fundamental theorem of calculus again to see that \begin f(x) &= f(a)+\Big(xf'(x)-af'(a)\Big)-\int_a^x tf''(t) \, dt \\ &= f(a) + x\left(f'(a) + \int_a^x f''(t) \,dt \right) -af'(a)-\int_a^x tf''(t) \, dt \\ &= f(a)+(x-a)f'(a)+\int_a^x \, (x-t)f''(t) \, dt, \end which is exactly Taylor's theorem with remainder in the integral form in the case k=1. The general statement is proved using induction. Suppose that Integrating the remainder term by parts we arrive at \begin \int_a^x \frac (x - t)^k \, dt = & - \left \frac (x - t)^ \righta^x + \int_a^x \frac (x - t)^ \, dt \\ = & \ \frac (x - a)^ + \int_a^x \frac (x - t)^ \, dt. \end Substituting this into the formula shows that if it holds for the value k, it must also hold for the value k+1. Therefore, since it holds for k=1, it must hold for every positive integer k.


Derivation for the remainder of multivariate Taylor polynomials

We prove the special case, where f:\mathbb R^n\to\mathbb R has continuous partial derivatives up to the order k+1 in some closed ball B with center \boldsymbol. The strategy of the proof is to apply the one-variable case of Taylor's theorem to the restriction of f to the line segment adjoining \boldsymbol and \boldsymbol. Parametrize the line segment between \boldsymbol and \boldsymbol by \boldsymbol(t)=\boldsymbol+t(\boldsymbol-\boldsymbol) We apply the one-variable version of Taylor's theorem to the function g(t) = f(\boldsymbol(t)): f(\boldsymbol)=g(1)=g(0)+\sum_^k\fracg^(0)\ +\ \int_0^1 \frac g^(t)\, dt. Applying the
chain rule In calculus, the chain rule is a formula that expresses the derivative of the Function composition, composition of two differentiable functions and in terms of the derivatives of and . More precisely, if h=f\circ g is the function such that h ...
for several variables gives \begin g^(t)&=\fracf(\boldsymbol(t))\\ &= \frac f(\boldsymbol+t(\boldsymbol-\boldsymbol))\\ &= \sum_ \left(\begin j\\ \alpha\end \right) (D^\alpha f) (\boldsymbol+t(\boldsymbol-\boldsymbol)) (\boldsymbol-\boldsymbol)^\alpha \end where \tbinom j \alpha is the
multinomial coefficient In mathematics, the multinomial theorem describes how to expand a power of a sum in terms of powers of the terms in that sum. It is the generalization of the binomial theorem from binomials to multinomials. Theorem For any positive integer ...
. Since \tfrac\tbinom j \alpha=\tfrac, we get: f(\boldsymbol)= f(\boldsymbol) + \sum_\frac (D^\alpha f) (\boldsymbol)(\boldsymbol-\boldsymbol)^\alpha+\sum_\frac (\boldsymbol-\boldsymbol)^\alpha \int_0^1 (1-t)^k (D^\alpha f)(\boldsymbol+t(\boldsymbol-\boldsymbol))\,dt.


See also

* * * * * *


Footnotes


References

*. *. *. *. *. *. *. *. *. *.


External links


Taylor Series Approximation to Cosine
at
cut-the-knot Alexander Bogomolny (January 4, 1948 July 7, 2018) was a Soviet Union, Soviet-born Israeli Americans, Israeli-American mathematician. He was Professor Emeritus of Mathematics at the University of Iowa, and formerly research fellow at the Moscow ...

Trigonometric Taylor Expansion
interactive demonstrative applet

a
Holistic Numerical Methods Institute
{{authority control Articles containing proofs Theorems in calculus Theorems in real analysis Approximations