HOME

TheInfoList



OR:

In mathematics (including combinatorics,
linear algebra Linear algebra is the branch of mathematics concerning linear equations such as: :a_1x_1+\cdots +a_nx_n=b, linear maps such as: :(x_1, \ldots, x_n) \mapsto a_1x_1+\cdots +a_nx_n, and their representations in vector spaces and through matrices. ...
, and
dynamical systems In mathematics, a dynamical system is a system in which a function describes the time dependence of a point in an ambient space. Examples include the mathematical models that describe the swinging of a clock pendulum, the flow of water in ...
), a linear recurrence with constant coefficients (also known as a linear recurrence relation or linear difference equation) sets equal to 0 a
polynomial In mathematics, a polynomial is an expression (mathematics), expression consisting of indeterminate (variable), indeterminates (also called variable (mathematics), variables) and coefficients, that involves only the operations of addition, subtrac ...
that is linear in the various iterates of a variable—that is, in the values of the elements of a
sequence In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called ''elements'', or ''terms''). The number of elements (possibly infinite) is calle ...
. The polynomial's linearity means that each of its terms has degree 0 or 1. A linear recurrence denotes the evolution of some variable over time, with the current
time period The categorisation of the past into discrete, quantified named blocks of time is called periodization.Adam Rabinowitz. And kingIt’s about time: historical periodization and Linked Ancient World Data'. Study of the Ancient universe Papers, 2014 ...
or discrete moment in time denoted as , one period earlier denoted as , one period later as , etc. The '' solution'' of such an equation is a function of , and not of any iterate values, giving the value of the iterate at any time. To find the solution it is necessary to know the specific values (known as ''
initial condition In mathematics and particularly in dynamic systems, an initial condition, in some contexts called a seed value, is a value of an evolving variable at some point in time designated as the initial time (typically denoted ''t'' = 0). Fo ...
s'') of of the iterates, and normally these are the iterates that are oldest. The equation or its variable is said to be ''
stable A stable is a building in which livestock, especially horses, are kept. It most commonly means a building that is divided into separate stalls for individual animals and livestock. There are many different types of stables in use today; the ...
'' if from any set of initial conditions the variable's limit as time goes to infinity exists; this limit is called the ''
steady state In systems theory, a system or a process is in a steady state if the variables (called state variables) which define the behavior of the system or the process are unchanging in time. In continuous time, this means that for those properties ''p'' ...
''. Difference equations are used in a variety of contexts, such as in economics to model the evolution through time of variables such as gross domestic product, the
inflation rate In economics, inflation is an increase in the general price level of goods and services in an economy. When the general price level rises, each unit of currency buys fewer goods and services; consequently, inflation corresponds to a reduct ...
, the
exchange rate In finance, an exchange rate is the rate at which one currency will be exchanged for another currency. Currencies are most commonly national currencies, but may be sub-national as in the case of Hong Kong or supra-national as in the case of ...
, etc. They are used in modeling such
time series In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. E ...
because values of these variables are only measured at discrete intervals. In
econometric Econometrics is the application of statistical methods to economic data in order to give empirical content to economic relationships.M. Hashem Pesaran (1987). "Econometrics," '' The New Palgrave: A Dictionary of Economics'', v. 2, p. 8 p. 8� ...
applications, linear difference equations are modeled with stochastic terms in the form of autoregressive (AR) models and in models such as
vector autoregression Vector autoregression (VAR) is a statistical model used to capture the relationship between multiple quantities as they change over time. VAR is a type of stochastic process model. VAR models generalize the single-variable (univariate) autoregres ...
(VAR) and
autoregressive moving average In statistics, econometrics and signal processing, an autoregressive (AR) model is a representation of a type of random process; as such, it is used to describe certain time-varying processes in nature, economics, etc. The autoregressive model spe ...
(ARMA) models that combine AR with other features.


Definitions

A linear recurrence with constant coefficients is an equation of the following form, written in terms of
parameters A parameter (), generally, is any characteristic that can help in defining or classifying a particular system (meaning an event, project, object, situation, etc.). That is, a parameter is an element of a system that is useful, or critical, when ...
and : :y_t = a_1y_ + \cdots + a_ny_ + b, or equivalently as :y_= a_1y_ + \cdots + a_ny_t + b. The positive integer n is called the order of the recurrence and denotes the longest time lag between iterates. The equation is called homogeneous if and nonhomogeneous if . If the equation is homogeneous, the coefficients determine the
characteristic polynomial In linear algebra, the characteristic polynomial of a square matrix is a polynomial which is invariant under matrix similarity and has the eigenvalues as roots. It has the determinant and the trace of the matrix among its coefficients. The char ...
(also "auxiliary polynomial" or "companion polynomial") :p(\lambda)= \lambda^n - a_1\lambda^ - a_2\lambda^-\cdots-a_ whose roots play a crucial role in finding and understanding the sequences satisfying the recurrence.


Conversion to homogeneous form

If , the equation :y_t = a_1y_ + \cdots + a_ny_ + b is said to be ''nonhomogeneous''. To solve this equation it is convenient to convert it to homogeneous form, with no constant term. This is done by first finding the equation's ''steady state value''—a value such that, if successive iterates all had this value, so would all future values. This value is found by setting all values of equal to in the difference equation, and solving, thus obtaining :y^* = \frac assuming the denominator is not 0. If it is zero, the steady state does not exist. Given the steady state, the difference equation can be rewritten in terms of deviations of the iterates from the steady state, as :\left(y_t -y^*\right)= a_1\left(y_-y^*\right) + \cdots + a_n\left(y_-y^*\right) which has no constant term, and which can be written more succinctly as :x_t= a_1x_ + \cdots + a_nx_ where equals . This is the homogeneous form. If there is no steady state, the difference equation :y_t = a_1y_ + \cdots + a_ny_ + b can be combined with its equivalent form :y_= a_1y_+ \cdots + a_ny_+ b to obtain (by solving both for ) :y_t - a_1y_ - \cdots - a_ny_ = y_- a_1y_- \cdots - a_ny_ in which like terms can be combined to give a homogeneous equation of one order higher than the original.


Solution example for small orders

The roots of the characteristic polynomial play a crucial role in finding and understanding the sequences satisfying the recurrence. If there are d distinct roots r_1, r_2, \ldots, r_d, then each solution to the recurrence takes the form :a_n = k_1 r_1^n + k_2 r_2^n + \cdots + k_d r_d^n, where the coefficients k_i are determined in order to fit the initial conditions of the recurrence. When the same roots occur multiple times, the terms in this formula corresponding to the second and later occurrences of the same root are multiplied by increasing powers of n. For instance, if the characteristic polynomial can be factored as (x-r)^3, with the same root r occurring three times, then the solution would take the form :a_n = k_1 r^n + k_2 n r^n + k_3 n^2 r^n.


Order 1

For order 1, the recurrence :a_=r a_ has the solution a_n = r^n with a_0 = 1 and the most general solution is a_n = k r^n with a_0 = k. The characteristic polynomial equated to zero (the characteristic equation) is simply t - r = 0.


Order 2

Solutions to such recurrence relations of higher order are found by systematic means, often using the fact that a_n = r^n is a solution for the recurrence exactly when t = r is a root of the characteristic polynomial. This can be approached directly or using
generating function In mathematics, a generating function is a way of encoding an infinite sequence of numbers () by treating them as the coefficients of a formal power series. This series is called the generating function of the sequence. Unlike an ordinary serie ...
s (
formal power series In mathematics, a formal series is an infinite sum that is considered independently from any notion of convergence, and can be manipulated with the usual algebraic operations on series (addition, subtraction, multiplication, division, partial su ...
) or matrices. Consider, for example, a recurrence relation of the form :a_=Aa_+Ba_. When does it have a solution of the same general form as a_n = r^n? Substituting this guess (
ansatz In physics and mathematics, an ansatz (; , meaning: "initial placement of a tool at a work piece", plural Ansätze ; ) is an educated guess or an additional assumption made to help solve a problem, and which may later be verified to be part of th ...
) in the recurrence relation, we find that :r^=Ar^+Br^ must be true for all n > 1. Dividing through by r^, we get that all these equations reduce to the same thing: :r^2=Ar+B, :r^2-Ar-B=0, which is the characteristic equation of the recurrence relation. Solve for r to obtain the two roots \lambda_1, \lambda_2: these roots are known as the
characteristic root In linear algebra, an eigenvector () or characteristic vector of a linear transformation is a nonzero vector that changes at most by a scalar factor when that linear transformation is applied to it. The corresponding eigenvalue, often denoted ...
s or eigenvalues of the characteristic equation. Different solutions are obtained depending on the nature of the roots: If these roots are distinct, we have the general solution :a_n = C\lambda_1^n+D\lambda_2^n while if they are identical (when A^2 + 4 B = 0), we have :a_n = C\lambda^n+Dn\lambda^n This is the most general solution; the two constants C and D can be chosen based on two given initial conditions a_0 and a_1 to produce a specific solution. In the case of complex eigenvalues (which also gives rise to complex values for the solution parameters C and D), the use of complex numbers can be eliminated by rewriting the solution in trigonometric form. In this case we can write the eigenvalues as \lambda_1, \lambda_2 = \alpha \pm \beta i. Then it can be shown that :a_n = C\lambda_1^n+D\lambda_2^n can be rewritten as :a_n = 2 M^n \left( E \cos(\theta n) + F \sin(\theta n)\right) = 2 G M^n \cos(\theta n - \delta), where :\begin M = \sqrt & \cos (\theta) =\tfrac & \sin( \theta) = \tfrac \\ C,D = E \mp F i & & \\ G = \sqrt & \cos (\delta ) = \tfrac & \sin (\delta )= \tfrac \end Here E and F (or equivalently, G and \delta) are real constants which depend on the initial conditions. Using :\lambda_1+\lambda_2=2 \alpha = A, :\lambda_1 \cdot \lambda_2=\alpha^2+\beta^2=-B, one may simplify the solution given above as :a_n = (-B)^ \left( E \cos(\theta n) + F \sin(\theta n)\right), where a_1 and a_2 are the initial conditions and :\begin E &=\frac \\ F &=-i \frac \\ \theta &=\arccos \left (\frac \right ) \end In this way there is no need to solve for \lambda_1 and \lambda_2. In all cases—real distinct eigenvalues, real duplicated eigenvalues, and complex conjugate eigenvalues—the equation is
stable A stable is a building in which livestock, especially horses, are kept. It most commonly means a building that is divided into separate stalls for individual animals and livestock. There are many different types of stables in use today; the ...
(that is, the variable a converges to a fixed value pecifically, zero if and only if ''both'' eigenvalues are smaller than one in
absolute value In mathematics, the absolute value or modulus of a real number x, is the non-negative value without regard to its sign. Namely, , x, =x if is a positive number, and , x, =-x if x is negative (in which case negating x makes -x positive), an ...
. In this second-order case, this condition on the eigenvalues can be shownPapanicolaou, Vassilis, "On the asymptotic stability of a class of linear difference equations," ''Mathematics Magazine'' 69(1), February 1996, 34–43. to be equivalent to , A, < 1 - B < 2, which is equivalent to , B, < 1 and , A, < 1 - B.


General solution


Characteristic polynomial and roots

Solving the homogeneous equation :x_t= a_1x_ + \cdots + a_nx_ involves first solving its characteristic polynomial :\lambda^n = a_1\lambda^ + \cdots + a_\lambda^2+a_ \lambda + a_n for its characteristic roots . These roots can be solved for algebraically if , but not necessarily otherwise. If the solution is to be used numerically, all the roots of this characteristic equation can be found by
numerical methods Numerical analysis is the study of algorithms that use numerical approximation (as opposed to symbolic manipulations) for the problems of mathematical analysis (as distinguished from discrete mathematics). It is the study of numerical methods th ...
. However, for use in a theoretical context it may be that the only information required about the roots is whether any of them are greater than or equal to 1 in
absolute value In mathematics, the absolute value or modulus of a real number x, is the non-negative value without regard to its sign. Namely, , x, =x if is a positive number, and , x, =-x if x is negative (in which case negating x makes -x positive), an ...
. It may be that all the roots are
real Real may refer to: Currencies * Brazilian real (R$) * Central American Republic real * Mexican real * Portuguese real * Spanish real * Spanish colonial real Music Albums * ''Real'' (L'Arc-en-Ciel album) (2000) * ''Real'' (Bright album) (2010) ...
or instead there may be some that are complex numbers. In the latter case, all the complex roots come in
complex conjugate In mathematics, the complex conjugate of a complex number is the number with an equal real part and an imaginary part equal in magnitude but opposite in sign. That is, (if a and b are real, then) the complex conjugate of a + bi is equal to a - ...
pairs.


Solution with distinct characteristic roots

If all the characteristic roots are distinct, the solution of the homogeneous linear recurrence :x_t= a_1x_ + \cdots + a_nx_ can be written in terms of the characteristic roots as :x_t=c_1\lambda_1^t +\cdots + c_n\lambda_n^t where the coefficients can be found by invoking the initial conditions. Specifically, for each time period for which an iterate value is known, this value and its corresponding value of can be substituted into the solution equation to obtain a linear equation in the as-yet-unknown parameters; such equations, one for each initial condition, can be solved simultaneously for the parameter values. If all characteristic roots are real, then all the coefficient values will also be real; but with non-real complex roots, in general some of these coefficients will also be non-real.


Converting complex solution to trigonometric form

If there are complex roots, they come in conjugate pairs and so do the complex terms in the solution equation. If two of these complex terms are and , the roots can be written as :\lambda_j, \lambda_=\alpha \pm \beta i =M\left(\frac \pm \fraci\right) where is the
imaginary unit The imaginary unit or unit imaginary number () is a solution to the quadratic equation x^2+1=0. Although there is no real number with this property, can be used to extend the real numbers to what are called complex numbers, using addition and ...
and is the modulus of the roots: :M = \sqrt. Then the two complex terms in the solution equation can be written as :\begin c_j\lambda_j^t + c_\lambda_^t & = M^t\left(c_j\left(\frac + \fraci\right)^t + c_\left(\frac - \fraci\right)^t\right) \\ pt& = M^t\left(c_j\left(\cos\theta + i\sin\theta\right)^t + c_\left(\cos \theta - i\sin\theta\right)^t\right) \\ pt& = M^t\bigl(c_j\left(\cos\theta t + i\sin \theta t\right) + c_\left(\cos\theta t - i\sin\theta t\right) \bigr) \end where is the angle whose cosine is and whose sine is ; the last equality here made use of
de Moivre's formula In mathematics, de Moivre's formula (also known as de Moivre's theorem and de Moivre's identity) states that for any real number and integer it holds that :\big(\cos x + i \sin x\big)^n = \cos nx + i \sin nx, where is the imaginary unit (). ...
. Now the process of finding the coefficients and guarantees that they are also complex conjugates, which can be written as . Using this in the last equation gives this expression for the two complex terms in the solution equation: :2M^t\left(\gamma \cos\theta t - \delta \sin\theta t\right) which can also be written as :2\sqrtM^t \cos(\theta t + \psi) where is the angle whose cosine is and whose sine is .


Cyclicity

Depending on the initial conditions, even with all roots real the iterates can experience a transitory tendency to go above and below the steady state value. But true cyclicity involves a permanent tendency to fluctuate, and this occurs if there is at least one pair of complex conjugate characteristic roots. This can be seen in the trigonometric form of their contribution to the solution equation, involving and .


Solution with duplicate characteristic roots

In the second-order case, if the two roots are identical (), they can both be denoted as and a solution may be of the form :x_t = c_1 \lambda^t + c_2 t \lambda^t.


Solution by conversion to matrix form

An alternative solution method involves converting the th order difference equation to a first-order
matrix difference equation A matrix difference equation is a difference equation in which the value of a vector (or sometimes, a matrix) of variables at one point in time is related to its own value at one or more previous points in time, using matrices. The order of the eq ...
. This is accomplished by writing , , , and so on. Then the original single th-order equation :y_t = a_1y_ + a_2y_ + \cdots + a_ny_ + b can be replaced by the following first-order equations: : \begin w_ & = a_1w_ + a_2w_+\cdots + a_nw_ + b \\ w_ & = w_ \\ & \,\,\,\vdots \\ w_ & =w_. \end Defining the vector as :\mathbf_i = \beginw_ \\ w_ \\ \vdots \\ w_ \end this can be put in matrix form as :\mathbf_t = \mathbf\mathbf_+\mathbf Here is an matrix in which the first row contains and all other rows have a single 1 with all other elements being 0, and is a column vector with first element and with the rest of its elements being 0. This matrix equation can be solved using the methods in the article
Matrix difference equation A matrix difference equation is a difference equation in which the value of a vector (or sometimes, a matrix) of variables at one point in time is related to its own value at one or more previous points in time, using matrices. The order of the eq ...
.


Solution using generating functions

The recurrence :y_t = a_1y_ + \cdots + a_ny_ + b, can be solved using the theory of
generating function In mathematics, a generating function is a way of encoding an infinite sequence of numbers () by treating them as the coefficients of a formal power series. This series is called the generating function of the sequence. Unlike an ordinary serie ...
s. First, we write Y(x) = \sum_ y_t x^t. The recurrence is then equivalent to the following generating function equation: :Y(x) = a_1xY(x) + a_2x^2Y(x) + \cdots + a_nx^nY(x) + \frac + p(x) where p(x) is a polynomial of degree at most n-1 correcting the initial terms. From this equation we can solve to get :Y(x) = \left(\frac + p(x)\right) \cdot \frac. In other words, not worrying about the exact coefficients, Y(x) can be expressed as a
rational function In mathematics, a rational function is any function that can be defined by a rational fraction, which is an algebraic fraction such that both the numerator and the denominator are polynomials. The coefficients of the polynomials need not be ...
Y(x) = \frac. The closed form can then be derived via
partial fraction decomposition In algebra, the partial fraction decomposition or partial fraction expansion of a rational fraction (that is, a fraction such that the numerator and the denominator are both polynomials) is an operation that consists of expressing the fraction as ...
. Specifically, if the generating function is written as :\frac=\sum_i \frac then the polynomial p(x) determines the initial set of corrections z(n), the denominator (x - r_i)^m determines the exponential term r_i^n, and the degree m together with the numerator f_i(x) determine the polynomial coefficient k_i(n).


Relation to solution to differential equations

The method for solving linear
differential equations In mathematics, a differential equation is an equation that relates one or more unknown functions and their derivatives. In applications, the functions generally represent physical quantities, the derivatives represent their rates of change, an ...
is similar to the method above—the "intelligent guess" (
ansatz In physics and mathematics, an ansatz (; , meaning: "initial placement of a tool at a work piece", plural Ansätze ; ) is an educated guess or an additional assumption made to help solve a problem, and which may later be verified to be part of th ...
) for linear differential equations with constant coefficients is e^ where \lambda is a complex number that is determined by substituting the guess into the differential equation. This is not a coincidence. Considering the Taylor series of the solution to a linear differential equation: :\sum_^\infin \frac (x-a)^n it can be seen that the coefficients of the series are given by the n-th derivative of f(x) evaluated at the point a. The differential equation provides a linear difference equation relating these coefficients. This equivalence can be used to quickly solve for the recurrence relationship for the coefficients in the power series solution of a linear differential equation. The rule of thumb (for equations in which the polynomial multiplying the first term is non-zero at zero) is that: :y^ \to f +k/math> and more generally :x^m*y^ \to n(n-1)...(n-m+1)f +k-m/math> Example: The recurrence relationship for the Taylor series coefficients of the equation: : (x^2 + 3x -4)y^ -(3x+1)y^ + 2y = 0 is given by : n(n-1)f +1+ 3nf +2-4f +3-3nf +1-f +2 2f = 0 or :-4f +3+2nf +2+ n(n-4)f +1+2f = 0. This example shows how problems generally solved using the power series solution method taught in normal differential equation classes can be solved in a much easier way. Example: The differential equation :ay'' + by' +cy = 0 has solution : y=e^. The conversion of the differential equation to a difference equation of the Taylor coefficients is :af + 2+ bf + 1+ cf = 0. It is easy to see that the n-th derivative of e^ evaluated at 0 is a^n.


Solving with z-transforms

Certain difference equations - in particular, linear constant coefficient difference equations - can be solved using
z-transform In mathematics and signal processing, the Z-transform converts a discrete-time signal, which is a sequence of real or complex numbers, into a complex frequency-domain (z-domain or z-plane) representation. It can be considered as a discrete-tim ...
s. The ''z''-transforms are a class of
integral transform In mathematics, an integral transform maps a function from its original function space into another function space via integration, where some of the properties of the original function might be more easily characterized and manipulated than in ...
s that lead to more convenient algebraic manipulations and more straightforward solutions. There are cases in which obtaining a direct solution would be all but impossible, yet solving the problem via a thoughtfully chosen integral transform is straightforward.


Stability

In the solution equation :x_t = c_1\lambda_1^t +\cdots + c_n\lambda_n^t, a term with real characteristic roots converges to 0 as grows indefinitely large if the absolute value of the characteristic root is less than 1. If the absolute value equals 1, the term will stay constant as grows if the root is +1 but will fluctuate between two values if the root is −1. If the absolute value of the root is greater than 1 the term will become larger and larger over time. A pair of terms with complex conjugate characteristic roots will converge to 0 with dampening fluctuations if the absolute value of the modulus of the roots is less than 1; if the modulus equals 1 then constant amplitude fluctuations in the combined terms will persist; and if the modulus is greater than 1, the combined terms will show fluctuations of ever-increasing magnitude. Thus the evolving variable will converge to 0 if all of the characteristic roots have magnitude less than 1. If the largest root has absolute value 1, neither convergence to 0 nor divergence to infinity will occur. If all roots with magnitude 1 are real and positive, will converge to the sum of their constant terms ; unlike in the stable case, this converged value depends on the initial conditions; different starting points lead to different points in the long run. If any root is −1, its term will contribute permanent fluctuations between two values. If any of the unit-magnitude roots are complex then constant-amplitude fluctuations of will persist. Finally, if any characteristic root has magnitude greater than 1, then will diverge to infinity as time goes to infinity, or will fluctuate between increasingly large positive and negative values. A theorem of
Issai Schur Issai Schur (10 January 1875 – 10 January 1941) was a Russian mathematician who worked in Germany for most of his life. He studied at the University of Berlin. He obtained his doctorate in 1901, became lecturer in 1903 and, after a stay at the ...
states that all roots have magnitude less than 1 (the stable case) if and only if a particular string of
determinant In mathematics, the determinant is a scalar value that is a function of the entries of a square matrix. It characterizes some properties of the matrix and the linear map represented by the matrix. In particular, the determinant is nonzero if an ...
s are all positive. If a non-homogeneous linear difference equation has been converted to homogeneous form which has been analyzed as above, then the stability and cyclicality properties of the original non-homogeneous equation will be the same as those of the derived homogeneous form, with convergence in the stable case being to the steady-state value instead of to 0.


See also

*
Recurrence relation In mathematics, a recurrence relation is an equation according to which the nth term of a sequence of numbers is equal to some combination of the previous terms. Often, only k previous terms of the sequence appear in the equation, for a parameter ...
*
Linear differential equation In mathematics, a linear differential equation is a differential equation that is defined by a linear polynomial in the unknown function and its derivatives, that is an equation of the form :a_0(x)y + a_1(x)y' + a_2(x)y'' \cdots + a_n(x)y^ = b ...
*
Skolem–Mahler–Lech theorem In additive and algebraic number theory, the Skolem–Mahler–Lech theorem states that if a sequence of numbers satisfies a linear difference equation, then with finitely many exceptions the positions at which the sequence is zero form a regular ...
*
Skolem problem In mathematics, the Skolem problem is the problem of determining whether the values of a constant-recursive sequence include the number zero. The problem can be formulated for recurrences over different types of numbers, including integers, ratio ...


References

{{reflist Combinatorics Dynamical systems Linear algebra Recurrence relations