HOME

TheInfoList



OR:

In
mathematics Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
(including
combinatorics Combinatorics is an area of mathematics primarily concerned with counting, both as a means and as an end to obtaining results, and certain properties of finite structures. It is closely related to many other areas of mathematics and has many ...
,
linear algebra Linear algebra is the branch of mathematics concerning linear equations such as :a_1x_1+\cdots +a_nx_n=b, linear maps such as :(x_1, \ldots, x_n) \mapsto a_1x_1+\cdots +a_nx_n, and their representations in vector spaces and through matrix (mathemat ...
, and
dynamical systems In mathematics, a dynamical system is a system in which a Function (mathematics), function describes the time dependence of a Point (geometry), point in an ambient space, such as in a parametric curve. Examples include the mathematical models ...
), a linear recurrence with constant coefficients (also known as a linear recurrence relation or linear difference equation) sets equal to 0 a
polynomial In mathematics, a polynomial is a Expression (mathematics), mathematical expression consisting of indeterminate (variable), indeterminates (also called variable (mathematics), variables) and coefficients, that involves only the operations of addit ...
that is linear in the various iterates of a variable—that is, in the values of the elements of a
sequence In mathematics, a sequence is an enumerated collection of objects in which repetitions are allowed and order matters. Like a set, it contains members (also called ''elements'', or ''terms''). The number of elements (possibly infinite) is cal ...
. The polynomial's linearity means that each of its terms has degree 0 or 1. A linear recurrence denotes the evolution of some variable over time, with the current
time period The categorization of the past into discrete, quantified named blocks of time is called periodization.Adam Rabinowitz. And king It’s about time: historical periodization and Linked Ancient World Data''. Study of the Ancient universe Papers, 2 ...
or discrete moment in time denoted as , one period earlier denoted as , one period later as , etc. The ''
solution Solution may refer to: * Solution (chemistry), a mixture where one substance is dissolved in another * Solution (equation), in mathematics ** Numerical solution, in numerical analysis, approximate solutions within specified error bounds * Solu ...
'' of such an equation is a function of , and not of any iterate values, giving the value of the iterate at any time. To find the solution it is necessary to know the specific values (known as ''
initial condition In mathematics and particularly in dynamic systems, an initial condition, in some contexts called a seed value, is a value of an evolving variable at some point in time designated as the initial time (typically denoted ''t'' = 0). Fo ...
s'') of of the iterates, and normally these are the iterates that are oldest. The equation or its variable is said to be ''
stable A stable is a building in which working animals are kept, especially horses or oxen. The building is usually divided into stalls, and may include storage for equipment and feed. Styles There are many different types of stables in use tod ...
'' if from any set of initial conditions the variable's limit as time goes to infinity exists; this limit is called the ''
steady state In systems theory, a system or a process is in a steady state if the variables (called state variables) which define the behavior of the system or the process are unchanging in time. In continuous time, this means that for those properties ''p' ...
''. Difference equations are used in a variety of contexts, such as in
economics Economics () is a behavioral science that studies the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services. Economics focuses on the behaviour and interac ...
to model the evolution through time of variables such as
gross domestic product Gross domestic product (GDP) is a monetary measure of the total market value of all the final goods and services produced and rendered in a specific time period by a country or countries. GDP is often used to measure the economic performanc ...
, the
inflation rate In economics, inflation is an increase in the average price of goods and services in terms of money. This increase is measured using a price index, typically a consumer price index (CPI). When the general price level rises, each unit of curre ...
, the
exchange rate In finance, an exchange rate is the rate at which one currency will be exchanged for another currency. Currencies are most commonly national currencies, but may be sub-national as in the case of Hong Kong or supra-national as in the case of ...
, etc. They are used in modeling such
time series In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. ...
because values of these variables are only measured at discrete intervals. In
econometric Econometrics is an application of statistical methods to economic data in order to give empirical content to economic relationships. M. Hashem Pesaran (1987). "Econometrics", '' The New Palgrave: A Dictionary of Economics'', v. 2, p. 8 p. 8� ...
applications, linear difference equations are modeled with stochastic terms in the form of autoregressive (AR) models and in models such as
vector autoregression Vector autoregression (VAR) is a statistical model used to capture the relationship between multiple quantities as they change over time. VAR is a type of stochastic process model. VAR models generalize the single-variable (univariate) autoregres ...
(VAR) and
autoregressive moving average In the statistical analysis of time series, autoregressive–moving-average (ARMA) models are a way to describe a (weakly) stationary stochastic process using autoregression (AR) and a moving average (MA), each with a polynomial. They are a too ...
(ARMA) models that combine AR with other features.


Definitions

A linear recurrence with constant coefficients is an equation of the following form, written in terms of
parameters A parameter (), generally, is any characteristic that can help in defining or classifying a particular system (meaning an event, project, object, situation, etc.). That is, a parameter is an element of a system that is useful, or critical, when ...
and : y_t = a_1y_ + \cdots + a_ny_ + b, or equivalently as y_= a_1y_ + \cdots + a_ny_t + b. The positive integer n is called the order of the recurrence and denotes the longest time lag between iterates. The equation is called homogeneous if and nonhomogeneous if . If the equation is homogeneous, the coefficients determine the
characteristic polynomial In linear algebra, the characteristic polynomial of a square matrix is a polynomial which is invariant under matrix similarity and has the eigenvalues as roots. It has the determinant and the trace of the matrix among its coefficients. The ...
(also "auxiliary polynomial" or "companion polynomial") p(\lambda)= \lambda^n - a_1\lambda^ - a_2\lambda^-\cdots-a_ whose roots play a crucial role in finding and understanding the sequences satisfying the recurrence.


Conversion to homogeneous form

If , the equation y_t = a_1y_ + \cdots + a_ny_ + b is said to be ''nonhomogeneous''. To solve this equation it is convenient to convert it to homogeneous form, with no constant term. This is done by first finding the equation's ''steady state value''—a value such that, if successive iterates all had this value, so would all future values. This value is found by setting all values of equal to in the difference equation, and solving, thus obtaining y^* = \frac assuming the denominator is not 0. If it is zero, the steady state does not exist. Given the steady state, the difference equation can be rewritten in terms of deviations of the iterates from the steady state, as \left(y_t -y^*\right)= a_1\left(y_-y^*\right) + \cdots + a_n\left(y_-y^*\right) which has no constant term, and which can be written more succinctly as x_t= a_1x_ + \cdots + a_nx_ where equals . This is the homogeneous form. If there is no steady state, the difference equation y_t = a_1y_ + \cdots + a_ny_ + b can be combined with its equivalent form y_= a_1y_+ \cdots + a_ny_+ b to obtain (by solving both for ) y_t - a_1y_ - \cdots - a_ny_ = y_- a_1y_- \cdots - a_ny_ in which like terms can be combined to give a homogeneous equation of one order higher than the original.


Solution example for small orders

The roots of the characteristic polynomial play a crucial role in finding and understanding the sequences satisfying the recurrence. If there are d distinct roots r_1, r_2, \ldots, r_d, then each solution to the recurrence takes the form a_n = k_1 r_1^n + k_2 r_2^n + \cdots + k_d r_d^n, where the coefficients k_i are determined in order to fit the initial conditions of the recurrence. When the same roots occur multiple times, the terms in this formula corresponding to the second and later occurrences of the same root are multiplied by increasing powers of n. For instance, if the characteristic polynomial can be factored as (x-r)^3, with the same root r occurring three times, then the solution would take the form a_n = k_1 r^n + k_2 n r^n + k_3 n^2 r^n.


Order 1

For order 1, the recurrence a_=r a_ has the solution a_n = r^n with a_0 = 1 and the most general solution is a_n = k r^n with a_0 = k. The characteristic polynomial equated to zero (the characteristic equation) is simply t - r = 0.


Order 2

Solutions to such recurrence relations of higher order are found by systematic means, often using the fact that a_n = r^n is a solution for the recurrence exactly when t = r is a root of the characteristic polynomial. This can be approached directly or using
generating function In mathematics, a generating function is a representation of an infinite sequence of numbers as the coefficients of a formal power series. Generating functions are often expressed in closed form (rather than as a series), by some expression invo ...
s (
formal power series In mathematics, a formal series is an infinite sum that is considered independently from any notion of convergence, and can be manipulated with the usual algebraic operations on series (addition, subtraction, multiplication, division, partial su ...
) or matrices. Consider, for example, a recurrence relation of the form a_ = Aa_+Ba_. When does it have a solution of the same general form as a_n = r^n? Substituting this guess (
ansatz In physics and mathematics, an ansatz (; , meaning: "initial placement of a tool at a work piece", plural ansatzes or, from German, ansätze ; ) is an educated guess or an additional assumption made to help solve a problem, and which may later be ...
) in the recurrence relation, we find that r^=Ar^+Br^ must be true for all n > 1. Dividing through by r^, we get that all these equations reduce to the same thing: \begin r^2 &= Ar + B, \\ r^2 - Ar - B &= 0, \end which is the characteristic equation of the recurrence relation. Solve for r to obtain the two roots \lambda_1, \lambda_2: these roots are known as the
characteristic root In linear algebra, an eigenvector ( ) or characteristic vector is a vector that has its direction unchanged (or reversed) by a given linear transformation. More precisely, an eigenvector \mathbf v of a linear transformation T is scaled by a c ...
s or eigenvalues of the characteristic equation. Different solutions are obtained depending on the nature of the roots: If these roots are distinct, we have the general solution a_n = C\lambda_1^n+D\lambda_2^n while if they are identical (when A^2 + 4 B = 0), we have a_n = C\lambda^n + D n\lambda^n This is the most general solution; the two constants C and D can be chosen based on two given initial conditions a_0 and a_1 to produce a specific solution. In the case of complex eigenvalues (which also gives rise to complex values for the solution parameters C and D), the use of complex numbers can be eliminated by rewriting the solution in trigonometric form. In this case we can write the eigenvalues as \lambda_1, \lambda_2 = \alpha \pm \beta i. Then it can be shown that a_n = C\lambda_1^n + D\lambda_2^n can be rewritten as a_n = 2 M^n \left( E \cos(\theta n) + F \sin(\theta n)\right) = 2 G M^n \cos(\theta n - \delta), where \begin M = \sqrt & \cos (\theta) =\tfrac & \sin( \theta) = \tfrac \\ C,D = E \mp F i & & \\ G = \sqrt & \cos (\delta ) = \tfrac & \sin (\delta )= \tfrac \end Here E and F (or equivalently, G and \delta) are real constants which depend on the initial conditions. Using \lambda_1+\lambda_2=2 \alpha = A, \lambda_1 \cdot \lambda_2=\alpha^2+\beta^2=-B, one may simplify the solution given above as a_n = (-B)^ \left( E \cos(\theta n) + F \sin(\theta n)\right), where a_1 and a_2 are the initial conditions and \begin E &= \frac \\ F &=-i \frac \\ \theta &=\arccos \left (\frac \right ) \end In this way there is no need to solve for \lambda_1 and \lambda_2. In all cases—real distinct eigenvalues, real duplicated eigenvalues, and complex conjugate eigenvalues—the equation is
stable A stable is a building in which working animals are kept, especially horses or oxen. The building is usually divided into stalls, and may include storage for equipment and feed. Styles There are many different types of stables in use tod ...
(that is, the variable a converges to a fixed value pecifically, zero if and only if ''both'' eigenvalues are smaller than one in
absolute value In mathematics, the absolute value or modulus of a real number x, is the non-negative value without regard to its sign. Namely, , x, =x if x is a positive number, and , x, =-x if x is negative (in which case negating x makes -x positive), ...
. In this second-order case, this condition on the eigenvalues can be shown to be equivalent to , A, < 1 - B < 2, which is equivalent to , B, < 1 and , A, < 1 - B.


General solution


Characteristic polynomial and roots

Solving the homogeneous equation x_t= a_1x_ + \cdots + a_nx_ involves first solving its characteristic polynomial \lambda^n = a_1\lambda^ + \cdots + a_\lambda^2+a_ \lambda + a_n for its characteristic roots . These roots can be solved for algebraically if , but not necessarily otherwise. If the solution is to be used numerically, all the roots of this characteristic equation can be found by
numerical methods Numerical analysis is the study of algorithms that use numerical approximation (as opposed to symbolic manipulations) for the problems of mathematical analysis (as distinguished from discrete mathematics). It is the study of numerical methods t ...
. However, for use in a theoretical context it may be that the only information required about the roots is whether any of them are greater than or equal to 1 in
absolute value In mathematics, the absolute value or modulus of a real number x, is the non-negative value without regard to its sign. Namely, , x, =x if x is a positive number, and , x, =-x if x is negative (in which case negating x makes -x positive), ...
. It may be that all the roots are real or instead there may be some that are
complex number In mathematics, a complex number is an element of a number system that extends the real numbers with a specific element denoted , called the imaginary unit and satisfying the equation i^= -1; every complex number can be expressed in the for ...
s. In the latter case, all the complex roots come in
complex conjugate In mathematics, the complex conjugate of a complex number is the number with an equal real part and an imaginary part equal in magnitude but opposite in sign. That is, if a and b are real numbers, then the complex conjugate of a + bi is a - ...
pairs.


Solution with distinct characteristic roots

If all the characteristic roots are distinct, the solution of the homogeneous linear recurrence x_t= a_1x_ + \cdots + a_nx_ can be written in terms of the characteristic roots as x_t=c_1\lambda_1^t +\cdots + c_n\lambda_n^t where the coefficients can be found by invoking the initial conditions. Specifically, for each time period for which an iterate value is known, this value and its corresponding value of can be substituted into the solution equation to obtain a linear equation in the as-yet-unknown parameters; such equations, one for each initial condition, can be solved simultaneously for the parameter values. If all characteristic roots are real, then all the coefficient values will also be real; but with non-real complex roots, in general some of these coefficients will also be non-real.


Converting complex solution to trigonometric form

If there are complex roots, they come in conjugate pairs and so do the complex terms in the solution equation. If two of these complex terms are and , the roots can be written as \lambda_j, \lambda_=\alpha \pm \beta i =M\left(\frac \pm \fraci\right) where is the
imaginary unit The imaginary unit or unit imaginary number () is a mathematical constant that is a solution to the quadratic equation Although there is no real number with this property, can be used to extend the real numbers to what are called complex num ...
and is the modulus of the roots: M = \sqrt. Then the two complex terms in the solution equation can be written as \begin c_j\lambda_j^t + c_\lambda_^t & = M^t\left(c_j\left(\frac + \fraci\right)^t + c_\left(\frac - \fraci\right)^t\right) \\ pt& = M^t\left(c_j\left(\cos\theta + i\sin\theta\right)^t + c_\left(\cos \theta - i\sin\theta\right)^t\right) \\ pt& = M^t\bigl(c_j\left(\cos\theta t + i\sin \theta t\right) + c_\left(\cos\theta t - i\sin\theta t\right) \bigr) \end where is the angle whose cosine is and whose sine is ; the last equality here made use of de Moivre's formula. Now the process of finding the coefficients and guarantees that they are also complex conjugates, which can be written as . Using this in the last equation gives this expression for the two complex terms in the solution equation: 2M^t\left(\gamma \cos\theta t - \delta \sin\theta t\right) which can also be written as 2\sqrtM^t \cos(\theta t + \psi) where is the angle whose cosine is and whose sine is .


Cyclicity

Depending on the initial conditions, even with all roots real the iterates can experience a transitory tendency to go above and below the steady state value. But true cyclicity involves a permanent tendency to fluctuate, and this occurs if there is at least one pair of complex conjugate characteristic roots. This can be seen in the trigonometric form of their contribution to the solution equation, involving and .


Solution with duplicate characteristic roots

In the second-order case, if the two roots are identical (), they can both be denoted as and a solution may be of the form x_t = c_1 \lambda^t + c_2 t \lambda^t.


Solution by conversion to matrix form

An alternative solution method involves converting the th order difference equation to a first-order
matrix difference equation A matrix difference equation is a difference equation in which the value of a vector (or sometimes, a matrix) of variables at one point in time is related to its own value at one or more previous points in time, using matrices. The order of the e ...
. This is accomplished by writing , , , and so on. Then the original single th-order equation y_t = a_1y_ + a_2y_ + \cdots + a_ny_ + b can be replaced by the following first-order equations: \begin w_ & = a_1w_ + a_2w_+\cdots + a_nw_ + b \\ w_ & = w_ \\ & \,\,\,\vdots \\ w_ & =w_. \end Defining the vector as \mathbf_i = \beginw_ \\ w_ \\ \vdots \\ w_ \end this can be put in matrix form as \mathbf_t = \mathbf\mathbf_+\mathbf Here is an matrix in which the first row contains and all other rows have a single 1 with all other elements being 0, and is a column vector with first element and with the rest of its elements being 0. This matrix equation can be solved using the methods in the article
Matrix difference equation A matrix difference equation is a difference equation in which the value of a vector (or sometimes, a matrix) of variables at one point in time is related to its own value at one or more previous points in time, using matrices. The order of the e ...
. In the homogeneous case is a para-permanent of a lower triangular matrix


Solution using generating functions

The recurrence y_t = a_1y_ + \cdots + a_ny_ + b, can be solved using the theory of
generating function In mathematics, a generating function is a representation of an infinite sequence of numbers as the coefficients of a formal power series. Generating functions are often expressed in closed form (rather than as a series), by some expression invo ...
s. First, we write Y(x) = \sum_ y_t x^t. The recurrence is then equivalent to the following generating function equation: Y(x) = a_1xY(x) + a_2x^2Y(x) + \cdots + a_nx^nY(x) + \frac + p(x) where p(x) is a polynomial of degree at most n-1 correcting the initial terms. From this equation we can solve to get Y(x) = \left(\frac + p(x)\right) \cdot \frac. In other words, not worrying about the exact coefficients, Y(x) can be expressed as a
rational function In mathematics, a rational function is any function that can be defined by a rational fraction, which is an algebraic fraction such that both the numerator and the denominator are polynomials. The coefficients of the polynomials need not be ...
Y(x) = \frac. The closed form can then be derived via
partial fraction decomposition In algebra, the partial fraction decomposition or partial fraction expansion of a rational fraction (that is, a fraction such that the numerator and the denominator are both polynomials) is an operation that consists of expressing the fraction as ...
. Specifically, if the generating function is written as \frac = \sum_i \frac then the polynomial p(x) determines the initial set of corrections z(n), the denominator (x - r_i)^m determines the exponential term r_i^n, and the degree m together with the numerator f_i(x) determine the polynomial coefficient k_i(n).


Relation to solution to differential equations

The method for solving linear differential equations is similar to the method above—the "intelligent guess" (
ansatz In physics and mathematics, an ansatz (; , meaning: "initial placement of a tool at a work piece", plural ansatzes or, from German, ansätze ; ) is an educated guess or an additional assumption made to help solve a problem, and which may later be ...
) for linear differential equations with constant coefficients is e^ where \lambda is a complex number that is determined by substituting the guess into the differential equation. This is not a coincidence. Considering the
Taylor series In mathematics, the Taylor series or Taylor expansion of a function is an infinite sum of terms that are expressed in terms of the function's derivatives at a single point. For most common functions, the function and the sum of its Taylor ser ...
of the solution to a linear differential equation: \sum_^\infin \frac (x-a)^n it can be seen that the coefficients of the series are given by the n-th derivative of f(x) evaluated at the point a. The differential equation provides a linear difference equation relating these coefficients. This equivalence can be used to quickly solve for the recurrence relationship for the coefficients in the power series solution of a linear differential equation. The rule of thumb (for equations in which the polynomial multiplying the first term is non-zero at zero) is that: y^ \to f +k/math> and more generally x^m*y^ \to n(n-1)...(n-m+1)f +k-m/math> Example: The recurrence relationship for the Taylor series coefficients of the equation: (x^2 + 3x -4)y^ -(3x+1)y^ + 2y = 0 is given by n(n-1)f +1+ 3nf +2-4f +3-3nf +1-f +2 2f = 0 or -4f +3+2nf +2+ n(n-4)f +1+2f = 0. This example shows how problems generally solved using the power series solution method taught in normal differential equation classes can be solved in a much easier way. Example: The differential equation ay'' + by' +cy = 0 has solution y=e^. The conversion of the differential equation to a difference equation of the Taylor coefficients is af + 2+ bf + 1+ cf = 0. It is easy to see that the n-th derivative of e^ evaluated at 0 is a^n.


Solving with z-transforms

Certain difference equations - in particular, linear constant coefficient difference equations - can be solved using
z-transform In mathematics and signal processing, the Z-transform converts a discrete-time signal, which is a sequence of real or complex numbers, into a complex valued frequency-domain (the z-domain or z-plane) representation. It can be considered a dis ...
s. The ''z''-transforms are a class of
integral transform In mathematics, an integral transform is a type of transform that maps a function from its original function space into another function space via integration, where some of the properties of the original function might be more easily charac ...
s that lead to more convenient algebraic manipulations and more straightforward solutions. There are cases in which obtaining a direct solution would be all but impossible, yet solving the problem via a thoughtfully chosen integral transform is straightforward.


Stability

In the solution equation x_t = c_1\lambda_1^t +\cdots + c_n\lambda_n^t, a term with real characteristic roots converges to 0 as grows indefinitely large if the absolute value of the characteristic root is less than 1. If the absolute value equals 1, the term will stay constant as grows if the root is +1 but will fluctuate between two values if the root is −1. If the absolute value of the root is greater than 1 the term will become larger and larger over time. A pair of terms with complex conjugate characteristic roots will converge to 0 with dampening fluctuations if the absolute value of the modulus of the roots is less than 1; if the modulus equals 1 then constant amplitude fluctuations in the combined terms will persist; and if the modulus is greater than 1, the combined terms will show fluctuations of ever-increasing magnitude. Thus the evolving variable will converge to 0 if all of the characteristic roots have magnitude less than 1. If the largest root has absolute value 1, neither convergence to 0 nor divergence to infinity will occur. If all roots with magnitude 1 are real and positive, will converge to the sum of their constant terms ; unlike in the stable case, this converged value depends on the initial conditions; different starting points lead to different points in the long run. If any root is −1, its term will contribute permanent fluctuations between two values. If any of the unit-magnitude roots are complex then constant-amplitude fluctuations of will persist. Finally, if any characteristic root has magnitude greater than 1, then will diverge to infinity as time goes to infinity, or will fluctuate between increasingly large positive and negative values. A theorem of
Issai Schur Issai Schur (10 January 1875 – 10 January 1941) was a Russian mathematician who worked in Germany for most of his life. He studied at the Humboldt University of Berlin, University of Berlin. He obtained his doctorate in 1901, became lecturer i ...
states that all roots have magnitude less than 1 (the stable case) if and only if a particular string of
determinant In mathematics, the determinant is a Scalar (mathematics), scalar-valued function (mathematics), function of the entries of a square matrix. The determinant of a matrix is commonly denoted , , or . Its value characterizes some properties of the ...
s are all positive. If a non-homogeneous linear difference equation has been converted to homogeneous form which has been analyzed as above, then the stability and cyclicality properties of the original non-homogeneous equation will be the same as those of the derived homogeneous form, with convergence in the stable case being to the steady-state value instead of to 0.


See also

*
Recurrence relation In mathematics, a recurrence relation is an equation according to which the nth term of a sequence of numbers is equal to some combination of the previous terms. Often, only k previous terms of the sequence appear in the equation, for a parameter ...
*
Linear differential equation In mathematics, a linear differential equation is a differential equation that is linear equation, linear in the unknown function and its derivatives, so it can be written in the form a_0(x)y + a_1(x)y' + a_2(x)y'' \cdots + a_n(x)y^ = b(x) wher ...
*
Skolem–Mahler–Lech theorem In additive and algebraic number theory, the Skolem–Mahler–Lech theorem states that if a sequence of numbers satisfies a linear difference equation, then with finitely many exceptions the positions at which the sequence is zero form a regularl ...
*
Skolem problem In mathematics, the Skolem problem is the problem of determining whether the values of a constant-recursive sequence include the number zero. The problem can be formulated for recurrences over different types of numbers, including integers, rati ...


References

{{reflist Combinatorics Dynamical systems Linear algebra Recurrence relations