In
time series
In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. Exa ...
data, seasonality is the presence of variations that occur at specific regular intervals less than a year, such as weekly, monthly, or quarterly. Seasonality may be caused by various factors, such as weather, vacation, and holidays and consists of periodic, repetitive, and generally regular and predictable patterns in the levels of a time series.
Seasonal fluctuations in a time series can be contrasted with cyclical patterns. The latter occur when the data exhibits rises and falls that are not of a fixed period. Such non-seasonal fluctuations are usually due to economic conditions and are often related to the "business cycle"; their period usually extends beyond a single year, and the fluctuations are usually of at least two years.
Organisations facing seasonal variations, such as ice-cream vendors, are often interested in knowing their performance relative to the normal seasonal variation. Seasonal variations in the labour market can be attributed to the entrance of school leavers into the job market as they aim to contribute to the workforce upon the completion of their schooling. These regular changes are of less interest to those who study employment data than the variations that occur due to the underlying state of the economy; their focus is on how unemployment in the workforce has changed, despite the impact of the regular seasonal variations.
It is necessary for organisations to identify and measure seasonal variations within their market to help them plan for the future. This can prepare them for the temporary increases or decreases in labour requirements and inventory as demand for their product or service fluctuates over certain periods. This may require training, periodic maintenance, and so forth that can be organized in advance. Apart from these considerations, the organisations need to know if variation they have experienced has been more or less than the expected amount, beyond what the usual seasonal variations account for.
Motivation
There are several main reasons for studying seasonal variation:
:*The description of the seasonal effect provides a better understanding of the impact this component has upon a particular series.
:*After establishing the seasonal pattern, methods can be implemented to eliminate it from the time-series to study the effect of other components such as
cyclical and irregular variations. This elimination of the seasonal effect is referred to as de-seasonalizing or
seasonal adjustment
Seasonal adjustment or deseasonalization is a statistical method for removing the seasonal component of a time series. It is usually done when wanting to analyse the trend, and cyclical deviations from trend, of a time series independently of the ...
of data.
:*To use the past patterns of the seasonal variations to contribute to forecasting and the prediction of the future trends, such as in
climate normal
Climatological normal or climate normal (CN) is a 30-year average of a weather variable for a given time of year.[graphical technique
Statistical graphics, also known as statistical graphical techniques, are graphics used in the field of statistics for data visualization.
Overview
Whereas statistics and data analysis procedures generally yield their output in numeric or ...]
s can be used to detect seasonality:
* A
run sequence plot
A run chart, also known as a run-sequence plot is a graph that displays observed data in a time sequence. Often, the data displayed represent some aspect of the output or performance of a manufacturing or other business process. It is therefore ...
will often show seasonality
* A seasonal plot will show the data from each season overlapped
* A
seasonal subseries plot
Seasonal subseries plots are a graphical tool to visualize and detect seasonality in a time series. Seasonal subseries plots involves the extraction of the seasons from a time series into a subseries. Based on a selected periodicity, it is an alte ...
is a specialized technique for showing seasonality
* Multiple
box plot
In descriptive statistics, a box plot or boxplot is a method for graphically demonstrating the locality, spread and skewness groups of numerical data through their quartiles. In addition to the box on a box plot, there can be lines (which are ca ...
s can be used as an alternative to the seasonal subseries plot to detect seasonality
* An
autocorrelation plot (ACF) and a spectral plot can help identify seasonality.
A really good way to find periodicity, including seasonality, in any regular series of data is to remove any overall trend first and then to inspect time periodicity.
The run sequence plot is a recommended first step for analyzing any time series. Although seasonality can sometimes be indicated by this plot, seasonality is shown more clearly by the seasonal subseries plot or the box plot. The seasonal subseries plot does an excellent job of showing both the seasonal differences (between group patterns) and also the within-group patterns. The box plot shows the seasonal difference (between group patterns) quite well, but it does not show within group patterns. However, for large data sets, the box plot is usually easier to read than the seasonal subseries plot.
The seasonal plot, seasonal subseries plot, and the box plot all assume that the seasonal periods are known. In most cases, the analyst will in fact, know this. For example, for monthly data, the period is 12 since there are 12 months in a year. However, if the period is not known, the autocorrelation plot can help. If there is significant seasonality, the autocorrelation plot should show spikes at lags equal to the period. For example, for monthly data, if there is a seasonality effect, we would expect to see significant peaks at lag 12, 24, 36, and so on (although the intensity may decrease the further out we go).
An autocorrelation plot (ACF) can be used to identify seasonality, as it calculates the difference (residual amount) between a Y value and a lagged value of Y. The result gives some points where the two values are close together ( no seasonality ), but other points where there is a large discrepancy. These points indicate a level of seasonality in the data.
Semiregular cyclic variations might be dealt with by
spectral density estimation
In statistical signal processing, the goal of spectral density estimation (SDE) or simply spectral estimation is to estimate the spectral density (also known as the power spectral density) of a signal from a sequence of time samples of the signa ...
.
Calculation
Seasonal variation is measured in terms of an index, called a seasonal index. It is an average that can be used to compare an actual observation relative to what it would be if there were no seasonal variation. An index value is attached to each period of the time series within a year. This implies that if monthly data are considered there are 12 separate seasonal indices, one for each month. The following methods use seasonal indices to measure seasonal variations of a time-series data.
:*Method of simple averages
:*Ratio to
trend
A fad or trend is any form of collective behavior that develops within a culture, a generation or social group in which a group of people enthusiastically follow an impulse (psychology), impulse for a short period.
Fads are objects or behavior ...
method
:*Ratio-to-
moving-average method
:*Link relatives method
Method of simple averages
The measurement of seasonal variation by using the ratio-to-moving-average method provides an index to measure the degree of the seasonal variation in a time series. The index is based on a mean of 100, with the degree of seasonality measured by variations away from the base. For example, if we observe the hotel rentals in a winter resort, we find that the winter quarter index is 124. The value 124 indicates that 124 percent of the average quarterly rental occur in winter. If the hotel management records 1436 rentals for the whole of last year, then the average quarterly rental would be 359= (1436/4). As the winter-quarter index is 124, we estimate the number of winter rentals as follows:
359*(124/100)=445;
Here, 359 is the average quarterly rental. 124 is the winter-quarter index. 445 the seasonalized winter-quarter rental.
This method is also called the percentage
moving average
In statistics, a moving average (rolling average or running average) is a calculation to analyze data points by creating a series of averages of different subsets of the full data set. It is also called a moving mean (MM) or rolling mean and is ...
method. In this method, the original data values in the time-series are expressed as percentages of moving averages. The steps and the tabulations are given below.
Ratio to trend method
# Find the centered 12 monthly (or 4 quarterly) moving averages of the original data values in the
time-series
In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. Exa ...
.
#:
# Express each original data value of the time-series as a percentage of the corresponding centered
moving average
In statistics, a moving average (rolling average or running average) is a calculation to analyze data points by creating a series of averages of different subsets of the full data set. It is also called a moving mean (MM) or rolling mean and is ...
values obtained in step(1). In other words, in a multiplicative time-series model, we get (Original data values) / (Trend values) × 100 = ( × × × ) / ( × ) × 100 = ( × ) × 100.
This implies that the ratio–to-moving average represents the seasonal and irregular components.
#:
#:
# Arrange these percentages according to months or quarter of given years. Find the averages over all months or quarters of the given years.
#:
# If the sum of these indices is not 1200 (or 400 for quarterly figures), multiply then by a correction factor = 1200 / (sum of monthly indices). Otherwise, the 12 monthly averages will be considered as seasonal indices.
Ratio-to-moving-average method
Let us calculate the seasonal index by the ratio-to-moving-average method from the following data:
Now calculations for 4 quarterly moving averages and ratio-to-moving-averages are shown in the below table.
Now the total of seasonal averages is 398.85. Therefore, the corresponding correction factor would be 400/398.85 = 1.00288. Each seasonal average is multiplied by the correction factor 1.00288 to get the adjusted seasonal indices as shown in the above table.
Link relatives method
1. In an additive time-series model, the seasonal component is estimated as:
: = – ( + + )
where
: : Seasonal values
:
: : Actual data values of the time-series
:
: :
Trend
A fad or trend is any form of collective behavior that develops within a culture, a generation or social group in which a group of people enthusiastically follow an impulse (psychology), impulse for a short period.
Fads are objects or behavior ...
values
:
: : Cyclical values
:
: : Irregular values.
2. In a multiplicative time-series model, the seasonal component is expressed in terms of ratio and percentage as
: ''Seasonal effect''
;
However, in practice the detrending of time-series is done to arrive at
.
This is done by dividing both sides of
by trend values so that
.
3. The deseasonalized time-series data will have only trend ( ), cyclical ( ) and irregular ( ) components and is expressed as:
:*Multiplicative model :
:*
Additive model In statistics, an additive model (AM) is a nonparametric regression method. It was suggested by Jerome H. Friedman and Werner Stuetzle (1981) and is an essential part of the ACE algorithm. The ''AM'' uses a one-dimensional smoother to build a rest ...
: – = ( + + + ) – = + +
Modeling
A completely regular cyclic variation in a time series might be dealt with in
time series analysis
In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. Exa ...
by using a
sinusoidal model
In statistics, signal processing, and time series analysis, a sinusoidal model is used to approximate a sequence ''Yi'' to a sine function:
:Y_i = C + \alpha\sin(\omega T_i + \phi) + E_i
where ''C'' is constant defining a mean level, α is an ...
with one or more
sinusoid
A sine wave, sinusoidal wave, or just sinusoid is a mathematical curve defined in terms of the ''sine'' trigonometric function, of which it is the graph. It is a type of continuous wave and also a smooth periodic function. It occurs often in ma ...
s whose period-lengths may be known or unknown depending on the context. A less completely regular cyclic variation might be dealt with by using a special form of an
ARIMA
Arima, officially The Royal Chartered Borough of Arima is the easternmost and second largest in area of the three boroughs of Trinidad and Tobago. It is geographically adjacent to Sangre Grande and Arouca at the south central foothills of th ...
model which can be structured so as to treat cyclic variations semi-explicitly. Such models represent
cyclostationary process A cyclostationary process is a signal having statistical properties that vary cyclically with time.
A cyclostationary process can be viewed as multiple interleaved stationary processes. For example, the maximum daily temperature in New York City ca ...
es.
Another method of modelling periodic seasonality is the use of pairs of Fourier terms. Similar to using the sinusoidal model, Fourier terms added into regression models utilize sine and cosine terms in order to simulate seasonality. However, the seasonality of such a regression would be represented as the sum of sine or cosine terms, instead of a single sine or cosine term in a sinusoidal model. Every periodic function can be approximated with the inclusion of Fourier terms.
The difference between a sinusoidal model and a regression with Fourier terms can be simplified as below:
Sinusoidal Model:
:
Regression With Fourier Terms:
:
Seasonal adjustment
''Seasonal adjustment'' or ''deseasonalization'' is any method for removing the seasonal component of a
time series
In mathematics, a time series is a series of data points indexed (or listed or graphed) in time order. Most commonly, a time series is a sequence taken at successive equally spaced points in time. Thus it is a sequence of discrete-time data. Exa ...
. The resulting seasonally adjusted data are used, for example, when analyzing or reporting non-seasonal trends over durations rather longer than the seasonal period. An appropriate method for seasonal adjustment is chosen on the basis of a particular view taken of the
decomposition of time series
The decomposition of time series is a statistical task that deconstructs a time series into several components, each representing one of the underlying categories of patterns. There are two principal types of decomposition, which are outlined belo ...
into components designated with names such as "trend", "cyclic", "seasonal" and "irregular", including how these interact with each other. For example, such components might act additively or multiplicatively. Thus, if a seasonal component acts additively, the adjustment method has two stages:
*estimate the seasonal component of variation in the time series, usually in a form that has a zero mean across series;
*subtract the estimated seasonal component from the original time series, leaving the seasonally adjusted series:
.
If it is a multiplicative model, the magnitude of the seasonal fluctuations will vary with the level, which is more likely to occur with economic series.
When taking seasonality into account, the seasonally adjusted multiplicative decomposition can be written as
; whereby the original time series is divided by the estimated seasonal component.
The multiplicative model can be transformed into an additive model by taking the log of the time series;
SA Multiplicative decomposition:
Taking log of the time series of the multiplicative model:
One particular implementation of seasonal adjustment is provided by
X-12-ARIMA
X-13ARIMA-SEATS, successor to X-12-ARIMA and X-11, is a set of statistical methods for seasonal adjustment and other descriptive analysis of time series data that are implemented in the U.S. Census Bureau's software package. These methods are or ...
.
In regression analysis
In
regression analysis
In statistical modeling, regression analysis is a set of statistical processes for estimating the relationships between a dependent variable (often called the 'outcome' or 'response' variable, or a 'label' in machine learning parlance) and one ...
such as
ordinary least squares
In statistics, ordinary least squares (OLS) is a type of linear least squares method for choosing the unknown parameters in a linear regression model (with fixed level-one effects of a linear function of a set of explanatory variables) by the prin ...
, with a seasonally varying
dependent variable
Dependent and independent variables are variables in mathematical modeling, statistical modeling and experimental sciences. Dependent variables receive this name because, in an experiment, their values are studied under the supposition or demand ...
being influenced by one or more
independent variable
Dependent and independent variables are variables in mathematical modeling, statistical modeling and experimental sciences. Dependent variables receive this name because, in an experiment, their values are studied under the supposition or demand ...
s, the seasonality can be accounted for and measured by including ''n''-1
dummy variables, one for each of the seasons except for an arbitrarily chosen reference season, where ''n'' is the number of seasons (e.g., 4 in the case of meteorological seasons, 12 in the case of months, etc.). Each dummy variable is set to 1 if the data point is drawn from the dummy's specified season and 0 otherwise. Then the predicted value of the dependent variable for the reference season is computed from the rest of the regression, while for any other season it is computed using the rest of the regression and by inserting the value 1 for the dummy variable for that season.
Related patterns
It is important to distinguish seasonal patterns from related patterns.
While a seasonal pattern occurs when a time series is affected by the
season
A season is a division of the year based on changes in weather, ecology, and the number of daylight hours in a given region. On Earth, seasons are the result of the axial parallelism of Earth's tilted orbit around the Sun. In temperate and pol ...
or the time of the year, such as annual, semiannual, quarterly, etc.
A cyclic pattern, or simply a cycle, occurs when the data exhibit rises and falls in other periods, i.e., much longer (e.g.,
decadal) or much shorter (e.g.,
week
A week is a unit of time equal to seven days. It is the standard time period used for short cycles of days in most parts of the world. The days are often used to indicate common work days and rest days, as well as days of worship. Weeks are ofte ...
ly) than seasonal.
A
quasiperiodicity
Quasiperiodicity is the property of a system that displays irregular periodicity. Periodic behavior is defined as recurring at regular intervals, such as "every 24 hours". Quasiperiodic behavior is a pattern of recurrence with a component of unpred ...
is a more general, irregular periodicity.
See also
*
Oscillation
Oscillation is the repetitive or periodic variation, typically in time, of some measure about a central value (often a point of equilibrium) or between two or more different states. Familiar examples of oscillation include a swinging pendulum ...
*
Periodic function
A periodic function is a function that repeats its values at regular intervals. For example, the trigonometric functions, which repeat at intervals of 2\pi radians, are periodic functions. Periodic functions are used throughout science to desc ...
*
Periodicity (disambiguation)
Periodicity or periodic may refer to:
Mathematics
* Bott periodicity theorem, addresses Bott periodicity: a modulo-8 recurrence relation in the homotopy groups of classical groups
* Periodic function, a function whose output contains values tha ...
*
Photoperiodism
Photoperiodism is the physiological reaction of organisms to the length of night or a dark period. It occurs in plants and animals. Plant photoperiodism can also be defined as the developmental responses of plants to the relative lengths of light a ...
References
*
*''Complete Business Statistics'' (Chapter 12) by Amir D. Aczel.
*''Business Statistics: Why and When'' (Chapter 15) by Larry E. Richards and Jerry J. Lacava.
*''Business Statistics'' (Chapter 16) by J.K. Sharma.
*''Business Statistics, a decision making approach'' (Chapter 18) by David F. Groebner and Patric W. Shannon.
*''Statistics for Management'' (Chapter 15) by Richard I. Levin and David S. Rubin.
*''Forecasting: practice and principles'' by Rob J. Hyndman and George Athansopoulos
External links
*
Seasonalityat NIST/SEMATECH e-Handbook of Statistical Methods
{{Authority control