HOME

TheInfoList



OR:

Quantile Regression Averaging (QRA) is a forecast combination approach to the computation of
prediction interval In statistical inference, specifically predictive inference, a prediction interval is an estimate of an interval in which a future observation will fall, with a certain probability, given what has already been observed. Prediction intervals are o ...
s. It involves applying
quantile regression Quantile regression is a type of regression analysis used in statistics and econometrics. Whereas the method of least squares estimates the conditional ''mean'' of the response variable across values of the predictor variables, quantile regressi ...
to the point forecasts of a small number of individual forecasting models or experts. It has been introduced in 2014 by Jakub Nowotarski and Rafał Weron and originally used for
probabilistic forecasting Probabilistic forecasting summarizes what is known about, or opinions about, future events. In contrast to single-valued forecasts (such as forecasting that the maximum temperature at a given site on a given day will be 23 degrees Celsius, or that t ...
of electricity prices and loads. Despite its simplicity it has been found to perform extremely well in practice - the top two performing teams in the ''price track'' of the
Global Energy Forecasting Competition The Global Energy Forecasting Competition (GEFCom) is a competition conducted by a team led by Dr. Tao Hong that invites submissions around the world for forecasting energy demand. GEFCom was first held in 2012 on Kaggle, and the second GEFCom was h ...
(GEFCom2014) used variants of QRA.


Introduction

The individual point forecasts are used as
independent variables Dependent and independent variables are variables in mathematical modeling, statistical modeling and experimental sciences. Dependent variables receive this name because, in an experiment, their values are studied under the supposition or deman ...
and the corresponding observed target variable as the
dependent variable Dependent and independent variables are variables in mathematical modeling, statistical modeling and experimental sciences. Dependent variables receive this name because, in an experiment, their values are studied under the supposition or demand ...
in a standard
quantile regression Quantile regression is a type of regression analysis used in statistics and econometrics. Whereas the method of least squares estimates the conditional ''mean'' of the response variable across values of the predictor variables, quantile regressi ...
setting. The Quantile Regression Averaging method yields an interval forecast of the target variable, but does not use the prediction intervals of the individual methods. One of the reasons for using point forecasts (and not interval forecasts) is their availability. For years, forecasters have focused on obtaining accurate point predictions. Computing probabilistic forecasts, on the other hand, is generally a much more complex task and has not been discussed in the literature nor developed by practitioners so extensively. Therefore, QRA may be found particularly attractive from a practical point of view as it allows to leverage existing development of point forecasting.


Computation

The
quantile regression Quantile regression is a type of regression analysis used in statistics and econometrics. Whereas the method of least squares estimates the conditional ''mean'' of the response variable across values of the predictor variables, quantile regressi ...
problem can be written as follows: Q_y(q, X_t) = X_t\beta_q, where Q_y(q , \cdot) is the conditional ''q''-th
quantile In statistics and probability, quantiles are cut points dividing the range of a probability distribution into continuous intervals with equal probabilities, or dividing the observations in a sample in the same way. There is one fewer quantile th ...
of the dependent variable (y_t), X_t= ,\hat_,...,\hat_/math> is a vector of point forecasts of m individual models (i.e. independent variables) and ''βq'' is a vector of parameters (for quantile ''q''). The parameters are estimated by minimizing the loss function for a particular ''q''-th quantile: \min\limits_ \left y_t - X_t\beta_q , + \sum\limits_ (1-q), y_t - X_t\beta_q , \right= \min\limits_ \left \sum\limits_(q - \mathbf_) (y_t - X_t\beta_q ) \right/math> QRA assigns weights to individual forecasting methods and combines them to yield forecasts of chosen quantiles. Although the QRA method is based on quantile regression, not
least squares The method of least squares is a standard approach in regression analysis to approximate the solution of overdetermined systems (sets of equations in which there are more equations than unknowns) by minimizing the sum of the squares of the res ...
, it still suffers from the same problems: the exogenous variables should not be correlated strongly and the number of variables included in the model has to be relatively small in order for the method to be computationally efficient.


Factor Quantile Regression Averaging (FQRA)

The main difficulty associated with applying QRA comes from the fact that only individual models that perform well and (preferably) are distinct should be used. However, there may be many well performing models or many different specifications of each model (with or without exogenous variables, with all or only selected lags, etc.) and it may not be optimal to include all of them in Quantile Regression Averaging. In Factor Quantile Regression Averaging (FQRA), instead of selecting individual models ''a priori'', the relevant information contained in all forecasting models at hand is extracted using
principal component analysis Principal component analysis (PCA) is a popular technique for analyzing large datasets containing a high number of dimensions/features per observation, increasing the interpretability of data while preserving the maximum amount of information, and ...
(PCA). The
prediction interval In statistical inference, specifically predictive inference, a prediction interval is an estimate of an interval in which a future observation will fall, with a certain probability, given what has already been observed. Prediction intervals are o ...
s are then constructed on the basis of the common factors (f_t) obtained from the panel of point forecasts, as independent variables in a quantile regression. More precisely, in the FQRA method X_t= ,\hat_,...,\hat_/math> is a vector of k factors extracted from a panel of point forecasts of m individual models, not a vector of point forecasts of the individual models themselves. A similar principal component-type approach was proposed in the context of obtaining point forecasts from the
Survey of Professional Forecasters The Survey of Professional Forecasters (SPF) is a quarterly survey of macroeconomic forecasts for the economy of the United States issued by the Federal Reserve Bank of Philadelphia. It is the oldest such survey in the United States. The survey i ...
data. Instead of considering a (large) panel of forecasts of the individual models, FQRA concentrates on a small number of common factors, which - by construction - are orthogonal to each other, and hence are contemporaneously uncorrelated. FQRA can be also interpreted as a forecast averaging approach. The factors estimated within PCA are linear combinations of individual vectors of the panel and FQRA can therefore be used to assign weights to the forecasting models directly.


QRA and LAD regression

QRA may be viewed as an extension of combining point forecasts. The well-known
ordinary least squares In statistics, ordinary least squares (OLS) is a type of linear least squares method for choosing the unknown parameters in a linear regression model (with fixed level-one effects of a linear function of a set of explanatory variables) by the prin ...
(OLS) averaging uses linear regression to estimate weights of the point forecasts of individual models. Replacing the quadratic loss function with the absolute loss function leads to quantile regression for the median, or in other words, least absolute deviation (LAD) regression.{{Cite journal, title = An empirical comparison of alternative schemes for combining electricity spot price forecasts, journal = Energy Economics, date = 2014, pages = 395–412, volume = 46, doi = 10.1016/j.eneco.2014.07.014, first = Jakub, last = Nowotarski, first2 = Eran, last2 = Raviv, first3 = Stefan, last3 = Trück, first4 = Rafał, last4 = Weron


See also

*
Consensus forecast Used in a number of sciences, ranging from econometrics to meteorology, consensus forecasts are predictions of the future that are created by combining together several separate forecasts which have often been created using different methodologies ...
'', also known as combining forecasts'', ''forecast averaging'' or ''model averaging'' (in econometrics and statistics) and ''
committee machine A committee machine is a type of artificial neural network using a divide and conquer strategy in which the responses of multiple neural networks (experts) are combined into a single response.HAYKIN, S. Neural Networks - A Comprehensive Foundation. ...
s'', ''
ensemble averaging In machine learning, particularly in the creation of artificial neural networks, ensemble averaging is the process of creating multiple models and combining them to produce a desired output, as opposed to creating just one model. Frequently an ens ...
'' or ''expert aggregation'' (in machine learning) *
Electricity price forecasting Electricity price forecasting (EPF) is a branch of energy forecasting which focuses on predicting the spot and forward prices in wholesale electricity markets. Over the last 15 years electricity price forecasts have become a fundamental input to e ...
*
Energy forecasting Energy forecasting includes forecasting demand ( load) and price of electricity, fossil fuels (natural gas, oil, coal) and renewable energy sources (RES; hydro, wind, solar). Forecasting can be both expected price value and probabilistic forecasti ...
*
Forecasting Forecasting is the process of making predictions based on past and present data. Later these can be compared (resolved) against what happens. For example, a company might estimate their revenue in the next year, then compare it against the actual ...
*
Global Energy Forecasting Competition The Global Energy Forecasting Competition (GEFCom) is a competition conducted by a team led by Dr. Tao Hong that invites submissions around the world for forecasting energy demand. GEFCom was first held in 2012 on Kaggle, and the second GEFCom was h ...
s *
Economic forecasting Economic forecasting is the process of making predictions about the economy. Forecasts can be carried out at a high level of aggregation—for example for GDP, inflation, unemployment or the fiscal deficit—or at a more disaggregated level, for ...
*
Prediction interval In statistical inference, specifically predictive inference, a prediction interval is an estimate of an interval in which a future observation will fall, with a certain probability, given what has already been observed. Prediction intervals are o ...
*
Probabilistic forecasting Probabilistic forecasting summarizes what is known about, or opinions about, future events. In contrast to single-valued forecasts (such as forecasting that the maximum temperature at a given site on a given day will be 23 degrees Celsius, or that t ...
*
Quantile regression Quantile regression is a type of regression analysis used in statistics and econometrics. Whereas the method of least squares estimates the conditional ''mean'' of the response variable across values of the predictor variables, quantile regressi ...


Implementations

* Matlab code for computing interval forecasts using QRA is available from RePEc: https://ideas.repec.org/c/wuu/hscode/m14003.html


References

Economic forecasting Regression analysis Probability assessment