Backtesting is a term used in modeling to refer to testing a
predictive model on historical data. Backtesting is a type of
retrodiction
Retrodiction is the act of making a prediction about the past. It is also known as postdiction (but this should not be confused with the use of the term in criticisms of parapsychological research).
Activity
The activity of retrodiction (or po ...
, and a special type of
cross-validation applied to previous time period(s).
Financial analysis
In a trading strategy, investment strategy, or risk modeling, backtesting seeks to estimate the performance of a strategy or model if it had been employed during a past period. This requires simulating past conditions with sufficient detail, making one limitation of backtesting the need for detailed historical data. A second limitation is the inability to model strategies that would affect historic prices. Finally, backtesting, like other modeling, is limited by potential
overfitting
mathematical modeling, overfitting is "the production of an analysis that corresponds too closely or exactly to a particular set of data, and may therefore fail to fit to additional data or predict future observations reliably". An overfitt ...
. That is, it is often possible to find a strategy that would have worked well in the past, but will not work well in the future. Despite these limitations, backtesting provides information not available when models and strategies are tested on synthetic data.
Backtesting has historically only been performed by large institutions and professional money managers due to the expense of obtaining and using detailed datasets. However, backtrading is increasingly used on a wider basis, and independent web-based backtesting platforms have emerged. Although the technique is widely used, it is prone to weaknesses.
Basel financial regulations require large financial institutions to backtest certain risk models.
For a
Value at Risk
Value at risk (VaR) is a measure of the risk of loss for investments. It estimates how much a set of investments might lose (with a given probability), given normal market conditions, in a set time period such as a day. VaR is typically used by ...
1-day at 99% backtested 250 days in a row, the test is considered green (0-95%), orange (95-99.99%) or red (99.99-100%) depending on the following table:
For a
Value at Risk
Value at risk (VaR) is a measure of the risk of loss for investments. It estimates how much a set of investments might lose (with a given probability), given normal market conditions, in a set time period such as a day. VaR is typically used by ...
10-day at 99% backtested 250 days in a row, the test is considered green (0-95%), orange (95-99.99%) or red (99.99-100%) depending on the following table:
Hindcast

In
oceanography and
meteorology
Meteorology is a branch of the atmospheric sciences (which include atmospheric chemistry and physics) with a major focus on weather forecasting. The study of meteorology dates back millennia, though significant progress in meteorology did no ...
, ''backtesting'' is also known as ''hindcasting'': a hindcast is a way of testing a
mathematical model
A mathematical model is a description of a system using mathematical concepts and language. The process of developing a mathematical model is termed mathematical modeling. Mathematical models are used in the natural sciences (such as physics, ...
; researchers enter known or closely estimated inputs for past events into the model to see how well the output matches the known results.
Hindcasting usually refers to a
numerical-model integration of a historical period where no observations have been
assimilated. This distinguishes a hindcast run from a
reanalysis. Oceanographic observations of
salinity
Salinity () is the saltiness or amount of salt dissolved in a body of water, called saline water (see also soil salinity). It is usually measured in g/L or g/kg (grams of salt per liter/kilogram of water; the latter is dimensionless and equal ...
and
temperature
Temperature is a physical quantity that expresses quantitatively the perceptions of hotness and coldness. Temperature is measured with a thermometer.
Thermometers are calibrated in various temperature scales that historically have relied on ...
as well as observations of
surface-wave parameters such as the
significant wave height
In physical oceanography, the significant wave height (SWH, HTSGW or ''H''s)
is defined traditionally as the mean ''wave height'' (trough to crest) of the highest third of the waves (''H''1/3). Nowadays it is usually defined as four times the ...
are much scarcer than meteorological observations, making hindcasting more common in oceanography than in meteorology. Also, since surface waves represent a forced system where the wind is the only generating force,
wave hindcasting is often considered adequate for generating a reasonable representation of the wave
climate
Climate is the long-term weather pattern in an area, typically averaged over 30 years. More rigorously, it is the mean and variability of meteorological variables over a time spanning from months to millions of years. Some of the meteorologica ...
with little need for a full reanalysis. Hydrologists use hindcasting for model stream flows.
An example of hindcasting would be entering
climate
Climate is the long-term weather pattern in an area, typically averaged over 30 years. More rigorously, it is the mean and variability of meteorological variables over a time spanning from months to millions of years. Some of the meteorologica ...
forcings (events that force change) into a
climate model
Numerical climate models use quantitative methods to simulate the interactions of the important drivers of climate, including atmosphere, oceans, land surface and ice. They are used for a variety of purposes from study of the dynamics of the ...
. If the hindcast showed reasonably-accurate climate response, the model would be considered successful.
The
ECMWF re-analysis The ECMWF reanalysis project is a meteorological reanalysis project carried out by the European Centre for Medium-Range Weather Forecasts (ECMWF).
The first reanalysis product, ERA-15, generated reanalyses for approximately 15 years, from December ...
is an example of a combined atmospheric reanalysis coupled with a wave-model integration where no wave parameters were assimilated, making the wave part a hindcast run.
See also
*
Applied research (customer foresight)
*
Black box model
*
Climate
Climate is the long-term weather pattern in an area, typically averaged over 30 years. More rigorously, it is the mean and variability of meteorological variables over a time spanning from months to millions of years. Some of the meteorologica ...
*
ECMWF re-analysis The ECMWF reanalysis project is a meteorological reanalysis project carried out by the European Centre for Medium-Range Weather Forecasts (ECMWF).
The first reanalysis product, ERA-15, generated reanalyses for approximately 15 years, from December ...
*
Forecasting
Forecasting is the process of making predictions based on past and present data. Later these can be compared (resolved) against what happens. For example, a company might estimate their revenue in the next year, then compare it against the actual ...
*
NCEP re-analysis
*
Program trading
*
Retrodiction
Retrodiction is the act of making a prediction about the past. It is also known as postdiction (but this should not be confused with the use of the term in criticisms of parapsychological research).
Activity
The activity of retrodiction (or po ...
*
Statistical arbitrage
*
Thought Experiment
A thought experiment is a hypothetical situation in which a hypothesis, theory, or principle is laid out for the purpose of thinking through its consequences.
History
The ancient Greek ''deiknymi'' (), or thought experiment, "was the most anci ...
*
Value at risk
Value at risk (VaR) is a measure of the risk of loss for investments. It estimates how much a set of investments might lose (with a given probability), given normal market conditions, in a set time period such as a day. VaR is typically used by ...
References
{{reflist
Tests
Technical analysis
Mathematical modeling
Numerical climate and weather models
Statistical forecasting