Extrapolated Runs
   HOME

TheInfoList



OR:

Extrapolated Runs (XR) is a
baseball statistic Baseball statistics play an important role in evaluating the progress of a player or team. Since the flow of a baseball game has natural breaks to it, and normally players act individually rather than performing in clusters, the sport lends itsel ...
invented by
sabermetrician Sabermetrics, or originally SABRmetrics, is the empirical analysis of baseball, especially baseball statistics that measure in-game activity. Sabermetricians collect and summarize the relevant data from this in-game activity to answer specific ques ...
Jim Furtado Baseball Think Factory, abbreviated as BTF or BBTF, is a sabermetrically-oriented baseball web site that features daily news stories in baseball, with original content contributed by SABR members such as Dan Szymborski. The site was previously br ...
to estimate the number of runs a hitter contributes to his team. XR measures essentially the same thing as
Bill James George William James (born October 5, 1949) is an American baseball writer, historian, and statistician whose work has been widely influential. Since 1977, James has written more than two dozen books devoted to baseball history and statistics. ...
'
Runs Created Runs created (RC) is a baseball statistic invented by Bill James to estimate the number of runs a hitter contributes to their team. Purpose James explains in his book, ''The Bill James Historical Baseball Abstract'', why he believes runs created is ...
, but it is a
linear weights In statistics, linear regression is a linear approach for modelling the relationship between a scalar response and one or more explanatory variables (also known as dependent and independent variables). The case of one explanatory variable is call ...
formula that assigns a run value to each event, rather than a multiplicative formula like James' creation.


Purpose and formulae

According to Furtado, Extrapolated Runs was inspired by Paul Johnson's Estimated Runs Produced (ERP) formula, which was published in James' 1985 ''Baseball Abstract''. Furtado found that Johnson's method, when written a different way, was essentially a linear weights formula (something James apparently did not recognize at the time, given his very public disdain for linear run estimators). ERP was almost as accurate as RC at measuring team runs, it did not succumb to RC's infamous
problems A problem is a difficulty which may be resolved by problem solving. Problem(s) or The Problem may also refer to: People * Problem (rapper), (born 1985) American rapper Books * Problems (Aristotle), ''Problems'' (Aristotle), an Aristotelian (or ps ...
at the individual level, and its values stacked up well when compared to
Pete Palmer Pete Palmer (born January 30, 1938) is an American sports statistician and encyclopedia editor. He is a major contributor to the applied mathematical field referred to as sabermetrics. Along with the Bill James ''Baseball Abstracts'', Palmer's ...
's linear weights formula, even though the two methods were developed in entirely different ways. For these reasons, Furtado believed that linear estimators had more promise than was originally thought, and he set out to develop his own. After much trial and error (some of which involved borrowing concepts and weights from other linear formulas), Furtado eventually found a set of weights that best fit his sample (every
Major League Baseball Major League Baseball (MLB) is a professional baseball organization and the oldest major professional sports league in the world. MLB is composed of 30 total teams, divided equally between the National League (NL) and the American League (AL), ...
season from 1955 to 1997). He unveiled the formula in the ''1999 Big Bad Baseball Annual'': "Extrapolated Runs was developed for use with seasons from 1955 to the present. I came up with three versions of the formula. The three formulas are: * XR – Extrapolated Runs = (.50 × 1B) + (.72 × 2B) + (1.04 × 3B) + (1.44 × HR) + (.34 × (HP+TBB−IBB)) + (.25 × IBB)+ (.18 × SB) + (−.32 × CS) + (−.090 × (AB − H − K)) + (−.098 x K)+ (−.37 × GIDP) + (.37 x SF) + (.04 × SH) * XRR – Extrapolated Runs Reduced = (.50 × 1B) + (.72 × 2B) + (1.04 × 3B) + (1.44 × HR) + (.33 × (HP+TBB)) + (.18 × SB) + (−.32 × CS) + (−.098 × (AB − H)) * XRB – Extrapolated Runs Basic = (.50 × 1B) + (.72 × 2B) + (1.04 × 3B) + (1.44 × HR) + (.34 × (TBB)) + (.18 × SB) + (−.32 × CS) + (−.096 × (AB − H)) "As you can see, calculating XR requires only addition and multiplication. Its simplicity of design is one of its greatest attributes. Unlike a lot of the other methods, you don't need to know team totals, actual runs, league figures or anything else. You just plug the stats into the formula and you are all set. "Another of XR attributes is that the formula is pretty much context neutral. Other than park effects, the only remaining residue of context is due to the inclusion of IBB, GIDP and SF. Although I could have removed them from the full version, I felt that the inclusion of these terms was important since my research showed there was a strong correlation between the IBB, SF and GIDP opportunities that players face. I also felt, like Bill James, that these statistics do tell us something valuable about players. Of course, I knew some people might not agree with me. For them, I created two other versions. "XR also accounts for just about every out. James correctly understands that the more outs an individual player consumes the less valuable his positive contributions are. Since XR will be used as the base of the Extrapolated Win method, I thought it was important to include as many outs as possible in the formula. "Another nice thing about XR is that if you add up all the players' Extrapolated Runs, you'll have the team totals. That's a benefit of using a linear equation."


Pros and cons of extrapolated runs

Along with Palmer's Linear Weights, XR is the most accurate of the linear run estimators, in terms of predicting team runs scored. And unlike James' RC, it doesn't artificially inflate the runs produced by individual players who combine high OBPs and SLGs. It is also much easier to calculate than Base Runs. However, like any linear formula, there is no guarantee that it will work outside of the context in which it was developed (in this case, seasons from 1955 to 1997).


See also

*
Sabermetrics Sabermetrics, or originally SABRmetrics, is the empirical analysis of baseball, especially baseball statistics that measure in-game activity. Sabermetricians collect and summarize the relevant data from this in-game activity to answer specific ques ...
*
Runs Created Runs created (RC) is a baseball statistic invented by Bill James to estimate the number of runs a hitter contributes to their team. Purpose James explains in his book, ''The Bill James Historical Baseball Abstract'', why he believes runs created is ...
* Base Runs


External links


Introducing XR
{{Baseball statistics Baseball statistics