Sunday, July 27, 2014

WAR Baseline: It's All About Playing Time

One of the most confusing concepts for fans trying to learn sabermetrics is the replacement baseline used in the Wins Above Replacement (WAR) processes.  In simple terms, WAR is the wins a player contributed to his team's win total above what you would expect from a replacement level player - a theoretical player who could be acquired for league minimum salary.  An example of a replacement player would be a player in AAA, who is good enough to get some time in the majors, but is not regarded as a top prospect.   

Why is replacement used instead of average or zero?  When building a ball club, comparing players to league average can be problematic.  If a team is forced to replace a player due to an injury, he is not likely to be replaced by an average player or even a slightly below average player.  Average players are actually good players and are not generally available quickly or cheaply.   In most cases, the injured player will be replaced by a player who is substantially below average. 

Comparing players to zero is also not generally a great idea because your replacement is not likely to bat .000 for any length of time.  Your replacement will usually be somewhere between zero and average.  Based on examination of data over several years, analysts determine how good a player typically needs to be to get a decent amount of playing time.  The threshold above which a player must perform in order to get consistent at bats is called replacement level.  Different people use somewhat different replacement levels, but I'll follow the FanGraphs.com definition here.

If you are interested in playing general manager and are concerned about roster construction or how much money a player is worth, the replacement threshold is the way to go.  If you want to do something else such as selecting hall of famers or award winners or you just want to know how many players on your favorite team compare favorable to an average player, you can use an alternative baseline.

If you do decide to shun replacement level for something more intuitive though, you should understand the consequences.  It all comes down to how much credit you want to give for playing time.  Whether you choose Wins Above Average (WAA) or Wins Above Zero (WAO) or WAR can make a substantial difference when there is a lot of variation in playing time among players.

Suppose, Gary Great and Sammy Solid were both second basemen with exactly 600 Plate Appearances (PA).  They were both average base runners and average defenders and played in neutral parks.  The only difference was that Gary was a much better hitter than Sammy.  Gary had a .400 OBP, .540 slugging average and .398 Weighted On-Base Average (wOBA).  Sammy had a .325 OBP, .450 slugging average and .335 wOBA. 

The question is how many wins was Gary worth compared to Sammy? 

We would normally have to do a lot of calculations involving base running, fielding and park effects in order to calculate Wins, but the question is simplified by assuming that the two players were similar in every way except batting.  Based on PA and wOBA, Gary had 40 Batting Runs which means than he contributed an estimated 40 runs above what would be expected from an average player in the same number of plate appearances.  Since 10 runs is worth approximately one win, he was 4 WAA.

Sammy had 10 Batting Runs or 1 WAA.  So, there was a a gap of three wins between the two players.  (Note that we should actually be adding a fraction of a win for playing second base, but they both get the same fraction so we'll ignore it for simplicity.)

What if we use zero as the baseline rather than average?  An average player is worth 68 runs over 600 PA, so Gary was 40 + 68 = 108 runs above zero (also called Runs Created) or 10.8 WA0.  Sammy had 78  Runs Created or 7.8 WA0.  Again, the the two players were separated by three wins.

Finally, a replacement player is worth 20 runs per 600 PA below an average player, so Gary was 40 + 20 Runs Above Replacement or 6 WAR.  Sammy was 30 Runs Above Replacement or 3 WAR. So, one more time there were three wins between the two batters.

Table 1 below clearly shows that there was a very big disparity in the number of wins a each player was credited in WAA, WA0 and WAR, but no difference in the number of wins separating the two players.  This is because they had the same number of plate appearances. 

Table 1: Two Players: Average in Every Way but Hitting
Measure
Gary Great
Sammy Solid
Difference
Plate Appereances
600
600
0
wOBA
.398
.335
.063
Runs Above Average
40
10
30
Wins Above Average
4
1
3
Runs Above Zero
108
78
30
Wins Above Zero
10.8
7.8
3
Runs Above Replacement
60
30
30
Wins Above Replacement
6
3
3

It's another story when players are far apart in their numbers of PA  Suppose Gary had a .398 wOBA in 300 PA while Sammy still had a .335 wOBA in 600 PA.  The results are summarized in Table 2 below.
In that case, Gary had 20 Batting Runs compared to 10 for Sammy.  That comes out to 2 WAA for Gary and 1 WAA for Sammy.  So Gary was one win better by that measure. 

Does this make sense? Is a great hitter who missed half the season worth more wins than an above average hitter in a full season?

Let's see what happens if we change the threshold.  An average player is worth 34 runs in 300 PA, so Gary was 20 + 34 = 54 Runs Above Zero.  Sammy was still 78 Runs Above Zero.  In terms of wins, Gary was 5.4 WA0 and Sammy 7.8 WA0.  In this case, Sammy was 2.4 Wins better than Gary.

Finally, if replacement is the baseline, Gary was 20 + 10 = 30 Runs Above Replacement or 3 WAR while Sammy was 10 + 20 = 30 Runs Above Replacement or 3 WAR.  So, they were considered equal contributors to wins by this metric. 

Table 2: Two Players: Average in Every Way but Hitting, Playing Time
Measure
Gary Great
Sammy Solid
Difference
Plate Appearances
300
600
-300
wOBA
.398
.335
.063
Runs Above Average
20
10
10
Wins Above Average
2
1
1
Runs Above Zero
54
78
-24
Wins Above Zero
5.4
7.8
-2.4
Runs Above Replacement
30
30
0
Wins Above Replacement
3
3
0

The lesson learned is that the baseline you choose can make a large difference in your evaluation of players.  In the first case, Gary was the better player.  In the second instance, Sammy was the better player by a substantial margin.  In the third situation, they were equals. You don't have to use replacement level if you don't want, but it's important to be aware how much the results vary among baselines.

3 comments:

  1. AnonymousJuly 27, 2014

    Yeah baseball sure is a fascinating game, in a lot of ways it's just a chess match battle between different perspectives. And that can be said for all elements of the game from how a player approaches hitting or pitching, or how the coaches go about mentoring and training, or how a GM works through the sea of code to try and figure out the best way to assemble a team.

    But also as far as the HOF evaluation perspective game goes, now it changes once again with the rules being slightly adjusted and a new 10 year limit. Although Alan Trammel is one of 3 players grandfathered in to the original 15 year eligibility period along with Lee Smith and Don Mattingly. It will be interesting to see how their vote totals add up next time with the new wrinkle in the process.

    ReplyDelete
  2. Berdj J. RassamAugust 06, 2014

    It's an interesting stat, but most managers, coaches, and front-office personnel make most of their personnel decisions without relying in any meaningful way on this stat.

    ReplyDelete
  3. Berdj, Thanks for reading. I don't agree that front office people don't use a WAR process in their decisions. Their process might not be the same one that FanGraphs or Baseball-Reference uses, but they surely have a way to estimate the overall value of players. And if some teams don't do that, that is their problem, not mine. I don't write for teams. I write for fans who are trying to get a better understanding of all the new stats out there.

    ReplyDelete

Twitter

Blog Archive

Subscribe

My Sabermetrics Book

My Sabermetrics Book
One of Baseball America's top ten books of 2010

Other Sabermetrics Books

Stat Counter