Does Win Shares overvalue relievers?
November 26, 2003
I’ve read a lot of opinions about Win Shares and pitchers. Win Shares overvalues relievers, undervalues starters, undervalues pitching in general, etc. etc. So I thought it would make sense to start to examine this a little more closely.
So let’s look at two teams that had plenty of good starters and relievers: the Oakland A’s and Los Angeles Dodgers.
Here are the vitals for key members of the Oakland A's staff:
| Player |
Wins |
Losses |
ERA |
G |
Saves |
IP |
Holds |
PCL-1 |
Win Shares |
|
T Hudson |
16 |
7 |
2.70 |
34 |
0 |
240 |
0 |
88.5 |
23.2 |
|
K Foulke |
9 |
1 |
2.08 |
72 |
43 |
87 |
0 |
39.6 |
21.4 |
|
B Zito |
14 |
12 |
3.30 |
35 |
0 |
232 |
0 |
69.2 |
17.7 |
|
M Mulder |
15 |
9 |
3.13 |
26 |
0 |
187 |
0 |
64.0 |
16.9 |
|
C Bradford |
7 |
4 |
3.04 |
72 |
2 |
77 |
23 |
26.4 |
9.0 |
And here's the same data for the top four pitchers from the Dodgers:
| Player |
W |
L |
ERA |
SV |
IP |
HLD |
PCL-1 |
Win Shares |
| E Gagne |
2 |
3 |
1.20 |
55 |
82 |
0 |
40.0 |
25.0 |
|
K Brown |
14 |
9 |
2.39 |
0 |
211 |
0 |
70.4 |
19.7 |
|
H Nomo |
16 |
13 |
3.09 |
0 |
218 |
0 |
57.9 |
16.9 |
|
G Mota |
6 |
3 |
1.97 |
1 |
105 |
13 |
42.6 |
14.0 |
PCL-1 is Win Shares code for Marginal Runs Not Allowed. It's just like Marginal Runs for batters, and it's based on the number of runs allowed below 152% of the league average, times innings pitched, for each pitcher. It's similar to Lee Sinins' "Runs Saved Above Average" or Thorn and Palmer's "Pitching Runs."
Now, with batters, there is a direct relationship between Marginal Runs and Win Shares, as we've mentioned. Things aren't that easy with pitchers. As you can see, something happens between PCL-1 and Win Shares. For instance, Eric Gagne leaps from the bottom of the list in Marginal Runs Not Allowed to the top of pitching Win Shares. Let's discuss.
Win Shares calls for three additional calculations before finalizing pitching Win Shares. They are:
1. Points are added for wins and saves, and subtracted for losses. The specific formula is: (Wins times three minus losses plus saves)/3.
2. Points are also added if a pitcher pitched in "high leverage" situations -- that is, innings in which the game was close and late. Win Shares uses saves and holds to estimate the number of leveraged innings the pitcher pitched.
3. The Marginal Runs Not Allowed for the leveraged innings are based on "component ERA" instead of the recorded ERA. Component ERA is estimated the same way Runs Created are calculated, but for pitchers instead of batters. This step is needed, because relief pitchers' ERAs are often at the mercy of specific situations and subsequent pitchers.
Steps two and three are kind of complicated. so I'm just going to focus on the first step in this article, starting with the results.
For the A's, Keith Foulke received 23 additonal points from step one, thanks to his 43 saves and 9-1 record. Tim Hudson received 14 additional points, and the other Oakland pitchers received less than ten.
On the Dodgers, Gagne received nineteen additional points. He had the second-most saves in history, but lost three games and only won two. Nomo received twelve points and Brown eleven.
Looking around the league, I found the following:
- Halladay received 20 points from this step, the most of any starter, thanks to his 22-7 record.
- Russ Ortiz received 19, while Smoltz got 14.
- Moyer received 19 points; Loaiza: 18
- On the Yankees: Pettite: 18, Rivera 17, Mussina and Clemens: 14
- On the Devil Rays: Lance Carter got 19 points, Zambrano got 9
- Prior got 16 points and Borowski got 12
- Mantei: 13 and Webb: 8
- Billy Wagner received 14 points and Robertson 12 (!)
- Schmidt and Worrell both received 15 points on the Giants
- And Wolf received 13 points to Jose Mesa's 11 points.
How should we judge this? Gagne got as many points as every starter except Halladay. On the other hand, a starter would have to go 23-0 to receive as many points as Foulke did.
Here's another way look at it: Foulke's points increased 58% as a result of this step, while Hudson's increased 16%. Gagne's points increased 48%; Brown's increased 16% and Nomo's increased 20%. By the way, Mota's total increased 12% and Bradford's increased 24%. And for the best record in the majors, Halladay's increased 20%.
So on a proportional basis, this step helped the closers tremendously. Foulke's Win Shares rose from 12 to 15 as a result of this step, while Hudson's declined from 26 to 25. Gagne's rose from 13 to 17, while Brown's and Nomo's declined by one. Here's one final closer example: Jose Mesa's Win Shares climbed from -1 to +1 due to this step.
The irony is that James added this step to give a bit of credit to pitchers with outstanding won/loss records (let's call it "the Joe Morgan factor"). He added the saves step to allow relievers to "tread water while the won-lost records are considered." Unfortunately, it does more than that, because of the proportional impact this step has for relievers vs. starters.
I can't imagine what James went through when he created this step, because the role of the reliever has changed so dramatically throughout baseball history. It must have been incredibly complicated. He may well have come up with the formula that works best over the past century and a half. But it doesn't seem to work well in today's environment.
It's important to remember that relievers get credit for their important role in the second and third steps noted above. So this step seems to result in a bit of "piling on" credit for relievers.
I can think of two ways to resolve this. One is to rework the calculation so that saves get less emphasis (I'm thinking one-ninth the emphasis of wins), and that the impact of wins and losses are somehow mitigated by games started. I don't think Keith Foulke deserves the same credit as a starter for his nine wins.
The other way is to just drop the step altogether. Win, losses, saves and holds are just a reflection of situation, run support and baseball "accounting." So why adjust pitching Win Shares for them at all? This would be the anti-Joe Morgan approach.
What do people think?
First of all, a small correction:
<i>PCL-1 is Win Shares code for Marginal Runs Not Allowed. It’s just like Marginal Runs for batters, and it’s based on the number of runs allowed below 152% of the league average, times innings pitched, for each pitcher. It’s similar to Lee Sinins’ “Runs Saved Above Average” or Thorn and Palmer’s “Pitching Runs."</i>
It’s not at 152%; it’s 152% less an amount derived from team defensive quality. Small quibble, but one that helps good pitchers.
Anyhow, PCL-2 is one of those things I never did get exactly why it was implemented in the way it was. Were you to want this, you’d want Wins Above Team. Why? Adding it overall does not increase the total claim points available to the team. I alluded to this in the above paragraph, but anything that increases the total claim points available hurts the team’s best pitchers (aside from just saying Pedro gets 30 extra claim points just because he’s Pedro, or Rocket, or Koufax, or Lefty).
My personal Pitcher Win Shares reform, untested, goes something like this:
PCL-1—Runs Saved (current PCL-1)
PCL-2—Wins Above Team
PCL-3—Reliever ERC Runs Saved (current PCL-3)
PCL-4—Errors better than league average, divided by 4 (we put errors entirely to the fielders, but pitchers make errors too, and the unearned runs adjustments don’t entirely make up for this)
PCL-5—Strikeouts better than team, divided by 60 (accounts for the shifting threshold which would result were we to extrapolate the pitcher’s own stats to the team rates, then figure the threshold with them).
Making as many of these better or worse than average or team keeps the main two claim point sources, PCL-1 and PCL-3, and the others color.
Posted by on 11/26 at 12:24 PM
Thanks, Charlie. I keep trying to allow HTML code in the comments, but haven’t figured it out yet. You’re right about the PCL-1 calculations. I never know how much detail to cover in these articles.
When you say “Win Above Team”, I take it you mean winning % above team %, extrapolated to number of decisions? Great idea about strikeouts, by the way. I was going in a different direction to address the same issue. And I hadn’t thought about pitcher errors. Should we use specific error counts for the pitcher himself, or were you thinking team errors?
Posted by
studes on 11/26 at 12:54 PM
Pitcher errors. Basically an adjusted fielding percentage. Those unearned runs coming from your own errors aren’t just half yours, but all yours, plus it’s an individual tweaking of something that was divided up on a team rate. What might be right for you may not be right for some ...
And yes, wins better than team winning percentage, extrapolated for decisions. Personally, I’d prefer a net quality starts, but we don’t have those throughout history. What Bill wanted, I think, was some sort of quantification of pitching randomness, ie, three starts of 7 IP - 3 R isn’t as good in terms of team wins as going 9-1, 7-3, 5-5 (though I’m not really sure that’s true).
I have done a bit of work on the strikeout issue, actually, using theoretical teams. I originally went for all three of the pitcher-only stats, but I found that Bill implemented his thresholds (the reduction for team fielding) in a way that he was taking these into account. This isn’t a huge adjustment, BTW. Basically a extra Win Share a season for the best pitchers, maybe a bit less. I’m all in favor of giving the best pitchers more Win Shares, since the system probably undervalues them to a degree.
One other thing I have done, when working on the fielding formulae I e-mailed you, was relax the thresholds for pitching and defense. I’d eliminate the minimum/maximum Win Shares entirely, and set the percentages based on a deviation from the team percentage using team strikeout rate only. Anyhow, this leads to eliminating the rule (almost entirely for bad teams, FWIW) that the team pitcher threshold must be at least 1 run above the actual team runs allowed rate. This lets bad pitchers on bad teams pick up Win Shares they don’t deserve. One could drop it entirely, or set the minimum to be the team runs allowed rate. It helps good pitchers on bad teams.
Posted by on 11/26 at 01:30 PM
Bingo! I’ve been finding the exact same things. I was surprised to find that there wasn’t more impact from assigning pitcher-only events to specific pitchers, but that was the result. Doesn’t mean it shouldn’t be done, however.
Also, I found the exact same thing with the artificial restraints on pitching/fielding breakouts. It really came to light with this year’s Tigers. Their fielders only appeared to be a little worse than average, but those pitchers stunk!
Posted by
studes on 11/26 at 01:49 PM
studes: Doesn’t mean it shouldn’t be done, however.
Actually, I found for other than strikeouts, it shouldn’t. Pitchers who allow fewer home runs and walks give up fewer runs anyways. I had to go through everything about three times before I was convinced that the adjustment as already applied to the pitcher threshold as a team was adequate. For strikeouts it doesn’t apply, since technically a strikeout is only a smidgen better than a groundout or a flyout, but responsibility should go all to the pitcher.
I think the one thing that struck me was that, working through all this stuff, was that Bill should have had a little more confidence. Many things on this side of the equation seem arbitrary, but are right and they work. My own revised fielding formulae gave anywhere from about 18 to about 60 Win Shares to team defense from 1974-1982 NL, and in general they matched up pretty darn good with Bill’s formulae. The exceptions were those where I took something else into account, like (especially) catchers’ throwing.
And, as for dramatic contrast, check out some of those mid-1970s Padres teams. The fielders weren’t good, but could take half of the team’s defensive Win Shares because the pitchers really sucked.
Posted by on 11/26 at 03:54 PM
I dont think there’s any doubt that it overrates relievers. How can Gagne with 82 innings pitched be so much more valuable than Brown with 211? I support the anti-Joe Morgan approach, and so should all serious sabermetricians.
Posted by on 11/29 at 03:34 PM
Situational Credit: The win-save-hold credit always seemed like the most arbitrary part of the win-shares calculation and always has bothered me a little. It gives pitchers ‘situational credit’ which opens up a can of worms. If you give pitchers credit for how they perform in certain game situations, shouldn’t hitters, and fielders get situational credit, too? I guess there aren’t any convenient hitting stats for this (please don’t bring back the GWRBI). It is common sense that each run is not equal in terms of increasing or decreasing your teams probability of winning a particular game, but Win-Shares treats them all equally. That’s okay because we believe it all evens out over 162 games and if it doesn’t it’s just luck anyway. Win-Shares is still the greatest, but it’s the greatest because of its top-down approach. Adding credit for wins, saves and holds seems like a bottom up approach and that’s why it bothers me.
Posted by on 12/06 at 04:44 PM
Thanks for your comments, Rob. Actually, Win Shares does credit hitters for hitting in the clutch. Win Shares uses the most recent version of Runs Created, in which James added credit for two things:
- BA with RISP vs. the batter’s general BA
- Percent of Home runs hit with men on base
That’s one of the reasons you sometimes see funky outcomes with batting win shares. For instance, Manny Ramirez and Carlos Delgado just about tied for total Win Shares, according to baseballreference.com.
However, using the “clutch hitting” version of Runs Created, as Win Shares does, Delgado comes out about 14 runs ahead.
Posted by
studes on 12/06 at 06:46 PM
Right, but that clutch credit is still top-down. It ignores the score of the individual games and looks at the data as a whole. Wins, Holds and Saves are dependent on the score and inning of each particular game, which is what I think of as bottom-up. I don’t think either approach is wrong it just bothers my pedantic nature that it’s mixed.
Posted by on 12/09 at 01:00 PM
Page 1 of 1 pages