Friday, January 27, 2012

Hall of Fame post revisited

A couple weeks ago, I wrote a post regarding the Hall of Fame, specifically regarding the candidacies of the prominent first basemen on the current ballot: Jeff Bagwell, Rafael Palmeiro, and Mark McGwire. While I generally felt good about the final result - definitive yes to Bagwell, debatable yes to Palmeiro, debatable no to McGwire - one of the methods I used to measure sort of bugged me, so I wanted to take a chance to re-examine and update my results.

Initially, I measured the number of seasons each of the players had with a WAR over 10, 8, 5 and 3. I used these values because they seem like good cutoff points. Why, though? I'm not sure, really, and as I re-read my post, they just seemed too... arbitrary. Of course, the whole process is a bit arbitrary. Drawing the line from what makes a Hall of Famer and what doesn't is ENTIRELY the preference of the person making that choice. If you're a "small Hall of Fame" voter, and believes that the Hall should only be for the Ruth/Mays/Aaron/Williams types, and you vote that way consistently, that's fine. As long as someone puts thought and effort into their methodology and is consistent with their standards, then that's all I can ask for.

However, "consistent" shouldn't mean "unchanging" in this case. As new information becomes available, of course it should be considered. The case I've used a few times this year is regarding Mike Fast's work at Baseball Prospectus regarding catcher defense. If we discover new information, we should use it. At the same time, if a methodology is developed for rating players, and a flaw is found in that, it's ok - in fact, it's necessary - to change it.

So I went back through my study, and realized these WAR season scores were just TOO arbitrary. Why was I drawing the lines where I was? No reason - they just seemed like good places to draw lines. However, with only one season of a first baseman having a 10+ WAR in our study, maybe that's too arbitrary? So I decided to put every season of our 25 players back into a spreadsheet, and instead of WARs of 10, 8. 5 and 3 as dividers, I'd use percentiles - the 99th, 90th, 75th and 50th.

This isn't perfect. It includes seasons at the beginning and end of careers when players weren't regulars or ended up stopping play. Still, it is BETTER. "Meliora" - that's what we're shooting for, right Yellow Jackets? If we can do better, then we should, even if better still isn't perfect.

Once completed, we're left with 396, sorted by WAR. the 99th percentile is 8.8, the 90th is 5.9, the 75th is 4.1 and the 50th is 2.0. These are still technically arbitrary cutoffs, of course - they're just drawn where they are for a reason.

Thomas, Frank0691530
Bagwell, Jeff15111330
Thome, Jim0591327
Palmeiro, Rafael0381425
Helton, Todd1571225
McGwire, Mark0391123
Clark, Will1261221
Giambi, Jason146819
McGriff, Fred0261018
Olerud, John0261018
Grace, Mark0051217
Delgado, Carlos0241016
Lee, Derrek003710
Joyner, Wally001910
Vaughn, Mo00369
Galarraga, Andres00268
Konerko, Todd00156
Martinez, Tino00246
Clark, Tony00044
Segui, David00044
Casey, Sean00033
Karros, Eric00033
Morris, Hal00033
Young, Kevin00123
Snow, JT00022

Compare this to our chart in our original post - Frank Thomas, who had no 8+ WAR seasons, had six where he's in the 90th percentile - he now gets credit for that. To me, this is just a more sensible measurement.

This changes our final results so slightly that my conclusions are the same. Thomas, Bagwell and Thome are still the definite choices, arguments can be made for Palmeiro and Helton, and McGwire and Giambi are, again too far off:

PlayerHHRTBTOBWAR10-peark3-peak5 BestSeasonsTOTAL
Thomas, Frank343222441.531.5
Bagwell, Jeff775411211.532.5
Thome, Jim81233365343
Palmeiro, Rafael1311461094.551.5
Helton, Todd5117664334.567.5
McGwire, Mark1821013577.57690.5
Giambi, Jason158999512893
McGriff, Fred254510119109.595.5
Clark, Will1015.5131179587106.5
Olerud, John917147887.569.5110
Delgado, Carlos1268101210111112128
Grace, Mark4211281112131211137
Galarraga, Andres696121518151416156
Lee, Derrek141315161413141513.5169.5
Joyner, Wally111917141315181613.5175.5
Konerko, Todd131011151617171817.5182.5
Vaughn, Mo191419181714121315192
Martinez, Tino161216171816161717.5199.5
Casey, Sean202421211919211922.5243.5
Clark, Tony241822232021202019.5247.5
Karros, Eric1715.518202322.5222122.5250.5
Snow, JT212020192220242225259
Morris, Hal232524242124252322.5274.5
Segui, David222323222422.5232419.5275
Young, Kevin252225252525192522.5288.5

Who knows - I may change this even more as I move on. But for now, I feel like it's an improvement over my previous results.

