Baseball Graphs is dedicated to the better use and communication of baseball statistics. Below, you'll find excerpts from, and links to, some of the best baseball writers on the Internet. Follow the links above to read my own intermittent attempts at wisdom (the Baseball Graphs blog), and the heart of this site, historical graphs of every season dating back to 1900.
There are also two special sections you might want to check out. One is the graphical review of the 2003 season, which informed our work at The Hardball Times. The other is the Batted Balls Library, which includes a unique look at batters and pitchers from 2002 through 2005.
I was on vacation in Massachusetts the last two weeks. Enjoyed it very much, thanks. While browsing books at the Harvard Coop bookstore, I saw The Complete Idiot’s Guide to Statistics and decided to buy a copy. Yes, I browse the mathematics section at bookstores.
I talk about statistics a lot on this blog, but I last took a statistics class over twenty years ago. I’m pretty sure that I’ve forgotten everything I learned over twenty years ago, so I decided to buy the book to make sure I know what I’m talking about here. I actually enjoyed reading the book and I’d recommend it for those who’d like to remember what they’ve forgotten from their old stats class.
And I realized that much of the book, particularly the part called Inferential Statistics, is exactly what baseball analysts are doing when they try to project player performances.
There was recently a five-part Projection Roundtable at the Hardball Times that focused on the current state of the art. I don’t know about you, but much of that discussion was over my head; I haven’t spent a lot of time thinking about projections because I find the current state of baseball so fascinating.
But player projections are the most important task facing ballclubs, so I might start paying a bit more attention to the subject. Along those lines, let me present the following, very simple, Player Projection Framework. I’ll call it the Complete Idiot’s Guide to Player Projections.
Let’s say you want to know how many stars there are in the sky. The problem is that you can’t count them all at once; you can only look at one small portion of the sky at a time, and it would take an eternity to take in the entire sky. So you can never truly know how many stars there really are in the sky.
It’s the same thing with a baseball player. A baseball player has what Tangotiger calls a “true talent” level. When you look at a part of the sky, you’re only counting the stars in a sample of the total sky. With a ballplayer, when you look at a season of 600 plate appearances, you’re only looking at a sample of his true talent level. In both cases, the absolute truth can’t be directly measured.
This is a pretty common thing in statistics. Statisticians are always talking about samples, sample distributions and sampling distribution of the mean. There’s also this really important concept called the Central Limit Theorem that says that the larger the sample size, the more the sample results will follow a normal probability distribution. Which means you can consider the results of a player’s seasons to be normally distributed. See? I did read the book.
Anyway, the basic process, for both baseball and the sky, is to estimate the larger population (true talent level or total stars in the sky) based on the samples you have, and then estimate the likely outcome (and potential range of outcomes) for the next “sample” (or, piece of the sky or season). And that’s the overview of the Complete Idiot’s Guide to Player Projections.
Here are some specific steps:
I’m sure one of those fancy-pants sabermetricians will come along and correct me, but I think this is a pretty good framework for how to project player performances. Some of the keys are how well you correct any bias in the original stats, your regression method, the population to which you regress, whether you do this for components or for overall players and how you estimate ongoing changes to the player’s true talent level. At this stage, a breakthrough in any of those areas (not to mention the injury risk) would pretty much guarantee you a seat at the next Projection Roundtable.