Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: Which Players Will Join The 3,000-Hit Club?

BABIP: The Pokes, The Quails, and The Inexplicable .410 Batting Average through April

Everybody knows a hit is a hit. Hits advance baserunners, they drive in runs, they kick up grass and dirt and miss gloves.

Now, batting average (AVG) is the traditional measure of a baseball player's skill. I personally don't subscribe to this method of evaluation, but for the sake of this article, we're only going to deal with batting average and therefore hits. Not worrying about walks, not worrying about homers, not worrying about strikeouts.

Building off of that last sentence, we should define a key concept: Three True Outcomes. The Three True Outcomes are Strikeout, Walk, and Home Run. The reason these are considered "true" over any other possible outcome is the fact that all 3 are strictly Pitcher-Batter interactions.

If Tim Lincecum punches out Adam Dunn, it had absolutely nothing to do with how Fred Lewis positioned himself.

If Oliver Perez walks Adam Dunn however, does it matter that Luis Castillo wasn't properly aligned for the shift?

Finally, if Adam Dunn takes Ted Lilly yard, will it matter if Aramis Ramirez was perfectly poised to turn a 5-4-3?

Clearly not.

These are the fundamentals that such metrics as FIP are built around, just the things that pitchers can control.

But obviously, not everything in baseball is a K, BB, or HR. This is where BABIP comes into play.

Join me after the jump and we'll discuss why.

Star-divide

BABIP stands for Batting Average on Balls in Play. It only takes hits that aren't home runs, and batted outs that aren't strikeouts. The exact formula is as follows:

Ed893b7b0405147c8498db6096fce17e_medium

via upload.wikimedia.org

Well this is all good and fine, but what does it mean? How is it used?

Well for starters, BABIP tends to normalize to around .300, league-wide every season. It'll certainly vary a bit, but .300 is a good benchmark. Keep this in mind.

Now think about hits. While they all count, we all know that some hits are more valuable than others. A line drive single to CF is probably a more "legit" hit than a squirter that rolls between 2B and 3B and just past 2 diving gloves. Reason why? Well, a line drive single has the height to get past the infield, and drops well in front of the CF, and basically eludes all the gloves in a sense of "yeah nobody's going to get to that". However, that squirter, if you perhaps had a 3B or a SS with better range, that could be a 5-3 or 6-3.

Maybe that 2nd hit is a seeing-eye grounder. Maybe it's a dying quail. Maybe it's just a lucky shot that jussssst gets past the outstretched glove. Maybe it's because the opposing defense sucks. Like I said, it still counts, but how much of that do you credit to the batter and how much do you fault the fielders, and how much is just statistical randomness?

It's the same kind of thing you can apply to a lineout to CF and a 2B8. The balls were hit the same, but one found glove, one found grass. There's an element of randomness that has to be considered on batted balls. Sure, we could break down every swing and every batter and every pitch, but 1. Who has time for that, and 2. The real-life differences in everything can just be kind of swooped up as statistically insignificant.

Now there are a couple of ways to use BABIP. You can use it to explore a pitcher, a batter, and a team, in terms of their performance - and if it's sustainable.

For starters, let's apply it to pitchers. We'll use Matt Herges between 2007 and 2008 for this example. In 2007, Herges had a 2.96 ERA; he struck out 5.55 batters/9, walked 2.77/9, and gave up 0.74 HR9. Very solid. Compare that to his career line of 6.13 K9, 3.41 BB9, and 0.84 HR9. Not too far off at all, right? Now let's look at 2008. K9 6.44, BB9 3.36, HR9 0.70. Again, not very far off of his career numbers, but it came with a 5.04 ERA! You could argue (based on the rate stats) that he pitched very similarly in 2008 as he did in 2007, so what went wrong?

This is where we look at BABIP. 2007, Herges was sporting a nice, proud .219 BABIP. Now, just based on that baseline number above, this is incredibly low. In 2008, his BABIP was .353. This is very high. Now, considering the fact that all the fielding-independent numbers didn't change much, we can probably thank a bit of randomness as to why the ball found gloves or found ground.

Remember how good the Rockies' defense was in 2007. That will help any pitcher look better. Batted balls find gloves and become outs. In 2008 however, a lot of key injuries definitely sunk the defense, and you could very likely thank the lack of defense for part of Herges' inflated ERA.

Similarly, with a batter, you look at their career BABIP to see what they're doing differently. If they're walking a similar amount as usual, hitting for the same power, hitting the same number of line drives, yet still aren't performing, maybe they're just hitting them right at fielders. Or if they're performing amazingly, perhaps they're getting the benefit of the opposite. Some batters are just high BABIP hitters, though, and some are low. It really varies, based on how many balls the batter puts into play.

Our real-player example is Matt Kemp. He's batting .392 right now, with a BABIP of .486. Those tell me that he's incredibly hot right now, but there's a very distinct chance that both of those numbers are coming down to more around his career norms. Interestingly, Kemp sports a .381 career BABIP, which is extremely high. His minor league BABIP was .368, which does support his major league numbers, but his BABIP might see a big drop this season to begin pushing his career total down.

So back to the idea of a .300 benchmark. The statistics - and by this I don't mean the WARPLVORP5 or whatever, I mean the literal statistics - mean, standard deviation, etc, they suggest that 30% of batted balls will become a hit. That includes everything. To be very specific, the NL-league average right now is a bit high, at .299. Last season it was .298, the season before .301.

If a team is significantly above or below that .300 mark, and they're doing most everything else like they normally would, there's a good chance that those hits are just doing the same stuff as above mentioned: finding/missing gloves/holes/no-man's-land.

Much as I'd like to simply credit/fault a team's defense, a lot of hits are hits for no reason other than ...well, no reason! They squeak past fielders and land where they ain't. You could fault the weather, the humidity, the sun, the wind, the ceiling of the Trop', Pigeons, whatever, sometimes hits fall. There's no right or wrong to it, it just is.

The reason I bring this up in today's article is simply to recap a bit of this season's slow start.

The Dodgers have a 19.5% Line Drive %, which is right around league average. They also have a .338 BABIP, a good 40 points above league average, which suggests that they're hitting the ball well, but they're also getting a bit lucky. The Rockies are at .282, but they're also tied for the worst LD% in the NL (17.0%), signifying that not only are they getting unlucky, but they're not making good contact with the ball, to the tune of about 20 points. This isn't to say that not catching breaks is the only thing wrong with the Rockies bats, but it suggests that a combination of lack of good contact and more bad breaks than the norm are sinking the offense.

So to summarize this whole debacle, let's recap:

1. BABIP is the batting average on balls that stay in the park.

2. BABIP normalizes league-wide over the course of a season

3. If a pitcher/batter/Team is looking incredibly good/poor and they're not doing anything more special than they ever have, maybe they're getting the breaks and the gaps.

4. When a manager says something like "He's just hitting it where we ain't" that's BABIP.

5. BABIP is really good to determine if someone's just hot or cold, especially if they're not doing anything differently than they're used to.

6. While not really a "luck factor", you can attribute an abnormal BABIP with a normal performance to statistical deviation, or "bad breaks"

7. Finally, this isn't intended to be a big excuse for why we're losing. We're doing a lot of things wrong, it just seems like missing breaks isn't helping things either.

That's all for this week, RowBots, maybe we'll actually win a few this week!

Comment 28 comments  |  2 recs  | 

Do you like this story?

Comments

Display:

Solid work

I’ have been curious…given that some players are high BABIP or low BABIP players over the course of their career, what is the range associated with those tags long term. For a line-drive hitter, is a .360 BABIP sustainable for a career (in this era)?

"I have no special talents. I am only passionately curious." - Albert Einstein

by Andrew T. Fisher on Apr 22, 2009 7:19 PM MDT reply actions  

Perhaps

Todd Helton’s 25.2 % LD translates into a .340 career BABIP

Interestingly enough, Derek Jeter’s 20.7 % LD translates into a .361 BABIP – Ichiro has a similar set

Also Albert Pujols has a 20.1 % and a .322.

A lot of BABIP hitting is doing stuff like knowing when to lay down smart bunts and playing a lot of small ball. Good bat control and being able to bloop singles are going to aid BABIP – see Ichiro.

That said I don’t know what the hell Matt Kemp is doing.

High BABIP does not necessarily indicate skill. It’s just the same as batting average, it can overrate batters a lot.

Follow me on Twitter! http://www.twitter.com/rockiesmagicnum - But be warned I only actually "tweet" every....well, not often.

by Andrew Martin on Apr 22, 2009 7:39 PM MDT up reply actions  

High BABIP does not necessarily indicate skill. It’s just the same as batting average, it can overrate batters a lot.

Even over the course of a career? I have a difficult time thinking 6000 ABs would not filter out randomness. Surely it would mean something.

That said I don’t know what the hell Matt Kemp is doing.

At least you can admit it. :) I have a hard time pegging him as well.

"I have no special talents. I am only passionately curious." - Albert Einstein

by Andrew T. Fisher on Apr 22, 2009 8:21 PM MDT up reply actions  

Well it depends on your valuation of batters

Guys with very high averages tend to not walk a lot anymore, see Ichiro or Pablo Sandoval (yes, I know, Chipper). I find high averages to be flukes more than consistencies, but it really depends on what

Speaking of Ichiro, like 2 years ago he batted .350+ for his 3rd straight season. There are only 2 “active” (and by saying active, I mean have played within the past 2 years) players who have batted .350 for 2 seasons in their career. Can you name them?

Follow me on Twitter! http://www.twitter.com/rockiesmagicnum - But be warned I only actually "tweet" every....well, not often.

by Andrew Martin on Apr 22, 2009 11:03 PM MDT up reply actions  

Helton and Pujols

Manny just missed it- .351 in 2000, but .349 in 2002.

by kosmo99 on Apr 22, 2009 11:16 PM MDT up reply actions  

My problem

with the theory of BABIP is this: it seems to me to simply be a way to try to account for “luck” (however you want to define it). Which, without going on a tangent, seems to me to be an impossible task based on the very nature of luck, as I perceive it.

On a practical level, does it account for luck? Here’s my second problem (which is identified by the posts above): it treats hard-hit balls and poorly hit balls the same (so long as they don’t clear the fence). This to me is a fundamental error in the system. I mean, smashing a liner to the track is on an entirely different from a check swing roller, and aggregating whether or not one or both fell in for hits doesn’t tell you very much about what happened, yet here they are treated the same. I don’t buy that as a legit principle. And it certainly doesn’t say too much about luck to me.

Really, there are just far too many factors to reduce this kind of stuff to a statistic. Which is why BABIP is an interesting and potentially useful tool for trend/long term analysis, but it really not that useful in anything involving short term strategy or considerations (as in, why are the Rockies playing poorly and how can it be fixed?) Maybe there’s something to the fact that this stuff tends to normalize, but it hasn’t been unlocked just yet.

Whatever, I’m old school trying to get hip about modern stats, but all these inherent flaws in trying to reduce the game to numbers drive me crazy and keep me from really getting on board.

by Teekalong on Apr 22, 2009 8:39 PM MDT reply actions  

One of the beauties of baseball, in my opinion

is the the endless, hopelessly romantic pursuit of statisticians to define what happens on the field. There are new stats generated every year nowadays in this everlasting quest. Are there flaws in BABIP? Sure, but there are flaws and blind areas covered by every statistic out there, especially the old school ones. You’ll find the holes if you look hard enough. Therefore, statistics are very useful tools but can be dangerous when not understood. That’s true for every statistic.

Baseball will never be perfectly described by statistics, nor should it be. But we will always try to get closer to that limit.

"I have no special talents. I am only passionately curious." - Albert Einstein

by Andrew T. Fisher on Apr 22, 2009 8:51 PM MDT up reply actions  

Like saying Walks, K's and HR's are a "true outcome"...

That are dependant on the pitcher and hitter. The park factors into a HR, look at Atkins HR yesterday to see how a small jut in the wall prevented a tater from appearing in Atkins scorecard. Don’t get me started the inlfuence of umpiring on walks and K’s.

Ef you baldo try Hair-Be Dere!!

by Charlie77 on Apr 22, 2009 9:03 PM MDT up reply actions  

Yeah, I knew someone was gonna mention this

And yes, parks effect them hugely. The point is simply that is has nothing to do with the fielders or any player interactions on the field.

And umpires are not considered in this because both teams are umped by the same ump. Yes, it seems one team might get better calls, but you can’t really factor that in, because there’s nothing to really back it up with, you know?

Point is, Tulowitzki has no real bearing on a strikeout. Michael Cuddyer won’t stop Scott Baker from walking a guy. Yes, dingers can be taken away, which is why they are kind of a hole in the 3 true outcomes, but for the most part, it doesn’t matter where a fielder plays, the number of homers that are miraculously snagged are going to be statistically insignificant over the course of a season.

Follow me on Twitter! http://www.twitter.com/rockiesmagicnum - But be warned I only actually "tweet" every....well, not often.

by Andrew Martin on Apr 22, 2009 11:07 PM MDT up reply actions  

Basically, just like in regards to a batted ball,

The umpire is considered part of the field. Both teams play on it, both teams are officiated by them.

Follow me on Twitter! http://www.twitter.com/rockiesmagicnum - But be warned I only actually "tweet" every....well, not often.

by Andrew Martin on Apr 22, 2009 11:12 PM MDT up reply actions  

luck?

While BABIP does average out to .300 across the league, certain players are going to have consistenly higher BABIP than others, due to a variety of factors such as LD rate and speed (fast guys beat out more infield hits).

They key is to look at a player’s current BABIP in comparison to his career BABIP. If he’s a career .330 BABIP guy and has a .335 BABIP, there isn’t much luck in this. On the other hand, if he his a career .280 BABIP guy and has a .320 BABIP, there’s probably a lot of luck in this.

on a tangent, I compiled the LD, FB, GB rates for home/road for the last few years for the Rockies. They have a significantly higher LD rate on home vs. road – a bigger variation than most teams. I’d like to break this down further to see if LD rate increases during a road trip (big Coors hangover in game 1, small hangover in game 9) but haven’t had the time.

http://www.observingcasually.com/coors-hangover

I have the raw data for all teams, if anyone wants it.

by kosmo99 on Apr 22, 2009 8:53 PM MDT reply actions  

Nice article RMN..

I have never really got into BABIP before now. It is an interesting stat to use. You say the .300 mark is the average, can you provide a standard deviation or a 95% confidence interval for a set time period? Thought it would be interesting to look at what a good and a bad BABIP would be.

Ef you baldo try Hair-Be Dere!!

by Charlie77 on Apr 22, 2009 9:21 PM MDT reply actions  

You definitely could, it just might be tricky.

If you go to baseball reference and look at the batting splits for a league for a year, you can see where the league-average BABIP is. You could go team-to-team and calculate one based on that, and we could pull up our ol’ Z-tables and find out where the interval is based on a normal distribution…but it’s late. I might do it tomorrow.

Follow me on Twitter! http://www.twitter.com/rockiesmagicnum - But be warned I only actually "tweet" every....well, not often.

by Andrew Martin on Apr 22, 2009 11:10 PM MDT up reply actions  

God, we're all a bunch of nerds

I love it :) .

"Admirably obsessive." - Uni Watch, March 24th, 2009
NA34 | HK | RMN

by oo_nrb on Apr 22, 2009 11:19 PM MDT up reply actions  

Is it bad that Z-table talk has taken a little sting away from the loss?

"I have no special talents. I am only passionately curious." - Albert Einstein

by Andrew T. Fisher on Apr 22, 2009 11:36 PM MDT up reply actions  

Z-tables are like a warm blanket..

To those of us who have spent a thanksgiving weekend stuck in the library cramming for a stats final.

Ef you baldo try Hair-Be Dere!!

by Charlie77 on Apr 22, 2009 11:38 PM MDT up reply actions  

Probably could copy and paste into Excel..

Then have it perform the calculation, but that’s just me and I’m kinda lazy that way.

Ef you baldo try Hair-Be Dere!!

by Charlie77 on Apr 22, 2009 11:36 PM MDT up reply actions  

we use it exclusively in my supply chain management class

Normsdist

Follow me on Twitter! http://www.twitter.com/rockiesmagicnum - But be warned I only actually "tweet" every....well, not often.

by Andrew Martin on Apr 23, 2009 9:08 AM MDT up reply actions  

Well I didn't even need that

because the 95% interval is found by Mean+-1.96*StDev

which means it’s (.251,.343)

Follow me on Twitter! http://www.twitter.com/rockiesmagicnum - But be warned I only actually "tweet" every....well, not often.

by Andrew Martin on Apr 23, 2009 9:20 AM MDT up reply actions  

Very Nice!!

Ef you baldo try Hair-Be Dere!!

by Charlie77 on Apr 23, 2009 11:03 AM MDT up reply actions  

up to you to analyze it now, slugger

Follow me on Twitter! http://www.twitter.com/rockiesmagicnum - But be warned I only actually "tweet" every....well, not often.

by Andrew Martin on Apr 23, 2009 11:50 AM MDT up reply actions  

From a pitching perspective

Steven Strasburg has a .398 BABiP against him this year. Probably because less than half of the guys he faces put the bat on the ball.

The 2009 Colorado Rockies: Expect the Unexpected

by free7694 on Apr 23, 2009 1:07 AM MDT reply actions  

college

I wonder what the average BABIP is for college. I would assume that it would be higher, simply due to having worse fielders (less rangey) pursuing the balls, allowing more hit to fall in. There’s also the wood/aluminum issue – you’re not going to break a bat on a swing and hit the ball weakly as a result as you might in MLB or the minors.

by kosmo99 on Apr 23, 2009 6:30 AM MDT reply actions  

All I know is

-Their pitching has serious issues.
-Their hitting has been atrocious
Their defense has issues at times.

Bottom line it is last year all over, which begs the question. Does this team really have the talent? Got to be honest with you, take away Tulos second half of 2007 and what sort of career has he had? Ianetta had a solid 2/3 of last year but overall his career has been disappointing. We suspect Helton/Atkins in decline. Aaron Cook has struggled since late last July and no Rock pitcher has ever put together back to back good seasons.

You can create stats to rationalize it as you wish. Bottom line is this team is dead last and plays, executes, and looks really bad.

by PinchHitLancePainter on Apr 23, 2009 6:54 AM MDT reply actions  

Tulo's second half in 08 was pretty strong too

Which leads me to believe he could just be a really slow starter. Which, hey, lots of guys are, even elite guys like Mark Teixeira. As far as Iannetta, I’m not sure you can call one year as a backup, one year as a starter, and two weeks a ‘disappointing’ career, but his start this season certainly has been.

I’m not passing judgment on Cook after three starts. And as far as no Rockies pitcher putting together back to back good seasons, 2006 and 2007 Jeff Francis would like a word with you.

Staying on the sunny side of Blake Street since 1993.
The Shawn Chacon Experience - Life as a Rockies fan, one day at a time: Because we're all still recovering from those nine blown saves.

by Franchise26 on Apr 23, 2009 7:30 AM MDT up reply actions  

And so as not to end this on as harsh a note:

You’re not wrong about the Rockies playing like crap. That’s obvious. But just like sometimes a cigar is just a cigar, sometimes a bad stretch is just a bad stretch. If it’s truly emblematic, I don’t think we’ll know for a few more weeks.

Staying on the sunny side of Blake Street since 1993.
The Shawn Chacon Experience - Life as a Rockies fan, one day at a time: Because we're all still recovering from those nine blown saves.

by Franchise26 on Apr 23, 2009 7:35 AM MDT up reply actions  

Comments For This Post Are Closed


User Tools

Welcome to the SB Nation blog about the Colorado Rockies, established 28 April 2005.

Community Guidelines
RockiesRoster.

FanPosts

Community blog posts and discussion.

Recommended FanPosts

Small
Musical Analysis of Baseball

Recent FanPosts

Paul_by_jerichasmall_small
PRMLB February Thread
Small
New Tulo FB page Please Join!!!
Img_1229_small
Better Things: Starting Pitching in 2012
Rockies1_small
2012 Projected Opening Day Payroll
2009__1_small
Opening Day & Fireworks Tickets
Img_1229_small
Mount Olympus on Lake Otsego
Img_1229_small
PRMLB: The January Thread
Avatar_small
Off Season Picture Time
Happy-face_small
Taking Out The Trash (And How Michael Cuddyer Can Help Us Do It)

+ New FanPost All FanPosts >

Colorado Sports Blogs

Mile High Report (Denver Broncos)
Mile High Hockey (Colorado Avalanche)
Denver Stiffs (Denver Nuggets)
Burgundy Wave (Colorado Rapids)
The Ralphie Report (CU Buffaloes)
SB Nation Denver

Top 30 PuRPs

  1. Drew Pomeranz, LHP - AA/MLB
  2. Nolan Arenado, 3B - A (Adv)
  3. Wilin Rosario, C - AA/MLB
  4. Chad Bettis, RHP - A (Adv)
  5. Tyler Matzek, A (Adv), A
  6. Alex White, AA/MLB
  7. Kyle Parker, OF - A
  8. Tim Wheeler, OF - AA
  9. Josh Rutledge, SS - A (Adv)
  10. Charlie Blackmon, OF - MLB
  11. Rosell Herrera, SS/3B - Rookie
  12. Trevor Story, SS/3B - Rookie
  13. Edwar Cabrera, LHP - A (Adv)
  14. Tyler Anderson, LHP - unassigned
  15. Rafael Ortega, OF - A
  16. Peter Tago, RHP, A
  17. Christian Friedrich, LHP - AA
  18. Joe Gardner, RHP - AA
  19. Corey Dickerson, OF - Low-A
  20. Thomas Field, 2B - AA
  21. Will Swanner, C - Rookie
  22. Kent Matthes, OF - A (Adv)
  23. Albert Campos, RHP - A
  24. Jordan Pacheco, C/UT - AAA/MLB
  25. Cristhian Adames, SS - A
  26. Ben Paulsen, 1B - AA
  27. Josh Slaats, RHP - Low-A
  28. David Kandilas, CF - Rookie
  29. Jayson Aquino, LHP - DSL
  30. Hector Gomez, SS - AA/MLB
HM:  
Edgmer Escalona, RHP - AAA/MLB
Dillon Thomas, OF - Rookie
Sam Mende, IF - Rookie
Mike Zuanich, 1B - AA
Dan Houston, RHP - AA

updated 10/25/2011. 


Managers

Rox_girl_small Rox Girl

35l7yvb_small Andrew Martin

Staff

Jeff_aberle_small Jeff Aberle

Poison-the-well-the-tropic-rot_small Bryan Kilpatrick

Avatar2_small Andrew T. Fisher

Wittgenstein_small Greg Stanwood

Special Assistants to the GM

Rockies_lost_americana_small holly96