Advanced Pitching Metrics for MLB Betting
Which stats actually predict future performance - and which ones just look pretty on stat sheets
ERA is a Lie (Sort Of)
Let me start with something that'll piss off old-school baseball fans: ERA is almost useless for prediction. Not completely useless—but if you're betting based on ERA, you're betting on what already happened, not what's likely to happen next.
Why? Because ERA includes too much noise. A pitcher with a 2.80 ERA might be legitimately elite, or he might be a 3.80 guy who's gotten lucky with batted ball sequencing and defense. The market can't always tell the difference fast enough, which creates opportunities.
Here's what happened in 2024: pitchers with ERAs a full run better than their FIP (Fielding Independent Pitching) regressed an average of 0.85 runs the following month. The data screamed "sell high" but casual bettors kept backing them based on that shiny ERA. That's free money if you know what to look for.
FIP: What a Pitcher Actually Controls
FIP strips out the noise. It looks at strikeouts, walks, hit-by-pitches, and home runs—the outcomes pitchers control directly, without relying on defense or luck. The formula scales to look like ERA, making it easy to compare.
FIP Formula
FIP = ((13×HR) + (3×BB) - (2×K)) / IP + Constant
The constant (typically around 3.10) adjusts FIP to the league-average ERA scale for easy comparison.
How to Use FIP for Betting
FIP vs ERA spread: When a pitcher's ERA is significantly lower than his FIP (0.75+ runs), he's likely overperforming. That means regression is coming. Books often don't adjust fast enough, creating fade opportunities.
Example: In June 2024, pitcher had a 2.65 ERA but a 3.85 FIP. His BABIP (batting average on balls in play) was .245 when league average is .300. The model screamed regression. Two weeks later, his ERA ballooned to 4.20 over his next four starts. Anyone fading him on the runline printed money.
FIP vs ERA spread going the other way: When ERA is higher than FIP, a pitcher is getting unlucky. His process is better than results suggest. That's a buy-low spot, especially if the public is down on him and the line reflects pessimism.
1. Gerrit Cole: 2.78 FIP (2.95 ERA) - sustainable
2. Dylan Cease: 2.92 FIP (3.47 ERA) - unlucky, regression candidate
3. Zack Wheeler: 3.05 FIP (2.89 ERA) - elite, slight overperformance
4. Logan Webb: 3.12 FIP (3.18 ERA) - rock solid consistency
5. Corbin Burnes: 3.22 FIP (2.85 ERA) - riding high BABIP luck
xFIP: Normalizing Home Run Luck
FIP is great, but it has a weakness: it treats all fly balls the same. In reality, some pitchers allow harder contact, some pitch in more favorable parks, and some just get lucky (or unlucky) with how many fly balls leave the yard.
xFIP (expected FIP) adjusts for this by assuming a league-average home run-to-fly-ball rate. If a pitcher's HR/FB rate is 8% but the league average is 12%, xFIP will normalize that to predict future performance more accurately.
When xFIP Matters Most
- Small sample sizes: Early in the season, a pitcher might have a 16% HR/FB rate over 30 innings. That's noise, not signal. xFIP smooths it out.
- Extreme park factors: Pitchers in Coors Field or Fenway see inflated HR rates. xFIP helps identify when that's park-driven vs. actual poor pitch quality.
- Regression candidates: Big gaps between FIP and xFIP signal unsustainable HR rates. That's actionable for betting.
Real Example: Spencer Strider 2023
ERA: 3.86
FIP: 2.85 (unlucky with timing of hits)
xFIP: 2.65 (HR/FB rate was elevated but fly ball quality wasn't poor)
The data said Strider was pitching like a sub-3.00 ERA guy but getting hammered by sequencing luck. Betting on him regressing positively (ERA dropping toward FIP/xFIP) was a high-percentage play. Over his next 8 starts, ERA dropped to 2.90. Consistent profit for anyone who understood the metrics.
SIERA: The Most Predictive Metric
SIERA (Skill-Interactive ERA) is where it gets sophisticated. Unlike FIP, SIERA accounts for the interaction between strikeouts and walks, adjusts for batted ball types, and recognizes that not all strikeouts are created equal.
A pitcher who strikes out 30% of batters but walks 12% isn't the same as one who strikes out 25% and walks 6%. SIERA understands that nuance. It's the single most predictive pitching metric for future ERA, consistently outperforming FIP and xFIP in studies.
Why SIERA is Better
- Accounts for batted ball profiles (ground balls vs. fly balls vs. line drives)
- Recognizes that high-strikeout pitchers can "get away with" slightly higher walk rates
- Better at projecting breakout and breakdown candidates than ERA-based metrics
The problem? SIERA is harder to calculate and less widely available. FanGraphs has it. Baseball-Reference doesn't. Most casual bettors never look at it, which means market inefficiency persists longer.
1. Tarik Skubal: 2.45 SIERA (breakout confirmed by metrics)
2. Zack Wheeler: 2.68 SIERA (elite consistency)
3. Gerrit Cole: 2.75 SIERA (still dealing at 34)
4. Dylan Cease: 2.89 SIERA (undervalued due to ERA volatility)
5. Corbin Burnes: 2.95 SIERA (reliable, not flashy)
When SIERA significantly diverges from ERA, trust SIERA. It's forward-looking. ERA is rearview mirror.
Stuff+: Measuring Pitch Quality Directly
Stuff+ is relatively new and it's a game-changer. Instead of waiting for outcomes (strikeouts, home runs, etc.), Stuff+ evaluates the quality of each individual pitch based on velocity, spin rate, movement, and location. It answers the question: how hard is this pitch to hit, regardless of whether the batter actually hit it?
A Stuff+ of 100 is league average. 110 is 10% better than average. 90 is 10% worse. Elite starters typically sit between 105-115. Guys throwing absolute gas with elite spin can hit 120+.
Why Stuff+ Matters for Betting
Predictive power: Stuff+ correlates with future strikeout rate and ERA better than past strikeout rate. If a pitcher's Stuff+ improves but his results haven't caught up yet, that's a bet-on opportunity.
Injury detection: Stuff+ can drop before results crater. A pitcher whose Stuff+ falls from 108 to 98 over three starts might still have decent ERA due to luck, but the underlying pitch quality decline is a massive red flag. Fade immediately.
Arsenal optimization: When a pitcher tweaks his pitch mix (throws more sliders, less changeups) and his Stuff+ spikes, he's probably unlocked something real. Books take weeks to adjust pricing. That's your window.
Case Study: Pitch Mix Change
A pitcher increased his slider usage from 18% to 32% in May 2024. His Stuff+ jumped from 102 to 112. ERA was still 3.90 because results lag process. Over the next six weeks, ERA fell to 2.75 as outcomes caught up to pitch quality. Anyone tracking Stuff+ caught that breakout early.
Spin Rate: Context is Everything
Spin rate became a mainstream stat after the 2021 sticky stuff crackdown. High spin fastballs get more "rise" and miss bats. High spin curveballs have sharper break. Sounds simple, right?
Not quite. Spin rate without context is useless. A 2,500 RPM four-seam fastball at 98 mph is elite. The same spin at 91 mph is mediocre because the velocity-to-spin ratio (which determines effectiveness) is off.
Spin Rate Red Flags
- Sudden drops: If a pitcher's fastball spin drops 150+ RPM over consecutive starts, something's wrong. Injury, mechanics issue, or illegal substance removal. Either way, fade.
- High spin, low movement: Some pitchers have high raw spin but poor "active spin" (the spin that creates movement). They get hit hard despite impressive RPM numbers. Don't fall for the spin rate hype without checking movement metrics.
1. Spencer Strider: 2,680 RPM (electric)
2. Dylan Cease: 2,510 RPM (plus pitch)
3. Gerrit Cole: 2,475 RPM (still elite at 34)
4. Shohei Ohtani: 2,440 RPM (devastating when healthy)
WHIP: Simple But Useful
WHIP (walks + hits per inning pitched) doesn't tell you why a pitcher is good or bad, but it's a decent shorthand for "how often do batters reach base?" Elite pitchers typically sit under 1.10 WHIP. Replacement-level arms are above 1.35.
The betting angle: WHIP correlates strongly with first-five-inning (F5) results. Pitchers with sub-1.05 WHIP tend to keep games close early, which matters for F5 unders and runlines. When books misprice F5 based on full-game ERA, WHIP can expose the error.
K/9 and BB/9: Control the Strike Zone
Strikeout rate (K/9) and walk rate (BB/9) are foundational. Elite pitchers miss bats (9.5+ K/9) and avoid free passes (sub-2.5 BB/9). When you see extreme ratios, pay attention.
K/BB Ratio
The strikeout-to-walk ratio is a simple but powerful metric. A ratio above 4.0 is elite. Below 2.0 is problematic. Pitchers with K/BB ratios above 5.0 almost never have prolonged slumps—even when ERA spikes temporarily, they bounce back because the underlying skills are sound.
2024 K/BB Leaders
1. Corbin Burnes: 5.85 K/BB (11.2 K/9, 1.9 BB/9)
2. Gerrit Cole: 5.20 K/BB (10.4 K/9, 2.0 BB/9)
3. Logan Webb: 4.75 K/BB (9.5 K/9, 2.0 BB/9)
These are the guys you trust even when results temporarily falter. The process is elite.
CSW%: Strikes That Matter
Called Strikes + Whiffs percentage (CSW%) measures how often a pitcher throws a strike that the batter doesn't put in play. It combines called strikes and swings-and-misses as a percentage of total pitches.
League average CSW% is around 29%. Elite pitchers sit at 32%+. When CSW% diverges significantly from K/9, it's predictive—a pitcher with high CSW% but low strikeouts is likely to see K-rate improve. That's a leading indicator books miss.
Putting It All Together: The Model Approach
No single metric tells the whole story. The sharpest bettors—and the best models—synthesize multiple inputs.
My Workflow for Evaluating Pitchers
- Start with SIERA: Most predictive baseline
- Check FIP vs ERA spread: Identify over/underperformers
- Review Stuff+: Evaluate pitch quality and detect changes
- Analyze K/BB ratio: Confirm underlying skill level
- Track spin rate trends: Spot injuries or mechanical issues early
- Compare vs. opponent lineup metrics: Context matters (covered in future post)
When multiple metrics align (SIERA low, FIP better than ERA, Stuff+ rising, K/BB above 4.0), you've found a high-conviction play. When they conflict (ERA shiny but FIP/SIERA ugly, Stuff+ declining), fade with confidence.
Final Thought: The Market Lags
Books are good, but they're not perfect. ERA still drives too much of the pricing because that's what the public bets on. By the time a pitcher's ERA catches up to his FIP/SIERA, the value is gone. Your edge is identifying the divergence before the market does.
Pitching metrics give you that edge. Use them.
See the Model in Action
Our model integrates all these metrics to find edges the market misses. Verified track record: 48-25-2 (+15.10 units).
View Daily Picks →