wES Rate: Creating an Improved CSW Rate

Author’s Note: Prior to writing this article, I failed to do adequate research and did not recognize that Jesse Roche had covered a very similar topic in his October 2021 article titled Adjusted Called Strikes: Not All Strikes Are Created Equal. While our work isn’t a perfect mirror to one another’s, I want to apologize to Jesse and the folks at Baseball Prospectus for this error, and highly recommend that you check out the excellent work that they did.

As it stands today, CSW rate is one of the most commonly referenced statistics in the online baseball community. Coined by our very own Nick Pollack, and formalized in an article by Alex Fast (with help from Colin Charles) in 2019, it has become arguably the highest-favored snapshot of pitcher performance.

Many metrics exist in our little corner of the internet to help us gain insight into player performance. For analytics-inclined folks like myself, we know that most vary from cumbersome to outright overwhelming to calculate for folks who may not be as in tune with that side of the game. CSW’s elegance comes from its simplicity and accessibility. Three numbers, easily accessible through BaseballSavant, are all you need to calculate CSW:

Called Strikes
Whiffs
Total Pitches Thrown

While CSW is already an excellent metric, I found myself having wandering thoughts about it over the offseason. The part that I found myself fixated on was the idea that all called strikes and whiffs—which I’ll collectively be dubbing “earned strikes” for the rest of this piece—should not be valued equally.

Before doing any Baseball Savant searches or hard research, my hunch was that if you broke down earned strikes by the count they were thrown in, the CSW rate would be higher in counts with no strikes, and decrease as the batter was closer to striking out.

To put it simply, as the count became more competitive, CSW rates would decrease.

Using two of an amateur stats junkie’s favorite tools—Baseball Savant search and Microsoft Excel—and the 2021 season as a test, I calculated the league-wide CSW rate for last year, along with a breakdown of CSW rate in each count:

CSW by Count – 2021

Count	0 Balls	1 Ball	2 Balls	3 Balls
0 Strikes	38.3%	34.0%	35.5%	57.3%
1 Strike	24.8%	25.7%	26.4%	28.1%
2 Strikes	19.0%	20.2%	20.6%	19.8%

It’s abundantly clear that earned strikes become more scarce as the batters get closer to striking out.

With this in mind, how do we determine values for a count-dependent earned strike? I was starting to get a bit out of my element, so I enlisted the help of Jeff Nicholas, data analyst extraordinaire here at Pitcher List, and dove in.

Balancing the Scales

Our first stab at assigning values to earned strikes by count was to calculate them using the expected run value. The chasm between each of these values was so wide—particularly once we got to 2-strike counts—that it rapidly became clear we weren’t on the right track. So, that idea was scrapped as quickly as it was adopted. In our second attempt, Jeff suggested calculating the Z-scores for CSW rate in each count, adding those Z-scores to 1, and using that sum as the value for an earned strike for each count. Here are the results:

Earned Strike Values by Count – 2021

Count	Z-Score	Earned Strike Weight (Z-Score + 1)
0-0	-0.22	0.78
0-1	0.09	1.09
0-2	0.22	1.22
1-0	-0.12	0.88
1-1	0.07	1.07
1-2	0.20	1.20
2-0	-0.15	0.85
2-1	0.06	1.06
2-2	0.19	1.19
3-0	-0.66	0.34
3-1	0.02	1.02
3-2	0.21	1.21

This looked much better, as it better reflected the value of count-dependent earned strikes in relation to the CSW rate across our entire sample. But we had to see if our work bore any fruit. For that, we went back to where it all began: Alex Fast’s original article introducing CSW rate.

More Descriptive than CSW

One of the first high-correlation comparisons Alex found was between CSW rate and strikeout rate. This seemed like the most logical place to push towards, so I asked Jeff to generate scatter plots to determine the strength of correlation of both CSW and wES rates with strikeout rate. We wanted to stick to pitchers with a high volume of work (starters/high-usage relievers), so we set a minimum of 1000 pitches thrown to “qualify” for our test. The strength of correlation between two data sets is found by using a linear regression formula to calculate r². This formula will spit out a number between 0 and 1; the larger the number, the stronger the correlation.

Here’s what we came up with:

While it’s intuitive that K% would correlate well with both of these metrics, the fact that wES rate handily outpaces classic CSW means two things:

wES rate is definitively more descriptive than CSW rate; and
Jeff and I are absolutely on the right track.

You may ask why didn’t we look at swinging-strike rate or Whiff rate like Alex and Colin did. The reason is that they already proved with their work that CSW is consistently better than both of those metrics at describing a pitcher’s success. The only thing we needed to prove with wES rate is that it’s better than CSW, and we’ve done that.

But what about a predictive standpoint?

More Predictive than CSW

Alex and Colin also found strong correlations between CSW and SIERA, one of the most prominent ERA estimators. This is encouraging because it put them on a path to determining CSW rate’s predictiveness relative to other commonly used stats at the time. Following the trail that they cut through the jungle for us, Jeff and I pushed forward in the same manner as we did with K%. Here’s how wES rate stacked up against CSW in correlation to SIERA for the 2021 season:

Once again, wES rate emerges with a stronger correlation than its predecessor. I was vibrating with excitement at this point, but our work wasn’t quite done; we still had one more test to run to determine the relative predictiveness of wES rate. I’ll let Alex explain it:

“I figured the best way to determine whether CSW had predictive qualities was to take CSW rates from previous seasons and compare them to the following year’s SIERA to see if there was a change. For example, if a pitcher posted a 30% CSW rate in 2016, and a 38% CSW rate in 2017, we would hope to see their SIERA go down from the ’16 to ’17 season.”

Following the path that Alex and Colin cut for us one last time, I asked Jeff to find every instance of a pitcher throwing at least 1000 pitches in consecutive seasons during the Statcast era (threshold dropped to 500 pitches for the pandemic shortened 2020 season), measure the difference in SIERA between year one and year two, and see if those changes more strongly correlated with changes in CSW rate or wES rate. Here’s what we came up with:

Differences in Year 1 to Year 2 SIERA compared to wES% (left) and CSW% (right) from 2015-2021

Bingo! That’s a meaningful jump in predictiveness!

Wrapping Up wES Rate

How quickly does wES rate stabilize in-season?

This is probably one of the more exciting discoveries that we made with wES rate (shout out to Alex Fast for asking about it when I had him review this piece). Jeff ran the numbers using this code from Jonah Pemstein at Fangraphs, and CSW stabilizes at ~700 pitches thrown, while wES stabilizes at ~570 pitches (chart below). Another improvement!

Stabilization comparison between wES% and CSW% – Target Alpha of 0.7

Is there going to be an easy way to track this throughout the season?

It’s above my current skill set, but I will eventually be working on a leaderboard for wES that will probably live on Tableau or some such platform. And hey, maybe I can convince Nick and the dev team here at PL to get it on the player pages if people like it enough!

Update 1/10/2022: Here is the full 2022 leaderboard! I set the minimum pitches threshold to 1200 to do two things;

Weed out most of the relievers (might talk about them in a different article altogether); and
Show off all the folks who got meaningful opportunities as starters in 2022.

Enjoy!

2022 wES Rate Leaderboard – Updated 1/10/2023

Rank	Name	Pitches	wES%
1	McClanahan, Shane	2463	33.26%
2	Heaney, Andrew	1230	33.10%
3	Cole, Gerrit	3274	32.85%
4	Nola, Aaron	3039	32.31%
5	Strider, Spencer	2277	32.25%
6	Burnes, Corbin	3274	31.95%
7	Ohtani, Shohei	2629	31.87%
8	Ashby, Aaron	1878	31.67%
9	Rodon, Carlos	2985	31.17%
10	Woodruff, Brandon	2538	30.96%
11	Bieber, Shane	2875	30.90%
12	Scherzer, Max	2167	30.89%
13	Kershaw, Clayton	1842	30.77%
14	Luzardo, Jesus	1644	30.65%
15	Morton, Charlie	2994	30.58%
16	Garrett, Braxton	1436	30.32%
17	Springs, Jeffrey	2246	30.07%
18	Musgrove, Joe	2834	29.99%
19	Lodolo, Nick	1779	29.85%
20	Gausman, Kevin	2783	29.78%
21	Cabrera, Edward	1215	29.63%
22	Singer, Brady	2376	29.58%
23	Darvish, Yu	3082	29.38%
24	Cease, Dylan	3120	29.37%
25	Wright, Kyle	2697	29.23%
26	Peralta, Freddy	1353	29.20%
27	Brubaker, JT	2381	29.15%
28	Greene, Hunter	2200	29.13%
29	Falter, Bailey	1295	29.06%
30	Giolito, Lucas	2753	29.00%
31	Gray, Jon	2062	28.89%
32	Contreras, Roansy	1588	28.87%
33	Hill, Rich	1990	28.85%
34	Snell, Blake	2341	28.81%
35	Garcia, Luis	3434	28.78%
36	Kluber, Corey	2454	28.67%
37	Javier, Cristian	2554	28.59%
38	Sandoval, Patrick	2456	28.52%
39	Severino, Luis	1639	28.50%
40	Cobb, Alex	2506	28.47%
41	Valdez, Framber	3018	28.45%
42	Gonsolin, Tony	2015	28.28%
43	Wood, Alex	2239	28.28%
44	Junis, Jakob	1802	28.28%
45	Cortes, Nestor	2465	28.26%
46	Webb, Logan	3006	28.24%
47	Skubal, Tarik	1914	28.18%
48	Urias, Julio	2622	28.15%
49	Bassitt, Chris	2886	28.13%
50	Castillo, Luis	2662	28.05%
51	Gray, Sonny	1931	28.00%
52	Mahle, Tyler	2101	27.97%
53	Montgomery, Jordan	2808	27.96%
54	Lopez, Pablo	2910	27.94%
55	Eovaldi, Nathan	1739	27.92%
56	Ryan, Joe	2391	27.73%
57	Kikuchi, Yusei	1843	27.72%
58	McKenzie, Triston	2807	27.71%
59	Wheeler, Zack	2358	27.70%
60	Gallen, Zac	2935	27.44%
61	Ray, Robbie	3047	27.44%
62	Lauer, Eric	2764	27.40%
63	Manoah, Alek	3047	27.39%
64	Carrasco, Carlos	2412	27.24%
65	Stroman, Marcus	2181	27.24%
66	Peterson, David	1984	27.22%
67	Alcantara, Sandy	3343	27.19%
68	Gray, Josiah	2692	27.17%
69	Anderson, Tyler	2576	27.17%
70	Hendricks, Kyle	1347	27.12%
71	Manaea, Sean	2482	27.11%
72	Civale, Aaron	1588	27.09%
73	Berrios, Jose	2721	27.01%
74	Espino, Paolo	1873	26.99%
75	Lynn, Lance	2022	26.98%
76	Stripling, Ross	2033	26.98%
77	Verlander, Justin	2607	26.96%
78	Fried, Max	2818	26.95%
79	Smyly, Drew	1814	26.92%
80	Kelly, Merrill	3071	26.92%
81	Crawford, Kutter	1296	26.80%
82	Thompson, Keegan	1878	26.80%
83	Dunning, Dane	2548	26.73%
84	Kopech, Michael	2011	26.52%
85	Detmers, Reid	2261	26.50%
86	Suarez, Jose	1783	26.45%
87	Blackburn, Paul	1762	26.41%
88	Wells, Tyler	1649	26.41%
89	Perez, Martin	2979	26.40%
90	Wainwright, Adam	3133	26.37%
91	Kirby, George	2092	26.33%
92	Pivetta, Nick	3132	26.30%
93	Rogers, Trevor	1940	26.22%
94	Walker, Taijuan	2500	26.15%
95	Taillon, Jameson	2805	26.07%
96	Bundy, Dylan	2180	26.05%
97	Rasmussen, Drew	2239	25.99%
98	Montas, Frankie	2327	25.93%
99	Gore, MacKenzie	1258	25.91%
100	Gibson, Kyle	2804	25.85%
101	Steele, Justin	2032	25.78%
102	Clevinger, Mike	1931	25.72%
103	Urquidy, Jose	2640	25.71%
104	Voth, Austin	1762	25.67%
105	Davies, Zach	2383	25.42%
106	Syndergaard, Noah	2026	25.39%
107	Marquez, German	2839	25.37%
108	Lyles, Jordan	2988	25.34%
109	Keller, Mitch	2661	25.26%
110	Hearn, Taylor	1787	25.18%
111	Gilbert, Logan	3015	25.17%
112	Archer, Chris	1787	25.11%
113	Mikolas, Miles	3152	25.10%
114	Irvin, Cole	2610	25.09%
115	Bradish, Kyle	1993	25.07%
116	Anderson, Ian	2005	25.06%
117	Thompson, Zach	2080	24.98%
118	Kremer, Dean	2041	24.97%
119	Feltner, Ryan	1675	24.86%
120	Corbin, Patrick	2623	24.84%
121	Suarez, Ranger	2564	24.81%
122	Kuhl, Chad	2375	24.80%
123	Quintana, Jose	2724	24.56%
124	Lynch, Daniel	2523	24.47%
125	Otto, Glenn	2229	24.44%
126	Minor, Mike	1750	24.42%
127	Watkins, Spenser	1642	24.35%
128	Lorenzen, Michael	1654	24.33%
129	Gomber, Austin	2059	24.25%
130	Bubic, Kris	2262	24.19%
131	Wacha, Michael	1949	23.99%
132	Plesac, Zach	2153	23.86%
133	Kaprielian, James	2230	23.75%
134	Keuchel, Dallas	1268	23.57%
135	Greinke, Zack	2272	23.51%
136	Freeland, Kyle	2931	23.50%
137	Wilson, Bryse	1825	23.34%
138	Rodriguez, Eduardo	1469	23.34%
139	Odorizzi, Jake	1798	23.15%
140	Fedde, Erick	2420	23.15%
141	White, Mitch	1715	23.03%
142	Sampson, Adrian	1795	23.02%
143	Bumgarner, Madison	2715	23.00%
144	Hutchison, Drew	1844	22.79%
145	Pallante, Andre	1753	22.77%
146	Quantrill, Cal	2920	22.75%
147	Houser, Adrian	1848	22.58%
148	Adon, Joan	1204	22.50%
149	Keller, Brad	2275	22.42%
150	Cueto, Johnny	2372	22.42%
151	Gonzales, Marco	2838	22.39%
152	Alexander, Tyler	1591	22.36%
153	Ashcraft, Graham	1756	22.29%
154	Flexen, Chris	2204	21.99%
155	Heasley, Jonathan	1796	21.94%
156	Sanchez, Anibal	1232	21.66%
157	Hudson, Dakota	2253	21.58%
158	Urena, Jose	1622	21.50%
159	Brieske, Beau	1287	21.39%
160	Oller, Adam	1294	21.27%
161	Senzatela, Antonio	1521	20.81%

Qualifier: Minimum 1200 pitches thrown

Special thanks to Jeff Nicholas

Photos by Icon Sportswire | Adapted by Justin Redler (@reldernitsuj on Twitter)

4 responses to “wES Rate: Creating an Improved CSW Rate”

Yants says:

June 7, 2022 at 3:11 PM

Haven’t fully read the final product but wanted to be the first commenter. Great work, Jor Boar. Excited to dig in.

- Jordan White says:
  
  June 7, 2022 at 3:27 PM
  
  Love you buddy!
  
  - michael says:
    
    June 7, 2022 at 3:51 PM
    
    great article jordan, really superb. 2 questions. adding 1 to z-scores seems to change their relative rates (i think) but i assume it must be necessary – could you give a quick explanation as to why? second, in terms of predictive value is this year’s wES predict next year’s SIERA better than this year’s SIERA; as well, does including both SIERA and wES from this predict SIERA better than either one of those variables independently?
    
Ron says:

June 7, 2022 at 3:49 PM

Huge. Great work. This has been in the back of my head for 2 years as I watch batters often just take a 2-0 or 3-0 pitch depending on the situation where any schmo is basically guaranteed a 100% CS on that pitch if they can get it in the zone. Meanwhile on 1-2, hitters almost will never let a CS get by.

I love where this is going. Once you have the SQL in the bakground powering it, the next logical variable is expanding your matrix beyond count and adding factors such as score, outs, and runners on base. All factor into batter decisions to swing or not and could potentially increase your r2.

AL East

AL Central

AL West

NL East

NL Central

NL West

wES Rate: Creating an Improved CSW Rate

Jordan White

4 responses to “wES Rate: Creating an Improved CSW Rate”

Leave a Reply to Jordan White Cancel reply