2058 - Error Ratings and Error Rates

Post by **RonCo** » Sat Apr 06, 2024 2:06 pm

So I was listening to the latest Brewster Babies podcast when I heard the guys raise the Ron Signal – in this case, a question about error rates and how valuable the error rating is. The real answer was, well, I knew but didn’t know. So I went to the league file and pulled some data directly from the BBA as it stands right now (meaning with four games left to play in the season).

Since they tend to get the most chances, I chose shortstops.

To do this really simple test I took every shortstop’s Error Rating, and then calculated the number of total Batted Balls in Zone for them while they played shortstop, and how many errors they were attributed. Then I tabulated the data by error rating. Simple, right?

Here’s the result:

IF Error Rating	E	BIZ	Error Rate
4	1	11	9.09%
5	4	142	2.82%
6	21	416	5.05%
7	36	1018	3.54%
8	154	4113	3.74%
9	150	5274	2.84%
10	190	8163	2.33%
11	74	3454	2.14%
League	630	22591	2.79%

First and foremost, we should immediately kill the “4” and “5” Error data due to sample size. The data for “6” Error should be looked at askew for similar reasons, but at 416 BIP it’s worth at least noting. Even the “7” is not highly represented…and as such represents a selection bias among BBA GMs. If a guy has less than an “8” error, he doesn’t play much SS—which makes this analysis a bit fraught.

If we take this data straight as it is, though, we see that BBA shortstops average out to a 2.79% error rate. If we trust that number, we’d say that the average SS Error Rating is somewhere in the low 9s. Raising to a “10” improves our error rate by about half a percent. Not piddly. Raising to an “11” adds another two-tenths.

The more interesting question, then, is what happens in those lower ranges. How bad is an “8” or a “7” or … and then the more interesting question is how many extra errors does that actually play out to. The last is a little easier to study…at least at the top end, because we can see that our most often played shortstops seem to get 600-700 BIZ. So let’s say 650. A half percent of 650 (the gap between 9 and 10) is 3.25 extra errors. You can do your own math down the line, but if we say three errors are like adding the runs of three additional singles, linear weights would say that value is maybe 1-2 runs.

ISSUES

The more interesting question is down the line, where the “8” and “7” Error Ratings lie, and where the data is a little weird.

I should note there are other issues with this data, too, not the least of which is that there will be variance within the Error Ratings. For example, though, I note that “8 Error” SS appear to create a higher rate of errors than “7” Error Rated SS. The difference is fairly small, though. Does that mean that the average “8” is very close to a “7” and vice versa? Who knows?

But I want to talk about the impact of random variance at these small rates.

For this study, let’s assume the Error Rate we see of “8” is correct. How, then, do “7” Rated players make less errors?

For that, let’s do some mathematical thought experiments. Specifically, let’s say that we think the “real” error rate of guys Rated “7” should be .038. In other words, just a little worse than the guys rated “8.” If we then let 10,000 shortstops with this error rate have 1,018 chances (just like the BBA Shortstops in this band), how often to they make errors at .0354 or less (like BBA shortstops)?

The answer is somewhere around 35% of the time.

That says that there exists a very likely case that the data we see in the “7” zone is an aberration due to sample size. It can go the other way, too. “8” could be a little lower.

So you get to read the data however you want. But one way of looking at the overall span would be to say that the low end error rate is maybe 3.8% and that the high end is 2.1%, meaning that the overall performance span of “normal” BBA shortstops is about 1.7% at the far edges. If we use 650 BIZ as our baseline for playing guys at those edges, then you’d get a variance of around 11 errors.

Is that right?

Dunno.

But Montreal’s Qutuz Mahdi carries an “8” error rating, and he’s made 30 errors on 707 BIZ to date. Phoenix’s Niaz Minhas, with a “9” error rating, has made 22 errors on 635 BIZ. Elvan Masuki ("11"), now with Charlotte, has made 11 errors on 629 BIZ.

If I squint just right, I can see that in these numbers. If anything, maybe that estimate is a little too low. But dealing with randomness at these sample sizes is always a tough thing.

Post by **RonCo** » Sat Apr 06, 2024 2:22 pm

And here's a plot of BBA SS Error Rates vs. their Error Rating, for all BBA SS with more than 300 BIZ. (With a polynomial trend line, for those who care)

Post by **Lane** » Sat Apr 06, 2024 3:16 pm

Looks pretty similar to the study i did way back in the day.

You can see that it's working in some ways but I'm still not a fan of the outliers on the high error rate for the 10 and 11 ratings.

Post by **Lane** » Sat Apr 06, 2024 3:18 pm

http://montybrewster.net/forums/viewtop ... 23&t=25137

Post by **RonCo** » Sat Apr 06, 2024 3:42 pm

That was a good look, too. Something else that strikes me, though, is to wonder if error rates are also separated by position. In other words, does a SS and a 2B with the same error rating make error at different rates?

McQuade's numbers were pretty good over the years, but better at 2B than SS.

BaseClogger · Post by **BaseClogger** » Sat Apr 06, 2024 3:46 pm

Outfielders have even smaller samples, right? That’s where I find even less consistency. And there’s a lot more variation in their ratings. You’ll see guys with a 3 which you don’t see in the infield (aside from 1B).

Does the game score errors more accurately than in real life where there’s human subjectiveness? As in, what about plays where the fielder doesn’t convert because of their hands but it didn’t result in an error.

Post by **RonCo** » Sat Apr 06, 2024 3:47 pm

There's also the Long Beach water theory!

BaseClogger · Post by **BaseClogger** » Sat Apr 06, 2024 3:48 pm

Here’s the guy I had in mind when I was talking on the podcast. He’s only got a 5 error rating but he’s played solid defense in the BBA in albeit somewhat limited sample sizes.

Post by **RonCo** » Sat Apr 06, 2024 3:55 pm

Here are current-day RF in the BBA:

OF Error	E	BIZ	Rate
3	0	73	0.00%
4	4	356	1.12%
5	18	1132	1.59%
6	36	2032	1.77%
7	38	2964	1.28%
8	26	1755	1.48%
9	5	351	1.42%
10	9	1450	0.62%
11	2	209	0.96%
Grand Total	138	10322	1.34%

Post by **RonCo** » Sat Apr 06, 2024 4:00 pm

Note that RF error rates are half those of SS just across the board.

Post by **RonCo** » Sat Apr 06, 2024 4:01 pm

Note the error rate of 11s jump up. But the jump represents exactly one more error over the entire league.

BaseClogger · Post by **BaseClogger** » Sat Apr 06, 2024 4:35 pm

RonCo wrote: ↑
Sat Apr 06, 2024 3:55 pm
Here are current-day RF in the BBA:

OF Error E BIZ Rate

3 0 73 0.00%

4 4 356 1.12%

5 18 1132 1.59%

6 36 2032 1.77%

7 38 2964 1.28%

8 26 1755 1.48%

9 5 351 1.42%

10 9 1450 0.62%

11 2 209 0.96%

Grand Total 138 10322 1.34%

More funny business. I guess it’s a well known thing defense requires huge sample sizes. We probably need a few years worth of data to get the complete picture.

Post by **RonCo** » Sat Apr 06, 2024 5:35 pm

Yep. Even real life folks say that things like ZR and whatnot are not stable with a full season of data. Because. Random.

Post by **RonCo** » Sat Apr 06, 2024 5:57 pm

BaseClogger wrote: ↑
Sat Apr 06, 2024 3:48 pm
Here’s the guy I had in mind when I was talking on the podcast. He’s only got a 5 error rating but he’s played solid defense in the BBA in albeit somewhat limited sample sizes.

There are 66 players who have had at least 100 Plays made (+ Errors) on balls in zone in center field. Here are the worst ten error rates based on:

Error Rate = Errors/(Plays made + Errors)

POS	Name	OF ERR	E	BIZ	BIZ-M	E/Plays
CF	Varnik Tuncay	7	3	113	77	0.038
CF	Jyri Kulmala	10	2	113	81	0.024
CF	Bae Pae	4	7	452	304	0.023
CF	Ed Wolochow	8	3	219	148	0.020
CF	Miguel Montero	5	2	158	110	0.018
CF	Keith Jahnke	6	2	169	117	0.017
LF	John Oliver	8	5	470	305	0.016
CF	Craig Ablett	7	4	368	251	0.016
CF	Wilson Prieto	7	4	405	265	0.015
CF	Ramón Trujillo	5	1	104	71	0.014

So, from an error standpoint alone, Montoya is not playing good defense in CF.

Post by **RonCo** » Sat Apr 06, 2024 6:07 pm

Here's an interesting table, though. Again with the sample size question, and with the idea that player ratings could be weirdly scattered inside rating zones...but this has a weird hump in the middle zones.

CF BIZ > 100

OF Error	E	Plays Made	E/(Plays+E)
4	7	304	0.022508
5	7	690	0.010043
6	2	282	0.007042
7	19	2312	0.008151
8	19	2180	0.00864
9	10	1125	0.008811
10	18	4778	0.003753
11	13	3138	0.004126
	95	14809	0.006374

I note that if I change the criteria to 150 BIZ, the curve smooths quite a bit. So it suggests that 100 BIZ is way to small to see real trends.

BaseClogger · Post by **BaseClogger** » Sat Apr 06, 2024 9:56 pm

RonCo wrote: ↑
Sat Apr 06, 2024 5:57 pm

BaseClogger wrote: ↑
Sat Apr 06, 2024 3:48 pm
Here’s the guy I had in mind when I was talking on the podcast. He’s only got a 5 error rating but he’s played solid defense in the BBA in albeit somewhat limited sample sizes.
There are 66 players who have had at least 100 Plays made (+ Errors) on balls in zone in center field. Here are the worst ten error rates based on:

Error Rate = Errors/(Plays made + Errors)

POS Name OF ERR E BIZ BIZ-M E/Plays

CF Varnik Tuncay 7 3 113 77 0.038

CF Jyri Kulmala 10 2 113 81 0.024

CF Bae Pae 4 7 452 304 0.023

CF Ed Wolochow 8 3 219 148 0.020

CF Miguel Montero 5 2 158 110 0.018

CF Keith Jahnke 6 2 169 117 0.017

LF John Oliver 8 5 470 305 0.016

CF Craig Ablett 7 4 368 251 0.016

CF Wilson Prieto 7 4 405 265 0.015

CF Ramón Trujillo 5 1 104 71 0.014

So, from an error standpoint alone, Montoya is not playing good defense in CF.

Error rate impacts ZR insofar as if you can’t complete the play it hurts the player’s score, right? He’s had good ZRs.

Post by **RonCo** » Sat Apr 06, 2024 11:20 pm

Yes. His ZR was a pretty ok 0.2 in 34 starts (48 games). So kind of better than league average. That's workable, but I admit I struggle to call that "good." He was a + in AAA.

BaseClogger · Post by **BaseClogger** » Sun Apr 07, 2024 6:54 am

RonCo wrote: ↑
Sat Apr 06, 2024 11:20 pm
Yes. His ZR was a pretty ok 0.2 in 34 starts (48 games). So kind of better than league average. That's workable, but I admit I struggle to call that "good." He was a + in AAA.

My experience has been that it’s very, very difficult to get positive ZR in CF. This is in part because I refuse to play a total Judy out there. Therefore a 8-5-9 defender putting up a positive number is quite surprising to me. I’ve seen so many 9-7-6 types put up negative ratings. And arm should have nothing to do with ZR for an outfielder.

Post by **RonCo** » Sun Apr 07, 2024 4:00 pm

Here is ground ball error rates by zone per my script. So it does seem that SS handle ground balls with greater difficulty.

	3L	3	34	4	4M	6M	6	56	5	5L
SHALLOW		0.96%	3.49%	2.12%	1.28%	1.89%	3.87%	3.29%	3.48%
MID	1.78%	1.36%	3.44%	3.19%	2.43%	3.06%	4.91%	4.72%	4.11%	1.89%
DEEP		1.18%	0.77%	0.97%	0.31%	0.37%	2.19%	0.68%	0.93%

Post by **RonCo** » Sun Apr 07, 2024 4:17 pm

Error Rates on FB by OF zones, per my script:

	9L	9	89	8R	8L	78	7	7L
SHALLOW	0.19%	0.18%	0.19%	0.06%	0.00%	0.05%	0.09%	0.39%
MID	0.27%	0.25%	0.33%	0.13%	0.09%	0.39%	0.41%	0.63%
DEEP			0.26%	0.17%	0.28%	0.59%

Error Rates on LD to OF zones, per my script:

	9L	9	89	8R	8L	78	7	7L
SHALLOW	0.00%	0.00%	0.00%	0.00%	0.00%	0.00%	0.16%	0.00%
MID	0.00%	1.27%	0.16%	0.82%	0.00%	0.15%	0.60%	0.00%
DEEP			0.00%	0.52%	0.54%	0.00%

Odd that 8L error rate = 0% whereas 8R error rate is quite high relative.

The Brewster

The Brewster

2058 - Error Ratings and Error Rates

2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Re: 2058 - Error Ratings and Error Rates

Who is online