SITE NEWS: We are moving all of our site and company news into a single blog for Sports-Reference.com. We'll tag all PFR content, so you can quickly and easily find the content you want.
Also, our existing PFR blog rss feed will be redirected to the new site's feed.
Pro-Football-Reference.com ยป Sports Reference
For more from Chase and Jason, check out their work at Football Perspective and The Big Lead.
ESPN attempts to fix the QB Rating formula
Everyone knows that quarterback rating is flawed. Everyone has known this for a long time. But because of the drawbacks to other statistics -- touchdowns, wins, interceptions, yards -- QB rating has persevered as the most mainstream singular statistic for grading quarterbacks. PFR has used Adjusted Net Yards per Attempt; Advanced NFL Stats using Expected Points Added; Football Outsiders uses DYAR. Now, ESPN takes its turn at measuring quarterback play.
The actual formula behind quarterback rating is complicated, but it can be reduced to a simple formula. That's what Bob Carroll, Pete Palmer and John Thorn discovered in their groundbreaking book, The Hidden Game of Football. Essentially, QB rating is equivalent to yards per attempt, but with a 20-yard-bonus for each completion, an 80-yard-bonus for each touchdown, and a 100-yard-penalty for an interception. Such adjustments should seem ridiculous to every reader, which is why everyone finds quarterback rating ridiculous. By way of comparison, PFR's ANY/A formula -- in addition to including relevant data on sacks -- gives no bonus for completions, a 20-yard bonus for touchdowns and a 45-yard penalty for interceptions.
But on Thursday, ESPN released the methodology behind its new QB Rating. And last night, ESPN aired an hour-long segment at 8 PM to discuss the new formula. So how does ESPN's formula look?
There's some good and some bad, which means it has exceeded my expectation. As Jason Lisk said, ESPN will promote it ad nauseum but it should have value. It's not perfect, but it's almost certainly better than the traditional passer rating and possibly the best single statistic out there. Here's my take.
ESPN has essentially limitless resources, which means it can: (1) hire some very good thinkers; (2) hire some very good computer programmers; (3) gain access to thousands of hours of high quality film; (4) hire lots of people to log key data for each play; and (5) discuss ways to tweak the formulas with current and former NFL QBs. On the surface, ESPN would be in perfect position to really knock this one out of the park. They claim to have used these abundant resources to develop this fantastic formula. It improves on QB rating by including data on rushing, sacks, fumbles, drops, and by separating yards gained through the air from yards gained after the catch. The formula also gives more weight to third down- and other high leverage-plays, especially later in close games.
Unfortunately, ESPN is keeping its formula a secret from the public. There are lots of legitimate reasons for doing this, but that decision makes it impossible to fully criticize the ESPN QBR. It's possible that there are serious bugs in the formula, but we have no way of knowing or discovering them. It's hard to get excited about a rating that says "Matt Hasselbeck is better than Sam Bradford because Hasselbeck has a 42.4 rating and Bradford a 41.0." Well, why is Hasselbeck's rating because than Bradford's? Because ESPN says so, of course. That's just not very convincing. For all the flaws in traditional passer rating or any of the formulas I've come up with over the years, you at least know what the flaws are. You can recreate the rankings because you have access to the formula. You can catch errors. You understand why rankings appear the way they do. And you can catch a simple programming mistake that throws off the computer ratings because of human error.
Still, I think the formula will have some utility. ESPN is building on the great work Brian Burke has done on expected points, and that makes sense. The QBR seems lot like what Football Outsiders has done with their method of ranking quarterbacks. I don't think ESPN is just being ESPN here; I think the powers that be have actually spent a bit of time kicking the tires and deciding what works. But too much is left to having faith in ESPN. For example:
The ESPN video tracking has been useful in helping to separate credit in plays like these. We track over-throws, under-throws, dropped passes, defended passes and yards after the catch. The big part was taking this information and analyzing how much of it was related to the QB, the receivers and the blockers. Not surprisingly, pass protection is related mostly to the QB and the offensive line, but yards after the catch is more about what the receiver does. Statistical analysis was able to show this and we divided credit based on those things.
My initial response is: Maybe. I'm certainly not satisfied by ESPN's mandate that YAC belongs mostly to the receiver. For example, what is worth more points in this system: an 8-yard pass to a WR that goes for 15 yards or a screen pass to the WR that goes for 30 yards? Who knows. On a screen pass, we know the QB doesn't receive credit for all the yards gained, but is the QB given credit for zero yards, some percentage of total yards gained, or the amount of yards the average receiver gains on a screen pass? How many yards does a quarterback get for a perfect pass that's dropped? Again, we're in the dark on what ESPN is doing, which makes it difficult to put complete faith in this system.
When I go to a restaurant, I don't need the chef to give me her recipe for me to decide whether or not I like my entree. On the other hand, just because my soda tastes sweet doesn't mean it has sugar. There might be an artificial sweetener in it, instead. If we don't know the formula, how can we understand its flaws? How can we say that it makes sense?
The amount of faith you put in this formula comes down to how much faith you have in ESPN. They're saying all the right things, but the devil is in the details. And we have little to no idea how the details work. I've generally found that when people hide things, they do so for a good reason. I suspect if ESPN released the formula, we would all scratch our heads at some of the ways ESPN calculates its QBR.
Aaron Schatz' biggest complain is the failure to adjust for strength of schedule. My biggest complaint is that this looks to be very retrodictive or explantory statistic but will be used in a predictive manner.
The formula will give a lot of weight to clutch plays; that is helpful to explain what happened. That's why ESPN notes that from '08 to '10, quarterbacks with the higher QBR won 85.7% of games. I'm sure if ESPN threw in "QB kneels at the end of the 4th quarter" in there, and gave it a large enough weight, they could get that number close to 100%. But that just makes it a stat that correlates with winning; it says nothing about it causing winning or how it will help you predict what will happen in the future. While explanatory stats are useful, I like predictive stats a bit more.
No one has ever shown that if QB A plays like an 80 normally, but has played like a 90 late in close games, that QB A is more likely to play like a 90 than an 80 the next time he's playing late in a close game. It hasn't been proven that average quarterbacks who played above average on third downs play above average on future third downs. These high-leverage situations will have a huge impact on the ESPN QBR, but they won't have any value in showing us which quarterbacks are the best. ESPN acknowledged that Matt Ryan looked fantastic in ESPN QBR -- much better than in traditional statistics -- and patted itself on the back for making the QB of the team with the best record in the NFC look like a top-three QB in the league. But guess what: Ryan wasn't that good last year. He was lucky in a lot of ways. And then he played terribly in the Falcons' only playoff game. ESPN's QBR wouldn't have predicted that, because ESPN's QBR was in love with Matty Ice's clutch play.
It's this predictive/explanatory dichotomy that causes Brian Burke to use both Expected Points Added and Win Probability Added to grade QBs. WPA is the explanatory statistic, and Ryan ranked 1st in WPA in 2010. But Ryan ranked 7th in EPA last year, and ranked behind Roethlisberger and Vick in EPA/game. He ranked below league average in Net Yards per Attempt, PFR's preferred predictive stat (whereas ANY/A is more of an explanatory stat). A lower ranking helps to explain why Ryan bombed in the playoffs, and also means I don't expect the Falcons to come close to matching the success they had last year.
I like all the factors that go into the ESPN QBR, as long as we understand that it's explanatory, not predictive. But I'm still disappointed that we have to go on blind faith. On the bright side, ESPN's QBR tastes sweet, but that's as far as we can go. I can't tell you if it's full of natural sugar or something much different.
This entry was posted on Saturday, August 6th, 2011 at 2:37 pm and is filed under General. You can follow any responses to this entry through the RSS 2.0 feed. Both comments and pings are currently closed.

Yeah I finished writing a long one on this today, and had a lot of the same thoughts. Some people are content with one number, but I'd like to see everything that goes into spitting out this magical QBR number. We're never going to see that most likely.
I like that Kurt Warner is #1 in qb rating at the present. Though the new formula may perhaps lower him a bit, it needs to be done so that a player like Daunte Culpepper isn't rated higher than Roger Staubach. Who would you want on your team? Barring ignorance, Roger would be the unanimous choice over Daunte.
Aaron Brooks over Bart Starr? No way. Jim Everett over Johnny Unitas? Even I'm not that biased enough to believe that.
I think there's too much of an obsession with finding one all-encompassing statistic to judge quarterbacks. Especially since there are always variables missing from these "comprehensive" formulas.
As far as this ESPN thing--which is sure to be insufferably shoved down our throats--I'm not sure I can buy a stat that says Matt Ryan was the third best QB last year and Eli Manning was seventh. My sister the Giants fan would laugh hysterically at the latter, having actually watched Eli play(how many times can a girl roll her eyes before they fall out of her head?). That's not to say Eli wasn't top-12, give or take, but top-7? Ahead of Rivers, Roethlisberger and Freeman? Come on.
And Ryan...look, he's good and there's something to say about a guy who doesn't cost his team games. But his passer rating seemed to be more indicative of his position in the league than this QBR. He was near the bottom of the league with 6.5 yards per attempt, thanks mainly to completing only 62.5% of his passes while averaging just 10.4 yards per completion. That jibes with the guy I felt I saw--rarely went downfield, avoided risk, relied on a good running game and defense, took advantage of a soft schedule...just a good player who took nothing off the table but didn't add much to it. When the Falcons got a fight from the Packers in the playoffs and needed Ryan to start making plays, he couldn't do it. He got exposed for what he was while Aaron Rodgers was chucking it and running it all over the field. Matt Ryan might be a franchise QB and it's possible adding Julio Jones will reverse Ryan's incredibly vanilla work the last two years, but last year? I don't know how he could've been better than Rodgers, Vick, Rivers, Roethlisberger, et al.
I think this "comprehensive" stat is no different than others like it--overrates some guys, underrates others, properly rates the rest. The best way to evaluate QBs is still with our eyes.
Two follow-ups:
1) I wonder how they handle Garrard's Hail Mary TD to win the game. Was that the single biggest play of the year by a QB? Surely no other play managed to flip win probabilities like that one did. Or do they take a lot of the value out of that because of the fluky nature of the play. Again, we have no idea what they did, but I'd be curious to hear their reasoning.
2) http://thebiglead.com/index.php/2011/08/06/a-look-at-players-where-espns-qbr-differs-from-passer-rating/ -- another good writeup from Lisk.
Chase, I've always wondered that about the Hail Mary stats for a site like FO as well. They take out Hail Mary INTs, which is fine, but what about the rare instances where the ball is caught for a TD or huge gain? Garrard had the lucky one last year, and I believe Kurt Warner had one before halftime to Fitzgerald a few years ago (maybe against the Saints?). Seems a bit biased to count the ones that are successful and leave out the many that aren't. There's little skill involved with that play other than getting the ball deep against a 3-man rush.
Wait... you mean Kerwin Bell (BellKe00.htm) wasn't the greatest quarterback ever to play the game?
He retired with a perfect 158.3 QB rating, 20.0% TDs, 15.0 YPA, 100% completion with no interceptions...
and, um, 5 career attempts.
A little bit of the methodology explained:
http://espn.go.com/nfl/story/_/id/6833215/explaining-statistics-total-quarterback-rating
Speaking of 'keeping its formula a secret', I don't think FO's DYAR formula is FULLY exposed to others as well. I know they explained how it works on their web site and books, but is it different from what ESPN did for QBR?
Chase,
Thank you for the article. Full disclosure, I am Albert Larcada, one of the 4 creators of QBR at ESPN.
I wanted to address one of your concerns - the David Garrard play.
It is not our top ranked play. In fact, it is no where near the top. Our WP model address these hail marys by looking at the expected the difference of WP on each play, not the actual difference. So, because Garrard's play was not expected to do well, it did not have a high clutch-weight. Plays like 1st and goal from the 1 with one second left down 4 are expected to change the WP a lot, so that would have a huge clutch weight.
The best play in our system is actually the Dennis Dixon rush on 11/29/09 with 6:32 left in the 4th quarter down 4. The best pass play is the Jason Campbell TD pass on 9/14/2008 to take the lead in the 4th quarter. This was a very high air yard pass.
As I'm sure you know, individual plays have a large variance, so I would not use them to rank QB's. Players can fluke their way into a good play, but it's hard to fluke your way into a good game/season. It is interesting nonetheless.
We'd love any other feedback you (or your readers) have.
I honestly think the closest to the best measure is "EPA" at advancedstats.com because it takes numbers put up in garbage time in consideration. It's certainly not perfect, because football is not a number crunching game like baseball but better than a QB rating IMO. Alex Smith puts up average no's according to qb rating and other metrics, but according to EPA he's been one of the worst starting quarterbacks in the league because he's poor in the clutch. I think most of us would agree he is not even an average starting quarterback.
http://wp.advancednflstats.com/playerstats.php?pos=QB
Albert,
Thanks for posting. I imagine scouring the internet to read review of ESPN's QBR is a thankless job. But a couple of thoughts:
I understand how replacing the actual delta with the expected change in WP from play to play will smooth out certain bumps. But is that just used as the clutch modifier? Or is the actual change in WP completely ignored? For example, what about Mark Sanchez' two throws against Houston?
With 24 seconds left, the Jets were down by 4 at the Houston 48. Sanchez threw an absolutely perfect pass -- and mostly air yards -- to Braylon Edwards who got OOB at the 6 with 16 seconds left. That play would seem to be worth more than either of Dixon's or Campbell's big plays. On the next pass, he threw a six yard TD (all air yards) to Holmes, which was also a huge turning point.
What about Roethlisberger's throw against the Packers? Down by 6 with 3 seconds left at the GB 19, Roethlisberger throws a pass into the end zone, caught by Wallace for the GWTD. I would think a play like that would rank very high in the ESPN formula.
Is the clutch rating component separate from the WP change component?
@Albert:
) In other words, to "help smooth out the bumps" of different rating systems.
Thanks for posting. Please pass along the following to ESPN re: your new rating system.
Similar to the "ombudsman" (or whatever your on-staff critique is called), ESPN should assemble a page of FAQ's regarding the QBR--esp. well thought-out questions by intelligent statisticians at competent sites (like this one, Brian Burke, Football Outsiders, et al.) that discuss ESPN's methodology. It would be a good page to put together these questions, and your (and your colleagues') answers to these questions.
For the record, I agree with the tone of the article here. The new QBR has some good, has some bad (maybe retroactively applying opponent adjustments starting mid-season would help, for example), and some "we-don't-know-because-the formula-is secret." (BTW, I understand your proprietary concerns.) Of course, everyone thinks their proprietary stat is the best, and everyone will never be satisfied with any one stat. In fact, I think a better way to rate the QB's might be to combine several good rating systems, similar to the way the BCS combines the computer ranking systems. (Did I just complement the BCS?--maybe I should find a better example.
Again, Mr. Larcada, thanks for your answer above, and I hope that your bosses will allow you guys to follow my suggestion as well.
Just out of curiosity, does anybody know if ESPN's fantasy football commissioner site now offer a QBR stat?
QBR is Joke. Football is not baseball. You can't quantify clutch, if the Wr ran the wrong route, etc. ESPN doesn't know if Reggie Wayne ran the wrong route in the Super Bowl or if Manning made a bad read. And it's a freaking Joke how they won't show the formula. Let me ask you a question, what would you say if the President changed the entire tax code and said you have to pay X amount of Dollars, but were not going to give you the new rules for our tax formula. If you say OK Mr. President than you are an idiot and you will love the new QBR.