SITE NEWS: We are moving all of our site and company news into a single blog for Sports-Reference.com. We'll tag all PFR content, so you can quickly and easily find the content you want.
Also, our existing PFR blog rss feed will be redirected to the new site's feed.
Pro-Football-Reference.com ยป Sports Reference
For more from Chase and Jason, check out their work at Football Perspective and The Big Lead.
Are all 10 point leads the same?
Over at the Advanced NFL Community, the claim was made that not all 10 point leads are the same. Here's the meat of the argument:
Let's say your team is up by 10 with 8 minutes remaining. That's a pretty good lead, right? Well, if it's a 13-3 game, it's a VERY good lead -- if their poor opponents couldn't muster more than a field goal in 52 minutes, they're unlikely to close a 10 point gap now. On the other hand, if it's a 42-32 shoot-out, it's still anybody's ballgame[.]
That sounds reasonable, but whenever I hear arguments like this I always raise an eyebrow and wonder if common perception matches reality. After all, ten points is ten points, and maybe it's equally likely that a team trailing 13-3 will win its game compared to a team trailing in a shootout. A touchdown and a field goal can be scored by almost any team in a short time, and the small sample size of less than one game may not be that telling of the team's offensive prowess.
So how do we measure this?
I checked every NFL game from 1993-2007 and noted the scores at the end of the third quarter and the end of the fourth quarter. There were 299 games in which there was a 10 point lead heading into the fourth quarter. Some games were high scoring, like the 2006 Chargers-Bengals game where San Diego came back to win after trailing 38-28; some games were low scoring, like the 1999 Cowboys-Eagles game where Philadelphia won after trailing 10-0 at the start of the fourth and the, uh, 2000 Cowboys-Eagles game where Philadelphia won after trailing 10-0 at the start of the fourth. So this gives us a nice sample to view how likely or unlikely a given team is to come back from a 10 point deficit.
Teams trailing by 10 had a 47-252 record in those games, an .157 winning percentage. So is a 38-28 lead less safe than a 17-7 lead? There are several ways to measure this. I checked the correlation coefficient of points scored in the game through three quarters by the trailing team and number of wins in the game by the trailing team. If high scoring games are more likely to have those late lead changes, the CC should be closer to 1 (more points, more wins). If low scoring games are likely to produce late drama, the CC should be closer to -1. It turns out the CC was 0.07, which isn't large in either direction. However, a more detailed breakdown gives us a more interesting result:
#gm #win win %
20+ 17 5 0.29
15-17 17 3 0.18
13-14 56 7 0.13
10-12 52 10 0.19
6-9 67 10 0.15
0-3 90 12 0.13
This means that teams that had scored 20 or more points through three quarters but trailed by 10 points won five of their seventeen games. Not exactly known for their offensive prowess, the Ravens have two of those games, with Jim Harbaugh leading a comeback against the Colts and Anthony Wright thwarting the Seahawks. There looks to be at least some small benefit to trailing in a high scoring game -- the teams that scored more than two touchdowns through three quarters won almost a quarter of their games while teams that hadn't scored more than a field goal won only two in every fifteen games.
I also looked at the significance of being at home. For example, is a 20-10 lead by the host tougher to overcome than a 20-10 lead by the visitor? We would expect it to be, although Doug and JKL have shown that home field advantage isn't worth that much by the fourth quarter. Unsurprisingly, a majority of the teams that are up by 10 through three quarters are the host -- 136 of the trailing teams were at home, 162 of the trailing teams were on the road, and 1 game was at a neutral site. The correlation coefficient between being at home and winning the game was 0.19, which signals a weak connection. However, home teams went 32-104 (23.5%) while road teams went 15-147 (9.3%) so being at home greatly improves your odds of winning that game, even if overall you're pretty unlikely to win.
I then ran a regression using home/road as one variable (home = 1, road = 0) and the trailing team's score after three quarters as the other input. Because the output is binary (win/lose), we should use a logit regression and not a linear regression. The logit regression formula that best fit the data was:
1 / (1 + e^(2.53 - 1.22*HOME - .025*3QPTS))
While the home variable was significant (0.0005), the 'points through three quarters by the trailing team' variable was not statistically significant (0.39). If it *was* statistically significant, how meaningful would it be practically? A road team trailing 10-0 would be expected to win just 7.4% of the time; if that team was instead losing 30-20, its odds of winning would jump to 11.6%. A team losing 13-3 at home would win 22.5% of the time while a host losing 27-17 would win 29.2% of the time.
So a trailing team (through three quarters) in a very high scoring game would be slightly more likely to win than a trailing team in a really low scoring game. The points variable was not statistically significant so we can't be confident in this based on the data, although conventional wisdom seems to support this. While you might prefer your team to be trailing 30-20 through three quarters, I'd much rather my team be at home down 13-3 then on the road down 30-20.
But there's another thing we could look at. If we run a logit regression using the same 299 games, and using as variables "points through three quarters" and the "vegas line", we get this:
1 / (1 + e^(1.75 - .016*3QPTS + .155*VEGAS))
where VEGAS is negative for the favorite. We don't need a home field variable here because the point spread would have already taken the location of the game into account. The '3QPTS' is still nowhere near significant while VEGAS is massively significant (.000005).
Here are some sample scenarios -- do these seem right to you?
7-point favorite, trailing 10-0: 34%
7-point dog, trailing 10-0: 5.5%
7-point favorite, trailing 30-20: 41.5%
7-point dog, trailing 30-20: 7.5%
So we know that the point spread of the game is a good indicator of a team's likelihood of winning even after three quarters, and to a lesser extent, home field advantage is also a good indicator as well. The number of points scored through three quarters was not shown to be an indicator of eventual success. That said, if you believed that to be true before reading this, nothing shown here would convince you to think it's better to be trailing in a low scoring game. If there is any real difference in likelihood of winning, it's more likely that a team is better to be trailing in a high scoring game, but the effect is certainly very small.
This entry was posted on Thursday, January 29th, 2009 at 8:13 am and is filed under General. You can follow any responses to this entry through the RSS 2.0 feed. Both comments and pings are currently closed.

This is EXACTLY why I visit here 2 or 3 times a day!!
Outstanding piece - I like when it gets math-heavy!
And yes, the Vegas-probabillities seem about right to me... Maybe the 7 point-favorite 30-20 is a bit high? But nothing earthshaking.
Good stuff. I think the flip side to the argument about high scoring games would be that my team might be more likely to score the 10 points, but it's also more likely to give up some more points in the final 8 minutes.
But why stop at 10-point margins? Why not look at all 2-score margins after 3 quarters, like any games with a lead of between 9 and 16 points?
Isn't this what you'd expect given the recent study on how the first three quarters is less accurate at predicting fourth quarter performance than overall team strength (as represented by the vegas line), though it isn't completely useless. This seems to confirm that scoring pace through three quarters isn't nearly as important as which team is actually better.
I think the analysis should be expanded to look at all competitive 4th quarter games, with either a logit regression using the two variables above and the end of third quarter margin or a linear regression predicting the scoring difference in the fourth quarter using the same three variables. You've essentially done the second analysis for "score tied after 3Q" and the first for "score diff =10 after 3Q". Seems like slices of the same question to me...
This is very useful (to me anyway!) It tells me where I need to work on the real-time WP model. Basically I should focus on adjustments for team strength first, then on home field, and lastly on scoring "pace." It also tells me that if you're an inferior team, you need a lot more than a 10-point lead to begin the 4th quarter before relative team strength stops being a significant factor.
One nitpick is that a logit regression across score differences (leads) is much trickier than this. For example, a 7-point lead is not 1/6th better than a 6-point lead. Late in a game, a TD will beat a 6-point lead, but with a 7-point lead would still give you about a 50/50 WP. But I think what you did is sufficient, at least for qualitative inferences.
Very interesting study. However, is there a significant difference between a 10 point game with 15 minutes left vs. one with 8 minutes remaining?
Chase--one thing that you didn't take into account that may influence some of the percentages you gave--game situation. If one of the teams has the ball in a goal-to-go situation entering the 4th, this is going to DRASTICALLY influence who will win that particular game. Now I understand in your 299-game sample, this situation may not be signficant, as there is likely enough balance on both sides of the equation.
As to the difference of points scored in the game, what a large study can't show is this: If the Ravens D has held the Colts O to 3 points through 3Q, and they are winning 13-3, I'm betting that Ravens fans are still worried about that lead. If they have held the Bills to 3 points through 3Q, then they are feeling more comfortable. Conversely, if your "average" D has played great through 3Q, you're not comfortable because you have seen your team give up points quickly--they've just played better than average so far, and you just KNOW that in the 4th Q they will give up 17 points and 150 yds so that they will maintain their "averageness."
I also maintain that how those points are scored/prevented makes a difference. A couple of weeks ago, I was sure that the Titans would come back against the Ravens--they were driving pretty well against them, and couldn't CONTINUE to make mistakes in the red zone, could they? And yet, the storyline was that the Titans lost three? scoring opportunities, and thus came up on the short end of the stick.
I guess my argument boils down to this: if the trailing team has a good offense, they probably prefer the 13-3 score: the D is playing good, and they "feel" that they have a good chance to get the 10 points because they have had turnovers/dropped passes/not executed/etc. If they have a good D, they prefer a higher score--they may have given up a non-offensive TD, and figure that the D can get 1 or 2 stops so that the offense can tie it up or win the game.
(My Saints could tie it up, but the defense would surrender the lead--see the last game of the season against Carolina).
Brian, he didn't run a logit across score differences. All the games in the sample were 10-point leads.
Ok, I see.
Joseph-I think your points about different mixes of offensive and defensive strength are accounted for by the Vegas spread variables.
Perhaps the next step is to ask whether all 10 point wins are the same. This question came up here in our discussion of the maximum likelihood model a couple years ago.
Would it be possible for you to share the data you used in this analysis?