SITE NEWS: We are moving all of our site and company news into a single blog for Sports-Reference.com. We'll tag all PFR content, so you can quickly and easily find the content you want.
Also, our existing PFR blog rss feed will be redirected to the new site's feed.
Pro-Football-Reference.com » Sports Reference
For more from Chase and Jason, check out their work at Football Perspective and The Big Lead.
Ten thousand 2005s
Prerequisite reading material:
How often does the best team win?
In the previous posts, I simulated ten thousand generic NFL seasons. In some of those seasons the "Seattle Seahawks" were great. In some they were terrible. In some they played a tough schedule, in others an easy one. In this post, I'll simulate ten thousand 2005 NFL seasons. The Seattle Seahawks will be a very good team in each of them, and they will play an easy schedule in each of them.
Mechanically, the procedures are similar, but philosophically there is a world of difference. The generic seasons had teams whose strengths I knew, so I could say things like "the best team" and "Chicago was not very good." I knew who the best team was and I knew how good Chicago was or wasn't. Exactly. Only because I knew those team strengths could I assign the proper probabilities to each game.
But if I want to simulate the 2005 season, I've got a problem: I don't know the team strengths. Neither do you. We have to guess. The guess I'm going to use is the team's rating from the simple rating system. I'm not going to spend time here making a case that that's the best guess or even necessarily a good guess. If you don't think the simple rating system is an adequate representation of team strength, that's fine. No hard feelings. But you'd better stop reading now, because that's the foundation this post rests on.
For those still with me, I'll make one more disclaimer. If I happen to say something like:
Seattle was the 4th-best team in football.
What I actually mean is:
According to the measurement of team strength that we have agreed upon --- which we acknowledge is imperfect in some obvious and some non-obvious ways --- Seattle appears to be the 4th-best team in football.
I am not trying to quash discussion of the merits of the various ways of estimating team strength and I am well aware of the weaknesses of the one I have chosen. But we've got to pick something and go with it, and the prose just seems to flow a bit better if you allow me to use the above shorthand notation. As you know, I can use all the help I can get with making the prose flow.
Now let's get to it. I'll just throw this summary out and then we'll discuss it.
Rating is the team's rating, which is my guess as to its true strength. Avg Wins is the average number of wins each team had over the course of the 10,000 seasons. Div is the number of division titles each team won. WC is the number of times each team got into the playoffs as a wildcard. PO = Div + WC. It is the number of times each team made the playoffs. SB is the number times each team made it to the Super Bowl and Champ is the number of times they won it.
TM Rating AvgWins Div WC PO SB Champ
=====+=========+========+================+===========
ind | 10.8 | 11.2 | 7128 1572 8700 | 2688 1640
sea | 9.1 | 11.1 | 8936 395 9331 | 3461 1780
car | 5.1 | 10.4 | 6304 1818 8122 | 1681 741
den | 10.8 | 10.4 | 4342 2797 7139 | 1825 1092
pit | 7.8 | 10.3 | 5741 1543 7284 | 1469 778
nyg | 7.5 | 10.1 | 5083 2534 7617 | 1785 817
sdg | 9.9 | 9.9 | 3190 2907 6097 | 1343 797
jax | 4.8 | 9.6 | 2727 2951 5678 | 674 321
kan | 7.0 | 9.4 | 2298 2842 5140 | 737 371
cin | 3.8 | 9.3 | 3015 1974 4989 | 516 242
was | 6.0 | 9.2 | 2986 2765 5751 | 989 416
chi | 1.4 | 9.1 | 5653 793 6446 | 721 256
nwe | 3.1 | 8.7 | 5001 476 5477 | 425 194
tam | -1.0 | 8.5 | 1969 2333 4302 | 315 103
dal | 3.2 | 8.3 | 1552 2249 3801 | 409 166
atl | -1.2 | 8.2 | 1652 2122 3774 | 236 73
mia | -0.8 | 8.0 | 3385 481 3866 | 165 52
rav | -1.8 | 7.4 | 773 829 1602 | 61 22
min | -3.5 | 7.3 | 1864 774 2638 | 113 36
gnb | -3.7 | 7.1 | 1616 755 2371 | 93 29
ram | -5.1 | 6.9 | 528 1013 1541 | 59 10
cle | -4.2 | 6.8 | 471 518 989 | 32 9
crd | -5.0 | 6.7 | 481 884 1365 | 46 9
phi | -2.3 | 6.6 | 379 878 1257 | 61 16
rai | -2.8 | 6.3 | 170 427 597 | 22 9
det | -6.7 | 6.3 | 867 417 1284 | 27 6
buf | -5.8 | 6.2 | 889 179 1068 | 20 7
nyj | -6.4 | 6.0 | 725 136 861 | 18 6
oti | -7.6 | 5.8 | 108 256 364 | 4 2
htx | -10.0 | 5.1 | 37 112 149 | 1 0
nor | -11.1 | 4.9 | 75 139 214 | 4 0
sfo | -11.1 | 4.7 | 55 131 186 | 0 0
Indianapolis averaged 11.2 wins per season in the simulation. They won the AFC South 71.2 percent of the time, they made the playoffs 87% of the time, they made it to the Super Bowl about 27% of the time and won it 16.4% of the time.
If you were to translate this into an English sentence, it would not be: at the beginning of the season, we should have estimated that the Colts had a 16.4% chance of winning the Super Bowl. It would be something more like: knowing what we now know in hindsight about how good these teams were in 2005, if we were to play the season again with those strengths remaining the same, the Colts would have a 16.4% chance of winning the Super Bowl. Alright, that's pretty bad English but I hope you get the point.
The probability of winning the Super Bowl depends two things: the team's strength and their schedule (including the playoff schedule). You can see the effect of both in the table. Denver and Indianapolis were essentially equally strong, but the Colts' chances of winning the Super Bowl were significantly higher. And Seattle's were even higher, despite being a weaker team. Carolina had a title chance that was disproportionately high (compared to their true strength) and San Diego's was disproportionally low. We'll revisist them in a moment.
Also note that the spread on average wins --- from Indy's 10.8 to Houston's 4.7 --- is much smaller than the spread on actual wins in the 2005 season. This makes sense. I think it's safe to say that there is almost never an NFL team that is morally a 14-2 team or a 2-14 team. There are, though, probably three or four teams each year --- maybe more --- that are capable of going 14-2 if things break right for them, and there are another few that might slip to 2-14 if things don't. And the result is that we see 14-2 teams and 2-14 with some regularity. This idea might strike some people as controversial, but it's really no different from pointing out that no basketball player truly is a 50-point-per-game player even though certain players do score 50 from time to time.
OK, time to play god. Let's move the Chargers to the NFC South and the Panthers to the AFC West and see what happens.
TM Rating AvgWins Div WC PO SB Champ
=====+=========+========+================+===========
sdg | 9.9 | 11.7 | 8209 1158 9367 | 3344 1790
clt | 10.8 | 11.3 | 7255 1615 8870 | 2881 1610
sea | 9.1 | 11.1 | 8921 398 9319 | 2879 1520
den | 10.8 | 10.6 | 5370 2328 7698 | 2134 1196
pit | 7.8 | 10.4 | 5795 1684 7479 | 1563 787
nyg | 7.5 | 10.1 | 5063 2508 7571 | 1478 727
jax | 4.8 | 9.7 | 2592 3360 5952 | 731 317
kan | 7.0 | 9.6 | 2980 2784 5764 | 902 441
was | 6.0 | 9.3 | 3015 2879 5894 | 827 366
cin | 3.8 | 9.3 | 2979 2222 5201 | 570 256
chi | 1.4 | 9.0 | 5504 754 6258 | 522 184
nwe | 3.1 | 8.7 | 4984 487 5471 | 476 195
car | 5.1 | 8.5 | 1385 2076 3461 | 388 179
tam | -1.0 | 8.3 | 979 2862 3841 | 173 63
dal | 3.2 | 8.3 | 1530 2287 3817 | 306 138
atl | -1.2 | 8.0 | 782 2516 3298 | 147 38
mia | -0.8 | 8.0 | 3298 551 3849 | 180 56
rav | -1.8 | 7.4 | 778 960 1738 | 62 23
min | -3.5 | 7.3 | 1933 661 2594 | 88 19
gnb | -3.7 | 7.0 | 1681 634 2315 | 86 22
ram | -5.1 | 6.9 | 559 1011 1570 | 36 6
cle | -4.2 | 6.8 | 448 593 1041 | 25 6
phi | -2.3 | 6.7 | 392 924 1316 | 49 19
crd | -5.0 | 6.6 | 458 793 1251 | 37 12
rai | -2.8 | 6.5 | 265 524 789 | 33 10
det | -6.7 | 6.3 | 882 369 1251 | 28 6
buf | -5.8 | 6.2 | 935 222 1157 | 32 10
nyj | -6.4 | 6.0 | 783 152 935 | 18 2
oti | -7.6 | 5.9 | 113 315 428 | 4 1
htx | -10.0 | 5.1 | 40 127 167 | 1 1
nor | -11.1 | 4.8 | 30 133 163 | 0 0
sfo | -11.1 | 4.7 | 62 113 175 | 0 0
Interesting.
This entry was posted on Tuesday, June 6th, 2006 at 4:05 am and is filed under Statgeekery. You can follow any responses to this entry through the RSS 2.0 feed. Both comments and pings are currently closed.

Open letter to NFL commissioner:
--------------------------------------
Dear Commissioner,
Please realign the league in the following way: move the Chargers to the NFC South and the Panthers to the AFC West. Thank you in advance for your prompt action on this request.
Sincerely,
Bills Fans
--------------------------------------
Seriously, the inner dynamics of the impact of the schedule and division and conference alignment are pretty amazing. This experiment is producing tons of intriguing data and knowledge that otherwise would not be known. Well, now I guess it's time to check my email!
I don't recall exactly how the TRUE TEAM STRENGTH rating was established, but I recall that the performance of the team on the field made some contribution to this rating. Switching Chargers and Panthers means different division schedules, different conference schedules and different cross-conference schedules. Would the ratings have ended up where they did for these two teams under very different schedules? Likely not.
(h/t Bill M.) Dear Commissioner,
Please realign the league in the following way: eliminate all divisions and conferences. Have each team play 16 different teams during the year, with a range of strengths (based on previous season's record). Skim off the top 16 records (with suitable tie-breakers) and play 'em off. Last one standing is SB champ.
Sincerely,
NFL fan.
The league is too bloody provincial! It's not one league, it's a bunch of little leagues sporting the appearance of a "national" league.
Here it is, Doug, my tinfoil hat, out in the open for all to see.
This is very interesting stuff.
cheers,
wthii
Lets do an ADP analysis. Lets each do a mock draft of all 32 teams and then assign a strength based on ADP. I wonder how many times the Cowboys will go 0-16 under this analysis.
Responding to wthii: Lets just forgoe the season and have a 32 team playoff bracket. Why skim to 16??????
It's good to see that these speculative simulations more or less confirm our intuitions, sometimes quite emphatically.
A minor quibble: could you please use a consistent team naming convention, such as what NFL.com currently uses? In the final table I was looking for Indianapolis and found that "ind" had completely dropped off the chart! But a surprisingly good newcomer took its place: "clt".
I'm a bit confused by the 3 letter abbreviations. Is 'oti' the Titans? Should 'clt' in the second table be 'ind'? Also, I disagree with wthii (when I started writing this I agreed then changed my mind) swapping the Chargers and Panthers and re-running the numbers really does make sense, since the "Rating" is defined as the True (the actual, absolutely correct) Team Strength each team would achieve that "Rating", albeit in a different way (stronger schedule-lose more, weaker schedule-win more), no matter who they played.
"I don’t recall exactly how the TRUE TEAM STRENGTH rating was established"
The rating was a linear regression of the team's margin of victory versus opponents. I.e. it explicitly tried to be schedule neutral by statistically nullifying the schedule.
Now, would it be the same if you switched division alignments? No, it's an imperfect rating. However, it did make a serious effort to take strength of schedule out. This is as accurate a use of it as anything.
wthii, I anticipated that objection. Matt and ccw did a nice job of saying the same thing I would have said in response to it. The rating does make an effort to be schedule-neutral. Whether it actually does so is debateable, but the effort was made (and I don't know how to make a better effort).
ccw and Waverly, apologies for the cryptic abbreviations. A database geek with a knowledge of NFL history might be able to deduce that 'clt' is the key belonging to the Baltimore/Indianapolis Colts franchise. Johnny Unitas and Peyton Manning both played for 'clt'. But I also have a table that takes a franchise and a year and gives me back a nice abbreviation. E.g. 'clt1961' --> 'bal', but 'clt2005' --> 'ind'. (If I used any incorrect terminology in the above explanation, it's because I'm not a database geek.) Anyway, sometimes I forget to take that last extra step.
oti means doug can't get over that the titans don't play in houston anymore.
i'm thinking we should call the cardinals chistlphari and the how about the raiders the oaklaoak here are some others....
dka
sbe
Interesting indeed.
I think you meant "from Indy's 11.2 to San Francisco's 4.7".
This is a fascinating demonstration of how the division alignments and schedule strength affect who wins the Super Bowl. And given that there is some year-to-year carry over of true team strength, you could probably say that San Diego would be more likely to win the Super Bowl next year if it moved to the NFC South. That could be considered a short-term unfairness.
But I wonder if there's anything inherent to the current alignment long-term that makes some divisions and conferences strong/weak such that some teams will always have an advantage or disadvantage. Doesn't it seem like the AFC West is always strong? I suspect there are some complex economic, geographical, and social factors at work here.
Indeed I did. I guess I was looking at the wrong column.
I can see how scrapping divisions might be a "fairer" measurement of which team is best, but do we REALLY want to throw them out. Don't we get excited for Cowboys-Redskins, Bears-Packers, Broncos-Raiders, Steelers-Browns, etc.? I realize that some divisions (NFC West) don't have a ton of historic rivalries, but if, for example, Arizona and Seattle go down to the wire for the division crown three years in row, rivalries will develop.
Hi Monkeytime --
We could do a post season 32 team playoff. But I'd like for the regular season to have some impact on which teams make the playoffs. Call me old fashioned.
Eight teams in the playoffs seems like not enough to me. And, as Doug has indicated to me, teams with byes are more likely to win. Is it because they're better teams? I submit his current articles suggest that's not the case. So, I don't like byes, meaning 12 is out. In fact, it means any number not a positive integer power of 2 is out. Sixteen is the next stop.
Does that seem like too many? Like, perhaps, the input from the regular season is rendered less important? My solution (which the NFL seems poised to follow anyway) is to add more teams to the league.
cheers,
wthii
Hey Vince,
I agree completely with your last sentence, and that's what caused me to come up with this crazy scheme. Competition makes football interesting. Rivalries come and go. Hyping rivalries between weak teams may sell stadium seats, but it makes for a lousy football game. Hyping outdated rivalries is almost as bad as hyping league rivalry (das uber bowl) when the merger is almost 40 years old. It's balding and has a gut!
If there are natural rivalries, divisions ought to reflect this. Isn't there a natural rivalry between Cleveland and Detroit? Cincinnati and Indianapolis? San Francisco and Oakland? Which would you have rather watched last season, Chicago v. Green Bay, or Pittsburgh v. Denver? Right now, the latter is a better rivalry because it's a more competetive contest.
That's my schtick. But, be not deceived! I harbor no illusions that anyone will ever agree with me (though Doug is kind to humor these notions -- and tolerate these posts).
cheers,
wthii
Thinking outside the box a bit, it'd be neat to have a 31 week round robin season. It's kinda asinine to play some teams twice a year and others almost never. Jim Kelly never played against the Chargers. They simply weren't on the Bills preseason, regular season, or post season schedules between 1986 and 1996, inclusive. Yet, he played the Colts about 600 times over the same period. The Bucs never played a single snap in Ralph Wilson Stadium. There are a bunch of similar anomolies across the league. A round robin would be good for football, as it's every team for itself... no more advantages or disadvantages due to alignment. Sure, 31 weeks is a lot of weeks, but so what? If every team played 3 games a month for 10 months, it'd work. Top 2 teams play in the Super Bowl.
I know my idea probably sucks, but I still like it
It'd be great, but unfortunately I just don't think the players' bodies would be able to stand up to a 31 game season. They struggle with 16-20, frankly.
playoffs strictly to the power of two? More teams?? You don't want the NFL - you want college football it has the best of both worlds for you - over 100 teams and a playoff bracket of exactly two teams, no more no less. May I also strongly recommend the sport of college basketball to you as well, it has so much symmetry even chicks can get into it from a march to march basis.
Bill M. I love the idea - but i am thinking of expanding the round robin 31 teams: after which we eliminate one team (the worst) followed by a round robin of the 31 remaning teams (30 games), again eliminating one team, etc. I am thinking we can get the 2006 season done in about 496 weeks, not including bye weeks of course. We might have to add a few weeks if we want to make it double-elimination. Can we have all the games in Omaha?
"I think it’s safe to say that there is almost never an NFL team that is morally a 14-2 team or a 2-14 team."
I agree, but I disagree with the following sentances. Saying that if things break right there are 14-2 teams and if they break wrong there are 2-14 teams. I think there is usually a "best" and "worst" team or teams. And therefore are 16-0 teams who get bad breaks and end up 14-2, and 0-16 teams who get good breaks and end up 2-14. Am I the only one who thinks this way?
This was a great series of articles. I'm disappointed I missed out on the initial discussion. I have written a similar simulator that does include the NFL tiebreakers. I get very similar predictions using my software. I normally do 5000 simulations. I have the Titans and 49ers each winning the SB multiple times in the 2006 season, based on 5000 simulations.
The goal of my simulator is to provide estimates of the chances of the various teams of making the playoffs during the NFL season. The software has a GUI and is free to anyone to use. Have fun.
[...] A few weeks ago, footballoutsiders linked to my ten thousand seasons series. A general theme among the comments was that the simulation was inaccurate because it was based on season-long power ratings instead of last-few-weeks power ratings. Because teams’ true strengths vary so much during the course of a season, I should have used a smaller but more recent sample instead of using all the data. Less is more. I thought about that for awhile and pondered the possibility that those folks might have a good point. [...]