How Our NFL Predictions Work (2024)

The Details

FiveThirtyEight has an admitted fondness for the Elo rating — a simple system that judges teams or players based on head-to-head results — and we’ve used it to rate competitors in basketball, baseball, tennis and various other sports over the years. The sport we cut our teeth on, though, was professional football. Way back in 2014, we developed our NFL Elo ratings to forecast the outcome of every game. The nuts and bolts of that system are described below.

Game predictions

In essence, Elo assigns every team a power rating (the NFL average is around 1500). Those ratings are then used to generate win probabilities for games, based on the difference in quality between the two teams involved, plus adjustments for changes at starting quarterback, the location of the matchup (including travel distance) and any extra rest days either team had coming into the contest. After the game, each team’s rating changes based on the result, in relation to how unexpected the outcome was and the winning margin. This process is repeated for every game, from kickoff in September until the Super Bowl.

For any game between two teams (A and B) with certain pregame Elo ratings, the odds of Team A winning are:

\begin{equation*}Pr(A) = \frac{1}{10^{\frac{-Elo Diff}{400}} + 1}\end{equation*}

EloDiff is Team A’s rating minus Team B’s rating, plus or minus the difference in several adjustments:

A home-field adjustment of roughly 48 points, depending on who was at home, plus 4 points of Elo for every 1,000 miles traveled. The exact amount is based on a rolling 10-year average of home-field advantage, and changes during the season.Model tweak
Sept. 6, 2022 This means the New York Giants get about a 48-point Elo bonus when “hosting” the Jets (despite both teams calling MetLife Stadium home), while the New England Patriots would get roughly 58-point Elo bonus when, say, the Los Angeles Chargers come to visit. There is no base home-field adjustment for neutral-site games such as the Super Bowl¹ or international games, although the travel-distance adjustment is included for the Super Bowl. If games are played without a significant number of fans in attendance, the base home-field advantage will be reduced to 33 points.If games are played without a significant number of fans in attendance, the base home-field advantage will be reduced to 33 points.Model tweak
Sept. 6, 2020
A rest adjustment of 25 Elo points whenever a team is coming off of a bye week (including when top-seeded teams don’t play during the opening week of the playoffs). Our research shows that teams in these situations play better than would be expected from their standard Elo alone, even after controlling for home-field effects.
A playoff adjustment that multiplies EloDiff by 1.2 before computing the expected win probabilities and point spreads for playoff games. We found that, in the NFL playoffs, favorites tend to outplay underdogs by a wider margin than we’d expect from their regular-season ratings alone.
A quarterback adjustment that assigns every team and each individual QB a rolling performance rating, which can be used to adjust a team’s “effective” Elo upward or downward in the event of a major injury or other QB change. (See below for more details about how this adjustment works.)

We also tested effects for weather and coaches (including both head coaches and coordinators) but found that neither improved the predictive value of our model in backtesting by enough to warrant inclusion.

Fun fact: If you want to compare Elo’s predictions with point spreads like the Vegas line, you can also divide EloDiff by 25 to get the spread for the game. Just be sure to include all of the many adjustments above to get the most accurate predicted line.

The quarterback adjustment

New for 2019,New feature
Sept. 3, 2019 we added a way to account for changes in performance — and personnel — at quarterback, the game’s most important position. Here’s how it works:

Both teams and individual quarterbacks have rolling ratings based on their recent performance.
- Performance is measured according to “VALUE,” a regression between ESPN’s Total QBR yards above replacement and basic box score numbers (including rushing stats) from a given game, adjusted for the quality of opposing defenses.
  - The formula for VALUE is: -2.2 * Pass Attempts + 3.7 * Completions + (Passing Yards / 5) + 11.3 * Passing TDs – 14.1 * Interceptions – 8 * Times Sacked – 1.1 * Rush Attempts + 0.6 * Rushing Yards + 15.9 * Rushing TDs.³
  - This metric is also adjusted for opposing defensive quality by computing a rolling rating for team QB VALUE allowed, subtracting league average from the VALUE an opponent usually gives up per game, and using that to adjust a QB’s performance for the game in question. So for example, if a team usually gives up a VALUE 5 points higher than the average team, we would adjust an individual QB’s performance downward by 5 points of VALUE to account for the easier opposing defense.
- For individual QBs, the rolling rating is updated every 10 games. (i.e., Rating_new = 0.9 * Rating_old + 0.1 * Game_VALUE ).
- For teams, the rolling rating is updated every 20 games.
  - This implies that short-term “hot” and “cold” streaks by individual QBs have predictive value, which can trigger a nonzero pregame QB adjustment even when a team has had the same starter for each of its previous 20 games.
- The rolling rating represents the VALUE we’d expect a quarterback (whether at the individual or team level) to produce against a passing defense of average quality in the next start. To convert between VALUE and Elo, the rolling rating can be multiplied by 3.3 to get the number of Elo points a QB is expected to be worth compared with an undrafted rookie replacement.
The quarterback Elo adjustment is applied before each game by comparing the starting QB’s rolling VALUE rating with the team’s rolling rating and multiplying by 3.3.
- For example: When Aaron Rodgers was injured midway through the 2017 season, he had a rolling VALUE rating of 66. The Green Bay Packers’ team rolling VALUE rating was 68, and backup Brett Hundley had a personal rating of 14. So when adjusting the Packers’ Elo for their next game with Hundley starting instead of Rodgers, we would have applied an adjustment of 3.3 * (14 – 68) = -176⁴ to Green Bay’s base Elo rating of 1586 heading into its Week 7 game against the Saints. This effectively would have left the Packers as a 1409 Elo team with Hundley under center (before applying adjustments for home field, travel and rest), dropping Green Bay’s win probability from 63 percent to 39 percent for the game despite playing at home. In cases like these, the QB adjustment can have a massive effect!

The average team QB VALUE rating going into the 2019 season was about 49.5 (or about 163 Elo points), a leaguewide number that has increased substantially over the history of the NFL as passing has become more prevalent and efficient. So a rolling rating that would have made a QB one of the best in football in the 1990s would rank as only average now, even though the zero-point in our ratings remains the replacement-level performance of an undrafted rookie starter.

One last note on these ratings involves how they are set initially. We’ll explain preseason team Elo ratings below, but here is how preseason ratings are set for the quarterback adjustment:

Pregame and preseason ratings

So all of that is how Elo works at the game-by-game level and what goes into our quarterback adjustments. But where do teams’ preseason ratings come from, anyway?

We use two sources to set teams’ initial ratings going into a season:

At the start of each season, every existing team carries its Elo rating over from the end of the previous season, except that it is reverted one-third of the way toward a mean of 1505. That is our way of hedging for the offseason’s carousel of draft picks, free agency, trades and coaching changes. We don’t currently have any way to adjust for a team’s actual offseason moves, aside from changes at quarterback, but a heavy dose of regression to the mean is the next-best thing, since the NFL has built-in mechanisms (like the salary cap) that promote parity, dragging bad teams upward and knocking good ones down a peg or two.
For seasons since 1990, we also use Vegas win totals to help set preseason Elo ratings, converting over-under expected wins to an Elo scale. (This addition to the model helped significantly improve predictive accuracy in backtesting, by a little more than half the improvement that adding the QB adjustment did.) As a side note, this is partly why we mix the projected starting QB’s rolling rating into the preseason team QB rating — we assume that changes at quarterback are “baked into” Vegas over/unders and must be adjusted for to avoid double-counting the improvement added by an upgrade at QB.

These two factors are combined, with one-third weight given to regressed Elo and two-thirds weight given to Vegas-wins Elo. This blend is what forms a team’s preseason Elo rating.

Note that end-of-season ratings from the previous year are for “existing” teams. Expansion teams have their own set of rules. For newly founded clubs in the modern era, we assign them a rating of 1300 — which is effectively the Elo level at which NFL expansion teams have played since the 1970 AFL merger. We also assigned that number to new AFL teams in 1960, letting the ratings play out from scratch as the AFL operated in parallel with the NFL. When the AFL’s teams merged into the NFL, they retained the ratings they’d built up while playing separately.

For new teams in the early days of the NFL, things are a little more complicated. When the NFL began in 1920 as the “American Professional Football Association” (they renamed it “National Football League” in 1922), it was a hodgepodge of independent pro teams from existing leagues and opponents that in some cases were not even APFA members. For teams that had not previously played in a pro league, we assigned them a 1300 rating; for existing teams, we mixed that 1300 mark with a rating that gave them credit for the number of years they’d logged since first being founded as a pro team.

\begin{equation*}Init Rating = 1300\times\frac{2}{3}^{Yrs Since 1st Season} + 1505\times{(1-\frac{2}{3})}^{Yrs Since 1st Season}\end{equation*}

This adjustment applied to 28 franchises during the 1920s, plus the Detroit Lions (who joined the NFL in 1930 after being founded as a pro team in 1929) and the Cleveland Rams (who joined in 1937 after playing a season in the second AFL). No team has required this exact adjustment since, although we also use a version of it for historical teams that discontinued operations for a period of time.

Not that there haven’t been plenty of other odd situations to account for. During World War II, the Chicago Cardinals and Pittsburgh Steelers briefly merged into a common team that was known as “Card-Pitt,” and before that, the Steelers had merged with the Philadelphia Eagles to create the delightfully monikered “Steagles.” In those cases, we took the average of the two teams’ ratings from the end of the previous season and performed our year-to-year mean reversion on that number to generate a preseason Elo rating. After the mash-up ended and the teams were redivided, the Steelers and Cardinals (or Eagles) received the same mean-reverted preseason rating implied by their combined performance the season before.

And don’t forget about the Cleveland Browns and Baltimore Ravens. Technically, the NFL considers the current Browns to be a continuation of the franchise that began under Paul Brown in the mid-1940s. But that team’s roster was essentially transferred to the Ravens for their inaugural season in 1996, while the “New Browns” were stocked through an expansion draft in 1999. Because of this, we decided the 1996 Ravens’ preseason Elo should be the 1995 Browns’ end-of-year Elo, with the cross-season mean-reversion technique applied, and that the 1999 Browns’ initial Elo should be 1300, the same as any other expansion team.

Season simulations

Now that we know where a team and quarterback’s initial ratings for a season come from and how those ratings update as the schedule plays out, the final piece of our Elo puzzle is how all of that fits in with our NFL interactive graphic, which predicts the entire season.

At any point in the season, the interactive lists each team’s up-to-date Elo rating (as well as how that rating has changed over the past week and how any changes at QB alter the team’s effective Elo), plus the team’s expected full-season record and its odds of winning its division, making the playoffs and even winning the Super Bowl. This is all based on a set of simulations that play out the rest of the schedule using Elo to predict each game.

Specifically, we simulate the remainder of the season tens of thousands of times using the Monte Carlo method, tracking how often each simulated universe yields a given outcome for each team. It’s important to note that we run these simulations “hot” — that is, a team’s Elo rating is not set in stone throughout the simulation but changes after each simulated game based on its result, which is then used to simulate the next game, and so forth. This allows us to better capture the possible variation in how a team’s season can play out, realistically modeling the hot and cold streaks that a team can go on over the course of a season.

Our simulations also project which quarterback will start each game by incorporating injuries, suspensions and starters being rested. For example, we might know that a quarterback is out for Weeks 1 and 2 but back for certain in Week 3. Or our forecast might have some uncertainty around a quarterback’s injury and project that he has only a 10 percent chance of playing next week but a 50 percent chance of playing the following week, and so on. In cases where we don’t know for sure which quarterback will start a game, the team’s quarterback adjustment is a weighted average of the possible starting quarterback adjustments.

Late in the season, you will find that the interactive allows you to experiment with different postseason contingencies based on who you have selected to win a given game. This is done by drilling down to just the simulated universes in which the outcomes you chose happened and seeing how those universes ultimately played out. It’s a handy way of seeing exactly what your favorite team needs to get a favorable playoff scenario or just to study the ripple effects each game may have on the rest of the league.

Starting in 2021,Model tweak
Sept. 7, 2021 we’re also adding a few tweaks for meaningless games involving playoff teams in the final week of the season. Specifically:

Any non-undefeated team that has locked up a specific playoff seed before its final regular-season game will be docked 250 rating points in that game (in addition to any penalty it might incur for resting the starting quarterback).
Neither team’s rating will change after a final regular-season game involving a team that has locked up a specific playoff seed.

The complete history of the NFL

In conjunction with our Elo interactive, we also have a separate dashboard showing how every team’s Elo rating has risen or fallen throughout history. These charts will help you track when your team was at its best — or worst — along with its ebbs and flows in performance over time. The data in the charts goes back to 1920 (when applicable) and is updated with every game of the current season.An important disclaimer: The historical interactive ratings will differ from the ratings found in our current-season prediction interactive because the historical ratings do not contain our quarterback adjustments. (If you’re interested in looking at the historical QB adjustment data, it’s available on our data homepage.)

FAQs

What is the NFL prediction model? ›

The model creates probabilities for every game. Including the Playoffs. Utilising a vast array of variables and proven statistical methods, the NFL Predictions Model takes a purely statistical approach to predicting NFL outcomes.

Know More ›

What is the most accurate NFL prediction site? ›

The best NFL prediction site today is Oddspedia. It has a long list of NFL betting tips you can use to place single and accumulator bets. Furthermore, Oddspedia . Find today's best bets for every NFL game this season with our expert NFL betting previews, the best NFL odds, and the most accurate NFL score predictions.

Who is the best at predicting NFL games? ›

Past Champions

Year	Most Accurate
2020	Matt Bowen ESPN
2019	Kevin Seifert ESPN
2018	Jamey Eisenberg CBS Sports
2017	Jamey Eisenberg CBS Sports

6 more rows

Keep Reading ›

Who decides NFL odds? ›

How are NFL odds determined? Sportsbooks employ oddsmakers who adjust NFL betting lines based on many factors, including home advantage, injuries, and the weather.

Get More Info Here ›

What is the best predictive model for the NFL? ›

nfelo is a prediction model built on top of 538's Elo framework that uses unique dynamics about the NFL to improve prediction accuracy. It is one of the (if not the!) most accurate public models available on the internet.