Nfelo is an Elo based power ranking model for the NFL. The table above contains each team's current Elo rating along with a handful of additional measurements of performance like point differential and Expected Points Added (EPA) splits.
Elo is a rating system for predicting head-to-head competitions between opponents. Though originally created to measure chess players, it's since been adapted to measure a wide variety of competitions and sports.
All Elo systems start with an initial Elo value for each team (or player in the case of chess). Before each game, the Elo ratings of both teams are compared to create a prediction of the outcome. Once the game is played, the Elo model compares the actual result against its expectations, and updates the Elo ratings for each team according. Pretty simple, right?
In addition to the elegance in their simplicity, Elo rating systems also benefit from their ability to be extended and customized for the competitions they measure. This is exactly what nfelo does. It's Elo for the NFL, hence the name (NFL + Elo = nfelo).
nfelo takes the core Elo framework and adds to it:
Since Elo rating systems are not tailored to the NFL out of the box, a little more work needs to be done to make it actionable for different types of NFL betting and analysis. When Elo compares one team's rating to another, its prediction is expressed as a win probability for one side. In the case of nfelo, it's also the home team win probability. Here's how nfelo translates this win probability into different applications.
Since Pick'em and Confidence pools are effectively just a stack rank of who is most likely to win, nfelo is pretty well situated. The model takes the home team win probability and subtracts it from 1 to find the away team win probability. The higher of these two numbers is then taken to represent the overall winner's expected win probability. The confidence pool recommendations are then just a stack rank of winners based on who has the highest probability.
Translating win probabilities into an expected winning margin of victory (or spread) is a bit more complex. To do this, nfelo uses a second model built on historical money line odds and spreads to translate a winning probability (ie the money line odds) into a winning margin (ie the spread).
In instances where the model's predicted winning margin and the market spread differ, nfelo uses an additional model for measuring how likely a team is to cover the market number. This is the basis for creating an Expected Value (EV) for each game, which are displayed on the betting card screen.
A final use of NFL Elo ratings is in measuring strength of schedule, both played and upcoming. Translating Elo ratings into strength of schedule is as easy as calculating the average Elo ratings of the a team's opponent (either played or yet to be played).
Combining Elo ratings and schedules also allows you to calculate Super Bowl and Playoff odds. Comparing the Elo ratings of any two teams produces an expected win probability, which can be used to simulate future outcomes using methods like a Monte Carlo. This functionality hasn't yet been added the nfelo site, but you can expect to see it in the future.
Elo ratings, while advantageous in many ways, have their limitations in betting. Some of these limitations include:
In addition to NFL Elo ratings, the table above also includes a few other team metrics that can be useful for power ratings.
Of course, wins are the most important stat, but they're not always the most predictive. Bad breaks in critical situations can turn wins into losses, even if those breaks aren't likely to happen again in the future. For this reason, it can be helpful to compare a team's actual wins against expected wins derived from other metrics. The table shows "Pythagorean Wins", which are based on a point differential equation, "PFF Wins", which use PFF grades to predict how many wins a team should have, and "538 Wins", which is just a sum of FiveThirtyEight's Elo based predictions. These metrics tend to be more predictive of future performance than wins alone and provide context for who has gotten lucky or unlucky.
Point differential is, quite possibly, the simplest metric there is for measuring team performance. Despite this simplicity, it's also one of the most predictive, beating out advanced metrics like DVOA, or Expected Points Added (EPA). In the table, you'll find "Points For", "Points Against", and "Net Points", which is just the total margin of victory.
EPA, which stands for Expected Points Added, is an advanced metric that has grown in popularity in recent years. It's based on a machine learning model that uses factors like starting field position, time remaining the half, down and distance, hardline, and more to determine how many points an offense is expected to score on their current drive based on their current context. Expected Points Added measures this expected points value at the start and end of each play, which allows us to determine how much value was created or destroyed on any individual play.
Adding up all these values tells us how many points a team produced over or below what we'd expect a league average team to produce in similar situations. Additionally, since EPA assigns value at the individual play level, it allows for a variety of informative aggregations.
In this table, EPA per play is split across offensive and defensive passing and rushing plays. These splits are a great way to determine which teams perform the best in different phases of the game.
Elo is a rating system that, in the context of the NFL, evaluates teams by assigning them a numerical rating, ranging from 1000-2000, based on their pregame and postgame performance. After each game, the team's Elo rating is updated according to the result of the game. The maximum number of points a team can gain from winning is determined by a predetermined factor known as K. The average team has an Elo of 1505.
Elo does not stand for anything. It is actually named after its creator Arpad Elo. Arpad Elo originally created Elo ratings as a method for calculating the relative skill levels of chess players, but it can be extended to calculate the relative ratings of any two-player (or team), zero-sum game.
The Elo rating system is a measure of team strength based on head-to-head results, margin of victory and quality of opponent. It is used to predict the probability of one team winning over the other and can also be used to forecast the magnitude of the result. This process of using Elo ratings to create a prediction is known as an Elo forecast.
The original Elo rating system devised by Arpad Elo calculates Elo using the formula R'a = Ra + Ra. K*(Sa — Ea), where R'a is the new rating, Ra is the previous rating, Ea denotes the expected outcome and SA is the actual outcome of the match. Other models like nfelo, which have adapted Elo ratings to other domains make modifications to this formula.
Elo ratings provide a numerical representation of team strengths which enables predictions of game outcomes and comparison of teams. It is a team based power rating system, which generally has the same purpose and application of any other NFL power rating system. What sets nfelo, and other model based systems is that they are entirely quantitative, and, as a result, potentially less biased.
@greerreNFL
NFL Analytics and Betting
Follow