Islander
Footballguy
I have completed the first phase of my offseason analysis of the IDP projections that come out every week from John Norton. I mentioned in this thread that I would try to analyze retrospectively some numbers to find out if there were biases in the projections or not. Sometimes during the season I felt like there were biases, but it was mostly based on random observations during the season rather than extensive quantitative analysis. So I tried to prove or disprove my beliefs about the flaws (or lackthereof) of the weekly projections.
However, before I go to the results, let me mention of few things.
Like I mentioned in the thread referred to above, my intent is not to bash the projections. It’s the best source of IDP projections I know of, and I use them. If I was doing my own projections, I don’t pretend they would be better. However, in the spirit of continuous improvement, I think it’s fair to point out the problems and I hope it’s not received as an all-out complaint, but rather as a suggestion for improvement. If the projections improve, everybody here will benefit.
I also do not think any single person can come up with the perfect set of projections every week. There are so many players, so many stat categories to project, you have to consider matchups, etc. But in a perfect world, we would have one or several people doing weekly projections and FBGs on this board would point out where projections are clearly too optimistic or pessimistic, and the person who made the projections would tweak them based on feedback received to end up with a better set of projections come Sunday. In 2007 I would love to see weekly projections from more than one person (on offense as well). In addition, maybe we should set up a weekly thread where people can post “the projected number tackles for Vilma this week is way too high / too low because XYZ”. Then the projections would get refreshed if the projections-maker (Norton or others) agrees with the feedback.
Also, let me describe what I think “perfect” projections are. A perfect projection is the number that would turn out to be exact on average if the game was played 100 times. So if Vilma plays the Dolphins at home, let’s say Norton projects 7 tackles. To me 7 is a perfect projection is Vilma would get 700 tackles in the 100 games the Jets would play the Dolphins at home. Of course we cannot know how many tackles Vilma would get in those 100 games because they won’t play those games. But you have to keep this in mind in order to make perfect projections. If Norton projects 12 tackles for Vilma in that game, and Vilma happens to get 12 tackles in that game, I do not think it was a perfect projection. I think it was luck that it turned out Vilma got 12 (from reading previous comments especially in the shark pool, many people disagree with this statement). I view player stats as probability distributions. If in those hypothetical 100 games, let's assume Vilma would get the following number of tackles
0: 2 times
1: 3 times
2: 4 times
3: 5 times
4: 6 times
5: 9 times
6: 12 times
7: 18 times
8: 12 times
9: 9 times
10: 6 times
11: 5 times
12: 4 times
13: 3 times
14: 2 times
As you can see, it’s a probability distribution with a mean of 7.0. If that was the case, I would project 7.0 tackles. If the outcome is 12, that does not mean my projection of 7.0 was wrong. The outcome was different from the projection simply because it’s a random process. In addition, if the projection was 12 and the outcome is 12, that does not mean the projection was correct (in fact, a projection of 12 was way too high in this particular example, because if Vilma played that game 100 times, he would get 70 tackles, not 120 tackles, and the projection of 12 would be too high 91% of the time). Think of projecting the outcome of a die roll where people could bet over/under on your projection. The perfect projection is 3.5. If the outcome is 6, it does not mean your projection of 3.5 was wrong.
Several people use the weekly projections in different ways. Some use them simply as a cheatsheet so they don’t care about the numbers, they only care about the rankings, so Norton needs to consider how the rankings will come out based on the projections. Others convert the projected stats into their scoring system to come up with a new cheatsheet. If that’s all you do, then whether all projections are 10% overstated or 10% understated for all players, you don’t care. But if you use the projections for other purposes, that may impact what you do. So again, in the perfect world, all projections would be unbiased for all players, and regardless what you use projections for, you will have something very good to work with. However, I fully understand that some people don’t place high value about all players being over/underprojected 10% across the board – they might only care about projections of certain players relative to one another (is player X projected more fantasy points than player Y?).
Another argument I frequently read in the IDP forum but also in the shark pool: “well there is only a 0.5 pt difference between the projection of LB10 and LB20 or between WR10 and WR20, so it does not mean much”. Let me express my views on this. I totally disagree. Half a point is big. If you bet on sports, you already know that the casino has a small advantage (the juice, or the vig) and to be able to beat the casino, you need to exploit lines that may be off by as little as 0.5 pt. The difference between a 54% success rate and a 51% success rate is enormous. In the Vilma example above, if the projection was 6.5 tackles instead of 7.0 tackles, the projection would be too high 41% of the time and it would be too low 69% of the time. That is far from being a good projection. In my view, anything more than 0.3 or 0.4 difference in projected fantasy points is statistically relevant and I disagree with the argument that it’s not much. I have read from some posters before that they thought a difference in projections of 2 pts is not much. In my eyes, it is huge. If you were a casino offering a line that is 2 pts off (for over/under fantasy points of a certain player), you would go out of business in a heartbeat if you were taking bets from sharks that know their stuff.
I have not done all the number crunching I want to do yet, but I have some preliminary results that I want to throw out there as a basis of discussion before I move on. I think the FBG crowd that hangs out here in the fantasy offseason is sharper than what we see between Sept-Dec so hopefully this discussion will be of good quality.
Here is what I did.
I took the projections for all DLs, LBs, DBs for each week in 2006. I looked at the actual stats for each IDP of each game in 2006. For each player and each week, I compared the projection to the actual result. I compiled numbers by position, by week, by stat category, etc.
The first thing I wanted to find out is whether there is a bias in the overall projections (not the breakdown by player). Many people don’t care about this, as I mentioned above, but I do, so I calculated season totals.
*** Note: I am missing week #5 stats because the link on the FBG site is incorrectly pointing to the week #6 stats. I will fix my numbers once I have the week #5 stats, but until then, my season totals are for all weeks except week #5. Can someone on the FBG staff fix the link to the week #5 stats and year-to-date stats after 5 weeks? ***
Here are the results:
Position Stat Actual Projected A-P Average WeeksTooHigh WeeksTooLow
DL Tkl 5417 5679 -262 -16.4 12 4
DL Asst 1989 1713 276 17.3 4 11
DL Sack 716 917 -201 -12.6 14 1
DL INT 23 2.5 20.5 1.3 2 12
DL FF 174 178.5 -4.5 -0.3 9 6
DL FR 133 125.9 7.1 0.4 8 7
DL PD 384 198 186 11.6 1 14
DL Pts 9776 10103.3 -327.3 -20.5 10 6
LB Tkl 7037 8550 -1513 -94.6 16 0
LB Asst 2683 3157 -474 -29.6 14 2
LB Sack 306.5 274.5 32 2.0 5 11
LB INT 92 63.3 28.7 1.8 4 12
LB FF 137 138 -1 -0.1 8 7
LB FR 91 96 -5 -0.3 7 8
LB PD 469 468 1 0.1 8 7
LB Pts 11638 12141.2 -503.2 -31.4 12 3
DB Tkl 8796 9266 -470 -29.4 14 2
DB Asst 2101 1682 419 26.2 1 15
DB Sack 90.5 24 66.5 4.2 1 15
DB INT 369 420.9 -51.9 -3.2 13 3
DB FF 160 62.5 97.5 6.1 1 15
DB FR 105 35.1 69.9 4.4 1 15
DB PD 1580 1756 -176 -11.0 13 3
DB Pts 14857.5 13813.8 1043.7 65.2 3 13
Total Tkl 21250 23495 -2245 -140.3 16 0
Total Asst 6773 6552 221 13.8 6 10
Total Sack 1113 1215.5 -102.5 -6.4 12 4
Total INT 484 486.7 -2.7 -0.2 9 7
Total FF 471 379 92 5.8 1 15
Total FR 329 257 72 4.5 2 14
Total PD 2433 2422 11 0.7 7 9
Total Pts 36271.5 36058.3 213.2 13.3 8 8
In the table above, you can see that Norton consistently overprojected tackles for LBs. He projected 8550 tackles all year (except week #5) and there were only 7037 tackles by LBs (excluding week #5). So he was off by 1513 tackles, for an average of 94.6 tackles too high per week. He was too high all 16 weeks (next to last column of table), and he was not too low in any week. For some categories, the sum of the last 2 columns is not 16 (for example, PDs for DLs 1+14=15) because the projection was equal to the actual result in a particular week.
Those numbers pretty much match what I expected before compiling the results. Norton projected too many sacks for DLs, too many INTs and PDs for DBs, not many tackles for LBs. Those are the bread & butter of each of the three positions. In addition, Norton projects too few INTs and PDs for DLs, too few sacks and INTs for LBs, and too few sacks and FFs / FRs for DBs. Those are the stats that are generally not the bread and butter of each position. I have not done this analysis with 2005 stats, but I am pretty sure we would find similar results.
Note that for the total fantasy points for all positions total (last row of table), Norton did pretty well. In 8 weeks he was too high, and in 8 weeks he was too low. He was off by only 13.3 pts per week on average for all NFL players (only 0.6% off). This is pretty close to zero and one could say there was virtually no bias. But the breakdown by position / stat category could be improved.
When looking at aggregate stats only, the biases should be fairly easy to fix. Total tackles for all LBs are pretty stable week to week. Even for the not so stable categories, like INTs for DLs, if you know there are about 1.5 INT by a DL each week (in 2006 there were 23 INTs by DLs in 16 games), then I think the weekly projections should have about 1.5 INTs total for all DLs. Norton only projected 2.5 INTs for DLs all year – clearly too low.
Hopefully I did not screw up on my spreadsheets and calculations. But if I spot or if anybody spots a problem, please let me know and I will re-post the numbers.
That’s it for now, later on I will post results for other analyses I plan on doing. Among other things, I will try to show that I believe the top players at each position have overstated projections. In other words, not only do LBs have too many projected tackles in total, but it is even more the case for the top LBs in the NFL. For all positions, I believe the projected fantasy points of the top players are too high on average, even though for all players of the NFL in total, Norton’s projections were pretty much in line. Finally, I will try to show that if there is a player who is not that good but Norton put him very high on the projections for that week (probably due to a favorable matchup), then on average those players were projected too many fantasy points.
For the moment I would appreciate comments on what I have so far.
However, before I go to the results, let me mention of few things.
Like I mentioned in the thread referred to above, my intent is not to bash the projections. It’s the best source of IDP projections I know of, and I use them. If I was doing my own projections, I don’t pretend they would be better. However, in the spirit of continuous improvement, I think it’s fair to point out the problems and I hope it’s not received as an all-out complaint, but rather as a suggestion for improvement. If the projections improve, everybody here will benefit.
I also do not think any single person can come up with the perfect set of projections every week. There are so many players, so many stat categories to project, you have to consider matchups, etc. But in a perfect world, we would have one or several people doing weekly projections and FBGs on this board would point out where projections are clearly too optimistic or pessimistic, and the person who made the projections would tweak them based on feedback received to end up with a better set of projections come Sunday. In 2007 I would love to see weekly projections from more than one person (on offense as well). In addition, maybe we should set up a weekly thread where people can post “the projected number tackles for Vilma this week is way too high / too low because XYZ”. Then the projections would get refreshed if the projections-maker (Norton or others) agrees with the feedback.
Also, let me describe what I think “perfect” projections are. A perfect projection is the number that would turn out to be exact on average if the game was played 100 times. So if Vilma plays the Dolphins at home, let’s say Norton projects 7 tackles. To me 7 is a perfect projection is Vilma would get 700 tackles in the 100 games the Jets would play the Dolphins at home. Of course we cannot know how many tackles Vilma would get in those 100 games because they won’t play those games. But you have to keep this in mind in order to make perfect projections. If Norton projects 12 tackles for Vilma in that game, and Vilma happens to get 12 tackles in that game, I do not think it was a perfect projection. I think it was luck that it turned out Vilma got 12 (from reading previous comments especially in the shark pool, many people disagree with this statement). I view player stats as probability distributions. If in those hypothetical 100 games, let's assume Vilma would get the following number of tackles
0: 2 times
1: 3 times
2: 4 times
3: 5 times
4: 6 times
5: 9 times
6: 12 times
7: 18 times
8: 12 times
9: 9 times
10: 6 times
11: 5 times
12: 4 times
13: 3 times
14: 2 times
As you can see, it’s a probability distribution with a mean of 7.0. If that was the case, I would project 7.0 tackles. If the outcome is 12, that does not mean my projection of 7.0 was wrong. The outcome was different from the projection simply because it’s a random process. In addition, if the projection was 12 and the outcome is 12, that does not mean the projection was correct (in fact, a projection of 12 was way too high in this particular example, because if Vilma played that game 100 times, he would get 70 tackles, not 120 tackles, and the projection of 12 would be too high 91% of the time). Think of projecting the outcome of a die roll where people could bet over/under on your projection. The perfect projection is 3.5. If the outcome is 6, it does not mean your projection of 3.5 was wrong.
Several people use the weekly projections in different ways. Some use them simply as a cheatsheet so they don’t care about the numbers, they only care about the rankings, so Norton needs to consider how the rankings will come out based on the projections. Others convert the projected stats into their scoring system to come up with a new cheatsheet. If that’s all you do, then whether all projections are 10% overstated or 10% understated for all players, you don’t care. But if you use the projections for other purposes, that may impact what you do. So again, in the perfect world, all projections would be unbiased for all players, and regardless what you use projections for, you will have something very good to work with. However, I fully understand that some people don’t place high value about all players being over/underprojected 10% across the board – they might only care about projections of certain players relative to one another (is player X projected more fantasy points than player Y?).
Another argument I frequently read in the IDP forum but also in the shark pool: “well there is only a 0.5 pt difference between the projection of LB10 and LB20 or between WR10 and WR20, so it does not mean much”. Let me express my views on this. I totally disagree. Half a point is big. If you bet on sports, you already know that the casino has a small advantage (the juice, or the vig) and to be able to beat the casino, you need to exploit lines that may be off by as little as 0.5 pt. The difference between a 54% success rate and a 51% success rate is enormous. In the Vilma example above, if the projection was 6.5 tackles instead of 7.0 tackles, the projection would be too high 41% of the time and it would be too low 69% of the time. That is far from being a good projection. In my view, anything more than 0.3 or 0.4 difference in projected fantasy points is statistically relevant and I disagree with the argument that it’s not much. I have read from some posters before that they thought a difference in projections of 2 pts is not much. In my eyes, it is huge. If you were a casino offering a line that is 2 pts off (for over/under fantasy points of a certain player), you would go out of business in a heartbeat if you were taking bets from sharks that know their stuff.
I have not done all the number crunching I want to do yet, but I have some preliminary results that I want to throw out there as a basis of discussion before I move on. I think the FBG crowd that hangs out here in the fantasy offseason is sharper than what we see between Sept-Dec so hopefully this discussion will be of good quality.
Here is what I did.
I took the projections for all DLs, LBs, DBs for each week in 2006. I looked at the actual stats for each IDP of each game in 2006. For each player and each week, I compared the projection to the actual result. I compiled numbers by position, by week, by stat category, etc.
The first thing I wanted to find out is whether there is a bias in the overall projections (not the breakdown by player). Many people don’t care about this, as I mentioned above, but I do, so I calculated season totals.
*** Note: I am missing week #5 stats because the link on the FBG site is incorrectly pointing to the week #6 stats. I will fix my numbers once I have the week #5 stats, but until then, my season totals are for all weeks except week #5. Can someone on the FBG staff fix the link to the week #5 stats and year-to-date stats after 5 weeks? ***
Here are the results:
Position Stat Actual Projected A-P Average WeeksTooHigh WeeksTooLow
DL Tkl 5417 5679 -262 -16.4 12 4
DL Asst 1989 1713 276 17.3 4 11
DL Sack 716 917 -201 -12.6 14 1
DL INT 23 2.5 20.5 1.3 2 12
DL FF 174 178.5 -4.5 -0.3 9 6
DL FR 133 125.9 7.1 0.4 8 7
DL PD 384 198 186 11.6 1 14
DL Pts 9776 10103.3 -327.3 -20.5 10 6
LB Tkl 7037 8550 -1513 -94.6 16 0
LB Asst 2683 3157 -474 -29.6 14 2
LB Sack 306.5 274.5 32 2.0 5 11
LB INT 92 63.3 28.7 1.8 4 12
LB FF 137 138 -1 -0.1 8 7
LB FR 91 96 -5 -0.3 7 8
LB PD 469 468 1 0.1 8 7
LB Pts 11638 12141.2 -503.2 -31.4 12 3
DB Tkl 8796 9266 -470 -29.4 14 2
DB Asst 2101 1682 419 26.2 1 15
DB Sack 90.5 24 66.5 4.2 1 15
DB INT 369 420.9 -51.9 -3.2 13 3
DB FF 160 62.5 97.5 6.1 1 15
DB FR 105 35.1 69.9 4.4 1 15
DB PD 1580 1756 -176 -11.0 13 3
DB Pts 14857.5 13813.8 1043.7 65.2 3 13
Total Tkl 21250 23495 -2245 -140.3 16 0
Total Asst 6773 6552 221 13.8 6 10
Total Sack 1113 1215.5 -102.5 -6.4 12 4
Total INT 484 486.7 -2.7 -0.2 9 7
Total FF 471 379 92 5.8 1 15
Total FR 329 257 72 4.5 2 14
Total PD 2433 2422 11 0.7 7 9
Total Pts 36271.5 36058.3 213.2 13.3 8 8
In the table above, you can see that Norton consistently overprojected tackles for LBs. He projected 8550 tackles all year (except week #5) and there were only 7037 tackles by LBs (excluding week #5). So he was off by 1513 tackles, for an average of 94.6 tackles too high per week. He was too high all 16 weeks (next to last column of table), and he was not too low in any week. For some categories, the sum of the last 2 columns is not 16 (for example, PDs for DLs 1+14=15) because the projection was equal to the actual result in a particular week.
Those numbers pretty much match what I expected before compiling the results. Norton projected too many sacks for DLs, too many INTs and PDs for DBs, not many tackles for LBs. Those are the bread & butter of each of the three positions. In addition, Norton projects too few INTs and PDs for DLs, too few sacks and INTs for LBs, and too few sacks and FFs / FRs for DBs. Those are the stats that are generally not the bread and butter of each position. I have not done this analysis with 2005 stats, but I am pretty sure we would find similar results.
Note that for the total fantasy points for all positions total (last row of table), Norton did pretty well. In 8 weeks he was too high, and in 8 weeks he was too low. He was off by only 13.3 pts per week on average for all NFL players (only 0.6% off). This is pretty close to zero and one could say there was virtually no bias. But the breakdown by position / stat category could be improved.
When looking at aggregate stats only, the biases should be fairly easy to fix. Total tackles for all LBs are pretty stable week to week. Even for the not so stable categories, like INTs for DLs, if you know there are about 1.5 INT by a DL each week (in 2006 there were 23 INTs by DLs in 16 games), then I think the weekly projections should have about 1.5 INTs total for all DLs. Norton only projected 2.5 INTs for DLs all year – clearly too low.
Hopefully I did not screw up on my spreadsheets and calculations. But if I spot or if anybody spots a problem, please let me know and I will re-post the numbers.
That’s it for now, later on I will post results for other analyses I plan on doing. Among other things, I will try to show that I believe the top players at each position have overstated projections. In other words, not only do LBs have too many projected tackles in total, but it is even more the case for the top LBs in the NFL. For all positions, I believe the projected fantasy points of the top players are too high on average, even though for all players of the NFL in total, Norton’s projections were pretty much in line. Finally, I will try to show that if there is a player who is not that good but Norton put him very high on the projections for that week (probably due to a favorable matchup), then on average those players were projected too many fantasy points.
For the moment I would appreciate comments on what I have so far.