Fantasy Football - Footballguys Forums

pecorino

Members
  • Content Count

    3,045
  • Joined

  • Last visited

Community Reputation

1,184 Excellent

About pecorino

  • Rank
    Footballguy

Recent Profile Visitors

5,751 profile views
  1. His path to victory in 2020 is all but assured. No reason to waste your time by heading to the polls and voting. You might as well kick back, enjoy the tax cuts and make popcorn for his victory speech. Congrats on backing a winner.
  2. Trump is 40% of America and that number seems firm. This is shaping up to be a long term cold Civil War.
  3. Yes. And I’ve heard all my life that churches should be taxed (Zappa was a huge spokesperson in that regard) yet it seems to be a dead issue. Wonder why...
  4. You’re welcome. So you’re saying you want another long post, this time on Calculus? eta: I smell a new sub forum.
  5. I find a lot to like about FBG polls.
  6. Good point. I did not even touch on the non-response rate or the other errors that can occur outside of random variation. My take on the 2016 polls is that 1) most were leaning or pretty clearly pointing towards Hillary but 2) they were far from slam dunks as those confidence intervals included scenarios in which Trump comes out the winner, and 3) most apropos to this conversation, I believe we hit a new level where subjects either did not tell the truth that they were voting for Trump or that they waffled at high rates so when it came to actually voting, they swung to Trump. Plus you've got the Comey debacle which hit in October which also threw a wrench into predictions.
  7. About polls and surveys: 1) Online polls and/or any survey in which people may choose to respond by their own choice are bunk and should be either tossed out altogether or consumed merely for their entertainment value. Think: a Presidential poll on November 7th, 2016 in which online visitors to CNN's website could click who they're voting for -- versus -- the same poll at the Fox website. Might be fun to look at but I wouldn't make any predictions based on that data. 2) One major goal of Statistics in general and polling in particular is to try to capture a single value (often a percent) about a very large population at one snapshot of time. For instance, to take this out of the realm of politics: Among Americans over 18 (a population of hundreds of millions of people), what percent of them would favor abolishing the penny? I envision this percentage as an unknowable number, that true percentage among all of those folks. If you believe in God, you'd probably say that He'd know that percentage even though it is constantly changing as people leave the population by dying or enter the population by turning 18 or by emigrating here (that's another thread). That magical, elusive percentage is what a statistician would like to know and it is called a "parameter." If every single member of that population could be asked, then we should be able to get a good handle on that parameter. But that's a very time consuming and costly proposition. So much so that we only do this kind of statistics rarely and it is called a census. For the sake of illustration, let's suppose that we know the true parameter (even though statisticians almost never do) and that the true percentage of our population who want to abolish the penny is 44%. Instead of attempting a census on a very large population, statisticians realized that one could get a reasonable approximation for this parameter by taking a random sample of the population and using that percentage as the best approximation. Randomness is key. If you sample people who are hanging around the mall or a church or an NFL game, you might get skewed results if the members of that sample are not representative of the whole population Heaven forbid you sample attendees at a coin collecting convention. So the statistician sets out to conduct a random sample. 3) An key fact about sampling is that, even for a population in the hundreds of millions, one only needs to sample a relatively small fraction of them to be able to get a decent approximation of the parameter. Think of it like ocean water. Just because the ocean is enormous doesn't mean that we'd need a lot of water to take a reasonably representative sample. It just needs to be mixed well and we need to choose a sample at random not somewhere convenient, like right off a pier. Turns out that a sample of only about 1000 subjects is enough for most situations so if you look closely at most polls, they will say something like "1023 people were surveyed." This is rather amazing: a well-conducted random sample of only 1000 people will give a reasonably accurate estimate for the true percentage of all Americans who favor or oppose a proposition. Such a sample yields a "margin of error" of about +/- 3%, roughly. 4) Conducting a random sample of 1000 Americans over the age of 18, though, is a royal PIA. You cannot very well put all those names in a hat and pull out 1000 of them. I won't get into the gory details but good samples tend to break this up into stages and do a stratified random sample. Suffice to say, it is easier to cut corners and make a specious sample than it is to do it well. This is why I trust the big pollsters like Gallup, Roper, Quinnipiac, etc. because they have the funds and expertise to conduct this random sample. Now, as you must suspect, it is possible that if you only sample 1000 people, we could get extraordinarily unlucky and just happen to select all penny-lovers in our sample even though, in truth, 44% of our population want to abolish it. This can happen but you can also hit the Powerball on three consecutive weeks. Sure it can happen but it's very, very, very unlikely. The techniques of statistics allow us to quantify just how likely it is that our random sample will be very far off from what one would expect after doing a random sample. The percentage of people in our sample who want to abolish is called a "statistic" and let's for the sake of argument assume that it came out as 40% 5) That statistic (the 40%) would vary from sample to sample. If we redid the sample of 1000 again, we might get 45%. And again and get 41%. But it varies according to a pattern which is very well understood. Imagine if you sampled over and over again (like millions of times), those percentages would dance around the true parameter percentage of 44% with some hitting right on the money and some being pretty far away (maybe as far away as 30% or 60% but very, very unlikely that it would be much further from 44% unless our sample was tainted). Graphing all of those different percentages would reveal a bell curve (a normal distribution) with the peak at 44% and with it trailing in to the rare tails down towards 30% on the left and 60% on the right. 6) Here is the bummer: we usually only have the time and energy to do one sample. So let's assume we got 40% for our sample statistic. Remember that the parameter was 44% but we need to pretend like we didn't know this because statisticians never know this "true" number. So we really need to rely on that 40% as our best estimate. If someone put a gun to your head and said "Predict the true parameter" you should guess 40% because that's what the sample said. But we would not have much confidence in this result because of the sampling variability mentioned above. I'd feel much better if I could say "I think the true parameter is pretty close to 40%". In fact, if I were to give that +/- 3% wiggle room, I would report that I'm pretty confident that the parameter is somewhere between 37% and 43%. That range gives us 95% confidence that we've captured the parameter in that interval. 7) Whoops, the parameter is actually not in that interval. We got unlucky, and that happens about 5% of the time. We do a perfectly random sample, we get an estimation and give ourselves the 3% wiggle room and still we wiffed. It happens about 5% of the time. But you never know when it is going to happen--we do not know what the parameter is, remember. So we claim it is in that interval but we cannot be certain. This, by the way, is a central difference between mathematics as statistics. Mathematicians are certain (they prove things) while Statisticians wrestle with probabilities and can tell you when something is likely to be true or false. The conclusion is: look for a random sample of at least 1000, go ahead and do your +/- margin of error, but do not assume that the "true percentage" lies in that range. We just don't know in any given sample, although our confidence grows with more samples or with larger ones.
  8. This is very close to correct but there are some semantic issues with what you wrote. But if the general public had this idea of polls and margin of error, I'd be satisfied.
  9. Let's cut to the chase. I suspect that a significant proportion of voters misunderstand margin of error in believing that it gives a range of percents that the "true" number could be. For instance, if a well-designed randomized survey claimed that Candidate #1 is polling at 46% with a margin of error of +/- 3%, then that means that the true percentage of votes Candidate #1 receives is between 43% and 49%. And if Candidate #2 came in at 54% with the same margin of error, so producing an interval of 51% to 57%, then that poll is saying that Candidate #2 "should" win or "will" win or what-have-you. But none of that is right. These polls with their margin of errors are just talking about probabilities, and most of the reported margin of errors is quantifying the variation inherent in conducting a random sample. Other types of error (like non-response bias or people not being honest) is quite difficult to quantify so I suspect most pollsters ignore it and simply follow the rule of thumb that a random sample of about 1000 produces a margin of error of about 3%. It's all too nuanced to summarize in one post (I have not told you what the + / - 3% margin of error means, only what it doesn't mean), but my suspicion is that many, many people misunderstand polls and margin of error. From your response, I cannot tell if you do or not since you just quoted another site. I'll keep going if there is interest but I've probably passed the tl;dr threshold.
  10. He is right about this one. Paper straws are the worst. https://www.yahoo.com/news/trump-plastic-straws-083548363.html
  11. Maybe it’s been said upthread, but this is still ridiculous. Lop off half of those candidates—less is more. Guess it’ll happen soon enough when the money runs out.
  12. I teach Statistics so I know a bit about the concept of margin of error in polling. I’d be edified if you would share with the board your understanding of that concept. Maybe a separate thread, even, unless you can give a reasonably concise answer. I find that a decent portion of our population has misconceptions about the more general notion of probabilities so I’m even more skeptical about folks understanding margin of error. It confounds my students often. But as a FBG, I suspect you actually do know what you are talking about. I’m on my phone or I’d do it myself. TIA.
  13. Is that correct? My quick search says it's over 22. We may be comparing apples to oranges, though.