Forum: >>> Magnum BBS <<<

meaning of 'statistical variation'

From Rich Ulrich@21:1/5 to All on Sat Nov 2 13:32:01 2024

XPost: alt.usage.english

Lede -- Dead-heat poll results are astonishing – and improbable,
these experts say. Robert Tait

https://www.theguardian.com/us-news/2024/nov/02/what-polls-mean-so-far-trump-harris-election-voters

Of the last 321 polls in the battlegrounds, 124 - nearly 40% -
showed margins of a single point or less, the pair wrote.
Pennsylvania was the most “troubling” case, with 20 out of 59 polls showing an exact tie, while another 26 showed margins of less than
1%.

This indicated “not just an astonishingly tight race, but also an
improbably tight race”, according to Clinton and Lapinski.

Large numbers of surveys would be expected to show a wider variety
of opinion, even in a close election, due to the randomness
inherent in polling. The absence of such variation suggests that
either pollsters are adjusting “weird” margins of 5% or more,
Clinton and Lapinski argued – or the following second possibility,
which they deemed more likely.

“Some of the tools pollsters are using in 2024 to address the
polling problems of 2020, such as weighting by partisanship, past
vote or other factors, may be flattening out the differences and
reducing the variation in reported poll results,” they write.

When the 'margin of error' is three points, then you expect the
observed results to VARY around the middle. Ignoring all
differences in method -- another survey, done one exactly the
same days by the same people is APT to differ by (say) three
points from the other. People with different methods and
sampling procedures would differ more.

If you flip a coin 1000 times, 500 heads is "most likely" but
if you get exactlly that number a bunch of times in a row,
someone is cheating. A range of results should center on
500, but 3% either way is EXPECTed to come up, fairly often.

The bell this rings for statisticians is the re-analyses done on
Gregor Mendel's data on Dominant/recessive genes. Mendel
wrote before modern statistics; he was a monk doing private
experiments; his work was buried in a minor publication, and
re-descovered years later. He published convincing data
showing ratios like 1-1, 3-1, 1-2-1 -- with TOO MUCH precision
for the number of plants he was growing.

Looking back, it seems that he very likely "fudged" his data in
one way or another. Reporting averages? Throwing out odd
results? His tables seem "unlikely" to have represented the
experiments as he described them.

I don't know how they selected their "321 polls" but I do know
that agencies with HIGH reputations are relatively few; and
the reputable ones do a pretty good job of "showing their
work", so I don't know what is going on here.

I can see how they may get more consistent results if they
do such things as "weighting by partisanship" -- this amounts
to a version of what is called "stratified sampling". Thus, they
are predicting changes in the overall outcome by looking at
changes in subgroups: Are suburban white college-educated
females changing their minds? - some of the surveys these
days are probably internet-based, and ask their volunteers
to give periodic responses, so there is a person-by-person
change measured, which could be precise.

What that LACKs is the randomness of selecting samples; and
where it potentially FAILs is that the generated outcome
makes iron-clad assumptions about turnout: This is our result
IF our percentages of assumed voters by category match
who votes. Unfortunately, WHO VOTEs seems to be the
big explanatory variable for WHO WINs in recent elections.

Nate Silver, of FiveThrtyEight, is the most prestigious of US
poll interpreters. Last week, he gave his statistical opinion,
that, despite all the close calls in polls, there is about a 60%
chance (IIRC) that one candidate or the other will win at
least 6 of the 7 swing states. Sounds about right to me.
Oh, he said that the data (last week) gave Trump about
a 53% chance of winning.

--
Rich Ulrich

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From David Jones@21:1/5 to Rich Ulrich on Sat Nov 2 23:36:09 2024

XPost: alt.usage.english

Rich Ulrich wrote:

Lede -- Dead-heat poll results are astonishing – and improbable,
these experts say. Robert Tait

https://www.theguardian.com/us-news/2024/nov/02/what-polls-mean-so-far-trump-harris-election-voters

Of the last 321 polls in the battlegrounds, 124 - nearly 40% -
showed margins of a single point or less, the pair wrote.
Pennsylvania was the most “troubling” case, with 20 out of 59 polls showing an exact tie, while another 26 showed margins of less than
1%.

This indicated “not just an astonishingly tight race, but also an
improbably tight race”, according to Clinton and Lapinski.

Large numbers of surveys would be expected to show a wider variety
of opinion, even in a close election, due to the randomness
inherent in polling. The absence of such variation suggests that
either pollsters are adjusting “weird” margins of 5% or more,
Clinton and Lapinski argued – or the following second possibility,
which they deemed more likely.

“Some of the tools pollsters are using in 2024 to address the
polling problems of 2020, such as weighting by partisanship, past
vote or other factors, may be flattening out the differences and
reducing the variation in reported poll results,” they write.

When the 'margin of error' is three points, then you expect the
observed results to VARY around the middle. Ignoring all
differences in method -- another survey, done one exactly the
same days by the same people is APT to differ by (say) three
points from the other. People with different methods and
sampling procedures would differ more.

If you flip a coin 1000 times, 500 heads is "most likely" but
if you get exactlly that number a bunch of times in a row,
someone is cheating. A range of results should center on
500, but 3% either way is EXPECTed to come up, fairly often.

The bell this rings for statisticians is the re-analyses done on
Gregor Mendel's data on Dominant/recessive genes. Mendel
wrote before modern statistics; he was a monk doing private
experiments; his work was buried in a minor publication, and
re-descovered years later. He published convincing data
showing ratios like 1-1, 3-1, 1-2-1 -- with TOO MUCH precision
for the number of plants he was growing.

Looking back, it seems that he very likely "fudged" his data in
one way or another. Reporting averages? Throwing out odd
results? His tables seem "unlikely" to have represented the
experiments as he described them.

I don't know how they selected their "321 polls" but I do know
that agencies with HIGH reputations are relatively few; and
the reputable ones do a pretty good job of "showing their
work", so I don't know what is going on here.

I can see how they may get more consistent results if they
do such things as "weighting by partisanship" -- this amounts
to a version of what is called "stratified sampling". Thus, they
are predicting changes in the overall outcome by looking at
changes in subgroups: Are suburban white college-educated
females changing their minds? - some of the surveys these
days are probably internet-based, and ask their volunteers
to give periodic responses, so there is a person-by-person
change measured, which could be precise.

What that LACKs is the randomness of selecting samples; and
where it potentially FAILs is that the generated outcome
makes iron-clad assumptions about turnout: This is our result
IF our percentages of assumed voters by category match
who votes. Unfortunately, WHO VOTEs seems to be the
big explanatory variable for WHO WINs in recent elections.

Nate Silver, of FiveThrtyEight, is the most prestigious of US
poll interpreters. Last week, he gave his statistical opinion,
that, despite all the close calls in polls, there is about a 60%
chance (IIRC) that one candidate or the other will win at
least 6 of the 7 swing states. Sounds about right to me.
Oh, he said that the data (last week) gave Trump about
a 53% chance of winning.

Of course I know nothing about this, but one possible explanation is
that the results are all obtained by some version of "shrinkage", as in https://en.wikipedia.org/wiki/Shrinkage_(statistics)
... possibly they are all shrinking estimates towards a 50-50 result.
But if they did do this, presumably they would have to adjust the usual
"margin of error" in some way.

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Rich Ulrich@21:1/5 to dajhawkxx@nowherel.com on Sun Nov 3 17:50:02 2024

XPost: alt.usage.english

On Sat, 2 Nov 2024 23:36:09 -0000 (UTC), "David Jones"
<dajhawkxx@nowherel.com> wrote:

Rich Ulrich wrote:

Lede -- Dead-heat poll results are astonishing – and improbable,
these experts say. Robert Tait

https://www.theguardian.com/us-news/2024/nov/02/what-polls-mean-so-far-trump-harris-election-voters

Of the last 321 polls in the battlegrounds, 124 - nearly 40% -
showed margins of a single point or less, the pair wrote.
Pennsylvania was the most “troubling” case, with 20 out of 59 polls >> showing an exact tie, while another 26 showed margins of less than
1%.

This indicated “not just an astonishingly tight race, but also an
improbably tight race”, according to Clinton and Lapinski.

Large numbers of surveys would be expected to show a wider variety
of opinion, even in a close election, due to the randomness
inherent in polling. The absence of such variation suggests that
either pollsters are adjusting “weird” margins of 5% or more,
Clinton and Lapinski argued – or the following second possibility,
which they deemed more likely.

“Some of the tools pollsters are using in 2024 to address the
polling problems of 2020, such as weighting by partisanship, past
vote or other factors, may be flattening out the differences and
reducing the variation in reported poll results,” they write.

When the 'margin of error' is three points, then you expect the
observed results to VARY around the middle. Ignoring all
differences in method -- another survey, done one exactly the
same days by the same people is APT to differ by (say) three
points from the other. People with different methods and
sampling procedures would differ more.

If you flip a coin 1000 times, 500 heads is "most likely" but
if you get exactlly that number a bunch of times in a row,
someone is cheating. A range of results should center on
500, but 3% either way is EXPECTed to come up, fairly often.

The bell this rings for statisticians is the re-analyses done on
Gregor Mendel's data on Dominant/recessive genes. Mendel
wrote before modern statistics; he was a monk doing private
experiments; his work was buried in a minor publication, and
re-descovered years later. He published convincing data
showing ratios like 1-1, 3-1, 1-2-1 -- with TOO MUCH precision
for the number of plants he was growing.

Looking back, it seems that he very likely "fudged" his data in
one way or another. Reporting averages? Throwing out odd
results? His tables seem "unlikely" to have represented the
experiments as he described them.

I don't know how they selected their "321 polls" but I do know
that agencies with HIGH reputations are relatively few; and
the reputable ones do a pretty good job of "showing their
work", so I don't know what is going on here.

I can see how they may get more consistent results if they
do such things as "weighting by partisanship" -- this amounts
to a version of what is called "stratified sampling". Thus, they
are predicting changes in the overall outcome by looking at
changes in subgroups: Are suburban white college-educated
females changing their minds? - some of the surveys these
days are probably internet-based, and ask their volunteers
to give periodic responses, so there is a person-by-person
change measured, which could be precise.

What that LACKs is the randomness of selecting samples; and
where it potentially FAILs is that the generated outcome
makes iron-clad assumptions about turnout: This is our result
IF our percentages of assumed voters by category match
who votes. Unfortunately, WHO VOTEs seems to be the
big explanatory variable for WHO WINs in recent elections.

Nate Silver, of FiveThrtyEight, is the most prestigious of US
poll interpreters. Last week, he gave his statistical opinion,
that, despite all the close calls in polls, there is about a 60%
chance (IIRC) that one candidate or the other will win at
least 6 of the 7 swing states. Sounds about right to me.
Oh, he said that the data (last week) gave Trump about
a 53% chance of winning.

Of course I know nothing about this, but one possible explanation is
that the results are all obtained by some version of "shrinkage", as in >https://en.wikipedia.org/wiki/Shrinkage_(statistics)
... possibly they are all shrinking estimates towards a 50-50 result.
But if they did do this, presumably they would have to adjust the usual >"margin of error" in some way.

Here is another commentary on the same thing. https://www.emptywheel.net/2024/11/03/male-pollsters-shocked-shocked-when-a-woman-pollster-discovers-women-voters/?fbclid=IwY2xjawGU1ChleHRuA2FlbQIxMQABHXSltRouQ_w_bulSgPL75SATJWOXBb2NZFhDmk3_5vXMG_wWdKCVNw5UhQ_aem_fJ-vwR17bW3o71gQRR9g6Q

What this article calls 'herding' does sound like it might be
'shrinkage' -- though, shrinkage is a particular procedure.
The author here partly seems to blame Nate Silver for making
everyone so cautious, though I don't know where his hand
comes into it. The article suggests, over-correcting for previous
errors in missing the pro-Trump vote.

New news: a well-respected pollster has newly measured a
bigger lead for Harris in Iowa, and that has stimulated more
reaction.

The people pushing for turnout have been a bigger presence
than I've seen before -- three phone calls, a door-knocking,
and a thingie hung on my doorknob. I'm in Pennsylvania, which
is targeted as a swing state with 19 electors.

--
Rich Ulrich

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Rich Ulrich@21:1/5 to rich.ulrich@comcast.net on Sun Nov 10 20:08:52 2024

XPost: alt.usage.english

On Sun, 03 Nov 2024 17:50:02 -0500, Rich Ulrich
<rich.ulrich@comcast.net> wrote:

Nate Silver, of FiveThrtyEight, is the most prestigious of US
poll interpreters. Last week, he gave his statistical opinion,
that, despite all the close calls in polls, there is about a 60%
chance (IIRC) that one candidate or the other will win at
least 6 of the 7 swing states. Sounds about right to me.

Trump wins 7 of 7 swing states, now that Arizona has been 'called'.

Turnout: Trump received ABOUT as many votes as he did in 2020,
probably more than 50% of the total votes -- a bit of a surprise
since "53%" of prospective voters say they dislike him. (Lesser
of two evils, I guess.)

Since Biden won by millions, that means that Harris received
notably fewer than Biden had -- turnout mattered. Trump's
edge was up (they say) even more in the states that were
not the swing states (and did not receive saturation-levels of
TV advertising and local visits).

--
Rich Ulrich

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Peter Moylan@21:1/5 to Rich Ulrich on Mon Nov 11 13:12:01 2024

XPost: alt.usage.english

On 11/11/24 12:08, Rich Ulrich wrote:

On Sun, 03 Nov 2024 17:50:02 -0500, Rich Ulrich
<rich.ulrich@comcast.net> wrote:

Nate Silver, of FiveThrtyEight, is the most prestigious of US
poll interpreters. Last week, he gave his statistical
opinion, that, despite all the close calls in polls, there is
about a 60% chance (IIRC) that one candidate or the other will
win at least 6 of the 7 swing states. Sounds about right to
me.

Trump wins 7 of 7 swing states, now that Arizona has been 'called'.

Turnout: Trump received ABOUT as many votes as he did in 2020,
probably more than 50% of the total votes -- a bit of a surprise
since "53%" of prospective voters say they dislike him. (Lesser of
two evils, I guess.)

Since Biden won by millions, that means that Harris received notably
fewer than Biden had -- turnout mattered. Trump's edge was up (they
say) even more in the states that were not the swing states (and did
not receive saturation-levels of TV advertising and local visits).

Opponents of compulsory voting have often claimed that, if everyone
votes, the results will be distorted by the votes of the unintelligent
and of the "don't care" voters.

Is the latest US result an illustration of what happens when only the intelligent voters vote?

--
Peter Moylan peter@pmoylan.org http://www.pmoylan.org
Newcastle, NSW

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

Who's Online
Recent Visitors
- Plume
  Sun Sep 14 09:34:52 2025
  from Uk via Raw
- Gretchiie
  Sun Sep 14 06:07:30 2025
  from Derry, Nh via Telnet
- Thlc
  Sat Sep 13 17:11:34 2025
  from Rognac, France via Telnet
- Thlc
  Sat Sep 13 17:04:03 2025
  from Rognac, France via Telnet
- Thlc
  Sat Sep 13 16:32:19 2025
  from Rognac, France via SSH
- Thlc
  Sat Sep 13 15:41:11 2025
  from Rognac, France via SSH
- Thlc
  Sat Sep 13 07:56:03 2025
  from Rognac, France via SSH
- Gretchiie
  Sat Sep 13 07:22:10 2025
  from Derry, Nh via Telnet

System Info

Sysop:	Keyop
Location:	Huddersfield, West Yorkshire, UK
Users:	546
Nodes:	16 (0 / 16)
Uptime:	166:40:20
Calls:	10,385
Calls today:	2
Files:	14,057
Messages:	6,416,529

meaning of 'statistical variation'

Who's Online

Recent Visitors

System Info