Chance News 111: Difference between revisions

From ChanceWiki
Jump to navigation Jump to search
Line 117: Line 117:
The authors of this op-ed are present and past presidents of the American Statistical Association.  They describe how Andreas Georgiou, the former chief statistician of Greece, is being prosecuted for his reporting on the Greek national debt.  The issue is not his work, which has actually produced a more accurate accounting, but rather the revelation that the debt is larger than previously understood, which will require stricter austerity measures.  Nussbaum and Wallman call for US government officials to stress the need for transparent statistical data, and more generally to condemn attacks on science.
The authors of this op-ed are present and past presidents of the American Statistical Association.  They describe how Andreas Georgiou, the former chief statistician of Greece, is being prosecuted for his reporting on the Greek national debt.  The issue is not his work, which has actually produced a more accurate accounting, but rather the revelation that the debt is larger than previously understood, which will require stricter austerity measures.  Nussbaum and Wallman call for US government officials to stress the need for transparent statistical data, and more generally to condemn attacks on science.


In the US, statisticians are not being prosecuted.  However, the story calls to mind how climate scientists have been  [https://www.theguardian.com/environment/2017/feb/22/climate-change-science-attacks-threats-trump subjected to harrassment].  Indeed, in February Congressman Lamar Smith (R-Texas), chair of House Science Committee, convened hearings on  [https://arstechnica.com/science/2017/02/house-science-committee-holds-hearing-on-making-epa-great-again/ “Making EPA Great Again”].
In the US, statisticians are not being prosecuted for their analyses.  However, the Greek saga calls to mind how climate scientists have been  [https://www.theguardian.com/environment/2017/feb/22/climate-change-science-attacks-threats-trump subjected to harrassment].  Indeed, in February Congressman Lamar Smith (R-Texas), chair of House Science Committee, convened hearings on  [https://arstechnica.com/science/2017/02/house-science-committee-holds-hearing-on-making-epa-great-again/ “Making EPA Great Again”].


In May, the director of the US Census Bureau John Thompson announced his resignation.  In an interview with NPR,[http://www.npr.org/sections/codeswitch/2017/07/15/536908867/could-a-census-without-a-leader-spell-trouble-in-2020 Could a census without a leader spell trouble In 2020?] (NPR "Code Switch", 15 July 2017), expressed confidence that work at the bureau would continue without disruption.  Some observers have expressed concern because the 2020 census is fast approaching.  Allocation of funds for many federal programs is
In May, the director of the US Census Bureau John Thompson announced his resignation.  In an interview with NPR,[http://www.npr.org/sections/codeswitch/2017/07/15/536908867/could-a-census-without-a-leader-spell-trouble-in-2020 Could a census without a leader spell trouble In 2020?] (NPR "Code Switch", 15 July 2017), expressed confidence that work at the bureau would continue without disruption.  Some observers have expressed concern because the 2020 census is fast approaching.  Allocation of funds for many federal programs is
[https://www.brookings.edu/wp-content/uploads/2016/06/0309_census_report.pdf dependent on census data].  The counts are also have implications for Congressional redistricting.  The Trump adminstration has so far been slow at [https://www.nytimes.com/interactive/2017/07/17/us/politics/trump-appointments.html filling top jobs].
[https://www.brookings.edu/wp-content/uploads/2016/06/0309_census_report.pdf dependent on census data].  The counts are also have implications for Congressional redistricting.  The Trump adminstration has so far been slow at [https://www.nytimes.com/interactive/2017/07/17/us/politics/trump-appointments.html filling top jobs].

Revision as of 19:20, 22 July 2017

Under contruction: July 1, 2017 to ...

Quotations

“In my darkest moods I follow what could be called the ‘Groucho principle’: because stories have gone through so many filters that encourage distortion and selection, the very fact that I am hearing a claim based on statistics is reason to disbelieve it.”

-- David Spiegelhalter (president of Royal Statistical Society), quoted in:
'Exaggerations' threaten public trust in science, says leading statistician, Guardian, 28 June 2017

"Aside from ruining the rest of the 21st century, the 2016 U.S. election inflicted spectacular collateral damage on the subject known as statistics."

-- Paul Alper, in: Occom, Mencken and Cohn, Higher Education Review, vol. 49, no.2

Forsooth

"Random" seat assignment?

Ryanair's 'random' seat allocation not random — scientists
by John von Radowitz, Irish Independent, 30 June 2017

Ryanair is an Irish low-cost airline. This Wikipedia entry reports that they have repeatedly faced criticism for misleading advertising.

The Irish Independent article concerns a recent controversy. When customers book Ryanair flights, they can pay to make a seat selection or else opt for "random" seat assignment, which is free. Advertising on the airline's website says, "Can't stand the middle seat? Don't leave it to chance, take your pick from a choice of seats. Get up to 50pc off reserved seats with prices starting at £2."

But is it up to chance? In light of customer complaints, the BBC consumer affairs show Watchdog sought expert opinion from Oxford University. To test the claim, researchers had four groups of four passengers book travel on four separate flights, all under the random seating option. On every flight, all of the passengers got middle seats. The odds of this happening were estimated at about 1:540,000,000. Compare this to the 1:45,000,000 odds of winning the UK National Lottery jackpot. The director of Oxford University's Statistical Consultancy, Dr. Jennifer Rogers, is quoted as saying, "This is a highly controversial topic and my analysis cast doubt on whether Ryanair's seat allocation can be purely random."

The article concludes with the following explanation from Ryanair, which qualifies as an extended Forsooth!

We haven't changed the random seat allocation policy.

The reason for more middle seats being allocated is that more and more passengers are taking our reserved seats (from just £2) and these passengers overwhelmingly prefer aisle and window seats which is why people who choose random (free of charge) seats are more likely to be allocated middle seats.

Some random seat passengers are confused by the appearance of empty seats beside them when they check-in up to four days prior to departure.

The reason they can't have these window or aisle seats is that these are more likely to be selected by reserved seat passengers, many of whom only check in 24 hours prior to departure.

Since our current load factor is 95pc, we have to keep these window and aisle seats free to facilitate those customers who are willing to pay (from £2) for them.

Submitted by Patrick O'Beirne

Sense and Sensibility and Statistics

From The Word Choices That Explain Why Jane Austen Endures
by Kathleen A. Flynn and Josh Katz, New York Times, 6 July 6, 2017.

Jane Austen' popularity has endured, and there may be something in her language that explains this. A principal components analysis of the words used by a large number books published from 1701 t0 1920 show that Austen's novels were unusual in her use of words related to time (always, fortnight and week) or emotion (awkward, decided, dislike, glad, sorry, suppose).

Jane Austen also uses a large number of intensifying words: quite, really, and very. These words are normally avoided by authors, but Austen uses them to develop a sense of irony. This fits in well with some non-statistical assessments.

Traditional literary approaches to Austen have long focused on this aspect of her work: “the incongruities between pretence and essence, between the large idea and the inadequate ego," as the critic Marvin Mudrick put it. A look at passages where words like very are used frequently often finds the stated meaning conceivably at odds with the real one, the exaggeration subtly inviting doubt."

A nice interactive plot shows the first two principal components and you can hover over individual data points to see the book title.

Submitted by Steve Simon

Gerrymandering

Partisan gerrymandering has benefited the GOP, analysis shows
NBCnews.com, 25 June 2017

Republican electoral gains in 2010 gave the party substantial control over the redistricting that followed the 2010 Census. NBC reports on a statistical analysis by the Associated Press showing the effect in subsequent elections and the state and national levels. For example, in the 2016 election, they estimate that Republicans won approximately 22 more seats in the US House of Representatives than they would have been obtained if representation was proportional to vote share. The AP website reports that Texas had the largest gain, nearly 4 extra seats.

Gerrymandering is of course nothing new. The reference to Texas brought to mind a famous story, retold in a 2002 story from the Economist (How to rig an election, 25 April 2002): "[It] used to be said of the old Texas 6th (which was a road from Houston to Dallas), that you could kill most of the constituents by driving down the road with the car doors open." However, the latest efforts are seen as particularly egregious. We read,

The AP’s analysis was based on an “efficiency gap” formula developed by University of Chicago law professor Nick Stephanopoulos and Eric McGhee, a researcher at the nonpartisan Public Policy Institute of California. Their mathematical model was cited last fall as “corroborative evidence” by a federal appeals court panel that struck down Wisconsin’s Assembly districts as an intentional partisan gerrymander in violation of Democratic voters’ rights to representation. The U.S. Supreme Court has agreed to hear an appeal.

Stephanopoulos and McGhee computed efficiency gaps for four decades of congressional and state House races starting in 1972, concluding the pro-Republican maps enacted after the 2010 Census resulted in “the most extreme gerrymanders in modern history.”

Here is how Stephanopoulos explained the efficiency gap idea in a 2014 article, Here's how we can end gerrymandering once and for all (New Republic, 2 July 2014):

The efficiency gap is simply the difference between the parties’ respective wasted votes in an election, divided by the total number of votes cast. Wasted votes are ballots that don’t contribute to victory for candidates, and they come in two forms: lost votes cast for candidates who are defeated, and surplus votes cast for winning candidates but in excess of what they needed to prevail. When a party gerrymanders a state, it tries to maximize the wasted votes for the opposing party while minimizing its own, thus producing a large efficiency gap. In a state with perfect partisan symmetry, both parties would have the same number of wasted votes.

NBC news also cites another analysis approach, introduced by the Princeton University Gerrymandering Project, launched this year by Sam Wang. It's inaugural post This analysis "found that the extreme Republican advantages in some states were no fluke. The Republican edge in Michigan's state House districts had only a 1-in-16,000 probability of occurring by chance; in Wisconsin's Assembly districts, there was a mere 1-in-60,000 likelihood of it happening randomly."

The FAQ on the Princeton Project site has a description of how the tests work.

Discussion
The figures quoted in the next to last paragraph are not "the probability of [the resulting advantage] occurring by chance." Why? How would you rewrite the statement?

More on mortality

Further criticism of social scientists and journalists jumping to conclusions based on mortality trends
by Andrew Gelman, "Statistical Modeling, Causal Inference, and Social Science" blog, 11 July 2017

In Deaths of despair from the last installment of Chance News, we described the Angus-Deaton work on middle-age mortality, including commentary by Gelman and others. In this latest post, Gelman includes links to some other articles suggested by his readers.

Gelman reiterates his support for the project, which he believe has raised important issues. But he notesthat the omission age-adjustment was strange, since "reporting trends in mortality rates without age adjusting is like reporting trends in nominal prices without adjusting for the consumer price index."

Gelman and collaborators have produced a wealth of graphics related to the story, which are packaged in another post, Easier-to-download graphs of age-adjusted mortality trends by sex, ethnicity, and age group.

Greece prosecutes former chief statistician

Greece’s troubling prosecution of its former chief statistician: Why Congress and State should speak up
by Barry D. Nussbaum and Katherine K. Wallman, Huffington Post, 7 July 2017

The authors of this op-ed are present and past presidents of the American Statistical Association. They describe how Andreas Georgiou, the former chief statistician of Greece, is being prosecuted for his reporting on the Greek national debt. The issue is not his work, which has actually produced a more accurate accounting, but rather the revelation that the debt is larger than previously understood, which will require stricter austerity measures. Nussbaum and Wallman call for US government officials to stress the need for transparent statistical data, and more generally to condemn attacks on science.

In the US, statisticians are not being prosecuted for their analyses. However, the Greek saga calls to mind how climate scientists have been subjected to harrassment. Indeed, in February Congressman Lamar Smith (R-Texas), chair of House Science Committee, convened hearings on “Making EPA Great Again”.

In May, the director of the US Census Bureau John Thompson announced his resignation. In an interview with NPR,Could a census without a leader spell trouble In 2020? (NPR "Code Switch", 15 July 2017), expressed confidence that work at the bureau would continue without disruption. Some observers have expressed concern because the 2020 census is fast approaching. Allocation of funds for many federal programs is dependent on census data. The counts are also have implications for Congressional redistricting. The Trump adminstration has so far been slow at filling top jobs.