Sandbox: Difference between revisions

From ChanceWiki
Jump to navigation Jump to search
Line 11: Line 11:
These stories describe problems with the Census Bureau' [http://usa.ipums.org/usa/ IPUMS] (Integrated Public Use Mircodata Series) data, which provides subsamples of Census data to outside researchers.  In order to protect the privacy of citizens, the records are altered slightly.  For example, incomes may be rounded and ages may be tweaked by a small amount.  Ideally this would make it impossible to identify any particular individual, while at the same time not introducing any important distortion into the overall demographic profile.
These stories describe problems with the Census Bureau' [http://usa.ipums.org/usa/ IPUMS] (Integrated Public Use Mircodata Series) data, which provides subsamples of Census data to outside researchers.  In order to protect the privacy of citizens, the records are altered slightly.  For example, incomes may be rounded and ages may be tweaked by a small amount.  Ideally this would make it impossible to identify any particular individual, while at the same time not introducing any important distortion into the overall demographic profile.


Unfortunately, it appears that serious distortions have resulted.  The Wall Street Journal article has an [http://online.wsj.com/article/SB10001424052748704533204575047241321811712.html#project%3Dnumbguy0204%26articleTabs%3Dinteractive interactive graphic]
Unfortunately, it appears that serious distortions have resulted.  For example, The Wall Street Journal article has an [http://online.wsj.com/article/SB10001424052748704533204575047241321811712.html#project%3Dnumbguy0204%26articleTabs%3Dinteractive interactive graphic], illustrating how data released in 2006 showed year-to-year  fluctuations in the percentage of women who were married;  those errors were corrected in 2007.


Submitted by Bill Peterson
Submitted by Bill Peterson

Revision as of 02:22, 26 February 2010

Census errors

Can you trust Census data?
Freakonomics blog, New York Times, 2 February 2010
Justin Wolfers

Census Bureau obscured personal data—Too well, some say
Numbers Guy blog, Wall Street Journal, 6 February 2010
Carl Bialik

These stories describe problems with the Census Bureau' IPUMS (Integrated Public Use Mircodata Series) data, which provides subsamples of Census data to outside researchers. In order to protect the privacy of citizens, the records are altered slightly. For example, incomes may be rounded and ages may be tweaked by a small amount. Ideally this would make it impossible to identify any particular individual, while at the same time not introducing any important distortion into the overall demographic profile.

Unfortunately, it appears that serious distortions have resulted. For example, The Wall Street Journal article has an interactive graphic, illustrating how data released in 2006 showed year-to-year fluctuations in the percentage of women who were married; those errors were corrected in 2007.

Submitted by Bill Peterson