Home > Publications . Search All . Browse All . Country . Browse PSC Pubs . PSC Report Series

PSC In The News

RSS Feed icon

Surprising findings on what influences unintended pregnancy from Wise, Geronimus and Smock

Recommendations on how to reduce discrimination resulting from ban-the-box policies cite Starr's work

Brian Jacob on NAEP scores: "Michigan is the only state in the country where proficiency rates have actually declined over time."

More News


Call for papers: Conference on computational social science, April 2017, U-M

Sioban Harlow honored with 2017 Sarah Goddard Power Award for commitment to women's health

Post-doc fellowship in computational social science for summer or fall 2017, U-Penn

ICPSR Summer Program scholarships to support training in statistics, quantitative methods, research design, and data analysis

More Highlights

Next Brown Bag

Mon, March 13, 2017, noon:
Rachel Best

Arline T. Geronimus photo

On the Validity of Using Census Geocode Characteristics to Proxy Individual Socioeconomic Characteristics

Publication Abstract

Geronimus, Arline T., John Bound, and Lisa Neidert. 1996. "On the Validity of Using Census Geocode Characteristics to Proxy Individual Socioeconomic Characteristics." Journal of the American Statistical Association, 91(434): 529-537.

Investigators of social differentials in health outcomes commonly augment incomplete microdata by appending socioeconomic characteristics of residential areas (such as median income in a zip code) to proxy for individual characteristics. But little empirical attention has been paid to how well this aggregate infromation serves as a proxy for the individual characteristics of interest. The authors build on recent work addressing the biases inherent in proxies and consider two health-related examples within a statistical framework that illuminates the nature and sources of biases. Data from the Panel Study of Income Dynamics and the National Maternal and Infant Health Survey are linked to census data. The authors assess the validity of using the aggregate census information as a proxy for individual information when estimating main effects and when controlling for potential confounding between socioeconomic and sociodemographic factors in measures of general health status and infant mortality. They find a general, but not universal, tendency for aggregate proxies to exaggerate the effects of micro-level variables and to do more poorly than micro-level variables at controlling for confounding. The magnitude and direction of these biases vary across samples, however. The authors' statistical framework and empirical findings suggest the difficulties in and limits to interpreting proxies derived from aggregate census data as if they were micro-level variables. The statistical framework that we outline for our study of health outcomes should be generally applicable to other situations where researchers have merged aggregate data with microdata samples.

Dataset(s): Panel Study of Income Dynamics (PSID): U.S., 1985. National Maternal and Infant Health Survey (NMIHS): U.S., 1988.

Licensed Access Link

Browse | Search : All Pubs | Next