Home > Research . Search . Country . Browse . Small Grants

PSC In The News

RSS Feed icon

Adhvaryu on how promoting worker welfare contributes to profitability in India's garment industry

Murphy says suburban communities that declined in the 1960s fared better than those declining since the Great Recession

Levy et al find state budget gains outweigh Medicaid expansion costs in Michigan

More News


Live coverage of former Census director on crucial issues surrounding Census 2020. TODAY 2 pm.

PDHP invites applications for Faculty Small Grants in support of population science

ISR seeking applicants for new Community Guides program

PRB policy communication training for pre-docs extends application deadline to March 12

More Highlights

Next Brown Bag

Mon, April 2, 2018, noon: Sean Reardon on Educational Inequality

Martha J. Bailey photo

How Does Automated Record Linkage Affect Inferences about Population Health?

a PSC Research Project

Investigators:   Martha J. Bailey, Catherine Massey, Eytan Adar

This project compares the performance of automated linking algorithms with the goal of improving their potential. Automated linking methods are required to complete the NSF-funded Longitudinal Intergenerational Family Electronic Micro-dataset (LIFE-M), which will link millions of US vital records to historical decennial census records to create an extensive longitudinal dataset covering individuals born in the US from 1880 to 1930. This analysis emanates from that need.

The project will produce systematic evidence regarding the performance of the most popular automated linking methods in terms of match rates, representativeness of the underlying population, erroneous match rates, and systematic measurement error. It will also examine how phonetic name-cleaning methods affect quality. Significantly, the project will analyze how match quality metrics vary for different underrepresented subgroups - including women, racial/ethnic minorities, and immigrants - to determine how specific linking methods could differentially affect inferences for different populations. Finally, the project will formulate recommended practices for researchers based upon the findings.

Funding Period: 09/15/2017 to 05/31/2019

Search . Browse