Home > Publications . Search All . Browse All . Country . Browse PSC Pubs . PSC Report Series

PSC In The News

RSS Feed icon

Frey's Scenario F simulation mentioned in account of the Democratic Party's tribulations

U-M Poverty Solutions funds nine projects

Dynarski says NY's Excelsior Scholarship Program could crowd out low-income and minority students

More News

Highlights

Workshops on EndNote, NIH reporting, and publication altmetrics, Jan 26 through Feb 7, ISR

2017 PAA Annual Meeting, April 27-29, Chicago

NIH funding opportunity: Etiology of Health Disparities and Health Advantages among Immigrant Populations (R01 and R21), open Jan 2017

Russell Sage 2017 Summer Institute in Computational Social Science, June 18-July 1. Application deadline Feb 17.

More Highlights

Next Brown Bag

Mon, Jan 23, 2017 at noon:
Decline of cash assistance and child well-being, Luke Shaefer

A case study of record linkage

Archived Abstract of Former PSC Researcher

Valliant, Richard L., Marianne Winglee, and Fritz Scheuren. 2005. "A case study of record linkage." Survey Methodology, 31(1): 4-12.

Record linkage is a process of pairing records from two files and trying to select the pairs that belong to the same entity. The basic framework uses a match weight to measure the likelihood of a correct match and a decision rule to assign record pairs as "true" or "false" match pairs. Weight thresholds for selecting a record pair as matched or unmatched depend on the desired control over linkage errors. Current methods to determine the selection thresholds and estimate linkage errors can provide divergent results, depending on the type of linkage error and the approach to linkage. This paper presents a case study that uses existing linkage methods to link record pairs but a new simulation approach (SimRate) to help determine selection thresholds and estimate linkage errors. SimRate uses the observed distribution of data in matched and unmatched pairs to generate a large simulated set of record pairs, assigns a match weight to each pair based on specified match rules, and uses the weight curves of the simulated pairs for error estimation.

Public Access Link

Browse | Search : All Pubs | Next