Combining Information from Multiple Data Sources to Assess Population Health

Archived Abstract of Former PSC Researcher

Raghunathan, Trivellore, Allison Rosen, Kassandra Lynn Messer, Patricia A. Berglund, Kaushik Ghosh, Paul Imbriano, Susan Stewart, Irina Bondarenko, et al. Forthcoming. "Combining Information from Multiple Data Sources to Assess Population Health." Journal of Survey Statistics and Methodology.

Information about an extensive set of health conditions on a well-defined sample of subjects is essential for assessing population health, gauging the impact of various policies, modeling costs, and studying health disparities. Unfortunately, there is no single data source that provides accurate information about health conditions. We combine information from several administrative and survey data sets to obtain model-based dummy variables for 107 health conditions (diseases, preventive measures, and screening for diseases) for elderly (age 65 and older) subjects in the Medicare Current Beneficiary Survey (MCBS) over the fourteen-year period, 1999-2012. The MCBS has prevalence of diseases assessed based on Medicare claims and provides detailed information on all health conditions but is prone to underestimation bias. The National Health and Nutrition Examination Survey (NHANES), on the other hand, collects self-reports and physical/laboratory measures only for a subset of the 107 health conditions. Neither source provides complete information, but we use them together to derive model-based corrected dummy variables in MCBS for the full range of existing health conditions using a missing data and measurement error model framework. We create multiply imputed dummy variables and use them to construct the prevalence rate and trend estimates. The broader goal, however, is to use these corrected or modeled dummy variables for a multitude of policy analysis, cost modeling, and analysis of other relationships either using them as predictors or as outcome variables.


Population Health Methodology

Browse | Search | Next

PSC In The News

RSS Feed icon

Erin Cech explains her research on the "passion principle," and how America's obsession with pursuing and pushing towards a "#dreamjob" is flawed

Shaefer notes success of initial CARES Act stimulus and concerns over a new round of COVID-19 shutdown support under a Biden presidency

More News


Faul's three-nation research to examine relationships between social factors and epigenetics

Open for Registration: Principles of Text Analysis Workshop

More Highlights

Connect with PSC follow PSC on Twitter Like PSC on Facebook