Home > Publications . Search All . Browse All . Country . Browse PSC Pubs . PSC Report Series

PSC In The News

RSS Feed icon

Axinn says data show incidents of sexual assault start at 'very young age'

Miech on 'generational forgetting' about drug-use dangers

Impacts of H-1B visas: Lower prices and higher production - or lower wages and higher profits?

More News

Highlights

Call for papers: Conference on computational social science, April 2017, U-M

Sioban Harlow honored with 2017 Sarah Goddard Power Award for commitment to women's health

Post-doc fellowship in computational social science for summer or fall 2017, U-Penn

ICPSR Summer Program scholarships to support training in statistics, quantitative methods, research design, and data analysis

More Highlights

Next Brown Bag

Mon, Feb 13, 2017, noon:
Daniel Almirall, "Getting SMART about adaptive interventions"

Tukey's gh distribution for multiple imputation

Publication Abstract

He, Y.L., and Trivellore Raghunathan. 2006. "Tukey's gh distribution for multiple imputation." American Statistician, 60(3): 251-256.

Tukey proposed a class of distributions, the g-and-h family (gh family), based on a transformation of a standard normal variable to accommodate different skewness and elongation in the distribution of variables arising in practical applications. It is easy to draw values from this distribution even though it is hard to explicitly state the probability density function. Given this flexibility, the gh family may be extremely useful in creating multiple imputations for missing data. This article demonstrates how this family, as well as its generalizations, can be used in the multiple imputation analysis of incomplete data. The focus of this article is on a scalar variable with missing values. In the absence of any additional information, data are missing completely at random, and hence the correct analysis is the complete-case analysis. Thus, the application of the gh multiple imputation to the scalar cases affords comparison with the correct analysis and with other model-based multiple imputation methods. Comparisons are made using simulated datasets and the data from a survey of adolescents ascertaining driving after drinking alcohol.

DOI:10.1198/000313006X126819 (Full Text)

Browse | Search : All Pubs | Next