Multiple imputation of baseline data in the cardiovascular health study

Alice M Arnold; Richard A Kronmal

doi:10.1093/aje/kwf156

Multiple imputation of baseline data in the cardiovascular health study

Am J Epidemiol. 2003 Jan 1;157(1):74-84. doi: 10.1093/aje/kwf156.

Authors

Alice M Arnold¹, Richard A Kronmal

Affiliation

¹ Department of Biostatistics, University of Washington, Seattle, WA, USA. arnolda@u.washington.edu

PMID: 12505893
DOI: 10.1093/aje/kwf156

Abstract

Most epidemiologic studies will encounter missing covariate data. Software packages typically used for analyzing data delete any cases with a missing covariate to perform a complete case analysis. The deletion of cases complicates variable selection when different variables are missing on different cases, reduces power, and creates the potential for bias in the resulting estimates. Recently, software has become available for producing multiple imputations of missing data that account for the between-imputation variability. The implementation of the software to impute missing baseline data in the setting of the Cardiovascular Health Study, a large, observational study, is described. Results of exploratory analyses using the imputed data were largely consistent with results using only complete cases, even in a situation where one third of the cases were excluded from the complete case analysis. There were few differences in the exploratory results across three imputations, and the combined results from the multiple imputations were very similar to results from a single imputation. An increase in power was evident and variable selection simplified when using the imputed data sets.

Publication types

Research Support, U.S. Gov't, P.H.S.

MeSH terms

Aged
Aged, 80 and over
Analysis of Variance
Black People
Cardiovascular Diseases / epidemiology*
Cardiovascular Diseases / etiology
Cause of Death
Cohort Studies
Data Collection / methods
Data Collection / standards
Data Interpretation, Statistical*
Epidemiologic Studies*
Female
Humans
Linear Models
Male
Mathematical Computing*
Models, Statistical*
Predictive Value of Tests
Proportional Hazards Models
Risk Factors
Software*
Survival Analysis
United States / epidemiology
Ventricular Remodeling

Abstract

Publication types

MeSH terms

Grants and funding