A weighted estimating equation for linear regression with missing covariate data

Stat Med. 2002 Aug 30;21(16):2421-36. doi: 10.1002/sim.1195.

Abstract

Linear regression is one of the most popular statistical techniques. In linear regression analysis, missing covariate data occur often. A recent approach to analyse such data is a weighted estimating equation. With weighted estimating equations, the contribution to the estimating equation from a complete observation is weighted by the inverse 'probability of being observed'. In this paper, we propose a weighted estimating equation in which we wrongly assume that the missing covariates are multivariate normal, but still produces consistent estimates as long as the probability of being observed is correctly modelled. In simulations, these weighted estimating equations appear to be highly efficient when compared to the most efficient weighted estimating equation as proposed by Robins et al. and Lipsitz et al. However, these weighted estimating equations, in which we wrongly assume that the missing covariates are multivariate normal, are much less computationally intensive than the weighted estimating equations given by Lipsitz et al. We compare the weighted estimating equations proposed in this paper to the efficient weighted estimating equations via an example and a simulation study. We only consider missing data which are missing at random; non-ignorably missing data are not addressed in this paper.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Adolescent
  • Child
  • Child, Preschool
  • Computer Simulation
  • Fever / blood
  • Heart Diseases / blood
  • Hemoglobins / analysis
  • Humans
  • Infant
  • Likelihood Functions
  • Linear Models*
  • Models, Biological*
  • Patient Admission
  • Respiratory Tract Diseases / blood

Substances

  • Hemoglobins