Missing genetic information in case-control family data with general semi-parametric shared frailty model

Lifetime Data Anal. 2011 Apr;17(2):175-94. doi: 10.1007/s10985-010-9178-5. Epub 2010 Dec 12.

Abstract

Case-control family data are now widely used to examine the role of gene-environment interactions in the etiology of complex diseases. In these types of studies, exposure levels are obtained retrospectively and, frequently, information on most risk factors of interest is available on the probands but not on their relatives. In this work we consider correlated failure time data arising from population-based case-control family studies with missing genotypes of relatives. We present a new method for estimating the age-dependent marginalized hazard function. The proposed technique has two major advantages: (1) it is based on the pseudo full likelihood function rather than a pseudo composite likelihood function, which usually suffers from substantial efficiency loss; (2) the cumulative baseline hazard function is estimated using a two-stage estimator instead of an iterative process. We assess the performance of the proposed methodology with simulation studies, and illustrate its utility on a real data example.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Adult
  • BRCA2 Protein / genetics*
  • Breast Neoplasms / genetics*
  • Case-Control Studies
  • Computer Simulation
  • Family
  • Female
  • Genotype
  • Humans
  • Middle Aged
  • Models, Genetic*
  • Models, Statistical*
  • Ubiquitin-Protein Ligases / genetics*

Substances

  • BRCA2 Protein
  • BRCA2 protein, human
  • BRAP protein, human
  • Ubiquitin-Protein Ligases