Estimation of genetic and environmental factors for binary traits using family data

Stat Med. 2004 Feb 15;23(3):449-65. doi: 10.1002/sim.1603.


While the family-based analysis of genetic and environmental contributions to continuous or Gaussian traits is now straightforward using the linear mixed models approach, the corresponding analysis of complex binary traits is still rather limited. In the latter we usually rely on twin studies or pairs of relatives, but these studies often have limited sample size or have difficulties in dealing with the dependence between the pairs. Direct analysis of extended family data can potentially overcome these limitations. In this paper, we will describe various genetic models that can be analysed using an extended family structure. We use the generalized linear mixed model to deal with the family structure and likelihood-based methodology for parameter inference. The method is completely general, accommodating arbitrary family structures and incomplete data. We illustrate the methodology in great detail using the Swedish birth registry data on pre-eclampsia, a hypertensive condition induced by pregnancy. The statistical challenges include the specification of sensible models that contain a relatively large number of variance components compared to standard mixed models. In our illustration the models will account for maternal or foetal genetic effects, environmental effects, or a combination of these and we show how these effects can be readily estimated using family data.

MeSH terms

  • Cluster Analysis
  • Family*
  • Female
  • Humans
  • Likelihood Functions
  • Linear Models
  • Male
  • Normal Distribution
  • Pre-Eclampsia / etiology*
  • Pre-Eclampsia / genetics
  • Pregnancy
  • Sweden
  • Twin Studies as Topic