Missingness in the T1DGC MHC fine-mapping SNP data: association with HLA genotype and potential influence on genetic association studies

Diabetes Obes Metab. 2009 Feb;11 Suppl 1(Suppl 1):101-7. doi: 10.1111/j.1463-1326.2008.01010.x.

Abstract

Aim: The absence or 'missingness' of single nucleotide polymorphism (SNP) assay values because of genotype or related factors of interest may bias association and other studies. Missingness was determined for the Type 1 Diabetes Genetics Consortium (T1DGC) Major Histocompatibility Complex (MHC) data and was found to vary across the region, ranging up to 11.1% of the non-null proband SNPs, with a median of 0.3%. We consider factors related to missingness in the T1DGC data and briefly assess its possible influence on association studies.

Methods: We assessed associations of missingness in the SNP assay data with human leucocyte antigen (HLA) genotype of the individual and with SNP genotypes of the parents. Within-cohort analyses were combined (over all cohorts) using (i) Mantel-Haenszel tests for two-by-two tables or (ii) by combining test statistics for larger tables and regression models. Mixed effect regression models were used to assess association of the SNP genotypes with affected status of the offspring after adjustment for parental SNP genotypes, cohort membership and HLA genotypes. Log-linear models were used to assess association of missingness in the unaffected sib assays with SNP genotypes of the probands.

Results: Missingness of SNP values near the HLA class I (A, B and C) and class II (DR, DQ and DP) loci is strongly associated with carriage of corresponding HLA genotypes within these groups. Similar associations pertain to missing values among the microsatellite data. In at least some of these cases, regions of missingness coincided with known deletion regions corresponding to the associated HLA haplotype. We conjecture that other regions of associated missingness may point to similar haplotypic deletions. Analysis of association patterns of SNP genotypes with affected status of offspring does not indicate strong informative missingness. However, association of missingness in proband data with parental SNP genotypes may impact transmission disequilibrium test (TDT)-type analyses. Comparisons of affected and unaffected siblings point to possible susceptibility regions additional to the classical HLA-DR3/4 alleles near BAT4-LY6G5B-BAT5 and NOTCH4.

Conclusions: Potentially informative missingness in SNP assay values in the MHC region may impact on association and related analyses based on the T1DGC data. These results suggest that it would be prudent to assess the degree to which missingness may abrogate assessed SNP disease markers in such studies. Initial analyses based on comparison of affected and unaffected status in offspring suggest that at least these may be little affected.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chromosome Mapping
  • Cohort Studies
  • Diabetes Mellitus, Type 1 / genetics*
  • Gene Deletion
  • Genetic Markers
  • Genetic Predisposition to Disease / genetics*
  • Genotype
  • HLA Antigens / genetics*
  • Homozygote
  • Humans
  • Major Histocompatibility Complex / genetics*
  • Parents
  • Polymorphism, Single Nucleotide / genetics*

Substances

  • Genetic Markers
  • HLA Antigens