Identification of novel candidate disease genes from de novo exonic copy number variants

Genome Med. 2017 Sep 21;9(1):83. doi: 10.1186/s13073-017-0472-7.

Abstract

Background: Exon-targeted microarrays can detect small (<1000 bp) intragenic copy number variants (CNVs), including those that affect only a single exon. This genome-wide high-sensitivity approach increases the molecular diagnosis for conditions with known disease-associated genes, enables better genotype-phenotype correlations, and facilitates variant allele detection allowing novel disease gene discovery.

Methods: We retrospectively analyzed data from 63,127 patients referred for clinical chromosomal microarray analysis (CMA) at Baylor Genetics laboratories, including 46,755 individuals tested using exon-targeted arrays, from 2007 to 2017. Small CNVs harboring a single gene or two to five non-disease-associated genes were identified; the genes involved were evaluated for a potential disease association.

Results: In this clinical population, among rare CNVs involving any single gene reported in 7200 patients (11%), we identified 145 de novo autosomal CNVs (117 losses and 28 intragenic gains), 257 X-linked deletion CNVs in males, and 1049 inherited autosomal CNVs (878 losses and 171 intragenic gains); 111 known disease genes were potentially disrupted by de novo autosomal or X-linked (in males) single-gene CNVs. Ninety-one genes, either recently proposed as candidate disease genes or not yet associated with diseases, were disrupted by 147 single-gene CNVs, including 37 de novo deletions and ten de novo intragenic duplications on autosomes and 100 X-linked CNVs in males. Clinical features in individuals with de novo or X-linked CNVs encompassing at most five genes (224 bp to 1.6 Mb in size) were compared to those in individuals with larger-sized deletions (up to 5 Mb in size) in the internal CMA database or loss-of-function single nucleotide variants (SNVs) detected by clinical or research whole-exome sequencing (WES). This enabled the identification of recently published genes (BPTF, NONO, PSMD12, TANGO2, and TRIP12), novel candidate disease genes (ARGLU1 and STK3), and further confirmation of disease association for two recently proposed disease genes (MEIS2 and PTCHD1). Notably, exon-targeted CMA detected several pathogenic single-exon CNVs missed by clinical WES analyses.

Conclusions: Together, these data document the efficacy of exon-targeted CMA for detection of genic and exonic CNVs, complementing and extending WES in clinical diagnostics, and the potential for discovery of novel disease genes by genome-wide assay.

Keywords: CNVs; Exon targeted array CGH; Intragenic copy number variants; de novo variants.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, N.I.H., Extramural

MeSH terms

  • Cohort Studies
  • DNA Copy Number Variations*
  • Exons*
  • Genetic Diseases, Inborn*
  • Genome, Human
  • Homeodomain Proteins / genetics
  • Humans
  • Intracellular Signaling Peptides and Proteins / genetics
  • Membrane Proteins / genetics
  • Neurodevelopmental Disorders / genetics
  • Protein-Serine-Threonine Kinases / genetics
  • Retrospective Studies
  • Transcription Factors / genetics
  • Whole Genome Sequencing

Substances

  • ARGLU1 protein, human
  • Homeodomain Proteins
  • Intracellular Signaling Peptides and Proteins
  • MEIS2 protein, human
  • Membrane Proteins
  • PTCHD1 protein, human
  • Transcription Factors
  • STK3 protein, human
  • Protein-Serine-Threonine Kinases