Learning a genome-wide score of human-mouse conservation at the functional genomics level

Nat Commun. 2021 May 3;12(1):2495. doi: 10.1038/s41467-021-22653-8.

Abstract

Identifying genomic regions with functional genomic properties that are conserved between human and mouse is an important challenge in the context of mouse model studies. To address this, we develop a method to learn a score of evidence of conservation at the functional genomics level by integrating information from a compendium of epigenomic, transcription factor binding, and transcriptomic data from human and mouse. The method, Learning Evidence of Conservation from Integrated Functional genomic annotations (LECIF), trains neural networks to generate this score for the human and mouse genomes. The resulting LECIF score highlights human and mouse regions with shared functional genomic properties and captures correspondence of biologically similar human and mouse annotations. Analysis with independent datasets shows the score also highlights loci associated with similar phenotypes in both species. LECIF will be a resource for mouse model studies by identifying loci whose functional genomic properties are likely conserved.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Computational Biology / methods*
  • DNA / genetics*
  • Epigenomics / methods
  • Genome, Human / genetics*
  • Genomics / methods
  • Humans
  • Mice
  • Sequence Analysis, DNA
  • Sequence Homology, Nucleic Acid*
  • Transcriptome / genetics*

Substances

  • DNA