A gene prioritization method based on a swine multi-omics knowledgebase and a deep learning model

Commun Biol. 2020 Sep 10;3(1):502. doi: 10.1038/s42003-020-01233-4.

Abstract

The analyses of multi-omics data have revealed candidate genes for objective traits. However, they are integrated poorly, especially in non-model organisms, and they pose a great challenge for prioritizing candidate genes for follow-up experimental verification. Here, we present a general convolutional neural network model that integrates multi-omics information to prioritize the candidate genes of objective traits. By applying this model to Sus scrofa, which is a non-model organism, but one of the most important livestock animals, the model precision was 72.9%, recall 73.5%, and F1-Measure 73.4%, demonstrating a good prediction performance compared with previous studies in Arabidopsis thaliana and Oryza sativa. Additionally, to facilitate the use of the model, we present ISwine ( http://iswine.iomics.pro/ ), which is an online comprehensive knowledgebase in which we incorporated almost all the published swine multi-omics data. Overall, the results suggest that the deep learning strategy will greatly facilitate analyses of multi-omics integration in the future.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Databases, Genetic
  • Deep Learning*
  • Genomics*
  • Neural Networks, Computer
  • Proteins / classification
  • Proteins / genetics*
  • Swine
  • Transcriptome / genetics*

Substances

  • Proteins