Gene prioritization for livestock diseases by data integration

Physiol Genomics. 2012 Mar 1;44(5):305-17. doi: 10.1152/physiolgenomics.00047.2011. Epub 2012 Jan 10.


Identifying causal genes that underlie complex traits such as susceptibility to disease is a primary aim of genetic and biomedical studies. Genetic mapping of quantitative trait loci (QTL) and gene expression profiling based on high-throughput technologies are common first approaches toward identifying associations between genes and traits; however, it is often difficult to assess whether the biological function of a putative candidate gene is consistent with a particular phenotype. Here, we have implemented a network-based disease gene prioritization approach for ranking genes associated with quantitative traits and diseases in livestock species. The approach uses ortholog mapping and integrates information on disease or trait phenotypes, gene-associated phenotypes, and protein-protein interactions. It was used for ranking all known genes present in the cattle genome for their potential roles in bovine mastitis. Gene-associated phenome profile and transcriptome profile in response to Escherichia coli infection in the mammary gland were integrated to make a global inference of bovine genes involved in mastitis. The top ranked genes were highly enriched for pathways and biological processes underlying inflammation and immune responses, which supports the validity of our approach for identifying genes that are relevant to animal health and disease. These gene-associated phenotypes were used for a local prioritization of candidate genes located in a QTL affecting the susceptibility to mastitis. Our study provides a general framework for prioritizing genes associated with various complex traits in different species. To our knowledge this is the first time that gene expression, ortholog mapping, protein interactions, and biomedical text data have been integrated systematically for ranking candidate genes in any livestock species.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Cattle
  • Cattle Diseases / genetics*
  • Data Interpretation, Statistical
  • Female
  • Gene Expression Profiling
  • Gene Regulatory Networks / physiology
  • Genetic Predisposition to Disease*
  • Genomics
  • Livestock / genetics*
  • Mastitis, Bovine / genetics*
  • Phenotype
  • Research
  • Systems Integration*
  • Validation Studies as Topic