Candidate gene prioritization with Endeavour

Léon-Charles Tranchevent; Amin Ardeshirdavani; Sarah ElShal; Daniel Alcaide; Jan Aerts; Didier Auboeuf; Yves Moreau

doi:10.1093/nar/gkw365

Candidate gene prioritization with Endeavour

Nucleic Acids Res. 2016 Jul 8;44(W1):W117-21. doi: 10.1093/nar/gkw365. Epub 2016 Apr 30.

Authors

Léon-Charles Tranchevent¹, Amin Ardeshirdavani², Sarah ElShal², Daniel Alcaide², Jan Aerts², Didier Auboeuf³, Yves Moreau²

Affiliations

¹ INSERM U1210, CNRS UMR5239, Laboratoire de Biologie et de Modélisation de la Cellule, Ecole Normale Supérieure de Lyon, Université de Lyon, 69364 Lyon, France yves.moreau@esat.kuleuven.be.
² Department of Electrical Engineering (ESAT), STADIUS Center for Dynamical Systems, Signal Processing and Data Analytics Department, KU Leuven, B-3001 Leuven, Belgium iMinds Future Health Department, KU Leuven, B-3001 Leuven, Belgium.
³ INSERM U1210, CNRS UMR5239, Laboratoire de Biologie et de Modélisation de la Cellule, Ecole Normale Supérieure de Lyon, Université de Lyon, 69364 Lyon, France.

Abstract

Genomic studies and high-throughput experiments often produce large lists of candidate genes among which only a small fraction are truly relevant to the disease, phenotype or biological process of interest. Gene prioritization tackles this problem by ranking candidate genes by profiling candidates across multiple genomic data sources and integrating this heterogeneous information into a global ranking. We describe an extended version of our gene prioritization method, Endeavour, now available for six species and integrating 75 data sources. The performance (Area Under the Curve) of Endeavour on cross-validation benchmarks using 'gold standard' gene sets varies from 88% (for human phenotypes) to 95% (for worm gene function). In addition, we have also validated our approach using a time-stamped benchmark derived from the Human Phenotype Ontology, which provides a setting close to prospective validation. With this benchmark, using 3854 novel gene-phenotype associations, we observe a performance of 82%. Altogether, our results indicate that this extended version of Endeavour efficiently prioritizes candidate genes. The Endeavour web server is freely available at https://endeavour.esat.kuleuven.be/.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Animals
Benchmarking
Genetic Association Studies
Genetic Predisposition to Disease*
Genotype*
Humans
Internet
Phenotype
Software*