CAFA-evaluator: a Python tool for benchmarking ontological classification methods

Damiano Piovesan; Davide Zago; Parnal Joshi; M Clara De Paolis Kaluza; Mahta Mehdiabadi; Rashika Ramola; Alexander Miguel Monzon; Walter Reade; Iddo Friedberg; Predrag Radivojac; Silvio C E Tosatto

doi:10.1093/bioadv/vbae043

CAFA-evaluator: a Python tool for benchmarking ontological classification methods

Bioinform Adv. 2024 Mar 14;4(1):vbae043. doi: 10.1093/bioadv/vbae043. eCollection 2024.

Authors

Affiliations

¹ Department of Biomedical Sciences, University of Padova, 35121 Padova, Italy.
² Program in Bioinformatics and Computational Biology, Iowa State University, Ames, IA 50011, United States.
³ Department of Veterinary Microbiology and Preventive Medicine, Iowa State University, Ames, IA 50011, United States.
⁴ Khoury College of Computer Sciences, Northeastern University, Boston, MA 02115, United States.
⁵ Department of Information Engineering, University of Padova, 35121 Padova, Italy.
⁶ Kaggle, San Francisco, CA, United States.

Abstract

We present CAFA-evaluator, a powerful Python program designed to evaluate the performance of prediction methods on targets with hierarchical concept dependencies. It generalizes multi-label evaluation to modern ontologies where the prediction targets are drawn from a directed acyclic graph and achieves high efficiency by leveraging matrix computation and topological sorting. The program requirements include a small number of standard Python libraries, making CAFA-evaluator easy to maintain. The code replicates the Critical Assessment of protein Function Annotation (CAFA) benchmarking, which evaluates predictions of the consistent subgraphs in Gene Ontology. Owing to its reliability and accuracy, the organizers have selected CAFA-evaluator as the official CAFA evaluation software.

Availability and implementation: https://pypi.org/project/cafaeval.