A major hurdle facing tuberculosis (TB) investigators who want to utilize a rapidly growing body of data from both systems biology approaches and omics technologies is the lack of a standard vocabulary for data annotation and reporting. Lacking a means to readily compare samples from different research groups, a significant quantity of potentially informative data is largely ignored by researchers. To facilitate standardizing data across studies, a simple ontology of TB terms was developed to provide a common vocabulary for annotating data sets. New terminology was developed to address animal models and experimental systems, and existing clinically focused terminology was modified and adapted. This ontology can be used to annotate host TB data in public databases and collaborations, thereby standardizing database searches and allowing researchers to more easily compare results. To demonstrate the utility of a standard TB ontology for host systems biology, a web application was developed to annotate and compare human and animal model gene expression data sets.
Keywords: GEO; Gene expression; Mycobacterium; Ontology; Transcriptomics.
Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.