Integration of RNA-seq and proteomics data with genomics for improved genome annotation in Apicomplexan parasites

Proteomics. 2015 Aug;15(15):2557-9. doi: 10.1002/pmic.201500253.

Abstract

While high quality genomic sequence data is available for many pathogenic organisms, the corresponding gene annotations are often plagued with inaccuracies that can hinder research that utilizes such genomic data. Experimental validation of gene models is clearly crucial in improving such gene annotations; the field of proteogenomics is an emerging area of research wherein proteomic data is applied to testing and improving genetic models. Krishna et al. [Proteomics 2015, 15, 2618-2628] investigated whether incorporation of RNA-seq data into proteogenomics analyses can contribute significantly to validation studies of genome annotation, in two important parasitic organisms Toxoplasma gondii and Neospora caninum. They applied a systematic approach to combine new and previously published proteomics data from T. gondii and N. caninum with transcriptomics data, leading to substantially improved gene models for these organisms. This study illustrates the importance of incorporating experimental data from both proteomics and RNA-seq studies into routine genome annotation protocols.

Keywords: Apicomplexa; Neospora; Proteogenomics; Protozoa; Toxoplasma.

Publication types

  • Research Support, N.I.H., Extramural
  • Comment

MeSH terms

  • Genomics / methods*
  • Neospora / genetics*
  • Neospora / metabolism*
  • Proteomics / methods*
  • Toxoplasma / genetics*
  • Toxoplasma / metabolism*