Background: Many species belonging to the genus Colletotrichum cause anthracnose disease on a wide range of plant species. In addition to their economic impact, the genus Colletotrichum is a useful model for the study of the evolution of host specificity, speciation and reproductive behaviors. Genome projects of Colletotrichum species have already opened a new era for studying the evolution of pathogenesis in fungi.
Results: We sequenced and annotated the genomes of four strains in the Colletotrichum acutatum species complex (CAsc), a clade of broad host range pathogens within the genus. The four CAsc proteomes and secretomes along with those representing an additional 13 species (six Colletotrichum spp. and seven other Sordariomycetes) were classified into protein families using a variety of tools. Hierarchical clustering of gene family and functional domain assignments, and phylogenetic analyses revealed lineage specific losses of carbohydrate-active enzymes (CAZymes) and proteases encoding genes in Colletotrichum species that have narrow host range as well as duplications of these families in the CAsc. We also found a lineage specific expansion of necrosis and ethylene-inducing peptide 1 (Nep1)-like protein (NLPs) families within the CAsc.
Conclusions: This study illustrates the plasticity of Colletotrichum genomes, and shows that major changes in host range are associated with relatively recent changes in gene content.
Keywords: Anthracnose; CAZyme; Colletotrichum spp.; Fungal genomics; Plant pathogen.