Large-scale genome analysis of bovine commensal Escherichia coli reveals that bovine-adapted E. coli lineages are serving as evolutionary sources of the emergence of human intestinal pathogenic strains

Genome Res. 2019 Sep;29(9):1495-1505. doi: 10.1101/gr.249268.119. Epub 2019 Aug 22.


How pathogens evolve their virulence to humans in nature is a scientific issue of great medical and biological importance. Shiga toxin (Stx)-producing Escherichia coli (STEC) and enteropathogenic E. coli (EPEC) are the major foodborne pathogens that can cause hemolytic uremic syndrome and infantile diarrhea, respectively. The locus of enterocyte effacement (LEE)-encoded type 3 secretion system (T3SS) is the major virulence determinant of EPEC and is also possessed by major STEC lineages. Cattle are thought to be the primary reservoir of STEC and EPEC. However, genome sequences of bovine commensal E. coli are limited, and the emerging process of STEC and EPEC is largely unknown. Here, we performed a large-scale genomic comparison of bovine commensal E. coli with human commensal and clinical strains, including EPEC and STEC, at a global level. The analyses identified two distinct lineages, in which bovine and human commensal strains are enriched, respectively, and revealed that STEC and EPEC strains have emerged in multiple sublineages of the bovine-associated lineage. In addition to the bovine-associated lineage-specific genes, including fimbriae, capsule, and nutrition utilization genes, specific virulence gene communities have been accumulated in stx- and LEE-positive strains, respectively, with notable overlaps of community members. Functional associations of these genes probably confer benefits to these E. coli strains in inhabiting and/or adapting to the bovine intestinal environment and drive their evolution to highly virulent human pathogens under the bovine-adapted genetic background. Our data highlight the importance of large-scale genome sequencing of animal strains in the studies of zoonotic pathogens.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Cattle
  • Enteropathogenic Escherichia coli / classification
  • Enteropathogenic Escherichia coli / genetics
  • Escherichia coli / classification*
  • Escherichia coli / genetics
  • Escherichia coli / pathogenicity
  • Escherichia coli Infections / microbiology*
  • Escherichia coli Proteins / genetics
  • Evolution, Molecular
  • Gene Regulatory Networks
  • Genome, Bacterial
  • Humans
  • Phylogeny
  • Shiga-Toxigenic Escherichia coli / classification
  • Shiga-Toxigenic Escherichia coli / genetics
  • Shiga-Toxigenic Escherichia coli / pathogenicity
  • Symbiosis
  • Virulence Factors / genetics*
  • Whole Genome Sequencing / methods*


  • Escherichia coli Proteins
  • Virulence Factors