Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) system is a novel type of innate defense system in prokaryotes for destruction of exogenous elements. To gain further insight into behavior and organization of the system, the extensive analysis of the available sequenced genomes is necessary. The dynamic nature of CRISPR loci is possibly valuable for typing and relative analyses of strains and microbial population. There are a few orderly bioinformatics investigations about the structure of CRISPR sequences in the Escherichia coli strains. In this study, 57 CRISPR loci were selected from 32 Escherichia coli strains to investigate their structural characteristics and potential functions using bioinformatics tools. Our results showed that most strains contained several loci that mainly included conserved direct repeats, while the spacers were highly variable. Moreover, RNA analysis of the sequences indicated that all loci could form stable RNA secondary structures and showed homology mostly with phages compared to plasmids. Only three strains included cas genes around their loci.
Keywords: CRISPR; Cas; Direct repeat; Escherichia coli; RNA structure.
© 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.