Data on the link between genomic integration of IS 1548 and lineage of the strain obtained by bioinformatic analyses of sequenced genomes of Streptococcus agalactiae available at the National Center for Biotechnology Information database

Data Brief. 2019 Dec 31:28:105066. doi: 10.1016/j.dib.2019.105066. eCollection 2020 Feb.

Abstract

IS1548, a 1316-bp element of the ISAs1 family affects the expression of several genes of the opportunistic pathogen Streptococcus agalactiae. Furthermore, certain lineages of S. agalactiae are more frequently associated to particular diseases than other [1, 2]. We took advantage of the release of the genome sequences of a huge number of epidemiologically unrelated S. agalactiae strains of various origin to analyze the prevalence of IS1548 among S. agalactiae strains. To this end, S. agalactiae genome available at the National Center for Biotechnology Information (NCBI) database were blasted with IS1548 DNA sequences. A sequence type (ST), based on the allelic profile of seven housekeeping genes, was assigned to each strain possessing IS1548. These strains were then grouped into clonal complexes (CCs). The data obtained will give the opportunity to compare the sequenced genomes of S. agalactiae based on their lineage and/or possession of IS1548, and to select the corresponding strains for comparative experimental studies. The data is related to the research article « Dual and divergent transcriptional impact of IS1548 insertion upstream of the peptidoglycan biosynthesis murB gene of Streptococcus agalactiae" [2].

Keywords: Clonal complex; ISAs1 family; Mobile genetic element; Multi locus sequence typing; Population structure; Sequence type.