Genomic characterization of Haemophilus influenzae: a focus on the capsule locus

BMC Genomics. 2019 Oct 12;20(1):733. doi: 10.1186/s12864-019-6145-8.

Abstract

Background: Haemophilus influenzae (Hi) can cause invasive diseases such as meningitis, pneumonia, or sepsis. Typeable Hi includes six serotypes (a through f), each expressing a unique capsular polysaccharide. The capsule, encoded by the genes within the capsule locus, is a major virulence factor of typeable Hi. Non-typeable (NTHi) does not express capsule and is associated with invasive and non-invasive diseases.

Methods: A total of 395 typeable and 293 NTHi isolates were characterized by whole genome sequencing (WGS). Phylogenetic analysis and multilocus sequence typing were used to characterize the overall genetic diversity. Pair-wise comparisons were used to evaluate the capsule loci. A WGS serotyping method was developed to predict the Hi serotype. WGS serotyping results were compared to slide agglutination (SAST) or real-time PCR (rt-PCR) serotyping.

Results: Isolates of each Hi serotype clustered into one or two subclades, with each subclade being associated with a distinct sequence type (ST). NTHi isolates were genetically diverse, with seven subclades and 125 STs being detected. Regions I and III of the capsule locus were conserved among the six serotypes (≥82% nucleotide identity). In contrast, genes in Region II were less conserved, with only six gene pairs from all serotypes showing ≥56% nucleotide identity. The WGS serotyping method was 99.9% concordant with SAST and 100% concordant with rt-PCR in determining the Hi serotype.

Conclusions: Genomic analysis revealed a higher degree of genetic diversity among NTHi compared to typeable Hi. The WGS serotyping method accurately predicted the Hi capsule type and can serve as an alternative method for Hi serotyping.

Keywords: Capsule locus; Genetic diversity; Haemophilus influenzae; Multilocus sequence typing; Serotype; Whole genome sequencing.

MeSH terms

  • Bacterial Capsules / genetics*
  • Bacterial Typing Techniques
  • Genetic Variation
  • Genomics / methods*
  • Haemophilus influenzae / classification*
  • Haemophilus influenzae / genetics
  • Haemophilus influenzae / isolation & purification
  • Humans
  • Multilocus Sequence Typing
  • Phylogeny
  • Polysaccharides, Bacterial / genetics*
  • Serotyping
  • Virulence Factors / genetics
  • Whole Genome Sequencing

Substances

  • Polysaccharides, Bacterial
  • Virulence Factors