Diploid genomic architecture of Nitzschia inconspicua, an elite biomass production diatom

Sci Rep. 2021 Aug 2;11(1):15592. doi: 10.1038/s41598-021-95106-3.

Abstract

A near-complete diploid nuclear genome and accompanying circular mitochondrial and chloroplast genomes have been assembled from the elite commercial diatom species Nitzschia inconspicua. The 50 Mbp haploid size of the nuclear genome is nearly double that of model diatom Phaeodactylum tricornutum, but 30% smaller than closer relative Fragilariopsis cylindrus. Diploid assembly, which was facilitated by low levels of allelic heterozygosity (2.7%), included 14 candidate chromosome pairs composed of long, syntenic contigs, covering 93% of the total assembly. Telomeric ends were capped with an unusual 12-mer, G-rich, degenerate repeat sequence. Predicted proteins were highly enriched in strain-specific marker domains associated with cell-surface adhesion, biofilm formation, and raphe system gliding motility. Expanded species-specific families of carbonic anhydrases suggest potential enhancement of carbon concentration efficiency, and duplicated glycolysis and fatty acid synthesis pathways across cytosolic and organellar compartments may enhance peak metabolic output, contributing to competitive success over other organisms in mixed cultures. The N. inconspicua genome delivers a robust new reference for future functional and transcriptomic studies to illuminate the physiology of benthic pennate diatoms and harness their unique adaptations to support commercial algae biomass and bioproduct production.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Biomass*
  • Carbonic Anhydrases / genetics
  • Contig Mapping
  • Diatoms / classification
  • Diatoms / genetics*
  • Diploidy*
  • Genome Size
  • Genome*
  • Genome, Chloroplast
  • Genome, Mitochondrial
  • Open Reading Frames / genetics
  • Phylogeny
  • Repetitive Sequences, Nucleic Acid / genetics
  • Sequence Analysis, DNA
  • Synteny / genetics

Substances

  • Carbonic Anhydrases