Genome and transcriptome characterization of the glycoengineered Nicotiana benthamiana line ΔXT/FT

BMC Genomics. 2019 Jul 19;20(1):594. doi: 10.1186/s12864-019-5960-2.

Abstract

Background: The allotetraploid tobacco species Nicotiana benthamiana native to Australia has become a popular host for recombinant protein production. Although its usage grows every year, little is known on this plant's genomic and transcriptomic features. Most N. benthamiana accessions currently used in research lack proper documentation of their breeding history and provenance. One of these, the glycoengineered N. benthamiana line ΔXT/FT is increasingly used for the production of biopharmaceutical proteins.

Results: Based on an existing draft assembly of the N. benthamiana genome we predict 50,516 protein -encoding genes (62,216 transcripts) supported by expression data derived from 2.35 billion mRNA-seq reads. Using single-copy core genes we show high completeness of the predicted gene set. We functionally annotate more than two thirds of the gene set through sequence homology to genes from other Nicotiana species. We demonstrate that the expression profiles from leaf tissue of ΔXT/FT and its wild type progenitor only show minimal differences. We identify the transgene insertion sites in ΔXT/FT and show that one of the transgenes was inserted inside another predicted gene that most likely lost its function upon insertion. Based on publicly available mRNA-seq data, we confirm that the N. benthamiana accessions used by different research institutions most likely derive from a single source.

Conclusions: This work provides gene annotation of the N. benthamiana genome, a genomic and transcriptomic characterization of a transgenic N. benthamiana line in comparison to its wild-type progenitor, and sheds light onto the relatedness of N. benthamiana accessions that are used in laboratories around the world.

Keywords: Accession history; Gene prediction; Genome; Intraspecific variation; Nicotiana benthamiana; Transgene.

MeSH terms

  • Gene Expression Profiling*
  • Genetic Variation
  • Genomics*
  • Glycoproteins / genetics*
  • Molecular Sequence Annotation
  • Nicotiana / genetics*
  • Protein Engineering*
  • Transgenes / genetics

Substances

  • Glycoproteins