De Novo Long-Read Genome Assembly and Annotation of the Luna Moth (Actias luna) Fully Resolves Repeat-Rich Silk Genes

Genome Biol Evol. 2024 Jul 3;16(7):evae148. doi: 10.1093/gbe/evae148.

Abstract

We present the first long-read de novo assembly and annotation of the luna moth (Actias luna) and provide the full characterization of heavy chain fibroin (h-fibroin), a long and highly repetitive gene (>20 kb) essential in silk fiber production. There are >160,000 described species of moths and butterflies (Lepidoptera), but only within the last 5 years have we begun to recover high-quality annotated whole genomes across the order that capture h-fibroin. Using PacBio HiFi reads, we produce the first high-quality long-read reference genome for this species. The assembled genome has a length of 532 Mb, a contig N50 of 16.8 Mb, an L50 of 14 contigs, and 99.4% completeness (BUSCO). Our annotation using Bombyx mori protein and A. luna RNAseq evidence captured a total of 20,866 genes at 98.9% completeness with 10,267 functionally annotated proteins and a full-length h-fibroin annotation of 2,679 amino acid residues.

Keywords: Lepidoptera; PacBio; fibroin; genome; moth; silk.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Bombyx / genetics
  • Fibroins* / genetics
  • Genome, Insect*
  • Insect Proteins / genetics
  • Molecular Sequence Annotation*
  • Moths* / genetics
  • Repetitive Sequences, Nucleic Acid
  • Silk / genetics

Substances

  • Fibroins
  • Silk
  • Insect Proteins

Associated data

  • BioProject/PRJNA1072661
  • figshare/10.6084/m9.figshare.25483375
  • figshare/10.6084/m9.figshare.25483282
  • figshare/10.6084/m9.figshare.25483330
  • figshare/10.6084/m9.figshare.25483372