Novel Megaptera novaeangliae (Humpback whale) haplotype chromosome-level reference genome

Sci Data. 2024 Oct 10;11(1):1113. doi: 10.1038/s41597-024-03922-9.

Abstract

The sequencing of a kidney sample (KW2013002) from a stranded Megaptera novaeangliae (Humpback whale) calf is the first chromosome-level reference genome for this species1. The calf, a 457 cm and 2,500 lbs male, was found stranded in Hawai'i Kai, HI, in 2013 and was marked as abandoned/orphaned. In 2023, 1 g of kidney was sequenced with PacBio long-read DNA sequencing, chromatin conformation capture (Hi-C), RNA sequencing, and mitochondrial sequencing to comprehensively characterize the genome and transcriptome of M. novaeangliae. Data validation includes a synteny analysis, mitochondrial annotation, and a comparison of BUSCO scores (scaffold v. reference genome and Balaenoptera musculus (Blue whale) v. M. novaeangliae). BUSCO analysis was performed on an M. novaeangliae scaffold-level assembly to determine genomic completeness of the reference genome, with a scaffold BUSCO score of 91.2% versus a score of 95.4%. Synteny analysis was performed using the B. musculus genome as comparison to determine chromosome-level coverage and structure. Further, a time-based phylogenetic tree was constructed using the sequenced data and publicly available genomes.

Publication types

  • Dataset

MeSH terms

  • Animals
  • Genome*
  • Haplotypes*
  • Humpback Whale* / genetics
  • Male
  • Sequence Analysis, DNA