De novo phasing resolves haplotype sequences in complex plant genomes

Plant Biotechnol J. 2022 Jun;20(6):1031-1041. doi: 10.1111/pbi.13815. Epub 2022 Apr 9.

Abstract

Genome phasing is a recently developed assembly method that separates heterozygous eukaryotic genomic regions and builds haplotype-resolved assemblies. Because differences between haplotypes are ignored in most published de novo genomes, assemblies are available as consensus genomes consisting of haplotype mixtures, thus increasing the need for genome phasing. Here, we review the operating principles and characteristics of several freely available and widely used phasing tools (TrioCanu, FALCON-Phase, and ALLHiC). An examination of downstream analyses using haplotype-resolved genome assemblies in plants indicated significant differences among haplotypes regarding chromosomal rearrangements, sequence insertions, and expression of specific alleles that contribute to the acquisition of the biological characteristics of plant species. Finally, we suggest directions to solve addressing limitations of current genome-phasing methods. This review provides insights into the current progress, limitations, and future directions of de novo genome phasing, which will enable researchers to easily access and utilize genome-phasing in studies involving highly heterozygous complex plant genomes.

Keywords: allele-specific expression; autopolyploid; chromosomal rearrangement; de novo phasing; haplotype-resolved assembly; haplotype-specific sequence insertion; plant genome.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alleles
  • Genome, Plant* / genetics
  • Genomics*
  • Haplotypes / genetics
  • Plants / genetics
  • Sequence Analysis, DNA / methods