Pangenomics in crop improvement-from coding structural variations to finding regulatory variants with pangenome graphs

Plant Genome. 2022 Mar;15(1):e20177. doi: 10.1002/tpg2.20177. Epub 2021 Dec 13.

Abstract

Since the first reported crop pangenome in 2014, advances in high-throughput and cost-effective DNA sequencing technologies facilitated multiple such studies including the pangenomes of oilseed rape (Brassica napus L.), soybean [Glycine max (L.) Merr.], rice (Oryza sativa L.), wheat (Triticum aestivum L.), and barley (Hordeum vulgare L.). Compared with single-reference genomes, pangenomes provide a more accurate representation of the genetic variation present in a species. By combining the genomic data of multiple accessions, pangenomes allow for the detection and annotation of complex DNA polymorphisms such as structural variations (SVs), one of the major determinants of genetic diversity within a species. In this review we summarize the current literature on crop pangenomics, focusing on their application to find candidate SVs involved in traits of agronomic interest. We then highlight the potential of pangenomes in the discovery and functional characterization of noncoding regulatory sequences and their variations. We conclude with a summary and outlook on innovative data structures representing the complete content of plant pangenomes including annotations of coding and noncoding elements and outcomes of transcriptomic and epigenomic experiments.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Genome, Plant
  • Genomics
  • Glycine max / genetics
  • Hordeum* / genetics
  • Oryza* / genetics
  • Sequence Analysis, DNA
  • Triticum / genetics