A draft human pangenome reference
- PMID: 37165242
- PMCID: PMC10172123
- DOI: 10.1038/s41586-023-05896-x
A draft human pangenome reference
Abstract
Here the Human Pangenome Reference Consortium presents a first draft of the human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals1. These assemblies cover more than 99% of the expected sequence in each genome and are more than 99% accurate at the structural and base pair levels. Based on alignments of the assemblies, we generate a draft pangenome that captures known variants and haplotypes and reveals new alleles at structurally complex loci. We also add 119 million base pairs of euchromatic polymorphic sequences and 1,115 gene duplications relative to the existing reference GRCh38. Roughly 90 million of the additional base pairs are derived from structural variation. Using our draft pangenome to analyse short-read data reduced small variant discovery errors by 34% and increased the number of structural variants detected per haplotype by 104% compared with GRCh38-based workflows, which enabled the typing of the vast majority of structural variant alleles per sample.
© 2023. The Author(s).
Conflict of interest statement
E.E.E. is a scientific advisory board (SAB) member of Variant Bio. P.F is a member of the SABs of Fabric Genomics and Eagle Genomics. E.E.K. is a member of the SAB of Encompass Biosciences, Foresite Labs and Galateo Bio and has received personal fees from Regeneron Pharmaceuticals, 23&Me and Illumina. A.B., A.C., P.-C.C., D.E.C., G.Baid, A.K., M.N. and K.S. are employees of Google and own Alphabet stock as part of the standard compensation package.
Figures
Comment in
-
Human pangenome supports analysis of complex genomic regions.Nature. 2023 May;617(7960):256-258. doi: 10.1038/d41586-023-01490-3. Nature. 2023. PMID: 37165235 No abstract available.
-
New Genomic Sequencing Resource Could Improve Care.Cancer Discov. 2023 Jul 7;13(7):1506-1507. doi: 10.1158/2159-8290.CD-NB2023-0042. Cancer Discov. 2023. PMID: 37249320
Similar articles
-
A pangenome reference of 36 Chinese populations.Nature. 2023 Jul;619(7968):112-121. doi: 10.1038/s41586-023-06173-7. Epub 2023 Jun 14. Nature. 2023. PMID: 37316654 Free PMC article.
-
Semi-automated assembly of high-quality diploid human reference genomes.Nature. 2022 Nov;611(7936):519-531. doi: 10.1038/s41586-022-05325-5. Epub 2022 Oct 19. Nature. 2022. PMID: 36261518 Free PMC article.
-
Pangenome graph construction from genome alignments with Minigraph-Cactus.Nat Biotechnol. 2024 Apr;42(4):663-673. doi: 10.1038/s41587-023-01793-w. Epub 2023 May 10. Nat Biotechnol. 2024. PMID: 37165083 Free PMC article.
-
The Human Pangenome Project: a global resource to map genomic diversity.Nature. 2022 Apr;604(7906):437-446. doi: 10.1038/s41586-022-04601-8. Epub 2022 Apr 20. Nature. 2022. PMID: 35444317 Free PMC article. Review.
-
Haplotyping-Assisted Diploid Assembly and Variant Detection with Linked Reads.Methods Mol Biol. 2023;2590:161-182. doi: 10.1007/978-1-0716-2819-5_11. Methods Mol Biol. 2023. PMID: 36335499 Review.
Cited by
-
Strategic targeting of Cas9 nickase induces large segmental duplications.Cell Genom. 2024 Aug 14;4(8):100610. doi: 10.1016/j.xgen.2024.100610. Epub 2024 Jul 24. Cell Genom. 2024. PMID: 39053455 Free PMC article.
-
Pig pangenome graph reveals functional features of non-reference sequences.J Anim Sci Biotechnol. 2024 Feb 22;15(1):32. doi: 10.1186/s40104-023-00984-4. J Anim Sci Biotechnol. 2024. PMID: 38389084 Free PMC article.
-
De Novo Genome Assemblies From Two Indigenous Americans from Arizona Identify New Polymorphisms in Non-Reference Sequences.Genome Biol Evol. 2024 Sep 3;16(9):evae188. doi: 10.1093/gbe/evae188. Genome Biol Evol. 2024. PMID: 39190003 Free PMC article.
-
AGAP duplicons associate with structural diversity at Chromosome 10q11.22.Genome Res. 2024 Oct 29;34(10):1487-1499. doi: 10.1101/gr.279454.124. Genome Res. 2024. PMID: 39322278
-
When less is more: sketching with minimizers in genomics.Genome Biol. 2024 Oct 14;25(1):270. doi: 10.1186/s13059-024-03414-4. Genome Biol. 2024. PMID: 39402664 Free PMC article. Review.
References
Publication types
MeSH terms
Grants and funding
- R01 HG006677/HG/NHGRI NIH HHS/United States
- U01 HG010961/HG/NHGRI NIH HHS/United States
- R35 GM130151/GM/NIGMS NIH HHS/United States
- U41 HG010972/HG/NHGRI NIH HHS/United States
- U01 HG010973/HG/NHGRI NIH HHS/United States
- R01 HG011649/HG/NHGRI NIH HHS/United States
- U01 HG010963/HG/NHGRI NIH HHS/United States
- R01 HG011274/HG/NHGRI NIH HHS/United States
- U24 HG009081/HG/NHGRI NIH HHS/United States
- U24 HG010262/HG/NHGRI NIH HHS/United States
- U01 HG010971/HG/NHGRI NIH HHS/United States
- U24 HG011853/HG/NHGRI NIH HHS/United States
- R01 HG010485/HG/NHGRI NIH HHS/United States
- U24 HG007497/HG/NHGRI NIH HHS/United States
- R01 HG002385/HG/NHGRI NIH HHS/United States
- ZIA HG200398/ImNIH/Intramural NIH HHS/United States
- R01 HG010169/HG/NHGRI NIH HHS/United States
- U41 HG007234/HG/NHGRI NIH HHS/United States
- OT2 OD033761/OD/NIH HHS/United States
- R01 GM123489/GM/NIGMS NIH HHS/United States
- WT_/Wellcome Trust/United Kingdom
LinkOut - more resources
Full Text Sources
Other Literature Sources
