Complete genomes of a multi-generational pedigree to expand studies of genetic and epigenetic inheritance

bioRxiv [Preprint]. 2025 Dec 16:2025.12.14.693655. doi: 10.64898/2025.12.14.693655.

Abstract

Pedigree analysis remains the gold standard for rare disease diagnostics, yet whole genome sequencing studies typically omit critical regions like centromeres, telomeres, and acrocentric chromosome p-arms. Here, we present telomere-to-telomere (T2T) reference genomes for four self-identified African American individuals of admixed ancestry spanning three generations. Our parent-of-origin assigned, chromosome-level assemblies revealed precise meiotic recombination breakpoints in previously inaccessible regions, including recombination events across acrocentric and subtelomeric sequences. Centromeric regions were highly stable, with multi-megabase arrays inherited intact across three generations, while the position of kinetochore assembly sites remained consistent and predominantly associated with the p-arm proximal region. The relative lengths of telomeres on individual chromosomes were maintained across generations. Using a targeted rDNA assembly approach, we reconstructed a complete megabase-scale ribosomal DNA (rDNA) array corresponding to the paternal chromosome 14. This openly available pedigree provides a benchmark dataset for studying recombination and genetic and epigenetic variation across the complete genome.

Keywords: Acrocentric short-arms; African Ancestry; Diploid genome assembly; Ribosomal DNA; Telomere-to-telomere; Telomeres; centromere; satellite DNA.

Publication types

  • Preprint