Decoding a highly mixed Kazakh genome

Hum Genet. 2020 May;139(5):557-568. doi: 10.1007/s00439-020-02132-8. Epub 2020 Feb 19.

Abstract

We provide a Kazakh whole genome sequence (MJS) and analyses with the largest comparative Kazakh genomic data available to date. We found 102,240 novel SNVs and a high level of heterozygosity. ADMIXTURE analysis confirmed a significant proportion of variations in this individual coming from all continents except Africa and Oceania. A principal component analysis showed neighboring Kalmyk, Uzbek, and Kyrgyz populations to have the strongest resemblance to the MJS genome which reflects fairly recent Kazakh history. MJS's mitochondrial haplogroup, J1c2, probably represents an early European and Near Eastern influence to Central Asia. This was also supported by the heterozygous SNPs associated with European phenotypic features and strikingly similar Kazakh ancestral composition inferred by ADMIXTURE. Admixture (f3) analysis showed that MJS's genomic signature is best described as a cross between the Neolithic East Asian (Devil's Gate1) and the Bronze Age European (Halberstadt_LBA1) components rather than a contemporary admixture.

MeSH terms

  • China
  • DNA, Mitochondrial
  • Ethnicity / genetics*
  • Female
  • Genetics, Population*
  • Genome, Human*
  • Humans
  • Kazakhstan
  • Polymorphism, Single Nucleotide*
  • Principal Component Analysis
  • White People / genetics*

Substances

  • DNA, Mitochondrial