There have been concerted efforts toward cataloging rare and deleterious variants in different world populations using high-throughput genotyping and sequencing-based methods. The Indian population is underrepresented or its information with respect to clinically relevant variants is sparse in public data sets. The aim of this study was to estimate the burden of monogenic disease-causing variants in Indian populations. Toward this, we have assessed the frequency profile of monogenic phenotype-associated ClinVar variants. The study utilized a genotype data set (global screening array, Illumina) from 2795 individuals (multiple in-house genomics cohorts) representing diverse ethnic and geographically distinct Indian populations. Of the analyzed variants from Global Screening Array, ~9% were found to be informative and were either not known earlier or underrepresented in public databases in terms of their frequencies. These variants were linked to disorders, namely inborn errors of metabolism, monogenic diabetes, hereditary cancers, and various other hereditary conditions. We have also shown that our study cohort is genetically a better representative of the Indian population than its representation in the 1000 Genome Project (South Asians). We have created a database, ClinIndb, linked to the Leiden Open Variation Database, to help clinicians and researchers in diagnosis, counseling, and development of appropriate genetic screening tools relevant to the Indian populations and Indians living abroad.
Keywords: ClinVar variants; Indian populations; database; high-throughput genotyping; monogenic diseases.
© 2020 Wiley Periodicals LLC.