Variability in genes related to SARS-CoV-2 entry into host cells (ACE2, TMPRSS2, TMPRSS11A, ELANE, and CTSL) and its potential use in association studies

Life Sci. 2020 Nov 1:260:118313. doi: 10.1016/j.lfs.2020.118313. Epub 2020 Aug 21.

Abstract

Background: The prevalence and mortality of the outbreak of the COVID-19 pandemic show marked geographic variation. The presence of several subtypes of the coronavirus and the genetic differences in the populations could condition that variation. Thus, the objective of this study was to propose variants in genes that encode proteins related to the SARS-CoV-2 entry into the host cells as possible targets for genetic associations studies.

Methods: The allelic frequencies of the polymorphisms in the ACE2, TMPRSS2, TMPRSS11A, cathepsin L (CTSL), and elastase (ELANE) genes were obtained in four populations from the American, African, European, and Asian continents reported in the 1000 Genome Project. Moreover, we evaluated the potential biological effect of these variants using different web-based tools.

Results: In the coding sequences of these genes, we detected one probably-damaging polymorphism located in the TMPRSS2 gene (rs12329760) that produces a change of amino acid. Furthermore, forty-eight polymorphisms with possible functional consequences were detected in the non-coding sequences of the following genes: three in ACE2, seventeen in TMPRSS2, ten in TMPRSS11A, twelve in ELANE, and six in CTSL. These polymorphisms produce binding sites for transcription factors and microRNAs. The minor allele frequencies of these polymorphisms vary in each community; indeed, some of them are high in specific populations.

Conclusion: In summary, using data of the 1000 Genome Project and web-based tools, we propose some polymorphisms, which, depending on the population, could be used for genetic association studies.

Keywords: ACE2; COVID19; Cathepsin; Elastase; Polymorphisms; SARS-CoV2; TMPRSS11A; TMPRSS2.

MeSH terms

  • Angiotensin-Converting Enzyme 2
  • Betacoronavirus* / genetics
  • Betacoronavirus* / isolation & purification
  • COVID-19
  • Cathepsin L / genetics*
  • Coronavirus Infections / epidemiology
  • Coronavirus Infections / genetics*
  • Coronavirus Infections / pathology
  • Coronavirus Infections / virology
  • Gene Frequency
  • Genetic Association Studies
  • Humans
  • Leukocyte Elastase / genetics*
  • Linkage Disequilibrium
  • Membrane Proteins / genetics*
  • Pandemics
  • Peptidyl-Dipeptidase A / genetics*
  • Pneumonia, Viral / epidemiology
  • Pneumonia, Viral / genetics*
  • Pneumonia, Viral / pathology
  • Pneumonia, Viral / virology
  • Polymorphism, Genetic*
  • SARS-CoV-2
  • Serine Endopeptidases / genetics*
  • Serine Proteases / genetics*

Substances

  • Membrane Proteins
  • TMPRSS11A protein, human
  • Serine Proteases
  • Peptidyl-Dipeptidase A
  • ACE2 protein, human
  • Angiotensin-Converting Enzyme 2
  • Serine Endopeptidases
  • TMPRSS2 protein, human
  • ELANE protein, human
  • Leukocyte Elastase
  • CTSL protein, human
  • Cathepsin L