CADD: predicting the deleteriousness of variants throughout the human genome
- PMID: 30371827
- PMCID: PMC6323892
- DOI: 10.1093/nar/gky1016
CADD: predicting the deleteriousness of variants throughout the human genome
Abstract
Combined Annotation-Dependent Depletion (CADD) is a widely used measure of variant deleteriousness that can effectively prioritize causal variants in genetic analyses, particularly highly penetrant contributors to severe Mendelian disorders. CADD is an integrative annotation built from more than 60 genomic features, and can score human single nucleotide variants and short insertion and deletions anywhere in the reference assembly. CADD uses a machine learning model trained on a binary distinction between simulated de novo variants and variants that have arisen and become fixed in human populations since the split between humans and chimpanzees; the former are free of selective pressure and may thus include both neutral and deleterious alleles, while the latter are overwhelmingly neutral (or, at most, weakly deleterious) by virtue of having survived millions of years of purifying selection. Here we review the latest updates to CADD, including the most recent version, 1.4, which supports the human genome build GRCh38. We also present updates to our website that include simplified variant lookup, extended documentation, an Application Program Interface and improved mechanisms for integrating CADD scores into other tools or applications. CADD scores, software and documentation are available at https://cadd.gs.washington.edu.
Figures
Similar articles
-
CADD v1.7: using protein language models, regulatory CNNs and other nucleotide-level scores to improve genome-wide variant predictions.Nucleic Acids Res. 2024 Jan 5;52(D1):D1143-D1154. doi: 10.1093/nar/gkad989. Nucleic Acids Res. 2024. PMID: 38183205 Free PMC article.
-
A general framework for estimating the relative pathogenicity of human genetic variants.Nat Genet. 2014 Mar;46(3):310-5. doi: 10.1038/ng.2892. Epub 2014 Feb 2. Nat Genet. 2014. PMID: 24487276 Free PMC article.
-
Predicting variant deleteriousness in non-human species: applying the CADD approach in mouse.BMC Bioinformatics. 2018 Oct 12;19(1):373. doi: 10.1186/s12859-018-2337-5. BMC Bioinformatics. 2018. PMID: 30314430 Free PMC article.
-
CADD-Splice-improving genome-wide variant effect prediction using deep learning-derived splice scores.Genome Med. 2021 Feb 22;13(1):31. doi: 10.1186/s13073-021-00835-9. Genome Med. 2021. PMID: 33618777 Free PMC article.
-
Calcium Apatite Deposition Disease: Diagnosis and Treatment.Radiol Res Pract. 2016;2016:4801474. doi: 10.1155/2016/4801474. Epub 2016 Nov 30. Radiol Res Pract. 2016. PMID: 28042481 Free PMC article. Review.
Cited by
-
SLC22A4 Gene in Hereditary Non-syndromic Hearing Loss: Recurrence and Incomplete Penetrance of the p.C113Y Mutation in Northwest Africa.Front Genet. 2021 Feb 10;12:606630. doi: 10.3389/fgene.2021.606630. eCollection 2021. Front Genet. 2021. PMID: 33643381 Free PMC article.
-
Noncoding mutation in RPGRIP1 contributes to inherited retinal degenerations.Mol Vis. 2021 Mar 18;27:95-106. eCollection 2021. Mol Vis. 2021. PMID: 33907365 Free PMC article.
-
driveR: a novel method for prioritizing cancer driver genes using somatic genomics data.BMC Bioinformatics. 2021 May 24;22(1):263. doi: 10.1186/s12859-021-04203-7. BMC Bioinformatics. 2021. PMID: 34030627 Free PMC article.
-
Genetic variability shapes the alternative pathway complement activity and predisposition to complement-related diseases.Immunol Rev. 2023 Jan;313(1):71-90. doi: 10.1111/imr.13131. Epub 2022 Sep 11. Immunol Rev. 2023. PMID: 36089777 Free PMC article. Review.
-
A scalable Bayesian functional GWAS method accounting for multivariate quantitative functional annotations with applications for studying Alzheimer disease.HGG Adv. 2022 Sep 17;3(4):100143. doi: 10.1016/j.xhgg.2022.100143. eCollection 2022 Oct 13. HGG Adv. 2022. PMID: 36204489 Free PMC article.
References
-
- Shendure J., Balasubramanian S., Church G.M., Gilbert W., Rogers J., Schloss J.A., Waterston R.H.. DNA sequencing at 40: past, present and future. Nature. 2017; 550:345–353. - PubMed
-
- Cooper G.M., Shendure J.. Needles in stacks of needles: finding disease-causal variants in a wealth of genomic data. Nat. Rev. Genet. 2011; 12:628–640. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
