dbNSFP: a lightweight database of human nonsynonymous SNPs and their functional predictions
- PMID: 21520341
- PMCID: PMC3145015
- DOI: 10.1002/humu.21517
dbNSFP: a lightweight database of human nonsynonymous SNPs and their functional predictions
Abstract
With the advance of sequencing technologies, whole exome sequencing has increasingly been used to identify mutations that cause human diseases, especially rare Mendelian diseases. Among the analysis steps, functional prediction (of being deleterious) plays an important role in filtering or prioritizing nonsynonymous SNP (NS) for further analysis. Unfortunately, different prediction algorithms use different information and each has its own strength and weakness. It has been suggested that investigators should use predictions from multiple algorithms instead of relying on a single one. However, querying predictions from different databases/Web-servers for different algorithms is both tedious and time consuming, especially when dealing with a huge number of NSs identified by exome sequencing. To facilitate the process, we developed dbNSFP (database for nonsynonymous SNPs' functional predictions). It compiles prediction scores from four new and popular algorithms (SIFT, Polyphen2, LRT, and MutationTaster), along with a conservation score (PhyloP) and other related information, for every potential NS in the human genome (a total of 75,931,005). It is the first integrated database of functional predictions from multiple algorithms for the comprehensive collection of human NSs. dbNSFP is freely available for download at http://sites.google.com/site/jpopgen/dbNSFP.
© 2011 Wiley-Liss, Inc.
Figures
Similar articles
-
dbNSFP v2.0: a database of human non-synonymous SNVs and their functional predictions and annotations.Hum Mutat. 2013 Sep;34(9):E2393-402. doi: 10.1002/humu.22376. Epub 2013 Jul 10. Hum Mutat. 2013. PMID: 23843252 Free PMC article.
-
dbNSFP v3.0: A One-Stop Database of Functional Predictions and Annotations for Human Nonsynonymous and Splice-Site SNVs.Hum Mutat. 2016 Mar;37(3):235-41. doi: 10.1002/humu.22932. Epub 2016 Jan 5. Hum Mutat. 2016. PMID: 26555599 Free PMC article.
-
dbNSFP v4: a comprehensive database of transcript-specific functional predictions and annotations for human nonsynonymous and splice-site SNVs.Genome Med. 2020 Dec 2;12(1):103. doi: 10.1186/s13073-020-00803-9. Genome Med. 2020. PMID: 33261662 Free PMC article.
-
Congruency in the prediction of pathogenic missense mutations: state-of-the-art web-based tools.Brief Bioinform. 2013 Jul;14(4):448-59. doi: 10.1093/bib/bbt013. Epub 2013 Mar 15. Brief Bioinform. 2013. PMID: 23505257 Review.
-
A SNP-centric database for the investigation of the human genome.BMC Bioinformatics. 2004 Mar 26;5:33. doi: 10.1186/1471-2105-5-33. BMC Bioinformatics. 2004. PMID: 15046636 Free PMC article. Review.
Cited by
-
Explicable prioritization of genetic variants by integration of rule-based and machine learning algorithms for diagnosis of rare Mendelian disorders.Hum Genomics. 2024 Mar 21;18(1):28. doi: 10.1186/s40246-024-00595-8. Hum Genomics. 2024. PMID: 38509596 Free PMC article.
-
Association between missense variants of uncertain significance in the CHEK2 gene and hereditary breast cancer: a cosegregation and bioinformatics analysis.Front Genet. 2024 Feb 27;14:1274108. doi: 10.3389/fgene.2023.1274108. eCollection 2023. Front Genet. 2024. PMID: 38476463 Free PMC article.
-
CAGI, the Critical Assessment of Genome Interpretation, establishes progress and prospects for computational genetic variant interpretation methods.Genome Biol. 2024 Feb 22;25(1):53. doi: 10.1186/s13059-023-03113-6. Genome Biol. 2024. PMID: 38389099 Free PMC article.
-
Large language models assisted multi-effect variants mining on cerebral cavernous malformation familial whole genome sequencing.Comput Struct Biotechnol J. 2024 Feb 1;23:843-858. doi: 10.1016/j.csbj.2024.01.014. eCollection 2024 Dec. Comput Struct Biotechnol J. 2024. PMID: 38352937 Free PMC article.
-
SIGMA leverages protein structural information to predict the pathogenicity of missense variants.Cell Rep Methods. 2024 Jan 22;4(1):100687. doi: 10.1016/j.crmeth.2023.100687. Epub 2024 Jan 10. Cell Rep Methods. 2024. PMID: 38211594 Free PMC article.
References
-
- Aittokallio T. Dealing with missing values in large-scale studies: microarray data imputation and beyond. Brief Bioinform. 2010;11:253–264. - PubMed
-
- Capriotti E, Calabrese R, Casadio R. Predicting the insurgence of human genetic diseases associated to single point protein mutations with support vector machines and evolutionary information. Bioinformatics. 2006;22:2729–2734. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous
