2013
DOI: 10.1002/humu.22376
|View full text |Cite
|
Sign up to set email alerts
|

dbNSFP v2.0: A Database of Human Non-synonymous SNVs and Their Functional Predictions and Annotations

Abstract: dbNSFP is a database developed for functional prediction and annotation of all potential non-synonymous single-nucleotide variants (nsSNVs) in the human genome. This database significantly facilitates the process of querying predictions and annotations from different databases/web-servers for large amounts of nsSNVs discovered in exome-sequencing studies. Here we report a recent major update of the database to version 2.0. We have rebuilt the SNV collection based on GENCODE 9 and currently the database include… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
512
0
2

Year Published

2014
2014
2020
2020

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 557 publications
(517 citation statements)
references
References 43 publications
0
512
0
2
Order By: Relevance
“…6,7 Annotations were added using an in-house-built annotation database, according to the UCSC RefGene track, dbSNP137 and the dbNSFP (v2.0). 8 We filtered for genes that contained at least two nonsynonymous variants, variants leading to a premature stop codon or splice variants with allele frequencies of o0.01 in dbSNP137 and absence in our in-house database of~200 exomes. The variants located in these genes were further evaluated for the impact on protein function by SIFT and Polyphen2 as predicted by the dbNSFP database.…”
Section: Methodsmentioning
confidence: 99%
“…6,7 Annotations were added using an in-house-built annotation database, according to the UCSC RefGene track, dbSNP137 and the dbNSFP (v2.0). 8 We filtered for genes that contained at least two nonsynonymous variants, variants leading to a premature stop codon or splice variants with allele frequencies of o0.01 in dbSNP137 and absence in our in-house database of~200 exomes. The variants located in these genes were further evaluated for the impact on protein function by SIFT and Polyphen2 as predicted by the dbNSFP database.…”
Section: Methodsmentioning
confidence: 99%
“…Further characterization of all non-synonymous variants, new and previously described, included interrogation using the full list of functional prediction algorithms in the dbNSFP database (version 2.7, September 12, 2014) (42,43). This analysis revealed that the majority of new variants and of known, rare missense have potential significance for the protein function, defined as potentially damaging, not tolerated or highly conserved in a number of the functional prediction algorithms.…”
Section: Ngs Resequencing and Analysis Of Pooled Genomic Regionsmentioning
confidence: 99%
“…To compare the predictors' performances with the rfPred one, we have used the individual prediction scores (Polyphen2, SIFT, LRT, PhyloP and MutationTaster) coming from dbNSFPv2.0 23 and CADD scores computed via its web-interface 6 . We have then computed the ROC curves and the areas under the curves of the different classifiers thanks to the R package "Hmisc" 24 on the learning dataset and on the validation datasets.…”
Section: Statistical Model Evaluationmentioning
confidence: 99%
“…For some exome positions, one or several pre-calculated scores were missing in dbNSFP 2.0 23 . Because we wanted to be able to use rfPred even if one of the score is missing for a variant position, we have imputed the missing score value by a random forest approach implemented in the R package "yaImpute" 25 , based on a k-NN algorithm.…”
Section: Missing Data Handlingmentioning
confidence: 99%