2013
DOI: 10.1038/nbt.2486
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of methods for modeling transcription factor sequence specificity

Abstract: Genomic analyses often involve scanning for potential transcription-factor (TF) binding sites using models of the sequence specificity of DNA binding proteins. Many approaches have been developed to model and learn a protein’s binding specificity, but these methods have not been systematically compared. Here we applied 26 such approaches to in vitro protein binding microarray data for 66 mouse TFs belonging to various families. For 9 TFs, we also scored the resulting motif models on in vivo data, and found tha… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

14
486
1
4

Year Published

2015
2015
2023
2023

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 347 publications
(505 citation statements)
references
References 55 publications
14
486
1
4
Order By: Relevance
“…2). The best method reported in the original evaluation (Team_D, a k-mer-based model) and the best reported in the revised evaluation (FeatureREDUCE, a hybrid PWM/k-mer model) both had reasonable, but not the best, performance on in vivo data, which might be due to overfitting to PBM noise 17 . …”
Section: Training Deepbind and Scoring Sequencesmentioning
confidence: 93%
See 4 more Smart Citations
“…2). The best method reported in the original evaluation (Team_D, a k-mer-based model) and the best reported in the revised evaluation (FeatureREDUCE, a hybrid PWM/k-mer model) both had reasonable, but not the best, performance on in vivo data, which might be due to overfitting to PBM noise 17 . …”
Section: Training Deepbind and Scoring Sequencesmentioning
confidence: 93%
“…Ascertaining DNA sequence specificities To evaluate DeepBind's ability to characterize DNA-binding protein specificity, we used PBM data from the revised DREAM5 TF-DNA Motif Recognition Challenge by Weirauch et al 17 . The PBM data represent 86 different mouse transcription factors, each measured using two independent array designs.…”
Section: Training Deepbind and Scoring Sequencesmentioning
confidence: 99%
See 3 more Smart Citations