2017
DOI: 10.1093/nar/gkx1166
|View full text |Cite
|
Sign up to set email alerts
|

Integrating sequence and gene expression information predicts genome-wide DNA-binding proteins and suggests a cooperative mechanism

Abstract: DNA-binding proteins (DBPs) perform diverse biological functions ranging from transcription to pathogen sensing. Machine learning methods can not only identify DBPs de novo but also provide insights into their DNA-recognition dynamics. However, it remains unclear whether available methods that can accurately predict DNA-binding sites in known DBPs can also identify novel DBPs. Moreover, sequence information is blind to the cellular- and disease-specific contexts of DBP activities, whereas the under-utilized kn… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
17
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 10 publications
(17 citation statements)
references
References 66 publications
0
17
0
Order By: Relevance
“…DBPs and their functional annotations in this work are taken from our recent work, where Gentle integration of gene expression and sequence attributes (GIGEASA), a model to integrate gene expression and sequence information was reported . In that article, we studied the integration of gene expression and sequence information for DBP prediction for human , mouse , and Arabidopsis Thaliana .…”
Section: Methodsmentioning
confidence: 99%
See 4 more Smart Citations
“…DBPs and their functional annotations in this work are taken from our recent work, where Gentle integration of gene expression and sequence attributes (GIGEASA), a model to integrate gene expression and sequence information was reported . In that article, we studied the integration of gene expression and sequence information for DBP prediction for human , mouse , and Arabidopsis Thaliana .…”
Section: Methodsmentioning
confidence: 99%
“…In this work, we have used the most lenient of these DBP annotation, that is, based on GO term labels. These GO annotations were also collected from GIGEASA . Despite this lenient annotation, the ratio of the number of positive (binding proteins) vs negative class (nonbinding proteins) ranges from 1:30 to 1:10 under various training cycles (see below).…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations