2019
DOI: 10.1038/s41598-019-44966-x
|View full text |Cite
|
Sign up to set email alerts
|

Modeling in-vivo protein-DNA binding by combining multiple-instance learning with a hybrid deep neural network

Abstract: Modeling in-vivo protein-DNA binding is not only fundamental for further understanding of the regulatory mechanisms, but also a challenging task in computational biology. Deep-learning based methods have succeed in modeling in-vivo protein-DNA binding, but they often (1) follow the fully supervised learning framework and overlook the weakly supervised information of genomic sequences that a bound DNA sequence may has multiple TFBS(s), and, (2) use one-hot encoding … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
21
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 52 publications
(22 citation statements)
references
References 45 publications
0
21
0
Order By: Relevance
“…It uses the noisy method to integrate the prediction values of all instances into the final prediction of the package. The improved method is termed as WSCNNLSTM [89].…”
Section: Application Of Hybrid Neural Networkmentioning
confidence: 99%
See 2 more Smart Citations
“…It uses the noisy method to integrate the prediction values of all instances into the final prediction of the package. The improved method is termed as WSCNNLSTM [89].…”
Section: Application Of Hybrid Neural Networkmentioning
confidence: 99%
“…In recent years, prediction of TFBSs based on sequence data has been achieved in some studies. Among these, the methods to determine the presence of motifs are mainly based on traditional machine learning methods such as EM and support-vector machines [99], [100], and deep learning methods such as CNN and LSTM [24], [89]. Due to the abstract and sequential nature of DNA sequences, prediction method of TFBSs solely based on sequence is not suitable for complex sequence data.…”
Section: Model Establishmentmentioning
confidence: 99%
See 1 more Smart Citation
“…Neural models have been rapidly developed since the MP model (McCulloch & Pitts, 1943) was proposed to mode the structure of human brain. Simulating learning activities for engineering applications is always an important goal in the development of neural models (see Snyder et al, 2016;Han et al, 2016;Zhang, Zhen, & Huang, 2019). Generally, learning activities can be considered as supervised learning and unsupervised learning.…”
Section: Introductionmentioning
confidence: 99%
“…To address this problem, we benchmarked the existing 20 DL tools [9][10][11][12][13][14][15][16][17][18][19][20][21][22][23][24][25][26][27][28] ( Supplementary Table S1) using 690 ENCODE DNA ChIP-Seq datasets (covering 91 TFs in 161 cell lines) and 55 RNA CLIP-Seq datasets (Fig. 1A-B and ( Supplementary Table S2) 28,23 .…”
mentioning
confidence: 99%