Protein Residue Contact Prediction Based on Deep Learning and Massive Statistical Features from Multi-Sequence Alignment

Zhang, Huiling; Hao, Min; Wu, Hao; Ting, Hing-Fung; Yi, Tang; Xi, Wenhui; Wei, Yanjie

doi:10.26599/tst.2021.9010064

Cited by 8 publications

(6 citation statements)

References 50 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The first feature set contains 526 feature channels: one-hot-encoder of the target sequence (1D features, 20*2 channels); position-specific frequency matrix (1D features, 21*2 channels, considering gap) and positional entropy ( Yang et al, 2020 ) (1D features, 1*2 channels); and coupling features ( Yang et al, 2020 ) (2D features, 441 channels) derived from the inverse of the shrunk covariance matrix of MSA. The second feature set contains 151 feature channels: one-hot-encoder of the target sequence (1D features, 20*2 channels), position-specific scoring matrix ( Altschul et al, 1997 ) (1D features; 20*2 channels; not considering gap), HMM profile ( Remmert et al, 2012 ) (1D features, 30*2 channels), secondary structure from SPOT-1D (Hanson et al, 2019) (1D features, 3*2 channels), solvent accessible surface area from SPOT-1D ( Hanson et al, 2019 ) (1D features, 1*2 channels), CCMPRED score (Seemayer et al, 2014) (2D features, 1 channel), mutual information ( Zhang et al, 2022 ) (2D feature, 1 channel), and statistical pair-wise contact potential ( Betancourt and Thirumalai, 1999 ) (2D feature, 1 channel). The first feature set, indicated as FeatSet1, is mainly composed of 2D direct coupling features (441 out of 526 total features) from the MSA, while the second feature set, indicated as FeatSet2, is mainly composed of 1D sequence-based features (148 out of 151 total features).…”

Section: Methodsmentioning

confidence: 99%

Inter-Residue Distance Prediction From Duet Deep Learning Models

et al. 2022

Self Cite

View full text Add to dashboard Cite

Residue distance prediction from the sequence is critical for many biological applications such as protein structure reconstruction, protein–protein interaction prediction, and protein design. However, prediction of fine-grained distances between residues with long sequence separations still remains challenging. In this study, we propose DuetDis, a method based on duet feature sets and deep residual network with squeeze-and-excitation (SE), for protein inter-residue distance prediction. DuetDis embraces the ability to learn and fuse features directly or indirectly extracted from the whole-genome/metagenomic databases and, therefore, minimize the information loss through ensembling models trained on different feature sets. We evaluate DuetDis and 11 widely used peer methods on a large-scale test set (610 proteins chains). The experimental results suggest that 1) prediction results from different feature sets show obvious differences; 2) ensembling different feature sets can improve the prediction performance; 3) high-quality multiple sequence alignment (MSA) used for both training and testing can greatly improve the prediction performance; and 4) DuetDis is more accurate than peer methods for the overall prediction, more reliable in terms of model prediction score, and more robust against shallow multiple sequence alignment (MSA).

show abstract

Section: Methodsmentioning

confidence: 99%

Inter-Residue Distance Prediction From Duet Deep Learning Models

et al. 2022

Self Cite

View full text Add to dashboard Cite

show abstract

“…And the robustness of the proposed algorithm to different disturbances acting on the ship is proved by simulation studies, and the obtained performance is comparable to the state-of-the-art methods based on template matching [ 1 ]. Another team of scholars has developed a protein residue contact prediction system based on deep learning and massive statistical features of multiple sequence alignments [ 2 ]. Ojugo created a predictive and intelligent decision support model for the diabetes pandemic using deep reinforcement learning algorithms [ 3 ].…”

Section: Related Workmentioning

confidence: 99%

Deep Q networks-based optimization of emergency resource scheduling for urban public health events

Zhao

Wang

2022

Neural Comput & Applic

View full text Add to dashboard Cite

In today's severe situation of the global new crown virus raging, there are still efficiency problems in emergency resource scheduling, and there are still deficiencies in rescue standards. For the happiness and well-being of people's lives, adhering to the principle of a community with a shared future for mankind, the emergency resource scheduling system for urban public health emergencies needs to be improved and perfected. This paper mainly studies the optimization model of urban emergency resource scheduling, which uses the deep reinforcement learning algorithm to build the emergency resource distribution system framework, and uses the Deep Q Network path planning algorithm to optimize the system, to achieve the purpose of optimizing and upgrading the efficient scheduling of emergency resources in the city. Finally, through simulation experiments, it is concluded that the deep learning algorithm studied is helpful to the emergency resource scheduling optimization system. However, with the gradual development of deep learning, some of its disadvantages are becoming increasingly obvious. An obvious flaw is that building a deep learning-based model generally requires a lot of CPU computing resources, making the cost too high.

show abstract

“…From the birth of machine learning to the present, according to the hierarchical structure of the model, its development process has gone through two stages: shallow learning and deep learning. In general, these models are considered nonlinear, only nonlinear transformers [12,13]. Deep learning is a model of a deep neural network with many layers of mystery.…”

Section: Deep Learningmentioning

confidence: 99%

Feature Extraction and Classification of Music Content Based on Deep Learning

Shi¹,

2022

Advances in Multimedia

View full text Add to dashboard Cite

To study the use of in-depth training in extracting and classifying the content of music samples, the work offers an algorithm for identifying and classifying musical genres based on a deep network of beliefs, enabling it to be used to extract and classify traditional Chinese musical instruments, and using real-world experiments to test its performance after training. The experimental results are as follows: the improved depth confidence network algorithm has the highest accuracy for music recognition and classification, which can reach 75.8%, higher than other traditional methods. The improved depth confidence network identifies and classifies Chinese traditional musical instruments through Softmax layer, and the accuracy is even as high as 99.2%; DBN is combined with Softmax neural network algorithm when only a few labeled samples in the training set are used for network fine-tuning, and the accuracy of the algorithm can still reach more than 90%, which can reduce the workload in the early stage. This study effectively solves the problem of too much workload and low accuracy in the process of music content recognition, classification, and extraction.

show abstract

Protein Residue Contact Prediction Based on Deep Learning and Massive Statistical Features from Multi-Sequence Alignment

Cited by 8 publications

References 50 publications

Inter-Residue Distance Prediction From Duet Deep Learning Models

Inter-Residue Distance Prediction From Duet Deep Learning Models

Deep Q networks-based optimization of emergency resource scheduling for urban public health events

Feature Extraction and Classification of Music Content Based on Deep Learning

Contact Info

Product

Resources

About