2015
DOI: 10.1007/978-3-319-16706-0_21
|View full text |Cite
|
Sign up to set email alerts
|

Protein Contact Prediction by Integrating Joint Evolutionary Coupling Analysis and Supervised Learning

Abstract: Motivation: Protein contact prediction is important for protein structure and functional study. Both evolutionary coupling (EC) analysis and supervised machine learning methods have been developed, making use of different information sources. However, contact prediction is still challenging especially for proteins without a large number of sequence homologs. Results: This article presents a group graphical lasso (GGL) method for contact prediction that integrates joint multi-family EC analysis and supervised l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
84
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
6

Relationship

2
4

Authors

Journals

citations
Cited by 54 publications
(84 citation statements)
references
References 45 publications
0
84
0
Order By: Relevance
“…We evaluate the accuracy of the top L/k ( k = 10, 5, 2, 1) predicted contacts where L is protein sequence length [10]. We define that a contact is short-, medium- and long-range when the sequence distance of the two residues in a contact falls into [6, 11], [12, 23], and ≥24, respectively.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…We evaluate the accuracy of the top L/k ( k = 10, 5, 2, 1) predicted contacts where L is protein sequence length [10]. We define that a contact is short-, medium- and long-range when the sequence distance of the two residues in a contact falls into [6, 11], [12, 23], and ≥24, respectively.…”
Section: Resultsmentioning
confidence: 99%
“…Therefore, contact prediction and contact-assisted protein folding has recently gained much attention in the community. However, for many proteins especially those without many sequence homologs, the predicted contacts by the state-of-the-art predictors such as CCMpred [4], PSICOV [5], Evfold [6], plmDCA[7], Gremlin[8], MetaPSICOV [9] and CoinDCA [10] are still of low quality and insufficient for accurate contact-assisted protein folding [11, 12]. This motivates us to develop a better contact prediction method, especially for proteins without a large number of sequence homologs.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…However, soluble proteins with that many sequence homologs likely have similar solved structures in protein database (PDB) and thus, may be modeled by template‐based methods. For proteins with few sequence homologs, pure coevolution methods such as CCMpred, PSICOV, Evfold, Gremlin, and CoinDCA do not fare well and their predictions are not very helpful to ab initio folding. Supervised learning such as PhyCMAP, DNCON, and SVMSEQ predicts contacts using a variety of protein features, on average outperforming pure coevolution methods on proteins with few sequence homologs.…”
Section: Introductionmentioning
confidence: 99%