Sequence and structure conservation analysis of the key coronavirus proteins supports the feasibility of discovering broad-spectrum antiviral medications.

Melo-Filho, Cleber C.; Bobrowski, Tesia; Martin, Holli-Joi; Sessions, Zoe; Popov, Konstantin; Moorman, Nathaniel J.; Baric, Ralph S.; Muratov, Eugene; Tropsha, Alexander

doi:10.26434/chemrxiv-2022-zg88d

Cited by 1 publication

(1 citation statement)

References 76 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…As biological macromolecules, protein is an indispensable component of all cells and tissues, accounting for ∼15.1% of the body weight . Enzyme catalysis, hormone regulation, antibody immunity, and other processes are inseparable from the participation of proteins, and which proteins are involved in these different processes depends on the structure of the proteins . For example, globulin with an approximately spherical shape can specifically identify the depression or fissure site of other compounds, whereas keratin, which consists of polypeptide chains in α-helix or β-folded conformations, plays a protective role in epithelial cells.…”

Section: Introductionmentioning

confidence: 99%

Enhancing Protein Function Prediction Performance by Utilizing AlphaFold-Predicted Protein Structures

Zhang

et al. 2022

J. Chem. Inf. Model.

View full text Add to dashboard Cite

The structure of a protein is of great importance in determining its functionality, and this characteristic can be leveraged to train data-driven prediction models. However, the limited number of available protein structures severely limits the performance of these models. AlphaFold2 and its open-source data set of predicted protein structures have provided a promising solution to this problem, and these predicted structures are expected to benefit the model performance by increasing the number of training samples. In this work, we constructed a new data set that acted as a benchmark and implemented a state-of-the-art structure-based approach for determining whether the performance of the function prediction model can be improved by putting additional AlphaFold-predicted structures into the training set and further compared the performance differences between two models separately trained with real structures only and AlphaFold-predicted structures only. Experimental results indicated that structure-based protein function prediction models could benefit from virtual training data consisting of AlphaFold-predicted structures. First, model performances were improved in all three categories of Gene Ontology terms (GO terms) after adding predicted structures as training samples. Second, the model trained only on AlphaFold-predicted virtual samples achieved comparable performances to the model based on experimentally solved real structures, suggesting that predicted structures were almost equally effective in predicting protein functionality.

show abstract

Section: Introductionmentioning

confidence: 99%

Enhancing Protein Function Prediction Performance by Utilizing AlphaFold-Predicted Protein Structures

Zhang

et al. 2022

J. Chem. Inf. Model.

View full text Add to dashboard Cite

show abstract

Sequence and structure conservation analysis of the key coronavirus proteins supports the feasibility of discovering broad-spectrum antiviral medications.

Cited by 1 publication

References 76 publications

Enhancing Protein Function Prediction Performance by Utilizing AlphaFold-Predicted Protein Structures

Enhancing Protein Function Prediction Performance by Utilizing AlphaFold-Predicted Protein Structures

Contact Info

Product

Resources

About