2022
DOI: 10.1021/acsbiomaterials.2c00737
|View full text |Cite
|
Sign up to set email alerts
|

CollagenTransformer: End-to-End Transformer Model to Predict Thermal Stability of Collagen Triple Helices Using an NLP Approach

Abstract: Collagen is one of the most important structural proteins in biology, and its structural hierarchy plays a crucial role in many mechanically important biomaterials. Here, we demonstrate how transformer models can be used to predict, directly from the primary amino acid sequence, the thermal stability of collagen triple helices, measured via the melting temperature T m. We report two distinct transformer architectures to compare performance. First, we train a small transformer model from scratch, using our coll… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
18
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 17 publications
(18 citation statements)
references
References 64 publications
(115 reference statements)
0
18
0
Order By: Relevance
“… The development of computational models for the prediction of T m values of collagen triple helices is currently an active area of research. [ 65 , 66 , 67 , 68 , 69 , 70 , 71 ] Several of these models succeeded in capturing the influence of inner residues on triple helix stability. These algorithms, however, do not take frameshifts, terminal capping groups and their charge into account.…”
Section: Discussionmentioning
confidence: 99%
“… The development of computational models for the prediction of T m values of collagen triple helices is currently an active area of research. [ 65 , 66 , 67 , 68 , 69 , 70 , 71 ] Several of these models succeeded in capturing the influence of inner residues on triple helix stability. These algorithms, however, do not take frameshifts, terminal capping groups and their charge into account.…”
Section: Discussionmentioning
confidence: 99%
“…(3) The development of computational models for the prediction of T m values of collagen triple helices is currently an active area of research. [65][66][67][68][69][70][71] Several of these models succeeded in capturing the influence of inner residues on triple helix stability. These algorithms, however, do not take frameshifts, terminal capping groups and their charge into account.…”
Section: Discussionmentioning
confidence: 99%
“…Cross‐comparisons between different works have therefore been virtually impossible. Our set of parameters allows for recalculating reported T m data to a common reference peptide and, thus, enables the comparison of results from diverse sources. The development of computational models for the prediction of T m values of collagen triple helices is currently an active area of research [65–71] . Several of these models succeeded in capturing the influence of inner residues on triple helix stability.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…“Attention” mechanisms, which direct neural networks to focus on the most relevant components of input data based on context rather than sequential order, improve training efficiency and performance in RNNs. Transformer models extend attention mechanisms to “self-attention” by removing sequential recurrent processing steps altogether and processing input tokens in parallel, thus shifting training toward semisupervised and unsupervised methods. , Transformer neural networks are the current state-of-the-art in natural language processing (NLP) tasks and have recently been extended to predict chemical and physical properties of materials. All supervised learning models are generally trained by assigning a loss function based on model performance and modifying trainable parameters through multiple iterations to minimize this loss. Artificial neural networks utilize back-propagation and gradient descent algorithms to optimally modify model parameters in each iteration.…”
Section: Computational Methods For Materiomics and Sustainable Materi...mentioning
confidence: 99%