Enhancing Transformers with Gradient Boosted Decision Trees for NLI Fine-Tuning

Minixhofer, Benjamin; Gritta, Milan; Iacobacci, Ignacio

doi:10.18653/v1/2021.findings-acl.26

Cited by 2 publications

(3 citation statements)

References 51 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Previous studies have demonstrated benefits of utilizing learned representations from Transformer Networks as inputs for gradient boosting models, leading to improved outcomes compared to directly using the predictions of a Transformer Network 19,36 . We thus follow a similar approach here.…”

Section: Resultsmentioning

confidence: 99%

Drug-target interaction prediction using a multi-modal transformer network demonstrates high generalizability to unseen proteins

Kroll,

Ranjan,

Lercher

2023

Preprint

View full text Add to dashboard Cite

The activities of most enzymes and drugs rely on interactions between proteins and small molecules. Accurate predictions of these interactions could accelerate pharmaceutical and biotechnological research massively. Machine learning models designed for this task are currently limited by the lack of information exchange between the protein and the small molecule during the creation of the required numerical representations. Here, we introduce ProSmith, a machine learning framework that employs a multimodal Transformer Network to simultaneously process protein amino acid sequences and small molecule strings in the same input. This approach facilitates the exchange of all relevant information between the two molecule types during the calculation of their numerical representations, allowing the model to account for their structural and functional interactions. Our final model combines gradient boosting predictions based on the resulting multimodal Transformer Network with independent predictions based on separate deep learning representations of the proteins and small molecules. The corresponding predictions outperform all previous models for predicting protein-small molecule interactions across three diverse tasks: predicting Michaelis constantsKM; inferring potential substrates for enzymes; and predicting protein-drug affinities. The Python code provided can be used to easily implement and improve machine learning predictions involving arbitrary protein-small molecule interactions.

show abstract

Section: Resultsmentioning

confidence: 99%

Drug-target interaction prediction using a multi-modal transformer network demonstrates high generalizability to unseen proteins

Kroll,

Ranjan,

Lercher

2023

Preprint

View full text Add to dashboard Cite

show abstract

“…Previous studies have demonstrated benefits of utilizing learned representations from Transformer Networks as inputs for gradient boosting models, leading to improved outcomes compared to directly using the predictions of a Transformer Network [18,37]. We thus follow a similar approach here.…”

Section: Prosmith Feeds the Learned Representations To Gradient Boost...mentioning

confidence: 99%

“…The final ProSmith model does not use these predictions directly, but instead uses the learned joint protein-small molecule representations to train gradient boosting models. This strategy was motivated by previous studies that showed superior results when adding a gradient boosting step [18,37]. To investigate whether this additional step indeed contributed to the superior performance of ProSmith, we re-examined the enzyme-substrate prediction task, comparing the model performance of directly using the end-to-end trained multimodal Transformer Network with that of a gradient boosting model that takes the learned joint protein-small molecule representation from this Network as input.…”

Section: Prosmith's Model Architecture Has An Important Impact On Mod...mentioning

confidence: 99%

A multimodal Transformer Network for protein-small molecule interactions enhances predictions of kinase inhibition and enzyme-substrate relationships

Kroll,

Ranjan,

Lercher

2024

PLoS Comput Biol

View full text Add to dashboard Cite

The activities of most enzymes and drugs depend on interactions between proteins and small molecules. Accurate prediction of these interactions could greatly accelerate pharmaceutical and biotechnological research. Current machine learning models designed for this task have a limited ability to generalize beyond the proteins used for training. This limitation is likely due to a lack of information exchange between the protein and the small molecule during the generation of the required numerical representations. Here, we introduce ProSmith, a machine learning framework that employs a multimodal Transformer Network to simultaneously process protein amino acid sequences and small molecule strings in the same input. This approach facilitates the exchange of all relevant information between the two molecule types during the computation of their numerical representations, allowing the model to account for their structural and functional interactions. Our final model combines gradient boosting predictions based on the resulting multimodal Transformer Network with independent predictions based on separate deep learning representations of the proteins and small molecules. The resulting predictions outperform recently published state-of-the-art models for predicting protein-small molecule interactions across three diverse tasks: predicting kinase inhibitions; inferring potential substrates for enzymes; and predicting Michaelis constants KM. The Python code provided can be used to easily implement and improve machine learning predictions involving arbitrary protein-small molecule interactions.

show abstract

Enhancing Transformers with Gradient Boosted Decision Trees for NLI Fine-Tuning

Cited by 2 publications

References 51 publications

Drug-target interaction prediction using a multi-modal transformer network demonstrates high generalizability to unseen proteins

Drug-target interaction prediction using a multi-modal transformer network demonstrates high generalizability to unseen proteins

A multimodal Transformer Network for protein-small molecule interactions enhances predictions of kinase inhibition and enzyme-substrate relationships

Contact Info

Product

Resources

About