Jennifer C. Ureta scite author profile

With the growing interest in using phages to combat antimicrobial resistance, computational methods for predicting phage-host interactions have been explored to help shortlist candidate phages. Most existing models consider entire proteomes and rely on manual feature engineering, which poses difficulty in selecting the most informative sequence properties to serve as input to the model. In this paper, we framed phage-host interaction prediction as a multiclass classification problem that takes as input the embeddings of a phage’s receptor-binding proteins, which are known to be the key machinery for host recognition, and predicts the host genus. We explored different protein language models to automatically encode these protein sequences into dense embeddings without the need for additional alignment or structural information. We show that the use of embeddings of receptor-binding proteins presents improvements over handcrafted genomic and protein sequence features. The highest performance was obtained using the transformer-based protein language model ProtT5, resulting in a 3% to 4% increase in weighted F1 and recall scores across different prediction confidence thresholds, compared to using selected handcrafted sequence features.

show abstract

CATE: An Embodied Conversational Agent for the Elderly

Bravo

Herrera

Valdez

et al. 2020

View full text Add to dashboard Cite

Protein embeddings improve phage-host interaction prediction

Gonzales¹,

Ureta²,

Shrestha³

2023

Preprint

View full text Add to dashboard Cite

With the growing interest in using phages to combat antimicrobial resistance, computational methods for predicting phage-host interactions have been explored to help shortlist candidate phages. Most existing models consider entire proteomes and rely on manual feature engineering, which poses difficulty in selecting the most informative sequence properties to serve as input to the model. In this paper, we framed phage-host interaction prediction as a multiclass classification problem, which takes as input the embeddings of a phage's receptor-binding proteins, which are known to be the key machinery for host recognition, and predicts the host genus. We explored different protein language models to automatically encode these protein sequences into dense embeddings without the need for additional alignment or structural information. We show that the use of embeddings of receptor-binding proteins presents improvements over handcrafted genomic and protein sequence features. The highest performance was obtained using the transformer-based protein language model ProtT5, resulting in a 3% to 4% increase of weighted F1 scores across different prediction confidence threshold,compared to using selected handcrafted sequence features.

show abstract

Identifying drug-resistant tuberculosis from chest X-ray images using a simple convolutional neural network

Ureta

Shrestha

2021

J. Phys.: Conf. Ser.

View full text Add to dashboard Cite

Tuberculosis(TB) is one of the top 10 causes of death worldwide, and drug-resistant TB is a major public health concern especially in resource-constrained countries. In such countries, molecular diagnosis of drug-resistant TB remains a challenge; and imaging tools such as X-rays, which are cheaply and widely available, can be a valuable supplemental resource for early detection and screening. This study uses a specialized convolutional neural network to perform binary classification of chest X-ray images to classify drug-resistant and drug-sensitive TB. The models were trained and validated using the TBPortals dataset which contains 2,973 labeled X-ray images from TB patients. The classifiers were able to identify the presence or absence of drug-resistant Tuberculosis with an AUROC between 0.66–0.67, which is an improvement over previous attempts using deep learning networks.

show abstract

At Home with Alexa: A Tale of Two Conversational Agents

Ureta

Brito

et al. 2020

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jennifer C. Ureta

Protein embeddings improve phage-host interaction prediction

CATE: An Embodied Conversational Agent for the Elderly

Protein embeddings improve phage-host interaction prediction

Identifying drug-resistant tuberculosis from chest X-ray images using a simple convolutional neural network

At Home with Alexa: A Tale of Two Conversational Agents

Contact Info

Product

Resources

About