Tianfan Fu scite author profile

Summary Accurate prediction of drug–target interactions (DTI) is crucial for drug discovery. Recently, deep learning (DL) models for show promising performance for DTI prediction. However, these models can be difficult to use for both computer scientists entering the biomedical field and bioinformaticians with limited DL experience. We present DeepPurpose, a comprehensive and easy-to-use DL library for DTI prediction. DeepPurpose supports training of customized DTI prediction models by implementing 15 compound and protein encoders and over 50 neural architectures, along with providing many other useful features. We demonstrate state-of-the-art performance of DeepPurpose on several benchmark datasets. Availability and implementation https://github.com/kexinhuang12345/DeepPurpose. Contact jimeng@illinois.edu Supplementary information Supplementary data are available at Bioinformatics online.

show abstract

Deep feature for text-dependent speaker verification

Liu

Chen

et al. 2015

Speech Communication

153

119

View full text Add to dashboard Cite

Artificial intelligence foundation for therapeutic science

Huang

Gao

et al. 2022

Nat Chem Biol

View full text Add to dashboard Cite

Therapeutics Data Commons: Machine Learning Datasets and Tasks for Drug Discovery and Development

Huang

Gao

et al. 2021

Preprint

View full text Add to dashboard Cite

Machine learning for therapeutics is an emerging field with incredible opportunities for innovation and expansion. Despite the initial success, many key challenges remain open. Here, we introduce Therapeutics Data Commons (TDC), the first unifying framework to systematically access and evaluate machine learning across the entire range of therapeutics. At its core, TDC is a collection of curated datasets and learning tasks that can translate algorithmic innovation into biomedical and clinical implementation. To date, TDC includes machine learning-ready datasets from learning tasks, spanning the discovery and development of safe and effective medicines. TDC also provides an ecosystem of tools, libraries, leaderboards, and community resources, including data functions, strategies for systematic model evaluation, meaningful data splits, data processors, and molecule generation oracles. All datasets and learning tasks are integrated and accessible via an open-source library. We envision that TDC can facilitate algorithmic and scientific advances and accelerate development, validation, and transition into production and clinical implementation. TDC is a continuous, open-source initiative, and we invite contributions from the research community. TDC is publicly available at https://tdcommons.ai.

show abstract

CORE: Automatic Molecule Optimization Using Copy & Refine Strategy

Xiao

Sun

2020

AAAI

View full text Add to dashboard Cite

Molecule optimization is about generating molecule Y with more desirable properties based on an input molecule X. The state-of-the-art approaches partition the molecules into a large set of substructures S and grow the new molecule structure by iteratively predicting which substructure from S to add. However, since the set of available substructures S is large, such an iterative prediction task is often inaccurate especially for substructures that are infrequent in the training data. To address this challenge, we propose a new generating strategy called “Copy&Refine” (CORE), where at each step the generator first decides whether to copy an existing substructure from input X or to generate a new substructure, then the most promising substructure will be added to the new molecule. Combining together with scaffolding tree generation and adversarial training, CORE can significantly improve several latest molecule optimization methods in various measures including drug likeness (QED), dopamine receptor (DRD2) and penalized LogP. We tested CORE and baselines using the ZINC database and CORE obtained up to 11% and 21% relatively improvement over the baselines on success rate on the complete test set and the subset with infrequent substructures, respectively.

show abstract

Scientific discovery in the age of artificial intelligence

Wang

et al. 2023

Nature

204

View full text Add to dashboard Cite

HINT: Hierarchical interaction network for clinical-trial-outcome predictions

Huang

Xiao³

et al. 2022

Patterns

View full text Add to dashboard Cite

MolGenSurvey: A Systematic Survey in Machine Learning Models for Molecule Design

Du¹,

Fu²,

Sun³

et al. 2022

Preprint

View full text Add to dashboard Cite

12 3 4

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Tianfan Fu

DeepPurpose: a deep learning library for drug–target interaction prediction

Deep feature for text-dependent speaker verification

Artificial intelligence foundation for therapeutic science

Therapeutics Data Commons: Machine Learning Datasets and Tasks for Drug Discovery and Development

CORE: Automatic Molecule Optimization Using Copy & Refine Strategy

Scientific discovery in the age of artificial intelligence

HINT: Hierarchical interaction network for clinical-trial-outcome predictions

MolGenSurvey: A Systematic Survey in Machine Learning Models for Molecule Design

Contact Info

Product

Resources

About