Prior to clinical development, a comprehensive pharmacokinetic characterization of a novel drug is required to understand its exposure at the site of action and elimination. Accordingly, in vitro assays and animal pharmacokinetic studies are regularly employed to predict drug exposure in humans, which is often costly and time-consuming. For this reason, the prediction of human pharmacokinetics at the point of design would be of high value for drug discovery. Therefore, we have established a comprehensive data curation protocol that enables machine learning evaluation of 12 human in vivo pharmacokinetic parameters using only chemical structure information and available doses for 1001 unique compounds. These machine learning models were thoroughly investigated and validated using both an independent hold-out test set and AstraZeneca clinical data. In addition, the availability of preclinical predictions for a subset of internal clinical candidates allowed us to compare our in silico approach with state-of-the-art pharmacokinetic predictions. Based on this evaluation, three fit-for-purpose models for AUC PO (R test 2 = 0.63; RMSEtest = 0.76), C max PO (R test 2 = 0.68; RMSEtest = 0.62), and Vdss IV (R test 2 = 0.47; RMSEtest = 0.50) were identified. Based on the findings, our machine learning models have considerable potential for practical applications in drug discovery, such as influencing decision-making in drug discovery projects and progression of drug candidates toward the clinic.
Animal pharmacokinetic (PK) data as well as human and animal in vitro systems are utilized in drug discovery to define the rate and route of drug elimination. Accurate prediction and mechanistic understanding of drug clearance and disposition in animals provide a degree of confidence for extrapolation to humans. In addition, prediction of in vivo properties can be used to improve design during drug discovery, help select compounds with better properties, and reduce the number of in vivo experiments. In this study, we generated machine learning models able to predict rat in vivo PK parameters and concentration–time PK profiles based on the molecular chemical structure and either measured or predicted in vitro parameters. The models were trained on internal in vivo rat PK data for over 3000 diverse compounds from multiple projects and therapeutic areas, and the predicted endpoints include clearance and oral bioavailability. We compared the performance of various traditional machine learning algorithms and deep learning approaches, including graph convolutional neural networks. The best models for PK parameters achieved R 2 = 0.63 [root mean squared error (RMSE) = 0.26] for clearance and R 2 = 0.55 (RMSE = 0.46) for bioavailability. The models provide a fast and cost-efficient way to guide the design of molecules with optimal PK profiles, to enable the prediction of virtual compounds at the point of design, and to drive prioritization of compounds for in vivo assays.
PROteolysis TArgeting Chimeras (PROTACs) use the ubiquitin–proteasome system to degrade a protein of interest for therapeutic benefit. Advances made in targeted protein degradation technology have been remarkable, with several molecules having moved into clinical studies. However, robust routes to assess and better understand the safety risks of PROTACs need to be identified, which is an essential step toward delivering efficacious and safe compounds to patients. In this work, we used Cell Painting, an unbiased high-content imaging method, to identify phenotypic signatures of PROTACs. Chemical clustering and model prediction allowed the identification of a mitotoxicity signature that could not be expected by screening the individual PROTAC components. The data highlighted the benefit of unbiased phenotypic methods for identifying toxic signatures and the potential to impact drug design.
Tetracyclines are broad-spectrum antibiotics used to prevent or treat a variety of bacterial infections. Resistance is often mediated through mobile resistance genes, which encode one of the three main mechanisms: active efflux, ribosomal target protection or enzymatic degradation. In the last few decades, a large number of new tetracycline-resistance genes have been discovered in clinical settings. These genes are hypothesized to originate from environmental and commensal bacteria, but the diversity of tetracycline-resistance determinants that have not yet been mobilized into pathogens is unknown. In this study, we aimed to characterize the potential tetracycline resistome by screening genomic and metagenomic data for novel resistance genes. By using probabilistic models, we predicted 1254 unique putative tetracycline resistance genes, representing 195 gene families (<70 % amino acid sequence identity), whereof 164 families had not been described previously. Out of 17 predicted genes selected for experimental verification, 7 induced a resistance phenotype in an Escherichia coli host. Several of the predicted genes were located on mobile genetic elements or in regions that indicated mobility, suggesting that they easily can be shared between bacteria. Furthermore, phylogenetic analysis indicated several events of horizontal gene transfer between bacterial phyla. Our results also suggested that acquired efflux pumps originate from proteobacterial species, while ribosomal protection genes have been mobilized from Firmicutes and Actinobacteria . This study significantly expands the knowledge of known and putatively novel tetracycline resistance genes, their mobility and evolutionary history. The study also provides insights into the unknown resistome and genes that may be encountered in clinical settings in the future.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.