Recommender systems (RSs), which underwent rapid development and had an enormous impact on e-commerce, have the potential to become useful tools for drug discovery. In this paper, we applied RS methods for the prediction of the antiviral activity class (active/inactive) for compounds extracted from ChEMBL. Two main RS approaches were applied: collaborative filtering (Surprise implementation) and content-based filtering (sparse-group inductive matrix completion (SGIMC) method). The effectiveness of RS approaches was investigated for prediction of antiviral activity classes (“interactions”) for compounds and viruses, for which some of their interactions with other viruses or compounds are known, and for prediction of interaction profiles for new compounds. Both approaches achieved relatively good prediction quality for binary classification of individual interactions and compound profiles, as quantified by cross-validation and external validation receiver operating characteristic (ROC) score >0.9. Thus, even simple recommender systems may serve as an effective tool in antiviral drug discovery.
Most of the common molecular descriptors have numerous different implementations. This can influence the results of compound prioritization based on the multiparameter assessment (MPA) approach that allows a medicinal chemist to simultaneously analyze and achieve the desired balance of the diverse and often conflicting molecular and pharmacological properties. In this study, we analyzed the feasibility of using different implementations of common descriptors (logP, logS, TPSA, logBB, hERG, nHBA) interchangeably in predesigned sets of requirements in the course of multiparameter compound optimization. The influence of methods of descriptor calculation, continuity or discreteness of their values, their applicability domains, as well as of the nature of desirability functions in an MPA profile were examined in terms of the stability of MPA compound ranking. It was shown that the interchangeable use of different methods of descriptor calculation is reliably acceptable only for continuously distributed parameters transformed by a smooth desirability function. If a descriptor in an MPA scheme is discretely distributed, only the implementation that was used for building the scoring profile may be used for assessment. An inconsistency of assessment due to different applicability domains of descriptors was also demonstrated.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.