A clustering method based on finding the largest set of disconnected fragments that two chemical compounds have in common is shown to be able to group structures in a way that is ideally suited to medicinal chemistry programs. We describe how markedly improved results can be obtained by using a similarity metric that accounts not just for the size of the shared fragments but also on their relative arrangement in the two parent compounds. The use of a physiochemical atom typing scheme is also shown to provide significant contributions. Results from calculations using a test set consisting of actives from nine different important biological target proteins demonstrate the strengths of our clustering method and the advantages over other approaches that are widely used throughout the pharmaceutical industry.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.