WISDOM is an international initiative to enable a virtual screening pipeline on a Grid infrastructure. Its first attempt was to deploy large scale in silico docking on a public Grid infrastructure. Protein-ligand docking is about computing the binding energy of a protein target to a library of potential drugs using a scoring algorithm. Previous deployments were either limited to one cluster, to Grids of clusters in the tightly protected environment of a pharmaceutical laboratory or to desktop Grids. The first large scale docking experiment ran on the EGEE Grid production service against targets relevant to research on malaria and saw over 41 million compounds docked for the equivalent of 80 years of CPU time. Up to 1,700 computers were simultaneously used in 15 countries around the world. Issues related to the deployment and the monitoring of the in silico docking experiment as well as experience with Grid operation and services are reported in the paper. The main problem encountered for such a large scale deployment was the Grid infrastructure stability. Although the overall success rate was above 80%, a lot of monitoring and supervision was still required at the application level to resubmit the jobs that failed. But the experiment demonstrated how Grid infrastructures have a tremendous capacity to mobilize very large CPU resources for well targeted goals during a significant period of time. This success leads to a second computing challenge targeting avian flu neuraminidase N1.
Though different species of the genus Plasmodium may be responsible for malaria, the variant caused by P. falciparum is often very dangerous and even fatal if untreated. Hemoglobin degradation is one of the key metabolic processes for the survival of the Plasmodium parasite in its host. Plasmepsins, a family of aspartic proteases encoded by the Plasmodium genome, play a prominent role in host hemoglobin cleavage. In this paper we demonstrate the use of virtual screening, in particular molecular docking, employed at a very large scale to identify novel inhibitors for plasmepsins II and IV. A large grid infrastructure, the EGEE grid, was used to address the problem of large computation resources required for docking hundreds of thousands of chemical compounds on different plasmepsin targets of P. falciparum. A large compound library of about 1 million chemical compounds was docked on 5 different targets of plasmepsins using two different docking software, namely FlexX and AutoDock. Several strategies were employed to analyze the results of this virtual screening approach including docking scores, ideal binding modes, and interactions to key residues of the protein. Three different classes of structures with thiourea, diphenylurea, and guanidino scaffolds were identified to be promising hits. While the identification of diphenylurea compounds is in accordance with the literature and thus provides a sort of "positive control", the identification of novel compounds with a guanidino scaffold proves that high throughput docking can be effectively used to identify novel potential inhibitors of P. falciparum plasmepsins. Thus, with the work presented here, we do not only demonstrate the relevance of computational grids in drug discovery but also identify several promising small molecules which have the potential to serve as candidate inhibitors for P. falciparum plasmepsins. With the use of the EGEE grid infrastructure for the virtual screening campaign against the malaria causing parasite P. falciparum we have demonstrated that resource sharing on an eScience infrastructure such as EGEE provides a new model for doing collaborative research to fight diseases of the poor.
Encouraged by the success of the first EGEE biomedical data challenge against malaria (WISDOM), the second data challenge battling avian flu was kicked off in April 2006 to identify new drugs for the potential variants of the influenza A virus. Mobilizing thousands of CPUs on the Grid, the six-week-long high-throughput screening activity has fulfilled over 100 CPU years of computing power and produced around 600 gigabytes of results on the Grid for further biological analysis and testing. In the paper, we demonstrate the impact of a worldwide Grid infrastructure to efficiently deploy large-scale virtual screening to speed up the drug design process. Lessons learned through the data challenge activity are also discussed.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.