Multidrug-resistant bacteria are a severe threat to public health. Conventional antibiotics are becoming increasingly ineffective as a result of resistance, and it is imperative to find new antibacterial strategies. Natural antimicrobials, known as host defence peptides or antimicrobial peptides, defend host organisms against microbes but most have modest direct antibiotic activity. Enhanced variants have been developed using straightforward design and optimization strategies and are being tested clinically. Here, we describe advanced computer-assisted design strategies that address the difficult problem of relating primary sequence to peptide structure, and are delivering more potent, cost-effective, broad-spectrum peptides as potential next-generation antibiotics.
Artificial neural networks had their first heyday in molecular informatics and drug discovery approximately two decades ago. Currently, we are witnessing renewed interest in adapting advanced neural network architectures for pharmaceutical research by borrowing from the field of "deep learning". Compared with some of the other life sciences, their application in drug discovery is still limited. Here, we provide an overview of this emerging field of molecular informatics, present the basic concepts of prominent deep learning methods and offer motivation to explore these techniques for their usefulness in computer-assisted drug discovery and design. We specifically emphasize deep neural networks, restricted Boltzmann machine networks and convolutional networks.
We present a generative long short-term memory (LSTM) recurrent neural network (RNN) for combinatorial de novo peptide design. RNN models capture patterns in sequential data and generate new data instances from the learned context. Amino acid sequences represent a suitable input for these machine-learning models. Generative models trained on peptide sequences could therefore facilitate the design of bespoke peptide libraries. We trained RNNs with LSTM units on pattern recognition of helical antimicrobial peptides and used the resulting model for de novo sequence generation. Of these sequences, 82% were predicted to be active antimicrobial peptides compared to 65% of randomly sampled sequences with the same amino acid distribution as the training set. The generated sequences also lie closer to the training data than manually designed amphipathic helices. The results of this study showcase the ability of LSTM RNNs to construct new amino acid sequences within the applicability domain of the model and motivate their prospective application to peptide and protein design without the need for the exhaustive enumeration of sequence libraries.
Malaria blood stage parasites export a large number of proteins into their host erythrocyte to change it from a container of predominantly hemoglobin optimized for the transport of oxygen into a niche for parasite propagation. To understand this process, it is crucial to know which parasite proteins are exported into the host cell. This has been aided by the PEXEL/HT sequence, a five-residue motif found in many exported proteins, leading to the prediction of the exportome. However, several PEXEL/HT negative exported proteins (PNEPs) indicate that this exportome is incomplete and it remains unknown if and how many further PNEPs exist. Here we report the identification of new PNEPs in the most virulent malaria parasite Plasmodium falciparum. This includes proteins with a domain structure deviating from previously known PNEPs and indicates that PNEPs are not a rare exception. Unexpectedly, this included members of the MSP-7 related protein (MSRP) family, suggesting unanticipated functions of MSRPs. Analyzing regions mediating export of selected new PNEPs, we show that the first 20 amino acids of PNEPs without a classical N-terminal signal peptide are sufficient to promote export of a reporter, confirming the concept that this is a shared property of all PNEPs of this type. Moreover, we took advantage of newly found soluble PNEPs to show that this type of exported protein requires unfolding to move from the parasitophorous vacuole (PV) into the host cell. This indicates that soluble PNEPs, like PEXEL/HT proteins, are exported by translocation across the PV membrane (PVM), highlighting protein translocation in the parasite periphery as a general means in protein export of malaria parasites.
Supplementary data are available at Bioinformatics online.
Many apicomplexan parasites, including Plasmodium falciparum, harbor a so-called apicoplast, a complex plastid of red algal origin which was gained by a secondary endosymbiotic event. The exact molecular mechanisms directing the transport of nuclear-encoded proteins to the apicoplast of P. falciparum are not well understood. Recently, in silico analyses revealed a second copy of proteins homologous to components of the endoplasmic reticulum (ER)-associated protein degradation (ERAD) system in organisms with secondary plastids, including the malaria parasite P. falciparum. These proteins are predicted to be endowed with an apicoplast targeting signal and are suggested to play a role in the transport of nuclear-encoded proteins to the apicoplast. Here, we have studied components of this ERAD-derived putative preprotein translocon complex in malaria parasites. Using transfection technology coupled with fluorescence imaging techniques we can demonstrate that the N terminus of several ERAD-derived components targets green fluorescent protein to the apicoplast. Furthermore, we confirm that full-length PfsDer1-1 and PfsUba1 (homologues of yeast ERAD components) localize to the apicoplast, where PfsDer1-1 tightly associates with membranes. Conversely, PfhDer1-1 (a host-specific copy of the Der1-1 protein) localizes to the ER. Our data suggest that ERAD components have been "rewired" to provide a conduit for protein transport to the apicoplast. Our results are discussed in relation to the nature of the apicoplast protein transport machinery.
Understanding the structure–activity relationships and mechanisms of action of membranolytic anticancer peptides could help them advance to therapeutic success.
Malaria parasites actively remodel the infected red blood cell (irbc) by exporting proteins into the host cell cytoplasm. The human parasite Plasmodium falciparum exports particularly large numbers of proteins, including proteins that establish a vesicular network allowing the trafficking of proteins onto the surface of irbcs that are responsible for tissue sequestration. Like P. falciparum, the rodent parasite P. berghei ANKA sequesters via irbc interactions with the host receptor CD36. We have applied proteomic, genomic, and reverse-genetic approaches to identify P. berghei proteins potentially involved in the transport of proteins to the irbc surface. A comparative proteomics analysis of P. berghei non-sequestering and sequestering parasites was used to determine changes in the irbc membrane associated with sequestration. Subsequent tagging experiments identified 13 proteins (Plasmodium export element (PEXEL)-positive as well as PEXEL-negative) that are exported into the irbc cytoplasm and have distinct localization patterns: a dispersed and/or patchy distribution, a punctate vesicle-like pattern in the cytoplasm, or a distinct location at the irbc membrane. Members of the PEXEL-negative BIR and PEXEL-positive Pb-fam-3 show a dispersed localization in the irbc cytoplasm, but not at the irbc surface. Two of the identified exported proteins are transported to the irbc membrane and were named erythrocyte membrane associated proteins. EMAP1 is a member of the PEXEL-negative Pb-fam-1 family, and EMAP2 is a PEXEL-positive protein encoded by a single copy gene; neither protein plays a direct role in sequestration. Our observations clearly indicate that P. berghei traffics a diverse range of proteins to different cellular locations via mechanisms that are analogous to those employed by P. falciparum. This information can be exploited to generate transgenic humanized rodent P. berghei parasites expressing chimeric P. berghei/P. falciparum proteins on the surface of rodent irbc, thereby opening new avenues for in vivo screening adjunct therapies that block sequestration.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.