Structure determination of linear peptides of 5–50 amino acids in aqueous solution and interacting with proteins is a key aspect in structural biology. PEP-FOLD3 is a novel computational framework, that allows both (i) de novo free or biased prediction for linear peptides between 5 and 50 amino acids, and (ii) the generation of native-like conformations of peptides interacting with a protein when the interaction site is known in advance. PEP-FOLD3 is fast, and usually returns solutions in a few minutes. Testing PEP-FOLD3 on 56 peptides in aqueous solution led to experimental-like conformations for 80% of the targets. Using a benchmark of 61 peptide–protein targets starting from the unbound form of the protein receptor, PEP-FOLD3 was able to generate peptide poses deviating on average by 3.3Å from the experimental conformation and return a native-like pose in the first 10 clusters for 52% of the targets. PEP-FOLD3 is available at http://bioserv.rpbs.univ-paris-diderot.fr/services/PEP-FOLD3.
Protein misfolding and aggregation is observed in many amyloidogenic diseases affecting either the central nervous system or a variety of peripheral tissues. Structural and dynamic characterization of all species along the pathways from monomers to fibrils is challenging by experimental and computational means because they involve intrinsically disordered proteins in most diseases. Yet understanding how amyloid species become toxic is the challenge in developing a treatment for these diseases. Here we review what computer, in vitro, in vivo and pharmacological experiments tell us about the accumulation and deposition of the oligomers of the (Aβ, tau), α-synuclein, IAPP and superoxide dismutase 1 proteins, which have been the mainstream concept underlying Alzheimer's disease (AD), Parkinson's disease (PD), type II diabetes (T2D) and amyotrophic lateral sclerosis (ALS) research, respectively for over many years.While SOD1 is a globular protein with a well-defined 3D structure, the Aβ, tau and α-synuclein proteins belong to the class of intrinsically disordered proteins (IDPs). IDPs are also known to play a critical role in many cellular functions such as signal transduction, cell growth, binding with DNA and RNA, and transcription, and are implicated in the development of cardiovascular problems and cancers 29 . The IDPs involved in neurodegenerative diseases have a few aggregation-prone regions and overall all IDPs have a low mean hydrophobicity and a high mean net charge 30 .IDPs are structurally flexible and lack stable secondary structures in aqueous solution. When isolated, they behave as polymers in a good solvent and their radii of gyration are well described by the Flory scaling law. 31 The insolubility and high self-assembly propensity of IDPs implicated in degenerative diseases have prevented high-resolution structural determination by solution nuclear magnetic resolution (NMR) and X-ray diffraction experiments. Local information at all aggregation steps can be, however, obtained by chemical shifts, residual coupling constants, and J-couplings from NMR, exchange hydrogen/deuterium (H/D) NMR, Raman spectroscopy; and secondary structure from fast Fourier infrared spectroscopy (FTIR) or circular dichroism (CD). Long-range tertiary contacts can be deduced from paramagnetic relaxation enhancement (PRE) NMR spectroscopy and single molecule Förster resonance energy transfer (sm-FRET), and short-range distance contacts can be extracted by cross linked residues determined by mass spectrometry (MS). Low-resolution 3D information of monomers and oligomers can be obtained by ion-mobility mass-spectrometry data (IM/MS) providing cross-collision sections, dynamic light scattering (DLS), pulse field gradient NMR spectroscopy and fluorescence correlation spectroscopy (FCS) providing hydrodynamics radius, small-angle X-ray scattering (SAXS) and small-angle neutron scattering (SANS), atomic force microscopy (AFM) and transmission electron microscopy (TEM) providing height features of the aggregates, as reported by some o...
In the context of the renewed interest of peptides as therapeutics, it is important to have an on-line resource for 3D structure prediction of peptides with well-defined structures in aqueous solution. We present an updated version of PEP-FOLD allowing the treatment of both linear and disulphide bonded cyclic peptides with 9–36 amino acids. The server makes possible to define disulphide bonds and any residue–residue proximity under the guidance of the biologists. Using a benchmark of 34 cyclic peptides with one, two and three disulphide bonds, the best PEP-FOLD models deviate by an average RMS of 2.75 Å from the full NMR structures. Using a benchmark of 37 linear peptides, PEP-FOLD locates lowest-energy conformations deviating by 3 Å RMS from the NMR rigid cores. The evolution of PEP-FOLD comes as a new on-line service to supersede the previous server. The server is available at: http://bioserv.rpbs.univ-paris-diderot.fr/PEP-FOLD.
Peptides and mini proteins have many biological and biomedical implications, which motivates the development of accurate methods, suitable for large-scale experiments, to predict their experimental or native conformations solely from sequences. In this study, we report PEP-FOLD2, an improved coarse grained approach for peptide de novo structure prediction and compare it with PEP-FOLD1 and the state-of-the-art Rosetta program. Using a benchmark of 56 structurally diverse peptides with 25-52 amino acids and a total of 600 simulations for each system, PEP-FOLD2 generates higher quality models than PEP-FOLD1, and PEP-FOLD2 and Rosetta generate near-native or native models for 95% and 88% of the targets, respectively. In the situation where we do not have any experimental structures at hand, PEP-FOLD2 and Rosetta return a near-native or native conformation among the top five best scored models for 80% and 75% of the targets, respectively. While the PEP-FOLD2 prediction rate is better than the ROSETTA prediction rate by 5%, this improvement is non-negligible because PEP-FOLD2 explores a larger conformational space than ROSETTA and consists of a single coarse-grained phase. Our results indicate that if the coarse-grained PEP-FOLD2 method is approaching maturity, we are not at the end of the game of mini-protein structure prediction, but this opens new perspectives for large-scale in silico experiments.
Rational peptide design and large-scale prediction of peptide structure from sequence remain a challenge for chemical biologists. We present PEP-FOLD, an online service, aimed at de novo modelling of 3D conformations for peptides between 9 and 25 amino acids in aqueous solution. Using a hidden Markov model-derived structural alphabet (SA) of 27 four-residue letters, PEP-FOLD first predicts the SA letter profiles from the amino acid sequence and then assembles the predicted fragments by a greedy procedure driven by a modified version of the OPEP coarse-grained force field. Starting from an amino acid sequence, PEP-FOLD performs series of 50 simulations and returns the most representative conformations identified in terms of energy and population. Using a benchmark of 25 peptides with 9–23 amino acids, and considering the reproducibility of the runs, we find that, on average, PEP-FOLD locates lowest energy conformations differing by 2.6 Å Cα root mean square deviation from the full NMR structures. PEP-FOLD can be accessed at http://bioserv.rpbs.univ-paris-diderot.fr/PEP-FOLD
Growing evidence supports that amyloid β (Aβ) oligomers are the major causative agents leading to neural cell death in Alzheimer's disease. The polyphenol (-)-epigallocatechin gallate (EGCG) was recently reported to inhibit Aβ fibrillization and redirect Aβ aggregation into unstructured, off-pathway oligomers. Given the experimental challenge to characterize the structures of Aβ/EGCG complexes, we performed extensive atomistic replica exchange molecular dynamics simulations of Aβ1-42 dimer in the present and absence of EGCG in explicit solvent. Our equilibrium Aβ dimeric structures free of EGCG are consistent with the collision cross section from ion-mobility mass spectrometry and the secondary structure composition from circular dichroism experiment. In the presence of EGCG, the Aβ structures are characterized by increased inter-center-of-mass distances, reduced interchain and intrachain contacts, reduced β-sheet content, and increased coil and α-helix contents. Analysis of the free energy surfaces reveals that the Aβ dimer with EGCG adopts new conformations, affecting therefore its propensity to adopt fibril-prone states. Overall, this study provides, for the first time, insights on the equilibrium structures of Aβ1-42 dimer in explicit aqueous solution and an atomic picture of the EGCG-mediated conformational change on Aβ dimer.
We have revisited the protein coarse-grained optimized potential for efficient structure prediction (OPEP). The training and validation sets consist of 13 and 16 protein targets. Because optimization depends on details of how the ensemble of decoys is sampled, trial conformations are generated by molecular dynamics, threading, greedy, and Monte Carlo simulations, or taken from publicly available databases. The OPEP parameters are varied by a genetic algorithm using a scoring function which requires that the native structure has the lowest energy, and the native-like structures have energy higher than the native structure but lower than the remote conformations. Overall, we find that OPEP correctly identifies 24 native or native-like states for 29 targets and has very similar capability to the all-atom discrete optimized protein energy model (DOPE), found recently to outperform five currently used energy models.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.