Machine learning for protein folding and dynamics

Noé, Frank; Fabritiis, Gianni De; Clementi, Cecilia

doi:10.1016/j.sbi.2019.12.005

Cited by 149 publications

(116 citation statements)

References 78 publications

Supporting

Mentioning

108

Contrasting

Unclassified

Order By: Relevance

“…Standard autoencoders have been used in many applications to MD simulation data. 4,[30][31][32][33][34][35][36][37][38][39][40][41] They connect two separate neural networks, an encoder network and a decoder network, to perform an unsupervised dimensionality reduction on input data (e.g. a protein structure from a frame of an MD simulation).…”

Section: Resultsmentioning

confidence: 99%

DiffNets: deep learning the structural determinants of proteins biochemical properties by comparing different structural ensembles

Ward

Zimmerman

Meller

et al. 2020

Preprint

View full text Add to dashboard Cite

AbstractA mechanistic understanding of how mutations modulate proteins’ biochemical properties would advance our understanding of biology, provide insight for engineering proteins with particular functions, and facilitate efforts in precision medicine. However, such mechanistic insight remains elusive in many cases. For example, experimentally-derived structures of protein variants with dramatically different behaviors are often nearly identical, suggesting that one must consider the entire ensemble of structures that a protein adopts. Molecular dynamics (MD) simulations provide access to such ensembles, but identifying the relevant features of these complex entities remains difficult. Here we develop DiffNets, a deep learning framework that combines supervised autoencoders with expectation maximization to identify the structural preferences that are responsible for the biochemical differences between protein variants. As a proof of principle, we show that DiffNets identify the important structural preferences that confer increased stability to TEM β-lactamase variants without any a priori knowledge of the relevant structural features.

show abstract

Section: Resultsmentioning

confidence: 99%

DiffNets: deep learning the structural determinants of proteins biochemical properties by comparing different structural ensembles

Ward

Zimmerman

Meller

et al. 2020

Preprint

View full text Add to dashboard Cite

show abstract

“…To conclude, we have introduced a new approach to sieve out the spurious solutions from AIaugmented enhanced sampling simulations. 37,38 AI-based approaches have had indisputable impact across sciences, including their use in enhancing the efficiency of molecular simulations.…”

Section: Resultsmentioning

confidence: 99%

“…35,36 Artificial intelligence (AI) potentially provides a systematic means to differentiate signal from noise in generic data, and thus discover relevant CVs to accelerate the simulations. [37][38][39][40][41] A number of such AI-based approaches have been proposed recently [37][38][39]42,43 and remain the subject of extensive research. A common underlying theme in these methods is to exploit AI tools to gradually uncover the underlying effective geometry, parametrize it on-the-fly, and exploit it to bias the design of experiments with the MD simulator by emphasizing informative configuration space areas that have not been explored before.…”

Section: Introductionmentioning

confidence: 99%

Confronting pitfalls of AI-augmented molecular dynamics using statistical physics

Pant

Smith

Wang

et al. 2020

Preprint

View full text Add to dashboard Cite

Artificial intelligence (AI) based approaches have had indubitable impact across the sciences through the ability to make sense of data. Recently AI has also seen use for enhancing the efficiency of molecular simulations, wherein AI derived slow modes are used to accelerate the simulation in targeted ways. However, while typical fields where AI is used are characterized by a plethora of data, molecular simulations per construction suffer from limited sampling and thus limited data. As such the use of AI in molecular simulations can suffer from a dangerous situation where the AI optimization could get stuck in spurious regimes, leading to incorrect characterization of the reaction coordinate for a given problem at hand. When such an incorrectly characterized reaction coordinate is then used to perform additional simulations or even experiments, one could start to deviate further and further from the ground truth. To deal with this problem of spurious AI solutions, here we report a new and automated algorithm using ideas from statistical mechanics. It is based on the notion that a more reliable AI solution for many problems in chemistry and biophysics will be one that maximizes the time scale separation between slow and fast processes. To learn this timescale separation even from limited data, we use a maximum path entropy or Caliber based framework. We show the applicability of this automatic protocol for 3 classic benchmark problems. Here we capture the conformational dynamics of a model peptide, ligand unbinding dynamics from a protein and the extensive sampling of the folding/unfolding energy landscape of a GB1 peptide. We believe our work will lead to increased and robust use of trustworthy AI in molecular simulations of complex systems.

show abstract

“…[46] Furthermore,r ecent advances in machine learning have successfully provided reduced models to reproduce the equilibrium thermodynamics of macromolecules with less computational time compared to the computationally expensive atomistic or ab initio molecular dynamics simulations. [47] It is,i ndeed, expected that advances in artificial intelligence and machine learning will already open up entirely new perspectives in the next couple of years for the de novo design of synthetic macromolecules by reasonably accurate predictions of their energy landscapes. [48] With this information, synthetic chemists can avoid the time-consuming trial-anderror approach in the laboratory and directly synthesize the desired well-folded functional macromolecules predicted by theory.L ikewise,b ym eans of correctly predicted surface properties,t he intermolecular interactions and the selfassembly of larger molecules through multivalent interactions can be tailored, which will revolutionize both the life [47] and materials sciences.…”

Section: Angewandte Chemiementioning

confidence: 99%

A Periodic System of Supramolecular Elements

Schmidt

Würthner

2020

Angew Chem Int Ed

View full text Add to dashboard Cite

Chemistry “beyond the molecule” is based on weak, noncovalent, and reversible interactions. As a consequence of these bonds being weak, structural organization by folding and self‐assembly can only be fully exploited with larger molecules that can provide multiple binding sites. Such “supramolecules” can now be synthesized and their folding into desired conformations predicted. A new level of chemistry can now be realized through the creation of non‐natural entities composed of molecular building blocks with defined secondary structures. Herein we define these building blocks as “supramolecular elements”. We anticipate that further research on such large molecules will reveal construction principles dictated by recurring motifs that govern structure formation through folding and self‐assembly. These principles are comparable to the organization of atoms in the Periodic Table of Chemical Elements and may lead to the establishment of a Periodic System of Supramolecular Elements.

show abstract

Machine learning for protein folding and dynamics

Cited by 149 publications

References 78 publications

DiffNets: deep learning the structural determinants of proteins biochemical properties by comparing different structural ensembles

DiffNets: deep learning the structural determinants of proteins biochemical properties by comparing different structural ensembles

Confronting pitfalls of AI-augmented molecular dynamics using statistical physics

A Periodic System of Supramolecular Elements

Contact Info

Product

Resources

About