This Article describes a novel geometric methodology for analyzing free energy and kinetics of assembly driven by short-range pair-potentials in an implicit solvent and provides a proof-of-concept illustration of its unique capabilities. An atlas is a labeled partition of the assembly landscape into a roadmap of maximal, contiguous, nearly-equipotential-energy conformational regions or macrostates, together with their neighborhood relationships. The new methodology decouples the roadmap generation from sampling and produces: (1) a queryable atlas of local potential energy minima, their basin structure, energy barriers, and neighboring basins; (2) paths between a specified pair of basins, each path being a sequence of conformational regions or macrostates below a desired energy threshold; and (3) approximations of relative path lengths, basin volumes (configurational entropy), and path probabilities. Results demonstrating the core algorithm’s capabilities and high computational efficiency have been generated by a resource-light, curated open source software implementation EASAL (Efficient Atlasing and Search of Assembly Landscapes, 10.1145/3204472ACM Trans. Math. Softw.201844148; see software, Efficient Atlasing and Search of Assembly Landscapes2016; video, Video Illustrating the opensource software EASAL2016; and user guide, EASAL software user guide2016). Running on a laptop with Intel(R) Core(TM) i7-7700@3.60 GHz CPU with 16GB of RAM, EASAL atlases several hundred thousand conformational regions or macrostates in minutes using a single compute core. Subsequent path and basin computations each take seconds. A parallelized EASAL version running on the same laptop with 4 cores gives a 3× speedup for atlas generation. The core algorithm’s correctness, time complexity, and efficiency–accuracy trade-offs are formally guaranteed using modern distance geometry, geometric constraint systems and combinatorial rigidity. The methodology further links the shape of the input assembling units to a type of intuitive and queryable bar-code of the output atlas, which in turn determine stable assembled structures and kinetics. This succinct input–output relationship facilitates reverse analysis and control toward design. A novel feature that is crucial to both the high sampling efficiency and decoupling of roadmap generation from sampling is a recently developed theory of convex Cayley (distance-based) custom parametrizations specific to assembly, as opposed to folding. Representing microstates with macrostate-specific Cayley parameters, to generate microstate samples, avoids gradient-descent search used by all prevailing methods. Further, these parametrizations convexify conformational regions or macrostates. This ratchets up sampling efficiency, significantly reducing number of repeated and discarded samples. These features of the new stand-alone methodology can also be used to complement the strengths of prevailing methodologies including Molecular Dynamics, Monte Carlo, and Fast Fourier Transform based methods.
Viral shell assembly occurs spontaneously in solution, caused by weak inter-atomic interactions between the identical coatprotein monomers that constitute the shell. A sound measure of robustness of the assembly process is therefore based on determining the weak interactions whose disruption significantly disrupts successful assembly.We used data isolated from the X-ray structure of a T=1, Adeno-Associated Virus as input to a new software suite of algorithms, EASAL, (Efficient Atlasing and Search of Assembly Landscapes, developed by 3 of the authors). Since assembly is entropically driven, we predicted and ranked the crucial interactions for assembly, purely by focusing on key changes in the geometry of the assembly configuration space when specific interactions are dropped. Using the same data for mutagenesis experiments towards assembly disruption, the 2 other authors experimentally verified these predictions successfully.This paper briefly describes the problem background for supramolecular assembly, the key features and novelty of EASAL, the geometric features of the configuration space chosen to measure robustness of assembly, the process by which the input data for EASAL is extracted from X-ray crystallography viral structure data, and a validation of EASAL's robustness predictions with mutagenesis results for *
Icosahedral viruses are under a micrometer in diameter, their infectious genome encapsulated by a shell assembled by a multiscale process, starting from an integer multiple of 60 viral capsid or coat protein (VP) monomers. We predict and validate inter-atomic hotspot interactions between VP monomers that are important for the assembly of 3 types of icosahedral viral capsids: Adeno Associated Virus serotype 2 (AAV2) and Minute Virus of Mice (MVM), both T = 1 single stranded DNA viruses, and Bromo Mosaic Virus (BMV), a T = 3 single stranded RNA virus. Experimental validation is by in-vitro, site-directed mutagenesis data found in literature. We combine ab-initio predictions at two scales: at the interface-scale , we predict the importance ( cruciality ) of an interaction for successful subassembly across each interface between symmetry-related VP monomers; and at the capsid-scale , we predict the cruciality of an interface for successful capsid assembly. At the interface-scale, we measure cruciality by changes in the capsid free-energy landscape partition function when an interaction is removed. The partition function computation uses atlases of interface subassembly landscapes, rapidly generated by a novel geometric method and curated opensource software EASAL (efficient atlasing and search of assembly landscapes). At the capsid-scale, cruciality of an interface for successful assembly of the capsid is based on combinatorial entropy. Our study goes all the way from resource-light, multiscale computational predictions of crucial hotspot inter-atomic interactions to validation using data on site-directed mutagenesis’ effect on capsid assembly. By reliably and rapidly narrowing down target interactions, (no more than 1.5 hours per interface on a laptop with Intel Core i5-2500K @ 3.2 Ghz CPU and 8GB of RAM) our predictions can inform and reduce time-consuming in-vitro and in-vivo experiments, or more computationally intensive in-silico analyses.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.