We present ABACUS-R, a method based on deep learning for designing amino acid sequences that autonomously fold into a given target backbone. This method predicts the sidechain type of a central residue from its 3D local environment by using an encoder-decoder network trained with a multi-task learning strategy. The environmental features encoded by the network include the types but not the conformations of the sidechains of surrounding residues. This eliminates the needs for reconstructing and optimizing sidechain structures, and drastically simplifies the sequence design process. Thus iteratively applying the encoder-decoder to different central residues is able to produce self-consistent overall sequences for a target backbone. Extensive results of wet experiments, including five structures solved by X-ray crystallography, show that ABACUS-R outperforms state-of-the-art energy function-based de novo sequence design methods by significant margins in success rate and design precision. ABACUS-R constitutes a robust tool for wide applications in protein design and protein engineering.
In the version of this article initially published, reference 64 (Liu, Y. et al. ABACUS-R: Rotamer-free protein sequence design based on deep learning and self-consistency (Code Ocean, 2022); https://doi.org/10.24433/CO.3351944.v1) was omitted from the reference list and the Code availability section. It has been restored to the HTML and PDF versions of the article.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.