Domains are the three-dimensional building blocks of proteins. An individual domain can be found in a variety of protein architectures that perform unique functions and are subject to different evolutionary selective pressures. We describe an approach to evaluate the variability in amino acid sequences of a single domain across architectural contexts. The ability to distinguish the different evolutionary paths of one protein domain can help determine whether existing knowledge about a specific domain will apply to an uncharacterized protein. Such discrimination can lead to insights and hypotheses about function, as well as guide experimental priorities.We developed and tested our approach on CheW-like domains (PF01584), which mediate protein/protein interactions and are difficult to compare experimentally. CheW-like domains occur in CheW scaffolding proteins, CheA kinases, and CheV proteins that regulate bacterial chemotaxis. We chose 16 protein Architectures that included 94% of all CheW-like domains found in nature. Because some Architectures had more than one CheW-like domain, CheW-like domains were divided into 21 distinct Contexts. The CheW-like domain sequences were closely related within most Contexts; however, one Context was subdivided into three Types. The resulting 23 sequence Types coalesced into five or six Classes of CheW-like domains, which we described in detail.In addition, we created SimpLogo, an innovative method for visualizing amino acid composition across large sets of multiple sequence alignments of arbitrary length. SimpLogo offers substantial advantages over standard sequence logos for comparison and analysis of related protein sequences. The R package for SimpLogo is freely available.
Azorhizobium caulinodans is a nitrogen-fixing bacterium that forms root nodules on its host legume, Sesbania rostrata . This agriculturally significant symbiotic relationship is important in lowland rice cultivation, and allows for nitrogen fixation under flood conditions. Chemotaxis plays an important role in bacterial colonization of the rhizosphere. Plant roots release chemical compounds that are sensed by bacteria, triggering chemotaxis along a concentration gradient toward the roots. This gives motile bacteria a significant competitive advantage during root surface colonization. Although plant-associated bacterial genomes often encode multiple chemotaxis systems, A. caulinodans appears to encode only one. The che cluster on the A. caulinodans genome contains cheA , cheW , cheY2 , cheB , and cheR . Two other chemotaxis genes, cheY1 and cheZ , are located independently from the che operon. Both CheY1 and CheY2 are involved in chemotaxis, with CheY1 being the predominant signaling protein. A. caulinodans CheA contains an unusual set of C-terminal domains: a CheW-like/Receiver pair (termed W2-Rec), follows the more common single CheW-like domain. W2-Rec impacts both chemotaxis and CheA function. We found a preference for transfer of phosphoryl groups from CheA to CheY2, rather than to W2-Rec or CheY1, which appears to be involved in flagellar motor binding. Furthermore, we observed increased phosphoryl group stabilities on CheY1 compared to CheY2 or W2-Rec. Finally, CheZ enhanced dephosphorylation of CheY2 substantially more than CheY1, but had no effect on the dephosphorylation rate of W2-Rec. This network of phosphotransfer reactions highlights a previously uncharacterized scheme for regulation of chemotactic responses. IMPORTANCE Chemotaxis allows bacteria to move towards nutrients and away from toxins in their environment. Chemotactic movement provides a competitive advantage over non-specific motion. CheY is an essential mediator of the chemotactic response with phosphorylated and unphosphorylated forms of CheY differentially interacting with the flagellar motor to change swimming behavior. Previously established schemes of CheY dephosphorylation include action of a phosphatase and/or transfer of the phosphoryl group to another receiver domain that acts as a sink. Here, we propose A. caulinodans uses a concerted mechanism in which the Hpt domain of CheA, CheY2, and CheZ function together as a dual sink system to rapidly reset chemotactic signaling. To the best of our knowledge, this mechanism is unlike any that have previously been evaluated. Chemotaxis systems that utilize both receiver and Hpt domains as phosphate sinks likely occur in other bacterial species.
Domains are the three-dimensional building blocks of proteins. An individual domain can occur in a variety of domain architectures that perform unique functions and are subject to different evolutionary selective pressures. We describe an approach to evaluate the variability in amino acid sequences of a single domain across architectural contexts. The ability to distinguish different evolutionary outcomes of one protein domain can help determine whether existing knowledge about a specific domain will apply to an uncharacterized protein, lead to insights and hypotheses about function, and guide experimental priorities. We developed and tested our approach on CheW-like domains (PF01584), which mediate protein/protein interactions and are difficult to compare experimentally. CheW-like domains occur in CheW scaffolding proteins, CheA kinases, and CheV proteins that regulate bacterial chemotaxis. We analyzed 16 domain architectures that included 94% of all CheW-like domains found in nature. We identified six Classes of CheW-like domains with presumed functional differences. CheV and most CheW proteins contained Class 1 domains, whereas some CheW proteins contained Class 6 ($20%) or Class 2 ($1%) domains instead.Most CheA proteins contained Class 3 domains. CheA proteins with multiple Hpt domains contained Class 4 domains. CheA proteins with two CheW-like domains contained one Class 3 and one Class 5. We also created SimpLogo, an innovative method for visualizing amino acid composition across large sets of multiple sequence alignments of arbitrary length. SimpLogo offers substantial advantages over standard sequence logos for comparison and analysis of related protein sequences. The R package for SimpLogo is freely available.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.