Domains are the three-dimensional building blocks of proteins. An individual domain can be found in a variety of protein architectures that perform unique functions and are subject to different evolutionary selective pressures. We describe an approach to evaluate the variability in amino acid sequences of a single domain across architectural contexts. The ability to distinguish the different evolutionary paths of one protein domain can help determine whether existing knowledge about a specific domain will apply to an uncharacterized protein. Such discrimination can lead to insights and hypotheses about function, as well as guide experimental priorities.We developed and tested our approach on CheW-like domains (PF01584), which mediate protein/protein interactions and are difficult to compare experimentally. CheW-like domains occur in CheW scaffolding proteins, CheA kinases, and CheV proteins that regulate bacterial chemotaxis. We chose 16 protein Architectures that included 94% of all CheW-like domains found in nature. Because some Architectures had more than one CheW-like domain, CheW-like domains were divided into 21 distinct Contexts. The CheW-like domain sequences were closely related within most Contexts; however, one Context was subdivided into three Types. The resulting 23 sequence Types coalesced into five or six Classes of CheW-like domains, which we described in detail.In addition, we created SimpLogo, an innovative method for visualizing amino acid composition across large sets of multiple sequence alignments of arbitrary length. SimpLogo offers substantial advantages over standard sequence logos for comparison and analysis of related protein sequences. The R package for SimpLogo is freely available.
A collection of fungal isolates was obtained from a complex microbial mat, which occupied an iron-rich freshwater spring that feeds into Clear Creek, Golden, Colorado, USA. Two of the fungal isolates, a Glomeromycete (possible Entrophospora sp.) and a Dothideomycete (possible Phaeosphaeria sp.), were investigated for bioactive secondary metabolites. In total, six new compounds consisting of clearanols A–E (5, 6, 10–12) and disulochrin (7) were purified and their structures were determined. Disulochrin exhibited modest antibacterial activity against methicillin-resistant Staphylococcus aureus, whereas clearanol C showed weak inhibitory activity against Candida albicans biofilm formation.
The crystal structure of AdhP, a recombinantly expressed alcohol dehydrogenase from Escherichia coli K-12 (substrain MG1655), was determined to 2.01 Å resolution. The structure, which was solved using molecular replacement, also included the structural and catalytic zinc ions and the cofactor nicotinamide adenine dinucleotide (NAD). The crystals belonged to space group P2 1 , with unit-cell parameters a = 68.18, b = 118.92, c = 97.87 Å , = 106.41. The final R factor and R free were 0.138 and 0.184, respectively. The structure of the active site of AdhP suggested a number of residues that may participate in a proton relay, and the overall structure of AdhP, including the coordination to structural and active-site zinc ions, is similar to those of other tetrameric alcohol dehydrogenase enzymes.
Domains are the three-dimensional building blocks of proteins. An individual domain can occur in a variety of domain architectures that perform unique functions and are subject to different evolutionary selective pressures. We describe an approach to evaluate the variability in amino acid sequences of a single domain across architectural contexts. The ability to distinguish different evolutionary outcomes of one protein domain can help determine whether existing knowledge about a specific domain will apply to an uncharacterized protein, lead to insights and hypotheses about function, and guide experimental priorities. We developed and tested our approach on CheW-like domains (PF01584), which mediate protein/protein interactions and are difficult to compare experimentally. CheW-like domains occur in CheW scaffolding proteins, CheA kinases, and CheV proteins that regulate bacterial chemotaxis. We analyzed 16 domain architectures that included 94% of all CheW-like domains found in nature. We identified six Classes of CheW-like domains with presumed functional differences. CheV and most CheW proteins contained Class 1 domains, whereas some CheW proteins contained Class 6 ($20%) or Class 2 ($1%) domains instead.Most CheA proteins contained Class 3 domains. CheA proteins with multiple Hpt domains contained Class 4 domains. CheA proteins with two CheW-like domains contained one Class 3 and one Class 5. We also created SimpLogo, an innovative method for visualizing amino acid composition across large sets of multiple sequence alignments of arbitrary length. SimpLogo offers substantial advantages over standard sequence logos for comparison and analysis of related protein sequences. The R package for SimpLogo is freely available.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.