Over the last 25 years, biology has entered the genomic era and is becoming a science of ‘big data’. Most interpretations of genomic analyses rely on accurate functional annotations of the proteins encoded by more than 500 000 genomes sequenced to date. By different estimates, only half the predicted sequenced proteins carry an accurate functional annotation, and this percentage varies drastically between different organismal lineages. Such a large gap in knowledge hampers all aspects of biological enterprise and, thereby, is standing in the way of genomic biology reaching its full potential. A brainstorming meeting to address this issue funded by the National Science Foundation was held during 3–4 February 2022. Bringing together data scientists, biocurators, computational biologists and experimentalists within the same venue allowed for a comprehensive assessment of the current state of functional annotations of protein families. Further, major issues that were obstructing the field were identified and discussed, which ultimately allowed for the proposal of solutions on how to move forward.
The pyridoxal 5′‐phosphate (PLP) homeostasis protein (PLPHP) is a ubiquitous member of the COG0325 family with apparently no catalytic activity. Although the actual cellular role of this protein is unknown, it has been observed that mutations of the PLPHP encoding gene affect the activity of PLP‐dependent enzymes, B6 vitamers and amino acid levels. Here we report a detailed characterization of the Escherichia coli ortholog of PLPHP (YggS) with respect to its PLP binding and transfer properties, stability, and structure. YggS binds PLP very tightly and is able to slowly transfer it to a model PLP‐dependent enzyme, serine hydroxymethyltransferase. PLP binding to YggS elicits a conformational/flexibility change in the protein structure that is detectable in solution but not in crystals. We serendipitously discovered that the K36A variant of YggS, affecting the lysine residue that binds PLP at the active site, is able to bind PLP covalently. This observation led us to recognize that a number of lysine residues, located at the entrance of the active site, can replace Lys36 in its PLP binding role. These lysines form a cluster of charged residues that affect protein stability and conformation, playing an important role in PLP binding and possibly in YggS function.
Pyridoxal 5’-phosphate or PLP is a cofactor derived from B6 vitamers and essential for growth in all known organisms. PLP synthesis and salvage pathways are well characterized in a few model species even though key components, such as the vitamin B6 transporters, are still to be identified in many organisms including the model bacteria Escherichia coli or Bacillus subtilis . Using a comparative genomic approach, PLP synthesis and salvage pathways were predicted in 5840 bacterial and archaeal species with complete genomes. The distribution of the two known de novo biosynthesis pathways and previously identified cases of non-orthologous displacements were surveyed in the process. This analysis revealed that several PLP de novo pathway genes remain to be identified in many organisms, either because sequence similarity alone cannot be used to discriminate among several homologous candidates or due to non-orthologous displacements. Candidates for some of these pathway holes were identified using published TnSeq data, but many remain. We find that ~10 % of the analysed organisms rely on salvage but further analyses will be required to identify potential transporters. This work is a starting point to model the exchanges of B6 vitamers in communities, predict the sensitivity of a given organism to drugs targeting PLP synthesis enzymes, and identify numerous gaps in knowledge that will need to be tackled in the years to come.
Capturing the published corpus of information on all members of a given protein family should be an essential step in any study focusing on any specific member of that said family. This step is often performed only superficially or partially by experimentalists as the most common approaches and tools to pursue this objective are far from optimal. Using a previously gathered dataset of 284 references mentioning a member of the DUF34 (NIF3/Ngg1-interacting Factor 3), we evaluated the productivity of different databases and search tools, and devised a workflow that can be used by experimentalists to capture the most information in less time. To complement this workflow, web-based platforms allowing for the exploration of member distributions for several protein families across sequenced genomes or for the capture of gene neighborhood information were reviewed for their versatility, completeness and ease of use. Recommendations that can be used for experimentalist users, as well as educators, are provided and integrated within a customized, publicly accessible Wiki.
Pyridoxal 5′-phosphate (PLP) is the active form of vitamin B6 and a cofactor for many essential metabolic processes such as amino acid biosynthesis and one carbon metabolism. 4’-deoxypyridoxine (4dPN) is a long known B6 antimetabolite but its mechanism of action was not totally clear. By exploring different conditions in which PLP metabolism is affected in the model organism Escherichia coli K12, we showed that 4dPN cannot be used as a source of vitamin B6 as previously claimed and that it is toxic in several conditions where vitamin B6 homeostasis is affected, such as in a B6 auxotroph or in a mutant lacking the recently discovered PLP homeostasis gene, yggS. In addition, we found that 4dPN sensitivity is likely the result of multiple modes of toxicity, including inhibition of PLP-dependent enzyme activity by 4’-deoxypyridoxine phosphate (4dPNP) and inhibition of cumulative pyridoxine (PN) uptake. These toxicities are largely dependent on the phosphorylation of 4dPN by pyridoxal kinase (PdxK).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.