Rubisco is the primary carboxylase of the Calvin cycle, the most abundant enzyme in the biosphere, and one of the best-characterized enzymes. On the basis of correlations between Rubisco kinetic parameters, it is widely posited that constraints embedded in the catalytic mechanism enforce trade-offs between CO2 specificity, S C/O, and maximum carboxylation rate, k cat,C. However, the reasoning that established this view was based on data from ≈20 organisms. Here, we re-examine models of trade-offs in Rubisco catalysis using a data set from ≈300 organisms. Correlations between kinetic parameters are substantially attenuated in this larger data set, with the inverse relationship between k cat,C and S C/O being a key example. Nonetheless, measured kinetic parameters display extremely limited variation, consistent with a view of Rubisco as a highly constrained enzyme. More than 95% of k cat,C values are between 1 and 10 s–1, and no measured k cat,C exceeds 15 s–1. Similarly, S C/O varies by only 30% among Form I Rubiscos and <10% among C3 plant enzymes. Limited variation in S C/O forces a strong positive correlation between the catalytic efficiencies (k cat/K M) for carboxylation and oxygenation, consistent with a model of Rubisco catalysis in which increasing the rate of addition of CO2 to the enzyme–substrate complex requires an equal increase in the O2 addition rate. Altogether, these data suggest that Rubisco evolution is tightly constrained by the physicochemical limits of CO2/O2 discrimination.
The automatic extraction of chemical information from text requires the recognition of chemical entity mentions as one of its key steps. When developing supervised named entity recognition (NER) systems, the availability of a large, manually annotated text corpus is desirable. Furthermore, large corpora permit the robust evaluation and comparison of different approaches that detect chemicals in documents. We present the CHEMDNER corpus, a collection of 10,000 PubMed abstracts that contain a total of 84,355 chemical entity mentions labeled manually by expert chemistry literature curators, following annotation guidelines specifically defined for this task. The abstracts of the CHEMDNER corpus were selected to be representative for all major chemical disciplines. Each of the chemical entity mentions was manually labeled according to its structure-associated chemical entity mention (SACEM) class: abbreviation, family, formula, identifier, multiple, systematic and trivial. The difficulty and consistency of tagging chemicals in text was measured using an agreement study between annotators, obtaining a percentage agreement of 91. For a subset of the CHEMDNER corpus (the test set of 3,000 abstracts) we provide not only the Gold Standard manual annotations, but also mentions automatically detected by the 26 teams that participated in the BioCreative IV CHEMDNER chemical mention recognition task. In addition, we release the CHEMDNER silver standard corpus of automatically extracted mentions from 17,000 randomly selected PubMed abstracts. A version of the CHEMDNER corpus in the BioC format has been generated as well. We propose a standard for required minimum information about entity annotations for the construction of domain specific corpora on chemical and drug entities. The CHEMDNER corpus and annotation guidelines are available at: http://www.biocreative.org/resources/biocreative-iv/chemdner-corpus/
SummaryBacterial two-component systems (TCS) are key signal transduction networks regulating global responses to environmental change. Environmental signals may modulate the phosphorylation state of sensor kinases (SK). The phosphorylated SK transfers the phosphate to its cognate response regulator (RR), which causes physiological response to the signal. Frequently, the SK is bifunctional and, when unphosphorylated, it is also capable of dephosphorylating the RR. The phosphatase activity may also be modulated by environmental signals. Using the EnvZ/ OmpR system as an example, we constructed mathematical models to examine the steady-state and kinetic properties of the network. Mathematical modelling reveals that the TCS can show bistable behaviour for a given range of parameter values if unphosphorylated SK and RR form a dead-end complex that prevents SK autophosphorylation. Additionally, for bistability to exist the major dephosphorylation flux of the RR must not depend on the unphosphorylated SK. Structural modelling and published affinity studies suggest that the unphosphorylated SK EnvZ and the RR OmpR form a dead-end complex. However, bistability is not possible because the dephosphorylation of OmpR~P is mainly done by unphosphorylated EnvZ. The implications of this potential bistability in the design of the EnvZ/OmpR network and other TCS are discussed.
The number of software packages for kinetic modeling of biochemical networks continues to grow. Although most packages share a common core of functionality, the specific capabilities and user interfaces of different packages mean that choosing the best package for a given task is not trivial. We compare 12 software packages with respect to their functionality, reliability, efficiency, user-friendliness and compatibility. Although most programs performed reliably in all numerical tasks tested, SBML compatibility and the set-up of multicompartmentalization are problematic in many packages. For simple models, GEPASI seems the best choice for non-expert users. For large-scale models, environments such as Jarnac/JDesigner are preferable, because they allow modular implementation of models. Virtual Cell is the most versatile program and provides the simplest and clearest functionality for setting up multicompartmentalization.
SummarySignal transduction by a traditional two-component system involves a sensor protein that recognizes a physiological signal, autophosphorylates and transfers its phosphate, and a response regulator protein that receives the phosphate, alters its affinity toward specific target proteins or DNA sequences and causes change in metabolic activity or gene expression. In some cases the sensor protein, when unphosphorylated, has a positive effect upon the rate of dephosphorylation of the regulator protein (bifunctional sensor), whereas in other cases it has no such effect (monofunctional sensor). In this work we identify structural and functional differences between these two designs. In the first part of the paper we use sequence data for two-component systems from several organisms and homology modelling techniques to determine structural features for response regulators and for sensors. Our results indicate that each type of reference sensor (bifunctional and monofunctional) has a distinctive structural feature, which we use to make predictions regarding the functionality of other sensors. In the second part of the paper we use mathematical models to analyse and compare the physiological function of systems that differ in the type of sensor and are otherwise equivalent. Our results show that a bifunctional sensor is better than a monofunctional sensor both at amplifying changes in the phosphorylation level of the regulator caused by signals from the sensor and at attenuating changes caused by signals from small phosphodonors. Crosstalk to or from other two-component systems is better suppressed if the transmitting sensor is monofunctional, which is the more appropriate design when such cross-talk represents pathological noise. Crosstalk to or from other two-component systems is better amplified if the transmitting sensor is bifunctional, which is the more appropriate design when such cross-talk represents a physiological signal. These results provide a functional rationale for the selection of each design that is consistent with available experimental evidence for several two-component systems.
BackgroundIdentifying organism-environment interactions at the molecular level is crucial to understanding how organisms adapt to and change the chemical and molecular landscape of their habitats. In this work we investigated whether relative amino acid compositions could be used as a molecular signature of an environment and whether such a signature could also be observed at the level of the cellular amino acid composition of the microorganisms that inhabit that environment.Methodologies/Principal FindingsTo address these questions we collected and analyzed environmental amino acid determinations from the literature, and estimated from complete genomic sequences the global relative amino acid abundances of organisms that are cognate to the different types of environment. Environmental relative amino acid abundances clustered into broad groups (ocean waters, host-associated environments, grass land environments, sandy soils and sediments, and forest soils), indicating the presence of amino acid signatures specific for each environment. These signatures correlate to those found in organisms. Nevertheless, relative amino acid abundance of organisms was more influenced by GC content than habitat or phylogeny.ConclusionsOur results suggest that relative amino acid composition can be used as a signature of an environment. In addition, we observed that the relative amino acid composition of organisms is not highly determined by environment, reinforcing previous studies that find GC content to be the major factor correlating to amino acid composition in living organisms.
BackgroundModel organisms are used for research because they provide a framework on which to develop and optimize methods that facilitate and standardize analysis. Such organisms should be representative of the living beings for which they are to serve as proxy. However, in practice, a model organism is often selected ad hoc, and without considering its representativeness, because a systematic and rational method to include this consideration in the selection process is still lacking.Methodology/Principal FindingsIn this work we propose such a method and apply it in a pilot study of strengths and limitations of Saccharomyces cerevisiae as a model organism. The method relies on the functional classification of proteins into different biological pathways and processes and on full proteome comparisons between the putative model organism and other organisms for which we would like to extrapolate results. Here we compare S. cerevisiae to 704 other organisms from various phyla. For each organism, our results identify the pathways and processes for which S. cerevisiae is predicted to be a good model to extrapolate from. We find that animals in general and Homo sapiens in particular are some of the non-fungal organisms for which S. cerevisiae is likely to be a good model in which to study a significant fraction of common biological processes. We validate our approach by correctly predicting which organisms are phenotypically more distant from S. cerevisiae with respect to several different biological processes.Conclusions/SignificanceThe method we propose could be used to choose appropriate substitute model organisms for the study of biological processes in other species that are harder to study. For example, one could identify appropriate models to study either pathologies in humans or specific biological processes in species with a long development time, such as plants.
SUMMARYWe have developed an assay based on rice embryogenic callus for rapid functional characterization of metabolic genes. We validated the assay using a selection of well-characterized genes with known functions in the carotenoid biosynthesis pathway, allowing rapid visual screening of callus phenotypes based on tissue color. We then used the system to identify the functions of two uncharacterized genes: a chemically synthesized b-carotene ketolase gene optimized for maize codon usage, and a wild-type Arabidopsis thaliana ortholog of the cauliflower , we found that the wild-type Orange allele was sufficient to induce chromoplast differentiation. We also found that chromoplast differentiation was induced by increasing the availability of precursors and thus driving flux through the pathway, even in the absence of Orange. Remarkably, we found that diverse endosperm-specific promoters were highly active in rice callus despite their restricted activity in mature plants. Our callus system provides a unique opportunity to predict the effect of metabolic engineering in complex pathways, and provides a starting point for quantitative modeling and the rational design of engineering strategies using synthetic biology. We discuss the impact of our data on analysis and engineering of the carotenoid biosynthesis pathway.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.