BackgroundThe Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function.ResultsHere, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole-genome mutation screening in Candida albicans and Pseudomonas aureginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory.ConclusionWe conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens.
Summary Cross experiment comparisons in public data compendia are challenged by unmatched conditions and technical noise. The ADAGE method, which performs unsupervised integration with denoising autoencoder neural networks, can identify biological patterns, but because ADAGE models, like many neural networks, are over-parameterized, different ADAGE models perform equally well. To enhance model robustness and better build signatures consistent with biological pathways, we developed an ensemble ADAGE (eADAGE) that integrated stable signatures across models. We applied eADAGE to a compendium of Pseudomonas aeruginosa gene expression profiling experiments performed in 78 media. eADAGE revealed a phosphate starvation response controlled by PhoB in media with moderate phosphate and predicted that a second stimulus provided by the sensor kinase, KinB, is required for this PhoB activation. We validated this relationship using both targeted and unbiased genetic approaches. eADAGE, which captures stable biological patterns, enables cross-experiment comparisons that can highlight measured but undiscovered relationships.
What are bacteria doing during “reversible attachment,” the period of transient surface attachment when they initially engage a surface, besides attaching themselves to the surface? Can an attaching cell help any other cell attach? If so, does it help all cells or employ a more selective strategy to help either nearby cells (spatial neighbors) or its progeny (temporal neighbors)? Using community tracking methods at the single-cell resolution, we suggest answers to these questions based on how reversible attachment progresses during surface sensing for Pseudomonas aeruginosa strains PAO1 and PA14. Although PAO1 and PA14 exhibit similar trends of surface cell population increase, they show unanticipated differences when cells are considered at the lineage level and interpreted using the quantitative framework of an exactly solvable stochastic model. Reversible attachment comprises two regimes of behavior, processive and nonprocessive, corresponding to whether cells of the lineage stay on the surface long enough to divide, or not, before detaching. Stark differences between PAO1 and PA14 in the processive regime of reversible attachment suggest the existence of two surface colonization strategies. PAO1 lineages commit quickly to a surface compared to PA14 lineages, with early c-di-GMP-mediated exopolysaccharide (EPS) production that can facilitate the attachment of neighbors. PA14 lineages modulate their motility via cyclic AMP (cAMP) and retain memory of the surface so that their progeny are primed for improved subsequent surface attachment. Based on the findings of previous studies, we propose that the differences between PAO1 and PA14 are potentially rooted in downstream differences between Wsp-based and Pil-Chp-based surface-sensing systems, respectively. IMPORTANCE The initial pivotal phase of bacterial biofilm formation known as reversible attachment, where cells undergo a period of transient surface attachment, is at once universal and poorly understood. What is more, although we know that reversible attachment culminates ultimately in irreversible attachment, it is not clear how reversible attachment progresses phenotypically, as bacterial surface-sensing circuits fundamentally alter cellular behavior. We analyze diverse observed bacterial behavior one family at a time (defined as a full lineage of cells related to one another by division) using a unifying stochastic model and show that our findings lead to insights on the time evolution of reversible attachment and the social cooperative dimension of surface attachment in PAO1 and PA14 strains.
The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function. Here we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole-genome mutation screening in Candida albicans and Pseudomonas aureginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility (P. aureginosa only). We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory. We conclude that, while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. We finally report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bioontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens. 157 project. Predicting GO terms for a protein (protein-centric) and predicting which proteins are associated 158 with a given function (term-centric) are related but different computational problems: the former is a 159 multi-label classification problem with a structured output, while the latter is a binary classification task. 160Predicting the results of a genome-wide screen for a single or a small number of functions fits the term-centric 161 formulation. To see how well all participating CAFA methods perform term-centric predictions, we mapped 162 results from the protein-centric CAFA3 methods onto these terms. In addition we held a separate CAFA 163 challenge, CAFA-π whose purpose was to attract additional submissions from algorithms that specialize in 164 term-centric tasks. 165 We performed screens for three functions in three species, which we then used to assess protein function 166 prediction. In the bacterium Pseudomonas aeruginosa and the fungus Candida albicans we performed 167 genome-wide screens capable of uncovering genes with two functions, biofilm formation (GO:0042710) and 168 motility (for P. aeruginosa only) (GO:0001539), as described in Methods. In Drosophila melanogaster we 169 performed targeted assays, guided by previous CAFA submissions, of a ...
Pseudomonas aeruginosa is the most prevalent opportunistic pathogen in the airways of cystic fibrosis (CF) patients. The pulmonary disorder is characterized by recurrent microbial infections and an exaggerated host inflammatory immune response led primarily by influx of neutrophils. Under these conditions, chronic colonization with P. aeruginosa is associated with diminished pulmonary function and increased morbidity and mortality. P. aeruginosa has a wide array of genetic mechanisms that facilitate its persistent colonization of the airway despite extensive innate host immune responses. Loss of function mutations in the quorum sensing regulatory gene lasR have been shown to confer survival advantage and a more pathogenic character to P. aeruginosa in CF patients. However, the strategies used by LasR-deficient P. aeruginosa to modulate neutrophil-mediated bactericidal functions are unknown. We sought to understand the role of LasR in P. aeruginosa -mediated neutrophil extracellular trap (NET) formation, an important anti-microbial mechanism deployed by neutrophils, the first-line responder in the infected airway. We observe mechanistic and phenotypic differences between NETs triggered by LasR-sufficient and LasR-deficient P. aeruginosa strains. We uncover that LasR-deficient P. aeruginosa strains fail to induce robust NET formation in both human and murine neutrophils, independently of bacterial motility or LPS expression. LasR does not mediate NET release via downstream quorum sensing signaling pathways but rather via transcriptional regulation of virulence factors, including, but not restricted to, LasB elastase and LasA protease. Finally, our studies uncover the differential requirements for NADPH oxidase in NET formation triggered by different P. aeruginosa strains.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.