Mining all publicly available expression data to compute dynamic microbial transcriptional regulatory networks

Sastry, Anand V.; Poudel, Saugat; Rychel, Kevin; Yoo, Reo; Cr, Lamoureux; Chauhan, Siddharth M.; Haiman, Zachary B.; T, Al Bulushi; Seif, Yara; Kim, Jaehyung

doi:10.1101/2021.07.01.450581

Cited by 43 publications

(102 citation statements)

References 75 publications

(74 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…After filtering the profiles based on quality criteria (see Methods), we compiled a transcriptomic compendium containing 364 samples (83 new + 281 public expression profiles) ( Figure 1d and Supplementary Figure S1c ). All the samples were shown to have Pearson’s correlation coefficient (PCC) of 0.97 between replicates 4 . To eliminate batch effects, each individual experiment was normalized to a reference condition prior to calculating the iModulons 4 .…”

Section: Resultsmentioning

confidence: 99%

“…All the samples were shown to have Pearson’s correlation coefficient (PCC) of 0.97 between replicates 4 . To eliminate batch effects, each individual experiment was normalized to a reference condition prior to calculating the iModulons 4 .…”

Section: Resultsmentioning

confidence: 99%

“…The RNAseq reads were processed and quality control was done. Further, the independent component analysis (ICA) was applied to generate the iModulons that were characterized to get the regulatory networks of P. aeruginosa (Adapted from Sastry et al 4 ). b) ICA calculates the independently modulated sets of genes (iModulons).…”

Section: Resultsmentioning

confidence: 99%

“…Thus, elucidation of these regulatory mechanisms would be beneficial to designing new or combinatorial therapies against P. aeruginosa infections. Today, machine learning approaches can be used to establish TRN structure in bacteria if there is sufficient transcriptomic data available for analysis 4 .…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Machine Learning of Pseudomonas aeruginosa transcriptomes identifies independently modulated sets of genes associated with known transcriptional regulators

Rajput

Tsunemoto

Sastry

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

The transcriptional regulatory network (TRN) of Pseudomonas aeruginosa plays a critical role in coordinating numerous cellular processes. We extracted and quality controlled all publicly available RNA-sequencing datasets for P. aeruginosa to find 281 high-quality transcriptomes. We produced 83 new RNAseq data sets under critical conditions to generate a comprehensive compendium of 364 transcriptomes. We used this compendium to reconstruct the TRN of P. aeruginosa using independent component analysis (ICA). We identified 104 independently modulated sets of genes (called iModulons), among which 81 (78%) reflect the effects of known transcriptional regulators. We show that iModulons: 1) play an important role in defining the genomic boundaries of biosynthetic gene clusters (BGCs); 2) show increased expression of the BGCs and associated secretion systems in conditions that emulate cystic fibrosis (CF); 3) show the presence of a novel BGC named RiPP (bacteriocin producer) which might have a role in worsening CF outcomes; 4) exhibit the interplay of amino acid metabolism regulation and central metabolism across carbon sources, and 5) clustered according to their activity changes to define iron and sulfur stimulons. Finally, we compare the iModulons of P. aeruginosa with those of E. coli to observe conserved regulons across two gram negative species. This comprehensive TRN framework covers almost every aspect of the transcriptional regulatory machinery in P. aeruginosa, and thus could prove foundational for future research of its physiological functions.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Resultsmentioning

confidence: 99%

Section: Resultsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Machine Learning of Pseudomonas aeruginosa transcriptomes identifies independently modulated sets of genes associated with known transcriptional regulators

Rajput

Tsunemoto

Sastry

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…The final high-quality S. acidocaldarius compendium contained 95 RNA-seq datasets (Figure 1B, 1C). As part of the quality control procedure previously described (Sastry et al, 2021b), we performed manual curation of experimental metadata to identify which samples were biological replicates. We also examined the literature to identify each sample's strain, media, additional treatments, environmental parameters/changes, and growth stage, if reported.…”

Section: Methodsmentioning

confidence: 99%

Machine learning uncovers a data-driven transcriptional regulatory network for the Crenarchaeal thermoacidophile Sulfolobus acidocaldarius

Chauhan

Poudel

Rychel

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Dynamic cellular responses to environmental constraints are coordinated by the transcriptional regulatory network (TRN), which modulates gene expression. This network controls most fundamental cellular responses, including metabolism, motility, and stress responses. Here, we apply independent component analysis, an unsupervised machine learning approach, to 95 high-quality Sulfolobus acidocaldarius RNA-seq datasets and extract 45 independently modulated gene sets, or iModulons. Together, these iModulons contain 755 genes (32% of the genes identified on the genome) and explain over 70% of the variance in the expression compendium. We show that 5 modules represent the effects of known transcriptional regulators, and hypothesize that most of the remaining modules represent the effects of uncharacterized regulators. Further analysis of these gene sets results in: (1) the prediction of a DNA export system composed of 5 uncharacterized genes, (2) expansion of the LysM regulon, and (3) evidence for an as-yet-undiscovered global regulon. Our approach allows for a mechanistic, systems-level elucidation of an extremophile's responses to biological perturbations, which could inform research on gene-regulator interactions and facilitate regulator discovery in S. acidocaldarius. We also provide the first global TRN for S. acidocaldarius. Collectively, these results provide a roadmap towards regulatory network discovery in archaea.

show abstract

Mathematical models to study the biology of pathogens and the infectious diseases they cause

Xavier¹,

Monk²,

Poudel³

et al. 2022

iScience

Self Cite

View full text Add to dashboard Cite

Mining all publicly available expression data to compute dynamic microbial transcriptional regulatory networks

Cited by 43 publications

References 75 publications

Machine Learning of Pseudomonas aeruginosa transcriptomes identifies independently modulated sets of genes associated with known transcriptional regulators

Machine Learning of Pseudomonas aeruginosa transcriptomes identifies independently modulated sets of genes associated with known transcriptional regulators

Machine learning uncovers a data-driven transcriptional regulatory network for the Crenarchaeal thermoacidophile Sulfolobus acidocaldarius

Mathematical models to study the biology of pathogens and the infectious diseases they cause

Contact Info

Product

Resources

About