2021
DOI: 10.1101/2021.07.01.450581
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Mining all publicly available expression data to compute dynamic microbial transcriptional regulatory networks

Abstract: We are firmly in the era of biological big data. Millions of omics datasets are publicly accessible and can be employed to support scientific research or build a holistic view of an organism. Here, we introduce a workflow that converts all public gene expression data for a microbe into a dynamic representation of the organism's transcriptional regulatory network. This five-step process walks researchers through the mining, processing, curation, analysis, and characterization of all available expression data, u… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
87
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

3
4

Authors

Journals

citations
Cited by 43 publications
(102 citation statements)
references
References 75 publications
(74 reference statements)
0
87
0
Order By: Relevance
“…After filtering the profiles based on quality criteria (see Methods), we compiled a transcriptomic compendium containing 364 samples (83 new + 281 public expression profiles) ( Figure 1d and Supplementary Figure S1c ). All the samples were shown to have Pearson’s correlation coefficient (PCC) of 0.97 between replicates 4 . To eliminate batch effects, each individual experiment was normalized to a reference condition prior to calculating the iModulons 4 .…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…After filtering the profiles based on quality criteria (see Methods), we compiled a transcriptomic compendium containing 364 samples (83 new + 281 public expression profiles) ( Figure 1d and Supplementary Figure S1c ). All the samples were shown to have Pearson’s correlation coefficient (PCC) of 0.97 between replicates 4 . To eliminate batch effects, each individual experiment was normalized to a reference condition prior to calculating the iModulons 4 .…”
Section: Resultsmentioning
confidence: 99%
“…All the samples were shown to have Pearson’s correlation coefficient (PCC) of 0.97 between replicates 4 . To eliminate batch effects, each individual experiment was normalized to a reference condition prior to calculating the iModulons 4 .…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…The final high-quality S. acidocaldarius compendium contained 95 RNA-seq datasets (Figure 1B, 1C). As part of the quality control procedure previously described (Sastry et al, 2021b), we performed manual curation of experimental metadata to identify which samples were biological replicates. We also examined the literature to identify each sample's strain, media, additional treatments, environmental parameters/changes, and growth stage, if reported.…”
Section: Methodsmentioning
confidence: 99%