2021
DOI: 10.1093/nar/gkab433
|View full text |Cite
|
Sign up to set email alerts
|

noisyR: enhancing biological signal in sequencing datasets by characterizing random technical noise

Abstract: High-throughput sequencing enables an unprecedented resolution in transcript quantification, at the cost of magnifying the impact of technical noise. The consistent reduction of random background noise to capture functionally meaningful biological signals is still challenging. Intrinsic sequencing variability introducing low-level expression variations can obscure patterns in downstream analyses. We introduce noisyR, a comprehensive noise filter to assess the variation in signal distribution and achieve an opt… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
24
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

4
3

Authors

Journals

citations
Cited by 19 publications
(26 citation statements)
references
References 62 publications
0
24
0
Order By: Relevance
“…The metadata table contains additional experimental information and represents the starting point for the DE analyses/comparisons. The pre-processing step (preprocessExpressionMatrix function) handles the countsbased noise detection using noisyR [13], which outputs a denoised, un-normalised expression matrix (Fig. S2).…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…The metadata table contains additional experimental information and represents the starting point for the DE analyses/comparisons. The pre-processing step (preprocessExpressionMatrix function) handles the countsbased noise detection using noisyR [13], which outputs a denoised, un-normalised expression matrix (Fig. S2).…”
Section: Resultsmentioning
confidence: 99%
“…Visualisation tools such as Cytoscape [11] facilitate hypothesis generation, but require external definitions of the networks. Moreover, GRN inferences are often performed on the entire dataset, generating complex, difficult-to-interpret networks [12] and lacking in robustness due to intrinsic variability [13]. GeNeCK [14] attempts, in a web-based setting, to link GRN inference and interpretation, but is decoupled from other steps such as DE analysis.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The user can provide a feature set, including frequently used options such as the most abundant genes [49, 41], or highly variable genes [50, 16], or other custom subset specified by the user [57]. Also assessed are different sizes (number of entries) of the feature set; this incremental approach is essential for determining the largest subset of genes unaffected by noise [22, 21] which can alter downstream interpretations. The stability of each feature set is summarised using boxplots; the distribution of the ECC on the reduced UMAP space provides additional insights on the localisation of unstable regions on the UMAP topography.…”
Section: Discussionmentioning
confidence: 99%
“…While it is desirable to cluster cells based on biological signal, partitions can also be driven by technical nuisance effects (sequencing depth or noise [20, 21, 22]). Beyond the interplay of technical [23] and biological signals affecting clustering outputs, the results also depend on algorithmic decisions e.g.…”
Section: Introductionmentioning
confidence: 99%