2018
DOI: 10.1101/387365
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Data-adaptive pipeline for filtering and normalizing metabolomics data

Abstract: IntroductionUntargeted metabolomics datasets contain large proportions of uninformative features and are affected by a variety of nuisance technical effects that can bias subsequent statistical analyses. Thus, there is a need for versatile and data-adaptive methods for filtering and normalizing data prior to investigating the underlying biological phenomena. ConclusionOur proposed data-adaptive pipeline is intuitive and effectively reduces technical noise from untargeted metabolomics datasets. It is particular… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 31 publications
0
6
0
Order By: Relevance
“…Second, electronic noise becomes relatively more prominent for MS/MS spectra that originated from very low-intensity precursor ions. Therefore, a simple cut-off threshold at 1% base peak intensity does not suffice for metabolomics or exposome nontargeted studies that aim at low abundant molecules 20,22 .…”
Section: National Institute Of Standards and Technologymentioning
confidence: 99%
See 1 more Smart Citation
“…Second, electronic noise becomes relatively more prominent for MS/MS spectra that originated from very low-intensity precursor ions. Therefore, a simple cut-off threshold at 1% base peak intensity does not suffice for metabolomics or exposome nontargeted studies that aim at low abundant molecules 20,22 .…”
Section: National Institute Of Standards and Technologymentioning
confidence: 99%
“…Consequently, existing denoising methods are unsuitable for large scale, standardized de-noising in metabolomics. Typical metabolomics data processing software denoises spectra by simply discarding ions below 0.5-1% of the basepeak height [20][21][22] . Surprisingly, the chemistry information revealed by the fragment peaks are not considered when determining if a given fragment is true ion or noise.…”
Section: Introductionmentioning
confidence: 99%
“…But this is a tedious process 7 ; there is also Lawson et al's msPurity R package with a function called "SubtractMZ" to perform blank removal 110 . Data-adaptive filtering methods have also been suggested to remove features from blanks and low abundant features from samples with undetected values 111 . Another popular feature filtering method is based on the Coefficient of Variance (CV).…”
Section: Box 2 -Blank Removalmentioning
confidence: 99%
“…Then, acetonitrile was added to precipitate proteins and extracts were analyzed by LC-HRMS [8]. Data processing was performed in the R statistical programming environment using methods described elsewhere [15]. Detection of sample outliers, beyond a proportional expansion value of 1.2 for Hotelling's ellipse (PC1 and PC2), resulted in removal of two cases and 10 controls.…”
Section: Metabolomic Analysismentioning
confidence: 99%
“…Feature selection was performed separately for early-and late-diagnosis of ALL. Peak areas were log transformed and normalized with the Bioconductor R package 'scone' [15,19,20], which implemented and evaluated different scaling and regression-based-normalization methods for removing unwanted variation while preserving differences in case status. The normalization scheme selected by 'scone' used DESeq scaling and accounted for the following unwanted sources of variation: NBS age, blood volume, run order, and batch.…”
Section: Feature Selectionmentioning
confidence: 99%