2017
DOI: 10.1186/s12859-017-1738-1
|View full text |Cite
|
Sign up to set email alerts
|

Reliable Biomarker discovery from Metagenomic data via RegLRSD algorithm

Abstract: BackgroundBiomarker detection presents itself as a major means of translating biological data into clinical applications. Due to the recent advances in high throughput sequencing technologies, an increased number of metagenomics studies have suggested the dysbiosis in microbial communities as potential biomarker for certain diseases. The reproducibility of the results drawn from metagenomic data is crucial for clinical applications and to prevent incorrect biological conclusions. The variability in the sample … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 7 publications
(9 citation statements)
references
References 53 publications
0
8
0
Order By: Relevance
“…QIIME (version 1.8.0) was also used to measure beta diversity indices (unweighted UniFrac distance and weighted UniFrac distance) 27 . Line Discriminant Analysis (LDA) Effect Size (Lefse) analysis was used for screening significantly different biomarkers 28 . To study the differences in the abundance of microbial colonies between the two groups of samples, the species abundance data between groups were tested by Metastats 29 .…”
Section: Methodsmentioning
confidence: 99%
“…QIIME (version 1.8.0) was also used to measure beta diversity indices (unweighted UniFrac distance and weighted UniFrac distance) 27 . Line Discriminant Analysis (LDA) Effect Size (Lefse) analysis was used for screening significantly different biomarkers 28 . To study the differences in the abundance of microbial colonies between the two groups of samples, the species abundance data between groups were tested by Metastats 29 .…”
Section: Methodsmentioning
confidence: 99%
“…It is highly scalable and it has proved to achieve a discrete performance in reducing the false-positive detection, although as explicitly admitted by the developer, the false-negative rate is slightly higher. Other pipelines are also available for biomarker discoveries,63 however, a benchmark among them is beyond the scope of this review. Last, but not least, the complex tasks described above require high computational power and specific expertise in the field of biostatistics and informatics 64…”
Section: Selecting the Testmentioning
confidence: 99%
“…MetaAnalyst supports two kinds of analysis: (1) biomarker detection, and (2) phenotype classification. For biomarker detection, the MetaAnalyst packs 28 metagenomic biomarker discovery algorithms, namely, Shotgun-FunctionalizeR [ 19 ], Boruta [ 15 ], edgeR [ 23 ], DESeq2 [ 24 ], ENNB [ 16 ], MetagenomeSeq [ 17 ], MicrobiomeDDA [ 18 ], MetaStats [ 20 ], Raida [ 21 ], LEfSe [ 9 ], RPCA [ 10 ], RegLRSD [ 11 ] , RSPCA [ 46 ], Lasso [ 47 ], Relief [ 48 ], ReliefF [ 49 ], and the following hypothesis tests: Wilcoxon Rank Sum Test [ 50 ], t-Test [ 51 ], log t-Test [ 51 ], square t-Test [ 51 ], Welch’s Test [ 52 ], Chi-square Test [ 53 ], which are implemented using “stats” package R [ 30 ], Kolmogorov Smirnov Test [ 54 ], Levene Absolute Test [ 55 ], Levene Quadratic Test [ 55 ], Brown Forsythe Test [ 56 ], BSS/WSS (Between Sum of Squares over Within Sum of Squares) [ 57 ], and Pearson Correlation [ 58 ], which are implemented using MATLAB. Detailed description of these methods are provided in the User Manual.…”
Section: Methodsmentioning
confidence: 99%
“…Therefore, several algorithms and computational tools have been proposed for biomarker detection such as LEfSe [ 9 ], RPCA [ 10 ], RegLRSD [ 11 ], IMG/M [ 12 ], MeAtML [ 13 ], Fizzy [ 14 ], Boruta [ 15 ], ENNB [ 16 ], MetagenomeSeq [ 17 ], MicrobiomeDDA [ 18 ], ShotgunFunctionalizeR [ 19 ], MetaStats [ 20 ], Raida [ 21 ], FANTOM [ 22 ]. Due to the similarity between metagenomic data sequence-based transcriptomics, tools that were developed originally for analyzing RNA sequencing (RNA-seq) data such as edgeR [ 23 ] and DESeq2 [ 24 ] can be applied to analyzing metagenomic data.…”
Section: Introductionmentioning
confidence: 99%