2020
DOI: 10.1093/nargab/lqaa103
|View full text |Cite
|
Sign up to set email alerts
|

Editorial: Compositional data analysis and related methods applied to genomics—a first special issue fromNAR Genomics and Bioinformatics

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
13
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(13 citation statements)
references
References 30 publications
0
13
0
Order By: Relevance
“…Consequently, a positive relationship is very likely for metabarcoding datasets, and, with enough randomly assembled mock communities, the estimated slope will be very near 1. Such a comparison is inappropriate because the lack of independence among the observations renders most standard statistical analyses (e.g., ordinary regression or correlation analyses) inappropriate (see Gloor et al 2017, Erb et al 2020 mordax in Fig. 1D) but some taxa are strongly over-represented (A. brama in Fig.…”
Section: The Problemmentioning
confidence: 99%
“…Consequently, a positive relationship is very likely for metabarcoding datasets, and, with enough randomly assembled mock communities, the estimated slope will be very near 1. Such a comparison is inappropriate because the lack of independence among the observations renders most standard statistical analyses (e.g., ordinary regression or correlation analyses) inappropriate (see Gloor et al 2017, Erb et al 2020 mordax in Fig. 1D) but some taxa are strongly over-represented (A. brama in Fig.…”
Section: The Problemmentioning
confidence: 99%
“…where q = (q j ) D j=1 is the vector of individual event probabilities 4 . The multinomial encodes a constraint on n that leads to a mutual dependence between the parts.…”
Section: Sequencing Data Are Relativementioning
confidence: 99%
“…[1]) uses scale-free methods on data occurring in form of percentages, and its log-ratio methodology [2] has been applied to relative counts as well. While the sample spaces [3] of both data types are certainly not the same, the underlying problematic is identical: direct comparisons across samples can have paradoxical effects due to the lack of a common scale [4]. We have recently proposed to make use of information geometry to analyse compositional data [5].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The whole data analysis was conducted in R 3.4.4. Firstly we transformed the MG, MT and MP data using the central log ratio with the function clr 76 to overcome the inherent problems of compositional data 77,78 . In order to estimate the batch effect between the train and test samples, introduced by the different experimental procedure (mainly the robotic biomolecular extraction in the test samples and the read length), we regressed every entry in the MG and MT matrices with a linear model (with the function lm) as:…”
Section: Batch Effect Correctionmentioning
confidence: 99%