2016
DOI: 10.12688/f1000research.8900.2
|View full text |Cite
|
Sign up to set email alerts
|

DRIMSeq: a Dirichlet-multinomial framework for multivariate count outcomes in genomics

Abstract: There are many instances in genomics data analyses where measurements are made on a multivariate response. For example, alternative splicing can lead to multiple expressed isoforms from the same primary transcript. There are situations where differences (e.g. between normal and disease state) in the relative ratio of expressed isoforms may have significant phenotypic consequences or lead to prognostic capabilities. Similarly, knowledge of single nucleotide polymorphisms (SNPs) that affect splicing, so-called s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4

Citation Types

1
130
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
3
3
3

Relationship

1
8

Authors

Journals

citations
Cited by 156 publications
(144 citation statements)
references
References 61 publications
1
130
0
Order By: Relevance
“…For instance, Fordyce, Gompert, Forister, and Nice () rely on Dirichlet‐multinomial modelling (DMM) to analyze ecological count data, such as counts of behavioural and dietary choices of animals (also see Coblentz, Rosenblatt, & Novak, ). Similar models have been applied to large counts of DNA sequences—for instance, Fernandes et al (; aldex2 ), Nowicka and Robinson (; drim‐seq ), and Rosa et al (; hmp ) use DMM to estimate and compare feature‐specific relative abundances in transcriptomes and microbiomes. Additionally, DMM has been used to model mixtures of compositions, a situation that could arise in a laboratory‐derived microbial assemblage occurring as a contaminant within samples, or in mixtures of different communities in nature ( microbedmm , Holmes, Harris, & Quince, ; sourcetracker , Knights et al, ; biomico , Shafiei et al, ; feast , Shenhav et al, ; ecostructure , White, Dey, Mohan, Stephens, & Price, ).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…For instance, Fordyce, Gompert, Forister, and Nice () rely on Dirichlet‐multinomial modelling (DMM) to analyze ecological count data, such as counts of behavioural and dietary choices of animals (also see Coblentz, Rosenblatt, & Novak, ). Similar models have been applied to large counts of DNA sequences—for instance, Fernandes et al (; aldex2 ), Nowicka and Robinson (; drim‐seq ), and Rosa et al (; hmp ) use DMM to estimate and compare feature‐specific relative abundances in transcriptomes and microbiomes. Additionally, DMM has been used to model mixtures of compositions, a situation that could arise in a laboratory‐derived microbial assemblage occurring as a contaminant within samples, or in mixtures of different communities in nature ( microbedmm , Holmes, Harris, & Quince, ; sourcetracker , Knights et al, ; biomico , Shafiei et al, ; feast , Shenhav et al, ; ecostructure , White, Dey, Mohan, Stephens, & Price, ).…”
Section: Introductionmentioning
confidence: 99%
“…For instance, Fordyce, Gompert, Forister, and Nice (2011) rely on Dirichlet-multinomial modelling (DMM) to analyze ecological count data, such as counts of behavioural and dietary choices of animals (also see Coblentz, Rosenblatt, & Novak, 2017). Similar models have been applied to large counts of DNA sequences-for instance, Fernandes et al (2014;aldex2), Nowicka and Robinson (2016;drim-seq), and Rosa et al (2012;hmp) Consequently, we conducted a simulation experiment to learn the limits and benefits of DMM through the analysis of data that encompass much of the variety in attributes encountered across scientific domains (e.g. replication, number of observations, and so on; Figure 2).…”
Section: Introductionmentioning
confidence: 99%
“…Analysis of RNA-seq data for most biologists is a bottleneck because of reliance on the skills of often over-stretched bioinformaticians who are needed to process large datasets and apply complex analytical programs to experimental data. Many RNA-seq differential analysis programs do not have the flexibility to handle complex experimental designs (such as time-course or developmental series data) and are error prone (Love et al, 2014;Hardcastle and Kelly, 2010;Anders et al, 2012;Nowicka and Robinson, 2016). Results can be inconsistent due to the use of multiple different combinations of tools or pipelines by different bioinformaticians.…”
Section: Introductionmentioning
confidence: 99%
“…DEXSeq, on the other hand, utilizes a Negative Binomial distribution to model the counts per exon. It was originally targeted for identifying differential exon usage, but has been also evaluated in the context of transcript usage (Soneson and others, 2016;Nowicka and Robinson, 2016;Love and others, 2018). SUPPA2 uses biological replicates to estimate differences in isoform proportions across conditions and between biological replicates.…”
Section: Introductionmentioning
confidence: 99%