2019
DOI: 10.1101/2019.12.13.875625
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Deriving Ranges of Optimal Estimated Transcript Expression Due to Non-identifiability

Abstract: Current expression quantification methods suffer from a fundamental but under-characterized type of error: the most likely estimates for transcript abundances are not unique. Current quantification methods rely on probabilistic models, and the scenario where it admits multiple optimal solutions is called nonidentifiability. This means multiple estimates of transcript abundances generate the observed RNA-seq reads equally likely, and the underlying true expression cannot be determined. The non-identifiability p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2
2
1

Relationship

1
4

Authors

Journals

citations
Cited by 7 publications
(9 citation statements)
references
References 47 publications
0
9
0
Order By: Relevance
“…Fifth, there are possibly multiple optimal solutions to the DTA problem that present equally likely viral transcripts with different relative abundances in the sample. A useful direction of future work is to explore the space of optimal solutions similar to the work done in [18]. Finally, the approach presented in this paper can extended to the general transcript assembly problem, where a topological ordering of the nodes in the splice graph will serve the same function as the unique Hamiltonian path of the segment graph did in the DTA problem.…”
Section: Discussionmentioning
confidence: 97%
See 3 more Smart Citations
“…Fifth, there are possibly multiple optimal solutions to the DTA problem that present equally likely viral transcripts with different relative abundances in the sample. A useful direction of future work is to explore the space of optimal solutions similar to the work done in [18]. Finally, the approach presented in this paper can extended to the general transcript assembly problem, where a topological ordering of the nodes in the splice graph will serve the same function as the unique Hamiltonian path of the segment graph did in the DTA problem.…”
Section: Discussionmentioning
confidence: 97%
“…Here, a path π is a subset of edges E that can be ordered ( v 1 , w 1 ), … , ( v | π | , w | π | ) such that w i = v i +1 for all i ∈ [| π | − 1] = {1, … , | π | − 1}. While splice graphs are DAGs and typically have a unique source and sink node as well, they do not necessarily contain a Hamiltonian path [9, 1618].…”
Section: Preliminaries and Problem Statementmentioning
confidence: 99%
See 2 more Smart Citations
“…However, the results presented in (1,2,7) and in this manuscript suggest that one important aspect of accurate quantification will be properly determining when sequenced fragments, despite exhibiting sequence similarity to annotated transcripts, are most likely to have arisen from a transcribed sequence that differs from the annotation. In addition to the improvements introduced by Srivastava et al (1), and in lieu of full transcript assembly, both transcript quantification over splicing graphs (12) and determination of optimal abundance ranges implied by non-identifiability in the underlying inference problem (13) are promising directions to pursue.…”
Section: Discussionmentioning
confidence: 99%