2017
DOI: 10.1093/bioinformatics/btx668
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of tools for long read RNA-seq splice-aware alignment

Abstract: Supplementary data are available at Bioinformatics online.

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
62
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
3
3

Relationship

0
9

Authors

Journals

citations
Cited by 67 publications
(64 citation statements)
references
References 20 publications
(19 reference statements)
0
62
0
Order By: Relevance
“…Conventional "RNA-seq" with short reads, which is rather inappropriately termed so since the RNA molecules are not directly sequenced (Hrdlickova, Toloue, & Tian, 2017), requires reverse transcription (RT) of the template RNA molecule into complementary DNA (cDNA), which is typically further amplified by PCR. The gene annotation process is simplified by the direct RNA sequencing, and it has therefore allowed the identification of more complex or novel transcript isoforms genome-wide (Byrne et al, 2017;Krizanovic, Echchiki, Roux, & Sikic, 2018), and the ability to differentiate transcript haplotypes as well as to identify 3′ poly (A) tail lengths (Workman et al, 2018). The gene annotation process is simplified by the direct RNA sequencing, and it has therefore allowed the identification of more complex or novel transcript isoforms genome-wide (Byrne et al, 2017;Krizanovic, Echchiki, Roux, & Sikic, 2018), and the ability to differentiate transcript haplotypes as well as to identify 3′ poly (A) tail lengths (Workman et al, 2018).…”
Section: Tr Anscrip Tome Analys Is and D Irec T Rna S Equen Cingmentioning
confidence: 99%
See 1 more Smart Citation
“…Conventional "RNA-seq" with short reads, which is rather inappropriately termed so since the RNA molecules are not directly sequenced (Hrdlickova, Toloue, & Tian, 2017), requires reverse transcription (RT) of the template RNA molecule into complementary DNA (cDNA), which is typically further amplified by PCR. The gene annotation process is simplified by the direct RNA sequencing, and it has therefore allowed the identification of more complex or novel transcript isoforms genome-wide (Byrne et al, 2017;Krizanovic, Echchiki, Roux, & Sikic, 2018), and the ability to differentiate transcript haplotypes as well as to identify 3′ poly (A) tail lengths (Workman et al, 2018). The gene annotation process is simplified by the direct RNA sequencing, and it has therefore allowed the identification of more complex or novel transcript isoforms genome-wide (Byrne et al, 2017;Krizanovic, Echchiki, Roux, & Sikic, 2018), and the ability to differentiate transcript haplotypes as well as to identify 3′ poly (A) tail lengths (Workman et al, 2018).…”
Section: Tr Anscrip Tome Analys Is and D Irec T Rna S Equen Cingmentioning
confidence: 99%
“…Direct RNA sequencing (again, it is unfortunate that true RNA sequencing has to add the word "direct" to differentiate itself from conventional "RNA-Seq"), on the other hand, directly detects the ribonucleobases passing through the pore without RT or PCR amplification, and this approach is free of possible biases or misamplifications introduced during such steps. The gene annotation process is simplified by the direct RNA sequencing, and it has therefore allowed the identification of more complex or novel transcript isoforms genome-wide (Byrne et al, 2017;Krizanovic, Echchiki, Roux, & Sikic, 2018), and the ability to differentiate transcript haplotypes as well as to identify 3′ poly (A) tail lengths (Workman et al, 2018). Such an application is also practical for the study of RNA viral genomes, because the genome has multiple reading frames, anti-sense gene locations, inefficient termination signals, and complex splice forms, and gene annotation is challenging using the conventional RNA-seq method (Depledge et al, 2018).…”
Section: Tr Anscrip Tome Analys Is and D Irec T Rna S Equen Cingmentioning
confidence: 99%
“…Then high (30X) and low (4X) coverage simulations were implemented for the highly and the lowly expressed isoforms, respectively, and the simulated reads were mixed to build the dataset. To benchmark the aligners on the datasets from various species and in various coverages more comprehensively, we referred to a previous study [30] to randomly select three sets of genes respectively from the three species to simulate 9 datasets. Each of the datasets was built with one of the gene-sets and a fixed coverage (4X, 10X or 30X).…”
Section: Results On Simulated Datasetsmentioning
confidence: 99%
“…Circular Consensus Sequence (CCS) reads are a type of ROI and are generated by collapsing multiple subreads when ≥ 2 full-pass subreads are present. ONT produces longer reads with even higher error rates (error rates for "1D" raw reads: > 25%; error rates for "2D" consensus reads: 12-20%) (Križanović et al, 2018). Error-correction methods using short reads (such as the error correction tool LSC (Au et al, 2012)) have been created to correct the high rate of errors in long reads; however, error correction may create artifacts so that the corrected long reads may no longer be true single-molecule reads (Sharon et al, 2013).…”
Section: Introductionmentioning
confidence: 99%