2016
DOI: 10.1101/084145
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Challenges and advances for transcriptome assembly in non-model species

Abstract: Analyses of high-throughput transcriptome sequences of non-model organisms are based on two main approaches: de novo assembly and genome-guided assembly using mapping to assign reads prior to assembly. Given the limits of mapping reads to a reference when it is highly divergent, as is frequently the case for non-model species, we evaluate whether using blastn would outperform mapping methods for read assignment in such situations (>15% divergence). We demonstrate its high performance by using simulated read… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
10
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(10 citation statements)
references
References 47 publications
(34 reference statements)
0
10
0
Order By: Relevance
“…Full-length cDNA sequencing with SMRT technology (Iso-Seq) can be used to rapidly reconstruct the grape berry transcriptome, enabling the identification of cultivar-specific isoforms, refinement of the Cabernet Sauvignon genome annotation, and the creation of a reference for transcriptome-wide expression profiling. In contrast to transcriptome reconstruction using short-read sequencing that requires de novo assembly, Iso-Seq delivers fulllength transcripts that eliminate the introduction of assembly errors and artefacts like chimeric transcripts and incomplete fragments due to PolyA capture (Chang et al 2014;Huang et al 2016;Moreton et al 2016;Smith-Unna et al 2016;Geniza and Jaiswal 2017;Ungaro et al 2017). The incorporation of high-coverage short-read sequencing is still necessary to benefit from the complete transcript sequencing enable by Iso-Seq.…”
Section: Discussionmentioning
confidence: 99%
“…Full-length cDNA sequencing with SMRT technology (Iso-Seq) can be used to rapidly reconstruct the grape berry transcriptome, enabling the identification of cultivar-specific isoforms, refinement of the Cabernet Sauvignon genome annotation, and the creation of a reference for transcriptome-wide expression profiling. In contrast to transcriptome reconstruction using short-read sequencing that requires de novo assembly, Iso-Seq delivers fulllength transcripts that eliminate the introduction of assembly errors and artefacts like chimeric transcripts and incomplete fragments due to PolyA capture (Chang et al 2014;Huang et al 2016;Moreton et al 2016;Smith-Unna et al 2016;Geniza and Jaiswal 2017;Ungaro et al 2017). The incorporation of high-coverage short-read sequencing is still necessary to benefit from the complete transcript sequencing enable by Iso-Seq.…”
Section: Discussionmentioning
confidence: 99%
“…In addition to reporting synthetic metrics related to assembly structure, TransRate reports individual metrics related to specific elements of assembly quality. One such metric estimates the rate of chimerism, a phenomenon which is known to be problematic in de novo assembly (Ungaro et al, 2017;Singhal, 2013). Rates of chimerism are relatively constant between all assemblers, ranging from 10% for the Shannon assembly, to 12% for the SPAdes75 assembly.…”
Section: Assembly Structurementioning
confidence: 99%
“…The ease with which these tools may be used to produce and characterize transcriptome assemblies belies the true complexity underlying the overall process (Ungaro et al, 2017;Wang and Gribskov, 2017;Moreton et al, 2015;Yang and Smith, 2013). Indeed, the subtle (and not so subtle) methodological challenges associated with transcriptome reconstruction may result in highly variable assembly quality.…”
Section: Introductionmentioning
confidence: 99%
“…We conclude that the read assignation performed before the assembly 229 step largely contributes to limit the production of chimeras. This shows that the use of SRC_c 230 impacts the de novo assembled transcriptome quality and contributes to address one of the most 231 delicate de novo assembly challenge [43]. The MEGAN6 contigs assignation from M2 shows more 232 contigs than SRC_c could assign to host and symbiont (Tab.…”
mentioning
confidence: 97%