2014
DOI: 10.1038/hdy.2014.85
|View full text |Cite
|
Sign up to set email alerts
|

Whole-genome re-sequencing of non-model organisms: lessons from unmapped reads

Abstract: Unmapped reads are often discarded from the analysis of whole-genome re-sequencing, but new biological information and insights can be uncovered through their analysis. In this paper, we investigate unmapped reads from the re-sequencing data of 33 pea aphid genomes from individuals specialized on different host plants. The unmapped reads for each individual were retrieved following mapping to the Acyrthosiphon pisum reference genome and its mitochondrial and symbiont genomes. These sets of unmapped reads were … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

2
41
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 42 publications
(43 citation statements)
references
References 33 publications
(33 reference statements)
2
41
0
Order By: Relevance
“…human) transcriptomics such as differential gene expression or alternative splicing analysis [9,10]. Reads that do not map onto the human genome are considered noise or contamination and therefore generally ignored [11,12] (collectively about 9% of total reads, Fig. 1).…”
Section: Data Descriptionmentioning
confidence: 99%
“…human) transcriptomics such as differential gene expression or alternative splicing analysis [9,10]. Reads that do not map onto the human genome are considered noise or contamination and therefore generally ignored [11,12] (collectively about 9% of total reads, Fig. 1).…”
Section: Data Descriptionmentioning
confidence: 99%
“…Unmapped reads would not be a problem if these reads were all redundant, consisted of low quality bases, or originated from regions of little biological relevance. However, the majority of unmapped reads are high quality, contain relevant biological information [10], and there are many genomic regions with no mapped reads (e.g., see Figure 1). Reads are unmappable because they are too short to have high homology to only a single location in the genome, reads have a high error rate, the reference and subject genomes are substantially different, and insertions and deletions cause mapping problems.…”
Section: Introductionmentioning
confidence: 99%
“…Such information can be e.g. pathogens, symbionts or sequences/genes missing in the reference genome [1][2][3][4][5].…”
mentioning
confidence: 99%