2013
DOI: 10.1186/gb-2013-14-6-r66
|View full text |Cite
|
Sign up to set email alerts
|

Separating homeologs by phasing in the tetraploid wheat transcriptome

Abstract: BackgroundThe high level of identity among duplicated homoeologous genomes in tetraploid pasta wheat presents substantial challenges for de novo transcriptome assembly. To solve this problem, we develop a specialized bioinformatics workflow that optimizes transcriptome assembly and separation of merged homoeologs. To evaluate our strategy, we sequence and assemble the transcriptome of one of the diploid ancestors of pasta wheat, and compare both assemblies with a benchmark set of 13,472 full-length, non-redund… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

5
137
0
1

Year Published

2015
2015
2023
2023

Publication Types

Select...
5
3

Relationship

2
6

Authors

Journals

citations
Cited by 132 publications
(143 citation statements)
references
References 58 publications
(99 reference statements)
5
137
0
1
Order By: Relevance
“…Analysis of a manually curated set of 52 tetraploid wheat homeologues showed that they share 97.3% 6 1.2% DNA sequence identity and that the distance between adjacent variants decreases exponentially, with an average separation of approximately 38 bp. This determines that 8% of singlenucleotide polymorphisms (SNPs) between A and B genome homeologous are over 100 bp apart (Krasileva et al, 2013). This would prevent reads containing these widely spaced SNPs from being unambiguously mapped to one homeologue, explaining why we observe a residual level of expression from the deleted chromosome in the nullitetrasomic lines.…”
Section: Accurate Read Mapping Enables Homeologue Specificitymentioning
confidence: 98%
See 1 more Smart Citation
“…Analysis of a manually curated set of 52 tetraploid wheat homeologues showed that they share 97.3% 6 1.2% DNA sequence identity and that the distance between adjacent variants decreases exponentially, with an average separation of approximately 38 bp. This determines that 8% of singlenucleotide polymorphisms (SNPs) between A and B genome homeologous are over 100 bp apart (Krasileva et al, 2013). This would prevent reads containing these widely spaced SNPs from being unambiguously mapped to one homeologue, explaining why we observe a residual level of expression from the deleted chromosome in the nullitetrasomic lines.…”
Section: Accurate Read Mapping Enables Homeologue Specificitymentioning
confidence: 98%
“…To assess whether kallisto could correctly assign reads to the relevant homeologue, we used a unique genetic resource available in wheat: nullitetrasomic lines (Sears, 1954). Normal bread wheat contains three copies of most genes, one on each of the A, B, and D homeologous chromosomes, and these genes share over 95% identity in coding sequences (Krasileva et al, 2013). In nullitetrasomic lines, one chromosome is specifically deleted (nulli) and compensated by an additional copy of a homeologous chromosome (tetra).…”
Section: Accurate Read Mapping Enables Homeologue Specificitymentioning
confidence: 99%
“…Например, протяженность транскриптома дрозофилы ненамного меньше транскриптома у млекопитающих (Nfonsam et al, 2012). В то же время у злаковых средняя протяжен-ность транскриптома в два -три раза больше, чем у мыши (Krasileva et al, 2013), поэтому для этих видов растений целесообразно установить глубину секвенирования не менее чем в 4-5 × 10 9 п. н.…”
Section: глубина секвенирования транскриптома протяженность транскриunclassified
“…In the bulks, the individual reads align across the myriad of sources (Pontius et al 2002 ). More recently, we have shifted to a phased transcriptome which has gene models separated by the corresponding genome (Krasileva et al 2013 ).…”
Section: Wheat Genomicsmentioning
confidence: 99%
“…We sequence the parental genotypes (Avocet S and Avocet S + Yr15 , Fig. 22.1a, b ) and generate a consensus reference by aligning the reads to the UniGenes and gene models described above (Krasileva et al 2013 ). Although genome-specifi c references are used when possible, there are still cases where multiple homoeologues will align to a common reference (as illustrated in Fig.…”
Section: Snp Selection and Marker Designmentioning
confidence: 99%