BackgroundNext generation sequencing provides a powerful tool to study genome structure in species whose genomes are far from being completely sequenced. In this work we describe and compare different computational approaches to evaluate the repetitive component of the genome of sunflower, by using medium/low coverage Illumina or 454 libraries.ResultsBy varying sequencing technology (Illumina or 454), coverage (0.55 x-1.25 x), assemblers and assembly procedures, six different genomic databases were produced. The annotation of these databases showed that they were composed of different proportions of repetitive DNA families. The final assembly of the sequences belonging to the six databases produced a whole genome set of 283,800 contigs. The redundancy of each contig was estimated by mapping the whole genome set with a large Illumina read set and measuring the number of matched Illumina reads. The repetitive component amounted to 81% of the sunflower genome, that is composed mainly of numerous families of Gypsy and Copia retrotransposons. Also many families of non autonomous retrotransposons and DNA transposons (especially of the Helitron superfamily) were identified.ConclusionsThe results substantially matched those previously obtained by using a Sanger-sequenced shotgun library and a standard 454 whole-genome-shotgun approach, indicating the reliability of the proposed procedures also for other species. The repetitive sequences were collected to produce a database, SUNREP, that will be useful for the annotation of the sunflower genome sequence and for studying the genome evolution in dicotyledons.
The Rosoideae is a subfamily of the Rosaceae that contains a number of species of economic importance, including the soft fruit species strawberry (Fragaria ×ananassa), red (Rubus idaeus) and black (Rubus occidentalis) raspberries, blackberries (Rubus spp.) and one of the most economically important cut flower genera, the roses (Rosa spp.). Molecular genetics and genomics resources for the Rosoideae have developed rapidly over the past two decades, beginning with the development and application of a number of molecular marker types including restriction fragment length polymorphisms, amplified fragment length polymorphisms and microsatellites, and culminating in the recent publication of the genome sequence of the woodland strawberry, Fragaria vesca, and the development of high throughput single nucleotide polymorphism (SNP)-genotyping resources for Fragaria, Rosa and Rubus. These tools have been used to identify genes and other functional elements that control traits of economic importance, to study the evolution of plant genome structure within the subfamily, and are beginning to facilitate genomic-assisted breeding through the development and deployment of markers linked to traits such as aspects of fruit quality, disease resistance and the timing of flowering. In this review, we report on the developments that have been made over the last 20 years in the field of molecular genetics and structural genomics within the Rosoideae, comment on how the knowledge gained will improve the efficiency of cultivar development and discuss how these advances will enhance our understanding of the biological processes determining agronomically important traits in all Rosoideae species.
Retrotransposons are an ubiquitous component of plant genomes, especially abundant in species with large genomes. Populus trichocarpa has a relatively small genome, which was entirely sequenced; however, studies focused on poplar retrotransposons dynamics are rare. With the aim to study the retrotransposon component of the poplar genome, we have scanned the complete genome sequence searching full-length long-terminal repeat (LTR) retrotransposons, i. e., characterised by two long terminal repeats at the 5′ and 3′ ends. A computational approach based on detection of conserved structural features, on building multiple alignments, and on similarity searches was used to identify 1,479 putative full-length LTR retrotransposons. Ty1-copia elements were more numerous than Ty3-gypsy. However, many LTR retroelements were not assigned to any superfamily because lacking of diagnostic features and non-autonomous. LTR retrotransposon remnants were by far more numerous than full-length elements, indicating that during the evolution of poplar, large amplification of these elements was followed by DNA loss. Within superfamilies, Ty3-gypsy families are made of more members than Ty1-copia ones. Retrotransposition occurred with increasing frequency following the separation of Populus sections, with different waves of retrotransposition activity between Ty3-gypsy and Ty1-copia elements. Recently inserted elements appear more frequently expressed than older ones. Finally, different levels of activity of retrotransposons were observed according to their position and their density in the linkage groups. On the whole, the results support the view of retrotransposons as a community of different organisms in the genome, whose activity (both retrotransposition and DNA loss) has heavily impacted and probably continues to impact poplar genome structure and size. © 2011 Springer-Verlag
Low temperature is a major factor limiting rice growth and yield, and seedling is one of the developmental stages at which sensitivity to chilling stress is higher. Tolerance to chilling is a complex quantitative trait, so one of the most effective approaches to identify genes and pathways involved is to compare the stress-induced expression changes between tolerant and sensitive genotypes. Phenotypic responses to chilling of 13 Japonica cultivars were evaluated, and Thaibonnet and Volano were selected as sensitive and tolerant genotypes, respectively. To thoroughly profile the short-term response of the two cultivars to chilling, RNA-Seq was performed on Thaibonnet and Volano seedlings after 0 (not stressed), 2, and 10 h at 10 °C. Differential expression analysis revealed that the ICE-DREB1/CBF pathway plays a primary role in chilling tolerance, mainly due to some important transcription factors involved (some of which had never been reported before). Moreover, the expression trends of some genes that were radically different between Thaibonnet and Volano (i.e., calcium-dependent protein kinases OsCDPK21 and OsCDPK23, cytochrome P450 monooxygenase CYP76M8, etc.) suggest their involvement in low temperature tolerance too. Density of differentially expressed genes along rice genome was determined and linked to the position of known QTLs: remarkable co-locations were reported, delivering an overview of genomic regions determinant for low temperature response at seedling stage. Our study contributes to a better understanding of the molecular mechanisms underlying rice response to chilling and provides a solid background for development of low temperature-tolerant germplasm.
Rubus idaeus L. (red raspberry), is a perennial woody plant species of the Rosaceae family that is widely cultivated in the temperate regions of world and is thus an economically important soft fruit species. It is prized for its flavour and aroma, as well as a high content of healthful compounds such as vitamins and antioxidants. Breeding programs exist globally for red raspberry, but variety development is a long and challenging process. Genomic and molecular tools for red raspberry are valuable resources for breeding. Here, a chromosome-length genome sequence assembly and related gene predictions for the red raspberry cultivar ‘Anitra’ are presented, comprising PacBio long read sequencing scaffolded using Hi-C sequence data. The assembled genome sequence totalled 291.7 Mbp, with 247.5 Mbp (84.8%) incorporated into seven sequencing scaffolds with an average length of 35.4 Mbp. A total of 39,448 protein-coding genes were predicted, 75% of which were functionally annotated. The seven chromosome scaffolds were anchored to a previously published genetic linkage map with a high degree of synteny and comparisons to genomes of closely related species within the Rosoideae revealed chromosome-scale rearrangements that have occurred over relatively short evolutionary periods. A chromosome-level genomic sequence of R. idaeus will be a valuable resource for the knowledge of its genome structure and function in red raspberry and will be a useful and important resource for researchers and plant breeders.
Improved knowledge of genome composition, especially of its repetitive component, generates important informations in both theoretical and applied research. In this study, we provide the first insight into the local organization of the sunflower genome by sequencing and annotating 349,380 bp from 3 BAC clones, each including one single-copy gene. These analyses resulted in the identification of 11 putative gene sequences, 18 full-length LTR retrotransposons, 6 incomplete LTR retrotransposons, 2 non-autonomous LTR-retroelements (LINEs), 2 putative DNA transposons fragments and one putative helitron. Among LTR-retrotransposons, non-autonomous elements (the so-called LARDs), which do not carry any protein-encoding sequence, were discovered for the first time in the sunflower. The insertion time of intact retroelements was measured, based on sister LTRs divergence. All isolated elements were inserted relatively recently, especially those belonging to the Gypsy superfamily. Retrotransposon families related to those identified in the BAC clones are present also in other species of Helianthus, both annual and perennial, and even in other Asteraceae. In one of the three BAC clones, we found five copies of a lipid transfer protein (LTP) encoding gene within less than 100,000 bp, four of which are potentially functional. Two of these are interrupted by LTR retrotransposons, in the intron and in the coding sequence, respectively. The divergence between sister LTRs of the retrotransposons inserted within the genes indicates that LTP gene duplication started earlier than 1.749 MYRS ago. On the whole, the results reported in this study confirm that the sunflower is an excellent system to study transposons dynamics and evolution.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.