BackgroundDespite the economic importance of sugarcane in sugar and bioenergy production, there is not yet a reference genome available. Most of the sugarcane transcriptomic studies have been based on Saccharum officinarum gene indices (SoGI), expressed sequence tags (ESTs) and de novo assembled transcript contigs from short-reads; hence knowledge of the sugarcane transcriptome is limited in relation to transcript length and number of transcript isoforms.ResultsThe sugarcane transcriptome was sequenced using PacBio isoform sequencing (Iso-Seq) of a pooled RNA sample derived from leaf, internode and root tissues, of different developmental stages, from 22 varieties, to explore the potential for capturing full-length transcript isoforms. A total of 107,598 unique transcript isoforms were obtained, representing about 71% of the total number of predicted sugarcane genes. The majority of this dataset (92%) matched the plant protein database, while just over 2% was novel transcripts, and over 2% was putative long non-coding RNAs. About 56% and 23% of total sequences were annotated against the gene ontology and KEGG pathway databases, respectively. Comparison with de novo contigs from Illumina RNA-Sequencing (RNA-Seq) of the internode samples from the same experiment and public databases showed that the Iso-Seq method recovered more full-length transcript isoforms, had a higher N50 and average length of largest 1,000 proteins; whereas a greater representation of the gene content and RNA diversity was captured in RNA-Seq. Only 62% of PacBio transcript isoforms matched 67% of de novo contigs, while the non-matched proportions were attributed to the inclusion of leaf/root tissues and the normalization in PacBio, and the representation of more gene content and RNA classes in the de novo assembly, respectively. About 69% of PacBio transcript isoforms and 41% of de novo contigs aligned with the sorghum genome, indicating the high conservation of orthologs in the genic regions of the two genomes.ConclusionsThe transcriptome dataset should contribute to improved sugarcane gene models and sugarcane protein predictions; and will serve as a reference database for analysis of transcript expression in sugarcane.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-017-3757-8) contains supplementary material, which is available to authorized users.
Sugarcane (Saccharum spp. hybrids) is a potential lignocellulosic feedstock for biofuel production due to its exceptional biomass accumulation ability, high convertible carbohydrate content and a favorable energy input/output ratio. Genetic modification of biofuel traits to improve biomass conversion requires an understanding of the regulation of carbohydrate and lignin biosynthesis. RNA-Seq was used to investigate the transcripts differentially expressed between the immature and mature tissues of the sugarcane genotypes varying in fiber content. Most of the differentially expressed transcripts were found to be down-regulated during stem maturation, highlighting their roles in active secondary cell-wall development in the younger tissues of both high and low fiber genotypes. Several cellulose synthase genes (including CesA2, CesA4, CesA7 and COBRA-like protein), lignin biosynthesis-related genes (ρ-coumarate 3-hydroxylase, ferulate 5-hydroxylase, cinnamyl alcohol dehydrogenase and gentiobiase) and transcription regulators for the secondary cell-wall synthesis (including LIM, MYB, PLATZ, IAA24, C2H2 and C2C2 DOF zinc finger gene families) were exclusively differentially expressed between immature and mature tissues of high fiber genotypes. These findings reveal target genes for subsequent research on the regulation of cellulose and lignin metabolism.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.