transcripts ranged from 1000 and 2000 bp, and 12.6% transcripts were longer than 2000 bp (Fig. 2b). The number of transcripts annotated with seven public databases is summarised in Fig. 2c. A total of 311,224 (73.01%) and 227,392 (53.34%) transcripts received annotations from NCBI (https://www.ncbi.nlm.nih.gov/) non-redundant protein (NR) and Swiss-Prot databases (http://www.expasy.org/sprot/), respectively. Gene ontology (http:// www.geneontology.org), protein family (http://pfam.xfam.org/) and Kyoto Encyclopedia of Genes and Genomes (http://www.genome.jp/kegg) annotation analysis indicated that 201,698 (47.31%), 200,757 (47.09%) and 127,605 (29.93%) transcripts matched these databases, respectively. The genomic similarity analysis by BLASTx showed that the most abundant transcripts were annotated as Sesamum indicum, which accounted for 59.6%, followed by Erythranthe guttata (15.7%); only 1.7%, 1.7% and 1.5% sequences had hit against in Coffea canephora, Salvia miltiorrhiz and Vitis vinifera, respectively; 19.8% of annotated sequences were similar to other plant species (Fig. 2d). A box diagram of fragments per kilobase of transcript sequence per million base pairs (FPKM) indicated that gene expression levels (FPKM > 0.3) were not evenly distributed at the three developmental stages (Supplementary Fig. S1). There was an obvious increase at the Bud I and Bud II stages compared to the flowering stage 50%.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.