Abstract:Paulownia catalpifolia is an important, fast-growing timber species known for its high density, color and texture. However, few transcriptomic and genetic studies have been conducted in P. catalpifolia. In this study, single-molecule real-time sequencing technology was applied to obtain the full-length transcriptome of P. catalpifolia leaves treated with varying degrees of drought stress. The sequencing data were then used to search for microsatellites, or simple sequence repeats (SSRs). A total of 28.83 Gb da… Show more
“…A total number of 514 TFs were obtained and categorized into 23 families, with C3H (137) ranked the highest, followed by SNF2 (72), SET (63), CSD (63), and Others (41). Since all eukaryotic TF families were historically identified and SSR polymorphic genetic markers, also known as microsatellites, show significant species-specific differences and have been widely used for genetic map construction, functional gene mining, genetic diversity analyses, and molecular markerrelated studies (Shen et al, 2014;Feng et al, 2021). A total of 4,397 SSRs identified in A. sanguinea, exceeding the SSRs detected in T. weissflogii (3,295 SSRs) and P. tricornutum (1,390 SSRs) on numbers (Rastogi et al, 2018;Cheng et al, 2021).…”
Section: Discussionmentioning
confidence: 99%
“…SSR polymorphic genetic markers, also known as microsatellites, show significant species-specific differences and have been widely used for genetic map construction, functional gene mining, genetic diversity analyses, and molecular marker-related studies ( Shen et al, 2014 ; Feng et al, 2021 ). A total of 4,397 SSRs identified in A. sanguinea , exceeding the SSRs detected in T. weissflogii (3,295 SSRs) and P. tricornutum (1,390 SSRs) on numbers ( Rastogi et al, 2018 ; Cheng et al, 2021 ).…”
The dinoflagellate Akashiwo sanguinea is a harmful algal species and commonly observed in estuarine and coastal waters around the world. Harmful algal blooms (HABs) caused by this species lead to serious environmental impacts in the coastal waters of China since 1998 followed by huge economic losses. However, the full-length transcriptome information of A. sanguinea is still not fully explored, which hampers basic genetic and functional studies. Herein, single-molecule real-time (SMRT) sequencing technology was performed to characterize the full-length transcript in A. sanguinea. Totally, 83.03 Gb SMRT sequencing clean reads were generated, 983,960 circular consensus sequences (CCS) with average lengths of 3,061 bp were obtained, and 81.71% (804,016) of CCS were full-length non-chimeric reads (FLNC). Furthermore, 26,461 contigs were obtained after being corrected with Illumina library sequencing, with 20,037 (75.72%) successfully annotated in the five public databases. A total of 13,441 long non-coding RNA (lncRNA) transcripts, 3,137 alternative splicing (AS) events, 514 putative transcription factors (TFs) members from 23 TF families, and 4,397 simple sequence repeats (SSRs) were predicted, respectively. Our findings provided a sizable insights into gene sequence characteristics of A. sanguinea, which can be used as a reference sequence resource for A. sanguinea draft genome annotation, and will contribute to further molecular biology research on this harmful bloom algae.
“…A total number of 514 TFs were obtained and categorized into 23 families, with C3H (137) ranked the highest, followed by SNF2 (72), SET (63), CSD (63), and Others (41). Since all eukaryotic TF families were historically identified and SSR polymorphic genetic markers, also known as microsatellites, show significant species-specific differences and have been widely used for genetic map construction, functional gene mining, genetic diversity analyses, and molecular markerrelated studies (Shen et al, 2014;Feng et al, 2021). A total of 4,397 SSRs identified in A. sanguinea, exceeding the SSRs detected in T. weissflogii (3,295 SSRs) and P. tricornutum (1,390 SSRs) on numbers (Rastogi et al, 2018;Cheng et al, 2021).…”
Section: Discussionmentioning
confidence: 99%
“…SSR polymorphic genetic markers, also known as microsatellites, show significant species-specific differences and have been widely used for genetic map construction, functional gene mining, genetic diversity analyses, and molecular marker-related studies ( Shen et al, 2014 ; Feng et al, 2021 ). A total of 4,397 SSRs identified in A. sanguinea , exceeding the SSRs detected in T. weissflogii (3,295 SSRs) and P. tricornutum (1,390 SSRs) on numbers ( Rastogi et al, 2018 ; Cheng et al, 2021 ).…”
The dinoflagellate Akashiwo sanguinea is a harmful algal species and commonly observed in estuarine and coastal waters around the world. Harmful algal blooms (HABs) caused by this species lead to serious environmental impacts in the coastal waters of China since 1998 followed by huge economic losses. However, the full-length transcriptome information of A. sanguinea is still not fully explored, which hampers basic genetic and functional studies. Herein, single-molecule real-time (SMRT) sequencing technology was performed to characterize the full-length transcript in A. sanguinea. Totally, 83.03 Gb SMRT sequencing clean reads were generated, 983,960 circular consensus sequences (CCS) with average lengths of 3,061 bp were obtained, and 81.71% (804,016) of CCS were full-length non-chimeric reads (FLNC). Furthermore, 26,461 contigs were obtained after being corrected with Illumina library sequencing, with 20,037 (75.72%) successfully annotated in the five public databases. A total of 13,441 long non-coding RNA (lncRNA) transcripts, 3,137 alternative splicing (AS) events, 514 putative transcription factors (TFs) members from 23 TF families, and 4,397 simple sequence repeats (SSRs) were predicted, respectively. Our findings provided a sizable insights into gene sequence characteristics of A. sanguinea, which can be used as a reference sequence resource for A. sanguinea draft genome annotation, and will contribute to further molecular biology research on this harmful bloom algae.
“…This result was also consistent with the results found for other species, such as Persea americana Mill. 33 , Styrax japonicus 34 , and Paulownia catalpifolia 35 . The differences in the transcriptomic information and basic SSR characteristics between C. chekiangoleosa and other species might be due to species specificity, sequencing technology, reference genome quality and other factors.…”
Section: Discussionmentioning
confidence: 99%
“…This result was also consistent with the results found for other species, such as Persea americana Mill. 33 , Styrax japonicus 34 , and Paulownia catalpifolia 35 . The differences in the transcriptomic and tetranucleotide repeat types 18 .…”
Camellia chekiangoleosa is a popular variety of Oil-camellia that has high oil production and ornamental value. Microsatellite (SSR) markers are the preferred tool for the molecular marker-assisted breeding of C. chekiangoleosa. By focusing on the problems of the low development efficiency of polymorphic SSR markers and the lack of available functional markers in Oil-camellia, we identified 97,510 SSR loci based on the full-length transcriptome sequence of C. chekiangoleosa. An analysis of SSR characteristics showed that mononucleotide (51.29%) and dinucleotide (34.36%) SSRs were the main repeat types. The main SSR distribution areas based on proportion covered were ordered as follows: 5'UTR > 3'UTR > CDS. By comparing our data with those in databases such as GO and KEGG, we obtained functional annotations of unigene sequences containing SSR sites. The data showed that the amplification efficiency of the SSR primers was 51.72%, and the development efficiency of polymorphic SSR primers was 26.72%. Experiments verified that dinucleotide and pentanucleotide SSRs located in UTR regions could produce more polymorphic markers. An investigation into the genetic diversity of several C. chekiangoleosa populations also suggested that the developed SSR markers had higher levels of polymorphism. This study will provide a reference and high-quality markers for the large-scale development of functional SSR markers and genetic research in Oil-camellia.
“…This result was also consistent with the results found for other species, such as Persea americana Mill. 33 , Styrax japonicus 34 , and Paulownia catalpifolia 35 . The differences in the transcriptomic information and basic SSR characteristics between C. chekiangoleosa and other species might be due to species speci city, sequencing technology, reference genome quality and other factors.…”
Camellia chekiangoleosa is a popular variety of oil-tea camellia and has high oil production and ornamental value. Microsatellite (SSR) markers are the preferred tool for the molecular marker-assisted breeding of C. chekiangoleosa. By focusing on the problems of the low development efficiency of polymorphic SSR markers and lack of available functional markers in oil-tea camellia, we identified 97510 SSR loci based on the full-length transcriptome sequence of C. chekiangoleosa. An analysis of microsatellite characteristics showed that mononucleotide (51.29%) and dinucleotide (34.36%) SSRs were the main repeat types. The main SSR distribution areas based on proportion covered were ordered as follows: 5'UTR > 3'UTR > CDS. By comparing our data with that in databases such as GO and KEGG, we obtained functional annotations of unigene sequences containing SSR sites. The data showed that the amplification efficiency of the SSR primers was 51.72%, and the development efficiency of polymorphic SSR primers was 26.72%. Experiments verified that dinucleotide and pentanucleotide SSRs located in UTR regions could produce more polymorphic markers. An investigation into the genetic diversity of several C. chekiangoleosa populations also suggested that the developed SSR markers had higher levels of polymorphism. This study will provide a reference and high-quality markers for the large-scale development of functional SSR markers and genetic research in oil-tea camellia.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.