BackgroundThe development of long-read sequencing technologies, such as single-molecule real-time (SMRT) sequencing by PacBio, has produced a revolution in the sequencing of small genomes. Sequencing organelle genomes using PacBio long-read data is a cost effective, straightforward approach. Nevertheless, the availability of simple-to-use software to perform the assembly from raw reads is limited at present.ResultsWe present Organelle-PBA, a Perl program designed specifically for the assembly of chloroplast and mitochondrial genomes. For chloroplast genomes, the program selects the chloroplast reads from a whole genome sequencing pool, maps the reads to a reference sequence from a closely related species, and then performs read correction and de novo assembly using Sprai. Organelle-PBA completes the assembly process with the additional step of scaffolding by SSPACE-LongRead. The program then detects the chloroplast inverted repeats and reassembles and re-orients the assembly based on the organelle origin of the reference. We have evaluated the performance of the software using PacBio reads from different species, read coverage, and reference genomes. Finally, we present the assembly of two novel chloroplast genomes from the species Picea glauca (Pinaceae) and Sinningia speciosa (Gesneriaceae).ConclusionOrganelle-PBA is an easy-to-use Perl-based software pipeline that was written specifically to assemble mitochondrial and chloroplast genomes from whole genome PacBio reads. The program is available at https://github.com/aubombarely/Organelle_PBA.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-016-3412-9) contains supplementary material, which is available to authorized users.
Cannabis sativa has a complex history reflected in both selection on naturally occurring compounds and historical trade routes among humans. Iran is a rich resource of natural populationswhich hold the promise to characterize historical patterns of population structure and genetic diversity within Cannabis. Recent advances in high-throughput DNA sequencing technologies have dramatically increased our ability to produce information to the point that it is now feasible to inexpensively obtain population level genotype information at a large scale. In the present investigation, we have explored the use of Genotyping-By-Sequencing (GBS) in Iranian cannabis. We genotyped 98 cannabis samples 36 from Iranian locations and 26 accessions from two germplasm collections. In total, 24,710 high-quality Single Nucleotide Polymorphisms (SNP) were identified. Clustering analysis by Principal Component Analysis (PCA) identified two genetic clusters among Iranian populations and fineSTRUCTURE analysis identified 19 populations with some geographic partitioning. We defined Iranian cannabis in two main groups using the results of the PCA and discovered some strong signal to define some locations as population according to fineSTRUCTURE analyses. However, single nucleotide variant analysis uncovered a relatively moderate level of variation among Iranian cannabis.
Modern crop breeding is based on the use of genetically and phenotypically diverse plant material and, consequently, a proper understanding of population structure and genetic diversity is essential for the effective development of breeding programs. An example is avocado, a woody perennial fruit crop native to Mesoamerica with an increasing popularity worldwide. Despite its commercial success, there are important gaps in the molecular tools available to support on-going avocado breeding programs. In order to fill this gap, in this study, an avocado ‘Hass’ draft assembly was developed and used as reference to study 71 avocado accessions which represent the three traditionally recognized avocado horticultural races or subspecies (Mexican, Guatemalan and West Indian). An average of 5.72 M reads per individual and a total of 7,108 single nucleotide polymorphism (SNP) markers were produced for the 71 accessions analyzed. These molecular markers were used in a study of genetic diversity and population structure. The results broadly separate the accessions studied according to their botanical race in four main groups: Mexican, Guatemalan, West Indian and an additional group of Guatemalan × Mexican hybrids. The high number of SNP markers developed in this study will be a useful genomic resource for the avocado community.
Common bean (Phaseolus vulgaris L.) is a major source of proteins and one of the most important edible foods for more than three hundred million people in the world. The common bean plants are frequently attacked by spider mite (Tetranychus urticae Koch), leading to a significant decrease in plant growth and economic performance. The use of resistant cultivars and the identification of the genes involved in plant-mite resistance are practical solutions to this problem. Hence, a comprehensive study of the molecular interactions between resistant and susceptible common bean cultivars and spider mite can shed light into the understanding of mechanisms and biological pathways of resistance. In this study, one resistant (Naz) and one susceptible (Akhtar) cultivars were selected for a transcriptome comparison at different time points (0, 1 and 5 days) after spider mite feeding. The comparison of cultivars in different time points revealed several key genes, which showed a change increase in transcript abundance via spider mite infestation. These included genes involved in flavonoid biosynthesis process; a conserved MYB-bHLH-WD40 (MBW) regulatory complex; transcription factors (TFs) TT2, TT8, TCP, Cys2/His2-type and C2H2-type zinc finger proteins; the ethylene response factors (ERFs) ERF1 and ERF9; genes related to metabolism of auxin and jasmonic acid (JA); pathogenesis-related (PR) proteins and heat shock proteins.
Among the Lamiaceae family, the genus Thymus is an economically important genera due to its medicinal and aromatic properties. Most Thymus molecular research has focused on the determining the phylogenetic relationships between different species, but no published work has focused on the evolution of the transcriptome across the genus to elucidate genes involved in terpenoid biosynthesis. Hence, in this study, the transcriptomes of five different Thymus species were generated and analyzed to mine putative genes involved in thymol and carvacrol biosynthesis. High-throughput sequencing produced ~43 million high-quality reads per sample, which were assembled de novo using several tools, then further subjected to a quality evaluation. The best assembly for each species was used as queries to search within the UniProt, KEGG (Kyoto Encyclopedia of Genes and Genomes), COG (Clusters of Orthologous Groups) and TF (Transcription Factors) databases. Mining the transcriptomes resulted in the identification of 592 single-copy orthogroups used for phylogenetic analysis. The data showed strongly support a close genetic relationship between Thymus vulgaris and Thymus daenensis. Additionally, this study dates the speciation events between 1.5–2.1 and 9–10.2 MYA according to different methodologies. Our study provides a global overview of genes related to the terpenoid pathway in Thymus, and can help establish an understanding of the relationship that exists among Thymus species.
Leonurus cardiaca is well known for its medicinal importance. In this investigation, genotypic characterization of this species from six eco-geographical regions of Iran was evaluated by four molecular techniques (AFLP, RAPD, ISSR and IRAP). A total of 899 polymorphic fragments were detected by used molecular markers (AFLP = 356, RAPD = 325, ISSR = 113 and IRAP = 105) with an overall average polymorphism of 81.24%. Genetic variation calculated using Shannon's Information index (I) and Nei's gene diversity index (H) showed high genetic diversity in studied germplasm. Also, analysis of molecular variance showed high genetic variation among (55%) and within populations (45%). UPGMA dendrogram constructed from combined data of molecular markers distinguished studied populations in accordance with the results obtained by each marker which all individuals were clearly differentiated into two major clusters. The correlation coefficients were statistically significant for all marker systems with the highest correlation between similarity matrixes of RAPD and ISSR markers (r = 0.82). The present results have an important implication for L. cardiaca germplasm characterization, improvement, and conservation. Furthermore, the characterized individuals exhibited a great deal of molecular variation and they seem to have a rich gene pool for breeding programs.
Chelidonium majus is a traditional medicinal plant, which commonly known as a rich resource for the major benzylisoquinoline alkaloids (BIAs), including morphine, sanguinarine, and berberine. To understand the biosynthesis of C . majus BIAs, we performed de novo transcriptome sequencing of its leaf and root tissues using Illumina technology. Following comprehensive evaluation of de novo transcriptome assemblies produced with five programs including Trinity, Bridger, BinPacker, IDBA-tran, and Velvet/Oases using a series of k-mer sizes (from 25 to 91), BinPacker was found to produce the best assembly using a k-mer of 25. This study reports the results of differential gene expression (DGE), functional annotation, gene ontology (GO) analysis, classification of transcription factor (TF)s, and SSR and miRNA discovery. Our DGE analysis identified 6,028 transcripts that were up-regulated in the leaf, and 4,722 transcripts that were up-regulated in the root. Further investigations showed that most of the genes involved in the BIA biosynthetic pathway are significantly expressed in the root compared to the leaf. GO analysis showed that the predominant GO domain is “cellular component”, while TF analysis found bHLH to be the most highly represented TF family. Our study further identified 10 SSRs, out of a total of 39,841, that showed linkage to five unigenes encoding enzymes in the BIA pathway, and 10 conserved miRNAs that were previously not detected in this plant. The comprehensive transcriptome information presented herein provides a foundation for further explorations on study of the molecular mechanisms of BIA synthesis in C . majus .
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.