Comparative analysis of the organization and expression patterns of divergent and convergent gene pairs in multiple plant genomes can identify patterns that are shared by more than one species or are unique to a particular species. Here, we study the coexpression and interspecies conservation of divergent and convergent gene pairs in three plant species: rice (Oryza sativa), Arabidopsis (Arabidopsis thaliana), and black cottonwood (Populus trichocarpa). Strongly correlated expression levels between divergent and convergent genes were found to be quite common in all three species, and the frequency of strong correlation appears to be independent of intergenic distance. Conservation of divergent or convergent arrangement among these species appears to be quite rare. However, conserved arrangement is significantly more frequent when the genes display strongly correlated expression levels or have one or more Gene Ontology (GO) classes in common. A correlation between intergenic distance in divergent and convergent gene pairs and shared GO classes was observed, in varying degrees, in rice and Populus but not in Arabidopsis. Furthermore, multiple GO classes were either overrepresented or underrepresented in Arabidopsis and Populus gene pairs, while only two GO classes were underrepresented in rice divergent gene pairs. Three cis-regulatory elements common to both Arabidopsis and rice were overrepresented in the intergenic regions of strongly correlated divergent gene pairs compared to those of noncorrelated pairs. Our results suggest that shared as well as unique mechanisms operate in shaping the organization and function of divergent and convergent gene pairs in different plant species.
Improving drought tolerance of crop plants is a major goal of plant breeders. In this study, we characterized biomass and drought-related traits of 220 Medicago truncatula HapMap accessions. Characterized traits included shoot biomass, maximum leaf size, specific leaf weight, stomatal density, trichome density and shoot carbon-13 isotope discrimination (δ(13) C) of well-watered M. truncatula plants, and leaf performance in vitro under dehydration stress. Genome-wide association analyses were carried out using the general linear model (GLM), the standard mixed linear model (MLM) and compressed MLM (CMLM) in TASSEL, which revealed significant overestimation of P-values by CMLM. For each trait, candidate genes and chromosome regions containing SNP markers were found that are in significant association with the trait. For plant biomass, a 0.5 Mbp region on chromosome 2 harbouring a plasma membrane intrinsic protein, PIP2, was discovered that could potentially be targeted to increase dry matter yield. A protein disulfide isomerase-like protein was found to be tightly associated with both shoot biomass and leaf size. A glutamate-cysteine ligase and an aldehyde dehydrogenase family protein with Arabidopsis homologs strongly expressed in the guard cells were two of the top genes identified by stomata density genome-wide association studies analysis.
BackgroundAlfalfa (Medicago sativa L.) is the primary forage legume crop species in the United States and plays essential economic and ecological roles in agricultural systems across the country. Modern alfalfa is the result of hybridization between tetraploid M. sativa ssp. sativa and M. sativa ssp. falcata. Due to its large and complex genome, there are few genomic resources available for alfalfa improvement.ResultsA de novo transcriptome assembly from two alfalfa subspecies, M. sativa ssp. sativa (B47) and M. sativa ssp. falcata (F56) was developed using Illumina RNA-seq technology. Transcripts from roots, nitrogen-fixing root nodules, leaves, flowers, elongating stem internodes, and post-elongation stem internodes were assembled into the Medicago sativa Gene Index 1.2 (MSGI 1.2) representing 112,626 unique transcript sequences. Nodule-specific and transcripts involved in cell wall biosynthesis were identified. Statistical analyses identified 20,447 transcripts differentially expressed between the two subspecies. Pair-wise comparisons of each tissue combination identified 58,932 sequences differentially expressed in B47 and 69,143 sequences differentially expressed in F56. Comparing transcript abundance in floral tissues of B47 and F56 identified expression differences in sequences involved in anthocyanin and carotenoid synthesis, which determine flower pigmentation. Single nucleotide polymorphisms (SNPs) unique to each M. sativa subspecies (110,241) were identified.ConclusionsThe Medicago sativa Gene Index 1.2 increases the expressed sequence data available for alfalfa by ninefold and can be expanded as additional experiments are performed. The MSGI 1.2 transcriptome sequences, annotations, expression profiles, and SNPs were assembled into the Alfalfa Gene Index and Expression Database (AGED) at http://plantgrn.noble.org/AGED/, a publicly available genomic resource for alfalfa improvement and legume research.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-015-1718-7) contains supplementary material, which is available to authorized users.
Genome-enabled biotechnologies have the potential to accelerate breeding efforts in long-lived perennial crop species. Despite the transformative potential of molecular tools in pecan and other outcrossing tree species, highly heterozygous genomes, significant presence–absence gene content variation, and histories of interspecific hybridization have constrained breeding efforts. To overcome these challenges, here, we present diploid genome assemblies and annotations of four outbred pecan genotypes, including a PacBio HiFi chromosome-scale assembly of both haplotypes of the ‘Pawnee’ cultivar. Comparative analysis and pan-genome integration reveal substantial and likely adaptive interspecific genomic introgressions, including an over-retained haplotype introgressed from bitternut hickory into pecan breeding pedigrees. Further, by leveraging our pan-genome presence–absence and functional annotation database among genomes and within the two outbred haplotypes of the ‘Lakota’ genome, we identify candidate genes for pest and pathogen resistance. Combined, these analyses and resources highlight significant progress towards functional and quantitative genomics in highly diverse and outbred crops.
No abstract
Retrotransposons comprise a significant fraction of the rice genome. Despite their prevalence, the effects of retrotransposon insertions are not well understood, especially with regard to how they affect the expression of genes. In this study, we identified one-sixth of rice genes as being associated with retrotransposons, with insertions either in the gene itself or within its putative promoter region. Among genes with insertions in the promoter region, the likelihood of the gene being expressed was shown to be directly proportional to the distance of the retrotransposon from the translation start site. In addition, retrotransposon insertions in the transcribed region of the gene were found to be positively correlated with the presence of alternative splicing forms. Furthermore, preferential association of retrotransposon insertions with genes in several functional classes was identified. Some of the retrotransposons that are part of full-length cDNA (fl-cDNA) contribute splice sites and give rise to novel exons. Several interesting trends concerning the effects of retrotransposon insertions on gene expression were identified. Taken together, our data suggests that retrotransposon association with genes have a role in gene regulation. The data presented in this study provides a foundation for experimental studies to determine the role of retrotransposons in gene regulation.
BackgroundPods of the vanilla orchid (Vanilla planifolia) accumulate large amounts of the flavor compound vanillin (3-methoxy, 4-hydroxy-benzaldehyde) as a glucoside during the later stages of their development. At earlier stages, the developing seeds within the pod synthesize a novel lignin polymer, catechyl (C) lignin, in their coats. Genomic resources for determining the biosynthetic routes to these compounds and other flavor components in V. planifolia are currently limited.ResultsUsing next-generation sequencing technologies, we have generated very large gene sequence datasets from vanilla pods at different times of development, and representing different tissue types, including the seeds, hairs, placental and mesocarp tissues. This developmental series was chosen as being the most informative for interrogation of pathways of vanillin and C-lignin biosynthesis in the pod and seed, respectively. The combined 454/Illumina RNA-seq platforms provide both deep sequence coverage and high quality de novo transcriptome assembly for this non-model crop species.ConclusionsThe annotated sequence data provide a foundation for understanding multiple aspects of the biochemistry and development of the vanilla bean, as exemplified by the identification of candidate genes involved in lignin biosynthesis. Our transcriptome data indicate that C-lignin formation in the seed coat involves coordinate expression of monolignol biosynthetic genes with the exception of those encoding the caffeoyl coenzyme A 3-O-methyltransferase for conversion of caffeoyl to feruloyl moieties. This database provides a general resource for further studies on this important flavor species.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2164-15-964) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.