TGICL is a pipeline for analysis of large Expressed Sequence Tags (EST) and mRNA databases in which the sequences are first clustered based on pairwise sequence similarity, and then assembled by individual clusters (optionally with quality values) to produce longer, more complete consensus sequences. The system can run on multi-CPU architectures including SMP and PVM.
Legumes (Fabaceae or Leguminosae) are unique among cultivated plants for their ability to carry out endosymbiotic nitrogen fixation with rhizobial bacteria, a process that takes place in a specialized structure known as the nodule. Legumes belong to one of the two main groups of eurosids, the Fabidae, which includes most species capable of endosymbiotic nitrogen fixation 1. Legumes comprise several evolutionary lineages derived from a common ancestor 60 million years ago (Mya). Papilionoids are the largest clade, dating nearly to the origin of legumes and containing most cultivated species 2. Medicago truncatula (Mt) is a long-established model for the study of legume biology. Here we describe the draft sequence of the Mt euchromatin based on a recently completed BAC-assembly supplemented with Illumina-shotgun sequence, together capturing ~94% of all Mt genes. A whole-genome duplication (WGD) approximately 58 Mya played a major role in shaping the Mt genome and thereby contributed to the evolution of endosymbiotic nitrogen fixation. Subsequent to the WGD, the Mt genome experienced higher levels of rearrangement than two other sequenced legumes, Glycine max (Gm) and Lotus japonicus (Lj). Mt is a close relative of alfalfa (M. sativa), a widely cultivated crop with limited genomics tools and complex autotetraploid genetics. As such, the Mt genome sequence provides significant opportunities to expand alfalfa’s genomic toolbox.
SummaryIn natural ecosystems, the roots of many plants exist in association with arbuscular mycorrhizal (AM) fungi, and the resulting symbiosis has profound effects on the plant. The most frequently documented response is an increase in phosphorus nutrition; however, other effects have been noted, including increased resistance to abiotic and biotic stresses. Here we used a 16 000-feature oligonucleotide array and real-time quantitative RT-PCR to explore transcriptional changes triggered in Medicago truncatula roots and shoots as a result of AM symbiosis. By controlling the experimental conditions, phosphorus-related effects were minimized, and both local and systemic transcriptional responses to the AM fungus were revealed. The transcriptional response of the roots and shoots differed in both the magnitude of gene induction and the predicted functional categories of the mycorrhiza-regulated genes. In the roots, genes regulated in response to three different AM fungi were identified, and, through split-root experiments, an additional layer of regulation, in the colonized or noncolonized sections of the mycorrhizal root system, was uncovered. Transcript profiles of the shoots of mycorrhizal plants indicated the systemic induction of many genes predicted to be involved in stress or defense responses, and suggested that mycorrhizal plants might display enhanced disease resistance. Experimental evidence supports this prediction, and mycorrhizal M. truncatula plants showed increased resistance to a virulent bacterial pathogen, Xanthomonas campestris. Thus, the symbiosis is accompanied by a complex pattern of local and systemic changes in gene expression, including the induction of a functional defense response.
We sequenced 2.2 Mb representing triplicated genome segments of Brassica oleracea, which are each paralogous with one another and homologous with a segmentally duplicated region of the Arabidopsis thaliana genome. Sequence annotation identified 177 conserved collinear genes in the B. oleracea genome segments. Analysis of synonymous base substitution rates indicated that the triplicated Brassica genome segments diverged from a common ancestor soon after divergence of the Arabidopsis and Brassica lineages. This conclusion was corroborated by phylogenetic analysis of protein families. Using A. thaliana as an outgroup, 35% of the genes inferred to be present when genome triplication occurred in the Brassica lineage have been lost, most likely via a deletion mechanism, in an interspersed pattern. Genes encoding proteins involved in signal transduction or transcription were not found to be significantly more extensively retained than those encoding proteins classified with other functions, but putative proteins predicted in the A. thaliana genome were underrepresented in B. oleracea. We identified one example of gene loss from the Arabidopsis lineage. We found evidence for the frequent insertion of gene fragments of nuclear genomic origin and identified four apparently intact genes in noncollinear positions in the B. oleracea and A. thaliana genomes.
Genome sequencing of the model legumes, Medicago truncatula and Lotus japonicus , provides an opportunity for large-scale sequence-based comparison of two genomes in the same plant family. Here we report synteny comparisons between these species, including details about chromosome relationships, large-scale synteny blocks, microsynteny within blocks, and genome regions lacking clear correspondence. The Lotus and Medicago genomes share a minimum of 10 large-scale synteny blocks, each with substantial collinearity and frequently extending the length of whole chromosome arms. The proportion of genes syntenic and collinear within each synteny block is relatively homogeneous. Medicago–Lotus comparisons also indicate similar and largely homogeneous gene densities, although gene-containing regions in Mt occupy 20–30% more space than Lj counterparts, primarily because of larger numbers of Mt retrotransposons. Because the interpretation of genome comparisons is complicated by large-scale genome duplications, we describe synteny, synonymous substitutions and phylogenetic analyses to identify and date a probable whole-genome duplication event. There is no direct evidence for any recent large-scale genome duplication in either Medicago or Lotus but instead a duplication predating speciation. Phylogenetic comparisons place this duplication within the Rosid I clade, clearly after the split between legumes and Salicaceae (poplar).
Phosphorus (P) is an essential element for plant growth. Crop production of common bean (Phaseolus vulgaris), the most important legume for human consumption, is often limited by low P in the soil. Functional genomics were used to investigate global gene expression and metabolic responses of bean plants grown under P-deficient and P-sufficient conditions. P-deficient plants showed enhanced root to shoot ratio accompanied by reduced leaf area and net photosynthesis rates. Transcript profiling was performed through hybridization of nylon filter arrays spotted with cDNAs of 2,212 unigenes from a P deficiency root cDNA library. A total of 126 genes, representing different functional categories, showed significant differential expression in response to P: 62% of these were induced in P-deficient roots. A set of 372 bean transcription factor (TF) genes, coding for proteins with Inter-Pro domains characteristic or diagnostic for TF, were identified from The Institute of Genomic Research/Dana Farber Cancer Institute Common Bean Gene Index. Using real-time reverse transcription-polymerase chain reaction analysis, 17 TF genes were differentially expressed in P-deficient roots; four TF genes, including MYB TFs, were induced. Nonbiased metabolite profiling was used to assess the degree to which changes in gene expression in P-deficient roots affect overall metabolism. Stress-related metabolites such as polyols accumulated in P-deficient roots as well as sugars, which are known to be essential for P stress gene induction. Candidate genes have been identified that may contribute to root adaptation to P deficiency and be useful for improvement of common bean.
Approximately 80% of the maize genome comprises highly repetitive sequences interspersed with single-copy, gene-rich sequences, and standard genome sequencing strategies are not readily adaptable to this type of genome. Methodologies that enrich for genic sequences might more rapidly generate useful results from complex genomes. Equivalent numbers of clones from maize selected by techniques called methylation filtering and High C0t selection were sequenced to generate approximately 200,000 reads (approximately 132 megabases), which were assembled into contigs. Combination of the two techniques resulted in a sixfold reduction in the effective genome size and a fourfold increase in the gene identification rate in comparison to a nonenriched library.
We have developed a rice (Oryza sativa) genome annotation database (Osa1) that provides structural and functional annotation for this emerging model species. Using the sequence of O. sativa subsp. japonica cv Nipponbare from the International Rice Genome Sequencing Project, pseudomolecules, or virtual contigs, of the 12 rice chromosomes were constructed. Our most recent release, version 3, represents our third build of the pseudomolecules and is composed of 98% finished sequence. Genes were identified using a series of computational methods developed for Arabidopsis (Arabidopsis thaliana) that were modified for use with the rice genome. In release 3 of our annotation, we identified 57,915 genes, of which 14,196 are related to transposable elements. Of these 43,719 nontransposable element-related genes, 18,545 (42.4%) were annotated with a putative function, 5,777 (13.2%) were annotated as encoding an expressed protein with no known function, and the remaining 19,397 (44.4%) were annotated as encoding a hypothetical protein. Multiple splice forms (5,873) were detected for 2,538 genes, resulting in a total of 61,250 gene models in the rice genome. We incorporated experimental evidence into 18,252 gene models to improve the quality of the structural annotation. A series of functional data types has been annotated for the rice genome that includes alignment with genetic markers, assignment of gene ontologies, identification of flanking sequence tags, alignment with homologs from related species, and syntenic mapping with other cereal species. All structural and functional annotation data are available through interactive search and display windows as well as through download of flat files. To integrate the data with other genome projects, the annotation data are available through a Distributed Annotation System and a Genome Browser. All data can be obtained through the project Web pages at http://rice.tigr.org.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.