The higher plant Arabidopsis thaliana (Arabidopsis) is an important model for identifying plant genes and determining their function. To assist biological investigations and to define chromosome structure, a coordinated effort to sequence the Arabidopsis genome was initiated in late 1996. Here we report one of the first milestones of this project, the sequence of chromosome 4. Analysis of 17.38 megabases of unique sequence, representing about 17% of the genome, reveals 3,744 protein coding genes, 81 transfer RNAs and numerous repeat elements. Heterochromatic regions surrounding the putative centromere, which has not yet been completely sequenced, are characterized by an increased frequency of a variety of repeats, new repeats, reduced recombination, lowered gene density and lowered gene expression. Roughly 60% of the predicted protein-coding genes have been functionally characterized on the basis of their homology to known genes. Many genes encode predicted proteins that are homologous to human and Caenorhabditis elegans proteins.
SummaryThe cultivated Brassica species are the group of crops most closely related to Arabidopsis thaliana (Arabidopsis). They represent models for the application in crops of genomic information gained in Arabidopsis and provide an opportunity for the investigation of polyploid genome formation and evolution. The scientific literature contains contradictory evidence for the dynamics of the evolution of polyploid genomes. We aimed at overcoming the inherent complexity of Brassica genomes and clarify the effects of polyploidy on the evolution of genome microstructure in specific segments of the genome. To do this, we have constructed bacterial artificial chromosome (BAC) libraries from genomic DNA of B. rapa subspecies trilocularis (JBr) and B. napus var Tapidor (JBnB) to supplement an existing BAC library from B. oleracea. These allowed us to analyse both recent polyploidization (under 10 000 years in B. napus) and more ancient polyploidization events (ca. 20 Myr for B. rapa and B. oleracea relative to Arabidopsis), with an analysis of the events occurring on an intermediate time scale (over the ca. 4 Myr since the divergence of the B. rapa and B. oleracea lineages). Using the Arabidopsis genome sequence and clones from the JBr library, we have analysed aspects of gene conservation and microsynteny between six regions of the genome of B. rapa with the homoeologous regions of the genomes of B. oleracea and Arabidopsis. Extensive divergence of gene content was observed between the B. rapa paralogous segments and their homoeologous segments within the genome of Arabidopsis. A pattern of interspersed gene loss was identified that is similar, but not identical, to that observed in B. oleracea. The conserved genes show highly conserved collinearity with their orthologues across genomes, but a small number of species-specific rearrangements were identified. Thus the evolution of genome microstructure is an ongoing process. Brassica napus is a recently formed polyploid resulting from the hybridization of B. rapa (containing the Brassica A genome) and B. oleracea (containing the Brassica C genome). Using clones from the JBnB library, we have analysed the microstructure of the corresponding segments of the B. napus genome. The results show that there has been little or no change to the microstructure of the analysed segments of the Brassica A and C genomes as a consequence of the hybridization event forming natural B. napus. The observations indicate that, upon polyploid formation, these segments of the genome did not undergo a burst of evolution discernible at the scale of microstructure.
The plant Arabidopsis thaliana (Arabidopsis) has become an important model species for the study of many aspects of plant biology. The relatively small size of the nuclear genome and the availability of extensive physical maps of the five chromosomes provide a feasible basis for initiating sequencing of the five chromosomes. The YAC (yeast artificial chromosome)-based physical map of chromosome 4 was used to construct a sequence-ready map of cosmid and BAC (bacterial artificial chromosome) clones covering a 1.9-megabase (Mb) contiguous region, and the sequence of this region is reported here. Analysis of the sequence revealed an average gene density of one gene every 4.8 kilobases (kb), and 54% of the predicted genes had significant similarity to known genes. Other interesting features were found, such as the sequence of a disease-resistance gene locus, the distribution of retroelements, the frequent occurrence of clustered gene families, and the sequence of several classes of genes not previously encountered in plants.
Using contiguous genomic DNA sequences of Arabidopsis thaliana, we were able to identify a region of conserved structure in the genome of rice. The conserved, and presumptive homoeologous segments, are 194 kb and 219-300 kb in size in Arabidopsis and rice, respectively. They contain five homologous genes, distinguished in order by a single inversion. These represent the first homoeologous segments identified in the genomes of a dicot and a monocot, demonstrating that fine-scale conservation of genome structure exists and is detectable across this major divide in the angiosperms. The conserved framework of genes identified is interspersed with non-conserved genes, indicating that mechanisms beyond segmental inversions and translocations need to be invoked to fully explain plant genome evolution, and that the benefits of comparative genomics over such large taxonomic distances may be limited.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.