MCScan is an algorithm able to scan multiple genomes or subgenomes in order to identify putative homologous chromosomal regions, and align these regions using genes as anchors. The MCScanX toolkit implements an adjusted MCScan algorithm for detection of synteny and collinearity that extends the original software by incorporating 14 utility programs for visualization of results and additional downstream analyses. Applications of MCScanX to several sequenced plant genomes and gene families are shown as examples. MCScanX can be used to effectively analyze chromosome structural changes, and reveal the history of gene family expansions that might contribute to the adaptation of lineages and taxa. An integrated view of various modes of gene duplication can supplement the traditional gene tree analysis in specific families. The source code and documentation of MCScanX are freely available at http://chibba.pgml.uga.edu/mcscan2/.
Oilseed rape (Brassica napus L.) was formed~7500 years ago by hybridization between B. rapa and B. oleracea, followed by chromosome doubling, a process known as allopolyploidy. Together with more ancient polyploidizations, this conferred an aggregate 72× genome multiplication since the origin of angiosperms and high gene content. We examined the B. napus genome and the consequences of its recent duplication. The constituent A n and C n subgenomes are engaged in subtle structural, functional, and epigenetic cross-talk, with abundant homeologous exchanges. Incipient gene loss and expression divergence have begun. Selection in B. napus oilseed types has accelerated the loss of glucosinolate genes, while preserving expansion of oil biosynthesis genes. These processes provide insights into allopolyploid evolution and its relationship with crop domestication and improvement.T he Brassicaceae are a large eudicot family (1) and include the model plant Arabidopsis thaliana. Brassicas have a propensity for genome duplications ( Fig. 1) and genome mergers (2). They are major contributors to the human diet and were among the earliest cultigens (3).B. napus (genome A n A n C n C n ) was formed by recent allopolyploidy between ancestors of B. oleracea (Mediterranean cabbage, genome C o C o ) and B. rapa (Asian cabbage or turnip, genome A r A r ) and is polyphyletic (2, 4), with spontaneous formation regarded by Darwin as an example of unconscious selection (5). Cultivation began in Europe during the Middle Ages and spread worldwide. Diversifying selection gave rise to oilseed rape (canola), rutabaga, fodder rape, and kale morphotypes grown for oil, fodder, and food (4, 6).The homozygous B. napus genome of European winter oilseed cultivar 'Darmor-bzh' was assembled with long-read [>700 base pairs (bp)] 454 GS-FLX+ Titanium (Roche, Basel, Switzerland) and Sanger sequence (tables S1 to S5 and figs. S1 to S3) (7). Correction and gap filling used 79 Gb of Illumina (San Diego, CA) HiSeq sequence. A final assembly of 849.7 Mb was obtained with SOAP (8) and Newbler (Roche), with 89% nongapped sequence (tables S2 and S3). Unique mapping of 5× nonassembled 454 sequences from B. rapa ('Chiifu') or B. oleracea (' TO1000') assigned most of the 20,702 B. napus scaffolds to either the A n (8294) or the C n (9984) subgenomes (tables S4 and S5 and fig. S3). The assembly covers~79% of the 1130-Mb genome and includes 95.6% of Brassica expressed sequence tags (ESTs) (7). A single-nucleotide polymorphism (SNP) map (tables S6 to S9 and figs. S4 to S8) genetically anchored 712.3 Mb (84%) of the genome assembly, yielding pseudomolecules for the 19 chromosomes (table S10).The assembled C n subgenome (525.8 Mb) is larger than the A n subgenome (314.2 Mb), consistent with the relative sizes of the assembled C o genome of B. oleracea (540 Mb, 85% of thẽ 630-Mb genome) and the A r genome of B. rapa (312 Mb, 59% of the~530-Mb genome) (9-11). The B. napus assembly contains 34.8% transposable elements (TEs), less than the 40% estimated from raw reads (table...
Sorghum, an African grass related to sugar cane and maize, is grown for food, feed, fibre and fuel. We present an initial analysis of the approximately 730-megabase Sorghum bicolor (L.) Moench genome, placing approximately 98% of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information. Genetic recombination is largely confined to about one-third of the sorghum genome with gene order and density similar to those of rice. Retrotransposon accumulation in recombinationally recalcitrant heterochromatin explains the approximately 75% larger genome size of sorghum compared with rice. Although gene and repetitive DNA distributions have been preserved since palaeopolyploidization approximately 70 million years ago, most duplicated gene sets lost one member before the sorghum-rice divergence. Concerted evolution makes one duplicated chromosomal segment appear to be only a few million years old. About 24% of genes are grass-specific and 7% are sorghum-specific. Recent gene and microRNA duplications may contribute to sorghum's drought tolerance.
The genome of the mesopolyploid crop species Brassica rapaThe Brassica rapa Genome Sequencing Project Consortium 1 Abstract:The Brassicaceae family which includes Arabidopsis thaliana, is a natural priority for reaching beyond botanical models to more deeply sample angiosperm genomic and functional diversity. Here we report the draft genome sequence and its annoation of Brassica rapa, one of the two ancestral species of oilseed rape. We modeled 41,174 protein-coding genes in the B. rapa genome. B. rapa has experienced only the second genome triplication reported to date, with its close relationship to A. thaliana providing a useful outgroup for investigating many consequences of triplication for its structural and functional evolution. The extent of gene loss (fractionation) among triplicated genome segments varies, with one copy containing a greater proportion of genes expected to have been present in its ancestor (70%) than the remaining two (46% and 36%). Both a generally rapid evolutionary rate, and specific copy number amplifications of particular gene families, may contribute to the remarkable propensity of Brassica species for the development of new morphological variants. The B. rapa genome provides a new resource for comparative and evolutionary analysis of the Brassicaceae genomes and also a platform for genetic improvement of Brassica oil and vegetable crops.2
Tomato (Solanum lycopersicum) is a major crop plant and a model system for fruit development. Solanum is one of the largest angiosperm genera(1) and includes annual and perennial plants from diverse habitats. Here we present a high-quality genome sequence of domesticated tomato, a draft sequence of its closest wild relative, Solanum pimpinellifolium(2), and compare them to each other and to the potato genome (Solanum tuberosum). The two tomato genomes show only 0.6% nucleotide divergence and signs of recent admixture, but show more than 8% divergence from potato, with nine large and several smaller inversions. In contrast to Arabidopsis, but similar to soybean, tomato and potato small RNAs map predominantly to gene-rich chromosomal regions, including gene promoters. The Solanum lineage has experienced two consecutive genome triplications: one that is ancient and shared with rosids, and a more recent one. These triplications set the stage for the neofunctionalization of genes controlling fruit characteristics, such as colour and fleshiness
Conservation of gene order in vertebrates is evident after hundreds of millions of years of divergence, but comparisons of the Arabidopsis thaliana sequence to partial gene orders of other angiosperms (flowering plants) sharing common ancestry approximately 170-235 million years ago yield conflicting results. This difference may be largely due to the propensity of angiosperms to undergo chromosomal duplication ('polyploidization') and subsequent gene loss ('diploidization'); these evolutionary mechanisms have profound consequences for comparative biology. Here we integrate a phylogenetic approach (relating chromosomal duplications to the tree of life) with a genomic approach (mitigating information lost to diploidization) to show that a genome-wide duplication post-dates the divergence of Arabidopsis from most dicots. We also show that an inferred ancestral gene order for Arabidopsis reveals more synteny with other dicots (exemplified by cotton), and that additional, more ancient duplication events affect more distant taxonomic comparisons. By using partial sequence data for many diverse taxa to better relate the evolutionary history of completely sequenced genomes to the tree of life, we foster comparative approaches to the study of genome organization, consequences of polyploidy, and the molecular basis of quantitative traits.
Polyploidization has provided much genetic variation for plant adaptive evolution, but the mechanisms by which the molecular evolution of polyploid genomes establishes genetic architecture underlying species differentiation are unclear. Brassica is an ideal model to increase knowledge of polyploid evolution. Here we describe a draft genome sequence of Brassica oleracea, comparing it with that of its sister species B. rapa to reveal numerous chromosome rearrangements and asymmetrical gene loss in duplicated genomic blocks, asymmetrical amplification of transposable elements, differential gene co-retention for specific pathways and variation in gene expression, including alternative splicing, among a large number of paralogous and orthologous genes. Genes related to the production of anticancer phytochemicals and morphological variations illustrate consequences of genome duplication and gene divergence, imparting biochemical and morphological variation to B. oleracea. This study provides insights into Brassica genome evolution and will underpin research into the many important crops in this genus.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.