The analysis of the first plant genomes provided unexpected evidence for genome duplication events in species that had previously been considered as true diploids on the basis of their genetics [1][2][3] . These polyploidization events may have had important consequences in plant evolution, in particular for species radiation and adaptation and for the modulation of functional capacities 4-10 . Here we report a high-quality draft of the genome sequence of grapevine (Vitis vinifera) obtained from a highly homozygous genotype. The draft sequence of the grapevine genome is the fourth one produced so far for flowering plants, the second for a woody species and the first for a fruit crop (cultivated for both fruit and beverage). Grapevine was selected because of its important place in the cultural heritage of humanity beginning during the Neolithic period 11 . Several large expansions of gene families with roles in aromatic features are observed. The grapevine genome has not undergone recent genome duplication, thus enabling the discovery of ancestral traits and features of the genetic organization of flowering plants. This analysis reveals the contribution of three ancestral genomes to the grapevine haploid content. This ancestral arrangement is common to many dicotyledonous plants but is absent from the genome of rice, which is a monocotyledon. Furthermore, we explain the chronology of previously described whole-genome duplication events in the evolution of flowering plants.All grapevine varieties are highly heterozygous; preliminary data showed that there was as much as 13% sequence divergence between alleles, which would hinder reliable contig assembly when a wholegenome shotgun strategy was used for sequencing. Our consortium therefore selected the grapevine PN40024 genotype for sequencing. This line, originally derived from Pinot Noir, has been bred close to full homozygosity (estimated at about 93%) by successive selfings, permitting a high-quality whole-genome shotgun assembly.A total of 6.2 million end-reads were produced by our consortium, representing an 8.4-fold coverage of the genome. Within the assembly, performed with Arachne 12 , 316 supercontigs represent putative allelic haplotypes that constitute 11.6 million bases (Mb). These values are in good fit with the 7% residual heterozygosity of PN40024 assessed by using genetic markers. When considering only one of the haplotypes in each heterozygous region, the assembly (Table 1a) consists of 19,577 contigs (N 50 5 65.9 kilobases (kb), where N 50 corresponds to the size of the shorter supercontig or contig in a subset representing half of the assembly size) and 3,514 supercontigs (N 50 5 2.07 Mb) totalling 487 Mb. This value is close to the 475 Mb previously reported for the grapevine genome size 13 .Using a set of 409 molecular markers from the reference grapevine map 14 , 69% of the assembled 487 Mb, arranged into 45 ultracontigs
The complete sequence of the Arabidopsis thaliana genome revealed thousands of previously unsuspected genes, many of which cannot be ascribed even putative functions. One of the largest and most enigmatic gene families discovered in this way is characterized by tandem arrays of pentatricopeptide repeats (PPRs). We describe a detailed bioinformatic analysis of 441 members of the Arabidopsis PPR family plus genomic and genetic data on the expression (microarray data), localization (green fluorescent protein and red fluorescent protein fusions), and general function (insertion mutants and RNA binding assays) of many family members. The basic picture that arises from these studies is that PPR proteins play constitutive, often essential roles in mitochondria and chloroplasts, probably via binding to organellar transcripts. These results confirm, but massively extend, the very sparse observations previously obtained from detailed characterization of individual mutants in other organisms.
A family of 40 terpenoid synthase genes ( AtTPS) was discovered by genome sequence analysis in Arabidopsis thaliana. This is the largest and most diverse group of TPS genes currently known for any species. AtTPS genes cluster into five phylogenetic subfamilies of the plant TPS superfamily. Surprisingly, thirty AtTPS closely resemble, in all aspects of gene architecture, sequence relatedness and phylogenetic placement, the genes for plant monoterpene synthases, sesquiterpene synthases or diterpene synthases of secondary metabolism. Rapid evolution of these AtTPS resulted from repeated gene duplication and sequence divergence with minor changes in gene architecture. In contrast, only two AtTPS genes have known functions in basic (primary) metabolism, namely gibberellin biosynthesis. This striking difference in rates of gene diversification in primary and secondary metabolism is relevant for an understanding of the evolution of terpenoid natural product diversity. Eight AtTPS genes are interrupted and are likely to be inactive pseudogenes. The localization of AtTPS genes on all five chromosomes reflects the dynamics of the Arabidopsis genome; however, several AtTPS genes are clustered and organized in tandem repeats. Furthermore, some AtTPS genes are localized with prenyltransferase genes ( AtGGPPS, geranylgeranyl diphosphate synthase) in contiguous genomic clusters encoding consecutive steps in terpenoid biosynthesis. The clustered organization may have implications for TPS gene evolution and the evolution of pathway segments for the synthesis of terpenoid natural products. Phylogenetic analyses highlight events in the divergence of the TPS paralogs and suggest orthologous genes and a model for the evolution of the TPS gene family.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.