Sweetpotato [Ipomoea batatas (L.) Lam.] is a globally important staple food crop, especially for sub-Saharan Africa. Agronomic improvement of sweetpotato has lagged behind other major food crops due to a lack of genomic and genetic resources and inherent challenges in breeding a heterozygous, clonally propagated polyploid. Here, we report the genome sequences of its two diploid relatives, I. trifida and I. triloba, and show that these high-quality genome assemblies are robust references for hexaploid sweetpotato. Comparative and phylogenetic analyses reveal insights into the ancient whole-genome triplication history of Ipomoea and evolutionary relationships within the Batatas complex. Using resequencing data from 16 genotypes widely used in African breeding programs, genes and alleles associated with carotenoid biosynthesis in storage roots are identified, which may enable efficient breeding of varieties with high provitamin A content. These resources will facilitate genome-enabled breeding in this important food security crop.
Premise of the study:Targeted sequencing using next-generation sequencing (NGS) platforms offers enormous potential for plant systematics by enabling economical acquisition of multilocus data sets that can resolve difficult phylogenetic problems. However, because discovery of single-copy nuclear (SCN) loci from NGS data requires both bioinformatics skills and access to high-performance computing resources, the application of NGS data has been limited.Methods and Results:We developed MarkerMiner 1.0, a fully automated, open-access bioinformatic workflow and application for discovery of SCN loci in angiosperms. Our new tool identified as many as 1993 SCN loci from transcriptomic data sampled as part of four independent test cases representing marker development projects at different phylogenetic scales.Conclusions:MarkerMiner is an easy-to-use and effective tool for discovery of putative SCN loci. It can be run locally or via the Web, and its tabular and alignment outputs facilitate efficient downstream assessments of phylogenetic utility, locus selection, intron-exon boundary prediction, and primer or probe development.
Catnip or catmint (Nepeta spp.) is a flowering plant in the mint family (Lamiaceae) famed for its ability to attract cats. This phenomenon is caused by the compound nepetalactone, a volatile iridoid that also repels insects. Iridoids are present in many Lamiaceae species but were lost in the ancestor of the Nepetoideae, the subfamily containing Nepeta. Using comparative genomics, ancestral sequence reconstructions, and phylogenetic analyses, we probed the re-emergence of iridoid biosynthesis in Nepeta. The results of these investigations revealed mechanisms for the loss and subsequent re-evolution of iridoid biosynthesis in the Nepeta lineage. We present evidence for a chronology of events that led to the formation of nepetalactone biosynthesis and its metabolic gene cluster. This study provides insights into the interplay between enzyme and genome evolution in the origins, loss, and re-emergence of plant chemical diversity.
The evolution of chemical complexity has been a major driver of plant diversification, with novel compounds serving as key innovations. The species-rich mint family (Lamiaceae) produces an enormous variety of compounds that act as attractants and defense molecules in nature and are used widely by humans as flavor additives, fragrances, and anti-herbivory agents. To elucidate the mechanisms by which such diversity evolved, we combined leaf transcriptome data from 48 Lamiaceae species and four outgroups with a robust phylogeny and chemical analyses of three terpenoid classes (monoterpenes, sesquiterpenes, and iridoids) that share and compete for precursors. Our integrated chemical-genomic-phylogenetic approach revealed that: (1) gene family expansion rather than increased enzyme promiscuity of terpene synthases is correlated with mono- and sesquiterpene diversity; (2) differential expression of core genes within the iridoid biosynthetic pathway is associated with iridoid presence/absence; (3) generally, production of iridoids and canonical monoterpenes appears to be inversely correlated; and (4) iridoid biosynthesis is significantly associated with expression of geraniol synthase, which diverts metabolic flux away from canonical monoterpenes, suggesting that competition for common precursors can be a central control point in specialized metabolism. These results suggest that multiple mechanisms contributed to the evolution of chemodiversity in this economically important family.
Background
Teak, a member of the Lamiaceae family, produces one of the most expensive hardwoods in the world. High demand coupled with deforestation have caused a decrease in natural teak forests, and future supplies will be reliant on teak plantations. Hence, selection of teak tree varieties for clonal propagation with superior growth performance is of great importance, and access to high-quality genetic and genomic resources can accelerate the selection process by identifying genes underlying desired traits.
Findings
To facilitate teak research and variety improvement, we generated a highly contiguous, chromosomal-scale genome assembly using high-coverage Pacific Biosciences long reads coupled with high-throughput chromatin conformation capture. Of the 18 teak chromosomes, we generated 17 near-complete pseudomolecules with one chromosome present as two chromosome arm scaffolds. Genome annotation yielded 31,168 genes encoding 46,826 gene models, of which, 39,930 and 41,155 had Pfam domain and expression evidence, respectively. We identified 14 clusters of tandem-duplicated terpene synthases (TPSs), genes central to the biosynthesis of terpenes, which are involved in plant defense and pollinator attraction. Transcriptome analysis revealed 10 TPSs highly expressed in woody tissues, of which, 8 were in tandem, revealing the importance of resolving tandemly duplicated genes and the quality of the assembly and annotation. We also validated the enzymatic activity of four TPSs to demonstrate the function of key TPSs.
Conclusions
In summary, this high-quality chromosomal-scale assembly and functional annotation of the teak genome will facilitate the discovery of candidate genes related to traits critical for sustainable production of teak and for anti-insecticidal natural products.
Ancient duplication events and retained gene duplicates have contributed to the evolution of many novel plant traits and, consequently, to the diversity and complexity within and across plant lineages. While mounting evidence highlights the importance of whole-genome duplication (WGD; polyploidy) and its key role as an evolutionary driver, gene duplication dynamics and mechanisms, both of which are fundamental to our understanding of evolutionary process and patterns of plant diversity, remain poorly characterized in many clades. We use newly available transcriptomic data and a robust phylogeny to investigate the prevalence, occurrence, and timing of gene duplications in Lamiaceae (mints), a species-rich and chemically diverse clade with many ecologically, economically, and culturally important species. We also infer putative WGDs—an extreme mechanism of gene duplication—using large-scale data sets from synonymous divergence (KS), phylotranscriptomic, and divergence time analyses. We find evidence for widespread but asymmetrical levels of gene duplication and ancient polyploidy in Lamiaceae that correlate with species richness, including pronounced levels of gene duplication and putative ancient WGDs (7–18 events) within the large subclade Nepetoideae and up to 10 additional WGD events in other subclades. Our results help disentangle WGD-derived gene duplicates from those produced by other mechanisms and illustrate the non-uniformity of duplication dynamics in mints, setting the stage for future investigations that explore their impacts on trait diversity and species diversification. Our results also provide a practical context for evaluating the benefits and limitations of transcriptome-based approaches to inferring WGD, and we offer recommendations for researchers interested in investigating ancient WGDs in other plant groups.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.