The well-supported molecular phylogeny provides evidence for delineation of subfamilies within core Asparagales. With advances in technology and bioinformatics tools, the use of massively parallel sequencing will continue to become easier and more affordable for phylogenomic and molecular evolutionary biology investigations.
Whole plastid genomes (plastomes) are being sequenced rapidly from across the green plant tree of life, and phylogenetic analyses of these are increasing resolution and support for relationships that were unresolved in earlier studies. The cool-season grass subfamily, Pooideae, includes important temperate cereals, turf grasses and forage species, yet some aspects of deep phylogeny in the lineage are unresolved. We newly sequenced 25 Pooideae plastomes, and conducted phylogenomic analyses of these and 20 existing plastomes from the subfamily. Most aspects of deep relationship in Pooideae are maximally supported in our analyses, including those among early-diverging tribes.
Technological advances have allowed phylogenomic studies of plants, such as full chloroplast genome (plastome) analysis, to become increasingly popular and economically feasible. Although next–generation short–read sequencing allows for full plastomes to be sequenced relatively rapidly, it requires additional attention using software to assemble these reads into comprehensive sequences. Here we compare the use of three de novo assemblers combined with three contig assembly methods. Seven plastome sequences were analyzed. Three of these were Sanger–sequenced. The other four were assembled from short, single–end read files generated from next–generation libraries. These plastomes represented a total of six grass species (Poaceae), one of which was sequenced in duplicate by the two methods to allow direct comparisons for accuracy. Enumeration of missing sequence and ambiguities allowed for assessments of completeness and efficiency. All methods that used de Bruijn–based de novo assemblers were shown to produce assemblies comparable to the Sanger–sequenced plastomes but were not equally efficient. Contig assembly methods that utilized automatable and repeatable processes were generally more efficient and advantageous when applied to larger scale projects with many full plastomes. However, contig assembly methods that were less automatable and required more manual attention did show utility in determining plastomes with lower read depth that were not able to be assembled when automatable procedures were implemented. Although the methods here were used exclusively to generate grass plastomes, these could be applied to other taxonomic groups if previously sequenced plastomes were available. In addition to comparing sequencing methods, a supplemental guide for short–read plastome assembly and applicable scripts were generated for this study.
The internal transcribed spacers of the nuclear ribosomal RNA gene cluster, termed ITS1 and ITS2, are the most frequently used nuclear markers for phylogenetic analyses across many eukaryotic groups including most plant families. The reasons for the popularity of these markers include: 1.) Ease of amplification due to high copy number of the gene clusters, 2.) Available cost-effective methods and highly conserved primers, 3.) Rapidly evolving markers (i.e. variable between closely related species), and 4.) The assumption (and/or treatment) that these sequences are non-functional, neutrally evolving phylogenetic markers. Here, our analyses of ITS1 and ITS2 for 50 species suggest that both sequences are instead under selective constraints to preserve proper secondary structure, likely to maintain complete self-splicing functions, and thus are not neutrally-evolving phylogenetic markers. Our results indicate the majority of sequence sites are co-evolving with other positions to form proper secondary structure, which has implications for phylogenetic inference. We also found that the lowest energy state and total number of possible alternate secondary structures are highly significantly different between ITS regions and random sequences with an identical overall length and Guanine-Cytosine (GC) content. Lastly, we review recent evidence highlighting some additional problematic issues with using these regions as the sole markers for phylogenetic studies, and thus strongly recommend additional markers and cost-effective approaches for future studies to estimate phylogenetic relationships.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.