In recent years, the use of molecular data in algal systematics has increased as high-throughput sequencing (HTS) has become more accessible, generating very large data sets at a reasonable cost. In this perspectives paper, our goal is to describe how HTS technologies can advance algal systematics. Following an introduction to some common HTS technologies, we discuss how metabarcoding can accelerate algal species discovery. We show how various HTS methods can be applied to generate datasets for accurate species delimitation, and how HTS can be applied to historical type specimens to assist the nomenclature process. Finally, we discuss how HTS data such as organellar genomes and transcriptomes can be used to construct well resolved phylogenies, leading to a stable and natural classification of algal groups. We include examples of bioinformatic workflows that may be applied to process data for each purpose, along with common programs used to achieve each step. We also discuss possible strategies and the new skill set that will be required to fully embrace HTS as a part of algal systematics, along with considerations of cost and experimental design. HTS technology has revolutionized many fields in biology, and will certainly do the same in algal systematics.
Morphological identification of species in the order Gelidiales can be difficult and controversial because of phenotypic plasticity, the low numbers of reproductive specimens and poorly defined taxonomic boundaries. A DNA barcoding survey of Brazilian specimens of Gelidiales, employing neighbor-joining and Automatic Barcode Gap Discovery analyses, indicated the presence of 23 statistically robust primary species hypotheses (PSH). In addition to the cytochrome oxidase I gene (COI-5P), the chloroplast universal plastid amplicon (UPA) marker was also sequenced and submitted to the same analyses. Representatives of each COI-5P/UPA PSH were selected for rbcL sequence analysis to further corroborate the occurrence of 23 species and to infer their phylogenetic relationships. These analyses confirmed the identity of six species previously cited for Brazil: Gelidiella acerosa, G. ligulata, Gelidium crinale, G. floridanum, Pterocladiella bartlettii and P. capillacea. Three new reports for Brazil were also detected: Gelidium microdonticum, Pterocladiella beachiae and P. australafricanensis. Fourteen species remain unidentified and require detailed morphological evaluation.
Gracilariaceae has a worldwide distribution including numerous economically important species. We applied high-throughput sequencing to obtain organellar genomes (mitochondria and chloroplast) from 10 species of Gracilariaceae and, combined with published genomes, to infer phylogenies and compare genome architecture among species representing main lineages. We obtained similar topologies between chloroplast and mitochondrial genomes phylogenies. However, the chloroplast phylogeny was better resolved with full support. In this phylogeny, Melanthalia intermedia is sister to a monophyletic clade including Gracilaria and Gracilariopsis, which were both resolved as monophyletic genera. Mitochondrial and chloroplast genomes were highly conserved in gene synteny, and variation mainly occurred in regions where insertions of plasmid-derived sequences (PDS) were found. In mitochondrial genomes, PDS insertions were observed in two regions where the transcription direction changes: between the genes cob and trnL, and trnA and trnN. In chloroplast genomes, PDS insertions were in different positions, but generally found between psdD and rrs genes. Gracilariaceae is a good model system to study the impact of PDS in genome evolution due to the frequent presence of these insertions in organellar genomes. Furthermore, the bacterial leuC/leuD operon was found in chloroplast genomes of Gracilaria tenuistipitata, G. chilensis, and M. intermedia, and in extrachromosomal plasmid of G. vermiculophylla. Phylogenetic trees show two different origins of leuC/leuD: genes found in chloroplast and plasmid were placed with proteobacteria, and genes encoded in the nucleus were close to Viridiplantae and cyanobacteria.
Endosymbiosis, the establishment of a former free-living prokaryotic or eukaryotic cell as an organelle inside a host cell, can dramatically alter the genomic architecture of the endosymbiont. Plastids or chloroplasts, the light-harvesting organelle of photosynthetic eukaryotes, are excellent models to study this phenomenon because plastid origin has occurred multiple times in evolution. Here, we investigate the genomic signature of molecular processes acting through secondary plastid endosymbiosis—the origination of a new plastid from a free-living eukaryotic alga. We used phylogenetic comparative methods to study gene loss and changes in selective regimes on plastid genomes, focusing on green algae that have given rise to three independent lineages with secondary plastids (euglenophytes, chlorarachniophytes, and Lepidodinium). Our results show an overall increase in gene loss associated with secondary endosymbiosis, but this loss is tightly constrained by retention of genes essential for plastid function. The data show that secondary plastids have experienced temporary relaxation of purifying selection during secondary endosymbiosis. However, this process is tightly constrained, with selection relaxed only relative to the background in primary plastids. Purifying selection remains strong in absolute terms even during the endosymbiosis events. Selection intensity rebounds to pre-endosymbiosis levels following endosymbiosis events, demonstrating the changes in selection efficiency during different origin phases of secondary plastids. Independent endosymbiosis events in the euglenophytes, chlorarachniophytes, and Lepidodinium differ in their degree of relaxation of selection, highlighting the different evolutionary contexts of these events. This study reveals the selection-drift interplay during secondary endosymbiosis, and evolutionary parallels during organellogenesis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.