Complete mitochondrial genomes have been shown to be reliable markers for phylogeny reconstruction among diverse animal groups. However, the relative difficulty and high cost associated with obtaining de novo full mitogenomes have frequently led to conspicuously low taxon sampling in ensuing studies. Here, we report the successful use of an economical and accessible method for assembling complete or near-complete mitogenomes through shot-gun next-generation sequencing of a single library made from pooled total DNA extracts of numerous target species. To avoid the use of separate indexed libraries for each specimen, and an associated increase in cost, we incorporate standard polymerase chain reaction-based “bait” sequences to identify the assembled mitogenomes. The method was applied to study the higher level phylogenetic relationships in the weevils (Coleoptera: Curculionoidea), producing 92 newly assembled mitogenomes obtained in a single Illumina MiSeq run. The analysis supported a separate origin of wood-boring behavior by the subfamilies Scolytinae, Platypodinae, and Cossoninae. This finding contradicts morphological hypotheses proposing a close relationship between the first two of these but is congruent with previous molecular studies, reinforcing the utility of mitogenomes in phylogeny reconstruction. Our methodology provides a technically simple procedure for generating densely sampled trees from whole mitogenomes and is widely applicable to groups of animals for which bait sequences are the only required prior genome knowledge.
Advances in phylogenomics contribute towards resolving long-standing evolutionary questions. Notwithstanding, genetic diversity contained within more than a billion biological specimens deposited in natural history museums remains recalcitrant to analysis owing to challenges posed by its intrinsically degraded nature. Yet that tantalizing resource could be critical in overcoming taxon sampling constraints hindering our ability to address major evolutionary questions. We addressed this impediment by developing phyloHyRAD, a new bioinformatic pipeline enabling locus recovery at a broad evolutionary scale from HyRAD-X exome capture of museum specimens of low DNA integrity using a benchtop RAD-derived exome-complexity-reduction probe set developed from high DNA integrity specimens. Our new pipeline can also successfully align raw RNAseq transcriptomic and UCE reads with the RAD-derived probe catalog. Using this method, we generated a robust timetree for Carabinae beetles, the lack of which had precluded study of macroevolutionary trends pertaining to their biogeography and wing-morphology evolution. We successfully recovered up to 2945 loci with a mean of 1788 loci across the exome of specimens of varying age. Coverage was not significantly linked to specimen age, demonstrating the wide exploitability of museum specimens. We also recovered fragmentary mitogenomes compatible with Sanger-sequenced mtDNA. Our phylogenomic timetree revealed a Lower Cretaceous origin for crown group Carabinae, with the extinct Aplothorax nested within the genus Calosoma demonstrating the junior synonymy of Aplothorax syn. nov., resulting in the new combination Calosoma (Ctenosta) burchellii (Waterhouse, 1841) comb. nov. This study compellingly illustrates that HyRAD-X and phyloHyRAD efficiently provide genomic-level datasets informative at deep evolutionary scales.
Metagenomic analyses are challenging in metazoans, but high-copy number and repeat regions can be assembled from low-coverage sequencing by “genome skimming,” which is applied here as a new way of characterizing metagenomes obtained in an ecological or taxonomic context. Illumina shotgun sequencing on two pools of Coleoptera (beetles) of approximately 200 species each were assembled into tens of thousands of scaffolds. Repeated low-coverage sequencing recovered similar scaffold sets consistently, although approximately 70% of scaffolds could not be identified against existing genome databases. Identifiable scaffolds included mitochondrial DNA, conserved sequences with hits to expressed sequence tag and protein databases, and known repeat elements of high and low complexity, including numerous copies of rRNA and histone genes. Assemblies of histones captured a diversity of gene order and primary sequence in Coleoptera. Scaffolds with similarity to multiple sites in available coleopteran genome sequences for Dendroctonus and Tribolium revealed high specificity of scaffolds to either of these genomes, in particular for high-copy number repeats. Numerous “clusters” of scaffolds mapped to the same genomic site revealed intra- and/or intergenomic variation within a metagenome pool. In addition to effect of taxonomic composition of the metagenomes, the number of mapped scaffolds also revealed structural differences between the two reference genomes, although the significance of this striking finding remains unclear. Finally, apparently exogenous sequences were recovered, including potential food plants, fungal pathogens, and bacterial symbionts. The “metagenome skimming” approach is useful for capturing the genomic diversity of poorly studied, species-rich lineages and opens new prospects in environmental genomics.
A phylogenetic tree at the species level is still far off for highly diverse insect orders, including the Coleoptera, but the taxonomic breadth of public sequence databases is growing. In addition, new types of data may contribute to increasing taxon coverage, such as metagenomic shotgun sequencing for assembly of mitogenomes from bulk specimen samples. The current study explores the application of these techniques for large-scale efforts to build the tree of Coleoptera. We used shotgun data from 17 different ecological and taxonomic datasets (5 unpublished) to assemble a total of 1942 mitogenome contigs of >3000 bp. These sequences were combined into a single dataset together with all mitochondrial data available at GenBank, in addition to nuclear markers widely used in molecular phylogenetics. The resulting matrix of nearly 16,000 species with two or more loci produced trees (RAxML) showing overall congruence with the Linnaean taxonomy at hierarchical levels from suborders to genera. We tested the role of full-length mitogenomes in stabilizing the tree from GenBank data, as mitogenomes might link terminals with non-overlapping gene representation. However, the mitogenome data were only partly useful in this respect, presumably because of the purely automated approach to assembly and gene delimitation, but improvements in future may be possible by using multiple assemblers and manual curation. In conclusion, the combination of data mining and metagenomic sequencing of bulk samples provided the largest phylogenetic tree of Coleoptera to date, which represents a summary of existing phylogenetic knowledge and a defensible tree of great utility, in particular for studies at the intra-familial level, despite some shortcomings for resolving basal nodes.
The subfamily Carabinae is a diverse clade distributed across all biogeographical regions except Antarctica. In a seminal work, René Jeannel hypothesized a Gondwanan origin for this group, but this has hitherto remained untested with molecular data. We test this hypothesis by using a supermatrix approach. We also infer the most comprehensive phylogeny of the genus Calosoma, the only lineage within Carabinae comprising predominantly flying species. We use a recent timetree of Coleoptera to infer divergence time estimates in Carabinae. Our results identify four main lineages within Calosoma and reject the monophyly of several species groups erected by Jeannel. The subfamily Carabinae is estimated to have arisen in the Jurassic as suggested by Jeannel, and this dating is congruent, to some extent, with a vicariant hypothesis linked to the timing of the fragmentation of Gondwana. The main lineages of Calosoma are suggested to have diverged from each other in the Palaeogene, suggesting a dynamic biogeography, possibly shaped by dispersal rather than vicariance. This pattern could have resulted from the unique morphological evolution in Calosoma, allowing certain lineages to actively fly. Our divergence times within Carabinae are markedly inconsistent with previous studies, therefore reiterating the need for a fine-scale, fossil-based timetree of Adephaga.
Ambrosia beetles (Coleoptera: Curculionidae: Scolytinae and Platypodinae) rely on a symbiosis with fungi for their nutrition. Symbiotic fungi are preserved and transported in specialized storage structures called mycangia. Although pivotal in the symbiosis, mycangia have been notoriously difficult to study, given their minute size and membranous structure. We compared the application of novel visualization methods for the study of mycangia, namely micro-computed tomography (micro-CT) and laser ablation tomography (LATscan) with traditional paraffin sectioning. Micro-CT scanning has shown the greatest promise in new organ discovery, while sectioning remains the only method with sufficient resolution for cellular visualization. All three common types of mycangia (oral, mesonotal, and pronotal) were successfully visualized and presented for different species of ambrosia beetles: Ambrosiodmus minor (Stebbing) 1909, Euplatypus compositus (Say) 1823, Premnobius cavipennis Eichhoff 1878, Scolytoplatypus raja Blandford 1893, Xylosandrus crassiusculus (Motschulsky) 1866 and X. amputatus (Blandford) 1894. A reconstruction of the mycangium and the surrounding musculature in X. amputatus is also presented. The advantages of micro-CT compared to the previously commonly used microtome sectioning include the easy visualization and recording of three-dimensional structures, their position in reference to other internal structures, the ability to distinguish natural aberrations from technical artifacts, and the unprecedented visualizations of the anatomic context of mycangia enabled by the integrated software.
A phylogenetic tree at the species level is still far off for highly diverse insect orders, including the Coleoptera, but the taxonomic breadth of public sequence databases is growing. In addition, new types of data may contribute to increasing taxon coverage, such as metagenomic shotgun 2 sequencing for assembly of mitogenomes from bulk specimen samples. The current study explores the application of these techniques for large-scale efforts to build the tree of Coleoptera. We used shotgun data from 17 different ecological and taxonomic datasets (5 unpublished) to assemble a total of 1942 mitogenome contigs of >3000 bp. These sequences were combined into a single dataset together with all mitochondrial data available at GenBank, in addition to nuclear markers widely used in molecular phylogenetics. The resulting matrix of nearly 16000 species with two or more loci produced trees (RAxML) showing overall congruence with the Linnaean taxonomy at hierarchical levels from suborders to genera. We tested the role of full-length mitogenomes in stabilizing the tree from GenBank data, as mitogenomes might link terminals with non-overlapping gene representation. However, the mitogenome data were only partly useful in this respect, presumably because of the purely automated approach to assembly and gene delimitation, but improvements in future may be possible by using multiple assemblers and manual curation. In conclusion, the combination of data mining and metagenomic sequencing of bulk samples provided the largest phylogenetic tree of Coleoptera to date, which represents a summary of existing phylogenetic knowledge and a defensible tree of great utility, in particular for studies at the intra-familial level, despite some shortcomings for resolving basal nodes.
Establishing well-supported monophyletic groups is a key requirement for producing a natural classification that reflects evolutionary descent. In a phylogenetic framework this is best achieved through dense taxon sampling and the analysis of a robust character dataset, combined with statistical testing of topological hypotheses. This study assesses the monophyly of tribes and subfamilies within the diverse 'broad-nosed weevils' (Curculionidae: Entiminae, Cyclominae and Hyperinae) through analysis of single-locus sequence data for mitochondrial cox1 and rrnL genes, in combination with a 'backbone' of complete and near-complete mitochondrial genome sequences. Maximum likelihood phylogenetic analyses incorporating topological constraints for various higher-taxa were statistically tested using the AU, SH, and KH tests, which indicated that three tribes within Entiminae, as presently classified, are not monophyletic. Moderate and high bootstrap support was also consistent with two entimine tribes (Peritelini and Cylydrorhinini) being each recovered as monophyletic in an unconstrained analysis. Furthermore, one genus of cyclomine weevils (Aphela) is recovered outside the clade of 'broad-nosed weevils', although its taxonomic placement remains uncertain. It is apparent that the present approach may be hampered by limited taxon sampling in the 'backbone' dataset, rendering it difficult for divergent taxa to robustly match to their closest lineages. However, with improved taxon sampling of the mitogenome tree, the general approach can be a useful taxonomic tool for weevils.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.