Complete mitochondrial genomes have been shown to be reliable markers for phylogeny reconstruction among diverse animal groups. However, the relative difficulty and high cost associated with obtaining de novo full mitogenomes have frequently led to conspicuously low taxon sampling in ensuing studies. Here, we report the successful use of an economical and accessible method for assembling complete or near-complete mitogenomes through shot-gun next-generation sequencing of a single library made from pooled total DNA extracts of numerous target species. To avoid the use of separate indexed libraries for each specimen, and an associated increase in cost, we incorporate standard polymerase chain reaction-based “bait” sequences to identify the assembled mitogenomes. The method was applied to study the higher level phylogenetic relationships in the weevils (Coleoptera: Curculionoidea), producing 92 newly assembled mitogenomes obtained in a single Illumina MiSeq run. The analysis supported a separate origin of wood-boring behavior by the subfamilies Scolytinae, Platypodinae, and Cossoninae. This finding contradicts morphological hypotheses proposing a close relationship between the first two of these but is congruent with previous molecular studies, reinforcing the utility of mitogenomes in phylogeny reconstruction. Our methodology provides a technically simple procedure for generating densely sampled trees from whole mitogenomes and is widely applicable to groups of animals for which bait sequences are the only required prior genome knowledge.
Advances in phylogenomics contribute towards resolving long-standing evolutionary questions. Notwithstanding, genetic diversity contained within more than a billion biological specimens deposited in natural history museums remains recalcitrant to analysis owing to challenges posed by its intrinsically degraded nature. Yet that tantalizing resource could be critical in overcoming taxon sampling constraints hindering our ability to address major evolutionary questions. We addressed this impediment by developing phyloHyRAD, a new bioinformatic pipeline enabling locus recovery at a broad evolutionary scale from HyRAD-X exome capture of museum specimens of low DNA integrity using a benchtop RAD-derived exome-complexity-reduction probe set developed from high DNA integrity specimens. Our new pipeline can also successfully align raw RNAseq transcriptomic and UCE reads with the RAD-derived probe catalog. Using this method, we generated a robust timetree for Carabinae beetles, the lack of which had precluded study of macroevolutionary trends pertaining to their biogeography and wing-morphology evolution. We successfully recovered up to 2945 loci with a mean of 1788 loci across the exome of specimens of varying age. Coverage was not significantly linked to specimen age, demonstrating the wide exploitability of museum specimens. We also recovered fragmentary mitogenomes compatible with Sanger-sequenced mtDNA. Our phylogenomic timetree revealed a Lower Cretaceous origin for crown group Carabinae, with the extinct Aplothorax nested within the genus Calosoma demonstrating the junior synonymy of Aplothorax syn. nov., resulting in the new combination Calosoma (Ctenosta) burchellii (Waterhouse, 1841) comb. nov. This study compellingly illustrates that HyRAD-X and phyloHyRAD efficiently provide genomic-level datasets informative at deep evolutionary scales.
Metagenomic analyses are challenging in metazoans, but high-copy number and repeat regions can be assembled from low-coverage sequencing by “genome skimming,” which is applied here as a new way of characterizing metagenomes obtained in an ecological or taxonomic context. Illumina shotgun sequencing on two pools of Coleoptera (beetles) of approximately 200 species each were assembled into tens of thousands of scaffolds. Repeated low-coverage sequencing recovered similar scaffold sets consistently, although approximately 70% of scaffolds could not be identified against existing genome databases. Identifiable scaffolds included mitochondrial DNA, conserved sequences with hits to expressed sequence tag and protein databases, and known repeat elements of high and low complexity, including numerous copies of rRNA and histone genes. Assemblies of histones captured a diversity of gene order and primary sequence in Coleoptera. Scaffolds with similarity to multiple sites in available coleopteran genome sequences for Dendroctonus and Tribolium revealed high specificity of scaffolds to either of these genomes, in particular for high-copy number repeats. Numerous “clusters” of scaffolds mapped to the same genomic site revealed intra- and/or intergenomic variation within a metagenome pool. In addition to effect of taxonomic composition of the metagenomes, the number of mapped scaffolds also revealed structural differences between the two reference genomes, although the significance of this striking finding remains unclear. Finally, apparently exogenous sequences were recovered, including potential food plants, fungal pathogens, and bacterial symbionts. The “metagenome skimming” approach is useful for capturing the genomic diversity of poorly studied, species-rich lineages and opens new prospects in environmental genomics.
A phylogenetic tree at the species level is still far off for highly diverse insect orders, including the Coleoptera, but the taxonomic breadth of public sequence databases is growing. In addition, new types of data may contribute to increasing taxon coverage, such as metagenomic shotgun sequencing for assembly of mitogenomes from bulk specimen samples. The current study explores the application of these techniques for large-scale efforts to build the tree of Coleoptera. We used shotgun data from 17 different ecological and taxonomic datasets (5 unpublished) to assemble a total of 1942 mitogenome contigs of >3000 bp. These sequences were combined into a single dataset together with all mitochondrial data available at GenBank, in addition to nuclear markers widely used in molecular phylogenetics. The resulting matrix of nearly 16,000 species with two or more loci produced trees (RAxML) showing overall congruence with the Linnaean taxonomy at hierarchical levels from suborders to genera. We tested the role of full-length mitogenomes in stabilizing the tree from GenBank data, as mitogenomes might link terminals with non-overlapping gene representation. However, the mitogenome data were only partly useful in this respect, presumably because of the purely automated approach to assembly and gene delimitation, but improvements in future may be possible by using multiple assemblers and manual curation. In conclusion, the combination of data mining and metagenomic sequencing of bulk samples provided the largest phylogenetic tree of Coleoptera to date, which represents a summary of existing phylogenetic knowledge and a defensible tree of great utility, in particular for studies at the intra-familial level, despite some shortcomings for resolving basal nodes.
Ambrosia beetles (Coleoptera: Curculionidae: Scolytinae and Platypodinae) rely on a symbiosis with fungi for their nutrition. Symbiotic fungi are preserved and transported in specialized storage structures called mycangia. Although pivotal in the symbiosis, mycangia have been notoriously difficult to study, given their minute size and membranous structure. We compared the application of novel visualization methods for the study of mycangia, namely micro-computed tomography (micro-CT) and laser ablation tomography (LATscan) with traditional paraffin sectioning. Micro-CT scanning has shown the greatest promise in new organ discovery, while sectioning remains the only method with sufficient resolution for cellular visualization. All three common types of mycangia (oral, mesonotal, and pronotal) were successfully visualized and presented for different species of ambrosia beetles: Ambrosiodmus minor (Stebbing) 1909, Euplatypus compositus (Say) 1823, Premnobius cavipennis Eichhoff 1878, Scolytoplatypus raja Blandford 1893, Xylosandrus crassiusculus (Motschulsky) 1866 and X. amputatus (Blandford) 1894. A reconstruction of the mycangium and the surrounding musculature in X. amputatus is also presented. The advantages of micro-CT compared to the previously commonly used microtome sectioning include the easy visualization and recording of three-dimensional structures, their position in reference to other internal structures, the ability to distinguish natural aberrations from technical artifacts, and the unprecedented visualizations of the anatomic context of mycangia enabled by the integrated software.
The subfamily Carabinae is a diverse clade distributed across all biogeographical regions except Antarctica. In a seminal work, René Jeannel hypothesized a Gondwanan origin for this group, but this has hitherto remained untested with molecular data. We test this hypothesis by using a supermatrix approach. We also infer the most comprehensive phylogeny of the genus Calosoma, the only lineage within Carabinae comprising predominantly flying species. We use a recent timetree of Coleoptera to infer divergence time estimates in Carabinae. Our results identify four main lineages within Calosoma and reject the monophyly of several species groups erected by Jeannel. The subfamily Carabinae is estimated to have arisen in the Jurassic as suggested by Jeannel, and this dating is congruent, to some extent, with a vicariant hypothesis linked to the timing of the fragmentation of Gondwana. The main lineages of Calosoma are suggested to have diverged from each other in the Palaeogene, suggesting a dynamic biogeography, possibly shaped by dispersal rather than vicariance. This pattern could have resulted from the unique morphological evolution in Calosoma, allowing certain lineages to actively fly. Our divergence times within Carabinae are markedly inconsistent with previous studies, therefore reiterating the need for a fine-scale, fossil-based timetree of Adephaga.
The New World scarab beetle tribe Phanaeini contains coprophagous, necrophagous, mycetophagous and suspected myrmecophilous species. We analyse the largest tribal molecular dataset assembled, incorporating, for the first time, the enigmatic monobasic genus Megatharsis, the thalassinus group of the subgenus Coprophanaeus (Metallophanaeus), and the subgenus Dendropaemon (Eurypodea) (formerly Tetramereia), unveiling their macroevolutionary and biogeographical history in light of Cenozoic abiotic changes and inferring shifts in feeding biology through time. We recover the contentious genus Gromphas outside an otherwise monophyletic Phanaeini. We infer Megatharsis in a clade containing the apparent myrmecophilous genus Dendropaemon, within the Coprophanaeus clade, and demonstrate that the subgenus Coprophanaeus (Metallophanaeus) is polyphyletic, whilst species groups within the subgenus Coprophanaeus (Coprophanaeus) are monophyletic. Our divergence time analyses and ancestral range estimation indicate an eastern South American origin for Phanaeini in the early Eocene, with subsequent colonization of Central America and the Nearctic during the Oligocene, long before a Panamanian land bridge. A shift to necrophagy in Coprophanaeus is possibly linked to increasing Neotropical small vertebrate diversity since the Eocene and, astonishingly, myrmecophily evolved from necrophagy 35 Mya. These drastic shifts in lifestyle are not concordant with variations in diversification rates and appear unlinked to Quaternary extinction of large mammals.
A phylogenetic tree at the species level is still far off for highly diverse insect orders, including the Coleoptera, but the taxonomic breadth of public sequence databases is growing. In addition, new types of data may contribute to increasing taxon coverage, such as metagenomic shotgun 2 sequencing for assembly of mitogenomes from bulk specimen samples. The current study explores the application of these techniques for large-scale efforts to build the tree of Coleoptera. We used shotgun data from 17 different ecological and taxonomic datasets (5 unpublished) to assemble a total of 1942 mitogenome contigs of >3000 bp. These sequences were combined into a single dataset together with all mitochondrial data available at GenBank, in addition to nuclear markers widely used in molecular phylogenetics. The resulting matrix of nearly 16000 species with two or more loci produced trees (RAxML) showing overall congruence with the Linnaean taxonomy at hierarchical levels from suborders to genera. We tested the role of full-length mitogenomes in stabilizing the tree from GenBank data, as mitogenomes might link terminals with non-overlapping gene representation. However, the mitogenome data were only partly useful in this respect, presumably because of the purely automated approach to assembly and gene delimitation, but improvements in future may be possible by using multiple assemblers and manual curation. In conclusion, the combination of data mining and metagenomic sequencing of bulk samples provided the largest phylogenetic tree of Coleoptera to date, which represents a summary of existing phylogenetic knowledge and a defensible tree of great utility, in particular for studies at the intra-familial level, despite some shortcomings for resolving basal nodes.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.