Conventional assessments of ecosystem sample composition are based on morphology-based or DNA barcode identification of individuals. Both approaches are costly and time-consuming, especially when applied to the large number of specimens and taxa commonly included in ecological investigations. Next-generation sequencing approaches can overcome the bottleneck of individual specimen isolation and identification by simultaneously sequencing specimens of all taxa in a bulk mixture. Here we apply multiple parallel amplification primers, multiple DNA barcode markers, 454-pyrosequencing, and Illumina MiSeq sequencing to the same sample to maximize recovery of the arthropod macrobiome and the bacterial and other microbial microbiome of a bulk arthropod sample. We validate this method with a complex sample containing 1,066 morphologically distinguishable arthropods from a tropical terrestrial ecosystem with high taxonomic diversity. Multiamplicon next-generation DNA barcoding was able to recover sequences corresponding to 91% of the distinguishable individuals in a bulk environmental sample, as well as many species present as undistinguishable tissue. 454-pyrosequencing was able to recover 10 more families of arthropods and 30 more species than did conventional Sanger sequencing of each individual specimen. The use of other loci (16S and 18S ribosomal DNA gene regions) also added the detection of species of microbes associated with these terrestrial arthropods. This method greatly decreases the time and money necessary to perform DNA-based comparisons of biodiversity among ecosystem samples. This methodology opens the door to much cheaper and increased capacity for ecological and evolutionary studies applicable to a wide range of socio-economic issues, as well as a basic understanding of how the world works.cytochrome c oxidase subunit I | Costa Rica | insect | Malaise trap | NGS
Genetic information is a valuable component of biosystematics, especially specimen identification through the use of species-specific DNA barcodes. Although many genomics applications have shifted to High-Throughput Sequencing (HTS) or Next-Generation Sequencing (NGS) technologies, sample identification (e.g., via DNA barcoding) is still most often done with Sanger sequencing. Here, we present a scalable double dual-indexing approach using an Illumina Miseq platform to sequence DNA barcode markers. We achieved 97.3% success by using half of an Illumina Miseq flowcell to obtain 658 base pairs of the cytochrome c oxidase I DNA barcode in 1,010 specimens from eleven orders of arthropods. Our approach recovers a greater proportion of DNA barcode sequences from individuals than does conventional Sanger sequencing, while at the same time reducing both per specimen costs and labor time by nearly 80%. In addition, the use of HTS allows the recovery of multiple sequences per specimen, for deeper analysis of genetic variation in target gene regions.
The purpose of this review is to present the most common and emerging DNA-based methods used to generate data for biodiversity and biomonitoring studies. As environmental assessment and monitoring programmes may require biodiversity information at multiple levels, we pay particular attention to the DNA metabarcoding method and discuss a number of bioinformatic tools and considerations for producing DNA-based indicators using operational taxonomic units (OTUs), taxa at a variety of ranks and community composition. By developing the capacity to harness the advantages provided by the newest technologies, investigators can "scale up" by increasing the number of samples and replicates processed, the frequency of sampling over time and space, and even the depth of sampling such as by sequencing more reads per sample or more markers per sample. The ability to scale up is made possible by the reduced hands-on time and cost per sample provided by the newest kits, platforms and software tools. Results gleaned from broad-scale monitoring will provide opportunities to address key scientific questions linked to biodiversity and its dynamics across time and space as well as being more relevant for policymakers, enabling science-based decision-making, and provide a greater socio-economic impact. As genomic approaches are continually evolving, we provide this guide to methods used in biodiversity genomics.
The nuclear ribosomal internal transcribed spacer (ITS) region is the most commonly chosen genetic marker for the molecular identification of fungi in environmental sequencing and molecular ecology studies. Several analytical issues complicate such efforts, one of which is the formation of chimeric—artificially joined—DNA sequences during PCR amplification or sequence assembly. Several software tools are currently available for chimera detection, but rely to various degrees on the presence of a chimera-free reference dataset for optimal performance. However, no such dataset is available for use with the fungal ITS region. This study introduces a comprehensive, automatically updated reference dataset for fungal ITS sequences based on the UNITE database for the molecular identification of fungi. This dataset supports chimera detection throughout the fungal kingdom and for full-length ITS sequences as well as partial (ITS1 or ITS2 only) datasets. The performance of the dataset on a large set of artificial chimeras was above 99.5%, and we subsequently used the dataset to remove nearly 1,000 compromised fungal ITS sequences from public circulation. The dataset is available at http://unite.ut.ee/repository.php and is subject to web-based third-party curation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.