Development and validation of a multi-locus DNA metabarcoding method to identify endangered species in complex samples

Arulandhu, Alfred J.; Staats, Martijn; Hagelaar, Rico; Voorhuijzen, Marleen M.; Prins, T.W.; Scholtens, I.M.J.; Costessi, Adalberto; Duijsings, Daniël; Rechenmann, François; Gaspar, Frédéric B.; Crespo, Maria Teresa Barreto; Holst‐Jensen, Arne; Birck, Matthew; Burns, Malcolm; Haynes, Edward; Hochegger, Rupert; Klingl, Alexander; Lundberg, Lisa; Natale, Chiara; Niekamp, Hauke; Perri, E.; Barbante, Alessandra; Rosec, Jean-Philippe; Seyfarth, Ralf; Sovová, Tereza; Moorleghem, Christoff Van; Ruth, Saskia M. van; Peelen, Tamara; Kok, E.J.

doi:10.1093/gigascience/gix080

Cited by 71 publications

(74 citation statements)

References 43 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Stat et al, 2017). However, few metabarcode pipelines are explicitly designed to process multilocus high-throughput sequencing data (but see Arulandhu et al, 2017). As such, researchers must sort and process multiple eDNA metabarcodes independently, substantially increasing computation time with each additional metabarcode.…”

Section: Introductionmentioning

confidence: 99%

Anacapa Toolkit: an environmental DNA toolkit for processing multilocus metabarcode datasets

Curd

Gold

Kandlikar

et al. 2018

Preprint

106

View full text Add to dashboard Cite

are co-equal second authors.Robert Wayne and Rachel S. Meyer are co-equal senior authors. Abstract 1. Environmental DNA (eDNA) metabarcoding is a promising method to monitor species and community diversity that is rapid, affordable and non-invasive. The longstanding needs of the eDNA community are modular informatics tools, comprehensive and customizable reference databases, flexibility across high-throughput sequencing platforms, fast multilocus metabarcode processing and accurate taxonomic assignment. Improvements in bioinformatics tools make addressing each of these demands within a single toolkit a reality.2. The new modular metabarcode sequence toolkit Anacapa (https ://github.com/ limey-bean/Anaca pa/) addresses the above needs, allowing users to build comprehensive reference databases and assign taxonomy to raw multilocus metabarcode sequence data. A novel aspect of Anacapa is its database building module, "Creating Reference libraries Using eXisting tools" (CRUX), which generates comprehensive reference databases for specific user-defined metabarcoding loci. The Quality Control and ASV Parsing module sorts and processes multiple metabarcoding loci and processes merged, unmerged and unpaired reads maximizing recovered diversity. DADA2 then detects amplicon sequence variants (ASVs) and the Anacapa Classifier module aligns these ASVs to CRUX-generated reference databases using Bowtie2. Lastly, taxonomy is assigned to ASVs with confidence scores using a Bayesian Lowest Common Ancestor (BLCA) method. The Anacapa Toolkit also includes an r package, ranacapa, for automated results exploration through standard biodiversity statistical analysis.3. Benchmarking tests verify that the Anacapa Toolkit effectively and efficiently generates comprehensive reference databases that capture taxonomic diversity, and can assign taxonomy to both MiSeq and HiSeq-length sequence data. We demonstrate the value of the Anacapa Toolkit in assigning taxonomy to seawater eDNA samples collected in southern California.

show abstract

Section: Introductionmentioning

confidence: 99%

Anacapa Toolkit: an environmental DNA toolkit for processing multilocus metabarcode datasets

Curd

Gold

Kandlikar

et al. 2018

Preprint

106

View full text Add to dashboard Cite

show abstract

“…Due to the absence of ITS2 location annotations of some raw data, these sequences were moved to the candidate dataset first, and then a Hidden Markov Model was trained based on well-annotated ITS2 sequences to predict the potential ITS2 regions of these candidate sequences, before these ITS2 sequences could be included into our curated database. For all ITS2 sequences extracted based on the annotations, quality filter was performed in accordance with criteria as follow ( Figure 1A ): (1) length below 100 bp, (2) length above 900 bp, (3) belonging to reduplicate entries, (4) with more than three ambiguous base pairs, (5) belonging to environment samples or unclassified samples. The quality control steps filtered ITS2 entries with either low sequence quality or obscure taxonomy annotation.…”

Section: Methodsmentioning

confidence: 99%

“…Researches on plants would be carried out with the aid to the knowledge accumulated through the deep examination of the plants. On the other hand, as for researches on biodiversity and conservation of endangered fauna and flora, building accurate knowledge-base of plants is essential for their rational protection, which would also aid for preventing illegal trade of endangered plants (4). Therefore, accurate and rapid identification of plants would be essential for safe and rational utilization of plant resources and effective study and protection of plant biodiversity.…”

Section: Introductionmentioning

confidence: 99%

Holmes-ITS2: Consolidated ITS2 resources and search engines for plant DNA-based marker analyses

Bai

et al. 2018

Preprint

View full text Add to dashboard Cite

11Plants are valuable resources for a variety of products in modern societies. Plant species 12 identification is an integral part of research and practical application on plants. In parallel with 13 high-throughput sequencing technology, the high-throughput screening of species is in high 14 demand. Highly accurate and efficient DNA-based marker identification is essential for the 15 effective analysis of plant species or biological constituents of a mixture of plants as well. 16Therefore, it is of general interests and significance to generate a comprehensive and 17 accurate DNA-based marker sequence resource, as well as to build efficient sequence search 18 engines, for the accurate and fast identification of plant species. 19 20In this work, we have firstly established a high-quality ITS2 sequence database of plant 21 species containing more than 150,000 entries, through the systematical collection and 22 manually collation of the published ITS2 sequencing data of plant species, data quality control, 23 as well as representative sequence refinement based on clustering method. Secondly, an 24 accurate and efficient plant species identification system based on ITS2 sequence was 25 constructed, which is the proper combination of sequence search algorithms including BLAST 26 and Kraken. Through the deployment of high-performance and frequently updated web service, 27 it's expected to serve for a wide range of researchers involving the taxonomy classification of 28 plant species, as well as for deciphering of plant mixed systems including herbal materials in 29 TCM preparations. 30 31The Holmes-ITS2 web service is freely accessible at: http://its2.tcm.microbioinformatics.org/. 32The input of this web service could be multiple sequences in a single fasta format, to search 33 for matching ITS2 biomarker sequences already annotated in the database. This 34 sequence-based search is based on two engines: BLAST, and k-mer based Kraken. 35Alternatively, users can directly search for species name for the corresponding ITS2 biomarker 36 sequences. The web service has been put to the test by more than 50 experts from China, 37Denmark and US, and the average running time for the search ranges from 3-30 seconds for 38 up to 100 sequences as a batch query. 39 40 peer-reviewed)

show abstract

“…Arulandhu et al. (2017) developed a multi‐locus DNA metabarcoding method based on Illumina MiSeq and 12 DNA barcode markers, which demonstrated universal applicability to species identification across a wide range of plant and animal taxa in samples containing degraded DNA. In this work, 29 animal and 17 plant reference species, representing a phylogenetically wide range of taxa (Mammalia, Actinopterygii, Malacostraca, Bivalvia, Aves, Reptilia, Amphibia, Insecta, Angiospermae, and Cycadopsida) were used to prepare complex mixtures containing two to 11 taxonomically well‐characterized species present in relative concentrations (1% to 47%).…”

Section: Advanced Genome‐based Approachesmentioning

confidence: 99%

“…Nevertheless, Arulandhu et al. (2017) concluded that the proposed multi‐locus DNA metabarcoding method could provide reliable and detailed data on the composition of highly complex food products and supplements.…”

Section: Advanced Genome‐based Approachesmentioning

confidence: 99%

Botanical origin authentication of dietary supplements by DNA‐based approaches

Grazina

Amaral

Mafra

2020

Comp Rev Food Sci Food Safe

View full text Add to dashboard Cite

Herbal products, such as dietary supplements, have become a subject of increasing global importance for their health benefits and economic considerations. However, they have also been targets of adulteration practices, being the accurate identification of botanicals in herbal products of utmost importance to protect the health and expectations of consumers. Particularly, in the case of dietary supplements, which can have different types of formulations, the identification of plant material used in their production is often a research challenge. DNA-based techniques have played a crucial role on the development of a wide range of tools for the authentication of herbal products.Therefore, this review intends to describe their main progresses, critically discussing their advantages and drawbacks when applied to authenticate herbal products, focusing on dietary supplements. DNA barcoding is particularly emphasized because it has provided the highest number of applications, followed by the advances on highresolution melting analysis combined with DNA barcodes. A special emphasis is also given to the promising approaches relying on DNA metabarcoding and isothermal amplification.

show abstract

Development and validation of a multi-locus DNA metabarcoding method to identify endangered species in complex samples

Cited by 71 publications

References 43 publications

Anacapa Toolkit: an environmental DNA toolkit for processing multilocus metabarcode datasets

Anacapa Toolkit: an environmental DNA toolkit for processing multilocus metabarcode datasets

Holmes-ITS2: Consolidated ITS2 resources and search engines for plant DNA-based marker analyses

Botanical origin authentication of dietary supplements by DNA‐based approaches

Contact Info

Product

Resources

About