2022
DOI: 10.1111/1755-0998.13741
|View full text |Cite
|
Sign up to set email alerts
|

crabs—A software program to generate curated reference databases for metabarcoding sequencing data

Abstract: The measurement of biodiversity is an integral aspect of life science research. With the establishment of second‐ and third‐generation sequencing technologies, an increasing amount of metabarcoding data is being generated as we seek to describe the extent and patterns of biodiversity in multiple contexts. The reliability and accuracy of taxonomically assigning metabarcoding sequencing data have been shown to be critically influenced by the quality and completeness of reference databases. Custom, curated, eukar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
36
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7

Relationship

1
6

Authors

Journals

citations
Cited by 21 publications
(52 citation statements)
references
References 86 publications
1
36
0
Order By: Relevance
“…We created an annotated reference database of amplicon sequences associated with each primer set using the CRABS software and workflow (Jeunen et al., 2022). This effort involved downloading all arthropod mitochondrial sequences from GenBank and the Barcode of Life Database (BOLD) in May 2023, performing in silico PCR and pairwise global alignment analysis on them to extract amplicons, and performing the recommended cleaning steps to construct a final curated reference database (Jeunen et al., 2022).…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…We created an annotated reference database of amplicon sequences associated with each primer set using the CRABS software and workflow (Jeunen et al., 2022). This effort involved downloading all arthropod mitochondrial sequences from GenBank and the Barcode of Life Database (BOLD) in May 2023, performing in silico PCR and pairwise global alignment analysis on them to extract amplicons, and performing the recommended cleaning steps to construct a final curated reference database (Jeunen et al., 2022).…”
Section: Methodsmentioning
confidence: 99%
“…We created an annotated reference database of amplicon sequences associated with each primer set using the CRABS software and workflow (Jeunen et al., 2022). This effort involved downloading all arthropod mitochondrial sequences from GenBank and the Barcode of Life Database (BOLD) in May 2023, performing in silico PCR and pairwise global alignment analysis on them to extract amplicons, and performing the recommended cleaning steps to construct a final curated reference database (Jeunen et al., 2022). We used the ecotag function within OBITools to match to the reference database and assign taxonomy, using thresholds of ≥99% match to accept identifications at the species level, ≥98% at the genus level, ≥95% at the family, and ≥90% at the order level.…”
Section: Methodsmentioning
confidence: 99%
“…Taxonomy assignments from BLAST results (blastn; megablast; nr/nt database; exclude "uncultured/environmental sample sequences") were processed by an in-house python script to determine the last common ancestor (LCA; Jeunen et al, 2020). For Sintax taxonomy assignments, custom curated reference databases were generated for each of the three assays using CRABS version 1.0.1 (Jeunen, Dowle, et al, 2022; GitHub: https://github.com/gjeun en/refer ence_datab ase_creator; Supporting Information 4 in Appendix S1). Reference databases were generated by downloading sequences from the NCBI nt and MitoFish databases.…”
Section: Bioinformatic Analysismentioning
confidence: 99%
“…Finally, at the most local level, it is necessary to increase awareness and train bioinformaticians and end‐users on the problems specific to reference databases. For example, tools can be implemented to improve the quality control and curation workflows of reference databases such as taxci (Rulik et al, 2017), MetaCurator (Richardson et al, 2020), Anacapa (Curd et al, 2019), BCdatabaser (Keller et al, 2020), RESCRIPt (Robeson et al, 2021), DB4Q2 (Dubois et al, 2022), NEA_fish_DB (Claver et al, 2022), CRABS (Jeunen et al, 2022) and refdb (Keck & Altermatt, 2022). The technical solutions discussed in this paper should be used by scientists willing to compile their own database.…”
Section: Conclusion and Recommendationsmentioning
confidence: 99%