2018
DOI: 10.1093/bioinformatics/bty482
|View full text |Cite
|
Sign up to set email alerts
|

Metaxa2 Database Builder: enabling taxonomic identification from metagenomic or metabarcoding data using any genetic marker

Abstract: MotivationCorrect taxonomic identification of DNA sequences is central to studies of biodiversity using both shotgun metagenomic and metabarcoding approaches. However, no genetic marker gives sufficient performance across all the biological kingdoms, hampering studies of taxonomic diversity in many groups of organisms. This has led to the adoption of a range of genetic markers for DNA metabarcoding. While many taxonomic classification software tools can be re-trained on these genetic markers, they are often de… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
7
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 37 publications
(26 citation statements)
references
References 41 publications
0
26
0
Order By: Relevance
“…For this, we first removed exceptionally long or short sequence entries as well as any entries containing three or more consecutive uncalled base pairs from the locus‐specific data sets. For the rbcL , trnL and whole chloroplast genome data sets, we then isolated archetypical trnL and rbcL reference sequences for each locus using the primers employed during pollen metabarcoding and used these sequences in combination with the HMM‐based metaxa2 Database Builder tool (v1.0 4th beta; Bengtsson‐Palme et al, ) to extract the amplicon of interest. However, trnH sequences were too divergent for this approach.…”
Section: Methodsmentioning
confidence: 99%
“…For this, we first removed exceptionally long or short sequence entries as well as any entries containing three or more consecutive uncalled base pairs from the locus‐specific data sets. For the rbcL , trnL and whole chloroplast genome data sets, we then isolated archetypical trnL and rbcL reference sequences for each locus using the primers employed during pollen metabarcoding and used these sequences in combination with the HMM‐based metaxa2 Database Builder tool (v1.0 4th beta; Bengtsson‐Palme et al, ) to extract the amplicon of interest. However, trnH sequences were too divergent for this approach.…”
Section: Methodsmentioning
confidence: 99%
“…When target taxa are missing from the reference database, users run the risk of making false positive and false negative taxonomic assignments [ 16 ]. The mis-assignment of a metabarcode to the wrong species with high confidence, because the target species is missing from the database has been called a false-positive assignment or over prediction in the literature [ 17 ]. False negatives also occur when there are gaps in the database leading to low confidence taxonomic assignments.…”
Section: Introductionmentioning
confidence: 99%
“…PIDS is potentially an efficient DNA fingerprint management platform that can be used for crop breeding research, variety protection, and research in the fields of medicine, crime, and microbiology. Although the primers used for different species will be different, the requirements for a DNA fingerprint database are similar [21,22]. Species type fields can be added to the sample information table to distinguish samples.…”
Section: Discussionmentioning
confidence: 99%
“…DNA fingerprints can be managed systematically by a computer, and can be organized in DNA fingerprint databases [21]. DNA fingerprint databases are essential and important tools for plant molecular research because they provide powerful technical and information support for crop breeding, variety quality control, variety right protection, and molecular marker-assisted breeding [22,23].…”
Section: Introductionmentioning
confidence: 99%