2013
DOI: 10.1111/2041-210x.12120
|View full text |Cite
|
Sign up to set email alerts
|

SOAPBarcode: revealing arthropod biodiversity through assembly of Illumina shotgun sequences of PCR amplicons

Abstract: Summary1. Metabarcoding of mixed arthropod samples for biodiversity assessment has mostly been carried out on the 454 GS FLX sequencer (Roche, Branford, Connecticut, USA), due to its ability to produce long reads (≥400 bp) that are believed to allow higher taxonomic resolution. The Illumina sequencing platforms, with their much higher throughputs, could potentially reduce sequencing costs and improve sequence quality, but the associated shorter read length (typically <150 bp) has deterred their usage in next-g… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
69
2
1

Year Published

2014
2014
2023
2023

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 53 publications
(73 citation statements)
references
References 27 publications
1
69
2
1
Order By: Relevance
“…However, due to the significant decrease in quality as the individual reads exceed 200 nucleotides, the maximum length of sequence that can be generated with an error rate low enough to enable accurate identification is approximately 450 nucleotides (2 × 300 nucleotides paired reads on a MiSeq). Whilst the methodology is well established for bacterial communities sequencing a 250 nucleotide fragment of the v4 region of the 16S rRNA gene (Kozich et al 2013), this is considerably shorter than the standard barcode length for invertebrates of 650 bp, and in some applications the reduction in length results in poor species-level resolution (Liu et al 2013). Some researchers have explored the use of shorter regions of the standard barcode, or study alternate genes that may provide improved species resolution with shorter fragments; however, this presents the perennial issue of lack of reference sequences (Deagle et al 2014).…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…However, due to the significant decrease in quality as the individual reads exceed 200 nucleotides, the maximum length of sequence that can be generated with an error rate low enough to enable accurate identification is approximately 450 nucleotides (2 × 300 nucleotides paired reads on a MiSeq). Whilst the methodology is well established for bacterial communities sequencing a 250 nucleotide fragment of the v4 region of the 16S rRNA gene (Kozich et al 2013), this is considerably shorter than the standard barcode length for invertebrates of 650 bp, and in some applications the reduction in length results in poor species-level resolution (Liu et al 2013). Some researchers have explored the use of shorter regions of the standard barcode, or study alternate genes that may provide improved species resolution with shorter fragments; however, this presents the perennial issue of lack of reference sequences (Deagle et al 2014).…”
Section: Discussionmentioning
confidence: 99%
“…Some researchers have explored the use of shorter regions of the standard barcode, or study alternate genes that may provide improved species resolution with shorter fragments; however, this presents the perennial issue of lack of reference sequences (Deagle et al 2014). The large number of sequencing reads generated enables some solutions; the use of multiple, shorter overlapping amplicons to assemble longer amplicons or amplifying the standard COI barcode and producing a barcode sequence by assembling together sequences from each end with shotgun sequence of the whole amplicon (Liu et al 2013). Next-generation sequencing technologies are rapidly evolving (e.g., MinIon from Oxford Nanopore), and it is probable that within a short period of time they will be able to generate suitable read-lengths for DNA barcoding, enabling the use of existing databases.…”
Section: Discussionmentioning
confidence: 99%
“…Although NGS can produce sequences more cheaply than Sanger sequencing, current NGS sequence read lengths are typically 200-400 bp, shorter than the standard 658 bp DNA barcoding region, limiting their utility for species-resolution identifications (Liu et al 2013). This limitation is being overcome as newly available platforms employing single molecule real time (SMRT) sequencing technology now yield read lengths of 3000-20 000 bp (Roberts et al 2013), covering the entire DNA barcoding region.…”
Section: Dna Barcodingmentioning
confidence: 99%
“…The task of collating COI genotypes across all metazoans (and particularly small and cryptic invertebrates in remote regions) is arguably more difficult than creating reference data for the overall fewer conserved 18S rDNA genotypes. Critically, the remedy for this situation is to increase α taxonomic [78,79] approaches in the Antarctic region linked to sequencing efforts, the latter in the future being likely realized using shotgun sequencing approaches [16,58,80].…”
Section: Metabarcoding Marker Choice For Antarctic Invertebratesmentioning
confidence: 99%