2020
DOI: 10.1186/s12859-020-03852-4
|View full text |Cite
|
Sign up to set email alerts
|

Natrix: a Snakemake-based workflow for processing, clustering, and taxonomically assigning amplicon sequencing reads

Abstract: Background Sequencing of marker genes amplified from environmental samples, known as amplicon sequencing, allows us to resolve some of the hidden diversity and elucidate evolutionary relationships and ecological processes among complex microbial communities. The analysis of large numbers of samples at high sequencing depths generated by high throughput sequencing technologies requires efficient, flexible, and reproducible bioinformatics pipelines. Only a few existing workflows can be run in a user-friendly, sc… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
14
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
8

Relationship

4
4

Authors

Journals

citations
Cited by 18 publications
(14 citation statements)
references
References 27 publications
0
14
0
Order By: Relevance
“…Since NOREC4DNA reads FASTA files line by line, sorting the clustered sequences by how often the sequences have been sequenced will allow NOREC4DNA to work more reliably. While most modern sequencing pipelines perform these steps accordingly, we use the Snakemake-based pipeline ‘Natrix’ [ 33 ].…”
Section: Methodsmentioning
confidence: 99%
“…Since NOREC4DNA reads FASTA files line by line, sorting the clustered sequences by how often the sequences have been sequenced will allow NOREC4DNA to work more reliably. While most modern sequencing pipelines perform these steps accordingly, we use the Snakemake-based pipeline ‘Natrix’ [ 33 ].…”
Section: Methodsmentioning
confidence: 99%
“…Adapter removal, quality trimming, and demultiplexing using index sequences were performed by the sequencing company (Fasteris). Sequence processing was performed using a provisional version of the Natrix pipeline (Welzel et al, 2020). Base quality of raw sequence reads was rechecked using the fastqc software (v0.11.5; Andrews, 2010) and reads with an average Phred quality score below 25 or with at least one base with a Phred quality score below 15 were removed using prinseq‐lite (v0.20.4; Schmieder & Edwards, 2011).…”
Section: Methodsmentioning
confidence: 99%
“…For the amplicon analysis, we used the standardized workflow Natrix, including (i) quality filtering, (ii) clustering, and (iii) taxonomy annotation ( Welzel et al, 2020 ). The quality of the sequencing reads was re-checked using FastQC (v0.11.8), and low-quality tails were removed from the reads using PRINSEQ ( Schmieder and Edwards, 2011 ) (v0.20.4).…”
Section: Methodsmentioning
confidence: 99%