2014
DOI: 10.7287/peerj.preprints.314
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

dDocent: a RADseq, variant-calling pipeline designed for population genomics of non-model organisms

Abstract: 14Restriction-site associated DNA sequencing (RADseq) has become a powerful and useful 15 approach for population genomics. Currently, no software exists that utilizes both paired-end 16 reads from RADseq data to efficiently produce population-informative variant calls, 17 especially for organisms with large effective population sizes and high levels of genetic 18 polymorphism but for which no genomic resources exist. dDocent is an analysis pipeline with 19 a user-friendly, command-line interface designed to p… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
148
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 116 publications
(148 citation statements)
references
References 10 publications
0
148
0
Order By: Relevance
“…Population genetic summary statistics, with the exception of F ST , did change quantitatively due to missing data but not qualitatively (not shown) (Cariou, Duret, & Charlat, ). Filtering steps were conducted using VCFtools (Danecek et al., ), custom Python code, and code adapted from Jon Puritz's laboratory (Puritz, Hollenbeck, & Gold, ). Input files and formats for subsequent analysis of population structure were created using a combination of custom Python code, custom R code, and the radiator R package (Gosselin, ).…”
Section: Methodsmentioning
confidence: 99%
“…Population genetic summary statistics, with the exception of F ST , did change quantitatively due to missing data but not qualitatively (not shown) (Cariou, Duret, & Charlat, ). Filtering steps were conducted using VCFtools (Danecek et al., ), custom Python code, and code adapted from Jon Puritz's laboratory (Puritz, Hollenbeck, & Gold, ). Input files and formats for subsequent analysis of population structure were created using a combination of custom Python code, custom R code, and the radiator R package (Gosselin, ).…”
Section: Methodsmentioning
confidence: 99%
“…), de novo assembly of RAD loci using cd‐hit (Fu et al ., ), read mapping back onto the de novo assembly using BWA mem (Li & Durbin, ), SNP calling using FreeBayes (Garrison & Marth, ) and, finally, variant filtering using VCFtools (Danecek et al ., ). We used dDocent, rather than Stacks (Catchen et al ., ), the other commonly used program, because dDocent identifies more SNPs as a consequence of using both forward and reverse reads during alignment and quality trimming instead of removing entire reads (Puritz et al ., ). We used default settings for most of the dDocent pipeline, except sequences were clustered at 95%, instead of 90% similarity.…”
Section: Methodsmentioning
confidence: 97%
“…Sequence reads were then prepared for the SNP analyses using version 2.17 of the dDocent pipeline (Puritz et al ., ) for initial filtering (phred quality scores < 33) and trimming using Trimmomatic (Bolger et al . ), de novo assembly of RAD loci using cd‐hit (Fu et al ., ), read mapping back onto the de novo assembly using BWA mem (Li & Durbin, ), SNP calling using FreeBayes (Garrison & Marth, ) and, finally, variant filtering using VCFtools (Danecek et al ., ).…”
Section: Methodsmentioning
confidence: 99%
“…Quality trimming, assembly of a reduced representation reference, read mapping, and SNP calling were performed using the dDocent pipeline (Puritz et al. ). Two randomly chosen individuals from each sampling location ( n = 10 total) were used for de novo assembly of a reduced representation reference consisting of putatively orthologous loci.…”
Section: Methodsmentioning
confidence: 99%