2017
DOI: 10.1186/s12859-017-2000-6
|View full text |Cite
|
Sign up to set email alerts
|

A comparison of genotyping-by-sequencing analysis methods on low-coverage crop datasets shows advantages of a new workflow, GB-eaSy

Abstract: BackgroundGenotyping-by-sequencing (GBS), a method to identify genetic variants and quickly genotype samples, reduces genome complexity by using restriction enzymes to divide the genome into fragments whose ends are sequenced on short-read sequencing platforms. While cost-effective, this method produces extensive missing data and requires complex bioinformatics analysis. GBS is most commonly used on crop plant genomes, and because crop plants have highly variable ploidy and repeat content, the performance of G… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
52
1

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 68 publications
(54 citation statements)
references
References 34 publications
1
52
1
Order By: Relevance
“…There are multiple reasons we chose to develop our own computational pipeline for GBS rather than using existing workflows. Foremost, the prominent GBS analysis pipelines were developed and optimized for use in crop species (Sonah et al 2013;Catchen et al 2013;Glaubitz et al 2014;Torkamaneh et al 2017;Wickland et al 2017), some of which are polyploid and have differing levels of variation and LD than outbred rodent populations. Additionally, there were elements of each pipeline that did not meet our needs or lacked customizability.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…There are multiple reasons we chose to develop our own computational pipeline for GBS rather than using existing workflows. Foremost, the prominent GBS analysis pipelines were developed and optimized for use in crop species (Sonah et al 2013;Catchen et al 2013;Glaubitz et al 2014;Torkamaneh et al 2017;Wickland et al 2017), some of which are polyploid and have differing levels of variation and LD than outbred rodent populations. Additionally, there were elements of each pipeline that did not meet our needs or lacked customizability.…”
Section: Discussionmentioning
confidence: 99%
“…However, those publications used protocols that had not been optimized, leaving significant room for improvement in genotype quality and marker density. Additionally, although several tools and workflows for the analysis of GBS data have been described, including Stacks (Catchen et al 2013), IGST-GBS (Sonah et al 2013), TASSEL-GBS (Glaubitz et al 2014), Fast-GBS (Torkamaneh et al 2017), and GB-eaSy (Wickland et al 2017), the majority were developed and optimized for use in plant species. Given the lack of well-developed genomic resources in these species, they do not leverage the wealth of genomic data available for model organisms such as rats.…”
mentioning
confidence: 99%
“…This software was developed mainly for organisms without reference genomes and high-depth RAD sequencing. Additionally, Stacks is between the pipelines, with high accuracy for SNPs calling on this kind of data [34,35].…”
Section: Ddradseq Data Analysesmentioning
confidence: 99%
“…Pooled libraries were sequenced on one lane of the Illumina HiSeq 4000 in 2×150 bp paired-end mode yielding approximately 467 million paired-end reads (>140 Gb). Single nucleotide polymorphism (SNP) calling was performed using the GBeaSy analysis pipeline [115] with the following filter settings: minimum read length of 30bp after barcode and adapter trim, minimum phred-scaled variant quality of 30 and minimum read depth of 5 at the sample level. This yielded a total of 3,775,496 SNPs that were further filtered using VCFtools 0.1.13 [116].…”
Section: Methodsmentioning
confidence: 99%