With the goal of evaluating genotyping-by-sequencing (GBS) in a species with a complex octoploid genome, GBS was used to survey genome-wide single-nucleotide polymorphisms (SNPs) in three biparental strawberry (Fragaria ×ananassa) populations. GBS sequence data were aligned to the F. vesca 'Fvb' reference genome in order to call SNPs. Numbers of polymorphic SNPs per population ranged from 1,163 to 3,190. Linkage maps consisting of 30-65 linkage groups were produced from the SNP sets derived from each parent. The linkage groups covered 99% of the Fvb reference genome, with three to seven linkage groups from a given parent aligned to any particular chromosome. A phylogenetic analysis performed using the POLiMAPS pipeline revealed linkage groups that were most similar to ancestral species F. vesca for each chromosome. Linkage groups that were most similar to a second ancestral species, F. iinumae, were only resolved for With the goal of evaluating genotyping-by-sequencing (GBS) in a species with a complex 18 octoploid genome, GBS was used to survey genome-wide single-nucleotide polymorphisms 19 (SNPs) in three biparental strawberry (Fragaria ×ananassa) populations. GBS sequence data 20 were aligned to the F. vesca 'Fvb' reference genome in order to call SNPs. Numbers of 21 polymorphic SNPs per population ranged from 1,163 to 3,190. Linkage maps consisting of 30-22 65 linkage groups were produced from the SNP sets derived from each parent. The linkage 23 groups covered 99% of the Fvb reference genome, with three to seven linkage groups from a 24 given parent aligned to any particular chromosome. A phylogenetic analysis performed using the 25 POLiMAPS pipeline revealed linkage groups that were most similar to ancestral species F. vesca 26 for each chromosome. Linkage groups that were most similar to a second ancestral species, F. 27 iinumae, were only resolved for Fvb 4. The quantity of missing data and heterogeneity in 28 genome coverage inherent in GBS complicated the analysis, but POLiMAPS resolved F.29 ×ananassa chromosomal regions derived from diploid ancestor F. vesca.
BACKGROUND
31Genotyping-by-sequencing (GBS) is a powerful, cost-effective method for identifying 32 single-nucleotide polymorphisms (SNPs) on a whole-genome scale. The GBS technique 33 commonly used involves a form of reduced representation genome sequencing based on partial 34 restriction enzyme digestion, usually with a methylation-sensitive restriction enzyme, followed 35 by barcoded adaptor ligation and next-generation sequencing of highly multiplexed samples, 36 typically 48, 96, or 384 samples per lane (Elshire et al., 2011; Davey et al., 2011). Applications 37 of GBS range from germplasm diversity and population structure assessment to molecular 38 marker discovery. The high throughput and low per-sample cost of GBS makes it an attractive 39 option for plant breeding populations, as it can be used to saturate genetic maps (Russell et al., 40 2014, Ward et al., 2013, perform QTL mapping and genome-wide association analyses (GWAS) 41 for...