2017
DOI: 10.1093/bioinformatics/btx691
|View full text |Cite
|
Sign up to set email alerts
|

Simulating the dynamics of targeted capture sequencing with CapSim

Abstract: Supplementary data are available at Bioinformatics online.

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
8
1

Relationship

1
8

Authors

Journals

citations
Cited by 14 publications
(10 citation statements)
references
References 5 publications
0
10
0
Order By: Relevance
“…Once genome or transcriptome references are produced or downloaded, the next step is selecting the target loci for bait design. Good starting points for identifying loci with the right amount of genetic variation are the bait design tools MarkerMiner 1.0 (Chamala et al, 2015), BaitFisher (Mayer et al, 2016) and MrBait (Chafin et al, 2018), as well as the simulation package CapSim (Cao et al, 2018). BaitFisher (with its filtering program BaitFilter) and MrBait allow for the design of baits targeting a broad taxonomic spectrum and different enrichment strategies.…”
Section: Designing Bait Setsmentioning
confidence: 99%
“…Once genome or transcriptome references are produced or downloaded, the next step is selecting the target loci for bait design. Good starting points for identifying loci with the right amount of genetic variation are the bait design tools MarkerMiner 1.0 (Chamala et al, 2015), BaitFisher (Mayer et al, 2016) and MrBait (Chafin et al, 2018), as well as the simulation package CapSim (Cao et al, 2018). BaitFisher (with its filtering program BaitFilter) and MrBait allow for the design of baits targeting a broad taxonomic spectrum and different enrichment strategies.…”
Section: Designing Bait Setsmentioning
confidence: 99%
“…We used simulated sequencing data to assess the accuracy of our genotyping algorithm. Generation of simulated data is described in Cao et al (2017) [ 24 ]. We first introduced SNPs and small indels to the reference human genome (hg19) to create 4 diploid genomes - Genome1, Genome2, Genome3 and Genome4.…”
Section: Methodsmentioning
confidence: 99%
“…We then introduced repeat variations into these genomes (Additional file 1 : Table S1). We simulated PacBio whole genome sequencing (WGS) data for Genome 2 and we simulated Illumina targeted sequencing data for these 142 loci for all 4 simulated genomes according to Cao et al (2017) [ 24 ]. For simulation of capture sequencing data, we sampled fragments from each genome according to the length distributions observed in real sequencing data (mean 800 bp and standard deviation 100 bp).…”
Section: Methodsmentioning
confidence: 99%
“…This can be done by specifying the total number of baits, the number/percentage of baits per category of input file, and if different categories are to be included, for example, known SNPs, genomic regions of particular interest, and masked regions. The tiling can also be specified, as SciTools Web Tools from IDT; or ArrayOligoSelector (Bozdech et al, 2003), and simulation programs (Cao et al, 2018) or external providers (e.g., Arbor Biosciences; Roche), to select the final set of baits. Therefore, by following the short pipeline of supeRbaits, large bait sets for population genomics can be generated with the desired bait properties and placement, in a fast and transparent way.…”
Section: The R-pack Ag E Super Bait S and Its Appli C Ati Onmentioning
confidence: 99%