2015
DOI: 10.1101/019372
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

svclassify: a method to establish benchmark structural variant calls

Abstract: Background: The human genome contains variants ranging in size from small single nucleotide polymorphisms (SNPs) to large structural variants (SVs). High-quality benchmark small variant calls for the pilot National Institute of Standards and Technology (NIST) Reference Material (NA12878) have been developed by the Genome in a Bottle Consortium, but no similar high-quality benchmark SV calls exist for this genome. Since SV callers output highly discordant results, we developed methods to combine multiple forms … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

1
88
1

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
4

Relationship

1
8

Authors

Journals

citations
Cited by 47 publications
(90 citation statements)
references
References 29 publications
1
88
1
Order By: Relevance
“…27,28 A subset of CNVs from NA12878 were confirmed and further refined to those with support from multiple technologies using SVClassify . 29 The unique collection of Sanger sequencing from the HuRef sample has also been used to characterize SVs. 30,31 Long reads were used to broadly characterize SVs in a haploid hydatidiform mole cell line.…”
Section: Introductionmentioning
confidence: 99%
“…27,28 A subset of CNVs from NA12878 were confirmed and further refined to those with support from multiple technologies using SVClassify . 29 The unique collection of Sanger sequencing from the HuRef sample has also been used to characterize SVs. 30,31 Long reads were used to broadly characterize SVs in a haploid hydatidiform mole cell line.…”
Section: Introductionmentioning
confidence: 99%
“…For testing the structural variant callers, we downloaded the Genome in a Bottle high-confidence deletion dataset to use as our truth set against sample NA12878 [17]. At the time of initial pipeline development, Manta v1.0.1 [10] was found to have the highest sensitivity (91%) amongst the structural variant software we internally tested.…”
Section: Structural Variant Callingmentioning
confidence: 99%
“…Up to the point of this work, there was few work focusing on predicting SV segments using base-pair RD information for a single sample. To compare model performances with the SVM model used in the SVclassify work [17], we generated SV-containing and non-SV-containing labels based on the segmentation results for a simplified classification task. The classification performance was evaluated using AUC, sensitivity and false discovery rate (FDR).…”
Section: Evaluation Metricmentioning
confidence: 99%