2017
DOI: 10.1101/190454
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Identifying structural variants using linked-read sequencing data

Abstract: Structural variation, including large deletions, duplications, inversions, translocations, and other rearrangements, is common in human and cancer genomes. A number of methods have been developed to identify structural variants from Illumina short-read sequencing data. However, reliable identification of structural variants remains challenging because many variants have breakpoints in repetitive regions of the genome and thus are difficult to identify with short reads. The recently developed linked-read sequen… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
34
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 30 publications
(34 citation statements)
references
References 36 publications
0
34
0
Order By: Relevance
“…Overall, the method combines the advantage of long‐read approach to the reliability of short‐read sequencing. This method has already shown its efficacy in SV detection (Elyanow, Wu, & Raphael, ; Zheng et al, ).…”
Section: Introductionmentioning
confidence: 99%
“…Overall, the method combines the advantage of long‐read approach to the reliability of short‐read sequencing. This method has already shown its efficacy in SV detection (Elyanow, Wu, & Raphael, ; Zheng et al, ).…”
Section: Introductionmentioning
confidence: 99%
“…Furthermore, when a genome G derives from a diploid reference R we can not measure a set A(G) of adjacencies in G directly, but rather we can only measure an obfuscated version of the set A N (G) of novel adjacencies in G. That is, for every novel adjacency {j H , k ® H ® } À A N (G) we can only measure an unlabeled (i.e., with involved extremities missing the A/B labels) adjacency {j , k ® } (e.g., for a derived genome G shown in Figure 4 instead of measuring a novel adjacency {3 h A , 7 h B } À A N (G) we measure an unlabeled novel adjacency {3 h , 7 h }). There exist several methods capable of producing the unlabeled novel adjacencies both from a standard short-read bulk sequencing data [51,48,30,10,60] as well as from 3rd-generation sequencing technologies [49,16,52,66,50,27,17].…”
Section: Single Derived Genomementioning
confidence: 99%
“…Via a series of withincompartment molecular biology and subsequent standard steps of library construction and sequencing, barcoded short reads are produced that retain the long-range information of the long fragments of the initial DNA extract. Due to the combination of high base pair-level sequence accuracy and long-range information, 10x/Illumina data therefore support excellent SNP and small indel detection and phasing (Zheng et al 2016), as well as breakpoint detection of large events in cancer (Zheng et al 2016;Spies et al 2017;Elyanow et al 2017). For diploid genome reconstruction, 10x developed the de novo assembler, Supernova, which has been shown to produce whole human genome assemblies from 56-fold coverage 10x/Illumina data (Weisenfeld et al 2017;Zhang et al 2019a).…”
Section: Introductionmentioning
confidence: 99%