2018
DOI: 10.1093/bioinformatics/bty586
|View full text |Cite
|
Sign up to set email alerts
|

Fast characterization of segmental duplications in genome assemblies

Abstract: MotivationSegmental duplications (SDs) or low-copy repeats, are segments of DNA > 1 Kbp with high sequence identity that are copied to other regions of the genome. SDs are among the most important sources of evolution, a common cause of genomic structural variation and several are associated with diseases of genomic origin including schizophrenia and autism. Despite their functional importance, SDs present one of the major hurdles for de novo genome assembly due to the ambiguity they cause in building and trav… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
57
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
3
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 65 publications
(58 citation statements)
references
References 47 publications
1
57
0
Order By: Relevance
“…We identified segmental duplications in the Zoey and CanFam3.1 assemblies based on assembly self-alignment (43) and read-depth (44) approaches (see Supplementary Information, Section 3).…”
Section: Annotation Of Genome Featuresmentioning
confidence: 99%
“…We identified segmental duplications in the Zoey and CanFam3.1 assemblies based on assembly self-alignment (43) and read-depth (44) approaches (see Supplementary Information, Section 3).…”
Section: Annotation Of Genome Featuresmentioning
confidence: 99%
“…SegDups are two or more large genomic duplications (≥1-Kb long with ≥90% identity), but their numbers have not been determined so far (Fromer et al, 2012;Numanagic et al, 2018). In short-read sequencing data analysis, SegDups are known to be one of the main sources of false variant calls (Gong et al, 2020).…”
Section: Segdup Overlapping Ratio For Autosomal Cnvsmentioning
confidence: 99%
“…The Jaccard coefficient is generally used to measure the similarity of two discrete objects. Numanagic et al proposed the SEDEF framework based on the Jaccard coefficient, which can accurately predict segmental duplications (SDs) [40]. Wallace et al introduced the Jaccard coefficient into the prediction of disease-disease relationship and deduced the information of the interaction network [41].…”
Section: Jaccard Similarity Indexmentioning
confidence: 99%