2019
DOI: 10.1093/bioinformatics/btz376
|View full text |Cite
|
Sign up to set email alerts
|

TideHunter: efficient and sensitive tandem repeat detection from noisy long-reads using seed-and-chain

Abstract: Motivation Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) sequencing technologies can produce long-reads up to tens of kilobases, but with high error rates. In order to reduce sequencing error, Rolling Circle Amplification (RCA) has been used to improve library preparation by amplifying circularized template molecules. Linear products of the RCA contain multiple tandem copies of the template molecule. By integrating additional in silico processing steps, these tandem sequ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
31
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 29 publications
(31 citation statements)
references
References 28 publications
0
31
0
Order By: Relevance
“…We first used Tandem Repeat Finder (TRF; Benson 1999) to identify satellite sequences in our assemblies that are enriched in putative centromeric regions (i.e., ends of telocentric chromosomes and scaffolding breakpoints in the middle of metacentric chromosomes). Additionally, we identify short simple satellites (<20 bp) directly from raw Illumina reads using k--Seek , and we also identify tandem repeats directly from long--read data (Nanopore and PacBio) using TideHunter (Gao et al 2019). The latter two approaches provided analyses of satellite and repeat enrichment independent of our genome assemblies.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…We first used Tandem Repeat Finder (TRF; Benson 1999) to identify satellite sequences in our assemblies that are enriched in putative centromeric regions (i.e., ends of telocentric chromosomes and scaffolding breakpoints in the middle of metacentric chromosomes). Additionally, we identify short simple satellites (<20 bp) directly from raw Illumina reads using k--Seek , and we also identify tandem repeats directly from long--read data (Nanopore and PacBio) using TideHunter (Gao et al 2019). The latter two approaches provided analyses of satellite and repeat enrichment independent of our genome assemblies.…”
Section: Resultsmentioning
confidence: 99%
“…S5). Using this pipeline, our TRF analysis recovered the same 21--bp and 99--bp motif (or higher order variants of the two motifs) in D. miranda, with a higher order variant (Benson 1999), k--Seek , and TideHunter (Gao et al 2019) for each Drosophila species. HOR = higher order repeat.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Few TR detection methods for long-read sequencing data have been developed so far. Examples include PacmonSTR [ 9 ], NCRF [ 10 ], TideHunter [ 11 ], NanoSatellite [ 12 ], and Tandem-genotypes [ 13 ]. However, these tools have some limitations, either because they are technology-specific (PacmonSTR and NanoSatellite), because they are not intended to be used genome-wide (NCRF, TideHunter, and NanoSatellite), or because they require substantial preprocessing steps preventing their large-scale use (Tandem-genotypes).…”
Section: Introductionmentioning
confidence: 99%
“…Multiple consensus sequences are then generated by applying the heaviest bundling algorithm to the alignment graph . Recently, with the advent and growing popularity of long-read sequencing technologies on the Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) platforms, there is a renewed appreciation and interest of POA, and this algorithm is now broadly used for error correction and assembly of error-prone long reads (Loman et al, 2015;Vaser et al, 2017;Volden et al, 2018;Gao et al, 2019;Ruan and Li, 2019).…”
Section: Introductionmentioning
confidence: 99%