2018
DOI: 10.1093/bioinformatics/bty426
|View full text |Cite
|
Sign up to set email alerts
|

RIFRAF: a frame-resolving consensus algorithm

Abstract: Supplementary data are available at Bioinformatics online.

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
3
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 26 publications
0
5
0
Order By: Relevance
“…If the amplicon spans a coding sequence, then Rifraf.jl (12) can be used to infer a frame-shift corrected template sequence, as long as a reference sequence with a trusted reading frame is available.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…If the amplicon spans a coding sequence, then Rifraf.jl (12) can be used to infer a frame-shift corrected template sequence, as long as a reference sequence with a trusted reading frame is available.…”
Section: Methodsmentioning
confidence: 99%
“…(3) show that, for a 2.6 kb amplicon, under their quality filtering conditions, 80% of the errors are indels and 20% are substitution errors, and the indel errors are concentrated in homopolymer regions, increasing in rate with the length of the homopolymer. While high indel rates can be computationally challenging to deal with, since sequence alignment can be slow, they are favorable from a statistical perspective, because the errors appear in predictable places, making them more correctable (12).…”
Section: Introductionmentioning
confidence: 99%
“…We exploit kmer seeding (k=30), and this approximate pairwise alignment algorithm scales linearly with sequence length. If the amplicon spans a coding sequence, then Rifraf.jl (10) can be used to infer a frame-shift corrected template sequence, as long as a reference sequence with a trusted reading frame is available.…”
Section: C3 Fine Cluster Splittingmentioning
confidence: 99%
“…(3) show that, for a 2.6kb amplicon, with a post-filtering error rate of 0.5%, 80% are indel and 20% substitution errors, and the indel errors are concentrated in homopolymer regions, increasing in rate with the length of the homopolymer. While high indel rates can be computationally challenging to deal with, since sequence alignment can be slow, they are favorable from a statistical perspective, because the errors appear in predictable places, making them more correctable (10). Amplicon denoising (11)(12)(13)(14)(15)(16)(17) refers to a process that takes a large set of reads, corrupted by sequencing errors, and attempts to distill the noiseless variants and their frequencies.…”
Section: Introductionmentioning
confidence: 99%
“…In future applications of CIDER-Seq where reference genomes are available (i.e. applications apart from conventional metagenomics), this low-level of frameshift error can be further corrected using existing long-read sequencing-specific frameshift-correction algorithms such as Frame-Pro (29) or RIFRAF (30).…”
Section: Resultsmentioning
confidence: 99%