proovframe: frameshift-correction for long-read (meta)genomics

Hackl, Thomas; Trigodet, Florian; Eren, A. Murat; Biller, Steven J.; Eppley, John M.; Luo, Elaine; Burger, Andrew; DeLong, Edward F.; Fischer, Matthias

doi:10.1101/2021.08.23.457338

Cited by 19 publications

(18 citation statements)

References 51 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A commonly adopted solution has been to include short-read data for post-assembly error correction 15,22 , although it increases the cost and complexity overhead. Another solution has been to apply reference-based polishing to correct frameshift errors [23][24][25] but, although this provides a practical solution that enables gene calling, it does not provide true near-finished genomes. Finished microbial genomes, as defined by Bowers et al 2017 in the MIMAG (minimum information about a metagenome-assembled genome) standard 26 , are genomes that have "...a single, validated, contiguous sequence per replicon, without gaps or ambiguities" and "a consensus error rate equivalent to Q50 or better".…”

mentioning

confidence: 99%

Oxford Nanopore R10.4 long-read sequencing enables the generation of near-finished bacterial genomes from pure cultures and metagenomes without short-read or reference polishing

et al. 2022

View full text Add to dashboard Cite

Long-read Oxford Nanopore sequencing has democratized microbial genome sequencing and enables the recovery of highly contiguous microbial genomes from isolates or metagenomes. However, to obtain near-finished genomes it has been necessary to include short-read polishing to correct insertions and deletions derived from homopolymer regions. Here, we show that Oxford Nanopore R10.4 can be used to generate near-finished microbial genomes from isolates or metagenomes without short-read or reference polishing.

show abstract

mentioning

confidence: 99%

Oxford Nanopore R10.4 long-read sequencing enables the generation of near-finished bacterial genomes from pure cultures and metagenomes without short-read or reference polishing

et al. 2022

View full text Add to dashboard Cite

show abstract

“…A commonly adopted solution has been to include short-read data for post-assembly error correction 12,19 , although it increases the cost and complexity overhead. Another solution has been to apply reference-based polishing to correct frameshift errors [20][21][22] , but while it provides a practical solution, which allows gene calling, it does not provide true near-perfect genomes.…”

mentioning

confidence: 99%

Oxford Nanopore R10.4 long-read sequencing enables near-perfect bacterial genomes from pure cultures and metagenomes without short-read or reference polishing

Sereika¹,

Kirkegaard

Karst³

et al. 2021

Preprint

View full text Add to dashboard Cite

Long-read Oxford Nanopore sequencing has democratized microbial genome sequencing and enables the recovery of highly contiguous microbial genomes from isolates or metagenomes. However, to obtain near-perfect genomes it has been necessary to include short-read polishing to correct insertions and deletions derived from homopolymer regions. Here, we show that Oxford Nanopore R10.4 can be used to generate near-perfect microbial genomes from isolates or metagenomes without shortread or reference polishing.

show abstract

“…Full details are available in the original PointFinder 61 methods. We base our modifications to PointFinder on the previously demonstrated observation that frameshifts and stop codons in third-generation assemblies are more likely to reflect sequencing and assembly errors than true sequence variation 62,63 . We modify PointFinder to not halt its search for variants along a resistance loci if it encounters a stop codon.…”

Section: Methodsmentioning

confidence: 99%

“…We modify PointFinder to not halt its search for variants along a resistance loci if it encounters a stop codon. We additionally modify PointFinder to shift alignments around indels, maintaining the reading frame, in an approach similar to more general frameshift correction tools 62,63 . Our modified PointFinder has been incorporated into ResFinder version 4.2 and can be activated with the ‘-ii’ (Ignore Indels) and ‘-ic’ (Ignore stop Codons) flags.…”

Section: Methodsmentioning

confidence: 99%

BugSplit: highly accurate taxonomic binning of metagenomic assemblies enables genome-resolved metagenomics

Chandrakumar¹,

Gauthier

Nelson

et al. 2021

Preprint

View full text Add to dashboard Cite

A large gap remains between sequencing a microbial community and characterizing all of the organisms inside of it. Here we develop a novel method to taxonomically bin metagenomic assemblies through alignment of contigs against a reference database. We show that this workflow, BugSplit, bins metagenome-assembled contigs to species with a 33% absolute improvement in F1-score when compared to alternative tools. We perform nanopore mNGS on patients with COVID-19, and using a reference database predating COVID-19, demonstrate that BugSplit's taxonomic binning enables sensitive and specific detection of a novel coronavirus not possible with other approaches. When applied to nanopore mNGS data from cases of Klebsiella pneumoniae bacteremia and Neisseria gonorrhoeae infection, BugSplit's taxonomic binning accurately separates pathogen sequences from those of the host and microbiota, and unlocks the possibility of sequence typing, in silico serotyping, and antimicrobial resistance prediction of each organism within a sample. BugSplit is available at https://bugseq.com/academic.

show abstract

proovframe: frameshift-correction for long-read (meta)genomics

Cited by 19 publications

References 51 publications

Oxford Nanopore R10.4 long-read sequencing enables the generation of near-finished bacterial genomes from pure cultures and metagenomes without short-read or reference polishing

Oxford Nanopore R10.4 long-read sequencing enables the generation of near-finished bacterial genomes from pure cultures and metagenomes without short-read or reference polishing

Oxford Nanopore R10.4 long-read sequencing enables near-perfect bacterial genomes from pure cultures and metagenomes without short-read or reference polishing

BugSplit: highly accurate taxonomic binning of metagenomic assemblies enables genome-resolved metagenomics

Contact Info

Product

Resources

About