2014
DOI: 10.1093/nar/gku473
|View full text |Cite
|
Sign up to set email alerts
|

RAMICS: trainable, high-speed and biologically relevant alignment of high-throughput sequencing reads to coding DNA

Abstract: The challenge presented by high-throughput sequencing necessitates the development of novel tools for accurate alignment of reads to reference sequences. Current approaches focus on using heuristics to map reads quickly to large genomes, rather than generating highly accurate alignments in coding regions. Such approaches are, thus, unsuited for applications such as amplicon-based analysis and the realignment phase of exome sequencing and RNA-seq, where accurate and biologically relevant alignment of coding reg… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
6

Relationship

2
4

Authors

Journals

citations
Cited by 7 publications
(7 citation statements)
references
References 55 publications
0
7
0
Order By: Relevance
“…Sequences with a primer ID that was represented in fewer than three reads were discarded, along with those containing degenerate bases. The remaining sequences from each time point were aligned with MAFFT, Muscle, or RAMICS ( 60 ). Phylogenetic trees were drawn with FastTree v2.1 ( 61 ) and visualized using the python library ete3 ( 62 ).…”
Section: Methodsmentioning
confidence: 99%
“…Sequences with a primer ID that was represented in fewer than three reads were discarded, along with those containing degenerate bases. The remaining sequences from each time point were aligned with MAFFT, Muscle, or RAMICS ( 60 ). Phylogenetic trees were drawn with FastTree v2.1 ( 61 ) and visualized using the python library ete3 ( 62 ).…”
Section: Methodsmentioning
confidence: 99%
“…A custom python script was used to bin all reads containing an identical Primer ID tag, align the reads within each bin using MAFFT 32 and produce a consensus sequence based on a majority rule. Sequences with just one Primer ID representative, as well as consensus sequences derived from less than three reads and those containing degenerate bases, were excluded and the remaining sequences from each time point aligned with MAFFT, Muscle 33 or RAMICS 34 . The calculation of amino acid frequencies and hamming distances (for each sequence relative to the consensus sequence from the first time point –inferred PI virus), were performed using custom python scripts.…”
Section: Methodsmentioning
confidence: 99%
“…PhiX reads not removed by the Illumina MiSeq reporter software (version 3) or through duplicate removal were filtered by mapping all reads to PhiX-174 using RAMICS (Wright and Travers, 2014). Similarly, reads matching the human genome (Hg19; http://tinyurl.com/jay436s) were filtered using consecutively Bowtie2 and RAMICS.…”
Section: Methodsmentioning
confidence: 99%