2017
DOI: 10.1186/s13059-017-1319-7
|View full text |Cite
|
Sign up to set email alerts
|

Alignment-free sequence comparison: benefits, applications, and tools

Abstract: Alignment-free sequence analyses have been applied to problems ranging from whole-genome phylogeny to the classification of protein families, identification of horizontally transferred genes, and detection of recombined sequences. The strength of these methods makes them particularly useful for next-generation sequencing data processing and analysis. However, many researchers are unclear about how these methods work, how they compare to alignment-based methods, and what their potential is for use for their res… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

3
444
0
5

Year Published

2018
2018
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 442 publications
(477 citation statements)
references
References 183 publications
3
444
0
5
Order By: Relevance
“…Another alternative is UPGMA, though as a hierarchical clustering method it would not be expected to recover the same topology as a phylogenetic method (but perhaps the same clusters). However, in the present era, we see the main advantage of this class of methods as being able to avoid having to create an alignment from mapping 49 . If one is able to calculate genetic distances from assemblies or even directly from reads, the relatively costly and challenging step of creating a large multiple sequence alignment can be avoided.…”
Section: Resultsmentioning
confidence: 99%
“…Another alternative is UPGMA, though as a hierarchical clustering method it would not be expected to recover the same topology as a phylogenetic method (but perhaps the same clusters). However, in the present era, we see the main advantage of this class of methods as being able to avoid having to create an alignment from mapping 49 . If one is able to calculate genetic distances from assemblies or even directly from reads, the relatively costly and challenging step of creating a large multiple sequence alignment can be avoided.…”
Section: Resultsmentioning
confidence: 99%
“…The short length and low sequence identity of defensin sequences can make them challenging to study using traditional methods such as phylogenetics . Repeated mutation at the same location can obscure phylogenetic relationships by saturation effects and long branch attraction .…”
Section: Introductionmentioning
confidence: 99%
“…[16][17][18] The short length and low sequence identity of defensin sequences can make them challenging to study using traditional methods such as phylogenetics. 19 Repeated mutation at the same location can obscure phylogenetic relationships by saturation effects and long branch attraction. 20 It has not been possible to resolve the phylogeny of the whole superfamily; although, smaller phylogenies of the α-, βand big-defensin families have been published previously.…”
mentioning
confidence: 99%
“…They first identify homologous regions between two genomes, at either the DNA or protein level, and then use these alignments to build phylogenetic trees to infer relationships (Konstantinidis and Tiedje, 2005) (Parks et al, 2018). This approach has several limitations, including low sensitivity to detect remotely related species, slow computing, and low scalability (reviewed in (Zielezinski et al, 2017)). To overcome these limitations, several alignment-free methods were developed that are both computationally efficient and resistant to noise (Zielezinski et al, 2017;Ren et al, 2018;Jain et al, 2018).…”
Section: Introductionmentioning
confidence: 99%
“…This approach has several limitations, including low sensitivity to detect remotely related species, slow computing, and low scalability (reviewed in (Zielezinski et al, 2017)). To overcome these limitations, several alignment-free methods were developed that are both computationally efficient and resistant to noise (Zielezinski et al, 2017;Ren et al, 2018;Jain et al, 2018). Based on their methodologies, there are three main categories of alignment-free genome similarity comparisons (reviewed in (Zielezinski et al, 2017)).…”
Section: Introductionmentioning
confidence: 99%