2010
DOI: 10.1186/1471-2105-11-599
|View full text |Cite
|
Sign up to set email alerts
|

A genome alignment algorithm based on compression

Abstract: BackgroundTraditional genome alignment methods consider sequence alignment as a variation of the string edit distance problem, and perform alignment by matching characters of the two sequences. They are often computationally expensive and unable to deal with low information regions. Furthermore, they lack a well-principled objective function to measure the performance of sets of parameters. Since genomic sequences carry genetic information, this article proposes that the information content of each nucleotide … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2011
2011
2024
2024

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 11 publications
(8 citation statements)
references
References 44 publications
0
8
0
Order By: Relevance
“…Such an alignment generally assumes a model and a set of parameters of the differences between the sequences. It is widely recognised that the accuracy of the alignment is sensitive to these parameters [ 37 39 ]. However, in the context of real-time analysis of MinION sequencing, it is not possible to select in advance a sensible set of parameters.…”
Section: Methodsmentioning
confidence: 99%
“…Such an alignment generally assumes a model and a set of parameters of the differences between the sequences. It is widely recognised that the accuracy of the alignment is sensitive to these parameters [ 37 39 ]. However, in the context of real-time analysis of MinION sequencing, it is not possible to select in advance a sensible set of parameters.…”
Section: Methodsmentioning
confidence: 99%
“…Such an alignment generally assumes a model and a set of parameters of the differences between the sequences. It is widely recognised that the accuracy of the alignment is sensitive to these parameters [43][44][45]. However, in the context of real-time analysis of MinION sequencing, it is not possible to select in advance a sensible set of parameters.…”
Section: Sensitive Alignment Of Noisy Sequences With Probabilistic Fimentioning
confidence: 99%
“…Also, retrieving Single Nucleotide Polymorphisms (SNP) data-detecting and localizing one base mutations on genomic data [160]. Other application is aligning DNA sequences using compression [161]; Using data compression to detect large transformations between the DNA of different individuals or species, also known as rearrangement detection, has also been shown to work efficiently [162]; It has also been used for efficient storage of data structures in pan-genome analysis, namely using de Bruijn graphs [163,164]. Here, the problem is to deal with large amounts of information and its fast retrieval.…”
Section: Discussionmentioning
confidence: 99%
“…First, the performance of de novo assembly, for example employing de Bruijn graphs [5] to create (parts of) a genome from raw sequence reads without the aid of a reference genome, can be improved by decreasing the memory usage by one order of magnitude [6,7]; Second, through compression, expert models can be trained on a specific sequence and later be exploited in a similar sequence, to produce higher quality aligners [3,8].…”
Section: Introductionmentioning
confidence: 99%