Algorithms for extracting motifs from biological weighted sequences

Iliopoulos, Costas S.; Perdikuri, Katerina; Theodoridis, Evangelos; Tsakalidis, Athanasios K.; Tsichlas, Kostas

doi:10.1016/j.jda.2006.03.018

Cited by 7 publications

(5 citation statements)

References 24 publications

(32 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The weighted suffix tree for a given weighted X=X [1]X [2]...X[n], of length n can be built by following the steps given below.…”

Section: Construction Of Weighted Suffix Treementioning

confidence: 99%

“…The molecular weighted sequence shown in ( Figure. 1) is found in the numerous applications of computational molecular biology [1] and it is defined as sequence of either nucleotides or amino acids, where each character in every position is assigned a certain weight. In computational biology few important biological processes such as DNA assembly process [12], pattern matching, and identification of repeated patterns in biological weighted sequences are modeled by molecular weighted sequences and also very help full in the translation of gene expression and regulation.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A Dynamic Approach to Weighted Suffix Tree Construction Algorithm

Pandey¹,

Niyogi²,

Mittal³

2011

IJDPS

View full text Add to dashboard Cite

In present time weighted suffix tree is consider as a one of the most important existing data structure used for analyzing molecular weighted sequence. Although a static partitioning based parallel algorithm existed for the construction of weighted suffix tree, but for very long weighted DNA sequences it takes significant amount of time. However, in our implementation of dynamic partition based parallel weighted suffix tree construction algorithm on cluster computing makes it possible to significantly accelerate the construction of weighted suffix tree.

show abstract

“…The weighted suffix tree for a given weighted X=X [1]X [2]...X[n], of length n can be built by following the steps given below.…”

Section: Construction Of Weighted Suffix Treementioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

A Dynamic Approach to Weighted Suffix Tree Construction Algorithm

Pandey¹,

Niyogi²,

Mittal³

2011

IJDPS

View full text Add to dashboard Cite

show abstract

“…They may correspond to approximate repetitions randomly dispersed along the sequence, or to repetitions that occur in a periodic or approximately periodic fashion. The length and number of repeated elements one wishes to be able to identify may be highly variable [18].…”

Section: Introductionmentioning

confidence: 99%

Enhanced Self-Organizing Map Neural Network for DNA Sequence Classification

Mohamed¹,

Al-Mehdhar²,

Bamatraf³

et al. 2013

IIM

View full text Add to dashboard Cite

The artificial neural networks (ANNs), among different soft computing methodologies are widely used to meet the challenges thrown by the main objectives of data mining classification techniques, due to their robust, powerful, distributed, fault tolerant computing and capability to learn in a data-rich environment. ANNs has been used in several fields, showing high performance as classifiers. The problem of dealing with non numerical data is one major obstacle prevents using them with various data sets and several domains. Another problem is their complex structure and how hands to interprets. Self-Organizing Map (SOM) is type of neural systems that can be easily interpreted, but still can't be used with non numerical data directly. This paper presents an enhanced SOM structure to cope with non numerical data. It used DNA sequences as the training dataset. Results show very good performance compared to other classifiers. For better evaluation both micro-array structure and their sequential representation as proteins were targeted as dataset accuracy is measured accordingly.

show abstract

“…The same technique adopted to find out mutations that triggers a disease and is also a substantial part of tracing the evolution of a certain organism [2]. The dynamic programming based smith-waterman algorithm is considered as the only comparison algorithm guaranteed that return an optimal result which is suitable for both of protein and DNA sequences [4]. However, this algorithm took considerable amount of time even to compare two small length sequences and also not suitable for molecular weighted sequences.…”

Section: Introductionmentioning

confidence: 99%

“…Thus, we try to alleviate the limitation of this algorithm in terms of its execution time by implementing extended burrow wheeler transform based molecular weighted sequence comparison algorithm. 13 The implementation of this intuitive idea for large weighted molecular sequence [4] can have take enormous amount of computation time and memory. This can be proved by considering a practical example of comparing molecular weighted sequence of 300 positions with molecular weighted pattern of 10 positions and in each position there is a possibility of four characters.…”

Section: Introductionmentioning

confidence: 99%