2014
DOI: 10.1186/1471-2105-15-339
|View full text |Cite
|
Sign up to set email alerts
|

An algorithm of discovering signatures from DNA databases on a computer cluster

Abstract: BackgroundSignatures are short sequences that are unique and not similar to any other sequence in a database that can be used as the basis to identify different species. Even though several signature discovery algorithms have been proposed in the past, these algorithms require the entirety of databases to be loaded in the memory, thus restricting the amount of data that they can process. It makes those algorithms unable to process databases with large amounts of data. Also, those algorithms use sequential mode… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2015
2015
2017
2017

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 30 publications
0
6
0
Order By: Relevance
“…Here we present a signatures-based search method and the corresponding software tool, matK -QR Classifier on the Windows platform (http://www.neeri.res.in/matk_classifier/index.htm), which is different from the early tools in the set of predictive nucleotide patterns and the method of sequence classification. Molecular signatures (here we referred as a regular expression) are defined as DNA nucleotide motifs that are unique and present in target species but different from other species sequences [25]. …”
Section: Introductionmentioning
confidence: 99%
“…Here we present a signatures-based search method and the corresponding software tool, matK -QR Classifier on the Windows platform (http://www.neeri.res.in/matk_classifier/index.htm), which is different from the early tools in the set of predictive nucleotide patterns and the method of sequence classification. Molecular signatures (here we referred as a regular expression) are defined as DNA nucleotide motifs that are unique and present in target species but different from other species sequences [25]. …”
Section: Introductionmentioning
confidence: 99%
“…If the detection limit is improved in one of the branches, the worse results of other branches can be ignored when combining them-an idea derived from Lee and Sheu (2014).…”
Section: Discussionmentioning
confidence: 99%
“…Normal mode analysis has been calculated using the coarse-grained Elastic Network Model (ENM)[ 23 ],[ 24 ],[ 25 ],[ 26 ],[ 27 ]. Normal modes and molecular dynamics, with the help of powerful computational tools that allow further structural and dynamics analysis[ 28 ],[ 29 ], become standard techniques for analyzing and enhancing our understanding of molecular mechanisms. Briefly, ENM represents the N residues of a protein by their α-carbons (nodes), connected by uniform springs to their neighbours within a cut-off distance r c .…”
Section: Methodsmentioning
confidence: 99%