2020
DOI: 10.12928/telkomnika.v18i2.14883
|View full text |Cite
|
Sign up to set email alerts
|

Genomic repeats detection using Boyer-Moore algorithm on Apache Spark Streaming

Abstract: Genomic repeats, i.e., pattern searching in the string processing process to find repeated base pairs in the order of Deoxyribonucleic Acid (DNA), requires a long processing time. This research builds a big-data computational model to look for patterns in strings by modifying and implementing the Boyer-Moore algorithm on Apache Spark Streaming for human DNA sequences from the Ensemble site. Moreover, we perform some experiments on cloud computing by varying different specifications of computer clusters with in… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 19 publications
0
2
0
Order By: Relevance
“…RDDs are well-suited for a diversity of applications. Figure 1 presents the spark-cluster framework [10]. A spark comprises a driver node that is equivalent to a master node and several worker nodes that are correspondent to slave nodes.…”
Section: Background Of the Study 21 Sparkmentioning
confidence: 99%
“…RDDs are well-suited for a diversity of applications. Figure 1 presents the spark-cluster framework [10]. A spark comprises a driver node that is equivalent to a master node and several worker nodes that are correspondent to slave nodes.…”
Section: Background Of the Study 21 Sparkmentioning
confidence: 99%
“…This means that the proposed method in this research is an improvement of the previous one. The other study involving Apache Spark on discovery patterns was performed by Jiang et al [ 13 ], Riza et al [ 14 ], and Pérez-Chacón et al [ 15 ].…”
Section: Introductionmentioning
confidence: 99%