2021
DOI: 10.1108/ijicc-10-2020-0157
|View full text |Cite
|
Sign up to set email alerts
|

Improving the performance of query processing using proposed resilient distributed processing technique

Abstract: PurposeResilient distributed processing technique (RDPT), in which mapper and reducer are simplified with the Spark contexts and support distributed parallel query processing.Design/methodology/approachThe proposed work is implemented with Pig Latin with Spark contexts to develop query processing in a distributed environment.FindingsQuery processing in Hadoop influences the distributed processing with the MapReduce model. MapReduce caters to the works on different nodes with the implementation of complex mappe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 16 publications
0
1
0
Order By: Relevance
“…With the increasing demand for data storage and computing performance, Hadoop is not sufficient to satisfy the requirements, which is designed to be deployed on low-cost hardware and is highly fault-tolerant, while some performance has been sacrificed. But the appearance of Apache Spark and Apache Flink, making up for this shortcoming to a large extent by maintaining task state in memory, greatly accelerates the speed of data processing, and makes the ecosystem of Hadoop more mature (Lakshmi and Usha Rani, 2021). Through the investigation of the published literature, the research about Apache Spark and Apache Flink is becoming popular, while research of Hadoop goes through a downward tendency from 2016.…”
Section: Discussionmentioning
confidence: 99%
“…With the increasing demand for data storage and computing performance, Hadoop is not sufficient to satisfy the requirements, which is designed to be deployed on low-cost hardware and is highly fault-tolerant, while some performance has been sacrificed. But the appearance of Apache Spark and Apache Flink, making up for this shortcoming to a large extent by maintaining task state in memory, greatly accelerates the speed of data processing, and makes the ecosystem of Hadoop more mature (Lakshmi and Usha Rani, 2021). Through the investigation of the published literature, the research about Apache Spark and Apache Flink is becoming popular, while research of Hadoop goes through a downward tendency from 2016.…”
Section: Discussionmentioning
confidence: 99%