2022
DOI: 10.3390/math10193517
|View full text |Cite
|
Sign up to set email alerts
|

Performance Evaluation of Query Plan Recommendation with Apache Hadoop and Apache Spark

Abstract: Access plan recommendation is a query optimization approach that executes new queries using prior created query execution plans (QEPs). The query optimizer divides the query space into clusters in the mentioned method. However, traditional clustering algorithms take a significant amount of execution time for clustering such large datasets. The MapReduce distributed computing model provides efficient solutions for storing and processing vast quantities of data. Apache Spark and Apache Hadoop frameworks are used… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 21 publications
0
3
0
Order By: Relevance
“…Venturing into the realm of natural language processing, the transformer model [82] gained widespread acclaim. It established unparalleled standards in terms of discerning the dependencies between sequences, which paved the way for swift parallel computations and accelerated sequence information extraction [83].…”
Section: Long-and Short-term Sequence Recommendationmentioning
confidence: 99%
“…Venturing into the realm of natural language processing, the transformer model [82] gained widespread acclaim. It established unparalleled standards in terms of discerning the dependencies between sequences, which paved the way for swift parallel computations and accelerated sequence information extraction [83].…”
Section: Long-and Short-term Sequence Recommendationmentioning
confidence: 99%
“…Data locality and Hadoop [10] parameter tuning in dependable and uniform cluster environments make up the majority of the related attempt for enhancing Map reduce performance. [11] Cautioned have been used to categorize this system.…”
Section: Literature Surveymentioning
confidence: 99%
“…For example, Spark can be used to run big data queries, so query-wise performance prediction is also important. Azhir et al [14] used Spark and Hadoop to cluster query datasets of various sizes and evaluate query performance. Yadav et al [15] analyzed the impact of data size on the query execution time for Spark, which is a popular big data query framework.…”
Section: Related Workmentioning
confidence: 99%