2019
DOI: 10.1016/j.jnca.2019.06.009
|View full text |Cite
|
Sign up to set email alerts
|

Analytical composite performance models for Big Data applications

Abstract: In the era of Big Data, whose digital industry is facing the massive growth of data size and development of data intensive software, more and more companies are moving to use new frameworks and paradigms capable of handling data at scale. The outstanding MapReduce (MR) paradigm and its implementation framework, Hadoop are among the most referred ones, and basis for later and more advanced frameworks like Tez and Spark. Accurate prediction of the execution time of a Big Data application helps improving design t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 16 publications
(5 citation statements)
references
References 38 publications
0
5
0
Order By: Relevance
“…In [43], the lumping technique was used to alleviate the scalability issue of predicting YARN environment, introducing a significant error of about 15%. The contribution was built upon the efforts for developing analytical models for single queue scenarios in [44], [45]. Our findings from related work are summarized in Table 9.…”
Section: Related Workmentioning
confidence: 99%
“…In [43], the lumping technique was used to alleviate the scalability issue of predicting YARN environment, introducing a significant error of about 15%. The contribution was built upon the efforts for developing analytical models for single queue scenarios in [44], [45]. Our findings from related work are summarized in Table 9.…”
Section: Related Workmentioning
confidence: 99%
“…The "Action Program for Promoting the Development of Big Data" clearly supports companies in developing third-party data analysis and discovery services, technology outsourcing services, and knowledge process outsourcing services based on big data, and encourages large and medium-sized enterprises to coordinate development and collaborative innovation. Therefore, SMEs can adopt flexible and diverse forms such as third-party leasing and purchasing professional services or using Cloud deployement creatively to achieve comprehensive coverage and improvement of big data infrastructure [6].…”
Section: Increase Investment and Improve Big Data Infrastructurementioning
confidence: 99%
“…Castiglione et al [6] focus on predicting metrics relevant for operating highly concurrent applications in cloud infrastructure such as performance, number of virtual machines or energy efficiency. Aliabadi et al [2] focus on predicting the performance of batch applications in different Big Data analysis frameworks using Stochastic Activity Networks. These approaches for batch processing do not support the metrics required for stream processing, which we focus on.…”
Section: Related Workmentioning
confidence: 99%