2020
DOI: 10.1016/j.micpro.2020.103425
|View full text |Cite|
|
Sign up to set email alerts
|

WITHDRAWN: Comparative Research on Active Learning of Big Aata based on Mapreduce and Spark

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 20 publications
0
3
0
Order By: Relevance
“…In this paper, we propose a new hybrid method for generating a data warehouse that can meet the decision-making needs of decision-makers. In the first phase, the new method exploits the speed of the spark framework-which is much faster than MapReduce according to several research works [17]- [26] in analyzing large amounts of unstructured and distributed data, to generate a general schema for each collection. This allows to extract the structure of a large amount of data in a reasonable time, thus revealing the richness of the data stored in a document-oriented database.…”
Section: Methodsmentioning
confidence: 99%
“…In this paper, we propose a new hybrid method for generating a data warehouse that can meet the decision-making needs of decision-makers. In the first phase, the new method exploits the speed of the spark framework-which is much faster than MapReduce according to several research works [17]- [26] in analyzing large amounts of unstructured and distributed data, to generate a general schema for each collection. This allows to extract the structure of a large amount of data in a reasonable time, thus revealing the richness of the data stored in a document-oriented database.…”
Section: Methodsmentioning
confidence: 99%
“…As the big data technology develops, and the improvement of public policy evaluation technology continues in the public management, the evaluation of public value is still immature, and public value cannot be fully expressed. Especially for those abstract public values, such as government integrity and procedural justice, although we can know its importance, it is difficult to analyze them through which data [8][9][10]. Even if the relevant data are collected, the authenticity and reliability of the data cannot be guaranteed, which requires our continuous improvement and development in practice.…”
Section: Introductionmentioning
confidence: 99%
“…The dependency between the requests was not synchronized by the traditional mapreduce based scheduling approaches which reduced the efficiency of scheduling. Hadoop was an open source implementation, which was used for map reduction for application development and processing for computing distribution [7] But there is a several important issues remains unsolved [8]. These limitations were overcome by executing spark based computing in which the inter dependencies between the requests were analyzed and optimized to achieve effective query scheduling and has the ability to process huge data faster [9].…”
Section: Introductionmentioning
confidence: 99%