2020 5th International Conference on Computer and Communication Systems (ICCCS) 2020
DOI: 10.1109/icccs49078.2020.9118442
|View full text |Cite
|
Sign up to set email alerts
|

Implementation of Distributed Crawler System Based on Spark for Massive Data Mining

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 6 publications
0
3
0
Order By: Relevance
“…Chaturvedi et al [95] 2020 evolution mining and change mining are used to changing distributed systems in this study.…”
Section: Discussion and Comparisonmentioning
confidence: 99%
See 1 more Smart Citation
“…Chaturvedi et al [95] 2020 evolution mining and change mining are used to changing distributed systems in this study.…”
Section: Discussion and Comparisonmentioning
confidence: 99%
“…Web crawlers were maturely contributed with major search engines including search fields during a period of the rapid growth of Internet technologies and rising social desires of people [21]. F. Liu and W. Xin [95] demonstrates the architecture incorporation of the Spark-based distributed crawler system, provides a corresponding framework diagram, which presents the distributed framework platform in depth utilizing Spark's RDD elastic computational model and task assignment algorithm. However, we can fix the issue of insufficient resource consumption and poor collection performance using this Spark-based distributed crawler method, and then resolve the contradiction between the present exponential growth of data size as well as the speed of collecting information [96,97].…”
Section: Literature Reviewmentioning
confidence: 99%
“…The mechanism makes DPoS faster than normal proof of stake (PoS), and PBFT runs efficiently with the use of fewer preselected generals. The whole calculation process is similar to the spark mechanism, [55][56][57] but we use the blockchain consensus mechanism and HP-B to replace the functions of the central server (driver) and resident distributed data set in the spark mechanism to improve the security of the model training process, strengthen the verification of the node submission model, and maintain the consistency of the storage model parameters.…”
Section: Dpos + Pbft Consensus Mechanismmentioning
confidence: 99%