2019
DOI: 10.1007/s00500-019-03796-9
|View full text |Cite
|
Sign up to set email alerts
|

Performance analysis of efficient data distribution in P2P environment using hybrid clustering techniques

Abstract: In this paper, K-means algorithm has been applied for distributed large data using hybrid clustering techniques. K-means is a simple and scalable algorithm which can be applied on large datasets. It is one of the well-known unsupervised clustering algorithms that fail in providing structured to unstructured data to enable extraction of valuable information. Peer-to-peer (P2P) technologies divide the data or resources between the peers for managing the network bandwidth, network participants and processing powe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 19 publications
(16 reference statements)
0
2
0
Order By: Relevance
“…Transaction records from purchase transactions, sensor data, and log data are all examples of data that is created by machines (i.e., web logs, click logs, email logs). The most significant sources of big data include buy transaction records, online data, social media data, click stream data, mobile phone GPS signals, and sensor data [33]. It is the amount of data that cannot be stored and processed by a single computer that is referred to as big data.…”
Section: Multivariate Empirical Mode Decomposition-based Gradient Sup...mentioning
confidence: 99%
“…Transaction records from purchase transactions, sensor data, and log data are all examples of data that is created by machines (i.e., web logs, click logs, email logs). The most significant sources of big data include buy transaction records, online data, social media data, click stream data, mobile phone GPS signals, and sensor data [33]. It is the amount of data that cannot be stored and processed by a single computer that is referred to as big data.…”
Section: Multivariate Empirical Mode Decomposition-based Gradient Sup...mentioning
confidence: 99%
“…These platforms offer opportunities for new modes of production and resource allocation, scalable technological infrastructures, and a deeper focus on sustainability (Bauwens et al, 2017). More, according to Raju et al (2019) P2P technologies "divide the data or resources between the peers for managing the network bandwidth, network participants and processing powers. During the data distribution process in the P2P environments, accuracy, computation complexity and distributed clustering accuracy are the important issues as they reduce the entire system performance" (p.1).…”
Section: Introductionmentioning
confidence: 99%