NEMICO: Mining Network Data through Cloud-Based Data Mining Techniques

Baralis, Elena; Cagliero, Luca; Cerquitelli, Tania; Chiusano, Silvia; Garza, Paolo; Grimaudo, Luigi; Pulvirenti, Fabio

doi:10.1109/ucc.2014.72

Cited by 3 publications

(2 citation statements)

References 9 publications

(8 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Data mining plays a significant role in data analysis and knowledge extraction [1]; it has become an efficient tool for pattern discovery due to its applicability in a variety of circumstances such as association rule mining (ARM) [2], clustering analysis [3], and classification [4]. Mining frequent patterns (FPs) [2] are fundamental in ARM.…”

Section: Introductionmentioning

confidence: 99%

An Efficient Bit-Based Approach for Mining Skyline Periodic Itemset Patterns

Li,

2023

Electronics

View full text Add to dashboard Cite

Periodic itemset patterns (PIPs) are widely used in predicting the occurrence of periodic events. However, extensive redundancy arises due to a large number of patterns. Mining skyline periodic itemset patterns (SPIPs) can reduce the number of PIPs and guarantee the accuracy of prediction. The existing SPIP mining algorithm uses FP-Growth to generate frequent patterns (FPs), and then identify SPIPs from FPs. Such separate steps lead to a massive time consumption, so we propose an efficient bit-based approach named BitSPIM to mine SPIPs. The proposed method introduces efficient bitwise representations and makes full use of the data obtained in the previous steps to accelerate the identification of SPIPs. A novel cutting mechanism is applied to eliminate unnecessary steps. A series of comparative experiments were conducted on various datasets with different attributes to verify the efficiency of BitSPIM. The experiment results demonstrate that our algorithm significantly outperforms the latest SPIP mining approach.

show abstract

Section: Introductionmentioning

confidence: 99%

An Efficient Bit-Based Approach for Mining Skyline Periodic Itemset Patterns

Li,

2023

Electronics

View full text Add to dashboard Cite

show abstract

“…When dealing with Big Data collections, such as the network datasets, the computational cost of the data mining process (and in some cases the feasibility of the process itself) can potentially become a critical bottleneck in data analysis. To date, parallel and distributed approaches have been adopted to increase efficiency and scalability of network traffic mining algoritms [1], [2], [3], [4].…”

Section: Introductionmentioning

confidence: 99%

SaFe-NeC: A scalable and flexible system for network data characterization

Apiletti

Baralis

Cerquitelli

et al. 2016

NOMS 2016 - 2016 IEEE/IFIP Network Operations and Management Symposium

Self Cite

View full text Add to dashboard Cite

Nowadays, large volumes of data and measurements are being continuously generated by computer and telecommunication networks, but such volumes make it difficult to extract meaningful knowledge from them. This paper presents SaFe-NeC, an innovative methodology for analyzing network traffic by exploiting data mining techniques, i.e. clustering and classification algorithms, focusing on self-learning capabilities of state-of-theart scalable approaches. Self-learning algorithms, coupled with self-assessment indicators and domain-driven semantics enriching data mining results, are able to build a model of the data with minimal user intervention and highlight possibly meaningful interpretations to domain experts. Furthermore, a self-evolving model evaluation phase is included to continuously track the quality degradation of the model itself, whose rebuilding is triggered as soon as quality indicators fall below a threshold of tolerance. The proposed methodology can exploit the computational advantages of distributed computing frameworks, as the current implementation runs on Apache Spark. Preliminary experimental results on a real traffic dataset show the full potential of the proposed methodology to characterize network traffic data.

show abstract

The improved Apriori algorithm based on matrix pruning and weight analysis

Lang¹

2018

AIP Conference Proceedings

View full text Add to dashboard Cite

NEMICO: Mining Network Data through Cloud-Based Data Mining Techniques

Cited by 3 publications

References 9 publications

An Efficient Bit-Based Approach for Mining Skyline Periodic Itemset Patterns

An Efficient Bit-Based Approach for Mining Skyline Periodic Itemset Patterns

SaFe-NeC: A scalable and flexible system for network data characterization

The improved Apriori algorithm based on matrix pruning and weight analysis

Contact Info

Product

Resources

About