Hanqing Hu scite author profile

Many real‐world data mining applications have to deal with unlabeled streaming data. They are unlabeled because the sheer volume of the stream makes it impractical to label a significant portion of the data. The data streams can evolve over time and these changes are called concept drifts. Concept drifts have different characteristics, which can be used to categorize them into different types. A trade‐off between performance and cost exists among many concept drift detection approaches. On the one hand, high accuracy detection approach usually requires labeled data, possibly involving high cost for labeling. On the other hand, a variety of methods have been devoted to the topic of concept drift detection with unlabeled data, but these approaches often are most suited for only a subset of the concept drift types. The objective of this survey is to present these methods, categorize them and give recommendations of usage based on their behaviors under different types of concept drift. This article is categorized under: Fundamental Concepts of Data and Knowledge > Data Concepts Fundamental Concepts of Data and Knowledge > Key Design Issues in Data Mining Explainable AI > Classification

show abstract

Smart Preprocessing Improves Data Stream Mining

Kantardzic

2016

View full text Add to dashboard Cite

Fuzzy comprehensive evaluation on high-quality development of China’s rural economy based on entropy weight

2020

IFS

View full text Add to dashboard Cite

Factors Influencing Information Industry Development in a Coastal Economy

Jin

et al. 2019

Journal of Coastal Research

View full text Add to dashboard Cite

Selecting samples for labeling in unbalanced streaming data environments

Kantardzic

Sethi

2013

View full text Add to dashboard Cite

A Graph Database of Yelp Dataset Challenge 2018 and Using Cypher for Basic Statistics and Graph Pattern Exploration

Kronmueller

Chang

et al. 2018

View full text Add to dashboard Cite

Detecting Different Types of Concept Drifts with Ensemble Framework

Kantardzic

Lyu

2018

View full text Add to dashboard Cite

12 3 4 5

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Hanqing Hu

Orthogonally adapted Harris hawks optimization for parameter estimation of photovoltaic models

No Free Lunch Theorem for concept drift detection in streaming data classification: A review

Smart Preprocessing Improves Data Stream Mining

Fuzzy comprehensive evaluation on high-quality development of China’s rural economy based on entropy weight

Factors Influencing Information Industry Development in a Coastal Economy

Selecting samples for labeling in unbalanced streaming data environments

A Graph Database of Yelp Dataset Challenge 2018 and Using Cypher for Basic Statistics and Graph Pattern Exploration

Detecting Different Types of Concept Drifts with Ensemble Framework

Contact Info

Product

Resources

About