Context: The amount and diversity of data have increased drastically in recent years. However, in certain situations, the data to which a trained Machine Learning model is significantly different from testing data, a problem known as Concept Drift (CD). Because CD can be a serious issue, there has been a wealth of research on how to detect and work around it. However, most of the literature focuses on classification tasks. Objective: Making a Systematic Literature Review (SLR) for CD in the context of regression. Research questions: How to detect CD and how to build CD techniques for regression problems using machine learning? Method: We ran an automatic search process on reference databases, selecting papers from 2010 to August 2020, following the methodological process proposed by (Kitchenhame and Charters) (2007). Results:We selected 41 papers. Drift Detection Methods based on ensembles and neural networks with highlight OS-ELM were the most frequent in the selected papers with superior performance. However, only two papers confirm such superiority statistically. Furthermore, identify CD problems as the batch size, drift points, and where drift happens. Conclusions: SLR focuses on highlighting the existing literature on CD applied to regression.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.