2021
DOI: 10.1088/1742-6596/1828/1/012015
|View full text |Cite
|
Sign up to set email alerts
|

Low-code AutoML-augmented Data Pipeline – A Review and Experiments

Abstract: There is a lack of knowledge concerning the low-code autoML (automated machine learning) frameworks that can be used to enrich data for several purposes concerning either data engineering or software engineering. In this paper, 34 autoML frameworks have been reviewed based on the latest commits and augmentation properties of their GitHub content. The PyCaret framework was the result of the review due to requirements concerning adaptability by Google Colaboratory (Colab) and the BI (business intelligence) tool.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0
2

Year Published

2021
2021
2024
2024

Publication Types

Select...
8

Relationship

0
8

Authors

Journals

citations
Cited by 39 publications
(21 citation statements)
references
References 1 publication
0
16
0
2
Order By: Relevance
“…Eleven ML models were built using PyCaret, an open-source wrapper over several ML libraries in Python in a low-code environment. 11 After screening of the 11 models, we adopted 2 excellent ensemble learning methods: an extra tree classifier (model_ET) and a light gradient boosting machine (model_LGBM), 12 , 13 which uses many random decision trees and built a majority vote-like system. A brief explanation of the ensemble learning is provided in Supplemental File 1 .…”
Section: Methodsmentioning
confidence: 99%
“…Eleven ML models were built using PyCaret, an open-source wrapper over several ML libraries in Python in a low-code environment. 11 After screening of the 11 models, we adopted 2 excellent ensemble learning methods: an extra tree classifier (model_ET) and a light gradient boosting machine (model_LGBM), 12 , 13 which uses many random decision trees and built a majority vote-like system. A brief explanation of the ensemble learning is provided in Supplemental File 1 .…”
Section: Methodsmentioning
confidence: 99%
“…PyCaret is a Python-based machine learning framework for automating machine learning workflows [11,13]. Anomaly detection is the process of discovering unusual things, events, or observations that raise concerns because they differ significantly from the overall set of data points.…”
Section: Pycaret -Anomaly Detection Modulementioning
confidence: 99%
“…However, when data streams vary over time, standard approaches become ineffective, necessitating the use of an outlier identification algorithm that can handle dynamic data streams. PyCaret is a Python machine learning package that automates machine learning operations and is open-source [13]. Low-code autoML framework PyCaret (automatic machine learning) can be utilized to enhance data for a variety of applications, but there is a dearth of expertise about them.…”
Section: Introductionmentioning
confidence: 99%
“…PyCaret dapat juga dikustomisasi untuk keperluan tertentu seperti yang dilakukan oleh (do Nascimento et al, 2021) untuk keperluan analisis prediktif pada laboratorium kesehatan. PyCaret merupakan salah satu dari banyak framework autoML lainnya seperti PyTorch, Theano, dan lain sebagainya (Gain & Hotti, 2021). Menurut riset (Gain & Hotti, 2021), PyCaret memiliki keunggulan karena ia kompatibel dengan Google Colaboratory (Colab) dan aplikasi BI (business intelligence).…”
Section: Metodeunclassified
“…PyCaret merupakan salah satu dari banyak framework autoML lainnya seperti PyTorch, Theano, dan lain sebagainya (Gain & Hotti, 2021). Menurut riset (Gain & Hotti, 2021), PyCaret memiliki keunggulan karena ia kompatibel dengan Google Colaboratory (Colab) dan aplikasi BI (business intelligence). PyCaret sendiri memiliki beberapa modul yaitu klasifikasi, regresi, clustering, Association Rule Mining, deteksi anomali, dan Natural Language Processing.…”
Section: Metodeunclassified