2019
DOI: 10.34218/ijcet.10.3.2019.002
|View full text |Cite
|
Sign up to set email alerts
|

Machine Learning Algorithms for Heterogeneous Data: A Comparative Study

Abstract: In the present digital era massive amount of data is being continuously generated at exceptional and increasing scales. This data has become an important and indispensable part of every economy, industry, organization, business and individual. Further handling of these large datasets due to the heterogeneity in their formats is one of the major challenge. There is a need for efficient data processing techniques to handle the heterogeneous data and also to meet the computational requirements to process this hug… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(4 citation statements)
references
References 20 publications
0
4
0
Order By: Relevance
“…Concerning SVM, it gives poor performance with large dataset (Nataraja and Ramesh, 2019). Finally, VB and GLM are often inaccurate when the data are heterogeneous and when the responses are non-linear (Nataraja and Ramesh, 2019). Other studies from agriculture-related sectors also reported similar trend in the performance of machine learning algorithms.…”
Section: Discussionmentioning
confidence: 82%
See 2 more Smart Citations
“…Concerning SVM, it gives poor performance with large dataset (Nataraja and Ramesh, 2019). Finally, VB and GLM are often inaccurate when the data are heterogeneous and when the responses are non-linear (Nataraja and Ramesh, 2019). Other studies from agriculture-related sectors also reported similar trend in the performance of machine learning algorithms.…”
Section: Discussionmentioning
confidence: 82%
“…Moreover, the numerical features in our dataset are in very different scales, it is necessary to normalize them for ANNs and KNN to reduce effects of disparate ranges (Sarle et al, 1991;Han et al, 2011;Ali et al, 2019). Concerning SVM, it gives poor performance with large dataset (Nataraja and Ramesh, 2019). Finally, VB and GLM are often inaccurate when the data are heterogeneous and when the responses are non-linear (Nataraja and Ramesh, 2019).…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…On the other hand, integrating different methods by compounding their strength definitely would be more beneficial for overcoming this challenge related to heterogeneous data analysis. [ 150 ] For example, a novel CNN‐based algorithm based on both structured and unstructured data from hospitals has been proposed to predict chronic disease outbreaks in disease‐prone areas. The data heterogeneity problem was resolved in this study, and the disease prediction accuracy was 94.8%.…”
Section: Challenging Issues Faced With Ai and Microfluidics For Biote...mentioning
confidence: 99%