2020 6th International Conference on Science and Technology (ICST) 2020
DOI: 10.1109/icst50505.2020.9732824
|View full text |Cite
|
Sign up to set email alerts
|

Data Cleansing Processing using Pentaho Data Integration: Case Study Data Deduplication

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 3 publications
0
1
0
Order By: Relevance
“…The authors have also given an idea about the tools used for Big Data storage and processing along with the algorithms used in the Big Data processing. Hadoop, HDFS [3], CDH [4], Mon-goDB [5], Apache spark [6], Apache Solr [7], [8],Alteryx [9] Designer, Data Meer [9], Google Big Query [9] etc are the tools used to store process and analyze the Big Data and Support Vector Machine [10], Neural Network, Logistic regression [11], Linear Regression [11], Nearest Neighbor, Decision tree, Naive Bayes are the Algorithms used in healthcare analysis.…”
Section: Introductionmentioning
confidence: 99%
“…The authors have also given an idea about the tools used for Big Data storage and processing along with the algorithms used in the Big Data processing. Hadoop, HDFS [3], CDH [4], Mon-goDB [5], Apache spark [6], Apache Solr [7], [8],Alteryx [9] Designer, Data Meer [9], Google Big Query [9] etc are the tools used to store process and analyze the Big Data and Support Vector Machine [10], Neural Network, Logistic regression [11], Linear Regression [11], Nearest Neighbor, Decision tree, Naive Bayes are the Algorithms used in healthcare analysis.…”
Section: Introductionmentioning
confidence: 99%