2015 IEEE International Conference on Information Reuse and Integration 2015
DOI: 10.1109/iri.2015.12
|View full text |Cite
|
Sign up to set email alerts
|

A Multi-dimensional Comparison of Toolkits for Machine Learning with Big Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
6
4

Relationship

1
9

Authors

Journals

citations
Cited by 33 publications
(13 citation statements)
references
References 22 publications
0
12
0
Order By: Relevance
“…New reviews are constantly being added to large repositories of reviews across various websites at a high rate, over 1.5 million per month in the case of Yelp 4 . Consequently, distributed and streaming applications of machine learning algorithms 14 should be explored to effectively model the large corpus of online reviews which exist in the real world [38]. Mahout has been used for large-scale recommendation systems [39], which would be useful to apply to review spam detection, as reviewers may be related to each other on different review websites.…”
Section: Comparative Analysis and Suggestionsmentioning
confidence: 99%
“…New reviews are constantly being added to large repositories of reviews across various websites at a high rate, over 1.5 million per month in the case of Yelp 4 . Consequently, distributed and streaming applications of machine learning algorithms 14 should be explored to effectively model the large corpus of online reviews which exist in the real world [38]. Mahout has been used for large-scale recommendation systems [39], which would be useful to apply to review spam detection, as reviewers may be related to each other on different review websites.…”
Section: Comparative Analysis and Suggestionsmentioning
confidence: 99%
“…As these tools have advantages and drawbacks, and many have overlapping features, deciding on which framework to use is not easy. In this regard, several papers provide comparisons between some of these tools including MLlib [51,69].…”
Section: Research Highlightsmentioning
confidence: 99%
“…They implemented IAL (Independent Assisted Living) and patient monitoring system. Richter et al, (2015) have conducted several comparative studies on toolkits for machine learning in big data, including Mahout MapReduce, Mahout Samsara, Spark MLlib, H2O, and SAMOA. The study showed that on average, Spark MLlib and H2O has better performances than other toolkits in terms of the extensibility, scalability, usability, fault tolerance and speed.…”
Section: Related Workmentioning
confidence: 99%