2014
DOI: 10.1142/s0219649214500051
|View full text |Cite
|
Sign up to set email alerts
|

Machine Learning Techniques in Web Content Mining: A Comparative Analysis

Abstract: With incessantly growing amount of information published over Web pages, the World Wide Web (WWW) has become prolific in the field of data mining research. The heterogeneous and semi-structured nature of Web data has made the process of automated discovery a challenging issue. Web Content Mining (WCM) essentially uses data mining techniques to effectively discover knowledge from Web page contents. The intent of this study is to provide a comparative analysis of Machine Learning (ML) techniques available in the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(2 citation statements)
references
References 26 publications
0
1
0
Order By: Relevance
“…Anami, Basavaraj S., Ramesh S. Wadawadagi, and Veerappa B. Pagi. [16] used Self Organizing Maps for the text and document clustering using Bag of Words with n-grams.…”
Section: Related Work:-mentioning
confidence: 99%
“…Anami, Basavaraj S., Ramesh S. Wadawadagi, and Veerappa B. Pagi. [16] used Self Organizing Maps for the text and document clustering using Bag of Words with n-grams.…”
Section: Related Work:-mentioning
confidence: 99%
“…However, current technologies are increasingly interconnected, so the content must be designed for machines to read, rather than just humans [1]. Machine-adapted labeling of content on web pages is a base for automated content extraction, data mining, content transformation, and other needs [2]. However, the existing HTML standard is slowly moving away from presentation over data.…”
Section: Introductionmentioning
confidence: 99%