2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC) 2012
DOI: 10.1109/icsmc.2012.6378072
|View full text |Cite
|
Sign up to set email alerts
|

Automatic content extraction and visualization of Thai websites for improved information representation

Abstract: Abstract-This paper presents an integrated approach to automatically provide an overview of content on Thai websites based on tag cloud. This approach is intended to address the information overload issue by presenting the overview to users in order that they could assess whether the information meets their needs. The approach has incorporated Web content extraction, Thai word segmentation, and information presentation to generate a tag cloud in Thai language as an overview of the key content in the webpage. F… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 17 publications
0
1
0
Order By: Relevance
“…Although English and non-English news pages have similar subjects and layouts, the reader modes were not activated. There are several studies tried to resolve this problem by creating a new algorithm or a model for the local language [7]- [9]. However, it is difficult to create new methods for all languages owing to the shortage of developers fluent in each language, especially low-resource languages.…”
Section: Introductionmentioning
confidence: 99%
“…Although English and non-English news pages have similar subjects and layouts, the reader modes were not activated. There are several studies tried to resolve this problem by creating a new algorithm or a model for the local language [7]- [9]. However, it is difficult to create new methods for all languages owing to the shortage of developers fluent in each language, especially low-resource languages.…”
Section: Introductionmentioning
confidence: 99%