2011 International Conference on Machine Learning and Cybernetics 2011
DOI: 10.1109/icmlc.2011.6016978
|View full text |Cite
|
Sign up to set email alerts
|

Thai word segmentation for visualization of Thai Web sites

Abstract: Abstract:Information overload is a problem in the Information Age and Information visualization is an approach to provide an overview of the content of a web site. Tag cloud is one of the ways to represent information as an image of a group of words. However, there are limitations on tag cloud generation, and one of them is due to the characteristics for the language. In order to extract tags or words for tag cloud, word segmentation is required. This paper proposes a Thai word segmentation approach for the vi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2011
2011
2020
2020

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 14 publications
0
4
0
Order By: Relevance
“…However, unlike Chinese and Japanese, Thai WS did not receive much research attention. There are only six notable publications (Chormai et al, 2019;Nararatwong et al, 2018;Noyunsan et al;Thanadechteemapat and Fung;Tongtep and Theeramunkong) on Thai WS for the past ten years. On the other hand, there are at least eight papers from well-established conferences on Chinese and Japanese WS (Li et al, 2019;Aguirre and Aguiar, 2019;Ma et al, 2018;Gong et al, 2017;Chen et al, 2017;Zhou et al, 2017;Cai et al, 2017) within only the last two years.…”
Section: Introductionmentioning
confidence: 99%
“…However, unlike Chinese and Japanese, Thai WS did not receive much research attention. There are only six notable publications (Chormai et al, 2019;Nararatwong et al, 2018;Noyunsan et al;Thanadechteemapat and Fung;Tongtep and Theeramunkong) on Thai WS for the past ten years. On the other hand, there are at least eight papers from well-established conferences on Chinese and Japanese WS (Li et al, 2019;Aguirre and Aguiar, 2019;Ma et al, 2018;Gong et al, 2017;Chen et al, 2017;Zhou et al, 2017;Cai et al, 2017) within only the last two years.…”
Section: Introductionmentioning
confidence: 99%
“…The main problem is the lack of spacing and separation between the Thai words, and Thai word segmentation is required in order to present the key words in the tag clouds. However, this paper only focuses on the Web content extraction technique and solutions for the issue of Thai word segmentation have been reported by the authors [3] and other researchers in other venues. The structure of this paper starts with an introduction on the background and aims of this paper.…”
Section: Introductionmentioning
confidence: 99%
“…This technique has already been published in [23] and the objective is to segment Thai words in the extracted key content. The corpus should be verified whether the segmented words included in the corpus are consistent before it is utilized.…”
Section: B Thai Word Segmentationmentioning
confidence: 99%
“…The tags can be generated by two methods [23]. The first one is to use pre-defined words from a database, which are usually created by the Web content authors.…”
Section: Information Presentation Based On Tag Cloudmentioning
confidence: 99%