News Relation Discovery Based on Association Rule Mining with Combining Factors

Kittiphattanabawon, Nichnan; Theeramunkong, Thanaruk; Nantajeewarawat, Ekawit

doi:10.1587/transinf.e94.d.404

Cited by 6 publications

(2 citation statements)

References 24 publications

(24 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A representative pattern is generated in either linear-or exponential scale. As the graphic relationship, the relation discovered among news documents [29] is presented on the measurement values with its confidence and the size ratio.…”

Section: Measure Characterization By Pairwise Comparisonmentioning

confidence: 99%

Characterization of Interestingness Measures Using Correlation Analysis and Association Rule Mining

Somyanonthanakul

Theeramunkong

2020

IEICE Trans. Inf. & Syst.

Self Cite

View full text Add to dashboard Cite

Objective interestingness measures play a vital role in association rule mining of a large-scaled database because they are used for extracting, filtering, and ranking the patterns. In the past, several measures have been proposed but their similarities or relations are not sufficiently explored. This work investigates sixty-one objective interestingness measures on the pattern of A → B, to analyze their similarity and dissimilarity as well as their relationship. Three-probability patterns, P(A), P(B), and P(AB), are enumerated in both linear and exponential scales and each measure's values of those conditions are calculated, forming synthesis data for investigation. The behavior of each measure is explored by pairwise comparison based on these three-probability patterns. The relationship among the sixty-one interestingness measures has been characterized with correlation analysis and association rule mining. In the experiment, relationships are summarized using heat-map and association rule mined. As the result, selection of an appropriate interestingness measure can be realized using the generated heat-map and association rules.

show abstract

Section: Measure Characterization By Pairwise Comparisonmentioning

confidence: 99%

Characterization of Interestingness Measures Using Correlation Analysis and Association Rule Mining

Somyanonthanakul

Theeramunkong

2020

IEICE Trans. Inf. & Syst.

Self Cite

View full text Add to dashboard Cite

show abstract

“…However, most works on these approaches focused on classification, not clustering. Some researchers suggested a combined term weighting to avoid bias from a single source of information [17]. While most works on term weighting focused on classification, it is still an open question on which statistical approaches are suitable for unsupervised learning, like the clustering task.…”

Section: Introductionmentioning

confidence: 99%

Improving Seeded k-Means Clustering with Deviation- and Entropy-Based Term Weightings

Buatoom

Kongprawechnon

Theeramunkong

2020

IEICE Trans. Inf. & Syst.

Self Cite

View full text Add to dashboard Cite

The outcome of document clustering depends on the scheme used to assign a weight to each term in a document. While recent works have tried to use distributions related to class to enhance the discrimination ability. It is worth exploring whether a deviation approach or an entropy approach is more effective. This paper presents a comparison between deviation-based distribution and entropy-based distribution as constraints in term weighting. In addition, their potential combinations are investigated to find optimal solutions in guiding the clustering process. In the experiments, the seeded k-means method is used for clustering, and the performances of deviation-based, entropy-based, and hybrid approaches, are analyzed using two English and one Thai text datasets. The result showed that the deviation-based distribution outperformed the entropy-based distribution, and a suitable combination of these distributions increases the clustering accuracy by 10%.

show abstract

Effect of Weighting Factors and Unit-Selection Factors on Text Summarization

Ketui

Theeramunkong

2014

Lecture Notes in Computer Science

View full text Add to dashboard Cite

News Relation Discovery Based on Association Rule Mining with Combining Factors

Cited by 6 publications

References 24 publications

Characterization of Interestingness Measures Using Correlation Analysis and Association Rule Mining

Characterization of Interestingness Measures Using Correlation Analysis and Association Rule Mining

Improving Seeded k-Means Clustering with Deviation- and Entropy-Based Term Weightings

Effect of Weighting Factors and Unit-Selection Factors on Text Summarization

Contact Info

Product

Resources

About