Seedling developmental defects upon blocking CINNAMATE‐4‐HYDROXYLASE are caused by perturbations in auxin transport

The volume of unstructured text data generated by various social media has been increasing rapidly; therefore, use of text mining to support decision making has also been increasing. Especially, issue Clustering-determining a new relation with various issues through clustering-has gained attention from many researchers. However, traditional issue clustering methods can only be performed based on the co-occurrence frequency of issue keywords in many documents. Therefore, an association between issues that have a low co-occurrence frequency cannot be discovered using traditional issue clustering methods, even if those issues are strongly related in other perspectives. Therefore, issue clustering that fits each of criteria needs to be performed by the perspective of analysis and the purpose of use. In this study, a multi-dimensional issue clustering is proposed to overcome the limitation of traditional issue clustering. We assert, specifically in this study, that issue clustering should be performed for a particular purpose. We analyze the results of applying our methodology to two specific perspectives on issue clustering, (i) consumers' interests, and (ii) related R&D terms.

show abstract

Improving Performance of Recommendation Systems Using Topic Modeling

Choi¹,

Hyun²,

Kim³

2015

Journal of Intelligence and Information Systems

View full text Add to dashboard Cite

Detecting Spam Data for Securing the Reliability of Text Analysis

Hyun¹,

Kim²

2017

The Journal of Korean Institute of Communications and Informati

View full text Add to dashboard Cite

Recently, tremendous amounts of unstructured text data that is distributed through news, blogs, and social media has gained much attention from many researchers and practitioners as this data contains abundant information about various consumers' opinions. However, as the usefulness of text data is increasing, more and more attempts to gain profits by distorting text data maliciously or nonmaliciously are also increasing. This increase in spam text data not only burdens users who want to obtain useful information with a large amount of inappropriate information, but also damages the reliability of information and information providers. Therefore, efforts must be made to improve the reliability of information and the quality of analysis results by detecting and removing spam data in advance. For this purpose, many studies to detect spam have been actively conducted in areas such as opinion spam detection, spam e-mail detection, and web spam detection. In this study, we introduce core concepts and current research trends of spam detection and propose a methodology to detect the spam tag of a blog as one of the challenging attempts to improve the reliability of blog information.

show abstract

A Methodology For Investigating Public Opinion Using Multilevel Text Analysis

Wong

Lim²,

Hyun³

et al. 2015

View full text Add to dashboard Cite

Issue Reorganization Using The Measure Of Relevance

Shun¹,

Hyun²,

Kim³

et al. 2014

View full text Add to dashboard Cite

Detecting blog spam hashtags using topic modeling

Hyun

Kim

2016

View full text Add to dashboard Cite

Methodology for Issue-related R&D Keywords Packaging Using Text Mining

Hyun¹,

Shun²,

Kim³

2015

Journal of Internet Computing and Services

View full text Add to dashboard Cite

Considerable research efforts are being directed towards analyzing unstructured data such as text files and log files using commercial and noncommercial analytical tools. In particular, researchers are trying to extract meaningful knowledge through text mining in not only business but also many other areas such as politics, economics, and cultural studies. For instance, several studies have examined national pending issues by analyzing large volumes of text on various social issues. However, it is difficult to provide successful information services that can identify R&D documents on specific national pending issues. While users may specify certain keywords relating to national pending issues, they usually fail to retrieve appropriate R&D information primarily due to discrepancies between these terms and the corresponding terms actually used in the R&D documents. Thus, we need an intermediate logic to overcome these discrepancies, also to identify and package appropriate R&D information on specific national pending issues. To address this requirement, three methodologies are proposed in this study-a hybrid methodology for extracting and integrating keywords pertaining to national pending issues, a methodology for packaging R&D information that corresponds to national pending issues, and a methodology for constructing an associative issue network based on relevant R&D information. Data analysis techniques such as text mining, social network analysis, and association rules mining are utilized for establishing these methodologies. As the experiment result, the keyword enhancement rate by the proposed integration methodology reveals to be about 42.8%. For the second objective, three key analyses were conducted and a number of association rules between national pending issue keywords and R&D keywords were derived. The experiment regarding to the third objective, which is issue clustering based on R&D keywords is still in progress and expected to give tangible results in the future.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yoonjin Hyun

Discovery of Market Convergence Opportunity Combining Text Mining and Social Network Analysis: Evidence from Large-Scale Product Databases

A Multi-Dimensional Issue Clustering from the Perspective Consumers' Interests and R&D

Improving Performance of Recommendation Systems Using Topic Modeling

Detecting Spam Data for Securing the Reliability of Text Analysis

A Methodology For Investigating Public Opinion Using Multilevel Text Analysis

Issue Reorganization Using The Measure Of Relevance

Detecting blog spam hashtags using topic modeling

Methodology for Issue-related R&D Keywords Packaging Using Text Mining

Contact Info

Product

Resources

About