KOAS: Korean Text Offensiveness Analysis System

Park, San-Hee; Kim, Kang-Min; Cho, Seonhee; Park, Jun Hyung; Park, Hyuntae; Kim, Hyuna; Chung, Seongwon; Lee, Sangkeun

doi:10.18653/v1/2021.emnlp-demo.9

Cited by 16 publications

(21 citation statements)

References 13 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Korean stop words were downloaded from three well-known sources and merged [1]. To find the nouns, the Open Korean Text part-of-speech tagger offered in the Korean NLP in Python module in Python (Park and Cho, 2014) was used. These preprocessed nouns were then converted into single- (unigrams) and double-word (bigrams) combinations for analysis.…”

Section: Methodsmentioning

confidence: 99%

Everyday life information seeking in South Korea during the COVID-19 pandemic: daily topics of information needs in social Q&A

Kim

2022

OIR

View full text Add to dashboard Cite

PurposeThis study investigated information needs on COVID-19 by identifying topics discussed on social questions and answers (Q&A) about daily routines, problems, and health issues for survival. A layered model of contexts for everyday life information seeking (ELIS) was adapted for interpreting topics to better understand the contexts in which users could relate information needs.Design/methodology/approachQuestions and answers posted on Naver Knowledge-iN were collected and analyzed during the first nine months following the outbreak. Time distribution, topic modeling, and association rule mining were applied to examine the topics on COVID-19 and their temporal variation.FindingsNumerous topics related to the cognitive context (symptoms and masks) and situational contexts (international affairs, financial support, study, and work) were discovered. Topics related to social context were discussed moderately, but the number of questions on this topic increased with time. Strong associations were observed between terms related to symptoms, indicating their importance as a COVID-19 topic in health.Originality/valueThis study investigated topics of information needs using social Q&A data in which not only information inquiry but also information sharing coexist. The findings can help bridge the theory of ELIS to topic modeling in practice. The insights gained from this study can be used by information service providers for developing guidance and programs about how to survive during a pandemic.Peer reviewThe peer review history for this article is available at: https://publons.com/publon/10.1108/OIR-10-2021-0547.

show abstract

Section: Methodsmentioning

confidence: 99%

Everyday life information seeking in South Korea during the COVID-19 pandemic: daily topics of information needs in social Q&A

Kim

2022

OIR

View full text Add to dashboard Cite

show abstract

“…It was demanding to manually investigate the plethora of text and comments of news articles, so natural language processing (NLP) procedures, including (1) tokenization, (2) stop words, and (3) stemming, were used in this study with the assistance of the Korean natural language processing in the Python (KoNLPy) package, 17,18 Korean natural language processing procedures were performed in a form that allows morphological analysis.…”

Section: Methodsmentioning

confidence: 99%

Contents and sentiment analysis of newspaper articles and comments on telemedicine in Korea: Before and after of COVID-19 outbreak

Kang

Song

2022

Health Informatics J

View full text Add to dashboard Cite

Telemedicine is rapidly growing to meet the increased needs for high-quality health care during the COVID-19 pandemic. However, telemedicine is still a sensitive issue as it is related to medical privatization. The use of telemedicine after the COVID-19 outbreak might be influenced by public opinion, and this may be an important key in implementing telemedicine. In this study, we aimed to assess if telemedicine-related newspaper articles and comments changed positively during the COVID-19 pandemic. From January 1, 2019, to March 1, 2020 (before COVID-19), a total of 1073 telemedicine-related articles were found in the Korean news network. Although the post-COVID-19 article collection period (from March 2, 2020, to September 30, 2020) was about half that of the pre-COVID-19, about twice the number (1934) of telemedicine-related articles were collected. And telemedicine-related news articles had a more positive tone post-COVID-19 than pre-COVID-19 (52.9% after vs 40.4% before). In conclusion, this study presented the association between the COVID-19 outbreak and changes in the media’s perception of telemedicine in Korea. This study presented that, as telemedicine begins to be utilized due to COVID-19, news media and readers who embrace it are beginning to view telemedicine positively, suggesting that COVID-19 has a positive foundation for the spread of telemedicine.

show abstract

“…Finally, use thirdparty word segmentation tools to perform word segmentation tasks. Our study choose Jieba [18] for Chinese segmentation, konlpy for Korean [19] and Sudachipy for Japanese [20].…”

Section: B Oov Understandingmentioning

confidence: 99%

Graph Embedding-based Matching Multilingual Out-of-Vocabulary Terms on Social Media

Gu¹,

Jung²

2023

Preprint

View full text Add to dashboard Cite

<p>Our study aims to detect multilingual Out-of-Vocabulary (OOV) and matching among multilingual OOV. Based on the original OOV issue, many multilingual OOVs also emerged at the same time. In order to solve this problem, this paper proposes a graph embedding-based matching among multilingual OOV. The method is divided into two parts. The first part is to extract OOV from the network corpus and understand it. In the second part, the OOV in the first part is taken as the target node, and the understood part is taken as the feature node of the target node to construct the graph and embed the graph. Our study uses Chinese, Korean, and Japanese for the experiment. The result of the method that Our study proposed is that F1- score reached 93.94%. Our study also compares this method with other embedding algorithms, and the F1-score is higher than the average of other algorithms F1-score by 9.62%.</p>

show abstract

KOAS: Korean Text Offensiveness Analysis System

Cited by 16 publications

References 13 publications

Everyday life information seeking in South Korea during the COVID-19 pandemic: daily topics of information needs in social Q&A

Everyday life information seeking in South Korea during the COVID-19 pandemic: daily topics of information needs in social Q&A

Contents and sentiment analysis of newspaper articles and comments on telemedicine in Korea: Before and after of COVID-19 outbreak

Graph Embedding-based Matching Multilingual Out-of-Vocabulary Terms on Social Media

Contact Info

Product

Resources

About