Characterising text mining: a systematic mapping review of the Portuguese language

Souza, Ellen; Costa, D. G. C.; Castro, Dayvid; Vitório, Douglas; Teles, Ingryd; Almeida, Rafaela Duque de; Alves, Tiago L.; Oliveira, Adriano L. I.; Gusmão, Cristine

doi:10.1049/iet-sen.2016.0226

Cited by 11 publications

(9 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Despite significant advances, there are challenges; for instance, due to informal language, idioms, and culturally specific terms, there are few comprehensive linguistic models for different domains and geographic areas [Khurana et al 2023, Pedroso et al 2022]. [Souza et al 2018] conducted a systematic mapping of studies related to the application of text mining to the Portuguese language from 1996 to 2014. The study used an automated search approach in digital libraries and a manual search in several conference proceedings held in Brazil (e.g., PROPOR, BraSNAM, and STIL).…”

Section: Related Workmentioning

confidence: 99%

“…The limitations of algorithms and tools for these languages are an important obstacle in this scenario. Likewise, few studies have focused on Brazilian events, as in the case of [Souza et al 2018], whose mapping covered only up to 2014, and [Júnior et al 2020] which was based on studies of international conferences. Therefore, the need for a systematic mapping directed to the NLP in social media analysis comes from the lack of works that show the state of the art focused on Brazilian academic events, in order to fill this gap and provide a comprehensive view of the state of the art in the national context.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Natural Language Processing and Social Media: a systematic mapping on Brazilian leading events

Araújo,

Leite,

Silva

et al. 2023

Anais Do XX Encontro Nacional De Inteligência Artificial E Computacional (ENIAC 2023)

View full text Add to dashboard Cite

The number of social media platforms has increased significantly, as well as the number of active users. More than 18.2 million text messages are transmitted every minute on these platforms. Given the amount of data available, Natural Language Processing (NLP) techniques have been used by several researchers to analyze this large amount of unstructured data. Thus, it is essential to understand social media analysis’s main trends and challenges. From this perspective, this study presents a systematic mapping of NLP for social media analysis considering papers published in five well-established academic Brazilian events: BRACIS, BraSNAM, ENIAC, STIL, and PROPOR. The study aims to identify the main tools and techniques used, tasks performed, data sources, and evaluation measures. For this purpose, 186 studies were analyzed and carefully selected among the 654 papers published in these events in the three years (2020 to 2022). The results show a glimpse of the current scenario on the subject and point out areas that can be improved in future research with techniques for tasks such as text classification, sentiment analysis, and named-entity recognition. Therefore, this work can be helpful for academics interested in exploring the potential NLP for social media analysis and having a clear view of gaps, challenges, and research opportunities in this area. Nevertheless, it should guide the productive sector in this knowledge transfer, reducing the gap between the state of the art and practice, consequently increasing the competitiveness and innovation of social media analysis tools.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Natural Language Processing and Social Media: a systematic mapping on Brazilian leading events

Araújo,

Leite,

Silva

et al. 2023

Anais Do XX Encontro Nacional De Inteligência Artificial E Computacional (ENIAC 2023)

View full text Add to dashboard Cite

show abstract

“…Previamente, três algoritmos de AM foram avaliados para a construc ¸ão do classificador de sentimentos: Multinomial Naïve Bayes (MNB), Support Vector Machine (SVM) e Random Forest (RF). Essa escolha se deu pelo fato de que estes são os três algoritmos mais utilizados para classificac ¸ão de texto em Português [Souza et al 2018]. Além disso, seis combinac ¸ões de técnicas de pré-processamento também foram avaliadas: 1) unigram; 2) bigram; 3) unigram + bigram; 4) unigram + remoc ¸ão de stopwords; 5) bigram + remoc ¸ão de stopwords; 6) unigram + bigram + remoc ¸ão de stopwords.…”

Section: Avaliac ¸ãO Dos Classificadoresunclassified

Análise do posicionamento dos usuários do Twitter acerca da Vacinação Infantil contra a COVID-19 no Brasil

Vitório¹,

Albuquerque²,

Souza³

et al. 2022

Anais Do XI Brazilian Workshop on Social Network Analysis and Mining (BraSNAM 2022)

Self Cite

View full text Add to dashboard Cite

Com o início da vacinação contra a COVID-19, esta se tornou um assunto bastante debatido nas redes sociais. Porém, a vacinação infantil só veio a ter início cerca de um ano após a vacinação de adultos, o que acabou adiando esse debate específico para o fim de 2021. Este trabalho, portanto, visa analisar o posicionamento, se favorável ou contrário, dos usuários do Twitter no Brasil acerca da aplicação das vacinas nas crianças de 5 a 11 anos. Utilizando técnicas de Análise de Sentimentos, pôde-se perceber que a maior parte dos usuários se mostrou favorável ao início da vacinação no país. Também foram levantados pontos que levaram os usuários a se posicionar daquela forma.

show abstract

“…We have strictly followed the guidelines proposed by Kitchenham and Charters, 23 Kitchenham et al, 22 Petersen et al, 24 and Petersen et al 25 to achieve an impartial review. These guidelines have been widely adopted in SLR, surveys, and SMS 41‐44 …”

Section: Planning and Conducting The Mappingmentioning

confidence: 99%

Design of frameworks for self‐adaptive service‐oriented applications: A systematic analysis

et al. 2021

View full text Add to dashboard Cite

Self‐adaptive service‐oriented Applications (Self‐Apps) must be able to understand themselves or the environment in which they are executed, and propose solutions to meet changing conditions. The development of these applications is not a trivial task, since it encompasses issues from different research areas. Despite the importance of frameworks for Self‐Apps, there is a lack of comprehensive analysis of how the design of such applications is performed, and regarding the standardization of concepts and coverage of minimum requirements for Self‐Apps. The main contribution of this article is to present this comprehensive analysis, providing the state of the art for this subject. This analysis was built through a Systematic Mapping Study, based on a total of 65 studies, from which we identify the main attributes for Quality of Service (QoS), search strategies, and service management strategies employed in the design of frameworks for Self‐Apps. The main aspects of requirements involved in the design of Self‐Apps were pointed out to stakeholders. For example, these applications must implement a method for evaluation of QoS based on metrics. We also put forward the S‐Frame, a modular solution that brings together the main features for the design of Self‐Apps, and describe the main challenges concerning these applications.

show abstract

Characterising text mining: a systematic mapping review of the Portuguese language

Cited by 11 publications

References 20 publications

Natural Language Processing and Social Media: a systematic mapping on Brazilian leading events

Natural Language Processing and Social Media: a systematic mapping on Brazilian leading events

Análise do posicionamento dos usuários do Twitter acerca da Vacinação Infantil contra a COVID-19 no Brasil

Design of frameworks for self‐adaptive service‐oriented applications: A systematic analysis

Contact Info

Product

Resources

About