The large amount of text that is generated daily on the web through comments on social networks, blog posts and open-ended question surveys, among others, demonstrates that text data is used frequently, and therefore; its processing becomes a challenge for researchers. The topic modeling is one of the emerging techniques in text mining; it is based on the discovery of latent data and the search for relationships among text documents. In this paper, the objective of the research is to evaluate a generic methodology based on topic modeling and text network modeling, that allows researchers to gather valuable information from surveys that use open-ended questions. To achieve this, this methodology has been evaluated through the use of a case study in which the responses to a teacher self-assessment survey in an Ecuadorian university have been studied. The main contribution of the article is the inclusion of clustering algorithms in order to complement the results obtained when executing topic modeling. The proposed methodology is based on four phases: (a) Construction of a text database, (b) Text mining and topic modeling, (c) Topic network modeling and (d) The relevance of the identified topics. In previous works, it has been observed that the human interpretative contribution plays an important role in the process, especially in phases (a) and (d). For this reason, the visualization interfaces, such as graphs and dendograms, are of critical importance for researchers in order allow topic to efficiently analyze the results of the topic modeling. As a result of this case study, a compendium of the main strategies that teachers carry out in their classes with the aim of improving student retention is presented. In addition, the proposed methodology can be extended to the analysis of the unstructured textual information found in blogs, social networks, forums, etc. INDEX TERMS Latent Dirichlet allocation, open-ended questions, teacher self-assessment, topic modeling, topic network. DIEGO BUENAÑO-FERNANDEZ received the engineering degree in computer systems from the National Polytechnic School, Quito, in 1999, and the master's degree in business administration from Latin American Christian University, in 2012. He is currently pursuing the Ph.D. degree, in the Ph.D. program, in computer science with the University of Alicante, Spain. He is also the Dean of the Faculty of Engineering and Applied Sciences, Universidad de Las Américas, Quito, Ecuador. He also teaches the subjects operating systems and electronic business. His research line is related to data mining in educational environments.
Information Technology and Communications (ICT) is presented as the main element in order to achieve more efficient and sustainable city resource management, while making sure that the needs of the citizens to improve their quality of life are satisfied. A key element will be the creation of new systems that allow the acquisition of context information, automatically and transparently, in order to provide it to decision support systems. In this paper, we present a novel distributed system for obtaining, representing and providing the flow and movement of people in densely populated geographical areas. In order to accomplish these tasks, we propose the design of a smart sensor network based on RFID communication technologies, reliability patterns and integration techniques. Contrary to other proposals, this system represents a comprehensive solution that permits the acquisition of user information in a transparent and reliable way in a non-controlled and heterogeneous environment. This knowledge will be useful in moving towards the design of smart cities in which decision support on transport strategies, business evaluation or initiatives in the tourism sector will be supported by real relevant information. As a final result, a case study will be presented which will allow the validation of the proposal.
This article is concerned with the basic syntactic and semantic structure of Riau Indonesian, a hitherto undescribed dialect differing in many respects from standard Indonesian and Malay. Riau Indonesian appears to exhibit an abundance of zero-markings of various kinds, in which a wide variety of syntactic constructions and semantic categories lack overt morphosyntactic expression. Two alternative descriptions are provided: an “easy” description, couched in traditional grammatical terminology and a Eurocentric perspective, and a “simple” description, positing a single open syntactic category, and unconstrained rules of semantic interpretation. The latter, simple description, is argued to be superior.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.