Édgar Casasola Murillo scite author profile

Abstract: Mining and exploitation of data in social networks has been the focus of many efforts, but despite the resources and energy invested, still remains a lot for doing given its complexity, which requires the adoption of a multidisciplinary approach.Specifically, on what concerns to this research, the content of the texts published regularly, and at a very rapid pace, at sites of microblogs (eg Twitter.com) can be used to analyze global and local trends. These trends are marked by microblogs emerging topics that are distinguished from others by a sudden and accelerated rate of posts related to the same topic; in other words, by an increment of popularity in relatively short periods, a day or a few hours, for example Wanner et al.The problem, then, is twofold, first to extract the topics, then to identify which of those topics are trending. A recent solution, known as Bursty Biterm Topic Model (BBTM) is an algorithm for identifying trending topics, with a good level of performance in Twitter, but it requires great amount of computer processing. Hence, this research aims to evaluate if it is possible to reduce the amount of processing required and getting equally good results. This reduction carry out by a discrimination of co-occurrences of words (biterms) used by BBTM to model trending topics. In contrast to our previous work, in this research, we carry on a more complete and exhaustive set of experiments. Spanish Abstract: La minería y explotación de datos contenidos en las redes sociales ha sido foco de múltiples esfuerzos. Sin embargo, a pesar de los recursos y energía invertidos aún queda mucho por hacer dada su complejidad. Específicamente, esta investigación se centra en el contenido de los textos publicados regularmente, en los sitios de microblogs (por ejemplo, en Twitter.com) los cuales pueden ser utilizados para analizar tendencias. Estas ultimas son marcadas por temas emergentes que se distinguen de los demás por un súbito y acelerado aumento de publicaciones relacionadas al mismo tema; en otras palabras, por un incremento de popularidad en periodos relativamente cortos, de un día o de unas cuantas horas. En consecuencia, el problema es doble, primero extraer los temas sobre los cuáles se escribe y luego identificar cuáles de esos temas son tendencia. Una solución reciente, conocida como Bursty Biterm Topic Model (BBTM) es un algoritmo que utiliza coocurrencia de palabras (bitérminos) para identificación de temas emergentes y que cuenta con un buen nivel de resultados en Twitter. Sin embargo, su complejidad computacional es alta y requiere de una considerable cantidad de procesamiento computacional. De ahí, que esta investigación busca evaluar si es posible reducir la cantidad de procesamiento requerido y obtener resultados cuya calidad sean igualmente buenos. Esta reducción es llevada a cabo por una discriminación de las coocurrencias de palabras (bitérminos) utilizadas por BBTM para modelar los temas emergentes. En contraste al trabajo realizado previamente, en esta investigación, se llevan a cabo experimentos más completos y exhaustivos.

show abstract

Evaluation of Potential Spanish Text Markers on Social Posts as Features for Polarity Classification

Murillo

León

Raventós³

2018

CLEIej

View full text Add to dashboard Cite

Abstract: This work describes the identification and evaluation process of potential text markers for sentiment analysis. Evaluation of the markers and its use as part of the feature extraction process from plain text that is needed for sentiment analysis is presented. Evaluation of text marker obtained as a result of systematic analysis from a corpus over a second one allowed us to identify that emphasized positive words are strong indicators for positive text. The second corpus allowed us to evaluate the relation between the polarity of emphasized words and the text they appear in. Evaluation of the markers for polarity detection task in combination with a polarized dictionary produced polarity classification average precision of 56% using only three markers.This are promising results compared to the top 69% obtained using more features and specialized dictionaries for the same task. Spanish Abstract: Este trabajo describe el proceso de identificación y evaluación de posibles marcadores de texto para análisis de sentimiento. Se presenta la evaluación de los marcadores y su uso como parte del proceso de extracción de características para análisis de sentimiento.La evaluación de los marcadores de texto que obtenidos como resultado del análisis sistemático de un corpus sobre un segundo corpus permitió identificar que las palabras positivas con énfasis son fuertes indicadores para identificación de texto positivo. El segundo corpus nos permitió evaluar la relación existente entre la polaridad de las palabras con énfasis y el texto en que aparecen. La evaluación de los marcadores en combinación con un diccionario de polaridad permitieron obtener una precisión del 56%usando solo tres marcadores. Este es un resultado prometedor si se compara con el 69% que se logra obtener para la misma tarea con un número mayor de características y varios diccionarios especializados.

show abstract

The case for open source in information and network technology education: experiences from the EuropeAid @lis technology net project

Willmott¹,

Padget²,

Poggi³

et al. 2007

IJCEELL

View full text Add to dashboard Cite

Open source, open formats and open content have a significant role to play in the development of educational programs. Until now the debate has focused on open methods for delivery and licensing of educational material. But open methods are just as important for education itself, and IT in particular, as they are for delivery. We reflect on the experiences of the @LIS-TechNET project and argue that in emerging regions such as Latin America, a focus on open methods in IT education provides a more sustainable long-term benefit to the IT sector and the economy as a whole than proprietary methods. Rey, M. (2007) 'The case for open source in information and network technology education: experiences from the EuropeAid @lis technology net project', Int.

show abstract

Transferencia de la polaridad semántica de frases idiomáticas a comentarios de opinión

Murillo

León

2017

View full text Add to dashboard Cite

Las reacciones en redes sociales a propósito de un comentario pueden ser clasificadas en positivas, negativas o neutras, según la opinión del usuario o emisor sobre el tema en juego. Las frases idiomáticas son portadoras de un valor emocional, que va de positivo a negativo (lo que conocemos como polaridad). En nuestra investigación, establecimos que la aparición de una frase idiomática negativa en una opinión en redes sociales está directamente vinculada con la polaridad de la totalidad del comentario. Es decir, que la polaridad de la expresión es transferida al comentario propiamente dicho.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.