Most sentiment analysis models that use supervised learning algorithms consume a lot of labeled data in the training phase in order to give satisfactory results. This is usually expensive and leads to high labor costs in real-world applications. This work consists in proposing a hybrid sentiment analysis model based on a Long Short-Term Memory network, a rulebased sentiment analysis lexicon and the Term Frequency-Inverse Document Frequency weighting method. These three (input) models are combined in a binary classification model. In the latter, each of these algorithms has been implemented: Logistic Regression, k-Nearest Neighbors, Random Forest, Support Vector Machine and Naive Bayes. Then, the model has been trained on a limited amount of data from the IMDB dataset. The results of the evaluation on the IMDB data show a significant improvement in the Accuracy and F1 score compared to the best scores recorded by the three input models separately. On the other hand, the proposed model was able to transfer the knowledge gained on the IMDB dataset to better handle a new data from Twitter US Airlines Sentiments dataset.
Understanding the determinants of satisfaction in P2P hosting is crucial, especially with the emergence of platforms such as Airbnb, which has become the largest platform for short-term rental accommodation. Although many studies have been carried out in this direction, there are still gaps to be filled, particularly with regard to the apprehension of customers taking into account their category. In this study, we took a machine learning-based approach to examine 100,000 customer reviews left on the Airbnb platform to identify different dimensions that shape customer satisfaction according to each category studied (individuals, couples, and families). However, the data collected do not give any information on the category to which the customer belongs to. So, we applied natural language processing (NLP) algorithms to the reviews in order to find clues that could help us segment them, and then we trained two regression models, multiple linear regression and support vector regression, in order to calculate the coefficients acting on each of the 6 elementary scores (precision, cleanliness, check-in, communication, location, and value) noted on Airbnb, taking into account the category of customers who evaluated the performance of their accommodation. The results suggest that customers are not equally interested in satisfaction metrics. In addition, disparities were noted for the same indicator depending on the category to which the client belongs to. In light of these results, we suggest that improvements be made to the rating system adopted by Airbnb to make it suitable for each category to which the client belongs to.
With the emergence of social networks and their adoption by a large number of users, the importance of influencers continues to grow and companies are in a frantic race to recruit those most likely to promote their reputation and brand image. However, in the existing literature, there is little work that conducts quantitative studies on this subject in developing countries. For this reason, we conducted a study that attempts to understand the importance of influencers in reshaping public opinion of a company or brand. We chose as a subject of study a large Moroccan company operating in the telecommunications sector that hired a popular influencer among young Moroccans. We then adopted an approach based on scraping and analyzing the occurrences of the influencer's posts on Instagram and the content of the company's website and then publishing a questionnaire to 180 respondents in the age range of most of the followers of the influencer in question. The results suggest that a positive relationship exists between the influencer and brand reputation, meaning that if the person is following the influencer who has published content on the brand, that person is expected to be systematically aware of the brand, and vice versa.
<span lang="EN-US">Word embedding has become the most popular method of lexical description in a given context in the natural language processing domain, especially through the word to vector (Word2Vec) and global vectors (GloVe) implementations. Since GloVe is a pre-trained model that provides access to word mapping vectors on many dimensionalities, a large number of applications rely on its prowess, especially in the field of sentiment analysis. However, in the literature, we found that in many cases, GloVe is implemented with arbitrary dimensionalities (often 300d) regardless of the length of the text to be analyzed. In this work, we conducted a study that identifies the effect of the dimensionality of word embedding mapping vectors on short and long texts in a sentiment analysis context. The results suggest that as the dimensionality of the vectors increases, the performance metrics of the model also increase for long texts. In contrast, for short texts, we recorded a threshold at which dimensionality does not matter.</span>
Due to the social and economic fallout from the COVID-19 pandemic, we sought to gauge the attitudes of social network users, in this case, Twitter, towards the topic using a sentiment analysis approach. We collected 178,683 tweets using the Twitter API based on queries for the high-frequency hashtag #covid19. After the preprocessing step, we classified them in a binary way (positive and negative) and according to their intensity (valence) using the VADER model and then the NRCLex dictionary, which allows us to classify feelings according to their affective class. The results suggest that overall, the feelings detected through the tweets are positive. In addition, users seem to be interestedin the pandemic as a trend rather than as a topic related to other social or economic aspects.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.