Text mining refers to the discovery of previously unknown knowledge that can be found in text collections. In recent years, the text mining field has received great attention due to the abundance of textual data. A researcher in this area is requested to cope with issues originating from the natural language particularities. This survey discusses such semantic issues along with the approaches and methodologies proposed in the existing literature. It covers syntactic matters, tokenization concerns and it focuses on the different text representation techniques, categorisation tasks and similarity measures suggested.
In this article, we introduce the idea of expert recommendations whose objective is to relate review comments with users' tasks or expectations. We propose to use fine-grained information such as opinions and suggestions extracted using natural language processing techniques from user reviews about products, to improve a recommendation system. While typical recommender systems compare a user profile with some reference characteristics to rate unseen items, they rarely make use of the content of reviews that users have provided on a given product. In this article, we present the application of an opinion extraction system to extract opinions and suggestions from the content of the reviews, the use of the results to compare other products with the reviewed one, and eventually the recommendation of better products to the user. The recommendations are given a confidence weight by using a trust social network.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.