In this paper a study concerning the evaluation and analysis of natural language tweets is presented. Based on our experience in text summarisation, we carry out a deep analysis on user's perception through the evaluation of tweets manual and automatically generated from news. Specifically, we consider two key issues of a tweet: its informativeness and its interestingness. Therefore, we analyse: 1) do users equally perceive manual and automatic tweets?; 2) what linguistic features a good tweet may have to be interesting, as well as informative? The main challenge of this proposal is the analysis of tweets to help companies in their positioning and reputation on the Web. Our results show that: 1) automatically informative and interesting natural language tweets can be generated as a result of summarisation approaches; and 2) we can characterise good and bad tweets based on specific linguistic features not present in other types of tweets.Keywords: Natural Language Processing, Text Summarisation, Natural Language Tweet Generation, User Study, Linguistic Analysis, Descriptive Statistics
Introduction, Context and MotivationIn the current digital knowledge society, the overload of information has become a problem to companies, which cannot cope with all the available information. As a consequence, companies may not be exploiting the Web, and taking advantage of it accordingly, thus affecting key aspects, such as their visibility, reputation, marketing campaigns, customer's feedback, etc. With the
Preprint submitted to Computers in IndustryOctober 11, 2015 This is a previous version of the article published in Computers in Industry. 2016Industry. , 78: 3-15. doi:10.1016Industry. /j.compind.2015 birth of the Web 2.0, there has been a shift in the way the information is produced and consumed by users and companies. The Web 2.0 has established a wide range of on-line mechanisms and platforms through which companies can obtain direct feedback from users. These mechanisms (e.g., reviews, social net- With more than 241 million active users per month 1 , 184 million of which uses Twitter through their mobile device, and more than 500 million tweets daily 2 , Twitter 3 has become an excellent social media for on-line real-time news attention 4 . The length restriction imposed on tweets (140 characters) force messages to be concise, though it is also possible to link out to external information to enrich the tweet. Moreover, hashtags (e.g. #UA Universidad) allow to categorise information, to identify the trending topics, and more importantly to enable a rapid on-line information flow. According to [3]