“…It is necessary, therefore, to empirically test the validity of these categories not only for theoretical purposes but also to be used for the annotation of large datasets, which can be later used for computational purposes such as Machine Learning, Automatic Text Classification, Multilingual Information Retrieval and Sentiment Analysis, among others. In this paper, we undertake this task in the context of the MULTINOT project, aimed at the creation and empirical validation of discourse features in English and Spanish through corpus analysis and annotation (see Lavid et al 2015;Lavid, Moratón 2016).…”