Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing 2018
DOI: 10.18653/v1/d18-1370
|View full text |Cite
|
Sign up to set email alerts
|

Investigating the Role of Argumentation in the Rhetorical Analysis of Scientific Publications with Neural Multi-Task Learning Models

Abstract: Exponential growth in the number of scientific publications yields the need for effective automatic analysis of rhetorical aspects of scientific writing. Acknowledging the argumentative nature of scientific text, in this work we investigate the link between the argumentative structure of scientific publications and rhetorical aspects such as discourse categories or citation contexts. To this end, we (1) augment a corpus of scientific publications annotated with four layers of rhetoric annotations with argument… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 18 publications
(15 citation statements)
references
References 47 publications
0
12
0
Order By: Relevance
“…Domain knowledge is necessary to annotate scientific publications, and therefore annotation on scientific publications is difficult for the non-expert (e.g., crowdsourcing workers). Indeed, existing data sets for various tasks in scientific publication mining ( Lauscher et al, 2018 ; Hua et al, 2019 ; Yang et al, 2019 ; Yasunaga et al, 2019 ) are limited in terms of size, which additionally suggests that obtaining a sufficient number of data for supervised machine learning on scientific text is expensive and time-consuming. To remedy this bottleneck of annotation cost, we propose a self-supervised approach in which we use direct inline figure references in the article body to heuristically pair article paragraphs with figure captions and use those pairs as distant supervision.…”
Section: Related Workmentioning
confidence: 99%
“…Domain knowledge is necessary to annotate scientific publications, and therefore annotation on scientific publications is difficult for the non-expert (e.g., crowdsourcing workers). Indeed, existing data sets for various tasks in scientific publication mining ( Lauscher et al, 2018 ; Hua et al, 2019 ; Yang et al, 2019 ; Yasunaga et al, 2019 ) are limited in terms of size, which additionally suggests that obtaining a sufficient number of data for supervised machine learning on scientific text is expensive and time-consuming. To remedy this bottleneck of annotation cost, we propose a self-supervised approach in which we use direct inline figure references in the article body to heuristically pair article paragraphs with figure captions and use those pairs as distant supervision.…”
Section: Related Workmentioning
confidence: 99%
“…Green (2017b) extracted argumentative units from biomedical and biological articles using a semantic rule-based approach. Lauscher et al (2018a) and Lauscher et al (2018c) proposed several neural multi-task learning models based on Bi-LSTM to identify premises and conclusions. Other papers propose different approaches to identify argumentative zones, including supervised and weakly-supervised approaches with a rich set of linguistics features (e.g., (Guo et al, 2011)).…”
Section: Automatic Argument Unit Identificationmentioning
confidence: 99%
“…The usefulness of deep networks has been tested and proven in many NLP tasks, such as machine translation (Young et al, 2018 ), sentiment analysis (Zhang et al, 2018a ), text classification (Conneau et al, 2017 ; Zhang et al, 2018b ), relations extraction (Huang and Wang, 2017 ), as well as in AM (Cocarascu and Toni, 2017 , 2018 ; Daxenberger et al, 2017 ; Galassi et al, 2018 ; Lauscher et al, 2018 ; Lugini and Litman, 2018 ; Schulz et al, 2018 ). While a straightforward approach to exploit domain knowledge in AM is to apply a set of hand-crafted rules on the output of some first stage classifier (such as a neural network), NeSy or SRL approaches can directly enforce (hard or soft) constraints during training , so that a solution that does not satisfy them is penalized, or even ruled out.…”
Section: Combining Symbolic and Sub-symbolic Approachesmentioning
confidence: 99%