Proceedings of the 15th Conference of the European Chapter of The Association for Computational Linguistics: Volume 2 2017
DOI: 10.18653/v1/e17-2042
|View full text |Cite
|
Sign up to set email alerts
|

The Content Types Dataset: a New Resource to Explore Semantic and Functional Characteristics of Texts

Abstract: This paper presents a new resource, called Content Types Dataset, to promote the analysis of texts as a composition of units with specific semantic and functional roles. By developing this dataset, we also introduce a new NLP task for the automatic classification of Content Types. The annotation scheme and the dataset are described together with two sets of classification experiments.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
3
0
1

Year Published

2018
2018
2021
2021

Publication Types

Select...
3
1
1

Relationship

2
3

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 18 publications
0
3
0
1
Order By: Relevance
“…More recently, Yu et al (2016) used a hybrid deep neural network combined with a Hidden Markov Model (DNN-HMM) to segment speech transcripts from broadcast news to a sequence of stories. Similar to our approach, (Sprugnoli et al, 2017) used CRFs and SVMs for the classification of automatic classification of Content Types, a novel task that was introduced to provide cues to access the structure of a document's types of functional content.…”
Section: Related Workmentioning
confidence: 99%
“…More recently, Yu et al (2016) used a hybrid deep neural network combined with a Hidden Markov Model (DNN-HMM) to segment speech transcripts from broadcast news to a sequence of stories. Similar to our approach, (Sprugnoli et al, 2017) used CRFs and SVMs for the classification of automatic classification of Content Types, a novel task that was introduced to provide cues to access the structure of a document's types of functional content.…”
Section: Related Workmentioning
confidence: 99%
“…"Bieber fever" refers to avid supporters of Justin Bieber) and those discussing an infectious disease outbreak (43,51,52). Machine learning methods can also be used to distinguish between ambiguities in dates and locations, such as past and present outbreaks in articles that discuss historical context (53,54). Novel applications for ML methods are also being developed, such as structuring disease case information into epidemiological line lists (a listing of individuals affected by the disease and related information; i.e.…”
Section: Artificial Intelligence Applicationsmentioning
confidence: 99%
“…la « fièvre Justin Bieber » fait référence aux fans du chanteur) et les articles portant sur l'éclosion d'une maladie infectieuse (43,51,52). L'apprentissage automatique peut également servir à éclaircir les ambiguïtés dans les dates et les lieux, comme les éclosions passées et les éclosions en cours, dans les articles discutant du contexte historique (53,54). De plus, on élabore des applications avant-gardistes de l'apprentissage automatique, telles que la structuration des données sur les cas sous forme de listes de parcours des éléments épidémiologiques (la liste des patients infectés et l'information connexe : état de santé, sexe, lieu, date d'apparition de la maladie, hospitalisation, etc.)…”
Section: Applications De L'intelligence Artificielleunclassified