2021
DOI: 10.48550/arxiv.2101.05716
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SICKNL: A Dataset for Dutch Natural Language Inference

Abstract: We present SICK-NL (read: signal), a dataset targeting Natural Language Inference in Dutch. SICK-NL is obtained by translating the SICK dataset of Marelli et al. (2014) from English into Dutch. Having a parallel inference dataset allows us to compare both monolingual and multilingual NLP models for English and Dutch on the two tasks. In the paper, we motivate and detail the translation process, perform a baseline evaluation on both the original SICK dataset and its Dutch incarnation SICK-NL, taking inspiration… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 5 publications
0
1
0
Order By: Relevance
“…Until recently, the algorithms behind these techniques were mainly trained on English texts. Due to the enormous rise in research on AI algorithms, more of these techniques are also available for the Dutch language [243,244]. The Veracity of data may affect the result of the analysis done.…”
Section: The Status Quomentioning
confidence: 99%
“…Until recently, the algorithms behind these techniques were mainly trained on English texts. Due to the enormous rise in research on AI algorithms, more of these techniques are also available for the Dutch language [243,244]. The Veracity of data may affect the result of the analysis done.…”
Section: The Status Quomentioning
confidence: 99%