Abstract:We present SICK-NL (read: signal), a dataset targeting Natural Language Inference in Dutch. SICK-NL is obtained by translating the SICK dataset of Marelli et al. (2014) from English into Dutch. Having a parallel inference dataset allows us to compare both monolingual and multilingual NLP models for English and Dutch on the two tasks. In the paper, we motivate and detail the translation process, perform a baseline evaluation on both the original SICK dataset and its Dutch incarnation SICK-NL, taking inspiration… Show more
“…Until recently, the algorithms behind these techniques were mainly trained on English texts. Due to the enormous rise in research on AI algorithms, more of these techniques are also available for the Dutch language [243,244]. The Veracity of data may affect the result of the analysis done.…”
“…Until recently, the algorithms behind these techniques were mainly trained on English texts. Due to the enormous rise in research on AI algorithms, more of these techniques are also available for the Dutch language [243,244]. The Veracity of data may affect the result of the analysis done.…”
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.