Proceedings of the Thirteenth Workshop on Innovative Use of NLP For Building Educational Applications 2018
DOI: 10.18653/v1/w18-0539
|View full text |Cite
|
Sign up to set email alerts
|

Deep Learning Architecture for Complex Word Identification

Abstract: We describe a system for the CWI-task that includes information on 5 aspects of the (complex) lexical item, namely distributional information of the item itself, morphological structure, psychological measures, corpus-counts and topical information. We constructed a deep learning architecture that combines those features and apply it to the probabilistic and binary classification task for all English sets and Spanish. We achieved reasonable performance on all sets with best performances seen on the probabilist… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
6
3

Relationship

1
8

Authors

Journals

citations
Cited by 16 publications
(9 citation statements)
references
References 12 publications
0
9
0
Order By: Relevance
“…Our system is ranked directly above TMU [62]. This system is based on the frequency of the target word in a Wikipedia Corpus and a learner corpus subsequently trained on a random forest classifier, as well as a deep learning architecture with word/char embeddings, word length and frequency counts named NLP-CIC [63].…”
Section: Discussionmentioning
confidence: 99%
“…Our system is ranked directly above TMU [62]. This system is based on the frequency of the target word in a Wikipedia Corpus and a learner corpus subsequently trained on a random forest classifier, as well as a deep learning architecture with word/char embeddings, word length and frequency counts named NLP-CIC [63].…”
Section: Discussionmentioning
confidence: 99%
“…The submitted systems mainly use traditional machine learning classifiers(e.g. SVM, Random Forests) with features (Butnaru and Ionescu, 2018;Kajiwara and Komachi, 2018), deep learning methods (Hartmann and Dos Santos, 2018;De Hertog and Tack, 2018) and ensemble methods (Gooding and Kochmar, 2018;Aroyehun et al, 2018). More recently, (Gooding and Kochmar, 2019) propose a new perspective by treating CWI as a sequence labeling task that can detect both complex words and phrases.…”
Section: Complex Word Identificationmentioning
confidence: 99%
“…ITEC addresses both the binary and probabilistic classification task for the English and Spanish multilingual datasets (De Hertog and Tack, 2018). They have used 5 different aspects of the target word in the process of feature extractions, namely, word embedding, morphological structure, psychological measures, corpus counts, and topical information.…”
Section: Shared Task Systemsmentioning
confidence: 99%