2020
DOI: 10.1371/journal.pone.0237767
|View full text |Cite
|
Sign up to set email alerts
|

Learning about phraseology from corpora: A linguistically motivated approach for Multiword Expression identification

Abstract: Multiword Expressions (MWEs) are idiosyncratic combinations of words which pose important challenges to Natural Language Processing. Some kinds of MWEs, such as verbal ones, are particularly hard to identify in corpora, due to their high degree of morphosyntactic flexibility. This paper describes a linguistically motivated method to gather detailed information about verb+noun MWEs (VNMWEs) from corpora. Although the main focus of this study is Spanish, the method is easily adaptable to other languages. Monolin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(2 citation statements)
references
References 18 publications
0
2
0
Order By: Relevance
“…syntactic and/or morphological variability (Peng et al, 2014;Constant et al, 2017). In general, quantifying any variability has traditionally required obtaining frequencies of the variants from a full corpus, as done by Inurrieta et al (2020). However, as we only have a small number of examples for each idiom, these properties are not modeled in our approach.…”
Section: Background and Related Workmentioning
confidence: 99%
“…syntactic and/or morphological variability (Peng et al, 2014;Constant et al, 2017). In general, quantifying any variability has traditionally required obtaining frequencies of the variants from a full corpus, as done by Inurrieta et al (2020). However, as we only have a small number of examples for each idiom, these properties are not modeled in our approach.…”
Section: Background and Related Workmentioning
confidence: 99%
“…Apparently, multiword expressions pose challenges to Natural Language Processing (Inurrieta et al, 2020). Verbal idioms seem to be harder to identify and interpret.…”
Section: Idioms Language Change and Natural Language Processingmentioning
confidence: 99%