Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua 2021
DOI: 10.18653/v1/2021.naacl-main.438
|View full text |Cite
|
Sign up to set email alerts
|

User-Generated Text Corpus for Evaluating Japanese Morphological Analysis and Lexical Normalization

Abstract: Morphological analysis (MA) and lexical normalization (LN) are both important tasks for Japanese user-generated text (UGT). To evaluate and compare different MA/LN systems, we have constructed a publicly available Japanese UGT corpus. Our corpus comprises 929 sentences annotated with morphological and normalization information, along with category information we classified for frequent UGTspecific phenomena. Experiments on the corpus demonstrated the low performance of existing MA/LN methods for non-general wo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
references
References 10 publications
(24 reference statements)
0
0
0
Order By: Relevance