2016
DOI: 10.13053/cys-20-3-2453
|View full text |Cite
|
Sign up to set email alerts
|

Social Media – Processing Romanian Chat and Discourse Analysis

Abstract: In order to obtain a balanced corpus, a subcorpus of 2,576 sentences illustrating contemporary social media language has been added to the Dependency Treebank for Romanian. The texts were taken from the chat. The subject of this paper is to describe the second step of processing non-standard texts with a hybrid POS-tagger for Romanian and with a Malt parser, both until now trained on standard language and on other styles of communication. The results obtained show that the UAIC tools are comparable with the to… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
references
References 7 publications
0
0
0
Order By: Relevance