2020
DOI: 10.2139/ssrn.3740497
|View full text |Cite
|
Sign up to set email alerts
|

Comparative Study of Data Clustering Algorithms and Analysis of The Keywords Extraction Efficiency: Learner Corpus Case

Abstract: The paper focuses on the task of clustering essays produced by ESL (English as a Second Language) learners. The data was taken from a learner corpus REALEC. The division of texts by certain characteristics can be useful to speed up the analysis of a single corpus or access to the necessary sections of a large number of documents. The study discusses not only some existing approaches to clustering text data, as well as the possibility of clustering texts produced by ESL learners, but also ways to extract keywor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
references
References 10 publications
0
0
0
Order By: Relevance