2020
DOI: 10.11591/ijeecs.v17.i1.pp248-255
|View full text |Cite
|
Sign up to set email alerts
|

Author identification for Under-Resourced language (KadazanDusun)

Abstract: <span>This paper presents the task of Author Identification for KadazanDusun language by using tweets as the source of data to perform Author Identification task of short text on KadazanDusun, which is considered as one the under-resourced language in Malaysia. The aim of this paper is to demonstrate Author Identification of short text on KadazanDusun. Besides, this paper also examines the performance of two machine learning algorithms on the KadazanDusun data set by analyzing the stylometric features. S… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 18 publications
(27 reference statements)
0
4
0
Order By: Relevance
“…Another interesting tweet-based author identification using n-grams and word n-grams that treats the single-labelled multi-class problem in order to identify author. This study (Tarmizi 2020) demonstrates outstanding deployment of SVM over Naïve Bayes classifier.…”
Section: Statistical and Linguistic Modelsmentioning
confidence: 76%
“…Another interesting tweet-based author identification using n-grams and word n-grams that treats the single-labelled multi-class problem in order to identify author. This study (Tarmizi 2020) demonstrates outstanding deployment of SVM over Naïve Bayes classifier.…”
Section: Statistical and Linguistic Modelsmentioning
confidence: 76%
“…Tarmizi et al [6] present the task of Author Identification for KadazanDusun language by using tweets as the source of data. The feature extraction used is a combination of n-grams which n is from 1 to 5.…”
Section: Of 13mentioning
confidence: 99%
“…The calculations are continuing to prognosticating the correlation by calculating the covariance by implementing (2). Table 3 shows these calculation steps for x (number of friends), and y (friends and retweet) as two variables.…”
Section: ∑ ̅mentioning
confidence: 99%
“…Online social networks grew to become a significant phenomenon throughout the last years [1,2]. Where it has a vital impact on sharing, receiving and breaking news based on users' relationships [3].…”
Section: Introductionmentioning
confidence: 99%