Vector representation based on a supervised codebook for Nepali documents classification

Sitaula, Chiranjibi; Basnet, Anish; Aryal, Sunil

doi:10.7717/peerj-cs.412

Cited by 14 publications

(18 citation statements)

References 31 publications

(42 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…First, most of the existing works [ 2 , 4 , 5 , 12 ] on COVID-19-related tweets are performed in high-resource languages such as English and Arabic. The approach used by high-resource language might be inapplicable to low-resource languages such as Nepali, which is based on Devanagari script and has 36 consonants (33 are distinct consonants and 3 are combined consonants), 13 vowels, and 10 numerals ( Figure 1 ) [ 1 , 15 , 16 ]. Second, their investigation mainly targets either clustering the tweets into various themes/topics or classifying their polarity into three classes (negative, positive, or neutral) using the well-established feature extraction methods such as BERT, Word2Vec, and Glove.…”

Section: Introductionmentioning

confidence: 99%

“…For example, semantic features based on the COVID-19-related tweets could learn more informative features. For this, we employ the probabilistic feature extraction approach as suggested by Sitaula et al [ 1 ] recently, which calculates the probability of each input word across all categories and finally and attains the feature vector depending on the number of categories present in the dataset. Last, with the help of the domain-agnostic method, we capture the semantic information using the cross-domain approach, which means that we transfer the knowledge to current COVID-19 domain from another domain such as news categories.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Deep Learning‐Based Methods for Sentiment Analysis on Nepali COVID‐19‐Related Tweets

Sitaula

Basnet

Mainali³

et al. 2021

Computational Intelligence and Neuroscience

Self Cite

View full text Add to dashboard Cite

COVID-19 has claimed several human lives to this date. People are dying not only because of physical infection of the virus but also because of mental illness, which is linked to people’s sentiments and psychologies. People’s written texts/posts scattered on the web could help understand their psychology and the state they are in during this pandemic. In this paper, we analyze people’s sentiment based on the classification of tweets collected from the social media platform, Twitter, in Nepal. For this, we, first, propose to use three different feature extraction methods—fastText-based (ft), domain-specific (ds), and domain-agnostic (da)—for the representation of tweets. Among these three methods, two methods (“ds” and “da”) are the novel methods used in this study. Second, we propose three different convolution neural networks (CNNs) to implement the proposed features. Last, we ensemble such three CNNs models using ensemble CNN, which works in an end-to-end manner, to achieve the end results. For the evaluation of the proposed feature extraction methods and CNN models, we prepare a Nepali Twitter sentiment dataset, called NepCOV19Tweets, with 3 classes (positive, neutral, and negative). The experimental results on such dataset show that our proposed feature extraction methods possess the discriminating characteristics for the sentiment classification. Moreover, the proposed CNN models impart robust and stable performance on the proposed features. Also, our dataset can be used as a benchmark to study the COVID-19-related sentiment analysis in the Nepali language.

show abstract

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Deep Learning‐Based Methods for Sentiment Analysis on Nepali COVID‐19‐Related Tweets

Sitaula

Basnet

Mainali³

et al. 2021

Computational Intelligence and Neuroscience

Self Cite

View full text Add to dashboard Cite

show abstract

“…During word representation learning, fastText considers not only the word itself but also groups of characters from that word and subword information such as character unigrams, bigrams, and trigrams [ 20 ]. However, GloVe and word2vec fail to provide any vector representation for words that are not in the model dictionary [ 21 ]. As a result, in this study, fastText is used as a word representation model.…”

Section: Proposed Methodsmentioning

confidence: 99%

Automated Amharic News Categorization Using Deep Learning Models

Endalie

Haile

2021

Computational Intelligence and Neuroscience

View full text Add to dashboard Cite

For decades, machine learning techniques have been used to process Amharic texts. The potential application of deep learning on Amharic document classification has not been exploited due to a lack of language resources. In this paper, we present a deep learning model for Amharic news document classification. The proposed model uses fastText to generate text vectors to represent semantic meaning of texts and solve the problem of traditional methods. The text vectors matrix is then fed into the embedding layer of a convolutional neural network (CNN), which automatically extracts features. We conduct experiments on a data set with six news categories, and our approach produced a classification accuracy of 93.79%. We compared our method to well-known machine learning algorithms such as support vector machine (SVM), multilayer perceptron (MLP), decision tree (DT), XGBoost (XGB), and random forest (RF) and achieved good results.

show abstract

“…Various research have also been conducted for the recognition of handwritten characters and texts from other languages such as recognizing Baybayin scripts using SVM ( Sitaula, Basnet & Aryal, 2021 ), analyzing handwritten Hebrew document ( Biller et al, 2016 ), recognizing English handwritings ( Pham et al, 2020 ), and many more. For English handwritten digit and character recognition tasks, CNN-based architectures have yielded better performance than other techniques ( Baldominos, Saez & Isasi, 2019 ; Ranzato et al, 2007 ; Cireşan et al, 2011 ), and so on.…”

Section: Related Workmentioning

confidence: 99%

Convolutional neural network-based ensemble methods to recognize Bangla handwritten character

Shibly¹,

Tisha²,

Tani³

et al. 2021

PeerJ Computer Science

View full text Add to dashboard Cite

In this era of advancements in deep learning, an autonomous system that recognizes handwritten characters and texts can be eventually integrated with the software to provide better user experience. Like other languages, Bangla handwritten text extraction also has various applications such as post-office automation, signboard recognition, and many more. A large-scale and efficient isolated Bangla handwritten character classifier can be the first building block to create such a system. This study aims to classify the handwritten Bangla characters. The proposed methods of this study are divided into three phases. In the first phase, seven convolutional neural networks i.e., CNN-based architectures are created. After that, the best performing CNN model is identified, and it is used as a feature extractor. Classifiers are then obtained by using shallow machine learning algorithms. In the last phase, five ensemble methods have been used to achieve better performance in the classification task. To systematically assess the outcomes of this study, a comparative analysis of the performances has also been carried out. Among all the methods, the stacked generalization ensemble method has achieved better performance than the other implemented methods. It has obtained accuracy, precision, and recall of 98.68%, 98.69%, and 98.68%, respectively on the Ekush dataset. Moreover, the use of CNN architectures and ensemble methods in large-scale Bangla handwritten character recognition has also been justified by obtaining consistent results on the BanglaLekha-Isolated dataset. Such efficient systems can move the handwritten recognition to the next level so that the handwriting can easily be automated.

show abstract

Vector representation based on a supervised codebook for Nepali documents classification

Cited by 14 publications

References 31 publications

Deep Learning‐Based Methods for Sentiment Analysis on Nepali COVID‐19‐Related Tweets

Deep Learning‐Based Methods for Sentiment Analysis on Nepali COVID‐19‐Related Tweets

Automated Amharic News Categorization Using Deep Learning Models

Convolutional neural network-based ensemble methods to recognize Bangla handwritten character

Contact Info

Product

Resources

About