2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT) 2020
DOI: 10.1109/icccnt49239.2020.9225358
|View full text |Cite
|
Sign up to set email alerts
|

Toward an Enhanced Bengali Text Classification Using Saint and Common Form

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(2 citation statements)
references
References 11 publications
0
2
0
Order By: Relevance
“…List of punctuations that we removed from our dataset are given in Table 3. − Data tokenization: tokenization is the method of splitting or tokenizing a string [9]. Words are the token of a sentence and the sentences are the token of a paragraph.…”
Section: Data Pre-processingmentioning
confidence: 99%
See 1 more Smart Citation
“…List of punctuations that we removed from our dataset are given in Table 3. − Data tokenization: tokenization is the method of splitting or tokenizing a string [9]. Words are the token of a sentence and the sentences are the token of a paragraph.…”
Section: Data Pre-processingmentioning
confidence: 99%
“…Though they have used six classifiers, among all of those accuracy of NB was much efficient than other classifier's. Futhermore, count-vectorizer, tokenizing words, removal of stop words, part-of-speech (POS) tagging were the major steps for data preprocessing [9]- [11]. Different libraries and tools such as natural language toolkit (NLTK), TextBlob, Waikato environment for knowledge analysis (WEKA), and Beautiful Soup had been used for data preprocessing [12], [13].…”
Section: Introductionmentioning
confidence: 99%