2021
DOI: 10.48550/arxiv.2102.02478
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Bangla Text Dataset and Exploratory Analysis for Online Harassment Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 0 publications
0
2
0
Order By: Relevance
“…In this method, for keyword matching we used a predefined list of vulgar words with automatically filtered-out words that had no probability of occurrence in vulgar context, as explained in Section 3.3. The method achieved accuracies of 0.2, 0.245, 0.3, 0.324, 0.363, 0.385, 0.427, 0.449, 0.467, and 0.475 within the top 10,20,30,40,50,60,70,80,90, and 100 extracted words, respectively (see Table 4 and Figure 9). Moreover, for the longer word lists, the method, despite filtering out on average only half of the actually non-vulgar words, achieved accuracies close to purely human-based filtering.…”
Section: Baseline 2: Keyword-matching Methods Based On Tf-idf Term Ex...mentioning
confidence: 97%
See 1 more Smart Citation
“…In this method, for keyword matching we used a predefined list of vulgar words with automatically filtered-out words that had no probability of occurrence in vulgar context, as explained in Section 3.3. The method achieved accuracies of 0.2, 0.245, 0.3, 0.324, 0.363, 0.385, 0.427, 0.449, 0.467, and 0.475 within the top 10,20,30,40,50,60,70,80,90, and 100 extracted words, respectively (see Table 4 and Figure 9). Moreover, for the longer word lists, the method, despite filtering out on average only half of the actually non-vulgar words, achieved accuracies close to purely human-based filtering.…”
Section: Baseline 2: Keyword-matching Methods Based On Tf-idf Term Ex...mentioning
confidence: 97%
“…User comments from publicly viewable Facebook posts made by athletes, officials, and celebrities were analyzed in a study by Ahmed et al [40]. The researchers distinguished between Bengali-only comments and those written in English or a mix of English and other languages.…”
Section: Literature Reviewmentioning
confidence: 99%