2022
DOI: 10.3390/electronics11152418
|View full text |Cite
|
Sign up to set email alerts
|

Chinese Spam Detection Using a Hybrid BiGRU-CNN Network with Joint Textual and Phonetic Embedding

Abstract: The proliferation of spam in China has a negative impact on internet users’ experiences online. Existing methods for detecting spam are primarily based on machine learning. However, it has been discovered that these methods are susceptible to adversarial textual spam that has frequently been imperceptibly modified by spammers. Spammers continually modify their strategies to circumvent spam detection systems. Text with Chinese homophonic substitution may be easily understood by users according to its context. C… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 40 publications
0
3
0
Order By: Relevance
“…Phishing emails frequently exhibit linguistic inconsistencies, such as misspellings, grammar errors, and unnatural language usage. Studies by [10] and [11] have explored the linguistic features that are indicative of phishing. These features include grammatical errors, unusual syntax, and inconsistent language usage, which are often present in phishing emails.…”
Section: I) Linguistic Cues and Stylistic Anomaliesmentioning
confidence: 99%
“…Phishing emails frequently exhibit linguistic inconsistencies, such as misspellings, grammar errors, and unnatural language usage. Studies by [10] and [11] have explored the linguistic features that are indicative of phishing. These features include grammatical errors, unusual syntax, and inconsistent language usage, which are often present in phishing emails.…”
Section: I) Linguistic Cues and Stylistic Anomaliesmentioning
confidence: 99%
“…Fu Biehe [12] integrated a bad text classifier based on key information reduction, a bad text classifier based on improved KNN, and a bad text classifier based on sensitive decision tree through Bagging to recognize bad text containing variant words. Yao J [13] et al used a BiGRU-CNN hybrid network with word-phonetic co-embedding to recognize spam emails containing variant words. spam emails containing homophones.…”
Section: Introductionmentioning
confidence: 99%
“…The advent of smartphones with enhance features has contributed to huge adoption of short messaging by users due to its portability, mobility, ubiquity of services and its low cost continues to promote SMSM to become the most used means of communication globally. , Short Message Service (SMS) is text service in mobile communications with protocols that allow exchange of short text messages between fixed line or mobile phone devices (Ojugo & Ekurume, 2021aYao et al, 2022) Omede & Okpeki, 2023;Sahmoud & Mikki, 2022). An estimated 23-billion SMS was sent daily in 2014 globally; While, a total 8.3-trillion SMS was sent worldwide in the same year with net market revenue of over $128Billion in 2011.…”
Section: Introductionmentioning
confidence: 99%