2020
DOI: 10.1007/978-981-15-5421-6_39
|View full text |Cite
|
Sign up to set email alerts
|

Techniques, Applications, and Issues in Mining Large-Scale Text Databases

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
4

Relationship

0
10

Authors

Journals

citations
Cited by 22 publications
(6 citation statements)
references
References 21 publications
0
5
0
Order By: Relevance
“…• Text filtering: This step consists of removing undesired data from the collected datasets, such as duplicate and corrupted information, hyperlinks, and foreign language text, if required. While the removal of duplicate or corrupted data and hyperlinks in text data can be trivial, language detection is a more complex task to perform at scale [153]. To aid text filtering applications and reduce the requirement of manual language labeling, language filtering of text data can be performed using automated tools such as Google's Compact Language Detector [154], langid.py [155] or similar open-source software.…”
Section: B Aspect Extraction Techniquesmentioning
confidence: 99%
“…• Text filtering: This step consists of removing undesired data from the collected datasets, such as duplicate and corrupted information, hyperlinks, and foreign language text, if required. While the removal of duplicate or corrupted data and hyperlinks in text data can be trivial, language detection is a more complex task to perform at scale [153]. To aid text filtering applications and reduce the requirement of manual language labeling, language filtering of text data can be performed using automated tools such as Google's Compact Language Detector [154], langid.py [155] or similar open-source software.…”
Section: B Aspect Extraction Techniquesmentioning
confidence: 99%
“…These social skills or social intelligence also enhances commitment and learning in an individual specially in job sectors (Torabi, 2021;Mohadesi, 2021). It helps in extraction of large-scale text data and social computing as the process of extracting data from large text corpus is difficult (Avasthi et al, 2021;Wang et al, 2007).…”
Section: Literature Reviewmentioning
confidence: 99%
“…Multilingual text is an other open challenges. The proposed work focuses on all these challenges (Avasthi et al, 2020)…”
Section: Literature Reviewmentioning
confidence: 99%