2018
DOI: 10.1108/jkm-11-2016-0489
|View full text |Cite
|
Sign up to set email alerts
|

Twitter mining for ontology-based domain discovery incorporating machine learning

Abstract: Purpose This paper aims to obtain the domain of the textual content generated by users of online social network (OSN) platforms. Understanding a users’ domain (s) of interest is a significant step towards addressing their domain-based trustworthiness through an accurate understanding of their content in their OSNs. Design/methodology/approach This study uses a Twitter mining approach for domain-based classification of users and their textual content. The proposed approach incorporates machine learning module… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
46
0
3

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
2
1

Relationship

4
4

Authors

Journals

citations
Cited by 72 publications
(52 citation statements)
references
References 60 publications
0
46
0
3
Order By: Relevance
“…In this context, a thread of efforts has steered towards extracting knowledge from UGC to inform actionable intelligence (Abu-Salih, Wongthongtham, & Chan, 2018;Hultgren, Jennex, Persano,&Ornatowski,2016).Asnoted,OSNshavealreadybeenextensivelyusedasapowerful tooltopromoteknowledgeextractionandmanagementinseveraldomains (Arularasan,Suresh,& Seerangan,2018;Kasemsap,2019;Nishikant,PrabinKumar,&ShashiKant,2018).Givensuchan impact,understandingwaystobroadenthisscopeandextradatafromnewsourcessuchasUGCisof 104 interesttomanypractitionersandresearchersalike.Infact,eventhoughidentifying,reviewing,and interpretingsocialcontentconsumessubstantialtimeandeffort (Chang,Diaz,&Hung,2015),itstill attractswideinterestduetothepotentialtoapplyKMtoobtainhighqualitycontent,andactionable intelligenceinmanydisciplinesincludingpolitics (Cruz,2019),e-commerce(Schaupp&Bélanger, 2019),e-learning (Hosseingholizadeh,Sharif,&Kouhsari,2018),andhealthcare (Surendro,Satya, &Yodihartomo,2018).Othereffortshaveprovidedunconventionalandadvancedperceptionsto framethisconstantgrowthofUGC,alongsideotherBigDataislands.Forexample, Jennex(2017) presentedarevisedversionofthetraditionalKMpyramidthatincorporatesBigData,Internetof Things(IoT)andBusinessIntelligence(BI)toprovideanoverarchingparadigmtowardbetterdecision supportstrategies.…”
Section: Incorporation Of Knowledge Managementmentioning
confidence: 99%
“…In this context, a thread of efforts has steered towards extracting knowledge from UGC to inform actionable intelligence (Abu-Salih, Wongthongtham, & Chan, 2018;Hultgren, Jennex, Persano,&Ornatowski,2016).Asnoted,OSNshavealreadybeenextensivelyusedasapowerful tooltopromoteknowledgeextractionandmanagementinseveraldomains (Arularasan,Suresh,& Seerangan,2018;Kasemsap,2019;Nishikant,PrabinKumar,&ShashiKant,2018).Givensuchan impact,understandingwaystobroadenthisscopeandextradatafromnewsourcessuchasUGCisof 104 interesttomanypractitionersandresearchersalike.Infact,eventhoughidentifying,reviewing,and interpretingsocialcontentconsumessubstantialtimeandeffort (Chang,Diaz,&Hung,2015),itstill attractswideinterestduetothepotentialtoapplyKMtoobtainhighqualitycontent,andactionable intelligenceinmanydisciplinesincludingpolitics (Cruz,2019),e-commerce(Schaupp&Bélanger, 2019),e-learning (Hosseingholizadeh,Sharif,&Kouhsari,2018),andhealthcare (Surendro,Satya, &Yodihartomo,2018).Othereffortshaveprovidedunconventionalandadvancedperceptionsto framethisconstantgrowthofUGC,alongsideotherBigDataislands.Forexample, Jennex(2017) presentedarevisedversionofthetraditionalKMpyramidthatincorporatesBigData,Internetof Things(IoT)andBusinessIntelligence(BI)toprovideanoverarchingparadigmtowardbetterdecision supportstrategies.…”
Section: Incorporation Of Knowledge Managementmentioning
confidence: 99%
“…Random Forest Decision Tree classification as the major learning algorithm implemented in this undertaking is further utilized as a training data and test results to predict the MMORS river condition with its corresponding water pollution level classification indicated as -Excellent‖, -Good‖, -Poor‖, -Very Poor, and -Worst‖ This section describes the different metrics used by the researcher in evaluating the classifier model performance [8]; its effectiveness and the quality of its prediction. Several tests of data with known water quality parameter values are used to test the accuracy of the generated sample by distinguishing the reliability of the data and their validity in accordance to the comparison of an observed accuracy with an expected accuracy rate that is likely to meet based on the Confusion Matrix [9]. The classifier can also be evaluated in terms of Precision, Recall, and F-measure and the assessment of interrater-reliability [10] .Cohen's Kappa is used which is shown in Table IV.…”
Section: E Prediction and Validationmentioning
confidence: 99%
“…It is also directly linked to the capability to discard user's messages that are classi ed as spam i.e. unsolicited and repeated junk messages (Abu-Salih et al, 2020;Abu-Salih et al, 2018;Abu-Salih, et al, 2019b;. These tweets come usually from bots and have a malicious intention to create rumours and chaos (Shin et al, 2017).…”
Section: Related Workmentioning
confidence: 99%
“…A SA system is highly sensitive to the domain in which the data used to train are extracted. It can obtain poor results if the training dataset is not political (Abu-Salih et al, 2018;. Due to the lack of a sentiment lexicon for non-English languages, the creation of a new polarity lexicon is decided for the Spanish political event issuing from two different sources.…”
Section: -New Polarity Political Lexiconmentioning
confidence: 99%