2020 6th International Conference on Web Research (ICWR) 2020
DOI: 10.1109/icwr49608.2020.9122288
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Duplicate Bug Report Detection using Information Retrieval-based versus Machine Learning-based Approaches

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 16 publications
(6 citation statements)
references
References 48 publications
0
4
0
Order By: Relevance
“…In [51], Neysiani and Babamir proposed a study aimed at assessing the best DBR detection (or retrieval) approaches. They analyzed both IR-based and Machine Learning (ML) approaches.…”
Section: A Mini-systematic Survey About Dbr Detection and Retrievalmentioning
confidence: 99%
“…In [51], Neysiani and Babamir proposed a study aimed at assessing the best DBR detection (or retrieval) approaches. They analyzed both IR-based and Machine Learning (ML) approaches.…”
Section: A Mini-systematic Survey About Dbr Detection and Retrievalmentioning
confidence: 99%
“…Neysiani et al compared IR-based and ML-based methods for bug report deduplication [12], and the experimental results showed no significant difference in terms of accuracy or runtime efficiency. Campbell et al conducted a quantitative analysis of commonly used bug classification methods, including signature-based approaches (such as functions, addresses, and linked libraries) and text-tokenized methods.…”
Section: Related Surveymentioning
confidence: 99%
“…For example, for bugs with priority p1, the accuracy, precision, recall, and F measure were 0.732, 0.871, 0.732, and 0.796, respectively. Neysiani et al proposed a feature extraction model to aid in bug triage deduplication [12]. The model aggregates various features extracted from bug reports, including multiple text features extracted using TF-IDF, time features, context features, and classification features.…”
Section: Information Retrieval Approaches For Deduplication and Triagementioning
confidence: 99%
“…For the above reasons, researches proposed various techniques based on text mining (Chaparro, 2017;Zhang, Chen, Yang, Lee and Luo, 2016) and machine learning (ML) (Zhang, Wang, Hao, Xie, Zhang and Mei, 2015;Tan, Xu, Wang, Zhang, Xu and Luo, 2020) to automate bug report processing. The widely employed ML techniques include Naïve Bayes (NB) (Lamkanfi, Demeyer, Giger and Goethals, 2010;Abdelmoez, Kholief and Elsalmy, 2012), Random Forest (RF) and Support Vector Machine (SVM) (Neysiani and Babamir, 2020), and k-nearest neighbors (K-NN) (Hamdy and El-Laithy, 2019). However, the performance of the ML techniques is not satisfactory (Ramay, Umer, Yin, Zhu and Illahi, 2019).…”
Section: Introductionmentioning
confidence: 99%