Proceedings of the 2018 ACM SIGIR International Conference on Theory of Information Retrieval 2018
DOI: 10.1145/3234944.3234949
|View full text |Cite
|
Sign up to set email alerts
|

Towards Word Embeddings for Improved Duplicate Bug Report Retrieval in Software Repositories

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
40
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 21 publications
(40 citation statements)
references
References 6 publications
0
40
0
Order By: Relevance
“…(1) DWEN [3]: a deep learning based duplicate bug report detection method, which extracts text features from bug reports and builds a neural network model for predicting whether or not two bug reports are duplicate. (2) DBTM [27]: an information retrieval based duplicate bug report detection method, which models a bug report as a textual document describing certain technical issue(s), and models duplicate bug reports as the ones about the same technical issue(s).…”
Section: Incident Linkagementioning
confidence: 99%
“…(1) DWEN [3]: a deep learning based duplicate bug report detection method, which extracts text features from bug reports and builds a neural network model for predicting whether or not two bug reports are duplicate. (2) DBTM [27]: an information retrieval based duplicate bug report detection method, which models a bug report as a textual document describing certain technical issue(s), and models duplicate bug reports as the ones about the same technical issue(s).…”
Section: Incident Linkagementioning
confidence: 99%
“…As shown by the dictionary sources, the method is applied to android bug reports and an 11.55% performance improvement is achieved compared with REP [45]. A similar study [46] uses word embedding.…”
Section: Bug Report Deduplicationmentioning
confidence: 99%
“…Other machine learning methods [55,56], such as hidden Markov models (HMMs) or deep networks, are proposed. They build a model that identifies the features of duplicate bug reports and utilize it.…”
Section: Bug Report Deduplicationmentioning
confidence: 99%
“…A combination of tf-idf and topics learned by LDA have also been used (Nguyen et al, 2012;Budhiraja et al, 2018b). Recently, word embeddings have been used to compute similarity of two reports (Yang et al, 2016;Budhiraja et al, 2018a). However, in these approaches, the sequence information of a natural language sentence is not captured.…”
Section: Related Workmentioning
confidence: 99%