2020
DOI: 10.1109/tse.2018.2877612
|View full text |Cite
|
Sign up to set email alerts
|

Deep Semantic Feature Learning for Software Defect Prediction

Abstract: Software defect prediction, which predicts defective code regions, can assist developers in finding bugs and prioritizing their testing efforts. Traditional defect prediction features often fail to capture the semantic differences between different programs. This degrades the performance of the prediction models built on these traditional features. Thus, the capability to capture the semantics in programs is required to build accurate prediction models. To bridge the gap between semantics and defect prediction… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
133
0
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 177 publications
(134 citation statements)
references
References 95 publications
(233 reference statements)
0
133
0
1
Order By: Relevance
“…However, some of previous studies argued that these threshold‐dependent performance measures are problematic. For example, these measures depend on an arbitrarily selected threshold, and these measures are sensitive to class imbalanced problem existed in most of the gathered SDP datasets.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…However, some of previous studies argued that these threshold‐dependent performance measures are problematic. For example, these measures depend on an arbitrarily selected threshold, and these measures are sensitive to class imbalanced problem existed in most of the gathered SDP datasets.…”
Section: Methodsmentioning
confidence: 99%
“…Jing et al considered subclass discriminant analysis (SDA) method. Wang et al resorted to deep learning. They utilized deep belief network (DBN) to automatically learn semantic features from token vectors extracted from abstract syntax trees (ASTs) of program modules.…”
Section: Background and Related Workmentioning
confidence: 99%
“…-In this paper, we proposed a new CPDP approach called TCNN to obtain the transferable semantic (TCNN-generated) features for cross-project prediction. The key improvement is that, considering the data distribution divergence between projects, TCNN transforms CNN by imbedding the representations of project-specific data to an RKHS for distribution matching -Comprehensive experiments results showed that our TCNN can achieve better prediction performance over classic CPDP methods (e.g., NNFilter [13], data gravitation (DG) [14], TCA+ [10]) and state-of-the-art DL-based approaches (e.g., DBN [9], defect prediction through CNN (DPCNN) [8]) on 90 pairs of CPDP tasks formed by 10 open-source projects.…”
Section: Source Projectmentioning
confidence: 99%
“…In traditional CPDP methods, handcrafted features are commonly adopted to perform CPDP (e.g., Halstead features based on operators and operands [5], McCabe features based on dependencies [6], and CKfeatures based on the object-oriented concept [7]). In recent years, some researchers [8,9] suggested that the generic convolutional neural network (CNN) and deep belief network (DBN) models could extract semantic and structural features from project programs and applied them to perform SDP for better prediction performance. We call these features deep-learning-generated (DL-generated) features.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation