2018
DOI: 10.1007/s00500-018-3093-1
|View full text |Cite
|
Sign up to set email alerts
|

Cross-company defect prediction via semi-supervised clustering-based data filtering and MSTrA-based transfer learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 42 publications
(25 citation statements)
references
References 41 publications
0
23
0
Order By: Relevance
“…To solve the shortage of labeled data, a feasible solution is making use of the information of both labeled and unlabeled software instances, and SSDP methods have been presented [33], [44]- [46]. Seliya and Khoshgoftaar [47] proposed an expectation maximization based semi-supervised method.…”
Section: B Semi-supervised Defect Prediction (Ssdp)mentioning
confidence: 99%
See 1 more Smart Citation
“…To solve the shortage of labeled data, a feasible solution is making use of the information of both labeled and unlabeled software instances, and SSDP methods have been presented [33], [44]- [46]. Seliya and Khoshgoftaar [47] proposed an expectation maximization based semi-supervised method.…”
Section: B Semi-supervised Defect Prediction (Ssdp)mentioning
confidence: 99%
“…Based on the relationship graph, NSGLP employs a label propagation algorithm to predict defectproneness of unlabeled instances. Yu et al [46] proposed a semi-supervised clustering-based data filtering (MsTrA+) method to filter out irrelevant cross-company data.…”
Section: B Semi-supervised Defect Prediction (Ssdp)mentioning
confidence: 99%
“…Ryu et al [12] proposed a transfer cost-sensitive boosting approach (TCSBoost) for CPDP with few labeled target data. Yu et al [31] proposed an effective solution for Cross-company defect prediction. They firstly provided a novel semi-supervised clustering-based data filtering approach to filter out irrelevant cross-company instances.…”
Section: Cpdp Approachesmentioning
confidence: 99%
“…(1) Although the 31 datasets in our experiment have been widely used in many software defect prediction studies, we still cannot claim that our conclusion can be generalized to other datasets. (2) We only study the seven regression algorithms without additional optimization for a given dataset. 3We only employ FPA as the evaluation measure.…”
Section: Threats To Validitymentioning
confidence: 99%
“…Therefore, defect prediction is often used to help to reasonably allocate limited development and maintenance resources [1]. So far, many efficient software defect prediction methods using statistical methods or machine learning techniques have been proposed [2][3][4], but they are usually confined to predicting a given software module being faulty or non-faulty by means of some binary classification techniques. 1 However, predicting the defect-prone of a given software module does not provide enough logistics to software testing in practice [5][6].…”
Section: Introductionmentioning
confidence: 99%