<p>Cross-project defect prediction (CPDP) is a field of study that allows predicting defects in software projects for which the availability of data is limited and produces generalizable prediction models. Due to the heterogeneity of cross projects, CPDP is particularly challenging and several methods have been employed to address this problem. Nevertheless, the class-imbalanced characteristic of the cross-project defect data also increases the learning difficulty of such a task but has not been investigated in depth. This paper proposed a novel, cost-cognitive ensemble method for CPDP, which includes four phases: bagging balanced resampling phase, base classifiers learning phase, cost value cognitive phase, and base classifiers ensemble phase. These phases create a composition of classifiers that are used for predicting defects. Results of an empirical evaluation on 10 datasets from the PROMISE repository indicated that our method achieves the best overall performance with respect to conventional methods. Moreover, our method could cognize the cost value automatically during the model training, it is shown to be more effective and practical.</p> <p> </p>
<p>In software engineering, software personnel faced many large-scale software and complex systems, these need programmers to quickly and accurately read and understand the code, and efficiently complete the tasks of software change or maintenance tasks. Code-NN is the first model to use deep learning to accomplish the task of code summary generation, but it is not used the structural information in the code itself. In the past five years, researchers have designed different code summarization systems based on neural networks. They generally use the end-to-end neural machine translation framework, but many current research methods do not make full use of the structural information of the code. This paper raises a new model called G-DCS to automatically generate a summary of java code; the generated summary is designed to help programmers quickly comprehend the effect of java methods. G-DCS uses natural language processing technology, and training the model uses a code corpus. This model could generate code summaries directly from the code files in the coded corpus. Compared with the traditional method, it uses the information of structural on the code. Through Graph Convolutional Neural Network (GCN) extracts the structural information on the code to generate the code sequence, which makes the generated code summary more accurate. The corpus used for training was obtained from GitHub. Evaluation criteria using BLEU-n. Experimental results show that our approach outperforms models that do not utilize code structure information.</p> <p> </p>
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.