Hoa Khanh Dam scite author profile

Although there has been substantial research in software analytics for effort estimation in traditional software projects, little work has been done for estimation in agile projects, especially estimating user stories or issues. Story points are the most common unit of measure used for estimating the effort involved in implementing a user story or resolving an issue. In this paper, we offer for the first time a comprehensive dataset for story points-based estimation that contains 23,313 issues from 16 open source projects. We also propose a prediction model for estimating story points based on a novel combination of two powerful deep learning architectures: long short-term memory and recurrent highway network. Our prediction system is endto-end trainable from raw input data to prediction outcomes without any manual feature engineering. An empirical evaluation demonstrates that our approach consistently outperforms three common effort estimation baselines and two alternatives in both Mean Absolute Error and the Standardized Accuracy.

show abstract

DeepJIT: An End-to-End Deep Learning Framework for Just-in-Time Defect Prediction

Hoang

Dam

Kamei

et al. 2019

141

View full text Add to dashboard Cite

Lessons Learned from Using a Deep Tree-Based Model for Software Defect Prediction in Practice

Dam

Pham

et al. 2019

104

View full text Add to dashboard Cite

Defects are common in software systems and can potentially cause various problems to software users. Different methods have been developed to quickly predict the most likely locations of defects in large code bases. Most of them focus on designing features (e.g. complexity metrics) that correlate with potentially defective code. Those approaches however do not sufficiently capture the syntax and different levels of semantics of source code, an important capability for building accurate prediction models. In this paper, we develop a novel prediction model which is capable of automatically learning features for representing source code and using them for defect prediction. Our prediction system is built upon the powerful deep learning, tree-structured Long Short Term Memory network which directly matches with the Abstract Syntax Tree representation of source code. An evaluation on two datasets, one from open source projects contributed by Samsung and the other from the public PROMISE repository, demonstrates the effectiveness of our approach for both within-project and cross-project predictions. CCS CONCEPTS• Software and its engineering → Software creation and management; KEYWORDSSoftware engineering, software analytics, defect prediction ACM Reference Format:

show abstract

An Empirical Study of Model-Agnostic Techniques for Defect Prediction Models

Jiarpakdee

Tantithamthavorn

Dam

et al. 2022

IIEEE Trans. Software Eng.

View full text Add to dashboard Cite

Comparing Agent-Oriented Methodologies

Dam

Winikoff

2004

120

View full text Add to dashboard Cite

Automatic Feature Learning for Predicting Vulnerable Software Components

Dam

Tran

Pham

et al. 2021

IIEEE Trans. Software Eng.

103

View full text Add to dashboard Cite

Generating Pseudo-Code from Source Code Using Deep Learning

Alhefdhi¹,

Dam²,

Hata³

et al. 2018

View full text Add to dashboard Cite

Explainable software analytics

Dam

Tran

Ghose

2018

View full text Add to dashboard Cite

Software analytics has been the subject of considerable recent attention but is yet to receive significant industry traction. One of the key reasons is that software practitioners are reluctant to trust predictions produced by the analytics machinery without understanding the rationale for those predictions. While complex models such as deep learning and ensemble methods improve predictive performance, they have limited explainability. In this paper, we argue that making software analytics models explainable to software practitioners is as important as achieving accurate predictions. Explainability should therefore be a key measure for evaluating software analytics models. We envision that explainability will be a key driver for developing software analytics models that are useful in practice. We outline a research roadmap for this space, building on social science, explainable artificial intelligence and software engineering.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Hoa Khanh Dam

A Deep Learning Model for Estimating Story Points

DeepJIT: An End-to-End Deep Learning Framework for Just-in-Time Defect Prediction

Lessons Learned from Using a Deep Tree-Based Model for Software Defect Prediction in Practice

An Empirical Study of Model-Agnostic Techniques for Defect Prediction Models

Comparing Agent-Oriented Methodologies

Automatic Feature Learning for Predicting Vulnerable Software Components

Generating Pseudo-Code from Source Code Using Deep Learning

Explainable software analytics

Contact Info

Product

Resources

About