2017
DOI: 10.1109/tse.2017.2654244
|View full text |Cite
|
Sign up to set email alerts
|

Using Natural Language Processing to Automatically Detect Self-Admitted Technical Debt

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

5
120
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
3
3
3

Relationship

1
8

Authors

Journals

citations
Cited by 147 publications
(125 citation statements)
references
References 47 publications
5
120
0
Order By: Relevance
“…Besides a simple pattern-matching of keywords in comments [10], [40], different approaches for detecting SATDrelated comments have been proposed in the literature. Specifically, Maldonado et al [11] used a Natural Language Processing approach to classify SATD. Also, Ren et al [41] proposed the use of CNN to classify SATD, outperforming previously-proposed approaches.…”
Section: A Self-admitted Technical Debt (Satd) and Its Removalmentioning
confidence: 99%
“…Besides a simple pattern-matching of keywords in comments [10], [40], different approaches for detecting SATDrelated comments have been proposed in the literature. Specifically, Maldonado et al [11] used a Natural Language Processing approach to classify SATD. Also, Ren et al [41] proposed the use of CNN to classify SATD, outperforming previously-proposed approaches.…”
Section: A Self-admitted Technical Debt (Satd) and Its Removalmentioning
confidence: 99%
“…Since our dataset is unbalanced (i.e., only a small percentage of commits are CI skipped), we would like to put our results in context by comparing it to a baseline that takes this imbalanced data into account. Similar to prior work [9], [32], we calculate the performance of the baseline model as follows: the precision of this baseline model is calculated by taking the total number of CI skip commits over the total number of commits of each project. For example, project jMotif-GI has a total number of 345 commits, of those, only 42 commits are commits that are explicitly labeled as CI skip commits.…”
Section: Rq1: How Effective Is Our Rule-based Technique In Detecting mentioning
confidence: 99%
“…In some cases, the list of file types we use may not be comprehensive. We also provide a list of all the file extensions that are used in our study 9 .…”
Section: Internal Validitymentioning
confidence: 99%
“…In [10], customers accesses to businesses URLs are analyzed using a word2vec-based method to propose better services to customers. Finally, NLP is also used to detect design and requirement debts [13] from comments of ten open source projects.…”
Section: Related Workmentioning
confidence: 99%