2019
DOI: 10.1007/s10664-019-09694-w
|View full text |Cite
|
Sign up to set email alerts
|

Classifying code comments in Java software systems

Abstract: Code comments are a key software component containing information about the underlying implementation. Several studies have shown that code comments enhance the readability of the code. Nevertheless, not all the comments have the same goal and target audience. In this paper, we investigate how 14 diverse Java open and closed source software projects use code comments, with the aim of understanding their purpose. Through our analysis, we produce a taxonomy of source code comments; subsequently, we investigate h… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
30
0
1

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 48 publications
(32 citation statements)
references
References 29 publications
(46 reference statements)
1
30
0
1
Order By: Relevance
“…The most relevant approaches are References 7,8, which classify Java and C/C++ source code comments using a J48 decision tree algorithm and a NB multinominal classifier, respectively. Both use specific text preprocessing and feature engineering with rule‐based taxonomy classification, and achieve a weighted average precision and recall of 93% to 96% (on imbalanced datasets).…”
Section: Discussion Of Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…The most relevant approaches are References 7,8, which classify Java and C/C++ source code comments using a J48 decision tree algorithm and a NB multinominal classifier, respectively. Both use specific text preprocessing and feature engineering with rule‐based taxonomy classification, and achieve a weighted average precision and recall of 93% to 96% (on imbalanced datasets).…”
Section: Discussion Of Resultsmentioning
confidence: 99%
“…Pascarella et al employ supervised ML to build an automated classification approach 8 . Their study presents a publicly available dataset with manually classified comments on which they apply probabilistic classifiers, for example, Naive Bayes (NB), and decision tree algorithms, for example, J48 and RF, to detect and classify comments according to the hierarchical taxonomy of code comments introduced before.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…We use the approach proposed by Pascarella et al to classify the header and internal comments of public, private, and protected methods, respectively. As Tables 6 to 8 show, according to the work of Pascarella et al, each comment can be classified at coarse granularity, such as Purpose, Notice, Under developer, Style and IDE, Metadata, and Discarded. Meanwhile, each categorization can be divided into finer categories.…”
Section: Empirical Studymentioning
confidence: 99%