2020
DOI: 10.1016/j.jss.2020.110740
|View full text |Cite
|
Sign up to set email alerts
|

On the generation, structure, and semantics of grammar patterns in source code identifiers

Abstract: Identifiers make up a majority of the text in code. They are one of the most basic mediums through which developers describe the code they create and understand the code that others create. Therefore, understanding the patterns latent in identifier naming practices and how accurately we are able to automatically model these patterns is vital if researchers are to support developers and automated analysis approaches in comprehending and creating identifiers correctly and optimally. This paper investigates ident… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

4
61
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 29 publications
(65 citation statements)
references
References 55 publications
4
61
0
Order By: Relevance
“…In addition, the ensemble has been made fully available (see Section IV), and is intended for long term support by the research team as we expand the training set and include identifiers from different contexts (e.g., test code). 2) Confirmation of observations we made in prior work [25] that indicate 1) the importance of the position of a word in an identifier, 2) the importance of the context of an identifier when annotating using part-of-speech, and 3) the complementarity of three part-of-speech taggers. 3) An expanded set of manually-annotated identifiers, based on the original set constructed in [25], that can be used to train and create other tagging approaches or for other natural language problems.…”
Section: Introductionsupporting
confidence: 68%
See 4 more Smart Citations
“…In addition, the ensemble has been made fully available (see Section IV), and is intended for long term support by the research team as we expand the training set and include identifiers from different contexts (e.g., test code). 2) Confirmation of observations we made in prior work [25] that indicate 1) the importance of the position of a word in an identifier, 2) the importance of the context of an identifier when annotating using part-of-speech, and 3) the complementarity of three part-of-speech taggers. 3) An expanded set of manually-annotated identifiers, based on the original set constructed in [25], that can be used to train and create other tagging approaches or for other natural language problems.…”
Section: Introductionsupporting
confidence: 68%
“…2) Confirmation of observations we made in prior work [25] that indicate 1) the importance of the position of a word in an identifier, 2) the importance of the context of an identifier when annotating using part-of-speech, and 3) the complementarity of three part-of-speech taggers. 3) An expanded set of manually-annotated identifiers, based on the original set constructed in [25], that can be used to train and create other tagging approaches or for other natural language problems. As with the implementation, this has been made fully available to the research community (see Section IV).…”
Section: Introductionsupporting
confidence: 68%
See 3 more Smart Citations