2019
DOI: 10.1007/s12046-019-1223-9
|View full text |Cite
|
Sign up to set email alerts
|

Natural language processing in mining unstructured data from software repositories: a review

Abstract: With the increasing popularity of open-source platforms, software data is easily available from various open-source tools like GitHub, CVS, SVN, etc. More than 80 percent of the data present in them is unstructured. Mining data from these repositories helps project managers, developers and businesses, in getting interesting insights. Most of the software artefacts present in these repositories are in the natural language form, which makes natural language processing (NLP) an important part of mining to get the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(2 citation statements)
references
References 62 publications
0
2
0
Order By: Relevance
“…A more recent review was done, broadly focusing on adopting NLP to mine unstructured data in software repositories (Gupta and Gupta, 2019). The review was done by looking into general applications of mining repositories, with a sub-focus on traceability efforts.…”
Section: Related Workmentioning
confidence: 99%
“…A more recent review was done, broadly focusing on adopting NLP to mine unstructured data in software repositories (Gupta and Gupta, 2019). The review was done by looking into general applications of mining repositories, with a sub-focus on traceability efforts.…”
Section: Related Workmentioning
confidence: 99%
“…These are neural networks that have been trained on large amounts of text data. They can be tuned to perform well on specific tasks, such as sentiment analysis [3], [4].…”
Section: Introductionmentioning
confidence: 99%