2017
DOI: 10.1007/978-3-319-66562-7_15
|View full text |Cite
|
Sign up to set email alerts
|

Predicting the Programming Language: Extracting Knowledge from Stack Overflow Posts

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
9
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(9 citation statements)
references
References 12 publications
0
9
0
Order By: Relevance
“…Overflow questions to predict their programming language. This classifier achieves an accuracy of 81.1%, a precision of 0.83 and a recall of 0.81 which is much higher than the previous best model (Baquero et al [13]).…”
Section: ) a Classifier That Uses Only Textual Information In Stackmentioning
confidence: 68%
See 1 more Smart Citation
“…Overflow questions to predict their programming language. This classifier achieves an accuracy of 81.1%, a precision of 0.83 and a recall of 0.81 which is much higher than the previous best model (Baquero et al [13]).…”
Section: ) a Classifier That Uses Only Textual Information In Stackmentioning
confidence: 68%
“…Baquero et al [13] proposed a classifier to predict the programming language of a Stack Overflow question. They extracted a set of 18000 questions from Stack Overflow that contained text and code snippets, 1000 questions for each of 18 programming languages.…”
Section: Related Workmentioning
confidence: 99%
“…Dam and Zaytsev [19] utilized statistical language models such as n-grams and skip grams in natural language processing (NLP) for programming language identification. Baquero et al [4] proposed a model to predict the programming language from both comment text data and code snippets of Stack Overflow questions. They used Word2Vec for text feature extraction and n-gram for source code feature extraction.…”
Section: Related Workmentioning
confidence: 99%
“…Therefore, some This article is part of the topical collection "Deep learning approaches for data analysis: A practical perspective" guest edited by D. Jude Hemanth, Lipo Wang and Anastasia Angelopoulou. automatic source code classification methods have been developed based on text classification [1][2][3][4]. In these studies, the source codes are considered as text, and classification methods are applied with the help of natural language processing (NLP) techniques.…”
Section: Introductionmentioning
confidence: 99%
“…In this paper, we are interested in a tool that can classify a code snippet which is a small block reusable code with at least two lines of code, a much more challenging task. The only previous work that studies classification of the programming languages from a code snippet or a few lines of source code is the work of Baquero et al [11]. However, they achieve low accuracy showing that identifying programming languages from a small source code or a code snippet is much harder than larger pieces.…”
Section: Introductionmentioning
confidence: 99%