Proceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of 2019
DOI: 10.1145/3338906.3338939
|View full text |Cite
|
Sign up to set email alerts
|

Going big: a large-scale study on what big data developers ask

Abstract: Software developers are increasingly required to write big data code. However, they find big data software development challenging. To help these developers it is necessary to understand big data topics that they are interested in and the difficulty of finding answers for questions in these topics. In this work, we conduct a large-scale study on Stackoverflow to understand the interest and difficulties of big data developers. To conduct the study, we develop a set of big data tags to extract big data posts fro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

5
137
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 78 publications
(142 citation statements)
references
References 35 publications
5
137
0
Order By: Relevance
“…Finally, we refined by keeping tags that are significantly relevant to Docker and excludeding others. We used two heuristics and from previous work [4], [8] and [42] to measure the significance and relevance of each tag in the Docker tag set .…”
Section: 11mentioning
confidence: 99%
See 3 more Smart Citations
“…Finally, we refined by keeping tags that are significantly relevant to Docker and excludeding others. We used two heuristics and from previous work [4], [8] and [42] to measure the significance and relevance of each tag in the Docker tag set .…”
Section: 11mentioning
confidence: 99%
“…It provides 93,183 answer posts where 32,632 answer posts are considered as accepted answer posts. To reduce the noise, only accepted answers are taken into consideration by following earlier approaches [4], [8] and [33]. Finally, the data set contains 113,922 SoF questions and answers of which 81,290 (71.4%) are questions and 32,632 (28.6%) are accepted answers.…”
Section: 13mentioning
confidence: 99%
See 2 more Smart Citations
“…Traditional big data systems, for which MapReduce is a popular backbone [3], minimize the complexity of writing massively distributed programs by facilitating processing on multiple nodes using succinct functional-like constructs. This makes writing parallel code easier, as writing such code can be difficult due to possible data races, thread interference, and contention [1,4,28].…”
Section: Introductionmentioning
confidence: 99%