On Using Active Learning and Self-training when Mining Performance Discussions on Stack Overflow

Borg, Markus; Lennerstad, Iben; Ros, Rasmus; Bjarnason, Elizabeth

doi:10.1145/3084226.3084273

Cited by 4 publications

(4 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The obvious first step is to extend the dataset used for both the classification model and the CycleGAN. As the data labelling is labor-intensive, we plan to rely on our previous experience in active learning to focus annotation effort for maximum return on investment [4]. With more data, we can train the classifier to predict additional classes, including input related to emergency response.…”

Section: Discussionmentioning

confidence: 99%

Enabling Image Recognition on Constrained Devices Using Neural Network Pruning and a CycleGAN

Lidfeldt

Isaksson

Hedlund

et al. 2020

10th International Conference on the Internet of Things Companion

Self Cite

View full text Add to dashboard Cite

Smart cameras are increasingly used in surveillance solutions in public spaces. Contemporary computer vision applications can be used to recognize events that require intervention by emergency services. Smart cameras can be mounted in locations where citizens feel particularly unsafe, e.g., pathways and underpasses with a history of incidents. One promising approach for smart cameras is edge AI, i.e., deploying AI technology on IoT devices. However, implementing resource-demanding technology such as image recognition using deep neural networks (DNN) on constrained devices is a substantial challenge. In this paper, we explore two approaches to reduce the need for compute in contemporary image recognition in an underpass. First, we showcase successful neural network pruning, i.e., we retain comparable classification accuracy with only 1.1% of the neurons remaining from the state-of-the-art DNN architecture. Second, we demonstrate how a CycleGAN can be used to transform out-of-distribution images to the operational design domain. We posit that both pruning and CycleGANs are promising enablers for efficient edge AI in smart cameras. CCS CONCEPTS • Computing methodologies → Activity recognition and understanding; Object recognition; Neural networks.

show abstract

Section: Discussionmentioning

confidence: 99%

Enabling Image Recognition on Constrained Devices Using Neural Network Pruning and a CycleGAN

Lidfeldt

Isaksson

Hedlund

et al. 2020

10th International Conference on the Internet of Things Companion

Self Cite

View full text Add to dashboard Cite

show abstract

“…Active learning techniques select the most informative unlabeled examples to predict their label and include them in the training set [63]. Many researchers have successfully combined active learning with self-training to decrease the human labeling struggle and enhance the classification performance [33,64]. Motivated by the existing research [33], we integrate active learning with selftraining to select the most informative and high confident level examples.…”

Section: Active Self-training Based Sentiment Learnermentioning

confidence: 99%

LeSSA: A Unified Framework based on Lexicons and Semi-Supervised Learning Approaches for Textual Sentiment Classification

Khan

Lee

2019

Applied Sciences

View full text Add to dashboard Cite

Sentiment Analysis (SA) is an active research area. SA aims to classify the online unstructured user-generated contents (UUGC) into positive and negative classes. A reliable training data is vital to learn a sentiment classifier for textual sentiment classification, but due to domain heterogeneity, manually construction of reliable labeled sentiment corpora is a laborious and time-consuming task. In the absence of enough labeled data, the alternative usage of sentiment lexicons and semi-supervised learning approaches for sentiment classification have substantially attracted the attention of the research community. However, state-of-the-art techniques for semi-supervised sentiment classification present research challenges expressed in questions like the following. How to effectively utilize the concealed significant information in the unstructured data? How to learn the model while considering the most effective sentiment features? How to remove the noise and redundant features? How to refine the initial training data for initial model learning as the random selection may lead to performance degradation? Besides, mainly existing lexicons have trouble with word coverage, which may ignore key domain-specific sentiment words. Further research is required to improve the sentiment lexicons for textual sentiment classification. In order to address such research issues, in this paper, we propose a novel unified sentiment analysis framework for textual sentiment classification called LeSSA. Our main contributions are threefold. (a) lexicon construction, generating quality and wide coverage sentiment lexicon. (b) training classification models based on a high-quality training dataset generated by using k-mean clustering, active learning, self-learning, and co-training algorithms. (c) classification fusion, whereby the predictions from numerous learners are confluences to determine final sentiment polarity based on majority voting, and (d) practicality, that is, we validate our claim while applying our model on benchmark datasets. The empirical evaluation of multiple domain benchmark datasets demonstrates that the proposed framework outperforms existing semi-supervised learning techniques in terms of classification accuracy.

show abstract

“…In software engineering (SE), there are studies reporting low values for expert agreement/reliability using Kirppendorff's alpha and/or ICC, by e.g., Borg et al [4], Anvaari et al [1] and Kitchenham et al [27]. Evaluations depend on the interpretation of a construct under study, i.e., include some degree of subjectivity [5,47].…”

Section: Assessment Of Responsesmentioning

confidence: 99%

“…The values α ≥ 0.800 are suggested for drawing reliable conclusions while values 0.667 ≤ α < 0.800 are claimed for tentative conclusions only [29]. We used the R-function kripp.alpha 4 to measure the level of agreement among the respondents (raters) on the criteria (subjects) of the top 6 most evaluated tools. We considered the level of measurement for the data to be ratio, since the possible values (from 0 to 10 at intervals of 0.5, i.e., 21 levels) were ordered units having the same difference and an absolute zero.…”

Section: Krippendorff's Alphamentioning

confidence: 99%

Practitioner Evaluations on Software Testing Tools

Raulamo-Jurvanen

Hosio

Mäntylä

2019

Proceedings of the Evaluation and Assessment on Software Engineering

View full text Add to dashboard Cite

In software engineering practice, evaluating and selecting the software testing tools that best fit the project at hand is an important and challenging task. In scientific studies of software engineering, practitioner evaluations and beliefs have recently gained interest, and some studies suggest that practitioners find beliefs of peers more credible than empirical evidence. To study how software practitioners evaluate testing tools, we applied online opinion surveys (n=89). We analyzed the reliability of the opinions utilizing Krippendorff's alpha, intra-class correlation coefficient (ICC), and coefficients of variation (CV). Negative binomial regression was used to evaluate the effect of demographics. We find that opinions towards a specific tool can be conflicting. We show how increasing the number of respondents improves the reliability of the estimates measured with ICC. Our results indicate that on average, opinions from seven experts provide a moderate level of reliability. From demographics, we find that technical seniority leads to more negative evaluations. To improve the understanding, robustness, and impact of the findings, we need to conduct further studies by utilizing diverse sources and complementary methods.

show abstract

On Using Active Learning and Self-training when Mining Performance Discussions on Stack Overflow

Cited by 4 publications

References 26 publications

Enabling Image Recognition on Constrained Devices Using Neural Network Pruning and a CycleGAN

Enabling Image Recognition on Constrained Devices Using Neural Network Pruning and a CycleGAN

LeSSA: A Unified Framework based on Lexicons and Semi-Supervised Learning Approaches for Textual Sentiment Classification

Practitioner Evaluations on Software Testing Tools

Contact Info

Product

Resources

About