Empirical comparison of text-based mobile apps similarity measurement techniques

Al-Subaihin, Afnan A.; Sarro, Federica; Black, Sue; Capra, Licia

doi:10.1007/s10664-019-09726-5

Cited by 23 publications

(11 citation statements)

References 72 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The original study extracted features from mobile apps descriptions using four different techniques and compared their effectiveness for clustering apps by using these features and only one clustering approach, i.e. hierarchical clustering [1]. The study found that extracting features using Latent Dirichlet Allocation (LDA) consistently performs well among the investigated feature extraction techniques.…”

Section: Replication Study Designmentioning

confidence: 99%

“…In order to answer these RQs, we used the same dataset as the original study [1] 3 . This dataset contains 12,664 Android mobile applications belonging to 24 categories, which have been randomly sampled from the Google Play app store.…”

Section: Datasetmentioning

confidence: 99%

“…This dataset contains 12,664 Android mobile applications belonging to 24 categories, which have been randomly sampled from the Google Play app store. A detailed description of how this data was collected can be found elsewhere [1]. In this study, we opted to use the GA clustering approach proposed by Maulik and Bandyopadhyay [13].…”

Section: Datasetmentioning

confidence: 99%

“…In this paper, we carry out a partial replication of the original study [1] to investigate whether the results can be improved using a Genetic Algorithm-based clustering algorithm, as evolutionary approaches were shown to be successful as clustering techniques in other application domains [10]. Specifically, we investigate four of the five research questions posed in the original study, but we shift the focus on the clustering approach rather than the feature extraction method.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Exploring the Use of Genetic Algorithm Clustering for Mobile App Categorisation

Al-Subaihin

Sarro

2020

Search-Based Software Engineering

Self Cite

View full text Add to dashboard Cite

Search-based approaches have been successfully used as clustering algorithms in several domains. However, little research has looked into their effectiveness for clustering tasks commonly faced in Software Engineering (SE). This short replication paper presents a preliminary investigation on the use of Genetic Algorithm (GA) to the problem of mobile application categorisation. Our results show the feasibility of GA-based clustering for this task, which we hope will foster new avenues for Search-Based Software Engineering (SBSE) research in this area.

show abstract

Section: Replication Study Designmentioning

confidence: 99%

Section: Datasetmentioning

confidence: 99%

Section: Datasetmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Exploring the Use of Genetic Algorithm Clustering for Mobile App Categorisation

Al-Subaihin

Sarro

2020

Search-Based Software Engineering

Self Cite

View full text Add to dashboard Cite

show abstract

“…In terms of large-scale similarity detection based on slightly modified digital fingerprints for Chinese, there is currently no similar research at home and abroad. A. Al-Subaihin [20] proposed a new approach to batch text similarity detection is proposed by combining some ideas from dimensionality reduction techniques and information gain theory. It was focused on search engines need to detect duplicated and near-duplicated web pages.…”

Section: Introductionmentioning

confidence: 99%